biaffine_dep¶

Biaffine dependency parser.

class hanlp.components.parsers.biaffine.biaffine_dep.BiaffineDependencyParser[source]¶

Biaffine dependency parsing (Dozat & Manning 2017).

build_criterion(**kwargs)[source]¶

Implement this method to build criterion (loss function).

Parameters: **kwargs – The subclass decides the method signature.

build_dataloader(data, shuffle, device, training=False, logger=None, gradient_accumulation=1, sampler_builder=None, batch_size=None, **kwargs) → torch.utils.data.dataloader.DataLoader[source]¶

Build dataloader for training, dev and test sets. It’s suggested to build vocabs in this method if they are not built yet.

Parameters

data – Data representing samples, which can be a path or a list of samples.
batch_size – Number of samples per batch.
shuffle – Whether to shuffle this dataloader.
device – Device tensors should be loaded onto.
logger – Logger for reporting some message if dataloader takes a long time or if vocabs has to be built.
**kwargs – Arguments from **self.config.

build_metric(**kwargs)[source]¶

Implement this to build metric(s).

Parameters: **kwargs – The subclass decides the method signature.

build_model(training=True, **kwargs) → torch.nn.modules.module.Module[source]¶

Build model.

Parameters

training – True if called during training.
**kwargs – **self.config.

build_optimizer(epochs, trn, gradient_accumulation, **kwargs)[source]¶

Implement this method to build an optimizer.

Parameters: **kwargs – The subclass decides the method signature.

build_vocabs(dataset, logger=None, transformer=None)[source]¶

Override this method to build vocabs.

Parameters

trn – Training set.
logger – Logger for reporting progress.

evaluate_dataloader(loader: hanlp.common.dataset.PadSequenceDataLoader, criterion, logger=None, filename=None, output=False, ratio_width=None, metric=None, **kwargs)[source]¶

Evaluate on a dataloader.

Parameters

data – Dataloader which can build from any data source.
criterion – Loss function.
metric – Metric(s).
output – Whether to save outputs into some file.
**kwargs – Not used.

execute_training_loop(trn, dev, devices, epochs, logger, patience, save_dir, optimizer, gradient_accumulation, **kwargs)[source]¶

Implement this to run training loop.

Parameters

trn – Training set.
dev – Development set.
epochs – Number of epochs.
criterion – Loss function.
optimizer – Optimizer(s).
metric – Metric(s)
save_dir – The directory to save this component.
logger – Logger for reporting progress.
devices – Devices this component and dataloader will live on.
ratio_width – The width of dataset size measured in number of characters. Used for logger to align messages.
**kwargs – Other hyper-parameters passed from sub-class.

fit(trn_data, dev_data, save_dir, feat=None, n_embed=100, pretrained_embed=None, transformer=None, average_subwords=False, word_dropout=0.2, transformer_hidden_dropout=None, layer_dropout=0, scalar_mix: Optional[int] = None, embed_dropout=0.33, n_lstm_hidden=400, n_lstm_layers=3, hidden_dropout=0.33, n_mlp_arc=500, n_mlp_rel=100, mlp_dropout=0.33, lr=0.002, transformer_lr=5e-05, mu=0.9, nu=0.9, epsilon=1e-12, grad_norm=5.0, decay=0.75, decay_steps=5000, weight_decay=0, warmup_steps=0.1, separate_optimizer=False, patience=100, lowercase=False, epochs=50000, tree=False, proj=False, punct=False, min_freq=2, logger=None, verbose=True, unk='<unk>', max_sequence_length=512, batch_size=None, sampler_builder=None, gradient_accumulation=1, devices: Optional[Union[float, int, List[int]]] = None, transform=None, secondary_encoder=None, **kwargs)[source]¶

Fit to data, triggers the training procedure. For training set and dev set, they shall be local or remote files.

Parameters

trn_data – Training set.
dev_data – Development set.
save_dir – The directory to save trained component.
batch_size – The number of samples in a batch.
epochs – Number of epochs.
devices – Devices this component will live on.
logger – Any logging.Logger instance.
seed – Random seed to reproduce this training.
finetune – True to load from save_dir instead of creating a randomly initialized component. str to specify a different save_dir to load from.
eval_trn – Evaluate training set after each update. This can slow down the training but provides a quick diagnostic for debugging.
_device_placeholder – True to create a placeholder tensor which triggers PyTorch to occupy devices so other components won’t take these devices as first choices.
**kwargs – Hyperparameters used by sub-classes.

Returns

Any results sub-classes would like to return. Usually the best metrics on training set.

fit_dataloader(trn, optimizer, scheduler, criterion, epoch, logger, history: hanlp.common.structure.History, transformer_optimizer=None, transformer_scheduler=None, gradient_accumulation=1, **kwargs)[source]¶

Fit onto a dataloader.

Parameters

trn – Training set.
criterion – Loss function.
optimizer – Optimizer.
metric – Metric(s).
logger – Logger for reporting progress.
**kwargs – Other hyper-parameters passed from sub-class.

on_config_ready(**kwargs)[source]¶

Called when config is ready, either during fit or load. Subclass can perform extra initialization tasks in this callback.

Parameters: **kwargs – Not used.

predict(data: Any, batch_size=None, batch_max_tokens=None, conll=True, **kwargs)[source]¶

Predict on data fed by user. Users shall avoid directly call this method since it is not guarded with torch.no_grad and will introduces unnecessary gradient computation. Use __call__ instead.

Parameters

*args – Sentences or tokens.
**kwargs – Used in sub-classes.

HanLP Documentation

biaffine_dep

biaffine_dep¶