Skip to content

Commit

Permalink
add truncation
Browse files Browse the repository at this point in the history
  • Loading branch information
karinazad committed Mar 4, 2025
1 parent 763f4ab commit 98360cc
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions src/lobster/data/_ume_haha_datamodule.py
Original file line number Diff line number Diff line change
Expand Up @@ -126,6 +126,7 @@ def _get_dataset_node(self, dataset_info: DatasetInfo, split: Split) -> Dataset:
text_key="sequence",
return_tensors="pt",
padding="max_length",
truncation=True,
max_length=self._tokenizer_max_length,
num_workers=self._num_workers,
)
Expand Down

0 comments on commit 98360cc

Please sign in to comment.