Releases: makcedward/nlpaug
Releases · makcedward/nlpaug
Release 0.0.10
- Add aug_max to control maximum number of augmented item
- Fix ContextualWordEmbsAug (for BERT) error when input is longer than max sequence length
- Add RandomWordAug Substitute action
- Fix ContextualWordEmbsAug error when no augmented data
- Support multi thread processing (for CPU only) to speed up the augmentation
- Fix KeyboardAug error #55
Release 0.0.9
- Added Swap Mode (adjacent, middle and random) for RandomAug (character level)
- Added SynonymAug (WordNet/ PPDB) and AntonymAug (WordNet)
- WordNetAug is deprecated. Uses SynonymAug instead
- Introduce parameter n. Returning more than 1 augmented data. Changing output format from text (or numpy) to list of text (or numpy) if n > 1
- Introduce parameter temperature in ContextualWordEmbsAug and ContextualWordEmbsForSentenceAug to control the randomness
- aug_n parameter is deprecated. This parameter will be replaced by top_k parameter
- Fixed tokenization issue #48
- Upgraded transformers dependency (or pytorch_transformer) to 2.0.0
- Upgraded PyTorch dependency to 1.2.0
- Added SplitAug
Release 0.0.8
- BertAug is replaced by ContextualWordEmbsAug
- Support GPU (for ContextualWordEmbsAug only) #26
- Upgraded pytorch_transformer to 1.1.0 version #33
- ContextualWordEmbsAug suuports both BERT and XLNet model
- Removed librosa dependency
- Add ContextualWordEmbsForSentenceAug for generating next sentence
- Fix sampling issue #38
Release 0.0.7
- Add new augmenter (CropAug, LoudnessAug, MaskAug)
- QwertyAug is deprecated. It will be replaced by KeyboardAug
- Remove StopWordsAug. It will be replaced by RandomWordAug
- Code refactoring
- Added model download function for word2vec, GloVe and fasttext
0.0.6: Merge pull request #17 from makcedward/dev
0.0.6 release