thaitextaug.word2vec¶

Modules¶

class thaitextaug.word2vec.Word2VecAug(model: str, tokenize: object, type: str = 'file')¶

augment(sentence: str, n_sent: int = 1, p: float = 0.7) → List[Tuple[str]]¶

Parameters

Returns

list of synonyms

Return type

List[Tuple[str]]

modify_sent(sent, p=0.7) → List[List[str]]¶

Parameters

Return type

List[List[str]]

class thaitextaug.word2vec.BPEmbAug(lang: str = 'th', vs: int = 100000, dim: int = 300)¶

Thai Text Augment using word2vec from BPEmb

augment(sentence: str, n_sent: int = 1, p: float = 0.7) → List[Tuple[str]]¶

Text Augment using word2vec from BPEmb

Parameters

Returns

list of synonyms

Return type

List[Tuple[str]]

tokenizer(text: str) → List[str]¶

class thaitextaug.word2vec.Thai2fitAug¶

Text Augment using word2vec from Thai2Fit

augment(sentence: str, n_sent: int = 1, p: float = 0.7) → List[Tuple[str]]¶

Text Augment using word2vec from Thai2Fit

Parameters

Returns

list of text augment

Return type

List[Tuple[str]]

tokenizer(text: str) → List[str]¶