It offers functions for splitting, parsing, tokenizing and creating a vocabulary for big text data files. Moreover, it includes functions for building a document-term matrix and extracting information from those (term-associations, most frequent terms). It also embodies functions for calculating token statistics (collocations, look-up tables, string dissimilarities) and functions to work with sparse matrices. Lastly, it includes functions for Word Vector Representations (i.e. 'GloVe', 'fasttext') and ...

LicenseGPL 3.0
CategoriesWord Embedding
Tagsnlpembeddingtextrlangcranaiword
HomePage https://github.com/mlampros/textTinyR 🔍 Inspect URL
Ranking#726929 in MvnRepository (See Top Artifacts)
#14 in Word Embedding

VersionVulnerabilitiesRepositoryUsagesDate
1.1.x
1.1.0-b2BeDataDriven
0
May 12, 2022
1.1.0-b1BeDataDriven
0
May 12, 2022
1.0.x
1.0.8-b1BeDataDriven
0
May 12, 2022
1.0.7-b6BeDataDriven
0
May 12, 2022
1.0.7-b5BeDataDriven
0
May 12, 2022