It offers functions for splitting, parsing, tokenizing and creating a vocabulary for big text data files. Moreover, it includes functions for building a document-term matrix and extracting information from those (term-associations, most frequent terms). It also embodies functions for calculating token statistics (collocations, look-up tables, string dissimilarities) and functions to work with sparse matrices. Lastly, it includes functions for Word Vector Representations (i.e. 'GloVe', 'fasttext') and ...
| License | GPL 3.0 |
|---|---|
| Categories | Word Embedding |
| Tags | nlpembeddingtextrlangcranaiword |
| HomePage | https://github.com/mlampros/textTinyR 🔍 Inspect URL |
| Ranking | #726929 in MvnRepository (See Top Artifacts) #14 in Word Embedding |
| Version ▼ | Vulnerabilities | Repository | Usages | Date | |
|---|---|---|---|---|---|
1.1.x | 1.1.0-b2 | BeDataDriven |
0
| May 12, 2022 | |
| 1.1.0-b1 | BeDataDriven |
0
| May 12, 2022 | ||
1.0.x | 1.0.8-b1 | BeDataDriven |
0
| May 12, 2022 | |
| 1.0.7-b6 | BeDataDriven |
0
| May 12, 2022 | ||
| 1.0.7-b5 | BeDataDriven |
0
| May 12, 2022 |