It offers functions for splitting, parsing, tokenizing and creating a vocabulary for big text data files. Moreover, it includes functions for building a document-term matrix and extracting information from those (term-associations, most frequent terms). It also embodies functions for calculating token statistics (collocations, look-up tables, string dissimilarities) and functions to work with sparse matrices. Lastly, it includes functions for Word Vector Representations (i.e. 'GloVe', 'fasttext') and ...
| License | GPL 3.0 |
|---|---|
| Categories | Word Embedding |
| Tags | nlpembeddingtextrlangcranaiword |
| HomePage | https://github.com/mlampros/textTinyR 🔍 Inspect URL |
| Date | May 12, 2022 |
| Files | pom (6 KB) jar (285 KB) View All |
| Repositories | BeDataDriven |
| Ranking | #726929 in MvnRepository (See Top Artifacts) #14 in Word Embedding |
Note: this artifact is located at BeDataDriven repository (https://nexus.bedatadriven.com/content/groups/public/)
Compile Dependencies (18)
Provided Dependencies (4)
| Category/License | Group / Artifact | Version | Updates | |
|---|---|---|---|---|
| org.renjin » compiler | 0.8.2500 | 0.9.2726 | ||
GPL 2.0 | org.renjin.cran » Rcpp | 0.12.13-renjin-13 | 0.12.13-renjin-15 | |
| org.renjin.cran » BH | 1.62.0-1-b13 | 1.69.0-1-b3 | ||
GPL 2.0 | org.renjin.cran » RcppArmadillo | 0.7.900.2.0-b6 | 0.9.500.2.0-b1 |
Test Dependencies (1)
| Category/License | Group / Artifact | Version | Updates | |
|---|---|---|---|---|
MIT | org.renjin.cran » testthat | 1.0.2-renjin-14 | 1.0.2-renjin-17 |
Licenses
| License | URL |
|---|---|
| GPL-3 |
Developers
| Name | Dev Id | Roles | Organization | |
|---|---|---|---|---|
| Lampros Mouselimis | mouselimislampros<at>gmail.com |