Tools for measuring similarity among documents and detecting passages which have been reused. Implements shingled n-gram, skip n-gram, and other tokenizers; similarity/dissimilarity functions; pairwise comparisons; minhash and locality sensitive hashing algorithms; and a version of the Smith-Waterman local alignment algorithm suitable for natural language.
| License | MIT |
|---|---|
| Tags | rlangcran |
| HomePage | https://github.com/ropensci/textreuse 🔍 Inspect URL |
| Ranking | #350918 in MvnRepository (See Top Artifacts) |
| Used By | 1 artifacts |
| Version ▼ | Vulnerabilities | Repository | Usages | Date | |
|---|---|---|---|---|---|
0.1.x | 0.1.4-b15 | BeDataDriven | May 29, 2022 | ||
| 0.1.4-b14 | BeDataDriven |
0
| May 29, 2022 | ||
| 0.1.4-b13 | BeDataDriven |
0
| May 29, 2022 | ||
| 0.1.4-b12 | BeDataDriven |
0
| May 29, 2022 | ||
| 0.1.4-b11 | BeDataDriven |
0
| May 29, 2022 | ||
| 0.1.4-b10 | BeDataDriven |
0
| May 29, 2022 | ||
| 0.1.4-b9 | BeDataDriven |
0
| May 29, 2022 | ||
| 0.1.4-b8 | BeDataDriven |
0
| May 29, 2022 | ||
| 0.1.4-b7 | BeDataDriven |
0
| May 29, 2022 | ||
| 0.1.4-b6 | BeDataDriven |
0
| May 29, 2022 | ||
| 0.1.4-b4 | BeDataDriven |
0
| May 29, 2022 |
Related Books
| R in Action, Third Edition: Data analysis and graphics with R and Tidyverse (2022) by Kabacoff, Robert I. | |
| Data Science with Java: Practical Methods for Scientists and Engineers (2017) by Michael R. Brzustowicz PhD |