NLPA is a framework designed to operate in conjuction with BDP4J (https://github.com/sing-group/bdp4j) and able to extract texts from Twitter, Youtube Comments, text files, raw email files (.eml) or WARC (Web Archive) files. The extracted text can be preprocessed into a Dataset using task (org.bdp4j.pipe.Pipe) definitions. This framework incorporates more than 30 preprocessing tasks to transform the text.

LicenseGPL 3.0
HomePage https://github.com/sing-group/nlpa
Ranking#839060 in MvnRepository (See Top Artifacts)

VersionVulnerabilitiesRepositoryUsagesDate
2.0.x
2.0.0SingGroup
0
Jul 26, 2021
1.0.x
1.0.6SingGroup
0
Dec 17, 2020
1.0.5SingGroup
0
Feb 07, 2020
1.0.4SingGroup
0
Jan 20, 2020
1.0.3SingGroup
0
Jan 20, 2020
1.0.2SingGroup
0
Jan 20, 2020
1.0.1SingGroup
0
Jan 20, 2020
1.0.0SingGroup
0
Sep 19, 2019