Indexed Artifacts (28.7M)

Popular Categories

Artifacts using WebArchive Commons (35)

Sort: popular | newest
The Slimmed Down LOCKSS Daemon Core
Last Release on Dec 19, 2021
The Archive Commons Code Libraries project contains general Java utility libraries, as used by the Heritrix crawler and other projects.
Last Release on Sep 23, 2021
LOCKSS repository core infrastructure
Last Release on Jan 25, 2022
OpenWayback Core Java Classes
Last Release on Mar 19, 2021
LOCKSS Repository Service
Last Release on Jan 25, 2022
WARC Hadoop Recordreaders
Last Release on Nov 27, 2020
LOCKSS repository client
Last Release on May 9, 2019
NLPA is a framework designed to operate in conjuction with BDP4J (https://github.com/sing-group/bdp4j) and able to extract texts from Twitter, Youtube Comments, text files, raw email files (.eml) or WARC (Web Archive) files. The extracted text can be preprocessed into a Dataset using task (org.bdp4j.pipe.Pipe) definitions. This framework incorporates more than 30 preprocessing tasks to transform the text.
Last Release on Jul 26, 2021
LOC Wayback
Last Release on Mar 14, 2018
LOCKSS Configuration Service
Last Release on Jan 3, 2022