An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
This release consists of two files:
archivespark-assembly-3.0.1.jar
contains the actual ArchiveSpark classesarchivespark-assembly-3.0.1-deps.jar
provides all required dependencies as one big packageThe release is also available on Maven Central: https://search.maven.org/#artifactdetails|com.github.helgeho|archivespark_2.11|3.0.1|jar
This release consists of two files:
archivespark-assembly-3.0.jar
contains the actual ArchiveSpark classesarchivespark-assembly-3.0-deps.jar
provides all required dependencies as one big packageThe release is also available on Maven Central: https://search.maven.org/#artifactdetails|com.github.helgeho|archivespark_2.11|3.0|jar
This release consists of two files:
archivespark-assembly-2.7.6.jar
contains the actual ArchiveSpark classesarchivespark-assembly-2.7.6-deps.jar
provides all required dependencies as one big packageThe release is also available on Maven Central: https://search.maven.org/#artifactdetails|com.github.helgeho|archivespark_2.11|2.7.6|jar
This release consists of two files:
archivespark-assembly-2.7.5.jar
contains the actual ArchiveSpark classesarchivespark-assembly-2.7.5-deps.jar
provides all required dependencies as one big packageThe release is also available on Maven Central: https://search.maven.org/#artifactdetails|com.github.helgeho|archivespark_2.11|2.7.5|jar
This release consists of two files:
archivespark-assembly-2.7.jar
contains the actual ArchiveSpark classesarchivespark-assembly-2.7-deps.jar
provides all required dependencies as one big packageThe release is also available on Maven Central: https://search.maven.org/#artifactdetails|com.github.helgeho|archivespark_2.11|2.7|jar