An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Scala versions:
2.11
Latest version
[![archivespark Scala version support](https://index.scala-lang.org/helgeho/archivespark/archivespark/latest.svg)](https://index.scala-lang.org/helgeho/archivespark/archivespark)
JVM badge
[![archivespark Scala version support](https://index.scala-lang.org/helgeho/archivespark/archivespark/latest-by-scala-version.svg?platform=jvm)](https://index.scala-lang.org/helgeho/archivespark/archivespark)