helgeho / archivespark   3.0

MIT License GitHub

An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.

Scala versions: 2.11

Latest version

[![archivespark Scala version support](https://index.scala-lang.org/helgeho/archivespark/archivespark/latest.svg)](https://index.scala-lang.org/helgeho/archivespark/archivespark)

JVM badge

[![archivespark Scala version support](https://index.scala-lang.org/helgeho/archivespark/archivespark/latest-by-scala-version.svg?platform=jvm)](https://index.scala-lang.org/helgeho/archivespark/archivespark)