nielsenbe / spark-wiki-parser   1.0

Apache License 2.0 GitHub

A Spark based framework for parsing and extracting Media Wiki dumps (Wikipedia, Wiktionary, Wikidata)

Scala versions: 2.11