A Spark based framework for parsing and extracting Media Wiki dumps (Wikipedia, Wiktionary, Wikidata)