epiconcept-paris / sparkly   1.0

MIT License GitHub

A set of utilities for developing portable machine learning Apache Spark applicationsi developed by Epiconcept

Scala versions: 2.12

Sparkly: portable machine learning spark applications

A set of utilities for developing portable machine learning Apache Spark applicationsi developed by Epiconcept.

Sparkly site

Report bug & issues

Main functionalities

  • Simplifies parameter management system
  • Impmement a set of ml transformer for improving spark ml
  • Implement a storage independent API for manipulating files produced by spark jobs
  • New Grid Search with reporting for IA modules
  • Integrates Apache Lucene integration to dataframes to allow fuzzy search
  • Integrates Apache Lucene integration to implement index fetch datasets