Approximate Nearest Neighbors in Spark
sparkml extend library implements calculation algorithm
Divolte default Avro schema to use as external dependency
Impatient fork of Ammonite
Analytics + AI Platform for Apache Spark and BigDL
C4E, a Scala or Spark library for local and distributed Clustering.
Offline Recommender System Evaluation for Spark
Spark-based approximate nearest neighbor search using locality-sensitive hashing
An implementation of DBSCAN runing on top of Apache Spark
Run spark calculations from Ammonite
General Vectorization Lib for Machine Learning Tools
An extension to the amazing Spark framework for better functional programming.
Implementation of Random Ferns for Apache Spark
This project generalizes the Spark MLLIB Batch and Streaming K-Means clusterers in every practical way.
FITS data source for Spark SQL and DataFrames
SANSA Stack OWL (Web Ontology Language) API
ETL Library for Machine Learning - data pipelines, data munging and wrangling
Data model generator based on Scala case classes
A library for Spark DataFrame using MinIO Select API