-
microsoft/synapseml
Simple and Distributed Machine Learning
Scala versions: 2.12 2.11 -
johnsnowlabs/spark-nlp
State of the Art Natural Language Processing
Scala versions: 2.12 2.11 -
h2oai/sparkling-water
Sparkling Water provides H2O functionality inside Spark cluster
Scala versions: 2.12 2.11 2.10 -
azure/azure-cosmosdb-spark
Apache Spark Connector for Azure Cosmos DB
Scala versions: 2.11 2.10 -
jelmerk/hnswlib
Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Scala versions: 2.13 2.12 2.11 -
databrickslabs/automl-toolkit
Toolkit for Apache Spark ML for Feature clean-up, feature Importance calculation suite, Information Gain selection, Distributed SMOTE, Model selection and training, Hyper parameter optimization and selection, Model interprability.
Scala versions: 2.11 -
isarn/isarn-sketches-spark
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Scala versions: 2.12 2.11 2.10 -
astrolabsoftware/spark3d
Spark extension for processing large-scale 3D data sets: Astrophysics, High Energy Physics, Meteorology, …
Scala versions: 2.11 -
ozancicek/artan
Online latent state estimation with Spark
Scala versions: 2.12 2.11 -
sb-ai-lab/replay
A Comprehensive Framework for Building End-to-End Recommendation Systems with State-of-the-Art Models
Scala versions: 2.12