-
microsoft/synapseml 1.0.7
Simple and Distributed Machine Learning
Scala versions: 2.12 -
johnsnowlabs/spark-nlp 5.5.0
State of the Art Natural Language Processing
Scala versions: 2.12 -
h2oai/sparkling-water 2.4.13
Sparkling Water provides H2O functionality inside Spark cluster
Scala versions: 2.11 -
azure/azure-cosmosdb-spark 3.7.0
Apache Spark Connector for Azure Cosmos DB
Scala versions: 2.11 -
jelmerk/hnswlib 1.1.2
Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Scala versions: 2.13 2.12 2.11 -
databrickslabs/automl-toolkit 0.7.2
Toolkit for Apache Spark ML for Feature clean-up, feature Importance calculation suite, Information Gain selection, Distributed SMOTE, Model selection and training, Hyper parameter optimization and selection, Model interprability.
Scala versions: 2.11 -
isarn/isarn-sketches-spark 0.6.0-sp3.2
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Scala versions: 2.12 -
astrolabsoftware/spark3d 0.3.1
Spark extension for processing large-scale 3D data sets: Astrophysics, High Energy Physics, Meteorology, …
Scala versions: 2.11 -
sb-ai-lab/replay
A Comprehensive Framework for Building End-to-End Recommendation Systems with State-of-the-Art Models
-
ozancicek/artan 0.5.1
Online latent state estimation with Spark
Scala versions: 2.12