-
sansa-stack/sansa-inference
A general Inference API based on two of the most popular Big Data processing engines: Apache Spark and Apache Flink
Scala (JVM): 2.11 -
bizreach/aws-kinesis-scala
Scala client for Amazon Kinesis. Also provides write to Kinesis capability for Apache Spark or Spark Streaming.
Scala (JVM): 2.11 2.12 -
sansa-stack/sansa-query
SANSA Query Layer
Scala (JVM): 2.11 -
sansa-stack/sansa-owl
SANSA Stack OWL (Web Ontology Language) API
Scala (JVM): 2.11 -
helgeho/archivespark
An Apache Spark framework for easy data processing, extraction as well as derivation for Web archives and archival collections, developed by the Internet Archive and L3S Research Center.
Scala (JVM): 2.11 -
flipkart-incubator/spark-transformers
Spark-Transformers: Library for exporting Apache Spark MLLIB models in to use them in any Java application with no other dependencies.
Scala (JVM): 2.10 2.11 -
ponkin/bloom
Probabilistic data structures java implementation.
Scala (JVM): 2.11 -
deeplearning4j/nd4j
Fast, Scientific and Numerical Computing for the JVM (NDArrays)
Scala (JVM): 2.10 2.11 -
yotpoltd/metorikku
A simplified, lightweight ELT Framework based on Apache Spark
Scala (JVM): 2.11 -
sansa-stack/sansa-rdf
SANSA RDF Library
Scala (JVM): 2.11 -
microsoft/mobius
C# and F# language binding and extensions to Apache Spark
Scala (JVM): 2.10 2.11 -
interestinglab/waterdrop
An easy-to-use, scalable, bigdata processing tool
Scala (JVM): 2.11 -
h2oai/h2o-3
Open Source Fast Scalable Machine Learning Platform For Smarter Applications (Deep Learning, Gradient Boosting, Random Forest, Generalized Linear Modeling (Logistic Regression, Elastic Net), K-Means, PCA, Stacked Ensembles, Automatic Machine Learning (AutoML), ...)
Scala (JVM): 2.10 2.11