Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow
Lightweight Scala kernel for Jupyter / IPython 3
A Scala feature transformation library for data science and machine learning
A tool for data sampling, data generation, and data diffing
Process authoring tool for Apache Flink
SANSA RDF Library
flink-jpmml is a fresh-made library for dynamic real time machine learning predictions built on top of PMML standard models and Apache Flink streaming engine
HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
A Spark datasource for the HadoopOffice library
SANSA Machine Learning Layer
SANSA Stack OWL (Web Ontology Language) API
A general Inference API based on two of the most popular Big Data processing engines: Apache Spark and Apache Flink
A type class for data of all sizes.
SANSA Parent Project for managing common dependencies, plugins, meta-data, properties, etc.
Allow to pipe the result of an Elasticsearch query into a Flink data set
XGBoost4J for Scala with Mac and Linux binaries