Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Lightweight Scala kernel for Jupyter / IPython 3
A Scala feature transformation library for data science and machine learning
Process authoring tool for Apache Flink
SANSA RDF Library
flink-jpmml is a fresh-made library for dynamic real time machine learning predictions built on top of PMML standard models and Apache Flink streaming engine
HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
A Spark datasource for the HadoopOffice library
SANSA Stack OWL (Web Ontology Language) API
A general Inference API based on two of the most popular Big Data processing engines: Apache Spark and Apache Flink
SANSA Machine Learning Layer
A type class for data of all sizes.
CodeFeedr core infrastructure
Scala ADT support for Apache Flink
Allow to pipe the result of an Elasticsearch query into a Flink data set
SANSA Parent Project for managing common dependencies, plugins, meta-data, properties, etc.