Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
Lightweight Scala kernel for Jupyter / IPython 3
A Scala feature transformation library for data science and machine learning
A tool for data sampling, data generation, and data diffing
Process authoring tool for Apache Flink
SANSA RDF Library
flink-jpmml is a fresh-made library for dynamic real time machine learning predictions built on top of PMML standard models and Apache Flink streaming engine
HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
A Spark datasource for the HadoopOffice library
A general Inference API based on two of the most popular Big Data processing engines: Apache Spark and Apache Flink
SANSA Machine Learning Layer
SANSA Stack OWL (Web Ontology Language) API
A type class for data of all sizes.
SANSA Parent Project for managing common dependencies, plugins, meta-data, properties, etc.
Allow to pipe the result of an Elasticsearch query into a Flink data set
XGBoost4J for Scala with Mac and Linux binaries