Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow
REST job server for Apache Spark
A Scala API for Apache Beam and Google Cloud Dataflow.
Deploy fat JARs. Restart processes. (port of codahale/assembly-sbt)
DataStax Spark Cassandra Connector
Microsoft Machine Learning for Apache Spark
Elasticsearch Scala Client - Reactive, Non Blocking, Type Safe, HTTP Client
The Guardian’s image management system
The Programming Language Designed For Big Data and AI
Waves node application
GeoTrellis is a geographic data processing engine for high performance applications.
A testing tool for Scala and Java developers
TensorFlow API for the Scala Programming Language
Quasar Analytics is a general-purpose compiler for translating data processing and analytics over semi-structured data into efficient plans that run 100% in the target infrastructure.
Avro schema generation and serialization / deserialization for Scala
Scorex 2.0 Core
The DAML smart contract language
The missing MatPlotLib for Scala + Spark
An sbt plugin to create awesome microsites for your project