Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
REST job server for Apache Spark
A Scala API for Apache Beam and Google Cloud Dataflow.
DataStax Spark Cassandra Connector
Deploy fat JARs. Restart processes. (port of codahale/assembly-sbt)
Microsoft Machine Learning for Apache Spark
Elasticsearch Scala Client - Reactive, Non Blocking, Type Safe, HTTP Client
The Guardian’s image management system
The Programming Language Designed For Big Data and AI
GeoTrellis is a geographic data processing engine for high performance applications.
Waves node application
TensorFlow API for the Scala Programming Language
Quasar Analytics is a general-purpose compiler for translating data processing and analytics over semi-structured data into efficient plans that run 100% in the target infrastructure.
Avro schema generation and serialization / deserialization for Scala
Scorex 2.0 Core
The missing MatPlotLib for Scala + Spark
An sbt plugin to create awesome microsites for your project