Spark MLlib wrapper for the Snowball framework
A Spark datasource for the HadoopOffice library
Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
An independent MapR-DB Connector for Apache Spark that fully utilizes MapR-DB secondary indexes
Deriving Spark DataFrame schemas from case classes
Fast, Scientific and Numerical Computing for the JVM (NDArrays)
Spark-Transformers: Library for exporting Apache Spark MLLIB models in to use them in any Java application with no other dependencies.
An open source framework for building data analytic applications.
SnappyData - The Spark Database. Stream, Transact, Analyze, Predict in one cluster
Spark data source for the Cognite Data Platform
Bagging Estimator for Apache Spark ML
PageRank in Spark
Optics for Spark DataFrames
Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌
Spark RDD to read and write from HBase
Geospatial Raster support for Spark DataFrames
Generate Scala case class based on Spark DataFrame schema
Apache Spark test helper functions with pretty error messages