Upserts, Deletes And Incremental Processing on Big Data.
The Programming Language Designed For Big Data and AI
C# and F# language binding and extensions to Apache Spark
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
A library for Spark DataFrame using MinIO Select API
🚀 Validation DSL for data pipelines
A remote CLI interface for MapR
Mapflablup is a library to flat ➖ and blowup 🎈 Map Collection