An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
The enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCP
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Snowplow Enrichment jobs and library
ClickHouse Native Protocol JDBC implementation
Apache Spark based framework for analysis A/B experiments
Streaming data processor with a simple plugin framework and a modernized SQL interface.
The open standard for data logging