11 results
-
feathr-ai/feathr 1.0.0
Feathr – A scalable, unified data and AI engineering platform for enterprise
Scala versions: 2.12 -
swoop-inc/spark-alchemy 1.2.1
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Scala versions: 2.12 -
setl-framework/setl 1.0.0-SNAPSHOT
A simple Spark-powered ETL framework that just works 🍺
Scala versions: 2.12 2.11 -
treeverse/lakefs 0.14.1
lakeFS - Data version control for your data lake | Git for data
Scala versions: 2.12 -
galliaproject/gallia-core 0.6.1
A schema-aware Scala library for data transformation
Scala versions: 3.x 2.13 2.12 -
starlake-ai/starlake 1.3.0
Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.
Scala versions: 2.13 2.12 -
coxautomotivedatasolutions/spark-distcp 0.2.5
A re-implementation of Hadoop DistCP in Apache Spark
Scala versions: 2.13 -
vitaliihonta/scala-ql 0.1.0
Data manipulation and reporting for Scala.
Scala versions: 3.x 2.13 2.12 -
prophecy-io/spark-ai 0.1.9
Toolbox for building Generative AI applications on top of Apache Spark.
Scala versions: 2.12 -
harrystech/hyppo-worker 0.7.5
The hyppo data ingestion system worker components
Scala versions: 2.11