Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.
Arc-Jupyter is an interactive Jupyter Notebooks Extenstion for building Arc data pipelines via Jupyter Notebooks.
Provides the MongoDBExtract and MongoDBLoad stages
Provides ElasticsearchExtract and ElasticsearchLoad stages
Creates a list of formatted dates to easily calculate delta processing periods.
Provides the DebeziumTransform stage
Provides the CassandraExtract, CassandraExecute, and CassandraLoad stages
arc-dataquality-udf-plugin defines a set of data quality/validation user defined functions.
Plugin to support extract and load using the spark-bigquery-connector
Provides the CypherTransform and GraphTransform stages
Provides the DeltaLakeExtract and DeltaLakeLoad stages
Provides KafkaExtract, KafkaLoad and KafkaCommitExecute stages
Provides the SASExtract stage
Provides GeoSpark UDFs functionality to Arc.