-
timvw/adobe-analytics-datafeed-datasource 0.1.0
Apache Spark data source for Adobe Analytics Data Feed
Scala versions: 2.12 -
eto-ai/rikai 0.1.14
Parquet-based ML data format optimized for working with unstructured data
Scala versions: 2.13 2.12 -
zuinnote/hadoopoffice
HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
-
cdapio/cdap 6.10.1
An open source framework for building data analytic applications.
Scala versions: 2.12 -
turtlemonvh/ionic-spark-utils 0.0.2
Utilities for working with Ionic encryption via Spark.
Scala versions: 2.12 2.11 -
h2oai/h2o-3 3.30.0.3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Scala versions: 2.11 -
chitralverma/sparkml-extensions 0.1
Scala versions: 2.11 -
raistlintao/sparkmodelhelper 1.2.0
Scala Library for extracting useful information from trained Spark Model (DecisionTreeClassificationModel)
Scala versions: 2.12 -
mongodb/mongo-spark 10.4.0
The MongoDB Spark Connector
Scala versions: 2.13 2.12 -
a2mz/microspark 2.0
Micro Spark Rest API
Scala versions: 2.12 2.11