MLSQL is a SQL-Based language, and it's also a distributed compute engine based on Spark. The design goal of the MLSQL is to unify Big Data and Machine Learning, one language, one platform.
Find more examples on our user guide.
Get PreBuild Distribution
- The lasted version is MLSQL v1.3.0
- You can download from MLSQL Website
- Spark 2.3.2/2.4.3 are tested
Run PreBuild Distribution:
cp streamingpro-spark_2.x-x.x.x.tar.gz /tmp cd /tmp && tar xzvf streamingpro-spark_2.x-x.x.x.tar.gz cd /tmp/streamingpro-spark_2.x-x.x.x ## make sure spark distribution is available ## visit http://127.0.0.1:9003 export SPARK_HOME="....." ; ./start-local.sh
# clone project git clone https://github.com/allwefantasy/streamingpro . cd streamingpro ## configure build envs export MLSQL_SPARK_VERSIOIN=2.4 export DRY_RUN=false export DISTRIBUTION=false ## build ./dev/package.sh
Fork and Contribute
If you are planning to contribute to this repository, we first request you to create an issue at our Issue page even if the topic is not related to source code itself (e.g., documentation, new idea and proposal).
This is an active open source project for everyone, and we are always open to people who want to use this system or contribute to it. This guide document introduce how to contribute to MLSQL.
- Zhu William/allwefantasy#gmail.com
- Chen Fu/cfmcgrady#gmail.com
- Geng Yifei/pigeongeng#gmail.com