cookieai / cookie-datasets

Read well-known ML datasets in Apache Spark

GitHub

cookie-datasets

Join the chat at https://gitter.im/cookieai/cookie-datasets Build Status

Welcome! cookie-datasets is a library of DataFrame readers for Apache Spark. The library supports a number of popular data formats in the machine learning community, including MNIST, CIFAR, and IRIS.

Want to learn more? See the wiki.

Help wanted! See the issues section. Feel free to suggest new formats to be supported.