Xskipper is an Extensible Data Skipping Framework for Apache Spark.
To get started, see the Quick Start Guide .
See Xskipper site for more info.
Run as a project
To build a project using the Xskipper binaries from the Maven Central Repository, use the following Maven coordinates:
Include Xskipper in a Maven project by adding it as a dependency in the project's POM file. Xskipper should be compiled with Scala 2.12.
<dependency> <groupId>io.xskipper</groupId> <artifactId>xskipper-core_2.12</artifactId> <version>1.2.3</version> </dependency>
Include Xskipper in an SBT project by adding the following line to its build.sbt file:
libraryDependencies += "io.xskipper" %% "xskipper-core" % "1.2.3"
Xskipper is compiled using SBT.
To compile, run
To generate artifacts, run
To execute tests, run
Refer to SBT docs for more commands.
Xskipper tracks issues in GitHub and prefers to receive contributions as pull requests.
Xskipper currently requires Apache Spark 3.0.0
- IEEE Big Data 2020 paper - Extensible Data Skipping (arxiv version)
Apache License 2.0, see LICENSE.
This software has been developed under the BigDataStack project, as part of the holistic solution for big data applications and operations. BigDataStack has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 779747.