Building the Spark Cyclone Plugin

Software Setup

JDK

The project uses Java 11, and Zulu OpenJDK is a preferred JDK. Users can switch to the specific JDK every time they work with this repository by installing SDKMAN and running the following command in the project root directory:

sdk env

`sbt`

The project uses sbt v1.7.+ for building and running tests. Users can download sbt by following the instructions here.

Hadoop + Spark

The plugin is built and tested against Spark v3.3.0 and Hadoop v3.3.+, respectively. The instructions for setting up a Hadoop and Spark installation on a machine with VEs attached can be found on the project website, [here]](https://sparkcyclone.io/docs/spark-sql/getting-started/hadoop-and-spark-installation-guide) and here.

In addition, instructions for configuring a local (custom) installation of Spark with an established Hadoop cluster can be found here.

Hadoop Setup on Windows

For Windows, make sure you configure Hadoop as per Hadoop on Windows and set the appropriate HADOOP_HOME (Use winutils as needed)

The files should look like this:

C:/hadoop-3.2.1/bin/hadoop.dll
...

Also add the bin directory to the PATH.

Other Setup

Cluster Tests

For cluster-mode/detection tests that run on the VectorEngine scope, make sure that $SPARK_HOME/work is writable:

$ mkdir -p /opt/spark/work && chmod -R 777 /opt/spark/work

SSH

Instructions can be found here to lower the latency of SSH connections, which is likely needed in the case of software development involving VEs in a remote server(in general, a 40% decrease latency can be observed).

Building and Running

`sbt Options

The sbt console should be launched with large amount of heap memory available:

SBT_OPTS="-Xmx16g" sbt

Building the Plugin JAR

To build the plugin, simply run in the sbt console:

show assembly

The location of the assembled fat JAR will be displayed.

Deploying the JAR

A shortcut is provided in the sbt console to copy the built plugin JAR to a pre-determined directory in the filesystem:

// Copy the JAR to /opt/cyclone/${USER}/
deploy

Testing the Plugin

See Testing and CI for more information on how to run Spark Cyclone tests on different levels.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Building the Spark Cyclone Plugin

Software Setup

JDK

`sbt`

Hadoop + Spark

Hadoop Setup on Windows

Other Setup

Cluster Tests

SSH

Building and Running

`sbt Options

Building the Plugin JAR

Deploying the JAR

Testing the Plugin

FilesExpand file tree

BuildingThePlugin.md

Latest commit

History

BuildingThePlugin.md

File metadata and controls

Building the Spark Cyclone Plugin

Software Setup

JDK

sbt

Hadoop + Spark

Hadoop Setup on Windows

Other Setup

Cluster Tests

SSH

Building and Running

`sbt Options

Building the Plugin JAR

Deploying the JAR

Testing the Plugin

`sbt`