A structured file is a kind of text file, with lines, where each line is a record, the fields of which are separated by a separator (eg. tabs, comma, pipe, etc). After registering a data set, the system creates a table that shows the names of the file's columns and their type, making it easier for the user to construct a sql query.
- Install Eclipse
- Import the project as a Maven project.
- Install Spark
The project uses a Maven wrapper so there is no need to install it to your system as long as you have the JAVA_HOME environmental variable pointing to your Java 8 installation and SPARK_HOME enviromental variable pointing to your Spark installation folder.
A video tutorial of the tool can be found here