Skip to content

Latest commit

 

History

History
30 lines (18 loc) · 1.28 KB

README.md

File metadata and controls

30 lines (18 loc) · 1.28 KB

Pinax

Java tool that manages structured files as data sources with a schema.

A structured file is a kind of text file, with lines, where each line is a record, the fields of which are separated by a separator (eg. tabs, comma, pipe, etc). After registering a data set, the system creates a table that shows the names of the file's columns and their type, making it easier for the user to construct a sql query.

Setup

Eclipse Installation Requirements

  • Install Eclipse
  • Import the project as a Maven project.

Apache Spark Installation Requirements

Maven

The project uses a Maven wrapper so there is no need to install it to your system as long as you have the JAVA_HOME environmental variable pointing to your Java 8 installation and SPARK_HOME enviromental variable pointing to your Spark installation folder.

Video Tutorial

A video tutorial of the tool can be found here