Skip to content

A system for handling structured files as data sources with a schema

Notifications You must be signed in to change notification settings

DAINTINESS-Group/Pinax

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

64 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Pinax

Java tool that manages structured files as data sources with a schema.

A structured file is a kind of text file, with lines, where each line is a record, the fields of which are separated by a separator (eg. tabs, comma, pipe, etc). After registering a data set, the system creates a table that shows the names of the file's columns and their type, making it easier for the user to construct a sql query.

Setup

Eclipse Installation Requirements

  • Install Eclipse
  • Import the project as a Maven project.

Apache Spark Installation Requirements

Maven

The project uses a Maven wrapper so there is no need to install it to your system as long as you have the JAVA_HOME environmental variable pointing to your Java 8 installation and SPARK_HOME enviromental variable pointing to your Spark installation folder.

Video Tutorial

A video tutorial of the tool can be found here

About

A system for handling structured files as data sources with a schema

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages