Tajo is a relational and distributed data warehouse system for Hadoop. Tajo is designed for low-latency and scalable ad-hoc queries, online aggregation and ETL on large-data sets by leveraging advanced database techniques. It supports SQL standards. It has its own query engine which allows direct control of distributed execution and data flow. As a result, Tajo has a variety of query evaluation strategies and more optimization opportunities. In addition, Tajo will have a native columnar execution and and its optimizer.
- Tajo Wiki
- Getting Started
- Query Language
- Configuration Guide
- Backup and Restore Guide
- Functions
- Tajo Interactive Shell
- Java 1.8 or higher
- Hadoop 2.3.0 or higher
- [email protected] - To discuss and ask general development issues.
- [email protected] - To discuss and ask end-user questions/issues.
- [email protected] - To see notifications made in the Tajo issue tracking system, review board, and Jenkins CI.
- [email protected] - To monitor commits to the source repository.
To subscribe to the mailing lists, please send an email to:
${listname}[email protected]
For example, to subscribe to dev, send an email from your desired subscription address to:
and follow the instructions from there.