A one-stop integration framework for massive data

What is Apache InLong?
Features
When should I use InLong?
Build InLong
Deploy InLong
Contribute to InLong
Contact Us
Documentation
License

What is Apache InLong?

Stargazers Over Time	Contributors Over Time

Apache InLong is a one-stop integration framework for massive data that provides automatic, secure and reliable data transmission capabilities. InLong supports both batch and stream data processing at the same time, which offers great power to build data analysis, modeling and other real-time applications based on streaming data.

InLong (应龙) is a divine beast in Chinese mythology who guides the river into the sea, and it is regarded as a metaphor of the InLong system for reporting data streams.

InLong was originally built at Tencent, which has served online businesses for more than 8 years, to support massive data (data scale of more than 80 trillion pieces of data per day) reporting services in big data scenarios. The entire platform has integrated 5 modules: Ingestion, Convergence, Caching, Sorting, and Management, so that the business only needs to provide data sources, data service quality, data landing clusters and data landing formats, that is, the data can be continuously pushed from the source to the target cluster, which greatly meets the data reporting service requirements in the business big data scenario.

For getting more information, please visit our project documentation at https://inlong.apache.org/.

Features

Apache InLong offers a variety of features:

Ease of Use: a SaaS-based service platform. Users can easily and quickly report, transfer, and distribute data by publishing and subscribing to data based on topics.
Stability & Reliability: derived from the actual online production environment. It delivers high-performance processing capabilities for 10 trillion-level data streams and highly reliable services for 100 billion-level data streams.
Comprehensive Features: supports various types of data access methods and can be integrated with different types of Message Queue (MQ). It also provides real-time data extract, transform, and load (ETL) and sorting capabilities based on rules. InLong also allows users to plug features to extend system capabilities.
Service Integration: provides unified system monitoring and alert services. It provides fine-grained metrics to facilitate data visualization. Users can view the running status of queues and topic-based data statistics in a unified data metric platform. Users can also configure the alert service based on their business requirements so that users can be alerted when errors occur.
Scalability: adopts a pluggable architecture that allows you to plug modules into the system based on specific protocols. Users can replace components and add features based on their business requirements.

When should I use InLong?

InLong is based on MQ and aims to provide a one-stop, practice-tested module pluggable integration framework for massive data, based on this system, users can easily build stream-based data applications. It is suitable for environments that need to quickly build a data reporting platform, as well as an ultra-large-scale data reporting environment that InLong is very suitable for, and an environment that needs to automatically sort and land the reported data.

You can use InLong in the following ways：

Integrate InLong, manage data streams through SDK.
Use the InLong command-line tool to view and create data streams.
Visualize your operations on InLong dashboard.

Supported Data Nodes (Updating)

Type	Name	Version	Architecture
Extract Node	Auto Push	None	Standard
	File	None	Standard
	Kafka	2.x	Lightweight, Standard
	MongoDB	>= 3.6	Lightweight, Standard
	MQTT	>= 3.1	Standard
	MySQL	5.6, 5.7, 8.0.x	Lightweight, Standard
	Oracle	11,12,19	Lightweight
	PostgreSQL	9.6, 10, 11, 12	Lightweight, Standard
	Pulsar	2.8.x	Lightweight
	Redis	2.6.x	Standard
	SQLServer	2012, 2014, 2016, 2017, 2019	Lightweight, Standard
Load Node	Auto Consumption	None	Standard
	ClickHouse	20.7+	Lightweight, Standard
	Elasticsearch	6.x, 7.x	Lightweight, Standard
	Greenplum	4.x, 5.x, 6.x	Lightweight, Standard
	HBase	2.2.x	Lightweight, Standard
	HDFS	2.x, 3.x	Lightweight, Standard
	Hive	1.x, 2.x, 3.x	Lightweight, Standard
	Iceberg	0.12.x	Lightweight, Standard
	Hudi	0.12.x	Lightweight, Standard
	Kafka	2.x	Lightweight, Standard
	MySQL	5.6, 5.7, 8.0.x	Lightweight, Standard
	Oracle	11, 12, 19	Lightweight, Standard
	PostgreSQL	9.6, 10, 11, 12	Lightweight, Standard
	SQLServer	2012, 2014, 2016, 2017, 2019	Lightweight, Standard
	TDSQL-PostgreSQL	10.17	Lightweight, Standard
	Doris	>= 0.13	Lightweight, Standard
	StarRocks	>= 2.0	Lightweight, Standard
	Kudu	>= 1.12.0	Lightweight, Standard
	Redis	>= 3.0	Lightweight, Standard

Build InLong

More detailed instructions can be found at Quick Start section in the documentation.

Requirements:

Java JDK 8
Maven 3.6.1+
Docker 19.03.1+

Compile and install:

mvn clean install -DskipTests

(Optional) Compile using docker image:

docker pull maven:3.6-openjdk-8
docker run -v `pwd`:/inlong  -w /inlong maven:3.6-openjdk-8 mvn clean install -DskipTests

after compile successfully, you could find distribution file at inlong-distribution/target.

Deploy InLong

Develop InLong

Contribute to InLong

Report any issue on GitHub Issue
Code pull request according to How to contribute.

Contact Us

Join Apache InLong mailing lists:

Name Scope

[email protected] Development-related discussions Subscribe Unsubscribe Archives
Ask questions on Apache InLong Slack

Documentation

Home page: https://inlong.apache.org/
Issues: https://github.com/apache/inlong/issues

Name		Name	Last commit message	Last commit date
Latest commit History 3,106 Commits
.github		.github
.idea		.idea
bin		bin
codestyle		codestyle
conf		conf
docker		docker
inlong-agent		inlong-agent
inlong-audit		inlong-audit
inlong-common		inlong-common
inlong-dashboard		inlong-dashboard
inlong-dataproxy		inlong-dataproxy
inlong-distribution		inlong-distribution
inlong-manager		inlong-manager
inlong-sdk		inlong-sdk
inlong-sort-standalone		inlong-sort-standalone
inlong-sort		inlong-sort
inlong-tools/grafana/dashboards		inlong-tools/grafana/dashboards
inlong-tubemq		inlong-tubemq
licenses		licenses
.asf.yaml		.asf.yaml
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.licenserc.yaml		.licenserc.yaml
CHANGES.md		CHANGES.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
SECURITY.md		SECURITY.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A one-stop integration framework for massive data

What is Apache InLong?

Features

When should I use InLong?

Supported Data Nodes (Updating)

Build InLong

Deploy InLong

Develop InLong

Contribute to InLong

Contact Us

Documentation

License

About

Releases

Packages

Languages

License

lucaspeng12138/incubator-inlong

Folders and files

Latest commit

History

Repository files navigation

A one-stop integration framework for massive data

What is Apache InLong?

Features

When should I use InLong?

Supported Data Nodes (Updating)

Build InLong

Deploy InLong

Develop InLong

Contribute to InLong

Contact Us

Documentation

License

About

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages