Skip to content

Latest commit

 

History

History
121 lines (91 loc) · 6.23 KB

README_en.md

File metadata and controls

121 lines (91 loc) · 6.23 KB

##Elastic-Job - distributed scheduled job solution License: Hex.pm

Maven Central: Maven Status Elastic-Job is extracted from dd-job which is a component of ddframe. Elastic-Job just removed monitor and integrated standards parts from dd-job. It based on matured open-source productions like Quartz, Zookeeper and its client Curator.

Other components of ddframe also can open-source independently. Dangdang has already released DubboX, which is core component of dd-soa.

The relationship between Elastic-Job and ddframe is in this picture:

Evolution diagram of ddframe

##Contributors

Features

  • Scheduled job: Based on CRON expression to execute jobs.
  • Registry center: Based on Zookeeper and its client Curator to implement global job register center, use to register, monitor control and coordinate distributed jobs.
  • Sharding: Split single task to many task items, execute parallel on multiple servers.
  • Scalability: Server crashed or new server online, elastic-job will re-sharding when next job trigger, will not affect current running jobs.
  • Multiple job modes: Now support Simple, ThroughputDataFlow and SequenceDataFlow job modes.
  • Failover: Server crashed does not trigger re-sharding, only do it when next task trigger. Enable failover can notify other idle servers to pull orphan task items.
  • Execution status collection: Monitor execution status and statistics process success and failure count, collect previous trigger time, completed time and next trigger time.
  • Pause, resume and disable: Pause or resume jobs, and disable servers (usually disabled during system launching).
  • Misfired job re-trigger: Record missing jobs automatically, and trigger them after previous task completed. Please reference Quartz misfire.
  • Data processed concurrently: Use concurrent threads processing fetched data, to improve throughput.
  • Idempotency: Judge duplicate task items, restrict repeatable task items execute. Because enable idempotency need monitor job execution status, the performance for instantaneous jobs maybe low.
  • Failure tolerance: If job servers lost connection from registry center, job will stop immediately which to prevent registry center assign crashed task items to other job servers, but current job servers still running, then cause duplicated task items running.
  • Data offset store: Store offset of last processed data into Zookeeper.
  • Spring support: Integrate spring framework, customized namespace, place-holder supported etc.
  • Web console: Support web console to manage jobs and register centers.

Related documents

Downloads

Release notes

Directory structure

Usage

User guide

Limitations

Web console

Source codes compile problems

Theory illustrate

Job sharding strategy

InfoQ news

Quick Start

  • Add maven dependencies

Elastic-Job has deployed to Maven Central Repository, add dependencies in your pom.xml file.

<!-- add elastic-job core module -->
<dependency>
    <groupId>com.dangdang</groupId>
    <artifactId>elastic-job-core</artifactId>
    <version>${lasted.release.version}</version>
</dependency>

<!-- add elastic-job spring module, use namespace -->
<dependency>
    <groupId>com.dangdang</groupId>
    <artifactId>elastic-job-spring</artifactId>
    <version>${lasted.release.version}</version>
</dependency>
  • Job development
public class MyElasticJob extends AbstractThroughputDataFlowElasticJob<Foo> {
    
    @Override
    protected List<Foo> fetchData(JobExecutionMultipleShardingContext context) {
        List<Foo> result = // get data from database by sharding items
        return result;
    }
    
    @Override
    protected boolean processData(JobExecutionMultipleShardingContext context, Foo data) {
        // process data
        return true;
    }
}
  • Job configuration
<?xml version="1.0" encoding="UTF-8"?>
<beans xmlns="http://www.springframework.org/schema/beans"
    xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
    xmlns:reg="http://www.dangdang.com/schema/ddframe/reg" 
    xmlns:job="http://www.dangdang.com/schema/ddframe/job" 
    xsi:schemaLocation="http://www.springframework.org/schema/beans 
                        http://www.springframework.org/schema/beans/spring-beans.xsd 
                        http://www.dangdang.com/schema/ddframe/reg 
                        http://www.dangdang.com/schema/ddframe/reg/reg.xsd 
                        http://www.dangdang.com/schema/ddframe/job 
                        http://www.dangdang.com/schema/ddframe/job/job.xsd 
                        ">
    <!--configure registry center -->
    <reg:zookeeper id="regCenter" server-lists="yourhost:2181" namespace="dd-job" base-sleep-time-milliseconds="1000" max-sleep-time-milliseconds="3000" max-retries="3" />
    
    <!--configure job -->
    <job:simple id="oneOffElasticJob" class="xxx.MyElasticJob" registry-center-ref="regCenter" cron="0/10 * * * * ?"   sharding-total-count="3" sharding-item-parameters="0=A,1=B,2=C" />
</beans>