feat(split): child notify parent catch up #390

hycdong · 2020-02-10T01:41:31Z

Simple partition split process

meta receives client partition split request, and change partition count split: meta start partition split #286
replica notices partition count changed during on_config_sync
parent partition create child partition split: parent replica create child replica #291
parent prepare states for child to learn feat(split): parent replica prepare states #299
child partition async learn states from parent feat(split): child replica learn parent prepare list and checkpoint #309 feat(split): child replica apply private logs, in-memory mutations and catch up parent #319
child notify parent catch up
meta server register child partitions
child partition active, and parent recover read and write

More partition split discussion in issue #69 and partition split design doc
This pr solves the part of fifth step of partition split, which is bold in process description.

What this pr solved

child_notify_catch_up: child partition sent notify_catch_up_request to primary parent
parent_handle_child_catch_up: primary parent handle child's catch_up_request, if all child partitions caught up, parent will copy mutation to child synchronously.
sync_point: sync_point is the first decree after parent send write request to child synchronously. When sync_point committed, parent consider child has learned all data, and primary parent will update partition_count of child replica group (not in this pr, but in feat(split): update group partition count #392)

…ild_async_learn_2

neverchanje

I suggest creating a separate class for this function to reduce the complexity of replica. Other functions in replica_split.cpp likewise.

Of course, I'm just giving a suggestion here, because it may need a great time to refactor.

class child_notify_caught_up : public pipeline::when<>
{
public:
  void run() { // aka  replica::child_notify_catch_up
    notify_catch_up_rpc rpc;
    rpc.call([this] {
      if(ec==ERR_TIMEOUT) {
        // inherited from pipeline::when, this is a replacement for tasking::enqueue-like retry
        // so that you can get rid of threadpool, tracker, task_code...
        repeat(1_s);
        return;
      }
      ...
    });
  }
}

So how to execute child_caught_up_notifier::run() (child_notify_catch_up)?

class partition_split_executor : public pipeline::base {
public:
  explicit partition_split_executor(replica* r) {
    // set up the executing environment
    thread_pool(LPC_PARTITION_SPLIT).task_tracker(&tracker).from(&child_notify_caught_up);
  }
private:
  child_notify_caught_up _child_notify_caught_up;
};

// executing child_notify_caught_up();
partition_split_executor.run_pipeline();

As you can see, pipeline::base is designed for a more complicated situation, more specifically,
it's designed for multiple stages pipeline. A single function does not worth this refactoring.

Perhaps unconsciously, the split procedure is already modeled into multiple logical stages, each failed stage will retry(pipeline::repeat) or cleanup. It's natural to turn this execution logic into a coded pipeline.

// on_add_child(group_check)
// -> LPC_CREATE_CHILD: create_child_replica
// -> LPC_PARTITION_SPLIT: child_init_replica
// -> LPC_PARTITION_SPLIT: parent_prepare_states (repeat if fail)
// -> LPC_PARTITION_SPLIT: child_copy_prepare_list
// -> LPC_PARTITION_SPLIT_ASYNC_LEARN: child_learn_states
// -> LPC_PARTITION_SPLIT: child_catch_up_states
//   -->> LPC_CATCHUP_WITH_PRIVATE_LOGS: catch_up_with_private_logs
// -> LPC_PARTITION_SPLIT: child_notify_catch_up + RPC_SPLIT_NOTIFY_CATCH_UP
// -> LPC_PARTITION_SPLIT: parent_check_sync_point_commit
// -> ...
class partition_split_executor : public pipeline::base {
public:
  explicit partition_split_executor(replica* parent) {
    // set up the executing environment
    task_tracker(&_tracker);
  }
  void start_split() {
    thread_pool(LPC_CREATE_CHILD)
      .from(_create_child_replica) // initial stage
      .link(_child_init_replica, LPC_PARTITION_SPLIT, _child->thread_hash())
      .link(_parent_prepare_states, LPC_PARTITION_SPLIT, _parent->thread_hash())
      .link(_child_copy_prepare_list, LPC_PARTITION_SPLIT, _child->thread_hash())
      .link(_child_learn_states, LPC_PARTITION_SPLIT_ASYNC_LEARN)
      .link(_child_catch_up_states, LPC_PARTITION_SPLIT, _child->thread_hash());
      .link(_child_notify_catch_up) // LPC_PARTITION_SPLIT, _child->thread_hash()
      .link(_parent_check_sync_point_commit); // end stage

    // a stage that is forked from main pipeline, since it's not the
    // direct edge from `child_catch_up_states` to `_child_notify_catch_up`.
    fork(_catch_up_with_private_logs, 
         LPC_CATCHUP_WITH_PRIVATE_LOGS).link(_child_notify_catch_up);
  }
private:
  task_tracker _tracker;

  replica* _parent;
  replica* _child;
  split_states *_split_states;
  replica_stub *_stub;
};

// `when` and `result` mean when receives the empty argument
// this stage gives empty result to the next stage.
class child_catch_up_states : public pipeline::when<>, pipeline::result<>, {
public:
  void run() {
    if(...) {
      // -> `_child_notify_catch_up`
      step_down_next_stage();
    } else {
      _catch_up_with_private_logs->async();
    }
  }
};

The advantage of the pipeline design is decoupling the new logic from replica, to make it merely a "data class". pipeline is the organization of codes. So people encounter problems with split can quickly understand the entire execution logic. Duplication uses this technique too. Very few additional codes in replica serve for duplication.

I really hope we can at least consider this refactoring. We can have a discussion if you need.

src/dist/replication/replication.thrift

src/dist/replication/lib/replica_split.cpp

src/dist/replication/lib/replica_context.h

src/dist/replication/lib/replica_split.cpp

hycdong · 2020-03-10T06:00:00Z

I suggest creating a separate class for this function to reduce the complexity of replica. Other functions in replica_split.cpp likewise.

Of course, I'm just giving a suggestion here, because it may need a great time to refactor.

class child_notify_caught_up : public pipeline::when<>
{
public:
  void run() { // aka  replica::child_notify_catch_up
    notify_catch_up_rpc rpc;
    rpc.call([this] {
      if(ec==ERR_TIMEOUT) {
        // inherited from pipeline::when, this is a replacement for tasking::enqueue-like retry
        // so that you can get rid of threadpool, tracker, task_code...
        repeat(1_s);
        return;
      }
      ...
    });
  }
}

So how to execute child_caught_up_notifier::run() (child_notify_catch_up)?

class partition_split_executor : public pipeline::base {
public:
  explicit partition_split_executor(replica* r) {
    // set up the executing environment
    thread_pool(LPC_PARTITION_SPLIT).task_tracker(&tracker).from(&child_notify_caught_up);
  }
private:
  child_notify_caught_up _child_notify_caught_up;
};

// executing child_notify_caught_up();
partition_split_executor.run_pipeline();

As you can see, pipeline::base is designed for a more complicated situation, more specifically,
it's designed for multiple stages pipeline. A single function does not worth this refactoring.

Perhaps unconsciously, the split procedure is already modeled into multiple logical stages, each failed stage will retry(pipeline::repeat) or cleanup. It's natural to turn this execution logic into a coded pipeline.

// on_add_child(group_check)
// -> LPC_CREATE_CHILD: create_child_replica
// -> LPC_PARTITION_SPLIT: child_init_replica
// -> LPC_PARTITION_SPLIT: parent_prepare_states (repeat if fail)
// -> LPC_PARTITION_SPLIT: child_copy_prepare_list
// -> LPC_PARTITION_SPLIT_ASYNC_LEARN: child_learn_states
// -> LPC_PARTITION_SPLIT: child_catch_up_states
//   -->> LPC_CATCHUP_WITH_PRIVATE_LOGS: catch_up_with_private_logs
// -> LPC_PARTITION_SPLIT: child_notify_catch_up + RPC_SPLIT_NOTIFY_CATCH_UP
// -> LPC_PARTITION_SPLIT: parent_check_sync_point_commit
// -> ...
class partition_split_executor : public pipeline::base {
public:
  explicit partition_split_executor(replica* parent) {
    // set up the executing environment
    task_tracker(&_tracker);
  }
  void start_split() {
    thread_pool(LPC_CREATE_CHILD)
      .from(_create_child_replica) // initial stage
      .link(_child_init_replica, LPC_PARTITION_SPLIT, _child->thread_hash())
      .link(_parent_prepare_states, LPC_PARTITION_SPLIT, _parent->thread_hash())
      .link(_child_copy_prepare_list, LPC_PARTITION_SPLIT, _child->thread_hash())
      .link(_child_learn_states, LPC_PARTITION_SPLIT_ASYNC_LEARN)
      .link(_child_catch_up_states, LPC_PARTITION_SPLIT, _child->thread_hash());
      .link(_child_notify_catch_up) // LPC_PARTITION_SPLIT, _child->thread_hash()
      .link(_parent_check_sync_point_commit); // end stage

    // a stage that is forked from main pipeline, since it's not the
    // direct edge from `child_catch_up_states` to `_child_notify_catch_up`.
    fork(_catch_up_with_private_logs, 
         LPC_CATCHUP_WITH_PRIVATE_LOGS).link(_child_notify_catch_up);
  }
private:
  task_tracker _tracker;

  replica* _parent;
  replica* _child;
  split_states *_split_states;
  replica_stub *_stub;
};

// `when` and `result` mean when receives the empty argument
// this stage gives empty result to the next stage.
class child_catch_up_states : public pipeline::when<>, pipeline::result<>, {
public:
  void run() {
    if(...) {
      // -> `_child_notify_catch_up`
      step_down_next_stage();
    } else {
      _catch_up_with_private_logs->async();
    }
  }
};

The advantage of the pipeline design is decoupling the new logic from replica, to make it merely a "data class". pipeline is the organization of codes. So people encounter problems with split can quickly understand the entire execution logic. Duplication uses this technique too. Very few additional codes in replica serve for duplication.

I really hope we can at least consider this refactoring. We can have a discussion if you need.

I will consider your refactoring suggestion after merging all split core codes. For the one hand, this refactor is a big project. For the other hand, I am not familiar with pipeline logic and not sure whether it is suitable to be used for split. I will learn this logic during merging remainder split code and discuss with you.

hycdong and others added 20 commits September 20, 2019 19:18

child async learn and catch up

cea7e44

small fix

7d2beff

Merge branch 'master' into child_async_learn_2

caa40fb

merge master and fix conflicy

79a906c

Merge branch 'child_async_learn_2' of github.com:hycdong/rdsn into ch…

8df5b0d

…ild_async_learn_2

Merge branch 'master' into child_async_learn_2

a1ae859

Merge branch 'master' into child_async_learn_2

c8a25ca

Merge branch 'master' into child_async_learn_2

a2c9827

Merge branch 'master' into child_async_learn_2

436cdf6

merge master

bd64cce

Merge branch 'master' into child_async_learn_2

4c91133

Merge branch 'master' into child_async_learn_2

94c6ca3

Merge branch 'master' into child_async_learn_2

fcd9404

fix according to review

49fc63b

Merge branch 'pegasus' into child_async_learn_2

335add8

add comments, remove useless ut

7bc22db

fix bug and format code

24f3e5b

remove useless file

ee40711

add code and test

57653c4

fix bugs

cfb024b

This was referenced Feb 14, 2020

feat(split): register child partition #391

Merged

feat(split): update group partition count #392

Closed

weekly-digest bot mentioned this pull request Feb 16, 2020

Weekly Digest (9 February, 2020 - 16 February, 2020) #400

Closed

hycdong added 2 commits February 20, 2020 10:48

small fix

3087881

small fix

f97d59c

hycdong changed the title ~~feat(split): child notify parent catch up [WIP]~~ feat(split): child notify parent catch up Feb 20, 2020

hycdong marked this pull request as ready for review February 20, 2020 06:14

hycdong added the component/split label Feb 20, 2020

weekly-digest bot mentioned this pull request Feb 23, 2020

Weekly Digest (16 February, 2020 - 23 February, 2020) #402

Closed

Merge branch 'master' into child_catch_up

588370a

Merge branch 'master' into child_catch_up

9bfebdb

weekly-digest bot mentioned this pull request Mar 1, 2020

Weekly Digest (23 February, 2020 - 1 March, 2020) #412

Closed

hycdong and others added 2 commits March 2, 2020 15:54

Merge branch 'master' into child_catch_up

c6ccea7

Merge branch 'master' into child_catch_up

5d19f80

neverchanje reviewed Mar 9, 2020

View reviewed changes

src/dist/replication/replication.thrift Outdated Show resolved Hide resolved

src/dist/replication/lib/replica_split.cpp Outdated Show resolved Hide resolved

acelyc111 reviewed Mar 9, 2020

View reviewed changes

hycdong added 2 commits March 10, 2020 14:13

fix by code review

d64703e

refactor notift_catch_up_rpc

aacc3a3

acelyc111 approved these changes Mar 10, 2020

View reviewed changes

Merge branch 'master' into child_catch_up

3ffba9b

neverchanje approved these changes Mar 11, 2020

View reviewed changes

acelyc111 merged commit 3a2691e into XiaoMi:master Mar 11, 2020

neverchanje mentioned this pull request Mar 30, 2020

Release 1.12.3 apache/incubator-pegasus#506

Closed

neverchanje pushed a commit that referenced this pull request Mar 31, 2020

feat(split): child notify parent catch up (#390)

2beeeab

neverchanje added the 1.12.3 label Apr 17, 2020

hycdong deleted the child_catch_up branch April 22, 2020 01:34

hycdong mentioned this pull request Oct 21, 2020

feat(split): add update_child_group_partition_count #645

Merged

This was referenced Oct 28, 2020

feat(split): add splitting_replicas while on_config_sync #653

Merged

feat(split): parent group update partition count #654

Merged

hycdong mentioned this pull request Nov 26, 2020

feat(split): secondary start split #675

Merged

hycdong mentioned this pull request Jan 14, 2021

feat(split): add child copy mutation synchronously #727

Merged

hycdong mentioned this pull request Aug 31, 2021

Feature: support partition split apache/incubator-pegasus#754

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(split): child notify parent catch up #390

feat(split): child notify parent catch up #390

hycdong commented Feb 10, 2020 •

edited by acelyc111

Loading

neverchanje left a comment •

edited

Loading

hycdong commented Mar 10, 2020

feat(split): child notify parent catch up #390

feat(split): child notify parent catch up #390

Conversation

hycdong commented Feb 10, 2020 • edited by acelyc111 Loading

Simple partition split process

What this pr solved

neverchanje left a comment • edited Loading

Choose a reason for hiding this comment

hycdong commented Mar 10, 2020

hycdong commented Feb 10, 2020 •

edited by acelyc111

Loading

neverchanje left a comment •

edited

Loading