[docdb] Add new flag for how many servers to wait for before the transactions table #6632

bmatican · 2020-12-11T23:46:49Z

We have this code in CatalogManager, which currently waits for just RF number of TS to be up, before trying to create the txn status table. However, by default, we pick a number of tablets, that depends on the number of TS alive at moment of creation, for example 8 * number of servers. Worst case, we'll just create the table based on just the minimum (default RF = 3) tablets, so 24. In a big enough clusters, with say >24 nodes, this will not be enough to guarantee that we can get one txn status tablet leader on every node!

  if (FLAGS_enable_ysql) {
    LOG_WITH_PREFIX(INFO)
        << "YSQL is enabled, will create the transaction status table when "
        << FLAGS_replication_factor << " tablet servers are online";
    master_->ts_manager()->SetTSCountCallback(FLAGS_replication_factor, [this] {
      LOG_WITH_PREFIX(INFO)
          << FLAGS_replication_factor
          << " tablet servers registered, creating the transaction status table";
      // Retry table creation until it succeedes. It might fail initially because placement UUID
      // of live replicas is set through an RPC from YugaWare, and we won't be able to calculate
      // the number of primary (non-read-replica) tablet servers until that happens.
      while (true) {
        const auto s = CreateTransactionsStatusTableIfNeeded(/* rpc */ nullptr);

We should add a new flag for this, for a user to specify how many servers to wait for upfront, before pre-creating the table. Then our platform could specify this at universe creation, as we always know how many nodes we create in a universe. There's no backwards compatibility issues, but for a sensible default, we can just default this flag to replication_factor.

cc @psudheer21 @m-iancu

The text was updated successfully, but these errors were encountered:

…he transactions table is created Summary: Currently in the catalog manager, we wait for just RF number of TS before creating the transaction status table. However, when we move to large clusters (> 24 nodes) then just the RF number of TS might not be enough to guarantee one Txn status tablet leader on every node. Support has been added to specify the number of TS to wait for before creating the transaction table. User can specify the flag "txn_table_wait_ts_count" for this purpose. The default value of this flag is RF. Test Plan: ./bin/yb-ctl create --master_flags "txn_table_wait_ts_count=3" --tserver_flags "txn_table_wait_ts_count=3" Verify that Transaction table hasn't been created despite RF being 1 ./bin/yb-ctl add_node --tserver_flags "txn_table_wait_ts_count=3" ./bin/yb-ctl add_node --tserver_flags "txn_table_wait_ts_count=3" Verify that Transaction table gets created Also, added a unit test in create-table-itest.cc that simulates the above sequence Reviewers: bogdan, rsami Reviewed By: bogdan, rsami Subscribers: zyu, ybase Differential Revision: https://phabricator.dev.yugabyte.com/D10258

…before the transactions table is created Summary: Currently in the catalog manager, we wait for just RF number of TS before creating the transaction status table. However, when we move to large clusters (> 24 nodes) then just the RF number of TS might not be enough to guarantee one Txn status tablet leader on every node. Support has been added to specify the number of TS to wait for before creating the transaction table. User can specify the flag "txn_table_wait_ts_count" for this purpose. The default value of this flag is RF. Test Plan: ./bin/yb-ctl create --master_flags "txn_table_wait_ts_count=3" --tserver_flags "txn_table_wait_ts_count=3" Verify that Transaction table hasn't been created despite RF being 1 ./bin/yb-ctl add_node --tserver_flags "txn_table_wait_ts_count=3" ./bin/yb-ctl add_node --tserver_flags "txn_table_wait_ts_count=3" Verify that Transaction table gets created Also, added a unit test in create-table-itest.cc that simulates the above sequence Reviewers: bogdan, rsami Reviewed By: bogdan, rsami Subscribers: zyu, ybase Differential Revision: https://phabricator.dev.yugabyte.com/D10258

bmatican added the area/docdb YugabyteDB core features label Dec 11, 2020

bmatican self-assigned this Dec 11, 2020

bmatican assigned sanketkedia and unassigned bmatican Dec 16, 2020

sanketkedia closed this as completed Jan 20, 2021

dependabot bot mentioned this issue Jan 3, 2024

Bump plotly.js from 1.54.1 to 2.25.2 in /managed/ui sarvex/yugabyte-db#27

Open

dependabot bot mentioned this issue Oct 14, 2024

Bump the npm_and_yarn group across 3 directories with 48 updates offsoc/yugabyte-db#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[docdb] Add new flag for how many servers to wait for before the transactions table #6632

[docdb] Add new flag for how many servers to wait for before the transactions table #6632

bmatican commented Dec 11, 2020

[docdb] Add new flag for how many servers to wait for before the transactions table #6632

[docdb] Add new flag for how many servers to wait for before the transactions table #6632

Comments

bmatican commented Dec 11, 2020