Skip to content

Commit

Permalink
sql: introduce crdb_internal.transaction_contention_events virtual table
Browse files Browse the repository at this point in the history
This commit introduces `crdb_internal.transaction_contention_events`
virtual table. This virtual tables exposes transaction contention events
annotated with transaction fingerprint IDs for transactions that have
finished executing. This allows this virtual table to be joined into the
statement statistics and transaction statistics tables.
The new virtual table require either VIEWACTIVITYREDACTED OR
VIEWACTIVITY role option to access. However, if user has
VIEWACTIVTYREDACTED role, the contending key will be redacted. The contention
events are stored in memory. The amount of contention events stored is
controlled via 'sql.contention.event_store.capacity' cluster setting.

The new table has the following schema:

CREATE TABLE crdb_internal.transaction_contention_events (
    collection_ts                TIMESTAMPTZ NOT NULL,

    blocking_txn_id              UUID NOT NULL,
    blocking_txn_fingerprint_id  BYTES NOT NULL,

    waiting_txn_id               UUID NOT NULL,
    waiting_txn_fingerprint_id   BYTES NOT NULL,

    contention_duration          INTERVAL NOT NULL,
    contending_key               BYTES NOT NULL
)

* collected_ts: stores the timestamp of when the contention event was
collected
* blocking_txn_id: stores the transaction ID of the blocking transaction
of the contention event. This column can be joined into
`crdb_internal.cluster_contention_events` or
`crdb_internal.node_contention_events` table.
* blocking_txn_fingerprint_id: stores the transaction fingerprint ID of
the blocking transaction fingerprint IDs. This can be used to join into
the `crdb_internal.statement_statistics` and
`crdb_internal.transaction_statistics` tables to surface historical
information of the transactions that caused the contention.
* waiting_txn_id: stores the transaction ID of the waiting transaction
in the contention event. Similar to `blocking_txn_id`, this column can
be joined into `crdb_internal.cluster_contention_events` and
`crdb_internal.node_contention_events` tables.
* waiting_txn_fingerprint_id: stores the transaction fingerprint ID of
the waiting transaction. Similar to `blocking_txn_fingerprint_id`, this
column can be joined to `crdb_internal.statement_statistics` and
`crdb_internal.transaction_statistics` tables.
* contention_duration: stores the amount of time the waiting transaction
spent waiting for the blocking transaction.
* contending_key: stores the key that caused the contention. If
the user has VIEWACTIVITYREDACTED role option, this column is redacted.

Resolves cockroachdb#75904

Release note (sql change): introducing
`crdb_internal.transaction_contention_events` virtual table, that exposes
historical transaction contention events. The events exposed in the new virtual
table also include transaction fingerprint IDs for both blocking and
waiting transactions. This allows the new virtual table to be joined
into statement statistics and transaction statistics tables.
The new virtual table require either VIEWACTIVITYREDACTED OR
VIEWACTIVITY role option to access. However, if user has
VIEWACTIVTYREDACTED role, the contending key will be redacted. The contention
events are stored in memory. The amount of contention events stored is
controlled via 'sql.contention.event_store.capacity' cluster setting.

Release note (api change): introducing
GET `/_status/transactioncontentionevents` endpoint, that returns
cluster-wide in-memory historical transaction contention events.
The endpoint require either VIEWACTIVITYREDACTED OR VIEWACTIVITY role
option to access. However, if user has VIEWACTIVTYREDACTED role, the
contending key will be redacted. The contention events are stored in memory.
The amount of contention events stored is controlled via
'sql.contention.event_store.capacity' cluster setting.

Release Justification: Low risk, high benefit change
  • Loading branch information
Azhng authored and RajivTS committed Mar 6, 2022
1 parent 1874cf3 commit f023e40
Show file tree
Hide file tree
Showing 30 changed files with 2,344 additions and 1,636 deletions.
45 changes: 45 additions & 0 deletions docs/generated/http/full.md
Original file line number Diff line number Diff line change
Expand Up @@ -4440,6 +4440,51 @@ Response object for issuing Transaction ID Resolution.



## TransactionContentionEvents

`GET /_status/transactioncontentionevents`

TransactionContentionEvents returns a list of un-aggregated contention
events sorted by the collection timestamp.

Support status: [reserved](#support-status)

#### Request Parameters







| Field | Type | Label | Description | Support status |
| ----- | ---- | ----- | ----------- | -------------- |
| node_id | [string](#cockroach.server.serverpb.TransactionContentionEventsRequest-string) | | | [reserved](#support-status) |







#### Response Parameters







| Field | Type | Label | Description | Support status |
| ----- | ---- | ----- | ----------- | -------------- |
| events | [cockroach.sql.contentionpb.ExtendedContentionEvent](#cockroach.server.serverpb.TransactionContentionEventsResponse-cockroach.sql.contentionpb.ExtendedContentionEvent) | repeated | | [reserved](#support-status) |







## RequestCA

`GET /_join/v1/ca`
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -97,6 +97,7 @@ crdb_internal table_indexes table NULL NULL NULL
crdb_internal table_row_statistics table NULL NULL NULL
crdb_internal tables table NULL NULL NULL
crdb_internal tenant_usage_details view NULL NULL NULL
crdb_internal transaction_contention_events table NULL NULL NULL
crdb_internal transaction_statistics view NULL NULL NULL
crdb_internal zones table NULL NULL NULL

Expand Down
2 changes: 2 additions & 0 deletions pkg/ccl/serverccl/statusccl/BUILD.bazel
Original file line number Diff line number Diff line change
Expand Up @@ -10,6 +10,8 @@ go_library(
"//pkg/roachpb",
"//pkg/security",
"//pkg/server/serverpb",
"//pkg/sql",
"//pkg/sql/contention",
"//pkg/sql/pgwire",
"//pkg/sql/sqlstats/persistedsqlstats",
"//pkg/sql/tests",
Expand Down
20 changes: 20 additions & 0 deletions pkg/ccl/serverccl/statusccl/tenant_status_test.go
Original file line number Diff line number Diff line change
Expand Up @@ -713,6 +713,26 @@ SET TRACING=off;
t.Errorf("did not expect contention event in controlled cluster, but it was found")
}
}

testutils.SucceedsWithin(t, func() error {
err = testHelper.testCluster().tenantContentionRegistry(1).FlushEventsForTest(ctx)
if err != nil {
return err
}

resp := &serverpb.TransactionContentionEventsResponse{}
testHelper.
testCluster().
tenantHTTPClient(t, 1).
GetJSON("/_status/transactioncontentionevents", resp)

if len(resp.Events) == 0 {
return errors.New("expected transaction contention events being populated, " +
"but it is not")
}

return nil
}, 5*time.Second)
}

func testIndexUsageForTenants(t *testing.T, testHelper *tenantTestHelper) {
Expand Down
29 changes: 19 additions & 10 deletions pkg/ccl/serverccl/statusccl/tenant_test_utils.go
Original file line number Diff line number Diff line change
Expand Up @@ -19,6 +19,8 @@ import (
"github.com/cockroachdb/cockroach/pkg/roachpb"
"github.com/cockroachdb/cockroach/pkg/security"
"github.com/cockroachdb/cockroach/pkg/server/serverpb"
"github.com/cockroachdb/cockroach/pkg/sql"
"github.com/cockroachdb/cockroach/pkg/sql/contention"
"github.com/cockroachdb/cockroach/pkg/sql/pgwire"
"github.com/cockroachdb/cockroach/pkg/sql/sqlstats/persistedsqlstats"
"github.com/cockroachdb/cockroach/pkg/sql/tests"
Expand All @@ -37,11 +39,12 @@ type serverIdx int
const randomServer serverIdx = -1

type testTenant struct {
tenant serverutils.TestTenantInterface
tenantConn *gosql.DB
tenantDB *sqlutils.SQLRunner
tenantStatus serverpb.SQLStatusServer
tenantSQLStats *persistedsqlstats.PersistedSQLStats
tenant serverutils.TestTenantInterface
tenantConn *gosql.DB
tenantDB *sqlutils.SQLRunner
tenantStatus serverpb.SQLStatusServer
tenantSQLStats *persistedsqlstats.PersistedSQLStats
tenantContentionRegistry *contention.Registry
}

func newTestTenant(
Expand All @@ -62,13 +65,15 @@ func newTestTenant(
status := tenant.StatusServer().(serverpb.SQLStatusServer)
sqlStats := tenant.PGServer().(*pgwire.Server).SQLServer.
GetSQLStatsProvider().(*persistedsqlstats.PersistedSQLStats)
contentionRegistry := tenant.ExecutorConfig().(sql.ExecutorConfig).ContentionRegistry

return &testTenant{
tenant: tenant,
tenantConn: tenantConn,
tenantDB: sqlDB,
tenantStatus: status,
tenantSQLStats: sqlStats,
tenant: tenant,
tenantConn: tenantConn,
tenantDB: sqlDB,
tenantStatus: status,
tenantSQLStats: sqlStats,
tenantContentionRegistry: contentionRegistry,
}
}

Expand Down Expand Up @@ -172,6 +177,10 @@ func (c tenantCluster) tenantStatusSrv(idx serverIdx) serverpb.SQLStatusServer {
return c.tenant(idx).tenantStatus
}

func (c tenantCluster) tenantContentionRegistry(idx serverIdx) *contention.Registry {
return c.tenant(idx).tenantContentionRegistry
}

func (c tenantCluster) cleanup(t *testing.T) {
for _, tenant := range c {
tenant.cleanup(t)
Expand Down
1 change: 1 addition & 0 deletions pkg/cli/testdata/zip/partial1
Original file line number Diff line number Diff line change
Expand Up @@ -39,6 +39,7 @@ debug zip --concurrency=1 --cpu-profile-duration=0s /dev/null
[cluster] retrieving SQL data for crdb_internal.invalid_objects... writing output: debug/crdb_internal.invalid_objects.txt... done
[cluster] retrieving SQL data for crdb_internal.index_usage_statistics... writing output: debug/crdb_internal.index_usage_statistics.txt... done
[cluster] retrieving SQL data for crdb_internal.table_indexes... writing output: debug/crdb_internal.table_indexes.txt... done
[cluster] retrieving SQL data for crdb_internal.transaction_contention_events... writing output: debug/crdb_internal.transaction_contention_events.txt... done
[cluster] requesting nodes... received response... converting to JSON... writing binary output: debug/nodes.json... done
[cluster] requesting liveness... received response... converting to JSON... writing binary output: debug/liveness.json... done
[node 1] node status... converting to JSON... writing binary output: debug/nodes/1/status.json... done
Expand Down
1 change: 1 addition & 0 deletions pkg/cli/testdata/zip/partial1_excluded
Original file line number Diff line number Diff line change
Expand Up @@ -39,6 +39,7 @@ debug zip /dev/null --concurrency=1 --exclude-nodes=2 --cpu-profile-duration=0
[cluster] retrieving SQL data for crdb_internal.invalid_objects... writing output: debug/crdb_internal.invalid_objects.txt... done
[cluster] retrieving SQL data for crdb_internal.index_usage_statistics... writing output: debug/crdb_internal.index_usage_statistics.txt... done
[cluster] retrieving SQL data for crdb_internal.table_indexes... writing output: debug/crdb_internal.table_indexes.txt... done
[cluster] retrieving SQL data for crdb_internal.transaction_contention_events... writing output: debug/crdb_internal.transaction_contention_events.txt... done
[cluster] requesting nodes... received response... converting to JSON... writing binary output: debug/nodes.json... done
[cluster] requesting liveness... received response... converting to JSON... writing binary output: debug/liveness.json... done
[node 1] node status... converting to JSON... writing binary output: debug/nodes/1/status.json... done
Expand Down
1 change: 1 addition & 0 deletions pkg/cli/testdata/zip/partial2
Original file line number Diff line number Diff line change
Expand Up @@ -39,6 +39,7 @@ debug zip --concurrency=1 --cpu-profile-duration=0 /dev/null
[cluster] retrieving SQL data for crdb_internal.invalid_objects... writing output: debug/crdb_internal.invalid_objects.txt... done
[cluster] retrieving SQL data for crdb_internal.index_usage_statistics... writing output: debug/crdb_internal.index_usage_statistics.txt... done
[cluster] retrieving SQL data for crdb_internal.table_indexes... writing output: debug/crdb_internal.table_indexes.txt... done
[cluster] retrieving SQL data for crdb_internal.transaction_contention_events... writing output: debug/crdb_internal.transaction_contention_events.txt... done
[cluster] requesting nodes... received response... converting to JSON... writing binary output: debug/nodes.json... done
[cluster] requesting liveness... received response... converting to JSON... writing binary output: debug/liveness.json... done
[node 1] node status... converting to JSON... writing binary output: debug/nodes/1/status.json... done
Expand Down
1 change: 1 addition & 0 deletions pkg/cli/testdata/zip/testzip
Original file line number Diff line number Diff line change
Expand Up @@ -39,6 +39,7 @@ debug zip --concurrency=1 --cpu-profile-duration=1s /dev/null
[cluster] retrieving SQL data for crdb_internal.invalid_objects... writing output: debug/crdb_internal.invalid_objects.txt... done
[cluster] retrieving SQL data for crdb_internal.index_usage_statistics... writing output: debug/crdb_internal.index_usage_statistics.txt... done
[cluster] retrieving SQL data for crdb_internal.table_indexes... writing output: debug/crdb_internal.table_indexes.txt... done
[cluster] retrieving SQL data for crdb_internal.transaction_contention_events... writing output: debug/crdb_internal.transaction_contention_events.txt... done
[cluster] requesting nodes... received response... converting to JSON... writing binary output: debug/nodes.json... done
[cluster] requesting liveness... received response... converting to JSON... writing binary output: debug/liveness.json... done
[cluster] requesting CPU profiles
Expand Down
3 changes: 3 additions & 0 deletions pkg/cli/testdata/zip/testzip_concurrent
Original file line number Diff line number Diff line change
Expand Up @@ -103,6 +103,9 @@ zip
[cluster] retrieving SQL data for crdb_internal.table_indexes...
[cluster] retrieving SQL data for crdb_internal.table_indexes: done
[cluster] retrieving SQL data for crdb_internal.table_indexes: writing output: debug/crdb_internal.table_indexes.txt...
[cluster] retrieving SQL data for crdb_internal.transaction_contention_events...
[cluster] retrieving SQL data for crdb_internal.transaction_contention_events: done
[cluster] retrieving SQL data for crdb_internal.transaction_contention_events: writing output: debug/crdb_internal.transaction_contention_events.txt...
[cluster] retrieving SQL data for crdb_internal.zones...
[cluster] retrieving SQL data for crdb_internal.zones: done
[cluster] retrieving SQL data for crdb_internal.zones: writing output: debug/crdb_internal.zones.txt...
Expand Down
1 change: 1 addition & 0 deletions pkg/cli/testdata/zip/testzip_tenant
Original file line number Diff line number Diff line change
Expand Up @@ -53,6 +53,7 @@ debug zip --concurrency=1 --cpu-profile-duration=1s /dev/null
[cluster] retrieving SQL data for crdb_internal.invalid_objects... writing output: debug/crdb_internal.invalid_objects.txt... done
[cluster] retrieving SQL data for crdb_internal.index_usage_statistics... writing output: debug/crdb_internal.index_usage_statistics.txt... done
[cluster] retrieving SQL data for crdb_internal.table_indexes... writing output: debug/crdb_internal.table_indexes.txt... done
[cluster] retrieving SQL data for crdb_internal.transaction_contention_events... writing output: debug/crdb_internal.transaction_contention_events.txt... done
[cluster] requesting nodes... received response... converting to JSON... writing binary output: debug/nodes.json... done
[cluster] requesting liveness... received response...
[cluster] requesting liveness: last request failed: rpc error: ...
Expand Down
1 change: 1 addition & 0 deletions pkg/cli/zip_cluster_wide.go
Original file line number Diff line number Diff line change
Expand Up @@ -110,6 +110,7 @@ var debugZipTablesPerCluster = []string{
"crdb_internal.invalid_objects",
"crdb_internal.index_usage_statistics",
"crdb_internal.table_indexes",
"crdb_internal.transaction_contention_events",
}

// nodesInfo holds node details pulled from a SQL or storage node.
Expand Down
3 changes: 3 additions & 0 deletions pkg/rpc/auth_tenant.go
Original file line number Diff line number Diff line change
Expand Up @@ -107,6 +107,9 @@ func (a tenantAuthorizer) authorize(
case "/cockroach.server.serverpb.Status/CancelLocalQuery":
return a.authTenant(tenID)

case "/cockroach.server.serverpb.Status/TransactionContentionEvents":
return a.authTenant(tenID)

case "/cockroach.roachpb.Internal/GetSpanConfigs":
return a.authGetSpanConfigs(tenID, req.(*roachpb.GetSpanConfigsRequest))

Expand Down
1 change: 1 addition & 0 deletions pkg/server/serverpb/status.go
Original file line number Diff line number Diff line change
Expand Up @@ -38,6 +38,7 @@ type SQLStatusServer interface {
TableIndexStats(context.Context, *TableIndexStatsRequest) (*TableIndexStatsResponse, error)
UserSQLRoles(context.Context, *UserSQLRolesRequest) (*UserSQLRolesResponse, error)
TxnIDResolution(context.Context, *TxnIDResolutionRequest) (*TxnIDResolutionResponse, error)
TransactionContentionEvents(context.Context, *TransactionContentionEventsRequest) (*TransactionContentionEventsResponse, error)
}

// OptionalNodesStatusServer is a StatusServer that is only optionally present
Expand Down
18 changes: 18 additions & 0 deletions pkg/server/serverpb/status.proto
Original file line number Diff line number Diff line change
Expand Up @@ -1565,6 +1565,16 @@ message TxnIDResolutionResponse {
(gogoproto.nullable) = false];
}

message TransactionContentionEventsRequest {
string node_id = 1 [(gogoproto.customname) = "NodeID"];
}

message TransactionContentionEventsResponse {
repeated cockroach.sql.contentionpb.ExtendedContentionEvent events = 1 [
(gogoproto.nullable) = false
];
}

service Status {
// Certificates retrieves a copy of the TLS certificates.
rpc Certificates(CertificatesRequest) returns (CertificatesResponse) {
Expand Down Expand Up @@ -1977,4 +1987,12 @@ service Status {
// Client is responsible to perform retries if the requested transaction ID
// is not returned in the RPC response.
rpc TxnIDResolution(TxnIDResolutionRequest) returns (TxnIDResolutionResponse) {}

// TransactionContentionEvents returns a list of un-aggregated contention
// events sorted by the collection timestamp.
rpc TransactionContentionEvents(TransactionContentionEventsRequest) returns (TransactionContentionEventsResponse) {
option (google.api.http) = {
get: "/_status/transactioncontentionevents"
};
}
}
100 changes: 100 additions & 0 deletions pkg/server/status.go
Original file line number Diff line number Diff line change
Expand Up @@ -412,6 +412,27 @@ func (b *baseStatusServer) localTxnIDResolution(
return resp
}

func (b *baseStatusServer) localTransactionContentionEvents(
shouldRedactContendingKey bool,
) *serverpb.TransactionContentionEventsResponse {
registry := b.sqlServer.execCfg.ContentionRegistry

resp := &serverpb.TransactionContentionEventsResponse{
Events: make([]contentionpb.ExtendedContentionEvent, 0),
}
// Ignore error returned by ForEachEvent() since if our own callback doesn't
// return error, ForEachEvent() also doesn't return error.
_ = registry.ForEachEvent(func(event *contentionpb.ExtendedContentionEvent) error {
if shouldRedactContendingKey {
event.BlockingEvent.Key = []byte{}
}
resp.Events = append(resp.Events, *event)
return nil
})

return resp
}

// A statusServer provides a RESTful status API.
type statusServer struct {
*baseStatusServer
Expand Down Expand Up @@ -3000,3 +3021,82 @@ func (s *statusServer) TxnIDResolution(

return statusClient.TxnIDResolution(ctx, req)
}

func (s *statusServer) TransactionContentionEvents(
ctx context.Context, req *serverpb.TransactionContentionEventsRequest,
) (*serverpb.TransactionContentionEventsResponse, error) {
ctx = s.AnnotateCtx(propagateGatewayMetadata(ctx))

if err := s.privilegeChecker.requireViewActivityOrViewActivityRedactedPermission(ctx); err != nil {
return nil, err
}

user, isAdmin, err := s.privilegeChecker.getUserAndRole(ctx)
if err != nil {
return nil, serverError(ctx, err)
}

shouldRedactContendingKey := false
if !isAdmin {
shouldRedactContendingKey, err =
s.privilegeChecker.hasRoleOption(ctx, user, roleoption.VIEWACTIVITYREDACTED)
if err != nil {
return nil, serverError(ctx, err)
}
}

if s.gossip.NodeID.Get() == 0 {
return nil, status.Errorf(codes.Unavailable, "nodeID not set")
}

if len(req.NodeID) > 0 {
requestedNodeID, local, err := s.parseNodeID(req.NodeID)
if err != nil {
return nil, status.Errorf(codes.InvalidArgument, err.Error())
}
if local {
return s.localTransactionContentionEvents(shouldRedactContendingKey), nil
}

statusClient, err := s.dialNode(ctx, requestedNodeID)
if err != nil {
return nil, err
}
return statusClient.TransactionContentionEvents(ctx, req)
}

dialFn := func(ctx context.Context, nodeID roachpb.NodeID) (interface{}, error) {
statusClient, err := s.dialNode(ctx, nodeID)
return statusClient, err
}

rpcCallFn := func(ctx context.Context, client interface{}, _ roachpb.NodeID) (interface{}, error) {
statusClient := client.(serverpb.StatusClient)
return statusClient.TransactionContentionEvents(ctx, &serverpb.TransactionContentionEventsRequest{
NodeID: "local",
})
}

resp := &serverpb.TransactionContentionEventsResponse{
Events: make([]contentionpb.ExtendedContentionEvent, 0),
}

if err := s.iterateNodes(ctx, "txn contention events for node",
dialFn,
rpcCallFn,
func(nodeID roachpb.NodeID, nodeResp interface{}) {
txnContentionEvents := nodeResp.(*serverpb.TransactionContentionEventsResponse)
resp.Events = append(resp.Events, txnContentionEvents.Events...)
},
func(nodeID roachpb.NodeID, nodeFnError error) {
},
); err != nil {
return nil, err
}

sort.Slice(resp.Events, func(i, j int) bool {
return resp.Events[i].CollectionTs.Before(resp.Events[j].CollectionTs)
})

return resp, nil
}
Loading

0 comments on commit f023e40

Please sign in to comment.