Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sql: pg_catalog.pg_statistic_ext is incompletely defined #88108

Closed
knz opened this issue Sep 18, 2022 · 0 comments · Fixed by #93274
Closed

sql: pg_catalog.pg_statistic_ext is incompletely defined #88108

knz opened this issue Sep 18, 2022 · 0 comments · Fixed by #93274
Assignees
Labels
A-sql-pgcatalog A-sql-pgcompat Semantic compatibility with PostgreSQL A-sql-vtables Virtual tables - pg_catalog, information_schema etc C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. T-sql-foundations SQL Foundations Team (formerly SQL Schema + SQL Sessions)

Comments

@knz
Copy link
Contributor

knz commented Sep 18, 2022

Found while working on #88061

Describe the problem

The vtable pg_statistic_ext is not empty, but the data populated inside it is unusable.

  • stxnamespace is missing.
  • stxkind is missing.
  • stxstattarget is missing (could be -1)

To Reproduce

The following SQL query should report the statistics for a table:

WITH stat AS (
SELECT oid,
       stxrelid::pg_catalog.regclass AS tb,
       stxnamespace::pg_catalog.regnamespace AS nsp,
       stxname,
       pg_get_statisticsobjdef_columns(oid) AS columns,
       'd' = any(stxkind) AS hasndist,
       'f' = any(stxkind) AS hasdeps,
       'm' = any(stxkind) AS hasmcv,
       stxstattarget
  FROM pg_catalog.pg_statistic_ext stat
 WHERE stxrelid = %[1]s)

  SELECT '"'||nsp||'.'||stxname||'"'||
         IF((hasndist OR hasdeps OR hasmcv) AND NOT (hasndist AND hasdeps AND hasmcv),
            '('||
            IF(hasndist,
               'ndistinct' || IF(hasdeps OR hasmcv, ', ', ''),
               '')||
            IF(hasdeps, 'dependencies' || IF(hasmcv, ', ', ''), '')||
            IF(hasmcv, 'mcv', '')||
            ')',
           '')||
         ' ON ' || columns || ' FROM ' || tb ||
         IF(stxstattarget <> -1 AND stxstattarget IS NOT NULL,
            '; STATISTICS ' || stxstattarget::STRING, '')
         AS "Statistics objects"
    FROM stat
ORDER BY stat.oid

Alternatively, instead of pg_get_statisticsobjdef_columns the following should also work:

                    (SELECT pg_catalog.string_agg(pg_catalog.quote_ident(attname),', ')
                  FROM pg_catalog.unnest(stxkeys) s(attnum)
                JOIN pg_catalog.pg_attribute a ON (stxrelid = a.attrelid AND
                a.attnum = s.attnum AND NOT attisdropped)) AS columns

cc @rafiss for triage

Jira issue: CRDB-19692

Epic: CRDB-23454

@knz knz added C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. A-sql-pgcompat Semantic compatibility with PostgreSQL A-sql-vtables Virtual tables - pg_catalog, information_schema etc A-sql-pgcatalog labels Sep 18, 2022
@rafiss rafiss added the T-sql-foundations SQL Foundations Team (formerly SQL Schema + SQL Sessions) label Oct 3, 2022
@craig craig bot closed this as completed in 56d249d Dec 20, 2022
rafiss added a commit that referenced this issue Dec 21, 2022
@exalate-issue-sync exalate-issue-sync bot reopened this Jan 4, 2023
craig bot pushed a commit that referenced this issue Feb 24, 2023
88061: clisqlshell: new infrastructure for describe commands r=rafiss,ZhouXing19 a=knz

Fixes #95320.
Epic: CRDB-23454

The SQL shell (`cockroach sql`, `demo`) now
supports the client-side commands `\l`, `\dn`, `\d`, `\di`, `\dm`,
`\ds`, `\dt`, `\dv`, `\dC`, `\dT`, `\dd`, `\dg`, `\du`, `\df` and `\dd` in a
way similar to `psql`, including the modifier flags `S` and `+`, for
convenience for users migrating from PostgreSQL.

A notable difference is that when a pattern argument is specified, it
should use the SQL "LIKE" syntax (with `%` representing the wildcard
character) instead of PostgreSQL's glob-like syntax (with `*`
representing wildcards).

Issues discovered:

- [x] join bug:  #88096
- [x] semi-join exec error #91012
- [x] `pg_table_is_visible` should return true when given a valid index OID and the index is valid.  #88097
- [x] missing pkey column in pg_index:  #88106
- [x] missing stored columns in pg_index: #88107 
- [x] pg_statistic_ext has problems #88108
- [x] missing view def on materialized views  #88109
- [x] missing schema comments: #88098
- [x] missing pronamespace for functions #94952
- [x] broken pg_function_is_visible for UDFs #94953
- [x] generated columns #92545
- [x] indnullsnotdistinct #92583
- [x] missing prokind #95288
- [x] missing function comments in obj_description #95292
- [x] planning regression #95633

96397: builtins: mark some pg_.* builtins as strict r=DrewKimball a=mgartner

Builtins defined using the UDF `Body` field will be wrapped in a `CASE`
expression if they are strict, i.e., `CalledOnNullInput=false`. When the
builtin is inlined, the `CASE` expression prevents decorrelation,
leaving a slow apply-join in the query plan. This caused a significant
regression of some ORM introspection queries.

Some of these builtins have filters that cause the SQL body to return no rows
if any of the arguments is NULL. In this case, the builtin will have the same
behavior whether or not it is defined as being strict. We can safely optimize
these builtins by setting `CalledOnNullInput=true`.

The following conditions are sufficient to prove that `CalledOnNullInput` can
be set for a builtin function with a SQL body:

  1. The WHERE clause of the SQL query *null-rejects* every argument of the
     builtin. Operators like `=` and `<` *null-reject* their operands because
     they filter rows for which an operand is NULL.

  2. The arguments are not used elsewhere in the query. This is not strictly
     necessary, but simplifies the proof because it ensures NULL arguments will
     not cause the builtin to error.

Examples of SQL statements that would allow `CalledOnNullInput` to be set:
```
SELECT * FROM tab WHERE $1=1 AND $2='two';

SELECT * FROM tab WHERE $1 > 0;
```

Fixes #96218
Fixes #95569

Epic: None

Release note: None


97585: cli: don't scope TLS client certs to a specific tenant by default r=stevendanna a=knz

Epic: CRDB-23559
Fixes: #97584

This commit changes the default for `--tenant-scope` from "only the system tenant" to "cert valid for all tenants".

Note that the scoping is generally useful for security, and it is used in CockroachCloud. However, CockroachCloud does not use our CLI code to generate certs and sets its cert tenant scopes on its own.

Given that our CLI code is provided for convenience and developer productivity, and we don't expect certs generated here to be used in multi-tenant deployments where tenants are adversarial to each other, defaulting to certs that are valid on every tenant is a good choice.

Release note: None


Co-authored-by: Raphael 'kena' Poss <[email protected]>
Co-authored-by: Marcus Gartner <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-sql-pgcatalog A-sql-pgcompat Semantic compatibility with PostgreSQL A-sql-vtables Virtual tables - pg_catalog, information_schema etc C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. T-sql-foundations SQL Foundations Team (formerly SQL Schema + SQL Sessions)
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants