diff --git a/roadmap.md b/roadmap.md new file mode 100644 index 0000000000000..a8b57db5440df --- /dev/null +++ b/roadmap.md @@ -0,0 +1,185 @@ +# TiDB Roadmap + +This roadmap brings you what's coming in the 1-year future, so you can see the new features or improvements in advance, follow the progress, learn about the key milestones on the way, and give feedback as the development work goes on. In the course of development, this roadmap is subject to change based on user needs and feedback. If you have a feature request or want to prioritize a feature, please file an issue on [GitHub](https://github.com/pingcap/tidb/issues). + +✅: The feature or improvement is already available in TiDB. + +> **Safe harbor statement:** +> +> *Any unreleased features discussed or referenced in our documents, roadmaps, blogs, websites, press releases, or public statements that are not currently available ("unreleased features") are subject to change at our discretion and may not be delivered as planned or at all. Customers acknowledge that purchase decisions are solely based on features and functions that are currently available, and that PingCAP is not obliged to deliver aforementioned unreleased features as part of the contractual agreement unless otherwise stated.* + +## TiDB kernel + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
DomainFeatureDescription
Scalability & Stability
  • ✅ Optimize resource isolation in heavy read scenarios.
  • ✅ Optimize resource isolation in heavy (batch) write scenarios.
  • Provide resource management capability for background process.
  • Support resource management framework.
  • Provide a basic resource management and control framework to effectively control the resource squeeze of background tasks on front-end tasks (user operations), and improve cluster stability.
  • Refine resource management in the multi-service aggregation scenario.
Enhance the plan cache feature.
  • Support in-session subquery, expression index, and prepared plan cache for partitions, which expands the usage scenarios of plan cache.
  • Support plan cache for general SQL statements in a session to save cache resources, improve the hit rate of general execution plans, and improve SQL performance.
  • Support cross-session plan cache, save cache resources, improve the hit rate of general execution plans, and improve SQL performance. In general scenarios, reusing execution plans can improve memory utilization and to achieve higher throughputs.
Support dynamic region.Support dynamic region size adjustment (heterogeneous) and huge region size for scenarios with fast business growth and a large amount of data.
SQLSupport the JSON function.
  • ✅ Expression index
  • Multi-value index
  • ✅ TiFlash supports JSON function pushdown
In business scenarios that require flexible schema definitions, the application can use JSON to store information for ODS, transaction indicators, commodities, game characters, and props.
  • ✅ Support cluster-level flashback.
  • ✅ Support database-level flashback.
In game rollback scenarios, the flashback can be used to achieve a fast rollback of the current cluster. This solves the common problems in the gaming industry such as version errors and bugs.
✅ Support time to live (TTL).This feature enables automatic data cleanup in limited data archiving scenarios.
Support foreign key constraints.Supports foreign key constraints compatible with MySQL syntax, and provides DB-level referential integrity check capabilities.
✅ Support non-transactional DML for insert and update operations.
  • Implement a DDL parallel execution framework.
  • Provide DDL pause/resume capability.
Implement a distributed parallel DDL execution framework, so that DDL tasks executed by only one TiDB Owner node can be coordinated and executed by all TiDB nodes in the cluster. Improve the execution speed of DDL tasks and cluster resource utilization.
By converting the execution of DDL tasks to distributed mode, this feature accelerates the execution speed of DDL tasks and improves the utilization of computing resources in the entire cluster. At present, DDL tasks that need to improve the speed include large table indexing and lossy column type modification tasks.
Hybrid Transactional and Analytical Processing (HTAP)✅ Support TiFlash result write-back.

Support INSERT INTO SELECT.

  • Easily write analysis results in TiFlash back to TiDB.
  • Provide complete ACID transactions, more convenient and reliable than general ETL solutions.
  • Set a hard limit on the threshold of intermediate result size, and report an error if the threshold is exceeded.
  • Support fully distributed transactions, and remove or relax the limit on the intermediate result size.

These features combined enable a way to materialize intermediate results. The analysis results can be easily reused, which reduces unnecessary ad-hoc queries, improves the performance of BI and other applications (by pulling results directly) and reduces system load (by avoiding duplicated computation), thereby improving the overall data pipeline efficiency and reducing costs. It will make TiFlash an online service.

✅ Support FastScan for TiFlash.
  • FastScan provides weak consistency but faster table scan capability.
  • Further optimize the join order, shuffle, and exchange algorithms to improve computing efficiency and boost performance for complex queries.
  • Add a fine-grained data sharding mechanism to optimize the COUNT(DISTINCT) function and high cardinality aggregation.

This feature improves the basic computing capability of TiFlash, and optimizes the performance and reliability of the underlying algorithms of the columnar storage and MPP engine.

MaintenanceSupport rule-based SQL blocklist.In multi-service aggregation scenarios, provide SQL management and control capabilities, and improve cluster stability by prohibiting high-resource-consuming SQL statements.
+ +## Diagnosis and maintenance + + + + + + + + + + + + + + + + + + + + +
DomainFeatureDescription
SQL tuning for HTAP workloadsProvide SQL execution information from the perspective of applications.Provide a dashboard that displays a SQL execution overview from the perspective of applications in HTAP workloads.
Provide suggestions on optimizing SQL for TiFlash and TiKV in HTAP workloads.For one or several HTAP scenarios, provide suggestions on SQL optimization.
+ +## Data backup and migration + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
DomainFeatureDescription
Backup and restoreAWS EBS or GCP persistent disk snapshot-based backup and restore.Support backup and restore based on AWS EBS or GCP persistent disk snapshots.
Point-in-time recovery (PITR)Table-level and database-level PITR.BR supports table-level or database-level PITR.
Data replication to downstream systems via TiCDCImprove TiCDC scalability and reduce replication latency.Increase TiCDC's scalability by spanning data changes for single table to multiple TiCDC nodes and reduce replication latency by removing sorting stage.
✅ Support replicating data to object storage such as S3.TiCDC supports replicating data changes to common object storage services.
Data migration✅ Continuous data verification during data migration.DM supports online data verification during migration from MySQL compatible database to TiDB.
+ +## Security + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
DomainFeatureDescription
Password complexity check✅ A strong password is required.To improve security, empty passwords and weak passwords are not allowed.
The required password length is not less than 8. The password must contain an uppercase letter, a lowercase letter, a number, and a character.
Password expiration✅ TiDB provides password expiration management and requires users to change passwords regularly.Reduce the security risk of password cracking or leakage caused by using the same password for a long time.
Password reuse policy✅ TiDB provides a password reuse policy.Restrict password reuse and improve password security.
Password anti-brute force cracking✅ Accounts will be locked in case of consecutive incorrect passwords.Lock the account under continuous wrong passwords to prevent the password from being cracked by brute force.
Log redaction
  • Support data redaction in execution plans in TiDB Dashboard.
  • Enhance data redaction in TiDB-related logs.
Redact sensitive information in execution plans and various logs to enhance the security of user data.
Column-level access controlTiDB supports column-level privilege management.TiDB already supports cluster-level, database-level, and table-level privilege management. On top of that, TiDB will support column-level privilege management to meet the principle of least privilege and provide fine-grained data access control.
Audit logging capability enhancementSupport configurable audit log policies, configurable audit filters (filter by objects, users, and operation types), and visual access to audit logs.Improve the completeness and usability of the audit log feature.