Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add release notes for 0.252 #16013

Merged
merged 1 commit into from
May 4, 2021

Conversation

vaishnavibatni
Copy link
Contributor

@vaishnavibatni vaishnavibatni commented Apr 27, 2021

Missing Release Notes

Andrii Rosa

Chi Tsai

Tal Galili

Extracted Release Notes

All Commits

  • 99dcd76 Add version to FunctionMetadata (Rongrong Zhong)
  • b496843 Add support for returning partial results (Nikhil Collooru)
  • bb50c29 Add ability to read session properties from HdfsContext (Bhavani Hari)
  • 130863f Refactor planner to use FunctionImplementationType.isExternal (Rongrong Zhong)
  • 82833b8 Remove redundant projection added in HashGenerationOptimizer (Rongrong Zhong)
  • 421d7f6 Do not create local projection for constant in PlanRemoteProjections (Rongrong Zhong)
  • 53ebcda Use segmented Slice in SliceDictionaryWriter (Arunachalam Thirupathi)
  • 3e0c6a0 Handle casting to HiveSplit (Rohit Jain)
  • 2cfbf87 Fix MAP_UNION aggregate function to support ORDERY BY clause (Naveen Kumar Mahadevuni)
  • 3c8c4ec Fix code bug with param wrongly masking a field (Sreeni Viswanadha)
  • a34cf43 Bring retry queries to the beginning of the queue (James Sun)
  • 2a2ac84 Handle duplicated retry requests (James Sun)
  • 338aeb2 Add tests for query retry (James Sun)
  • 0fe4236 Support query retry only for auto commit transactions (James Sun)
  • 68f6cff Throw error upon purged retryable queries (James Sun)
  • ad51d4c Add query retry local limit enforcement (James Sun)
  • 2919bb9 Add query retry logic for transient failures (James Sun)
  • cce720c Add retriable flag to error code (James Sun)
  • 28dcba7 Remove unused timeoutExecutor for queued queries (James Sun)
  • 6a1d1d7 Adding the chi square distribution, which is central to many statistical procedures (https://en.wikipedia.org/wiki/Chi-square_distribution) (Main task for statistical functions in presto hackathon #15798) (Tal Galili)
  • 69a5216 Extend join optimizations based on empty source. (Ahmad Ghazal)
  • 6c8b45c Catch Exception in MemoryRevokingSchedulers (Rebecca Schlussel)
  • 751e2e3 Add per-query limit based spilling strategy (Rebecca Schlussel)
  • 6c09beb Move getMemoryAlreadyBeingRevoked to utility class (Rebecca Schlussel)
  • 8269d72 Refactor TaskRevocableMemoryListener (Rebecca Schlussel)
  • b6c170e Refactor MemoryPoolListener (Rebecca Schlussel)
  • 24243ad Support logging formatted prepared query (Shixuan Fan)
  • 60edac5 Update /v1/info/state API (Dongliang Chen)
  • 7c8b6f5 Enable timeout by default in Alluxio (Ke Wang)
  • 3e850ba Fix HttpRemoteTask task stats and queue space update order (James Petty)
  • a7f7a35 Rename some tests to follow naming convention (Rebecca Schlussel)
  • f05484a Reenable join spill test (Rebecca Schlussel)
  • 848f735 Fix join spill race condition (Rebecca Schlussel)
  • 46cd0e8 Only set join operators count if running with spill (Rebecca Schlussel)
  • 83fe8e7 Implement toString() for JDBC driver Array result (Junyi Huang)
  • f084ff4 Release notes for 0.251 (Costin V Cozianu)
  • 2dac158 Upgrade to drift 1.33 (Mayank Garg)
  • 413584a Don't rewrite IF to CASE (Sreeni Viswanadha)
  • d0e46d5 Upgrade zstd-jni to v1.4.9-2 (Chen Li)
  • 543567c Fix order inconsistency between Hive Metastore partition information and actual partition values (sophiashang)
  • b1306fc Add Drift serialization compatibility for NodeStatus (Tim Meehan)
  • 3ab2e8e Add Drift serialization compatibility for session functions (Tim Meehan)
  • 8c0b1ff Add support for passthrough Elasticsearch queries (v-jizhang)
  • 3513d07 Revert coordinator handling of out-of-task scope error categorization (tanjialiang)
  • 33cdd9b Optimize partitioned table write for Presto on Spark (Andrii Rosa)
  • d35dbfe Refactor PrestoSparkRddFactory (Andrii Rosa)
  • 75ec89c Remove duplicate logger setting (Andrii Rosa)
  • 7aa7331 Reformat TestPrestoSparkQueryRunner (Andrii Rosa)
  • e4c9e77 Run join, order by and window tests with Presto on Spark (Andrii Rosa)
  • 5060556 Minor refactor of Presto on Spark (Vic Zhang)
  • d1f947e Fix Presto on Spark query logging (Vic Zhang)
  • 32cd62a Reenable Presto on Spark integration Docker based test (Andrii Rosa)
  • 94942fb Fix Presto on Spark failure during startup (Andrii Rosa)
  • 8803df3 Fix evictor type for Alluxio cache (Bin Fan)
  • 4078f50 Add file based password authenticator documentation (imjalpreet)
  • c64f2a6 Add file based password authenticator plugin (imjalpreet)
  • b5b6ccd Print query id for Presto on Spark query (Vic Zhang)
  • 7f1395e Refactor SerializedPage checksum (Andrii Rosa)
  • 8c464c7 Fix spooling output buffer unit test (Chi Tsai)

@arhimondr
Copy link
Member

Presto on Spark Changes

* Fix a bug when Presto on Spark doesn't start because the temporary storage is not initialized

@rongrong
Copy link
Contributor

General Changes
* Add functions :func:`chisquared_cdf` and :func:`inverse_chisquared_cdf`


General Changes
_______________
* Fix hanging join operator when spill is enabled and the probe side finishes before the hash builder starts.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fix the join operator to prevent hanging when spill is enabled and the probe side finishes before the hash builder starts

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

*spilling is enabled

presto-docs/src/main/sphinx/release/release-0.252.rst Outdated Show resolved Hide resolved
_______________
* Fix hanging join operator when spill is enabled and the probe side finishes before the hash builder starts.
* Add file based password authenticator plugin.
* Add new spilling strategy to spill when the user + system + revocable memory on a node exceeds max_total_memory_per_node. This can be enabled by setting ``experimental.spiller.task-spilling-strategy`` to ``PER_QUERY_MEMORY_LIMIT``.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add new spilling strategy to spill when a combination of user, system and recovable memory on a node exceeds ``max_total_memory_per_node``. This can be enabled by setting the system configuration ``....`` to ``...``.

* Fix hanging join operator when spill is enabled and the probe side finishes before the hash builder starts.
* Add file based password authenticator plugin.
* Add new spilling strategy to spill when the user + system + revocable memory on a node exceeds max_total_memory_per_node. This can be enabled by setting ``experimental.spiller.task-spilling-strategy`` to ``PER_QUERY_MEMORY_LIMIT``.
* Add support for logging formatted prepared query. This could be enabled by setting session property ``log_formatted_query_enabled`` to ``true``.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This can be enabled ...

* Add file based password authenticator plugin.
* Add new spilling strategy to spill when the user + system + revocable memory on a node exceeds max_total_memory_per_node. This can be enabled by setting ``experimental.spiller.task-spilling-strategy`` to ``PER_QUERY_MEMORY_LIMIT``.
* Add support for logging formatted prepared query. This could be enabled by setting session property ``log_formatted_query_enabled`` to ``true``.
* Add support for passthrough Elasticsearch queries.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Move to elasticsearch connector

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should frame this better -

Add support for running Presto queries over the results of Elasticsearch queries. See :doc:`connector/elasticsearch` on how to use it

@aweisberg to confirm if I rephrased this correctly

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes sort of. We already supported it. The difference is these are passthrough in that you don't need to escape the query contents because they are base32 encoded.

Maybe

Add support for running base32 encoding ElasticSearch queries that are passed through. See :doc:`connector/elasticsearch` on how to use it

* Add support for returning partial results for the queries by setting `partial_results_enabled` session property. Additionally `partial_results_max_execution_time_multiplier`, `partial_results_completion_ratio_threshold` session properties can be set to configure the max execution time multiplier and minimum completion ratio threshold for the queries.
* Add automatic query retry functionality for transient failures. This can be enabled by setting per-query-retry-limit to a non-zero integer to indicate the per query retry count.
* Store dictionary elements in Segmented Slice.
* Update /v1/info/state API to mark coordinator active when 1) state is not shutting down, 2) active workers >= minimum required workers.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add support to coordinator endpoint ``/v1/info/state`` to return ``ACTIVE`` when the coordinator is not shutting down and the cluster has the minimum required workers.


Hive Changes
____________
* Enable timeout by default in Alluxio.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Update Alluxio cache config property ``cache.alluxio.timeout-enabled`` to be ``true`` by default.


JDBC Changes
____________
* Implement ``toString()`` for ``java.sql.Array`` results.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add support to serialize ``PrestoArray`` to a string.

Presto On Spark Changes
_______________________
* Improve bucketed table write parallelism.
* Fix a bug when Presto on Spark doesn't start because the temporary storage is not initialized
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

End with .


General Changes
_______________
* Fix hanging join operator when spill is enabled and the probe side finishes before the hash builder starts.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

*spilling is enabled

presto-docs/src/main/sphinx/release/release-0.252.rst Outdated Show resolved Hide resolved
presto-docs/src/main/sphinx/release/release-0.252.rst Outdated Show resolved Hide resolved
**Highlights**
==============
* Add new spilling strategy to spill when a combination of user, system and recovable memory on a node exceeds ``max_total_memory_per_node``.
* Add support for logging formatted prepared query.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this highlight worthy ?


**Highlights**
==============
* Add new spilling strategy to spill when a combination of user, system and recovable memory on a node exceeds ``max_total_memory_per_node``.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this highlight worthy?

==============
* Add new spilling strategy to spill when a combination of user, system and recovable memory on a node exceeds ``max_total_memory_per_node``.
* Add support for logging formatted prepared query.
* Add support to coordinator endpoint ``/v1/info/state`` to return ``ACTIVE`` when the coordinator is not shutting down and the cluster has the minimum required workers.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this highlight worthy ?

For me highlights is something significantly new. These things are listed in the section just below - so we don't need to highlight everything.

* Add new spilling strategy to spill when a combination of user, system and recovable memory on a node exceeds ``max_total_memory_per_node``.
* Add support for logging formatted prepared query.
* Add support to coordinator endpoint ``/v1/info/state`` to return ``ACTIVE`` when the coordinator is not shutting down and the cluster has the minimum required workers.
* Add support for returning partial results for the queries and max execution time multiplier and minimum completion ratio threshold for the queries.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add support for returning partial results should be enough

presto-docs/src/main/sphinx/release/release-0.252.rst Outdated Show resolved Hide resolved
presto-docs/src/main/sphinx/release/release-0.252.rst Outdated Show resolved Hide resolved
==============
* Add support for returning partial results.
* Add automatic query retry functionality for transient failures.
* Add support for running base32 encoding ElasticSearch queries that are passed through.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fix this one as well

* Fix the join operator to prevent hanging when spilling is enabled and the probe side finishes before the hash builder starts
* Add new spilling strategy to spill when a combination of user, system and recovable memory on a node exceeds ``max_total_memory_per_node``. This can be enabled by setting the system configuration ``experimental.spiller.task-spilling-strategy`` to ``PER_QUERY_MEMORY_LIMIT``.
* Add support for logging formatted prepared query. This can be enabled by setting session property ``log_formatted_query_enabled`` to ``true``.
* Add support for returning partial results for the queries by setting ``partial_results_enabled`` session property. Additionally ``partial_results_max_execution_time_multiplier``, ``partial_results_completion_ratio_threshold`` session properties can be set to configure the max execution time multiplier and minimum completion ratio threshold for the queries.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: there is an extra space before session properties can be


General Changes
_______________
* Fix the join operator to prevent hanging when spilling is enabled and the probe side finishes before the hash builder starts
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

end with .


Security Changes
________________
* Add file based password authenticator plugin. See :doc:`security/password-file`.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this URI also probably needs fixing. Please generate the HTML pages and check that everything looks correct

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did you check if this works ? I don't think that this is correct

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, I'm sorry - I missed adding the change to the commit before pushing it.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It works now.

@vaishnavibatni vaishnavibatni force-pushed the release-notes-0.252 branch 3 times, most recently from f2d98f1 to 6bf1854 Compare May 3, 2021 23:06
@mbasmanova mbasmanova merged commit 047dae0 into prestodb:master May 4, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants