Update/main #104974

benwtrent · 2024-01-31T13:29:52Z

Updates lucene_snapshot with main, there was a minor conflict as main was updated with 9.9.2 and the previous merge failed.

This commit upgrades to Lucene 9.9.2.

Clarify that in this situation there is a rebalancing move that would improve the cluster balance, but there's some reason why rebalancing is not happening. Also points at the `can_rebalance_cluster_decisions` as well as the node-by-node decisions since the action needed could be described in either place.

This change fixes the engine to apply the current codec when retrieving documents from the translog. We need to use the same codec than the main index in order to ensure that all the source data is indexable. The internal codec treats some fields differently than the default one, for instance dense_vectors are limited to 1024 dimensions. This PR ensures that these customizations are applied when indexing document for translog retrieval. Closes elastic#104639 Co-authored-by: Elastic Machine <[email protected]>

* Add DELETE endpoint for /_connector/_secret/{id} * Add endpoint to write_connector_secrets cluster privilege

This commit merges Aggregations into InternalAggregations in order to remove the unnecessary hierarchy.

* [Profiling] Add the number of cores to HostMetadata * Update AWS pricelist (remove cost_factor, add usd_per_hour) * Switch cost calculations from 'cost_factor' to 'usd_per_hour' * Remove superfluous CostEntry.toXContent() * Check for Number type in CostEntry.fromSource() * Add comment

During a promotable relocation, a `get_from_translog` sent by the unpromotable shard to handle a real-time get might encounter `ShardNotFoundException` or `IndexNotFoundException`. In these cases, we should retry. This is just for `GET`. I'll open a second PR for `mGET`. The relevant IT is in the Stateless PR. Relates ES-5727

…quest() method always returns the same object (elastic#104881)

…y API index (elastic#104916) * [DOCS] Adds get setting and update settings asciidoc files to security API index. * [DOCS] Fixes references in docs.

…#104891) SearchStats#count incorrectly counts the number of documents (or rows) in which a document appears instead of the actual number of values. This PR fixes this by looking at the term frequency instead of the doc count. Fix elastic#104795

…4930) This fixes a javadoc comment that was broken by elastic#104881

…elastic#104324)

…lastic#104625) This adds support for the `type` parameter, for sorting, to the Query API key API. The type for an API Key can currently be either `rest` or `cross_cluster`. This was overlooked in elastic#103695 when support for the `type` parameter was first introduced only for querying.

)

This adds support for the `match` query type to the Query API key Information API. Note that since string values associated to API Keys are mapped as `keywords`, a `match` query with no analyzer parameter is effectively equivalent to a `term` query for such fields (e.g. `name`, `username`, `realm_name`). Relates: elastic#101691

elastic#104909)

Today, we allow ESQL to execute against an unlimited number of shards concurrently on each node. This can lead to cases where we open and hold too many shards, equivalent to opening too many file descriptors or using too much memory for FieldInfos in ValuesSourceReaderOperator. This change limits the number of concurrent shards to 10 per node. This number was chosen based on the _search API, which limits it to 5. Besides the primary reason stated above, this change has other implications: We might execute fewer shards for queries with LIMIT only, leading to scenarios where we execute only some high-priority shards then stop. For now, we don't have a partial reduce at the node level, but if we introduce one in the future, it might not be as efficient as executing all shards at the same time. There are pauses between batches because batches are executed sequentially one by one. However, I believe the performance of queries executing against many shards (after can_match) is less important than resiliency. Closes elastic#103666

* Document nested expressions for stats * More docs * Apply suggestions from review - count-distinct.asciidoc - Content restructured, moving the section about approximate counts to end of doc. - count.asciidoc - Clarified that omitting the `expression` parameter in `COUNT` is equivalent to `COUNT(*)`, which counts the number of rows. - percentile.asciidoc - Moved the note about `PERCENTILE` being approximate and non-deterministic to end of doc. - stats.asciidoc - Clarified the `STATS` command - Added a note indicating that individual `null` values are skipped during aggregation * Comment out mentioning a buggy behavior * Update sum with inline function example, update test file * Fix typo * Delete line * Simplify wording * Fix conflict fix typo --------- Co-authored-by: Liam Thompson <[email protected]> Co-authored-by: Liam Thompson <[email protected]>

* Pushing input type through to cohere request * switching logic to allow request to always override * Fixing failure * Removing getModelId calls * Addressing feedback * Switching to enumset

…nd TermsGroupByIT (elastic#104898)

…stic#104927) This introduces a second implementation of RequestBuilder (elastic#104778). As opposed to ActionRequestBuilder, ActionRequestLazyBuilder does not create its request until the request() method is called, and does not hold onto that request (so each call to request() gets a new request instance). This PR also updates BulkRequestBuilder to inherit from ActionRequestLazyBuilder as an example of its use.

…or of the telemetry.* settings (elastic#104917)

…04935)

…upport for VERSION (elastic#104911)

…c#104615)

Our readEnum code instantiates/clones enum value arrays on read. Normally, this doesn't matter much but the two spots adjusted here are visibly hot during bulk indexing, causing GBs of allocations during e.g. the http_logs indexing run.

Fix pushed down filters for binary comparisons that compare a byte/short/int/long with an out of range value, like WHERE some_int_field < 1E300.

…104966)

…lastic#104928) We have functions that generate lucene geometries scattered in different places of the code. This commit moves everything under a utility class.

github-actions · 2024-01-31T13:30:03Z

Documentation preview:

✨ Changed pages

elasticsearchmachine · 2024-01-31T13:30:16Z

Pinging @elastic/es-search (Team:Search)

thecoop and others added 30 commits January 30, 2024 11:02

Change release version lookup to an instance method (elastic#104902)

1395edf

Upgrade to Lucene 9.9.2 (elastic#104753)

920beee

This commit upgrades to Lucene 9.9.2.

[Connector Secrets] Add delete API endpoint (elastic#104815)

50afaad

* Add DELETE endpoint for /_connector/_secret/{id} * Add endpoint to write_connector_secrets cluster privilege

Merge Aggregations into InternalAggregations (elastic#104896)

7c8bb14

This commit merges Aggregations into InternalAggregations in order to remove the unnecessary hierarchy.

indicating fix for 8.12.1 for int8_hnsw (elastic#104912)

332dd8c

Removing the assumption from some tests that the request builder's re…

6c9551a

…quest() method always returns the same object (elastic#104881)

[DOCS] Adds get setting and update settings asciidoc files to securit…

79d6c3e

…y API index (elastic#104916) * [DOCS] Adds get setting and update settings asciidoc files to security API index. * [DOCS] Fixes references in docs.

Reuse APMMeterService of APMTelemetryProvider (elastic#104906)

a3b1d86

Mute more tests that tend to leak searchhits (elastic#104922)

4567a84

Adding request source for cohere (elastic#104926)

422e6f6

Fixing a broken javadoc comment in ReindexDocumentationIT (elastic#10…

e74a79f

…4930) This fixes a javadoc comment that was broken by elastic#104881

Fix enabling / disabling of APM agent "recording" in APMAgentSettings (…

9ea187d

…elastic#104324)

Apply publish plugin to es-opensaml-security-api project (elastic#104933

0623eb0

)

[Connectors API] Relax strict response parsing for get/list operations (

d2d28ec

elastic#104909)

[ML] Passing input type through to cohere request (elastic#104781)

92dd213

* Pushing input type through to cohere request * switching logic to allow request to always override * Fixing failure * Removing getModelId calls * Addressing feedback * Switching to enumset

[Transform] Unmute 2 remaining continuous tests: HistogramGroupByIT a…

a4ddd32

…nd TermsGroupByIT (elastic#104898)

Update versions to skip after backport to 8.12 (elastic#104953)

4c44633

Update/Cleanup references to old tracing.apm.* legacy settings in fav…

dbf59c5

…or of the telemetry.* settings (elastic#104917)

Exclude tests that do not work in a mixed cluster scenario (elastic#1…

7764fdb

…04935)

ES|QL: Improve type validation in aggs for UNSIGNED_LONG and better s…

1daa324

…upport for VERSION (elastic#104911)

jedrazb and others added 7 commits January 31, 2024 11:25

[Connector API] Make update configuration action non-additive (elasti…

9589496

…c#104615)

ESQL: Correct out-of-range filter pushdowns (elastic#99961)

1da5f99

Fix pushed down filters for binary comparisons that compare a byte/short/int/long with an out of range value, like WHERE some_int_field < 1E300.

[DOCS] Dense vector element type should be float for OpenAI (elastic#…

2cbe23a

…104966)

Fix test assertions (elastic#104963)

5013ea4

Move functions that generate lucene geometries under a utility class (e…

3821745

…lastic#104928) We have functions that generate lucene geometries scattered in different places of the code. This commit moves everything under a utility class.

Merge branch 'main' into update/main

287f1a3

benwtrent added >non-issue :Search/Search Search-related issues that do not fall into other categories auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) labels Jan 31, 2024

benwtrent requested a review from a team as a code owner January 31, 2024 13:29

elasticsearchmachine added the Team:Search Meta label for search team label Jan 31, 2024

fixing index versions

477eae8

benwtrent merged commit 47ca7ae into elastic:lucene_snapshot Jan 31, 2024
14 of 15 checks passed

benwtrent deleted the update/main branch January 31, 2024 16:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update/main #104974

Update/main #104974

benwtrent commented Jan 31, 2024

github-actions bot commented Jan 31, 2024

elasticsearchmachine commented Jan 31, 2024

Update/main #104974

Update/main #104974

Conversation

benwtrent commented Jan 31, 2024

github-actions bot commented Jan 31, 2024

elasticsearchmachine commented Jan 31, 2024