Add specification for `SegmentMax-16` #28103

p-wysocki · 2024-12-17T15:08:03Z

Details:

Specification for tf.math.segment_max (https://www.tensorflow.org/api_docs/python/tf/math/segment_max)

Tickets:

CVS-158914

Signed-off-by: p-wysocki <[email protected]>

rkazants

Let us have EmbeddingSegmentsMax similar to EmbeddingSegmentsSum.
It should also have default index (defining default value for empty segment)

rkazants · 2024-12-17T18:51:48Z

...ocumentation/openvino-ir-format/operation-sets/operation-specs/arithmetic/segment-max-16.rst

+* Segment_4: ``[]``
+* Segment_5: ``[data[6], data[7]]``
+
+When there are no values in a segment, ``output[segment]`` is set to 0.


we should have default value for empty segments, otherwise, we will have additional computation graph (that is not trivial) to compute empty segments and replace zero value

The default value seems to be 0, according to https://www.tensorflow.org/api_docs/python/tf/raw_ops/SegmentMax. I don't think we should expand the op on our own, especially since we only expect it to come from TF FE.

There is also V2 https://www.tensorflow.org/api_docs/python/tf/raw_ops/SegmentMaxV2 where the default has been changed to numeric_limits<T>::lowest(). Adding attribute for default value seems to be a simple solution to support both cases, but to enable V2 at once we would also need to consider "num_segments" input.

rkazants · 2024-12-17T18:52:26Z

...ocumentation/openvino-ir-format/operation-sets/operation-specs/arithmetic/segment-max-16.rst

+
+* **1**: *data*
+
+  * **Description**: The numerical data on which SegmentMax operation will be performed. **Required.**


please define input shapes and output shape for each input and describe what dimensions are equal

data can have any rank and dimensions, so it's described as ND of any numerical type. segment_ids are specified to be a 1D tensor of non-negative, sorted integer numbers of size equal to the size of the first dimension of the input tensor.

Could you please specify what's missing? I think the shapes are covered, but I may be missing something.

p-wysocki · 2024-12-18T12:34:19Z

Let us have EmbeddingSegmentsMax similar to EmbeddingSegmentsSum.

EmbeddingSegmentX is for sparse inputs, while SegmentMax I'm implementing (to unlock some models) accepts dense inputs, so I don't think it should be added as EmbeddingSegmentMax. I'm moving the discussion to internal channels, if it results in changes, I'll apply them to the PR.

Signed-off-by: p-wysocki <[email protected]>

mitruska · 2025-01-07T11:52:21Z

...ocumentation/openvino-ir-format/operation-sets/operation-specs/arithmetic/segment-max-16.rst

+
+**Outputs**
+
+* **1**: The output tensor of type *T* and the same shape as the ``input`` tensor with the exception for the first dimension, which is equal to the count of unique segment IDs.


Suggested change

* **1**: The output tensor of type *T* and the same shape as the ``input`` tensor with the exception for the first dimension, which is equal to the count of unique segment IDs.

* **1**: The output tensor of type *T* and almost the same shape as the ``data`` input tensor with the exception for the first dimension, which is equal to the count of unique segment IDs (calculated as ``max(segment_ids) + 1``).

Maybe instead almost use,

The output tensor has same rank and dimensions as the ``data`` input tensor except first dimension which is calculated as ``max(segment_ids) + 1``

?

mitruska · 2025-01-07T12:28:11Z

...ocumentation/openvino-ir-format/operation-sets/operation-specs/arithmetic/segment-max-16.rst

+* Segment_4: ``[]``
+* Segment_5: ``[data[6], data[7]]``
+
+When there are no values in a segment, ``output[segment]`` is set to 0.


There is also V2 https://www.tensorflow.org/api_docs/python/tf/raw_ops/SegmentMaxV2 where the default has been changed to numeric_limits<T>::lowest(). Adding attribute for default value seems to be a simple solution to support both cases, but to enable V2 at once we would also need to consider "num_segments" input.

mitruska · 2025-01-07T12:37:33Z

...ocumentation/openvino-ir-format/operation-sets/operation-specs/arithmetic/segment-max-16.rst

+* **2**: *segment_ids*
+
+  * **Description**: Controls how the data is divided into segments. **Required.**
+  * **Range of values**: 1D tensor of non-negative, sorted integer numbers. Its size is equal to the size of the first dimension of the input tensor.
+  * **Type**: *T_IDX*
+
+**Outputs**
+
+* **1**: The output tensor of type *T* and the same shape as the ``input`` tensor with the exception for the first dimension, which is equal to the count of unique segment IDs.


The style of the "Inputs" section description follows rather the "Attributes" style.
Consider alignment with other spec documents.

mitruska · 2025-01-07T12:48:53Z

...ocumentation/openvino-ir-format/operation-sets/operation-specs/arithmetic/segment-max-16.rst

+* **2**: *segment_ids*
+
+  * **Description**: Controls how the data is divided into segments. **Required.**
+  * **Range of values**: 1D tensor of non-negative, sorted integer numbers. Its size is equal to the size of the first dimension of the input tensor.


I can see that unsorted segment_ids may lead to error or undefined behavior (implementation specific, depends on the hardware).
Should we specify a common behavior for OV op?
Can be clarified at the plugin implementation stage.

mitruska · 2025-01-07T12:50:50Z

...ocumentation/openvino-ir-format/operation-sets/operation-specs/arithmetic/segment-max-16.rst

+* **2**: *segment_ids*
+
+  * **Description**: Controls how the data is divided into segments. **Required.**
+  * **Range of values**: 1D tensor of non-negative, sorted integer numbers. Its size is equal to the size of the first dimension of the input tensor.


Suggested change

* **Range of values**: 1D tensor of non-negative, sorted integer numbers. Its size is equal to the size of the first dimension of the input tensor.

* **Range of values**: 1D tensor of non-negative, sorted integer numbers. Its size is equal to the size of the first dimension of the ``data`` input tensor.

praasz · 2025-01-13T08:09:59Z

...ocumentation/openvino-ir-format/operation-sets/operation-specs/arithmetic/segment-max-16.rst

+SegmentMax
+===================


Suggested change

SegmentMax

===================

SegmentMax

==========

Should number = be same as heading length?

praasz · 2025-01-13T08:18:27Z

...ocumentation/openvino-ir-format/operation-sets/operation-specs/arithmetic/segment-max-16.rst

+
+**Outputs**
+
+* **1**: The output tensor of type *T* and the same shape as the ``input`` tensor with the exception for the first dimension, which is equal to the count of unique segment IDs.


Maybe instead almost use,

The output tensor has same rank and dimensions as the ``data`` input tensor except first dimension which is calculated as ``max(segment_ids) + 1``

?

p-wysocki added 2 commits December 17, 2024 16:03

Add spec

2ad30f1

Signed-off-by: p-wysocki <[email protected]>

Minor changes

fd26731

Signed-off-by: p-wysocki <[email protected]>

p-wysocki requested review from mitruska, mmikolajcz and PiotrKrzem December 17, 2024 15:08

p-wysocki requested a review from a team as a code owner December 17, 2024 15:08

p-wysocki requested review from zKulesza and removed request for a team December 17, 2024 15:08

github-actions bot added the category: docs OpenVINO documentation label Dec 17, 2024

rkazants requested changes Dec 17, 2024

View reviewed changes

rkazants reviewed Dec 17, 2024

View reviewed changes

p-wysocki requested a review from rkazants December 18, 2024 12:34

Typos

5dea327

Signed-off-by: p-wysocki <[email protected]>

mitruska reviewed Jan 7, 2025

View reviewed changes

praasz reviewed Jan 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add specification for `SegmentMax-16` #28103

Add specification for `SegmentMax-16` #28103

p-wysocki commented Dec 17, 2024

rkazants left a comment

rkazants Dec 17, 2024

p-wysocki Dec 18, 2024 •

edited

Loading

mitruska Jan 7, 2025

rkazants Dec 17, 2024

p-wysocki Dec 18, 2024 •

edited

Loading

p-wysocki commented Dec 18, 2024

mitruska Jan 7, 2025

praasz Jan 13, 2025

mitruska Jan 7, 2025

mitruska Jan 7, 2025

mitruska Jan 7, 2025

mitruska Jan 7, 2025

praasz Jan 13, 2025

praasz Jan 13, 2025


		* 1: data

		* Description: The numerical data on which SegmentMax operation will be performed. Required.


		Outputs

		* 1: The output tensor of type T and the same shape as the ``input`` tensor with the exception for the first dimension, which is equal to the count of unique segment IDs.

	* Range of values: 1D tensor of non-negative, sorted integer numbers. Its size is equal to the size of the first dimension of the input tensor.
	* Range of values: 1D tensor of non-negative, sorted integer numbers. Its size is equal to the size of the first dimension of the ``data`` input tensor.

Add specification for SegmentMax-16 #28103

Are you sure you want to change the base?

Add specification for SegmentMax-16 #28103

Conversation

p-wysocki commented Dec 17, 2024

Details:

Tickets:

rkazants left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

p-wysocki Dec 18, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

p-wysocki Dec 18, 2024 • edited Loading

Choose a reason for hiding this comment

p-wysocki commented Dec 18, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Add specification for `SegmentMax-16` #28103

Add specification for `SegmentMax-16` #28103

p-wysocki Dec 18, 2024 •

edited

Loading

p-wysocki Dec 18, 2024 •

edited

Loading