openvinotoolkit · PiotrKrzem · Dec 20, 2024 · Aug 20, 2024 · Aug 26, 2024 · Aug 26, 2024
@@ -17,12 +17,20 @@ RandomUniform
 **Detailed description**:
 
 *RandomUniform* operation generates random numbers from a uniform distribution in the range ``[minval, maxval)``.
-The generation algorithm is based on underlying random integer generator that uses Philox algorithm. Philox algorithm
-is a counter-based pseudo-random generator, which produces uint32 values. Single invocation of Philox algorithm returns
-four result random values, depending on the given *key* and *counter* values. *Key* and *counter* are initialized
-with *global_seed* and *op_seed* attributes respectively.
+The generation algorithm is based on an underlying random integer generator that uses either Philox or Mersnne-Twister algorithm. 
+Both algorithms are counter-based pseudo-random generators, which produce uint32 values. A single algorithm invocation returns
+four result random values, depending on the given initial values. For Philox, these values are *key* and *counter*, for Mersenne-Twister it is a single *state* value. *Key* and *counter* are initialized
+with *global_seed* and *op_seed* attributes respectively, while the *state* is only initialized using *global_seed*. 
 
-If both seed values equal to zero, RandomUniform generates non-deterministic sequence.
+Algorithm selection allows to align the output of OpenVINO's Random Uniform op with the ones available in Tensorflow and PyTorch. 
+The *alignment* attribute selects which framework the output should be aligned to. Tensorflow uses the Philox algorithm and PyTorch uses the Mersenne-Twister algorithm.
+For Tensorflow, this function is equivalent to the function tf.raw_ops.RandomUniform(shape, dtype, global_seed, op_seed) when dtype represents a real number, and tf.raw_ops.RandomUniformInt(shape, min\_val, max\_val, dtype, global\_seed, op\_seed) for integer types. Internally, both of these functions are executed by tf.random.uniform(shape, min\_val, max\_val, dtype, global\_seed, op\_seed), where for floating-point dtype the output goes through additional conversion to reside within a given range.
+For PyTorch, this function is equivalent to the function torch.Tensor(shape, dtype).uniform\_(min\_val, max\_val) when dtype represents a real number, and torch.Tensor(shape, dtype).random\_(min\_val, max\_val) for integer types. Internally, both of these functions are executed by torch.rand(shape, dtype) with default generator and layout. The seed of these functions is provided by calling torch.manual\_seed(global\_seed). op\_seed value is ignored. 
+By default, the output is aligned with Tensorflow (Philox algorithm). This behavior is backwards-compatibile.
+
+If both seed values are equal to zero, RandomUniform generates a non-deterministic sequence.
+
+**Philox Algorithm Explaination**:
 
 .. math::
 
@@ -170,7 +178,7 @@ For integer values:
 where *x* is uint32 random value.
 
 
-Example 1. *RandomUniform* output with ``global_seed`` = 150, ``op_seed`` = 10, ``output_type`` = f32:
+Example 1. *RandomUniform* output with ``global_seed`` = 150, ``op_seed`` = 10, ``output_type`` = f32, ``alignment`` = TENSORFLOW:
 
 .. code-block:: xml
    :force:
@@ -181,7 +189,7 @@ Example 1. *RandomUniform* output with ``global_seed`` = 150, ``op_seed`` = 10,
             [0.5197197   0.22727466 0.991374  ]]
 
 
-Example 2. *RandomUniform* output with ``global_seed`` = 80, ``op_seed`` = 100, ``output_type`` = double:
+Example 2. *RandomUniform* output with ``global_seed`` = 80, ``op_seed`` = 100, ``output_type`` = double, ``alignment`` = TENSORFLOW:
 
 .. code-block:: xml
    :force:
@@ -196,7 +204,7 @@ Example 2. *RandomUniform* output with ``global_seed`` = 80, ``op_seed`` = 100,
          [2.67008206 2.36423758]]
 
 
-Example 3. *RandomUniform* output with ``global_seed`` = 80, ``op_seed`` = 100, ``output_type`` = i32:
+Example 3. *RandomUniform* output with ``global_seed`` = 80, ``op_seed`` = 100, ``output_type`` = i32, ``alignment`` = TENSORFLOW:
 
 .. code-block:: xml
    :force:
@@ -210,6 +218,148 @@ Example 3. *RandomUniform* output with ``global_seed`` = 80, ``op_seed`` = 100,
    output  = [[65 70 56]
          [59 82 92]]
 
+-------------------------------------------------------
+
+Mersenne-Twister Algorithm Explanation:
+
+Link to the original paper Mersenne Twister: Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator <https://dl.acm.org/doi/10.1145/272991.272995>__.
+
+The Mersenne-Twister algorithm generates random numbers by initializing a state array with a seed and then iterating through a series of transformations.
+Suppose we have n which determines the n-th element of the random sequence.
+
+The initial state array is generated recursively using the following formula:
+
+.. math::
+
+   state[0] = global_seed & 0xffffffff;
+   state[i] = 1812433253 * state[i-1] ^ (state[i-1] >> 30) + i
+
+where the value of i cannot exceed 623.
+
+The output is generated by tempering the state array:
+
+.. math::
+
+   y = state[i]\
+   y = y \oplus (y >> u)\
+   y = y \oplus ((y << s) & b)\
+   y = y \oplus ((y << t) & c)\
+   y = y \oplus (y >> l)
+
+where u, s, t, l, b, and c are constants.
+
+Whenever all state values are 'used', a new state array is generated recursively as follows:
+
+.. math::
+
+   current_state = state[i]
+   next_state    = state[i+1] if i+1 <= 623 else state[0]
+   next_m_state  = state[i+m] if i+m <= 623 else state[i+m-623]
+
+   twisted_state = (((current_state & 0x80000000) | (next_state & 0x7fffffff)) >> 1) ^ (next_state & 1 ? 0x9908b0df : 0)
+   state[i] = next_m_state ^ twisted_state
+
+where m is a constant. 
+
+For parity with PyTorch, the value of the constants is set as follows:
+
+.. math::
+
+   u = 11
+   s = 7
+   b = 0x9d2c5680
+   t = 15
+   c = 0xefc60000
+   l = 18
+   m = 397
+
+These values follow the recommendations from the linked paper for MT19937.
+
+To convert a given unsigned int value (denoted as x below) to a specific type, a simple conversion is performed.
+For float32:
+
+.. math::
+
+   mantissa_digits = 24 (mantissa / significand bits count of float + 1, equal to std::numeric_limits<float>::digits == FLT_MANT_DIG == 24)
+   mask = uint32(uint64(1) << mantissa_digits - 1)
+   divisor = float(1) / (uint64(1) << mantissa_digits)
+   output = float((x & mask) * divisor)
+
+For float16:
+
+   mantissa_digits = 11 (mantissa / significand bits count of float16 + 1, equal to 11)
+   mask = uint32(uint64(1) << mantissa_digits - 1)
+   divisor = float(1) / (uint64(1) << mantissa_digits)
+   output = float16((x & mask) * divisor)
+
+For bfloat16:
+
+   mantissa_digits = 8 (mantissa / significand bits count of bfloat16 + 1, equal to 8)
+   mask = uint32(uint64(1) << mantissa_digits - 1)
+   divisor = float(1) / (uint64(1) << mantissa_digits)
+   output = bfloat16((x & mask) * divisor)
+
+For float64 (double precision requires the use of two uint32 values, denoted as x and y below):
+
+   value = uint64(x) << 32 + y
+
+   mantissa_digits = 53 (mantissa / significand bits count of double + 1, equal to std::numeric_limits<double>::digits == DBL_MANT_DIG == 53)
+   mask = uint64(1) << mantissa_digits - 1
+   divisor = double(1) / (uint64(1) << mantissa_digits)
+   output = double((x & mask) * divisor)
+
+All of the floating - point types above after the conversion fall between the values of 0 and 1. To convert them to reside between a range <min, max>, a simple operation is performed:
+
+.. math::
+
+   output = x * (max - min) + min
+
+For integer types, no special conversion operation is done except for int64 when either min or max exceeds the maximum possible value of uint32. A simple operation to standardize the values is performed.
+The special behavior (optimization) for int64 matches the expected output for PyTorch, normally a concatenation of 2 uint32s always occurs.
+In other words:
+
+.. math::
+
+   if output is of int32 dtype:
+      output = int32(x)
+   else if output is of int64 dtype and (min <= max(uint32) and max <= max(uint32)):
+      output = int64(x)
+   else:
+      output = int64(x << 32 + y) (uses 2 uint32s instead of one)
+
+   output = output % (max - min) + min
+
+Example 1. RandomUniform output with initial_seed = 150, output_type = f32, alignment = PYTORCH:
+.. code-block:: xml
+   :force:
+
+   input_shape    = [ 3, 3 ] \\
+   output  = [[0.6789123  0.31274895 0.91842768] \\
+              [0.9312087   0.13456984 0.49623574] \\
+              [0.5082716   0.23938411 0.97856429]]
+
+
+Example 2. RandomUniform output with initial_seed = 80, output_type = double, alignment = PYTORCH:
+
+.. code-block:: xml
+   :force:
+
+   input_shape    = [ 2, 2 ] \\
+   minval = 2 \\
+   maxval = 10 \\
+   output  = [[8.34928537 6.12348725] \\
+              [3.76852914 2.89564172]]
+
+Example 3. RandomUniform output with initial_seed = 80, output_type = i32, alignment = PYTORCH:
+
+.. code-block:: xml
+   :force:
+
+   input_shape    = [ 2, 3 ] \\
+   minval = 50 \\
+   maxval = 100 \\
+   output  = [[89 73 68] \\
+              [95 78 61]]
 
 **Attributes**:
 
@@ -236,6 +386,14 @@ Example 3. *RandomUniform* output with ``global_seed`` = 80, ``op_seed`` = 100,
   * **Default value**: 0
   * **Required**: *Yes*
 
+* ``alignment``
+
+  * **Description**: the framework to align the output to.
+  * **Range of values**: TENSORFLOW, PYTORCH
+  * **Type**: `string`
+  * **Default value**: TENSORFLOW
+  * **Required**: *No*
+
 **Inputs**:
 
 *   **1**: ``shape`` - 1D tensor of type *T_SHAPE* describing output shape. **Required.**
@@ -247,7 +405,7 @@ Example 3. *RandomUniform* output with ``global_seed`` = 80, ``op_seed`` = 100,
 
 **Outputs**:
 
-* **1**: A tensor with type specified by the attribute *output_type* and shape defined by ``shape`` input tensor.
+* **1**: A tensor with type specified by the attribute *output_type* and shape defined by ``shape`` input tensor, with values aligned to the framework selected by the ``alignment`` attribute.
 
 **Types**