RFC 100 text: Add RFC for float16 support #10146

eschnett · 2024-06-06T00:36:52Z

See #10144.

doc/source/development/rfc/rfc100_float16_support.rst

rouault · 2024-06-06T13:48:59Z

@eschnett Once you're happy with the RFC, it would also be appropriate to mention it by an email to the gdal-dev mailing list (https://lists.osgeo.org/pipermail/gdal-dev/). Cf https://lists.osgeo.org/pipermail/gdal-dev/2024-February/058548.html for an example.
Typically the RFC lifecycle is:

draft a preliminary version as you did, make it known on gdal-dev to get early feedback on it
work on a candidate implementation, revise the RFC text
ask for any remaining comments
call for a vote to officially endorse it (cf https://lists.osgeo.org/pipermail/gdal-dev/2024-March/058635.html)

doc/source/development/rfc/rfc100_float16_support.rst

Co-authored-by: Even Rouault <[email protected]>

eschnett · 2024-06-06T20:05:09Z

Thinking about automatically converting float16 to float32 a bit more: I think it would be confusing to choose GDAL's behaviour in this respect on which compiler flags and versions were used to build GDAL. It would probably be better to make GDAL be consistent – it should either always automatically convert to float32, or never do that automatically. And given the current behaviour, it would be quite inconvenient if one was suddenly presented with a float16 dataset.

Maybe there should be a new flag, passed to GDAL when opening a dataset, that specifies whether float16 data should be automatically converted to float32 (the default setting), or should be preserved as float16?

rouault · 2024-06-06T20:35:07Z

Thinking about automatically converting float16 to float32 a bit more: I think it would be confusing to choose GDAL's behaviour in this respect on which compiler flags and versions were used to build GDAL

That's a good point. To give some perspective: in the past when we have introduced GDT_Int64 and GDT_UInt64, before their addition, the Zarr driver automatically exposed arrays with those types as Float64, as that was the closest (somewhat) compatible available data type. And we didn't add an option to allow int64/uint64 to be presented when we added those types. That was documented as a change with some breaking potential in the MIGRATION_GUIDE.TXT file. Similarly when GDT_Int8 has been introduced, such data type was handled in a very clumsy way before (it was presented as unsigned byte + a metadata item saying "actually it is signed"). After the change, those datasets were returned with the new type. So the practice up to now has been to adopt the new data type. I think it would be fine to do the same here, that is when the GDAL build has _Float16 support and the dataset is Float16, then use GDT_Float16 as the declared data type. Float16 datasets are sufficiently esoteric (at least that's my perspective) that it is likely acceptable to do so.
Having a specific flag to allow/disable Float16 adds some long term complications in GDAL internals (also probably minor given that drivers having Float16 capabilities still need to handle both exposing as Float16 or promotion to Float32 depending on if the GDAL build as Float16 capabilities). Also to be noted that the HDF5 library, when built with Float16 support, automatically uses it by default (which broke GDAL which didn't expect this new data type, but I found that acceptable)

rouault · 2024-06-27T09:28:19Z

@eschnett not sure what your plans are, but at that point, I believe that working on a candidate implementation would be the appropriate step forward

eschnett · 2024-06-27T13:12:57Z

@rouault I have an implementation but got stuck writing tests. It turns out to be difficult to test from Python because SWIG doesn't support Float16. I was looking into some C++ tests but got sidetracked by another project. I'll push my current state.

Any preference whether this should be a separate PR from the one with the RFC?

rouault · 2024-06-27T14:06:17Z

Thanks for the update. No rush, just wanted to know how things were progressing

because SWIG doesn't support Float16.

is it itself an issue? I don't anticipate a lot of GDAL API methods to actually return Float16 scalars. Most Float16 specific tests should be around RasterIO(), and RasterIO() at the SWIG level is handle by the swig/include/python specifics (and swig/include/gdal_array.i) with dedicate C wrapping code and Python code.

Any preference whether this should be a separate PR from the one with the RFC?

yes, the candidate implementation would be better into a separate PR

github-actions · 2024-07-26T02:40:17Z

The GDAL project highly values your contribution and would love to see this work merged! Unfortunately this PR has not had any activity in the last 28 days and is being automatically marked as "stale". If you think this pull request should be merged, please check

that all unit tests are passing
that all comments by reviewers have been addressed
that there is enough information for reviewers, in particular link
to any issues which this pull request fixes
that you have written unit tests where possible
In case you should have any uncertainty, please leave a comment and we will be happy to help you proceed with this pull request.
If there is no further activity on this pull request, it will be closed in 2 weeks.

github-actions · 2024-08-09T02:41:35Z

While we hate to see this happen, this PR has been automatically closed because it has not had any activity in the last 6 weeks. If this pull request should be reconsidered, please follow the guidelines in the previous comment and reopen this pull request. Or, if you have any further questions, just ask! We love to help, and if there's anything the GDAL project can do to help push this PR forward please let us know how we can assist.

eschnett · 2024-08-09T13:26:17Z

Just leaving a comment here – yes, I am still interested in this, and I am still working on it, but there's another higher-priority project taking up my time at the moment.

github-actions · 2024-09-08T03:58:23Z

The GDAL project highly values your contribution and would love to see this work merged! Unfortunately this PR has not had any activity in the last 28 days and is being automatically marked as "stale". If you think this pull request should be merged, please check

that all unit tests are passing
that all comments by reviewers have been addressed
that there is enough information for reviewers, in particular link
to any issues which this pull request fixes
that you have written unit tests where possible
In case you should have any uncertainty, please leave a comment and we will be happy to help you proceed with this pull request.
If there is no further activity on this pull request, it will be closed in 2 weeks.

github-actions · 2024-09-23T02:45:57Z

While we hate to see this happen, this PR has been automatically closed because it has not had any activity in the last 6 weeks. If this pull request should be reconsidered, please follow the guidelines in the previous comment and reopen this pull request. Or, if you have any further questions, just ask! We love to help, and if there's anything the GDAL project can do to help push this PR forward please let us know how we can assist.

rouault · 2024-09-23T10:41:14Z

(keep alive)

github-actions · 2024-10-23T02:45:55Z

The GDAL project highly values your contribution and would love to see this work merged! Unfortunately this PR has not had any activity in the last 28 days and is being automatically marked as "stale". If you think this pull request should be merged, please check

that all unit tests are passing
that all comments by reviewers have been addressed
that there is enough information for reviewers, in particular link
to any issues which this pull request fixes
that you have written unit tests where possible
In case you should have any uncertainty, please leave a comment and we will be happy to help you proceed with this pull request.
If there is no further activity on this pull request, it will be closed in 2 weeks.

rouault · 2024-10-23T13:02:32Z

(keep alive)

doc/source/development/rfc/rfc100_float16_support.rst

MIGRATION_GUIDE.TXT

schwehr · 2024-11-07T18:14:55Z

doc/source/development/rfc/rfc100_float16_support.rst

+-------
+
+This RFC adds support for the IEEE 16-bit floating point data type
+(aka ``half``, ``float16``). It adds new pixel data types


What is this two back quote style?

A double backquote is rendered as "code" style, similar to a single backquote in Github markdown. I've taken it from RFC 99.

doc/source/development/rfc/rfc100_float16_support.rst

rouault · 2024-11-07T21:55:05Z

For the remaining spelling issues (once you've americanized behaviour->behavior), you can add suppressions like in

gdal/doc/source/thanks.rst

Line 135 in 0defeab

.. below is an allow-list for spelling checker.

rouault · 2024-11-08T17:27:30Z

doc/source/development/rfc/rfc100_float16_support.rst

+  routines. This type should be supported without converting to
+  float32.
+
+- C++23 will introduce native support for ``std::float16_t``. However,


eschnett#1 is an attempt of aliasing GFloat16 on std::float16_t if it is available

Thanks! I'll probably merge it and then hash out the details.

Once the RFC text has been updated taking into account above to reflect how C++23 std::float16_t will be handled, I'd be happy to give my +1 to the RFC.

Woohoo! Thank you both! I'm the same as Even for a +1.

Add RFC for float16 support

3402ba0