Use UTF-8 for Linux #12

zeule · 2024-08-28T06:26:31Z

The wchar_t in Linux is very wide, 32 bits, and thus almost unused,
which, in turn, results in poor support in tooling. And that's to be
expected when UTF-8 is the default. At the same time, Iconv, which is
part of glibc, is universally available. Thus we can use it to convert
UCS-2 into UTF-8 in Linux and always use UTF-8 in the library API.

So that IDEs can list them.

Simplify script, fix install() for the bundled onigiruma, fix using pkg-config in the find module.

This allows to use find_package() with the build tree.

The wchar_t in Linux is very wide, 32 bits, and thus almost unused, which, in turn, results in poor support in tooling. And that's to be expected when UTF-8 is the default. At the same time, Iconv, which is part of glibc, is universally available. Thus we can use it to convert UCS-2 into UTF-8 in Linux and always use UTF-8 in the library API.

zeule · 2024-09-03T08:21:20Z

@hzdbyte, which other project do I have to adapt to merge in this change? Only the bundled GUI, or qgen as well, for example?

hzdbyte · 2024-09-04T23:27:12Z

To be honest I don't really like using iconv as dependency.
The library doesn't strictly depend on wchar_t since it has it's own string processing implementation, thus it can use any type of strings. I'd still prefer to keep the internal string representation based on fixed length type (it can be "short"/2bytes for example). We can implement UTF8 support in the special version of bindings to be used by clients/players with UTF8 strings (that'd be next to the "default" one).

zeule · 2024-09-05T05:02:11Z

But Iconv is a part of glibc, thus it brings no new dependency package-wise. Utf8 is the standard for Linux locales. Can that change your attitude?

hzdbyte · 2024-09-05T17:50:22Z

Wouldn't it be better to create utf8 binding that'd convert strings back and forth for utf8 players?

zeule added 7 commits August 28, 2024 07:46

Do not install system wxWidgets targets

6bca9ae

Add headers to target sources

dcc8158

So that IDEs can list them.

Tune up finding/installing oniguruma

35cdb70

Simplify script, fix install() for the bundled onigiruma, fix using pkg-config in the find module.

Add build tree cmake export

f7ed212

This allows to use find_package() with the build tree.

Add missing include

b9261f9

Move the endianness test to the build script

f8fce6c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use UTF-8 for Linux #12

Use UTF-8 for Linux #12

zeule commented Aug 28, 2024

zeule commented Sep 3, 2024

hzdbyte commented Sep 4, 2024

zeule commented Sep 5, 2024

hzdbyte commented Sep 5, 2024

Use UTF-8 for Linux #12

Are you sure you want to change the base?

Use UTF-8 for Linux #12

Conversation

zeule commented Aug 28, 2024

zeule commented Sep 3, 2024

hzdbyte commented Sep 4, 2024

zeule commented Sep 5, 2024

hzdbyte commented Sep 5, 2024