Hebrew support #7512

BLooperZ · 2024-11-05T22:32:52Z

This is an attempt to add hebrew support when rendering text

this is not yet fully working though

I couldn't figure out how to add freebidi dependency on platforms other than linux

on linux, there are some cursors displayed at random positions, even without the bidi conversion (on windows it seems to work correctly)

I hope there aren't other issues with cursor positioning
(and I haven't even added bidi support for text input😅)

help and comments will be greatly appreciated

Thank you for the amazing work on🙏

glebm

Can you please send a separate PR with just he.po? Makes reviewing easier, and we can submit it right away.

glebm · 2024-11-06T22:26:23Z

Source/engine/render/text_render.cpp

@@ -167,7 +168,7 @@ OptionalClxSpriteList LoadFont(GameFontTables size, text_color color, uint16_t r
 	const std::string_view language_code = GetLanguageCode();
 	const std::string_view language_tag = language_code.substr(0, 2);
 	if (language_tag == "zh" || language_tag == "ja" || language_tag == "ko"
-	    || (language_tag == "tr" && row == 0)) {
+		|| (language_tag == "tr" && row == 0)) {


There seem to be some whitespace changes here that will fail clang-format
We use tabs for structural indentation but spaces for continuation indentation.

will revert this change

Source/engine/render/text_render.cpp

glebm · 2024-11-06T22:28:07Z

Source/engine/render/text_render.cpp

-	for (; !remaining.empty() && remaining[0] != '\0'
-	     && (next = DecodeFirstUtf8CodePoint(remaining, &cpLen)) != Utf8DecodeError;
-	     remaining.remove_prefix(cpLen)) {
+	const auto drawSingleLine = [&]() {


It's a little bit unclear at a glance what this lambda captures, consider extracting the contents of the lambda into a function.

glebm · 2024-11-06T22:29:34Z

Source/engine/render/text_render.cpp

-	char32_t next;
-	std::string_view remaining = text;
-	size_t cpLen;
+	std::u32string text32 = ConvertUtf8ToUtf32(text);


We like to avoid allocations in code that runs on every frame, is it possible to implement this in a streaming fashion?

It looks like fribidi only supports UTF-32 but perhaps there is another library?
A quick search reveals https://github.com/Tehreer/SheenBidi, which supports UTF-8, and provides a number of handy functions for this. Looks like we can also replace our UTF-8 decoding implementations with calls to theirs.

Thank you, 2 notes regarding this (from my point of view, please feel free to suggest otherwise)

Because I am trying to find and handle each line separately, working with UTF-32 actually greatly simplify the indexing of the string, allowing simpler logic to get each line, which I find harder doing on UTF-8.

I was having trouble setting up additional dependency as I am not yet experienced with cmake. I was able to set up linking with freebidi binary on linux because it is quite popular. I am open to alternative but I would need guidance for setting them up.

Because I am trying to find and handle each line separately, working with UTF-32 actually greatly simplify the indexing of the string, allowing simpler logic to get each line, which I find harder doing on UTF-8.

UTF-8 is a self-synchronizing encoding, which means explicit line breaks can be found simply via find('\n'), without the need for decoding. If you use UTF-8, you'd use code unit (byte) indices rather than code point indices throughout (which necessitates a BiDi library that supports UTF-8 natively).

I am open to alternative but I would need guidance for setting them up.

It's quite simple, add a file 3rdParty/SheenBidi/CMakeLists.txt with:

include(functions/FetchContent_MakeAvailableExcludeFromAll) include(FetchContent) FetchContent_Declare(SheenBidi URL https://github.com/glebm/SheenBidi/archive/3721411b5ec2862240f34aeeb3e8ba59283ec2ec.tar.gz URL_HASH MD5=7dc0138bda1e16b217cee66cfd95a6f9 ) FetchContent_MakeAvailableExcludeFromAll(SheenBidi)

The target to link to is SheenBidi::sheenbidi.

glebm · 2024-11-06T22:35:24Z

CMake/Assets.cmake

@@ -2,7 +2,7 @@ if(NOT DEFINED DEVILUTIONX_ASSETS_OUTPUT_DIRECTORY)
  set(DEVILUTIONX_ASSETS_OUTPUT_DIRECTORY "${CMAKE_CURRENT_BINARY_DIR}/assets")
 endif()

-set(devilutionx_langs bg cs da de el es fr hr hu it ja ko pl pt_BR ro ru uk sv tr zh_CN zh_TW)
+set(devilutionx_langs bg cs da de el es fr hr hu it ja ko pl pt_BR ro ru uk sv tr he zh_CN zh_TW)


Estonian has been added a month ago so there is now conflict in this line that will require rebasing

BLooperZ added 11 commits August 27, 2024 21:49

Add hebrew language option

5037c81

Convert Bidirectional text

2f19759

Add fribidi as dependency

3fcf4eb

Adjust utf-32 conversion

4ee21d7

Add hebrew translation

c281277

Optional fribidi dependency

68b0749

Simplify drawing by using UTF-32 string

3d42f4e

Draw text line by line

cc4ad48

Convert line to visual

f6fdb79

Fix USE_FRIBIDI

e84a5b4

Restore UTF-8 for cursor position

612b8b4

glebm reviewed Nov 6, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hebrew support #7512

Hebrew support #7512

BLooperZ commented Nov 5, 2024 •

edited

Loading

glebm left a comment

glebm Nov 6, 2024

BLooperZ Nov 6, 2024

glebm Nov 6, 2024

glebm Nov 6, 2024 •

edited

Loading

BLooperZ Nov 6, 2024

glebm Nov 7, 2024 •

edited

Loading

glebm Nov 6, 2024 •

edited

Loading

Hebrew support #7512

Are you sure you want to change the base?

Hebrew support #7512

Conversation

BLooperZ commented Nov 5, 2024 • edited Loading

glebm left a comment

Choose a reason for hiding this comment

glebm Nov 6, 2024

Choose a reason for hiding this comment

BLooperZ Nov 6, 2024

Choose a reason for hiding this comment

glebm Nov 6, 2024

Choose a reason for hiding this comment

glebm Nov 6, 2024 • edited Loading

Choose a reason for hiding this comment

BLooperZ Nov 6, 2024

Choose a reason for hiding this comment

glebm Nov 7, 2024 • edited Loading

Choose a reason for hiding this comment

glebm Nov 6, 2024 • edited Loading

Choose a reason for hiding this comment

BLooperZ commented Nov 5, 2024 •

edited

Loading

glebm Nov 6, 2024 •

edited

Loading

glebm Nov 7, 2024 •

edited

Loading

glebm Nov 6, 2024 •

edited

Loading