Skip to content

Commit

Permalink
Add Gothic to charset_table
Browse files Browse the repository at this point in the history
The characters used to write Gothic (got). The existing Gothic
sentences seem to use spaces between words, so using charset_table is
likely appropriate.
Unicode Block: https://www.unicode.org/charts/PDF/U10330.pdf

See issue Tatoeba#1970, section "Other Unsearchable Characters".
  • Loading branch information
Yorwba committed Nov 6, 2019
1 parent e2da475 commit 8ce6c2e
Showing 1 changed file with 4 additions and 2 deletions.
6 changes: 4 additions & 2 deletions src/Shell/SphinxConfShell.php
Original file line number Diff line number Diff line change
Expand Up @@ -175,8 +175,10 @@ class SphinxConfShell extends Shell {
'U+0C80..U+0C8C', 'U+0C8E..U+0C90', 'U+0C92..U+0CA8', 'U+0CAA..U+0CB3',
'U+0CB5..U+0CB9', 'U+0CBC..U+0CC4', 'U+0CC6..U+0CC8', 'U+0CCA..U+0CCD',
'U+0CD5..U+0CD6', 'U+0CDE', 'U+0CE0..U+0CE3', 'U+0CE6..U+0CEF', 'U+0CF1..U+0CF2',
# Dhivehi
'U+0780..U+07B1',
# Dhivehi
'U+0780..U+07B1',
# Gothic
'U+10330..U+1034A',
);

public $scriptsWithoutWordBoundaries = array(
Expand Down

0 comments on commit 8ce6c2e

Please sign in to comment.