Split kerning by script, not by direction #636

simoncozens · 2022-07-20T14:43:59Z

Currently we split kerning into lookups based on a single factor, horizontal direction. However, shaping engines will perform script segmentation and so there will never be any cross-script kerning. By splitting the kerning into lookups based on the script of the glyphs involved, we can produce smaller lookups for large multi-script fonts, hopefully causing less overflows (faster compilation) and reducing file space by giving the binary compiler a better starting point for splitting lookups into subtables.

There are a few failsafes, such as glyphs without identifiable scripts (as well as purely common-script glyphs) go into a "Common" pot which is added to DFLT/dflt.

This may be easiest to review commit by commit; the changes are fairly small and self-contained apart from f56eaf6 which is the big rewrite.

…re than one category

…just ast nodes

Given a pair and mapping between glyph names and a set of scripts, split the pair into (potentially) multiple pairs each with a dominant script.

…ripts defined

behdad · 2022-07-20T15:16:57Z

Lib/ufo2ft/featureWriters/kernFeatureWriter.py

+        allSecondScripts = {}
+        for g in self.firstGlyphs:
+            if g not in glyphScripts:
+                glyphScripts[g] = set(["Zyyy"])


Please use a named constant for "Zyyy".

behdad · 2022-07-20T15:21:35Z

There are a few failsafes, such as glyphs without identifiable scripts (as well as purely common-script glyphs) go into a "Common" pot which is added to DFLT/dflt.

They are added to all other lookups as well, right?

The main deficiency I see in this PR is lack of support for Script_Extensions data-file. That's not a huge deal though. If glyphs with script A and B are kerned and you insert that kern in both lookups for A and for B, that handles most of the cases already.

simoncozens · 2022-07-20T15:44:07Z

They are added to all other lookups as well, right?

No, but the common lookup is added to all script/language combinations

The main deficiency I see in this PR is lack of support for Script_Extensions data-file.

I think it does support this. knownScriptsPerCodepoint looks up all scripts for a codepoint using script_extensions. We then partition the kern pair by scripts, and then evaluate all script combinations. So for example the Arabic comma is included in both Arabic and NKo. A kerning file like

    <key>comma-ar</key>
    <dict>
        <key>gba-nko</key>
        <integer>-120</integer>
        <key>lam-ar</key>
        <integer>-30</integer>
    </dict>
    <key>gba-nko</key>
    <dict>
        <key>gba-nko</key>
        <integer>-20</integer>
    </dict>
    <key>lam-ar</key>
    <dict>
        <key>lam-ar</key>
        <integer>50</integer>
    </dict>
    <key>three</key>
    <dict>
        <key>three</key>
        <integer>-50</integer>
    </dict>

becomes

lookup kern_Arab {
    lookupflag IgnoreMarks;
    pos comma-ar lam-ar <-30 0 -30 0>;
    pos lam-ar lam-ar <50 0 50 0>;
} kern_Arab;

lookup kern_Nkoo {
    lookupflag IgnoreMarks;
    pos comma-ar gba-nko <-120 0 -120 0>;
    pos gba-nko gba-nko <-20 0 -20 0>;
} kern_Nkoo;

lookup kern_Common {
    lookupflag IgnoreMarks;
    pos three three -50;
} kern_Common;

(check out test_split_pair in the tests.)

behdad · 2022-07-20T16:29:01Z

Thanks for the explanation. This is neat. LGTM!

anthrotype

LGTM, with comments

thanks Simon for tacking this 👍

Lib/ufo2ft/util.py

Lib/ufo2ft/featureWriters/kernFeatureWriter.py

tests/featureWriters/kernFeatureWriter_test.py

This reverts commit bdac61e. # Conflicts: # Lib/ufo2ft/featureWriters/kernFeatureWriter.py # Lib/ufo2ft/util.py

This reverts commit bdac61e.

Revert "Split kerning by script, not by direction (#636)"

simoncozens added 8 commits July 20, 2022 13:55

Allow classify_glyphs to handle its classifying function returning mo…

c8941f6

…re than one category

Allow kerning pairs to be constructed from lists of glyph names, not …

75f8a32

…just ast nodes

Allow access to each element of a pair

1f48337

Add partitionByScript method

ac5ddb3

Given a pair and mapping between glyph names and a set of scripts, split the pair into (potentially) multiple pairs each with a dominant script.

A test for the partition-pair-by-script logic

e2a0cb1

Split kerning by script

f56eaf6

Only classify glyphs into known scripts, or common if there are no sc…

9a5023c

…ripts defined

Fix test expectations

c236de0

simoncozens marked this pull request as ready for review July 20, 2022 14:44

Flake fixes

650498b

behdad reviewed Jul 20, 2022

View reviewed changes

Fix bad feaLib AST usage

1e5a043

simoncozens force-pushed the main branch from 6c05d0c to 72b2459 Compare July 20, 2022 15:20

simoncozens force-pushed the split-kerning-by-script branch from 1bd006e to 1e5a043 Compare July 20, 2022 15:20

Not my lint

f8a688c

Use constant for Zyyy

f8f70f9

anthrotype approved these changes Jul 20, 2022

View reviewed changes

behdad reviewed Jul 20, 2022

View reviewed changes

tests/featureWriters/kernFeatureWriter_test.py Show resolved Hide resolved

simoncozens added 5 commits July 21, 2022 14:55

Address the easy bit of the feedback

4280397

Emit the lookup references in a friendlier order

33560c3

Fix test expectations (reordering)

69aa0a1

Check logging of mixed script pair problems

85003f4

Inheritable logger name

f97e791

anthrotype approved these changes Jul 21, 2022

View reviewed changes

simoncozens merged commit bdac61e into main Jul 21, 2022

simoncozens deleted the split-kerning-by-script branch July 21, 2022 16:13

madig mentioned this pull request Jul 22, 2022

KerningPair: add annotations #638

Merged

anthrotype mentioned this pull request Aug 16, 2022

version 4.35.0 breaks our tests in gftools fonttools/fonttools#2747

Closed

anthrotype mentioned this pull request Sep 15, 2022

RTL kerning missing from font #658

Closed

madig added a commit that referenced this pull request Oct 11, 2022

Revert "Split kerning by script, not by direction (#636)"

2de7e50

This reverts commit bdac61e. # Conflicts: # Lib/ufo2ft/featureWriters/kernFeatureWriter.py # Lib/ufo2ft/util.py

madig added a commit that referenced this pull request Oct 11, 2022

Revert "Split kerning by script, not by direction (#636)"

09062f0

This reverts commit bdac61e.

madig pushed a commit that referenced this pull request Oct 11, 2022

Split kerning by script, not by direction (#636)

911e20c

madig mentioned this pull request Oct 11, 2022

Split kerning by script, not by direction (second attempt) #667

Closed

3 tasks

anthrotype added a commit that referenced this pull request Oct 14, 2022

Merge pull request #666 from googlefonts/revert-kern-splitter

f8f6bac

Revert "Split kerning by script, not by direction (#636)"

madig pushed a commit that referenced this pull request Nov 4, 2022

Split kerning by script, not by direction (#636)

8315c29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split kerning by script, not by direction #636

Split kerning by script, not by direction #636

simoncozens commented Jul 20, 2022

behdad Jul 20, 2022

behdad commented Jul 20, 2022

simoncozens commented Jul 20, 2022 •

edited

Loading

behdad commented Jul 20, 2022

anthrotype left a comment

Split kerning by script, not by direction #636

Split kerning by script, not by direction #636

Conversation

simoncozens commented Jul 20, 2022

behdad Jul 20, 2022

Choose a reason for hiding this comment

behdad commented Jul 20, 2022

simoncozens commented Jul 20, 2022 • edited Loading

behdad commented Jul 20, 2022

anthrotype left a comment

Choose a reason for hiding this comment

simoncozens commented Jul 20, 2022 •

edited

Loading