UTF-8 Encoded Text #1

emilypi · 2021-05-13T16:27:18Z

This proposal outlines a project plan for the migration of the text package from its current default encoding (UTF-16) to a new default of UTF-8.

The lack of UTF-8 as a default in the text package is a pain point raised by the Haskell Community and
many of our industry partners for many years. We have done our homework in soliciting feedback from the broader community and industry, and have received positive affirmation of the following:

This change is desirable: Overwhelmingly, the community was positive on such a change, and industry was the same.
There is an appetite for breakage: neither the community nor the people we spoke to in industry were concerned about the cost of migration or breakage, though, as a deliverable, we will make sure to minimize the cost of migration by reusing the text package name, and providing alternatives in the case that users require UTF-16 text.
The text maintainers are on board: All text maintainers currently support this effort, and will be reviewers and support for the project leader throughout the course of the project.

Rendered

cartazio · 2021-05-13T17:26:39Z

looks good to me

proposals/002-text-utf-default.md

goldfirere

It's so great to have all these details in one place. Textual encoding is not an area I know much about, and so I haven't been able to follow this track well. Now, I see it all laid out. Thank you!

While I have a bunch of comments here, please do not see this as trying to defeat this proposal. I just want the text to be clear about the concrete wins for Haskell that we should expect after this is complete.

proposals/002-text-utf-default.md

Co-authored-by: Richard Eisenberg <[email protected]>

phadej · 2021-05-13T20:02:07Z

The proposal should mention what happens with text-short: https://hackage.haskell.org/package/text-short

If my memory is correct, HVR intended to add ShortText directly into text but bos (who was still around then) didn't like that idea.

However, for use as "text symbols", e.g. representing identifiers in the programming language ShortText is IMO more suitable, as it doesn't have slicing - which you don't really need for the identifiers. Remember that offset+length is 16 bytes on 64bit system, which is a lot for every identifier.

I could argue that e.g. aeson might benefit from switching from Text to ShortText in the representation of JSON (at least object keys, maybe even all strings). I'm sure users would hate me for that change, but I'm also quite sure aeson itself could be faster with that change!

text-short is already using UTF-8.

phadej · 2021-05-13T20:11:51Z

Also, I'd like to have comments why we think UTF8 is better, when e.g.

JavaScript uses UTF16: https://developer.mozilla.org/en-US/docs/Web/API/DOMString,
V8 story is more complicated: https://stackoverflow.com/questions/40512393/understanding-string-heap-size-in-javascript-v8
- Latin1 strings
- UTF16 strings
- sliced strings (c.f. Text vs ShortText)
- ...
- which means they have opaque Text which is smart in what it's actually backed up by - not commiting to a single choice!
Java String is UTF16, https://docs.oracle.com/en/java/javase/11/docs/api/java.base/java/lang/String.html

String represents a string in the UTF-16 format in which supplementary characters are represented by surrogate pairs

coderfromhere · 2021-05-13T20:51:46Z

Also, I'd like to have comments why we think UTF8 is better, when e.g.

This link might be useful: http://utf8everywhere.org

chrisdone · 2021-05-13T22:19:50Z

I can add an example motivation. The aeson package parses from a ByteString assumed to be UTF-8. Then for each string you get a Text. If you’re using the fastest parsers you can, they will typically be e.g. attoparsec or flatparse which work on a ByteString. These often take advantage of the byte by byte format, using fast memchr and such functions from C. In order to use such parsers, you’re going from UTF-8, to UTF-16 —doubling the size, then you’ll have to again encode back into UTF-8. I’ve measured the performance of encodeUtf8 and found it to be quite small. About 1 microsecond per kilobyte of Unicode on my laptop. So doing this extra step is “OK”, but that depends on your app. I’ll attach my benchmarks here.

Another example: I had a client who was referring to genes which are all ASCII, but was quite logically using Text. After loading millions from disk, this adds up to double the space, and a decode step (from "Latin1 to UTF-16"). Whereas a UTF-8 representation would require no decode step, nor a doubling of memory, and in fact would let you often use pointer aliasing to the original buffer, avoiding a copy/alloc. My advice was to switch to ByteString, which did improve performance, but it feels against the spirit of handling text.

chrisdone · 2021-05-13T22:26:07Z

sampleUnicode :: Text
sampleUnicode = "! \" # $ % & ' ( ) * + , - . / 0 1 2 3 4 5 6 7 8 9 : ; < = > ? @ A B C D E F G H I J K L M N O P Q R S T U V W X Y Z [ \\ ] ^ _ ` a b c d e f g h i j k l m n o p q r s t u v w x y z { | } ~ ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ ® ¯ ° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿ À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß à á â ã ä å æ ç è é ê ë ì í î ï ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ Ā ā Ă ă Ą ą Ć ć Ĉ ĉ Ċ ċ Č č Ď ď Đ đ Ē ē Ĕ ĕ Ė ė Ę ę Ě ě Ĝ ĝ Ğ ğ Ġ ġ Ģ ģ Ĥ ĥ Ħ ħ Ĩ ĩ Ī ī Ĭ ĭ Į į İ ı Ĳ ĳ Ĵ ĵ Ķ ķ ĸ Ĺ ĺ Ļ ļ Ľ ľ Ŀ ŀ Ł ł Ń ń Ņ ņ Ň ň ŉ Ŋ ŋ Ō ō Ŏ ŏ Ő ő Œ œ Ŕ ŕ Ŗ ŗ Ř ř Ś ś Ŝ ŝ Ş ş Š š Ţ ţ Ť ť Ŧ ŧ Ũ ũ Ū ū Ŭ ŭ Ů ů Ű ű Ų ų Ŵ ŵ Ŷ ŷ Ÿ Ź ź Ż ż Ž ž ſ ƀ Ɓ Ƃ ƃ Ƅ ƅ Ɔ Ƈ ƈ Ɖ Ɗ Ƌ ƌ ƍ Ǝ Ə Ɛ Ƒ ƒ Ɠ Ɣ ƕ Ɩ Ɨ Ƙ ƙ ƚ ƛ Ɯ Ɲ ƞ Ɵ Ơ ơ Ƣ ƣ Ƥ ƥ Ʀ Ƨ ƨ Ʃ ƪ ƫ Ƭ ƭ Ʈ Ư ư Ʊ Ʋ Ƴ ƴ Ƶ ƶ Ʒ Ƹ ƹ ƺ ƻ Ƽ ƽ ƾ ƿ ǀ ǁ ǂ ǃ Ǆ ǅ ǆ Ǉ ǈ ǉ Ǌ ǋ ǌ Ǎ ǎ Ǐ ǐ Ǒ ǒ Ǔ ǔ Ǖ ǖ Ǘ ǘ Ǚ ǚ Ǜ ǜ ǝ Ǟ ǟ Ǡ ǡ Ǣ ǣ Ǥ ǥ Ǧ ǧ Ǩ ǩ Ǫ ǫ Ǭ ǭ Ǯ ǯ ǰ Ǳ ǲ ǳ Ǵ ǵ Ǻ ǻ Ǽ ǽ Ǿ ǿ Ȁ ȁ Ȃ ȃ ɐ ɑ ɒ ɓ ɔ ɕ ɖ ɗ ɘ ə ɚ ɛ ɜ ɝ ɞ ɟ ɠ ɡ ɢ ɣ ɤ ɥ ɦ ɧ ɨ ɩ ɪ ɫ ɬ ɭ ɮ ɯ ɰ ɱ ɲ ɳ ɴ ɵ ɶ ɷ ɸ ɹ ɺ ɻ ɼ ɽ ɾ ɿ ʀ ʁ ʂ ʃ ʄ ʅ ʆ ʇ ʈ ʉ ʊ ʋ ʌ ʍ ʎ ʏ ʐ ʑ ʒ ʓ ʔ ʕ ʖ ʗ ʘ ʙ ʚ ʛ ʜ ʝ ʞ ʟ ʠ ʡ ʢ ʣ ʤ ʥ ʦ ʧ ʨ"

bgroup
  "encodeUtf8"
  [ env
    (pure (T.replicate i sampleUnicode))
    (\t ->
       bench
         ("T.encodeUtf8: " ++ show (i * T.length sampleUnicode) ++ " chars")
         (whnf T.encodeUtf8 t))
  | i <- [1, 10, 100]
  ]

Numbers:

benchmarking encodeUtf8/T.encodeUtf8: 1065 chars
time                 1.059 μs   (1.050 μs .. 1.070 μs)
                     0.999 R²   (0.999 R² .. 1.000 R²)
mean                 1.058 μs   (1.049 μs .. 1.070 μs)
std dev              34.92 ns   (27.00 ns .. 49.58 ns)
variance introduced by outliers: 46% (moderately inflated)

benchmarking encodeUtf8/T.encodeUtf8: 10650 chars
time                 9.998 μs   (9.937 μs .. 10.07 μs)
                     0.999 R²   (0.999 R² .. 1.000 R²)
mean                 10.03 μs   (9.958 μs .. 10.15 μs)
std dev              313.1 ns   (237.3 ns .. 478.0 ns)
variance introduced by outliers: 37% (moderately inflated)

benchmarking encodeUtf8/T.encodeUtf8: 106500 chars
time                 100.0 μs   (98.89 μs .. 101.4 μs)
                     0.999 R²   (0.998 R² .. 1.000 R²)
mean                 99.66 μs   (98.81 μs .. 100.6 μs)
std dev              2.918 μs   (2.402 μs .. 3.566 μs)
variance introduced by outliers: 27% (moderately inflated)

So about 1us per kilobyte. Pretty consistently, for a small subset of Unicode. For Chinese, the cost would be higher, but I haven't tested.

Anyway, my point was to just bring some numbers to give people ballpark figures.

I'd love to remove this redundant decode/encode step and fully support this initiative.

chrisdone · 2021-05-13T22:33:40Z

Apologies if this was spelled out somewhere I haven't seen: but was there a consideration and decision made on having a package flag like "-futf8" or "-futf16" - allowing the authors to pick based on benchmarking their app?

Web apps will be naturally better performing as most of everything web is UTF-8, whereas anything talking to ODBC or Windows may benefit from direct UTF-16 use. If I could just pick with a flag in my stack.yaml for the text package, that'd mean I at least get to make a decision rather than having it hard-coded into the ecosystem like it is in Rust. 🤔

Any opinions on that?

phadej · 2021-05-13T22:50:50Z

Any opinions on that?

It will be tricky. It is possible to generate a header text will install so the downstream libraries may do things differently based on the encoding of text. Whether the maintainers will do that, we cannot know without asking. Whether they will even actively test non-default configuration is a very good question to ask.

phadej · 2021-05-13T22:52:59Z

If ghc-the-library will ever depend on text directly, re-installing text will be impossible if ghc is not made reinstallable (which I doubt will happen before this change is implemented). So there is that "but" also.

chrisdone · 2021-05-13T23:06:52Z

Good points. Inclusion with GHC would be a good reason to not bother with a flag. Assuming it’ll be hard to build GHC packages for a long time.

You’re probably right about maintenance too. It’s a lot to ask to maintain text already. Just moving it to use UTF-8 internally seems more achievable and long-term sustainable. 👍

neilmayhew · 2021-05-14T00:07:19Z

I'm not sure how relevant this is, but this project is very thought-provoking:

simdjson : Parsing gigabytes of JSON per second

As well as parsing JSON, it builds on earlier work using SIMD to validate UTF-8. The performance of both projects is astonishing, and the associated academic papers are very readable.

I wonder how possible it would be to make use of these libraries through FFI, and whether that would be a justification for using an FFI-friendly memory representation. If it were possible to read/mmap an entire file and validate the UTF-8 in a fraction of a second, I think that would be a big win.

proposals/002-text-utf-default.md

neilmayhew · 2021-05-14T18:04:53Z

Also, I'd like to have comments why we think UTF8 is better, when e.g.

Those uses of UTF-16 are mostly for historical reasons, because the implementations were done before UTF-8 was common when UCS-2 was still cool (and it was easiest just to upgrade them from UCS-2 to UTF-16 to keep them Unicode-compliant).

phadej · 2021-05-14T18:14:02Z

proposals/002-text-utf-default.md

+
+-   Ben Gamari: integration with GHC
+
+-   The Cabal maintainers (fgaz, emilypi, mikolaj): integration with Cabal


Neither Cabal nor cabal-install use Text at the moment. It does use (vendored version) of ShortText though.

IMO you can just drop Cabal from the list, it won't be affected unless there is some need to use Text in Cabal (which there isn't as far as I can tell).

I don't know much about Cabal internals, but Cabal.cabal seems to include text:
https://github.com/haskell/cabal/blob/4f8aeb2c8a0a3638e1af887dc869a17e291c8329/Cabal/Cabal.cabal#L272

It defines few instances for Text, it's not really used.

cabal master % git grep 'Data.Text' -- tests: Cabal-tests/tests/UnitTests/Distribution/Utils/Generic.hs:import qualified Data.Text as T Cabal-tests/tests/UnitTests/Distribution/Utils/Generic.hs:import qualified Data.Text.Encoding as T example (note in comment) Cabal/src/Distribution/Backpack.hs:-- >>> eitherParsec "foo[Str=text-1.2.3:Data.Text.Text]" :: Either String OpenUnitId Cabal/src/Distribution/Backpack.hs:-- Right (IndefFullUnitId (ComponentId "foo") (fromList [(ModuleName "Str",OpenModule (DefiniteUnitId (DefUnitId {unDefUnitId = UnitId "text-1.2.3"})) (ModuleName "Data.Text.Text"))])) -- CharParsing class has -- text :: Text -> m Text -- method as it's in upstream parsers -package version Cabal/src/Distribution/Compat/CharParsing.hs:import Data.Text (Text, unpack) -- these are used for debug, not part of proper build Cabal/src/Distribution/Fields/Lexer.hs:import qualified Data.Text as T Cabal/src/Distribution/Fields/Lexer.hs:import qualified Data.Text.Encoding as T Cabal/src/Distribution/Fields/Lexer.hs:import qualified Data.Text.Encoding.Error as T -- ditto Cabal/src/Distribution/Fields/LexerMonad.hs:import qualified Data.Text as T Cabal/src/Distribution/Fields/LexerMonad.hs:import qualified Data.Text.Encoding as T Cabal/src/Distribution/Fields/Parser.hs:import qualified Data.Text as T Cabal/src/Distribution/Fields/Parser.hs:import qualified Data.Text.Encoding as T Cabal/src/Distribution/Fields/Parser.hs:import qualified Data.Text.Encoding.Error as T -- instances Cabal/src/Distribution/Utils/Structured.hs:import qualified Data.Text as T Cabal/src/Distribution/Utils/Structured.hs:import qualified Data.Text.Lazy as LT -- some dev stuff bootstrap/src/Main.hs:import qualified Data.Text as T cabal-dev-scripts/src/GenSPDX.hs:import Data.Text (Text) cabal-dev-scripts/src/GenSPDX.hs:import qualified Data.Text as T cabal-dev-scripts/src/GenSPDXExc.hs:import Data.Text (Text) cabal-dev-scripts/src/GenSPDXExc.hs:import qualified Data.Text as T cabal-dev-scripts/src/GenUtils.hs:import Data.Text (Text) cabal-dev-scripts/src/GenUtils.hs:import qualified Data.Text as T -- this is really fun, the usage is -- let command' = command { commandName = T.unpack . T.replace "v1-" "" . T.pack . commandName $ command } cabal-install/src/Distribution/Client/CmdLegacy.hs:import qualified Data.Text as T cabal-testsuite/src/Test/Cabal/Plan.hs:import qualified Data.Text as Text -- docs doc/cabal-package.rst:a requirement ``Str`` and an implementation ``Data.Text``, you can doc/cabal-package.rst: ``mixins: parametrized requires (Str as Data.Text)`` doc/cabal-package.rst: ``mixins: text (Data.Text as Str)`` doc/cabal-package.rst: parametrized (MyModule as MyModule.Text) requires (Str as Data.Text), -- lexer debugging (same as above) templates/Lexer.x:import qualified Data.Text as T templates/Lexer.x:import qualified Data.Text.Encoding as T templates/Lexer.x:import qualified Data.Text.Encoding.Error as T

If any change in text affects Cabal performance, that will be interesting.

TL;DR text dependency could be dropped quite easily, but there is really no point as it's there via parsec.

Thinking of Cabal made me remember pandoc. It can be called the Haskell app, and it uses Text extensively. So if you want to have some real project on the list, i'd nominate it.

The promise that API doesn't change too much should make building pandoc even with its dependency footprint only a matter of CPU time.

Addition: pandoc is also as text processing tool as a tool (written in Haskell) can be, so if it clearly benefits, that could be enough to declare the success of the whole proposal.

Ah, interesting, thanks.

@emilypi shall we remove Cabal from stakeholders?

neilmayhew · 2021-05-14T18:17:21Z

This may be a little premature, but I'd like to make a plea for removing the BOM (Byte-Order Marker) that a lot of Windows software places at the beginning of UTF-8 files. This is used to declare the encoding of the file and isn't part of the meaning of the file. The logical place to do this would be in IO functions such as Text.readFile and it could be done efficiently via slicing. If necessary, a non-BOM-eliding version could be provided as well.

As a motivating example, the JSON spec explicitly says that a BOM should not be included in documents, but implementations may choose to skip one if it is present. A surprising number of systems do incorrectly include a BOM in JSON documents (eg Azure). It makes sense to elide it when reading a file or a stream that's already known or required to be UTF-8 because it doesn't contribute anything.

Ban the BOM!

emilypi · 2021-05-14T21:43:18Z

@phadej Re: text-short, this was my first suggestion for this, having spoken with HVR. However, there's a problem: he's not around, it's not finished, and the library has gone 3 years without a significant update. I suspect that we can have this finished in the same amount of time that it would take us to get text-short up to speed and fully implemented.

phadej · 2021-05-14T23:20:47Z

@emilypi the text-short into text and text UTF8ication are not mutually exclusive. In fact, if Text is utf-8, for me it makes a lot of sense to integrate ShortText into text package. The reasoning is the same why there is ShortByteString.

I repeat. Text is sometimes wrong type, when you don't need slicing. Yes, it adds one more "String" type, but that is the point. The usecases are different. (E.g. if GHC would move to use text internally, it would benefit from ShortText, not from Text).

proposals/002-text-utf-default.md

pchiusano · 2021-05-15T19:49:44Z

Very glad to see this happening!!

I really hope the effort doesn't get stymied by endless bikeshedding. 😬

Can anyone explain the relationship with the text-utf8 project? (Edit: disregard, I just read the attached proposal!)

Bodigrim · 2021-05-15T20:37:56Z

@pchiusano I'll clarify relations with prior art in an upcoming update. While elaborating the proposal, we came to a conclusion different from current statement.

Co-authored-by: notquiteamonad <[email protected]>

Bodigrim · 2021-05-18T01:05:47Z

CC @jgm any comments on behalf of pandoc?

jgm · 2021-05-18T03:59:31Z

No substantive comments, but I fully support this proposal! (Note added later: pandoc stores lots and lots of very short Texts in AST nodes. I've sometimes thought about using ShortText or ByteString instead to reduce memory use -- especially now that I know how big the constant overhead of Text is -- but the expense of converting between these formats and Text has made this seem unattractive. If Text used UTF-8, this sort of thing would be much more feasible.)

Kleidukos

Glad to see this taking shape.

proposals/002-text-utf-default.md

Co-authored-by: José Lorenzo Rodríguez <[email protected]>

phadej · 2021-05-19T18:00:03Z

I should have asked that at the very beginning: What is the goal of having this PR. Is it to facilitate some discussion, which may or may not be incorporated into proposal text based on which (alone, or including the discussion) will be brought to the HF technical group for decision for clearing funds for the implementation?

On the other hand, this project is already on https://haskell.foundation/projects/ page,
with some other projects which don't have a proposal text. So what we are actually discussing here? I.e. is it already decided that the project will go forward, and we are discussing some details? What is the process.

emilypi · 2021-05-20T04:53:00Z

@phadej I think that's a meta question that shouldn't be answered here, but i'm happy to have an out-of-band discussion about it elsewhere in more depth. Please focus on the proposal as a project specification for work that will happen if approved by the HF. The other proposals will operate similarly.

phadej · 2021-05-20T15:51:56Z

The critical point is if approved. Thank you. What is the status with the other projects on the https://haskell.foundation/projects/ page, are all of haskup, GHC Platform CI, Project Matchmaker, Performance Tuning Book, Vector Types Proposal and GHC Performance Dashboards also awaiting HF approval (and therefore don't receive funds).

The https://haskell.foundation/projects/ page page gives an impression that all of these projects are in the same state and the impression isn't that it is "awaiting HF approval", but rather that they are already ongoing and "blessed" by HF.

phadej · 2021-05-20T16:01:16Z

And as separate comment: For me the text UTF8 was a good idea to explore (as you can see from members https://github.com/text-utf8). And I hope that HF just gives the money*, and we look afterwards whether there is a difference or not.

I hope that it's clear for everyone that it's not up to HF Technical board to decide (before or after the experiment is done) whether text have to go for UTF8. It's up to text maintainers. They can and should listen for larger community, but IMHO this is wrong forum and too early to discuss without any data.

Again, just give @Bodigrim what he needs to write the patch and run the benchmarks. Let's not make this more difficult then it needs to be.

myShoggoth · 2021-05-20T16:40:35Z

Thank you all for the extremely productive feedback on this proposal, it has been great to see people interested and highly involved.

As of the HF Board Meeting on May 20th, 2021 this proposal has been approved.

Fix broken link in PROPOSALS.md

jberryman · 2022-04-20T22:08:01Z

If ghc-the-library will ever depend on text directly, re-installing text will be impossible if ghc is not made reinstallable (which I doubt will happen before this change is implemented). So there is that "but" also.

...or indirectly, via parsec, which was the case in 9.2:

https://hackage.haskell.org/package/ghc-9.2.1

...but removed by Matt Pickering for us when we hit that wall (https://hackage.haskell.org/package/ghc-9.2.2)

Ericson2314 · 2022-04-20T23:07:06Z

However, GHC is now reinstallable!

Kleidukos · 2022-04-21T06:24:24Z

@Ericson2314 Where / When did this happen?

Ericson2314 · 2022-04-21T16:51:19Z

https://gitlab.haskell.org/ghc/ghc/-/merge_requests/5965 GHC itself, I suppose I should disclaim :). Stuff like https://gitlab.haskell.org/ghc/ghc/-/merge_requests/6803 is still needed if we are to nicely be able to rebuild all the libraries.

Kleidukos · 2022-04-21T17:16:12Z

thanks!

Bodigrim · 2022-06-17T18:07:42Z

This is old news, but the proposal has been completed successfully in haskell/text#365 and released as text-2.0. My ZuriHac talk with experience report is available at https://www.youtube.com/watch?v=1qlGe2qnGZQ.

Fix list syntax

emilypi changed the title ~~Add text-utf8 proposal~~ UTF-8 Encoded Text May 13, 2021

Add text-utf8 proposal

f8ee7e3

emilypi force-pushed the proposal/text-utf8 branch from bb50591 to f8ee7e3 Compare May 13, 2021 16:50

ketzacoatl reviewed May 13, 2021

View reviewed changes

proposals/002-text-utf-default.md Outdated Show resolved Hide resolved

goldfirere reviewed May 13, 2021

View reviewed changes

Update proposals/002-text-utf-default.md

c184a9e

Co-authored-by: Richard Eisenberg <[email protected]>

Add technical introduction to Unicode

666518d

Rework Motivation section

cd12a3a

neilmayhew reviewed May 14, 2021

View reviewed changes

proposals/002-text-utf-default.md Outdated Show resolved Hide resolved

Update Motivation and state Performance Impact

7b79d75

phadej reviewed May 14, 2021

View reviewed changes

proposals/002-text-utf-default.md Outdated Show resolved Hide resolved

goldfirere reviewed May 14, 2021

View reviewed changes

proposals/002-text-utf-default.md Show resolved Hide resolved

proposals/002-text-utf-default.md Show resolved Hide resolved

phadej reviewed May 14, 2021

View reviewed changes

Add Compatibility Issues section

5bea03d

notquiteamonad reviewed May 15, 2021

View reviewed changes

proposals/002-text-utf-default.md Outdated Show resolved Hide resolved

ulysses4ever and others added 4 commits May 15, 2021 23:19

Improve formatting

3579759

Address Richard's comment re: actions vs. goals

c015ff2

Fix typo (compatibiltiy vs. compatibility)

dccba12

Co-authored-by: notquiteamonad <[email protected]>

Add Prior Art section

bd0828f

Kleidukos approved these changes May 18, 2021

View reviewed changes

lorenzo reviewed May 19, 2021

View reviewed changes

proposals/002-text-utf-default.md Outdated Show resolved Hide resolved

Update proposals/002-text-utf-default.md

737ca00

Co-authored-by: José Lorenzo Rodríguez <[email protected]>

emilypi assigned Bodigrim and emilypi May 20, 2021

emilypi added the proposal label May 20, 2021

myShoggoth merged commit dcfac9f into main May 20, 2021

emilypi pushed a commit that referenced this pull request Sep 20, 2021

Merge pull request #1 from cdsmith/cdsmith-patch-1

1776184

Fix broken link in PROPOSALS.md

tomjaguarpaw deleted the proposal/text-utf8 branch March 10, 2022 09:55

gbaz pushed a commit that referenced this pull request Oct 20, 2023

Merge pull request #1 from bgamari/patch-1

3fdee50

Fix list syntax


		- Ben Gamari: integration with GHC

		- The Cabal maintainers (fgaz, emilypi, mikolaj): integration with Cabal

UTF-8 Encoded Text #1

UTF-8 Encoded Text #1

Conversation

emilypi commented May 13, 2021 • edited by Bodigrim Loading

cartazio commented May 13, 2021

goldfirere left a comment

Choose a reason for hiding this comment

phadej commented May 13, 2021

phadej commented May 13, 2021

coderfromhere commented May 13, 2021

chrisdone commented May 13, 2021 • edited Loading

chrisdone commented May 13, 2021

chrisdone commented May 13, 2021

phadej commented May 13, 2021

phadej commented May 13, 2021

chrisdone commented May 13, 2021

neilmayhew commented May 14, 2021

neilmayhew commented May 14, 2021

phadej May 14, 2021

Choose a reason for hiding this comment

Bodigrim May 14, 2021

Choose a reason for hiding this comment

phadej May 14, 2021 • edited Loading

Choose a reason for hiding this comment

phadej May 14, 2021 • edited Loading

Choose a reason for hiding this comment

Bodigrim May 14, 2021

Choose a reason for hiding this comment

Bodigrim May 18, 2021

Choose a reason for hiding this comment

neilmayhew commented May 14, 2021 • edited Loading

emilypi commented May 14, 2021

phadej commented May 14, 2021

pchiusano commented May 15, 2021 • edited Loading

Bodigrim commented May 15, 2021

Bodigrim commented May 18, 2021

jgm commented May 18, 2021 • edited Loading

Kleidukos left a comment

Choose a reason for hiding this comment

phadej commented May 19, 2021

emilypi commented May 20, 2021 • edited Loading

phadej commented May 20, 2021

phadej commented May 20, 2021

myShoggoth commented May 20, 2021

jberryman commented Apr 20, 2022

Ericson2314 commented Apr 20, 2022 • edited Loading

Kleidukos commented Apr 21, 2022

Ericson2314 commented Apr 21, 2022

Kleidukos commented Apr 21, 2022

Bodigrim commented Jun 17, 2022

emilypi commented May 13, 2021 •

edited by Bodigrim

Loading

chrisdone commented May 13, 2021 •

edited

Loading

phadej May 14, 2021 •

edited

Loading

phadej May 14, 2021 •

edited

Loading

neilmayhew commented May 14, 2021 •

edited

Loading

pchiusano commented May 15, 2021 •

edited

Loading

jgm commented May 18, 2021 •

edited

Loading

emilypi commented May 20, 2021 •

edited

Loading

Ericson2314 commented Apr 20, 2022 •

edited

Loading