Allow case insensitive matching of literals #34

dmajda · 2011-08-14T15:15:33Z

Right now, matching literals case-insensitively is hard and ugly. For example, the only way to match "select" case-insensitively is:

select = [Ss][Ee][Ll][Ee][Cc][Tt]

Having one global flag for case-insensitivity would create problems when parts of a language is case-sensitive and another case-insensitive. Also combining languages (a feature I am thinging about for later) would be harder. Better way would be to signify case insensitivity for each literal separately, e.g. like this:

select = "select"i

The text was updated successfully, but these errors were encountered:

izuzak · 2011-08-15T08:51:43Z

i really like this proposal ("select"i). however, limiting the flag to just literals will prove problematic in large grammar files which are used for case insensitive matching - adding "i" to the end of each literal is very error-prone, imo.

could the "i" flag work in a way that it may be applied not just to literals but to any expression? for example:

newRule = select#
select = "select1" / "select2"

where "#" is the flag for case insensitive matching and in this case it is applied to everything matched by the select rule (both "select1" and "select2"). of course, the "#" flag could also be applied to literals themselves, e.g. "select1"#.

dmajda · 2011-09-30T10:51:09Z

I implemented the i suffix proposal for literals and also for character classes.

I chose this solution because any solution that would mark whole parts of the grammar as case-insensitive would have problems with recursive rules (probably requiring forbidding in such cases) and generally introduce complexity and non-local effects. This is something I'd like to avoid.

An easy way to avoid forgetting i when writing literals is to extract all case-insensitive keywords into separate rules and group them together like this:

select = "select"i
from   = "from"i
where  = "where"i
...

s3u · 2011-12-06T01:45:15Z

Has this fix ever made it into the online site or npm modules?

dmajda · 2012-02-02T20:01:41Z

@s3u Not yet. Will be included in PEG.js 0.7.0.

JanSemorad · 2017-07-20T15:54:47Z

DOWNLOAD
= "DOWNLOAD"
/ D O W N L O A D {return "DOWNLOAD" }

D = [Dd]
O = [Oo]
W = [Ww]
N = [Nn]
L = [Ll]
A = [Aa]

jtenner · 2017-07-20T16:26:01Z

None of this change is documented. I've been using peg.js for years now and I had no idea this feature exists.

futagoza · 2017-07-20T23:04:10Z

@JanSemorad Is there a point to that post?

@jtenner Thank for pointing this out. I've been using PEG.js for 4 years now, and for some reason I always knew the feature was there but somehow never realised it wasn't documented 😆 Opened a new issue for this, see #518

ghost assigned dmajda Aug 14, 2011

dmajda closed this as completed Sep 30, 2011

futagoza mentioned this issue Jul 20, 2017

Document case insensitive matching of literals #518

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow case insensitive matching of literals #34

Allow case insensitive matching of literals #34

dmajda commented Aug 14, 2011

izuzak commented Aug 15, 2011

dmajda commented Sep 30, 2011

s3u commented Dec 6, 2011

dmajda commented Feb 2, 2012

JanSemorad commented Jul 20, 2017

jtenner commented Jul 20, 2017

futagoza commented Jul 20, 2017

Allow case insensitive matching of literals #34

Allow case insensitive matching of literals #34

Comments

dmajda commented Aug 14, 2011

izuzak commented Aug 15, 2011

dmajda commented Sep 30, 2011

s3u commented Dec 6, 2011

dmajda commented Feb 2, 2012

JanSemorad commented Jul 20, 2017

jtenner commented Jul 20, 2017

futagoza commented Jul 20, 2017