From 98f90c874a95d18c19195554a1a58f491e7097dd Mon Sep 17 00:00:00 2001 From: Justin Grant Date: Mon, 14 Feb 2022 17:08:12 -0800 Subject: [PATCH] Update string parsing/formatting documentation Expands "ISO Extensions" page in the docs into a more comprehensive overview of using strings in Temporal, including a new FAQ. Also renames the URL of that page to be more generic, and updates links to it. --- docs/README.md | 13 +- docs/iso-string-ext.md | 111 ---------- docs/ja/README.md | 5 +- docs/persistence-model.svg | 2 +- docs/plaindate.md | 2 +- docs/plaindatetime.md | 2 +- docs/plainmonthday.md | 2 +- docs/plainyearmonth.md | 2 +- docs/strings.md | 403 +++++++++++++++++++++++++++++++++++++ docs/zh_CN/README.md | 3 +- docs/zoneddatetime.md | 6 +- 11 files changed, 419 insertions(+), 132 deletions(-) delete mode 100644 docs/iso-string-ext.md create mode 100644 docs/strings.md diff --git a/docs/README.md b/docs/README.md index a3048e1762..002ab93fcb 100644 --- a/docs/README.md +++ b/docs/README.md @@ -233,24 +233,21 @@ date.daysInYear; // => 365 See [Temporal.Calendar Documentation](./calendar.md) for detailed documentation. -## Object Relationship +## Object relationship -## String Persistence +## String persistence, parsing, and formatting All `Temporal` types have a string representation for persistence and interoperability. -The correspondence between types and strings is shown below. -For more information about extensions to the ISO 8601 / RFC 3339 standards that are used by Temporal and which are intended to be put on a standards track, see [ISO string extensions](./iso-string-ext.md). +The correspondence between types and machine-readable strings is shown below. +For more information about string parsing, serialization, and formatting in `Temporal` (including how Temporal is using industry standards like ISO 8601 and RFC 3339), see [String Parsing, Serialization, and Formatting](./strings.md). -## Other Documentation - -### **Key Concepts** +## Other documentation - [Ambiguity](./ambiguity.md) — Explanation of missing times and double times due to daylight saving and time zone changes. - [Balancing](./balancing.md) — Explanation of when `Temporal.Duration` units wrap around to 0 and when they don't. -- [ISO String Extensions](./iso-string-ext.md) — Discussion of extensions to the ISO 8601 and/or RFC 3339 standards which are used by `Temporal`. - [Why do Temporal instances have a Calendar?](./calendar-review.md) — Background about why types like `Temporal.PlainDate` or `Temporal.PlainDate` contain a calendar. These extensions are being actively worked on with IETF to get them on a standards track. diff --git a/docs/iso-string-ext.md b/docs/iso-string-ext.md deleted file mode 100644 index 2491af57c3..0000000000 --- a/docs/iso-string-ext.md +++ /dev/null @@ -1,111 +0,0 @@ -# ECMAScript Extended ISO-8601 / RFC 3339 Strings - -[ISO-8601](https://en.wikipedia.org/wiki/ISO_8601) and [RFC 3339](https://tools.ietf.org/html/rfc3339) are the industry standards for machine-readable serialization of dates and times. -However, there are three key use cases that are helpful for ECMAScript users that are not covered in ISO-8601 or RFC 3339: - -1. IANA Time Zone Names -2. Month/Day Syntax -3. Calendar Systems - -This document addresses addenda to the ISO-8601 and/or RFC 3339 spec for string serialization in the ECMAScript `Temporal` proposal. - -## IANA Time Zone Names - -There is precedent in other popular datetime libraries, such as Joda Time (now java.time), for appending time zone names in brackets. -We intend to fully support this syntax in ECMAScript, both input and output. -For example: - -``` -2007-12-03T10:15:30+01:00[Europe/Paris] -``` - -Neither ISO-8601 nor RFC 3339 specifications mention this syntax, but it is adopted in `Temporal` as a de-facto industry standard. - -The time zone string is written according to the rules of the time zone database (see [timezone.md](timezone.md)). -These rules are documented [here](https://htmlpreview.github.io/?https://github.com/eggert/tz/blob/master/theory.html): - -> Use only valid POSIX file name components (i.e., the parts of names other than `'/'`). -> Do not use the file name components `'.'` and `'..'`. -> Within a file name component, use only ASCII letters, `'.'`, `'-'` and `'_'`. -> Do not use digits, as that might create an ambiguity with POSIX TZ strings. -> A file name component must not exceed 14 characters or start with `'-'`. - -## Month/Day Syntax - -ISO-8601 and RFC 3339 specify syntax appropriate for `Temporal.PlainDate`, `Temporal.PlainTime`, `Temporal.PlainDateTime`, and `Temporal.PlainYearMonth`, but not `Temporal.PlainMonthDay` except in the special case of a duration. -Therefore, `Temporal` defines the following convention for month/day strings: - -``` -07-04 -``` - -The above string would signify July 4 in the ISO calendar. - -RFC 3339 includes `--07-04` for a date without year component. -In the current spec, we accept this but don't emit it. - -## Calendar Systems - -In order to achieve round-trip persistence for `Temporal` objects using non-ISO calendar systems, it's needed to add a calendar system identifier to the string syntax. - -The authors of this proposal are unaware of existing precedent for expressing a calendar system in an ISO 8601 and/or RFC 3339 string. -Therefore, we are proposing the following extension: -_Calendar-specific dates are expressed as their equivalent date in the ISO calendar system, with a suffix signifying the calendar system into which the ISO date should be converted when read by a computer._ - -For example, when parsed, the following string would represent the date **28 Iyar 5780** in the Hebrew calendar: - -``` -2020-05-22[u-ca=hebrew] -``` - -The syntax of the calendar suffix is currently being proposed for standardization with the CalConnect and IETF Calsify standards bodies. - -The calendar identifiers [are defined by CLDR](http://unicode.org/reports/tr35/#UnicodeCalendarIdentifier) as [a sequence of 3-8 character BCP47 subtags](http://unicode.org/reports/tr35/#unicode_locale_extensions). -The list of calendar identifiers currently supported by CLDR is: - -- `buddhist` -- `chinese` -- `coptic` -- `dangi` -- `ethioaa` -- `ethiopic` -- `gregory` -- `hebrew` -- `indian` -- `islamic` -- `islamic-umalqura` -- `islamic-tbla` -- `islamic-civil` -- `islamic-rgsa` -- `iso8601` -- `japanese` -- `persian` -- `roc` -- `islamicc` (deprecated in favor of `islamic-civil`) - -Example of a maximal length string containing both an IANA time zone name and a calendar system: - -``` -2020-05-22T07:19:35.356-04:00[America/Indiana/Indianapolis][u-ca=islamic-umalqura] -``` - -### Calendar-dependent YearMonth and MonthDay - -Based on the data model discussion in [#391](https://github.com/tc39/proposal-temporal/issues/391), `Temporal.YearMonth` and `Temporal.MonthDay` use the `Temporal.PlainDate` data model when in a non-ISO calendar system. -For example, a `Temporal` object using the Hebrew calendar might store "Iyar 5780" in a `Temporal.YearMonth` using `2020-04-25` which is the first day of that month in the ISO calendar. - -When expressed as an ISO string, we would say: - -``` -2020-04-25[u-ca=hebrew] -``` - -Because it is ambiguous whether that string represents a `Temporal.PlainDate`, `Temporal.YearMonth`, or `Temporal.MonthDay`, the appropriate `Temporal` constructor must be chosen in order to get the expected data type back out. - -## String Persistence Overview - -All `Temporal` types have a string representation for persistence and interoperability. -Most of these types only require existing standards, with a few exceptions noted above. -The correspondence between types and strings is shown below. - - diff --git a/docs/ja/README.md b/docs/ja/README.md index 9c213732fd..2451f84a5c 100644 --- a/docs/ja/README.md +++ b/docs/ja/README.md @@ -80,7 +80,7 @@ const zonedDateTime = Temporal.ZonedDateTime.from({ }); // => 1995-12-07T03:24:30.0000035-08:00[America/Los_Angeles] ``` -これは Tempral において最も多くの情報を持つタイプであり、`Temporal.TimeZone`、`Temporal.Instant`、`Temporal.PlainDateTime`(これには `Temporal.Calendar` が含まれます)の組み合わせとしてみなせます。 +これは Temporal において最も多くの情報を持つタイプであり、`Temporal.TimeZone`、`Temporal.Instant`、`Temporal.PlainDateTime`(これには `Temporal.Calendar` が含まれます)の組み合わせとしてみなせます。 より詳しくは[Temporal.ZonedDateTime Documentation](../zoneddatetime.md)を参照してください。 @@ -224,7 +224,7 @@ date.daysInYear; // => 365 ## 文字列による永続性 -すべての`Temporal`タイプは、永続性や相互運用性のために文字列による表現を持っています。各タイプと文字列表現の対応を以下に示します。Temporal で使用されている ISO 8601 や RFC 3339 に関する、より詳細な情報と標準化に向けた取り組みに関しては[ECMAScript 拡張の ISO-8601 と RFC 3339](./iso-string-ext.md)を参照してください。 +すべての`Temporal`タイプは、永続性や相互運用性のために文字列による表現を持っています。各タイプと文字列表現の対応を以下に示します。Temporal で使用されている ISO 8601 や RFC 3339 に関する、より詳細な情報と標準化に向けた取り組みに関しては[ECMAScript 拡張の ISO-8601 と RFC 3339](./strings.md)を参照してください。 @@ -234,5 +234,4 @@ date.daysInYear; // => 365 - [曖昧性](./ambiguity.md) — 時間に関する情報が欠落して曖昧さを引き起こす仕組み、2 つのタイプの時間オブジェクトが必要な理由、サマータイムやタイムゾーンの定義変更による影響 - [バランシング](./balancing.md) — `Temporal.Duration`の単位が、いつ繰り上げされる(またはされない)のかについての説明 -- [ISO 文字列の拡張](./iso-string-ext.md) — `Temporal`で使用されている ISO 8601 および RFC 3339 の拡張に関する議論、IETF と連携した標準化への取り組み - [なぜ Temporal インスタンスはカレンダーの情報を持っているのか?](./calendar-review.md) — `Temporal.PlainDate`や`Temporal.PlainDate`がカレンダーシステムを保持している理由と背景 diff --git a/docs/persistence-model.svg b/docs/persistence-model.svg index 06733b9adc..d5f52bb0e0 100644 --- a/docs/persistence-model.svg +++ b/docs/persistence-model.svg @@ -1,6 +1,6 @@ + Table of Contents + + + +## Machine-readable vs. human-readable string formats + +There are two kinds of string representations for ECMAScript `Temporal` objects: + +- **"Machine-readable" formats** are based on industry standards and standards-track extensions. + As their name suggests, machine-readable formats are optimized for unambiguous and efficient parsing by computers, so they are not suitable for display to non-technical end users. + They are never localized. + Examples: + - `"2022-02-28"` + - `"2022-02-28T11:06:00.092121729+08:00[Asia/Shanghai][u-ca=chinese]"` + - `"2022-02-28T03:06:00.092121729Z"` +- **"Human-readable" formats** - These formats are optimized for displaying to end users according to the language and conventions of users' locale, and the needs of specific UI applications. + Examples: + - `"2/28/2022, 11:06:00 AM PST"` + - `"Monday, February 28, 2022"` + - `"正月28日 GMT+8 11:06:00"` + +Temporal objects can be parsed from machine-readable formats and can also be serialized ("round-tripped") into these formats. + + +```javascript +zdt = Temporal.ZonedDateTime.from('2022-02-28T11:06:00.092121729+08:00[Asia/Shanghai][u-ca=chinese]'); +zdt.toString(); + // => "2022-02-28T11:06:00.092121729+08:00[Asia/Shanghai][u-ca=chinese]" + +date = Temporal.PlainDate.from('2022-02-28'); +date.toString(); + // => "2022-02-28" + +duration = Temporal.Duration.from('P1DT12H30M'); +duration.toString(); + // => "P1DT12H30M" +``` + + +Temporal can also produce localized, human-readable string representations. + + +```javascript +zdt = Temporal.ZonedDateTime.from('2022-02-28T11:06:00.092121729+08:00[Asia/Shanghai][u-ca=chinese]'); +zdt.toLocaleString('zh-CN', { calendar: 'chinese' }); + // => "正月28日 GMT+8 11:06:00" + +// `toLocaleString` on `Temporal` objects works the same as `Intl.DateTimeFormat#format` +zdt = Temporal.ZonedDateTime.from('2022-02-28T11:06:00.092121729+08:00[Asia/Shanghai][u-ca=chinese]'); +new Intl.DateTimeFormat('zh-CN', { calendar: 'chinese' }).format(zdt); + // => "正月28日 GMT+8 11:06:00" + +zdt = zdt.withCalendar('gregory'); +zdt.withTimeZone('America/Los_Angeles') + .toLocaleString('en-US', { calendar: 'gregory' }); + // => "2/27/2022, 7:06:00 PM PST" + +zdt.withTimeZone('Europe/Paris') + .toPlainDate() + .toLocaleString('fr-FR', { calendar: 'gregory', dateStyle: 'long' }); + // => "28 février 2022" + +zdt.withTimeZone('Europe/Paris') + .toPlainDate() + .subtract({ years: 3000 }) + .toLocaleString('fr-FR', { calendar: 'gregory', dateStyle: 'long' }); + // => "28 février 2022" + +zdt.withTimeZone('America/New_York') + .toPlainDate() + .subtract({ years: 3000 }) + .toLocaleString('en-US', { calendar: 'gregory', era: 'short', year: 'numeric', month: 'long', day: 'numeric' }); + // "February 27, 979 BC" +``` + + +Unlike machine-readable strings, human-readable strings cannot be parsed by Temporal. +Why not? + +## Machine-readable string persistence overview + +All `Temporal` types have a machine-readable string representation for persistence and interoperability. +These representations are based on existing industry standards, with a few standards-track extensions noted below. + + + +## Industry standards for machine-readable date/time strings + +[ISO-8601](https://en.wikipedia.org/wiki/ISO_8601) and [RFC 3339](https://tools.ietf.org/html/rfc3339) are the industry standards for machine-readable serialization of dates and times. +Temporal's string serialization is based on these existing standards, with exception of IANA Time Zones (like `Europe/Paris`) and non-Gregorian calendars like Hebrew or Chinese. +For IANA time zones and non-Gregorian calendars, a standards-track extension to RFC 3339 is being worked on as part of the [IETF SEDATE](https://datatracker.ietf.org/wg/sedate/about/) working group. + +Also, currently the default serialization for `Temporal.PlainMonthDay` (e.g. `12-25`) is also not covered by any current industry standard. +This [may change soon](https://github.com/tc39/proposal-temporal/issues/2049) by switching to the `XXXX-12-25` format from [ISO 8601-2](). + +The sections below explain the proposed extensions to RFC 3339 for time zones and calendars. + +### IANA time zone names + +For many years, a popular way to combine a timestamp and a time zone into a single string has been the format used in the [Java standard library](https://docs.oracle.com/javase/8/docs/api/java/time/ZonedDateTime.html#parse-java.lang.CharSequence-). +Before this format was adopted by Java in [JSR-310](https://jcp.org/en/jsr/detail?id=310) in 2014, other open-source libraries like [Joda Time](https://www.joda.org/joda-time/) (which JSR-310 was based on) used the same format. +For example: + +``` +2007-12-03T10:15:30+01:00[Europe/Paris] +``` + +Because of this format's long-term industry adoption, it was chosen for use in ECMAScript `Temporal` for both input and output. + +Although neither ISO-8601 nor RFC 3339 specifications currently use this syntax, it's on a standards track led by the [IETF SEDATE](https://datatracker.ietf.org/wg/sedate/about/) working group which includes ECMAScript `Temporal` champions as well as other industry participants. + +The time zone ID itself follows the rules of the time zone database, usually called TZDB. + +> Use only valid POSIX file name components (i.e., the parts of names other than `'/'`). +> Do not use the file name components `'.'` and `'..'`. +> Within a file name component, use only ASCII letters, `'.'`, `'-'` and `'_'`. +> Do not use digits, as that might create an ambiguity with POSIX TZ strings. +> A file name component must not exceed 14 characters or start with `'-'`. + +For more about TZDB, see the [`Temporal.TimeZone` documentation](timezone.md) or the [TZDB documentation](https://htmlpreview.github.io/?https://github.com/eggert/tz/blob/master/theory.html). + +### Calendar systems + +In order to achieve round-trip persistence for `Temporal` objects using non-ISO calendar systems, a calendar system identifier can be added. + +Therefore, we are proposing the following extension: +_Calendar-specific dates are expressed as their equivalent date in the ISO calendar system, with a suffix signifying the calendar system into which the ISO date should be converted when read by a computer._ + +For example, when parsed, the following string would represent the date **28 Iyar 5780** in the Hebrew calendar: + +``` +2020-05-22[u-ca=hebrew] +``` + +The syntax of the calendar suffix is currently on a standards track led by the [IETF SEDATE](https://datatracker.ietf.org/wg/sedate/about/) working group. + +The calendar identifiers [are defined by CLDR](http://unicode.org/reports/tr35/#UnicodeCalendarIdentifier) as [a sequence of 3-8 character BCP47 subtags](http://unicode.org/reports/tr35/#unicode_locale_extensions). +The list of calendar identifiers currently supported by CLDR is: + +- `buddhist` +- `chinese` +- `coptic` +- `dangi` +- `ethioaa` +- `ethiopic` +- `gregory` +- `hebrew` +- `indian` +- `islamic` +- `islamic-umalqura` +- `islamic-tbla` +- `islamic-civil` +- `islamic-rgsa` +- `iso8601` +- `japanese` +- `persian` +- `roc` +- `islamicc` (deprecated in favor of `islamic-civil`) + +Example of a maximal length string containing both an IANA time zone name and a calendar system: + +``` +2020-05-22T07:19:35.356-04:00[America/Indiana/Indianapolis][u-ca=islamic-umalqura] +``` + +### Calendar-dependent `Temporal.PlainYearMonth` and `Temporal.PlainMonthDay` + +Many `Temporal` types include a `calendar` property that controls the value of the `year`, `month` and `day` properties as well as other calendar-related behavior like adding months or calculating the number of number of days in a month. +But regardless of the calendar used, the string representation always uses the ISO 8601 calendar year, month, and day. +Doing this allows easier sorting and comparison of strings across different calendar systems, and it ensures better compatibility with other systems that are not aware of other calendar systems. + +For example, the following strings represent the same day in different calendar systems. + +```javascript +function localizedDate(s) { + date = Temporal.PlainDate.from(s); + calendar = Temporal.Calendar.from(s); + return date.toLocaleString('en-US', { calendar, dateStyle: 'long' }); +} +localizedDate('2020-04-25[u-ca=hebrew]'); // "1 Iyar 5780" +localizedDate('2020-04-25[u-ca=islamic]'); // "Ramadan 2, 1441 AH" +localizedDate('2020-04-25'); // "April 25, 2020" (ISO 8601 calendar is default) +``` + +In the ISO 8601 calendar, the string representation of `Temporal.PlainYearMonth` omits the day (e.g. `"2020-04"`) and the `Temporal.PlainMonthDay` representation omits the year (e.g. `"04-25"`). +But in other calendar systems, the year, month, and day are all needed in order to unambiguously determine the correct year and month (for `Temporal.PlainYearMonth`) or month and day (for `Temporal.PlainMonthDay`) in that calendar. +Therefore, the string representation of `Temporal.PlainYearMonth` and `Temporal.PlainMonthDay` in non-ISO calendar systems matches the representation of `Temporal.PlainDate`. +For example: + +```javascript +function localizedMonthDay(s) { + monthDay = Temporal.PlainMonthDay.from(s); + calendar = Temporal.Calendar.from(s); + return monthDay.toLocaleString('en-US', { calendar, month: 'long', day: 'numeric' }); +} +localizedMonthDay('2020-04-25[u-ca=hebrew]'); // "1 Iyar" +localizedMonthDay('2020-04-25[u-ca=islamic]'); // "Ramadan 2" +localizedMonthDay('04-25'); // "April 25" +``` + +## String parsing and formatting FAQ + +#### What is the right `Temporal` type to use for parsing a particular string? + +To determine the `Temporal` class that should be used to parse a string, it's important to pick the type whose data model matches the data in the string. + +- `Temporal.Instant` represents an exact time. + Its data model is the number of integer nanoseconds since January 1, 1970 UTC. + This type is unaware of time zones and does not provide a way to access calendar/clock units like month or hour. It's just a timestamp. +- `Temporal.ZonedDateTime` represents an exact time from the perspective of a specific time zone. + Its data model is a timestamp (like `Temporal.Instant`), a time zone (usually an IANA zone), and a calendar. + The time zone is needed because this `Temporal` type (and only this type) allows DST-safe creation of derived values. + When you add 1 day to a `Temporal.ZonedDateTime` instance, the exact time will usually be 24 hours later but it may be 23 or 25 if the addition crossed a DST transition. +- `Temporal.PlainDate` represents a timezone-less date: a local date without a time and without any reference to the time zone. + Its data model is a year/month/day and a calendar. +- `Temporal.PlainMonthDay` represents a birthday, anniversary, or other recurring event. + Its data model is a month, day, and calendar. +- `Temporal.PlainYearMonth` represents a month in a specific year, e.g. January 2022. + Its data model is a year, month, and calendar. +- `Temporal.PlainTime` represents a timezone-less time of day: a local time without reference to date, time zone, or calendar. + Its data model is an hour/minute/second, with seconds resolution down to 9 decimal digits (nanoseconds). +- `Temporal.PlainDateTime` represents a local date and time without any reference to the time zone. + Its data model is year/month/day/hour/minute/second and a calendar. + This type is rarely used, because the three types above cover most use cases. + If you only care about the exact timestamp and don't care about time zones, then use `Temporal.Instant`. + If you only care about the local date, then use `Temporal.PlainDate`. + If you care about the exact time and the time zone, then use `Temporal.ZonedDateTime`. + Other than the few use cases detailed in the [`Temporal.PlainDateTime` documentation](plaindatetime.html), most of the time it's better to use a different type. +- `Temporal.Duration` represents a period of time. + Its data model is a number of years, months, days, hours, minutes, seconds, milliseconds, microseconds, and nanoseconds. +- `Temporal.TimeZone` represents an IANA time zone like `Asia/Tokyo` or (rarely) an offset time zone like `+06:00`. + Its data model is the canonical ID of the time zone, e.g. `"Asia/Tokyo"` or `"+06:00"`. +- `Temporal.Calendar` represents a calendar like Hebrew, Chinese, or the default ISO 8601 calendar. + Its data model is the ID of the calendar, e.g. `"iso8601"` or `"hebrew"`. + +#### Is there one function I can call to parse any string into the appropriate `Temporal` type? + +No. +To parse a string into a `Temporal` type, you must know ahead of time what type to use, because the same string can be used to parse many different `Temporal` types. +For example, `2020-04-25[u-ca=hebrew]` can be successfully parsed by `Temporal.PlainDate.from`, `Temporal.PlainMonthDay.from`, `Temporal.PlainYearMonth.from`, or even `Temporal.PlainDateTime.from`. +This ambiguity requires choosing a `Temporal` type before parsing. + +#### When should I use `toString` vs. `toLocaleString` when converting `Temporal` types to a string? + +To display human-readable information to non-technical end users, use `toLocaleString()`. +To store data that can later be read by computers, use `toString()`. + +#### Why are IANA time zones needed? Why can't I just use timestamps like `2020-01-01T00:00-08:00`? + +Timestamps are good if you want to know the specific instant where something took place. +However, they're not good for: + +- Deriving new timestamps, e.g. "2 days later" where the new result needs to follow time zone rules +- Knowing what time zone was captured along with the data, which can be helpful later when deciding what time zone to show in a UI for that timestamp +- Solving [conflicts between time zone and UTC offset](ambiguity.html#ambiguity-caused-by-permanent-changes-to-a-time-zone-definition). + +For these reasons, it's helpful to capture and store the time zone (not just the offset) when strings are serialized. + +#### Why is a bracketed time zone annotation required to parse a string into a `Temporal.ZonedDateTime` object? + +To construct a `Temporal.ZonedDateTime`, three pieces of information are required: + +- A timestamp (which is the same data model as a `Temporal.Instant`) +- A calendar (which defaults to the `iso8601` calendar) +- A time zone, e.g. `America/Los_Angeles` + +A string like `"2022-02-28T03:06:00Z"` or `"2022-02-28T03:06:00+02:00"` lacks the time zone annotation (e.g. `[America/Los_Angeles]`), so it will throw an exception when parsed by `Temporal.ZonedDateTime.from`. + +A reasonable question is why `Temporal.ZonedDateTime` doesn't automatically use the offset (e.g. `Z` or `+02:00`) if there's no time zone suffix. +The answer is that `Temporal.ZonedDateTime` is designed to ensure DST-safe arithmetic, which in turn requires knowing when DST starts or stops in a particular time zone. +Here's a short example illustrating why this matters. + + +```javascript +function oneDayLaterInstant(s) { + return Temporal.ZonedDateTime.from(s).add({ days: 1 }).toInstant(); +} +oneDayLaterInstant('2021-03-28T00:00+01:00[Europe/Paris]'); + // => 2021-03-28T22:00:00Z +oneDayLaterInstant('2021-03-28T00:00+01:00[+01:00]'); + // => 2021-03-28T23:00:00Z (one hour later!) +oneDayLaterInstant('2021-03-28T00:00+01:00'); + // throws, because there's no time zone suffix +``` + + +The calls above use the same timestamp, but return a different result when called with a day that happens to contain a DST transition. +As the second call above shows, if you really do want to do arithmetic or otherwise use `Temporal.ZonedDateTime` functionality, it can be done using an offset time zone suffix like `[+01:00]`. + +But if there's no time zone suffix, then `Temporal.ZonedDateTime.from` will throw an exception. +This exception has the effect of forcing developers to provide the time zone in a different way, e.g. by parsing the string into a `Temporal.Instant` and using its `toZonedDateTimeISO()` method to convert to `Temporal.ZonedDateTime`. +This is a good outcome because whether the time zone is part of the string or called separately, requiring it enables developers to know that any `Temporal.ZonedDateTime` APIs will always return time-zone-aware results. +If timezone-aware results are not needed, then use another type like `Temporal.Instant`. + +#### How can I parse the offset of timestamp strings like `2022-02-28T03:06:00+02:00` or `2022-02-28T03:06:00Z`? + +Timestamp strings like `2022-02-28T03:06:00+02:00` or `2022-02-28T03:06:00Z` are normally parsed by `Temporal.Instant`. +Because the data model of `Temporal.Instant` is limited to the number of nanoseconds since January 1, 1970 UTC), the offset is not stored when parsing a string into a `Temporal.Instant`. +If the offset of the string is needed, use `Temporal.TimeZone.from`: + +```javascript +s = `2022-02-28T03:06:00Z`; +offset = Temporal.TimeZone.from(s); // => UTC + +s = `2022-02-28T03:06:00+02:00`; +offset = Temporal.TimeZone.from(s); // => +02:00 +``` + +#### Why can't I parse a UTC "Z" string using `Temporal.PlainXxx` types? + +One of the most common date/time-related bugs is treating a UTC timestamp as if it were a local date and time. +For example, `2022-02-01T00:00Z` is 1:00AM on February 1 in Paris but 7:00PM on January 31 in New York. +If a program assumes that `2022-02-01` is a local-timezone date, then that program will be wrong in half the world. + +Another, more subtle version of this bug is when an external data source switches its format for storing or sending timestamps. +For example, imagine a remote web service or database that provides a date/time string in this format: `2022-01-31T19:00-05:00`. +Developers can safely assume that `2022-01-31T19:00` is the local time in the time zone where this data was recorded, and so can parse this string with `Temporal.PlainDate.from`, `Temporal.PlainTime.from`, etc. +But now assume that the remote service or database is refactored to emit strings using a "Z" suffix like `2022-02-01T00:00Z`. +Code that only parses these strings into a `Temporal.Instant` won't break because the timestamps represent the same instant. +But code using `Temporal.PlainDate.from` will encounter off-by-one-day bugs, which can be maddening to track down because they're intermittent (only show up in some time zones and only at some times of day) and aren't attributable to code changes in the client app. + +To prevent these types of bugs, an exception will be thrown if a `Temporal.PlainXxx` type is asked to parse a timestamp string with a `Z` suffix. + +#### In the unusual case where I want to get the calendar/clock units from a UTC "Z" string, how can I do that? + +In rare cases, developers may want to extract the date and time units from a timestamp string that uses a `Z` suffix. +The answer above explains why this is usually a bad idea, but sometimes it's needed. +For example, code that stores a new log file every day may need to flip over to a new file at midnight UTC. +To handle cases like this, first parse the string into a `Temporal.Instant` and then convert to a `Temporal.ZonedDateTime` using the `UTC` time zone. + +```javascript +s = `2022-02-01T00:00Z`; +Temporal.PlainDate.from(s); // throws +zdt = Temporal.Instant.from(s).toZonedDateTimeISO(s).toPlainDate(); // => 2022-02-01 +``` + +#### Can I parse a subset of data in a string? + +Yes! +Temporal parsing methods accept strings that have more information than the type needs. +Information not needed for that type's data model is ignored. +This works for all `Temporal.PlainXxx` types, `Temporal.Instant`, and even `Temporal.TimeZone` and `Temporal.Calendar`. + +```javascript +s = '2022-02-28T11:06:00.092121729+08:00[Asia/Shanghai]'; +Temporal.ZonedDateTime.from(s); // => 2022-02-28T11:06:00.092121729+08:00[Asia/Shanghai] +Temporal.PlainTime.from(s); // => 11:06:00.092121729 +Temporal.PlainDate.from(s); // => 2022-02-28 +Temporal.PlainDateTime.from(s); // => 2022-02-28T11:06:00.092121729 +Temporal.PlainYearMonth.from(s); // => 2022-02 +Temporal.PlainMonthDay.from(s); // => 02-28 (note: may change to "XXXX-02-28") +Temporal.Instant.from(s); // => 2022-02-28T03:06:00.092121729Z +Temporal.TimeZone.from(s); // => Asia/Shanghai +Temporal.Calendar.from(s); // => iso8601 (the default calendar when parsing strings) +``` + + + +#### Why doesn't `Temporal` support parsing localized formats like `MM/DD/YY`? + +Parsing of localized date/time formats is notoriously difficult and brittle. +For example, here's what MDN has to say about [`Date.parse`]() + +> It is not recommended to use `Date.parse` as until ES5, parsing of strings was entirely implementation dependent. +> There are still many differences in how different hosts parse date strings, therefore date strings should be manually parsed (a library can help if many different formats are to be accommodated). + +Because of the complexity involved in standardizing and implementing localized date parsing, TC39 (the standards committee responsible for ECMAScript) has learned from the painful experience of `Date.parse` that localized parsing is best left to userland libraries, not platform specifications. + +For more background, see [this discussion](https://github.com/tc39/ecma402/issues/342#issuecomment-486857111) about localized parsing in `Intl.DateTimeFormat`. + +For future proposals that extend the Temporal API, there's a [feature request](https://github.com/js-temporal/proposal-temporal-v2/issues/2) for a limited form of parsing. +Feel free to add feedback to that issue. + +#### What industry standards apply to the string formats that `Temporal` uses? + +See the [standards](#industry-standards-for-machine-readable-datetime-strings) section above. + +#### When and how will `Temporal`'s string extensions be standardized? + +For IANA time zones and non-Gregorian calendars, a standards-track extension to RFC 3339 is being worked on as part of the [IETF SEDATE](https://datatracker.ietf.org/wg/sedate/about/) working group. +A draft RFC is available and is currently on the path to standardization. + +#### Can I use a string in place of an object in `Temporal` APIs? + +Yes! +Any ECMAScript `Temporal` method that accepts a `Temporal` object parameter will also accept a string or property-bag representation of that object. + +```javascript +newYear = Temporal.PlainDate.from('2022-01-01'); +newYear.until(Temporal.PlainDate.from('2022-01-15')); // => P14D +newYear.until('2022-01-15'); // => P14D +newYear.until({ year: 2022, month: 1, day: 15 }); // => P14D +``` + +#### Will `Temporal` parse ISO 8601 year-week-day strings, e.g. `2020-W13-5`? + +No. +Year-week-day strings like `2020-W13-5` are not parsed by the initial `Temporal` API. +This is a fairly common request for a future proposal that extends `Temporal`. +See https://github.com/js-temporal/proposal-temporal-v2/issues/11 for more details. diff --git a/docs/zh_CN/README.md b/docs/zh_CN/README.md index de8e82be91..df09804a41 100644 --- a/docs/zh_CN/README.md +++ b/docs/zh_CN/README.md @@ -234,7 +234,7 @@ date.daysInYear; // => 365 ## 字符串持久化 为了持久性和互操作性,所有的 `Temporal` 类型都有一个字符串表示。类型和字符串之间的对应关系如下所示。 -关于 Temporal 中使用的 ISO 8601 和 RFC 3339 的更多信息和标准化工作,请参考 [ISO 字符串扩展](../iso-string-ext.md)。 +关于 Temporal 中使用的 ISO 8601 和 RFC 3339 的更多信息和标准化工作,请参考 [ISO 字符串扩展](../strings.md)。 @@ -244,5 +244,4 @@ date.daysInYear; // => 365 - [Ambiguity](../ambiguity.md) — 解释由于夏令时和时区变化造成的时间缺失和时间重复。 - [Balancing](../balancing.md) — 解释什么时候 `Temporal.Duration` 的单位会自然转换,什么时候不会。 -- [ISO String Extensions](../iso-string-ext.md) — 对 `Temporal` 使用的 ISO 8601 和/或 RFC 3339 标准的扩展讨论。 - [Why do Temporal instances have a Calendar?](../calendar-review.md) — 关于为什么像 `Temporal.PlainDate` 或 `Temporal.PlainDate` 类型包含日历系统。这些扩展正在与 IETF 积极合作以使其进入标准化流程。 diff --git a/docs/zoneddatetime.md b/docs/zoneddatetime.md index c257f9499a..150026c60c 100644 --- a/docs/zoneddatetime.md +++ b/docs/zoneddatetime.md @@ -96,7 +96,7 @@ For example: If the string isn't valid, then a `RangeError` will be thrown regardless of the value of `overflow`. Note that this string format (albeit limited to the ISO calendar system) is also used by `java.time` and some other time-zone-aware libraries. -For more information on `Temporal`'s extensions to the ISO string format and the progress towards becoming a published standard, see [ISO standard extensions](./iso-string-ext.md). +For more information on `Temporal`'s extensions to the ISO 8601 / RFC 3339 string format and the progress towards becoming a published standard, see [String Parsing, Serialization, and Formatting](./strings.md). The time zone ID is always required. `2020-08-05T20:06:13+09:00` and `2020-08-05T11:06:13Z` are not valid inputs to this method because they don't include a time zone ID in square brackets. @@ -1280,13 +1280,13 @@ Note that rounding may change the value of other units as well. Normally, a calendar annotation is shown when `zonedDateTime`'s calendar is not the ISO 8601 calendar. By setting the `calendarName` option to `'always'` or `'never'` this can be overridden to always or never show the annotation, respectively. -For more information on the calendar annotation, see [ISO string extensions](./iso-string-ext.md#calendar-systems). +For more information on the calendar annotation, see [ISO string extensions](./strings.md#calendar-systems). Likewise, passing `'never'` to the `timeZoneName` or `offset` options controls whether the time zone offset (`+01:00`) or name annotation (`[Europe/Paris]`) are shown. If the time zone offset is shown, it is always shown rounded to the nearest minute. The string format output by this method can be parsed by [`java.time.ZonedDateTime`](https://docs.oracle.com/javase/8/docs/api/java/time/ZonedDateTime.html) as long as the calendar annotation is not output. -For more information on `Temporal`'s extensions to the ISO string format and the progress towards becoming a published standard, see [ISO standard extensions](./iso-string-ext.md). +For more information on `Temporal`'s extensions to the ISO 8601 / RFC 3339 string format and the progress towards becoming a published standard, see [String Parsing, Serialization, and Formatting](./strings.md). Example usage: