-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GH-41238: [Release] Use UTF-8 as the default encoding to upload binary #41242
Conversation
… binary We may have non ASCII characters in the process. For example, PGP uid may include non ASCII characters.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Makes sense 👍
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
My favorite LANG
.
I used this for 16.0.0 RC0 and confirmed that UTF-8 is used by debug print. |
export LC_ADDRESS=C.UTF-8 | ||
export LC_CTYPE=C.UTF-8 | ||
export LC_IDENTIFICATION=C.UTF-8 | ||
export LC_MEASUREMENT=C.UTF-8 | ||
export LC_MONETARY=C.UTF-8 | ||
export LC_NAME=C.UTF-8 | ||
export LC_NUMERIC=C.UTF-8 | ||
export LC_PAPER=C.UTF-8 | ||
export LC_TELEPHONE=C.UTF-8 | ||
export LC_TIME=C.UTF-8 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You could set LC_ALL=C.UTF-8
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah, I misunderstood LC_ALL
. I thought it can be overwritten by other LC_*
like LANG
.
I'll simplify the code. Thanks.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
After merging your PR, Conbench analyzed the 7 benchmarking runs that have been run so far on merge-commit d49b62d. There were no benchmark performance regressions. 🎉 The full Conbench report has more details. It also includes information about 10 possible false positives for unstable benchmarks that are known to sometimes produce them. |
#41242) ### Rationale for this change We may have non ASCII characters in the process. For example, PGP uid may include non ASCII characters. ### What changes are included in this PR? Use `LANG=C.UTF-8` and `LC_*=C.UTF-8` to use UTF-8 as the default encoding. ### Are these changes tested? Yes. I used this for 16.0.0 RC0. ### Are there any user-facing changes? No. * GitHub Issue: #41238 Authored-by: Sutou Kouhei <[email protected]> Signed-off-by: Sutou Kouhei <[email protected]>
… binary (apache#41242) ### Rationale for this change We may have non ASCII characters in the process. For example, PGP uid may include non ASCII characters. ### What changes are included in this PR? Use `LANG=C.UTF-8` and `LC_*=C.UTF-8` to use UTF-8 as the default encoding. ### Are these changes tested? Yes. I used this for 16.0.0 RC0. ### Are there any user-facing changes? No. * GitHub Issue: apache#41238 Authored-by: Sutou Kouhei <[email protected]> Signed-off-by: Sutou Kouhei <[email protected]>
… binary (apache#41242) ### Rationale for this change We may have non ASCII characters in the process. For example, PGP uid may include non ASCII characters. ### What changes are included in this PR? Use `LANG=C.UTF-8` and `LC_*=C.UTF-8` to use UTF-8 as the default encoding. ### Are these changes tested? Yes. I used this for 16.0.0 RC0. ### Are there any user-facing changes? No. * GitHub Issue: apache#41238 Authored-by: Sutou Kouhei <[email protected]> Signed-off-by: Sutou Kouhei <[email protected]>
… binary (apache#41242) ### Rationale for this change We may have non ASCII characters in the process. For example, PGP uid may include non ASCII characters. ### What changes are included in this PR? Use `LANG=C.UTF-8` and `LC_*=C.UTF-8` to use UTF-8 as the default encoding. ### Are these changes tested? Yes. I used this for 16.0.0 RC0. ### Are there any user-facing changes? No. * GitHub Issue: apache#41238 Authored-by: Sutou Kouhei <[email protected]> Signed-off-by: Sutou Kouhei <[email protected]>
… binary (apache#41242) ### Rationale for this change We may have non ASCII characters in the process. For example, PGP uid may include non ASCII characters. ### What changes are included in this PR? Use `LANG=C.UTF-8` and `LC_*=C.UTF-8` to use UTF-8 as the default encoding. ### Are these changes tested? Yes. I used this for 16.0.0 RC0. ### Are there any user-facing changes? No. * GitHub Issue: apache#41238 Authored-by: Sutou Kouhei <[email protected]> Signed-off-by: Sutou Kouhei <[email protected]>
… binary (apache#41242) ### Rationale for this change We may have non ASCII characters in the process. For example, PGP uid may include non ASCII characters. ### What changes are included in this PR? Use `LANG=C.UTF-8` and `LC_*=C.UTF-8` to use UTF-8 as the default encoding. ### Are these changes tested? Yes. I used this for 16.0.0 RC0. ### Are there any user-facing changes? No. * GitHub Issue: apache#41238 Authored-by: Sutou Kouhei <[email protected]> Signed-off-by: Sutou Kouhei <[email protected]>
Rationale for this change
We may have non ASCII characters in the process. For example, PGP uid may include non ASCII characters.
What changes are included in this PR?
Use
LANG=C.UTF-8
andLC_*=C.UTF-8
to use UTF-8 as the default encoding.Are these changes tested?
Yes. I used this for 16.0.0 RC0.
Are there any user-facing changes?
No.
ArgumentError: invalid byte sequence in US-ASCII
when signing binaries for RC0 #41238