I have a new post on Hackaday: Nic Barker Explains ASCII, Unicode, And UTF-8.
Tag Archives: unicode
JavaScript strings
I learned some things I didn’t know about strings in JavaScript over here: How big is a JavaScript string?
Unicode Normalization Forms
Today’s reading: Unicode Normalization Forms.
Plain Text
This was a quick romp around “plain text”: Plain Text – Dylan Beattie – NDC Oslo 2021. Would recommend if you don’t already know about such things.
Unicode Utilities: Confusables
Today I was referred to: Unicode Utilities: Confusables.
Unicode bidirectional override hack
Interesting: Security advisory for rustc (CVE-2021-42574).
As an example, the following snippet (with {U+NNNN} replaced with the Unicode codepoint NNNN):
if access_level != "user{U+202E} {U+2066}// Check if admin{U+2069} {U+2066}" {
...would be rendered by bidirectional-aware tools as:
if access_level != "user" { // Check if admin
Unicode problem in Windows 11
Today via Slashdot: Microsoft Warns of New Windows 11 Problems With Apps Using Unusual Registry Keys.
Unicode 14.0.0
So Unicode 14.0.0 is out.
Unicode Normalization FAQ
Some good info over in the Unicode Normalization FAQ.
Unicode Regular Expressions
Found some good info on Unicode in regular expressions: Unicode Regular Expressions. Particularly some good doco about Unicode Categories and how to indicate them in regular expressions.