From 798cb382e2a6f774df25a5ad7161f511c9713933 Mon Sep 17 00:00:00 2001 From: RGBCube Date: Thu, 3 Jul 2025 18:43:29 +0300 Subject: [PATCH] dump(unicode.utf-8-bypass): update --- apply.nu | 2 +- site/dump/unicode/utf-8-bypass.md | 3 ++- 2 files changed, 3 insertions(+), 2 deletions(-) diff --git a/apply.nu b/apply.nu index 0762013..5b8b229 100755 --- a/apply.nu +++ b/apply.nu @@ -7,7 +7,7 @@ def --wrapped sync [...arguments] { --delete --recursive --force --delete-excluded - --delete-missing-arguments + --delete-missing-args --human-readable --delay-updates diff --git a/site/dump/unicode/utf-8-bypass.md b/site/dump/unicode/utf-8-bypass.md index e1ce72e..a310e31 100644 --- a/site/dump/unicode/utf-8-bypass.md +++ b/site/dump/unicode/utf-8-bypass.md @@ -8,7 +8,8 @@ slash) character in UTF-8 in 3 different ways? These were `0x2F`, or `0xC0 0xAF`, or `0xE0 0x80 0xAF`. -This led to security issues and let attackers bypass validation logic. +This led to [security issues](https://capec.mitre.org/data/definitions/80.html) +and let attackers bypass validation logic. The Unicode specification later was revised to say that a UTF-8 encoder must produce the shortest possible sequence that can represent a codepoint, and a