mirror of
https://github.com/RGBCube/serenity
synced 2025-10-28 19:12:33 +00:00
The UTF-8 encoding of U+00A0 (NBSP) is the bytes 0xc2 0xa0. By looping over the string to escape byte-by-byte, we replace the second byte with " ", but leave the first byte in the resulting text. This creates an invalid UTF-8 string, with a lone leading byte. |
||
|---|---|---|
| .. | ||
| Entities.cpp | ||
| Entities.h | ||
| HTMLEncodingDetection.cpp | ||
| HTMLEncodingDetection.h | ||
| HTMLParser.cpp | ||
| HTMLParser.h | ||
| HTMLToken.cpp | ||
| HTMLToken.h | ||
| HTMLTokenizer.cpp | ||
| HTMLTokenizer.h | ||
| ListOfActiveFormattingElements.cpp | ||
| ListOfActiveFormattingElements.h | ||
| StackOfOpenElements.cpp | ||
| StackOfOpenElements.h | ||