• (nodebb)

    It's interesting how the UTF-8 encoding of U+00A9 COPYRIGHT SIGN (C2 A9), when interpreted as Windows-1252, decodes as the same character, but with aU+00C2 LATIN CAPITAL LETTER A WITH CIRCUMFLEX in front of it.

  • Rob (unregistered)

    According to Google Translate, the series of Chinese characters Cole T. found in the Amazon search box translates as "Enameled Cast Iron Dutch Skillet". Are we sure it's mojibake and not a legitimate amazon search that somehow got transferred from a different domain?

  • Anonymous') OR 1=1; DROP TABLE wtf; -- (unregistered) in reply to colejohnson66

    Yup, that's the case for all characters in the range U+0080 to U+00BF. In binary, the 8-bit code point 10abcdef gets encoded in UTF-8 as the 2-byte sequence 11000000 10abcdef. Fun little consequence of how UTF-8 works.

  • Someone (unregistered)

    Well, Romans did not generally use spaces, so maybe space aren't Latin characters in some sense of the word 'Latin' at least.

  • Foo AKA Fooo (unregistered) in reply to Anonymous') OR 1=1; DROP TABLE wtf; --

    Indeed, and had I not seen a lot of mojibake before in my life, I probably wouldn't even have noticed this one. But the A with circumflex was a telltale.

Leave a comment on “Boy Howdy”

Log In or post as a guest

Replying to comment #:

« Return to Article