- Feature Articles
- CodeSOD
- Error'd
- Forums
-
Other Articles
- Random Article
- Other Series
- Alex's Soapbox
- Announcements
- Best of…
- Best of Email
- Best of the Sidebar
- Bring Your Own Code
- Coded Smorgasbord
- Mandatory Fun Day
- Off Topic
- Representative Line
- News Roundup
- Editor's Soapbox
- Software on the Rocks
- Souvenir Potpourri
- Sponsor Post
- Tales from the Interview
- The Daily WTF: Live
- Virtudyne
Admin
More to the point, they don't no how ignerant they our. And so they dont' realise that they're grammer isnt' perfict in the frist plaice.
Dunning and Kruger ride again.
Admin
Why should we? :-D
Admin
The book Bilbo is writing IS "The Hobbit". "There and Back Again, a Hobbit's Tale" is its alternate (in Universe) title.
Admin
Admin
The name "Hohlbein" is special as well: "Hollow leg". I bet he had to endure quite some mocking as a kid.
Admin
Huh? The site loads and renders fine for me. Are you sure you have a Japanese font installed on your computer?
CAPTCHA: facilisi - Word of advice: don't be a fascist sissy
Admin
He was different/special in the sense that he could fix the bug.
Admin
Its not an axiom if you have to prove it.
Admin
Slashdot prime doesn't support Unicode, but clearly someone has been able to make it work with the same code base. Ergo, Slashdot must be willfully not supporting Unicode.
Admin
Actually, Slashdot DOES support Unicode. In fact, it always had.
It's just that a bunch of random idiots decided to break the page layout playing with the various Unicode codepoints that they implemented a rather strict WHITELIST of allowed codepoints.
And the whitelist excludes anything that has a HTML entity representation as well, and of course, excludes the generic HTML codepoint implementation as well.
There was a pile of it involving the force right-to-left character that screwed up page layouts and other things that people used to fake moderation scores and other such crap.
tl;dr version - It supports Unicode, but idiots being idiots forced there to be a whitelist of codepoints.
Captcha: damnum. Damnum idiot /. posters abusing Unicode forcing this crap on the rest of the people.
Admin
tl;dr: Yes, it has a whitelist, which sucks because it's too small.
However, if you're using the D1 system (middle-click a post title to open it in a new tab and you'll find yourself using the D1 reply form), you get to see what it originally did, which is convert UTF-8 to ANSI with what appears to be this exact routine:
In other words, you get funny accented characters for all of the code points that require more than 1 byte in UTF-8.Admin
That whitelist is apparently so strict that "smart quotes" are always messed up. The UKP symbol is usually broken, but sometimes a poster figures out how to make it work, probably by doing something unusual like using entities.
I know about the problem with direction reversal characters, and it's not rocket science to simply filter them. You don't need to blacklist 99% of Unicode for the <1% of character codes (mostly in the 20xx range) that cause problems. That is much more of a WTF than simply not properly supporting UTF-8.
Admin
Admin
TRWTF is that PHP (which this appears to be) has built-in functions for handling UTF-8:
http://php.net/manual/en/function.utf8-encode.php
(The second is at function.utf8-decode.php - I guess I can't post two links.)
Admin
Three cheers for Muphry's Law!
Admin
Admin
Replacement characters used to come from web sites that used Canadian (accented French Canadian alphabetic) and other characters that didn't represent valid EUC or ShiftJIS encodings, where the webmasters forgot to declare that they used a foreign encoding.
Admin
Admin
Admin
Then again, that's a WTF system of versioning. So never mind.
Admin
Anders is a very common Swedish name. Compare Andersson, what means the son of Anders
Admin
"Might" refers to something in the past did not happen but could have. "May" refers to something which has a possibility of happening in the future, or something in the past or present about whose occurrence is uncertain.
"Please don't commit your code changes yet, you may break the application and now would not be a good time."
"I told you not to commit your code changes. You may have broken the application."
"I specifically asked you not to commit your changes. You might have broken the application. Fortunately it seems okay this time, but please don't do it again when asked not to."
Admin
Thanks Lorne, that was a good read.
Admin
Admin
Admin
"Before leaving the aircraft, please check around your immediate seating area for any personal belongings you might have brought on board."
Admin
By UKP, do you mean GBP?
Admin
Maybe it's because British English and American English have drifted apart in meaning, but to us elderly limeys this would have the meaning of "... any personal belongings that, if you had at any time had the opportunity of bringing on board, you would have, but didn't." (Apologies for the clumsy construction of this sentence, crafting it was more difficult than I thought it would be.)
As such, this message comes across as amusingly whimsical.
"Hmm, what can I look around for? I know, my golf clubs, which I left in the conservatory. No, they're not here. What about my cat, whom I left sunning herself in the lounge? No, she's not here either ..."
And so on.
(Whatever she herself thinks about the ownership situation, my cat most definitely is one of my personal possessions.)
Admin
That's what George Carlin said as well: "Well... I MIGHT have brought my arrowhead collection! But I didn't. SO I'M NOT GOING TO LOOK FOR IT! I'm going to look for things I BROUGHT on board! It would seem to increase the likelihood of my finding something, wouldn't you say?"
Admin
Still, it's not as bad as "We will take off momentarily".
Admin
I wouldn't bother with UTF-8. If ASCII is good enough for jesus it's good enough for me.
Admin
Or referring to the room where one rids oneself of body waste as the "bathroom".
Admin
Because slowly, but surely, it's gathering up enough momentum, to the point where it kicks your arse.
Then the question will become "why didn't someone make us pay attention?"
Admin
You don't need functions to encode and decode UTF8. You need functions to encode and decode other character sets into UTF.
Admin
Using count in for loop condition is plain stupid if there is no way array size can change during looping.
Admin
Come on! The real WTF here is that there is a webservice to encode text to and from UTF-8.
That's like seriously having a web service in production that adds numbers for you. Need to add two numbers? Don't use the + operator, but to a webservice call...
Admin
If Unicode had been made incompatible with ASCII, I think that would have been great. Most Americans (get over it, that's what we call ourselves) would not want the expense and pain of converting, so I could ignore it completely.
Admin
whoa!
Admin
pssht, I won't be alive for that.
Admin
Still confused how a web startup has a System that has been around since "time out of mind". Well, it was a web startup cubicle anyway, so what was it doing in Anders' company?
Admin
You never know what might happen in the future.
Admin
Admin
In all fairness, it does work for a certain subset of UTF-8 :-)
Admin
Shouldn't that be "Grammar Nazgul" (rather than Nazi)? Give or take a few special characters I didn't encode in my comment. (I'm gonna do it later)
Admin
Shouldn't that be "Grammar Nazgûl" (rather than Nazi)?FTFY.
Admin
Both in the article & in this example, shouldn't the function return something too?
function _utf8Encode(&$arr){ // insert real utf8 encode function here return &$arr; } function _utf8Decode(&$arr){ // insert real utf8 decode function here return &$arr; }
Admin
Admin
I got hung up on the phrase "well-to-do programmer".
"Well-to-do" means "rich". What is a rich programmer doing working in a hole, er, cubicle?
I don' think tha' word means what you think it means...