- Feature Articles
- CodeSOD
-
Error'd
- Most Recent Articles
- Monkeys
- Killing Time
- Hypersensitive
- Infallabella
- Doubled Daniel
- It Figures
- Three Little Nyms
- Tangled Up In Blue
- Forums
-
Other Articles
- Random Article
- Other Series
- Alex's Soapbox
- Announcements
- Best of…
- Best of Email
- Best of the Sidebar
- Bring Your Own Code
- Coded Smorgasbord
- Mandatory Fun Day
- Off Topic
- Representative Line
- News Roundup
- Editor's Soapbox
- Software on the Rocks
- Souvenir Potpourri
- Sponsor Post
- Tales from the Interview
- The Daily WTF: Live
- Virtudyne
Admin
Strings suck. We should all use integers.
Admin
Admin
Booleans suck, we should all use arrays of bits
Admin
I agree, and in fact I always use that approach. So, instead of having the string "FOO" in my code, I have a little array of integers: 70, 79, 79.
To save on memory, I make each integer only 8 bits wide.
I also find it's convenient to mark the end of the list of integers by having an array element containing the integer 0.
Admin
Why use bits? Use unary digits...
ok.
CAPTCHA: ewww! (as in C)
Admin
C-strings are, in fact, a very good representation for string data in those situations where your program is not actually, you know, manipulating string data. They're great for printf() formatting messages, error messages, interface messages, and such, though.
Admin
Admin
I'm ok with C strings.
Looks like our WTF today is using strlen instead of sizeof. A pretty common mistake...
Admin
captcha: tacos - mmmmm....
Admin
Admin
Admin
because printf doesn't need to manipulate anything...
if memcpy() is involved (even hidden away in a standard library function like printf()), nul-terminated strings are not the best option. ie: not the best option, even if standards make them the only option.
but think about the alternative: VCHAR8, VCHAR16, VCHAR32, VCHAR64, VCHAR128? "How much space do you really need. Choose now!"
And then you might want an extra byte at the beginning to tell you how many lead bytes you're using.
Admin
That's right!
The fun is kind of ruined by the explanation of the WTF :(
Admin
Strings suck we should all use yarn.
Admin
If only there were some way to represent lengths of strings before interrogating the string (char array) itself.
I'm thinking something that could possibly hold different variables with a structure. I'll call it struct, for short.
You could call on a variable like "string_length" within this struct.
You could then use the second item in your struct which would point to the string itself which resides in memory. I'm thinking of calling this a pointer.
All I need to do is add some looping and conditional constructs.
Man, I think I'm on to something.
Look out world, here I come!
Admin
Admin
Admin
sizeof() would not return the character length of a C String. Assuming a "C string" is a char* memory buffer, the variable itself is a pointer.
Psuedo code:
char* mystring; mystring = somethingThatReturnsAstring(); int nLen = sizeof(mystring); // nLen would be 4 on a 32-bit system
Admin
Yeah, but string theory is supposed to explain everything, isn't it?
Admin
True...
(False? File Not Found?)
Admin
It is also a pretty common mistake to think that sizeof() returns the size of an array...
Admin
Replying to myself.
sizeof() will return a string length if you did:
char mystring[16] = "hello world"; int nLen = sizeof(mystring);
I just don't think it's practical using fixed length buffers like that.
captcha: doom (yes we all are)
Admin
g-string I like
Admin
Yes! Three-state logic for booleans!
Admin
String theoretic booleans!
enum boolean { false = 0, true = 1, not_even_wrong = 2, file_not_found = 3 };
-j
Admin
To be pedantic, the sizeof in your second example will return the size of the array used to hold the string and not the length of the string itself. The string, as defined in C, does not have to consume the entire array.
Insidious coders have been known to store more than one string in an array. Not that I condone that in regular practice of course. However, you are entirely correct that sizeof should not be used to determine the length of a string as it is not a fundamental type in C.
Admin
This could could be perfectly valid really.
I'll add comments to show what could be happening...
// set char* key_p to the key offset in buf string. key_p = &buf[buf_i]; // get the key_len by reading to the next null key_len = strlen(key_p);
// print key length, for kicks i suppose printf("key_len<%d>\n", key_len);
// if for some reason this thing isn't null terminated // then badness. Of course, we just did a strlen on it, // so i don't know that we'll ever go DOWN this code... // but its not really invalid of itself... if (key_p[key_len] != 0x00) { fprintf(stderr, "key termination error %02x\n", key_p[key_len]); close(key_fd); return; } // print the key for more debug fun. if (debug) { printf("len<%d> key<%s>\n", key_len, key_p); }
I'm sorry, but this just isn't all that WTFable. And null termination is a good thing, especially when you consider that it is very portable amongst disparate languages, operating systems, etc...
Admin
Also, I was incorrect. sizeof would be badness. Sorry, hadn't had my coffee yet. :-(
Admin
Admin
Sure it is very practical. Why use somestring = mystring when you can use strcpy(somestring,mystring); every time.
And then when you want to always have the same data in somestring and mystring then you can just call strcpy(mystring,somestring); every time you make a change. BRILLIANT!
Admin
There are no "strings" in C. Only arrays of char.
Admin
Admin
Null terminated strings make sense when you realise then a string is really just a pointer to memory, and having it null terminated means you don't need additional temporary variables while scanning it (just update the pointer; *(p+1) is a perfectly valid substring of a non-null string *p and is quicker than sub$() or right$() or whatever other function might be required to copy memory blocks around (which a string format with length indicator would require).
The problem is that they're not too intuitive to deal with (remember to allocate an extra byte; if you want to store 100 characters then you need 101 bytes allocated and the offsets range from 0 to 99... eep!) and you can easily start overlapping strings (*p and *(p+1) in this case; modify *p and you've automatically changes *(p+1) which may not be what you want) and this easily leads to bugs. Ah well.
All programming languages suck. All programmers suck.
Rgds Stephen (a programmer)
Admin
Mav, what point is a condition that is never true?
A slightly simplified variant would be the following:
char x = 0; if (x != 0) { ... }
Admin
I find your ideas intriguing and would like to subscribe to your newsletter.
Admin
Admin
while (!true) { // do a shot and then loosen your tie // things are only going to get worse from here on out }
Admin
I agree with you, its a dumb thing to do, but I've seen worse things... I'm sure I've made mistakes similar to this without knowing it, I imagine that every programmer has. So I hate to call down hellfire on this guy for doing it, especially when you consider the fact that it really doesn't adversely affect anything. It certainly isn't worse than failure. Its merely worse than no failure. So really its a WTNF.
Admin
Umm, strlen Got Lost
Admin
I have the thing just for you http://www.tcl.tk/
Admin
The real WTF is using strlen. Everyone nowdays should know, that only strnlen can save this world from Annihilation.
Admin
LOL, now THAT was FUNNY!!!
Admin
Ye Gods, I think he's got it. (0x00)
Admin
Ok, what about if he's worried about concurrent access? Maybe he's just trying to make his code super-robust?
To wit:
key_p = &buf[buf_i]; key_len = strlen(key_p);
printf("key_len<%d>\n", key_len);
// RIGHT HERE another thread comes in and modifies the // contents of key_p! if (key_p[key_len] != 0x00) { fprintf(stderr, "key termination error %02x\n", key_p[key_len]); close(key_fd); return; } if (debug) { printf("len<%d> key<%s>\n", key_len, key_p); }
A bit of a stretch I know...
Or maybe he's worried about some malicious process coming in and modifying the strlen function in memory, so that it appears to work, but is subtly broken... the nefarious scheme of some cartoon supervillain-esque hacker?
Admin
Er... strcpy?
Admin
integers suck too, and so do bits, I use arrays of dual-phase silicon semiconductor relays
Admin
char mystring[16] = "hello world"; Will allocate on the stack, so will never segfault, until the function returns anyway.
CAPTCHA: doom (it likes doom today)
Admin
But then again, you can just #define 0 std::rand(), so I guess my point is moot.
Admin
The only case you can use sizeof() as equivalent of strlen() is when you let the compiler figure out how much space do you need:
char mystring[] = "hello"; printf("%d %d\n", sizeof(mystring), strlen(mystring));
6 5
Notice that they are off by 1, because sizeof() counts the zero terminating byte, and strlen() doesn't. It's not C strings that suck, it's "arrays == pointers" paradigm and the rest of the language, too, thanks, K&R, great job!
Admin
I don't think there's nothing funny that can be said about null-terminated strings. I rather like this [paraphrased] quote:
One of the major reasons for the downfall of the Roman Empire was, lacking zero, they had no way to indicate termination of their C strings.