- Feature Articles
- CodeSOD
- Error'd
- Forums
-
Other Articles
- Random Article
- Other Series
- Alex's Soapbox
- Announcements
- Best of…
- Best of Email
- Best of the Sidebar
- Bring Your Own Code
- Coded Smorgasbord
- Mandatory Fun Day
- Off Topic
- Representative Line
- News Roundup
- Editor's Soapbox
- Software on the Rocks
- Souvenir Potpourri
- Sponsor Post
- Tales from the Interview
- The Daily WTF: Live
- Virtudyne
Admin
Looks like a Markov Chain Generator gone wrong. Oh, and Frist!
Admin
Theres something missing from the list of things spammers say, and it begins with F, and rhymes with thirst.
Admin
Looks like Markov chains: http://www.codinghorror.com/blog/2008/06/markov-and-you.html
Admin
The spammers are trying to poison Bayesian style spam filters with random text. They've been doing it for years both via email and on forums, so I'm surprised it's worth an article this late in the day.
Admin
Awesome is was the I in catsup
Admin
Pretty weird indeed :)
Admin
No quack.
Admin
Maybe they're testing their poetry generators?
Admin
The first one just needs a little editing to become Haiku
I thought about know I thought by themselves things up I still living still there
Admin
:) one of them was mine :)
Admin
Witty, I was going to be but then it happened ...
Admin
Maybe skynet is starting to wake up and testing its vocabulary. Still has a lot to learn, obviously. Who is going to help?
Admin
No quack.
Admin
Hmm, so would marking those emails as "not spam" tend to defeat that purpose?
Admin
Scoff, scoff.
I wrote a random sentence generator when I was 11 that worked better than that - and that was a pencil-and-paper emulator from a before cheap computers were invented.
(Okay I'll come clean: I borrowed the technique from an article in an old Scientific American from sometime in the mid-60's.)
The point being: if random sentence generators were known to computer scientists that long ago, how come spambotters haven't caught up?
Oh, and (scoff, scoff) the Markov Chain generator I programmed some time ago produces out that is miles better than any of those outputs. Mine actually outputs plausible English, not like the output of a resident of (insert country you don't like) on scopolamine.
Conclusion: spammers are of subhuman intelligence and deserve to be exterminated as vermin.
Admin
Some of these look like something Nagesh would write.
If were not careful these spam messages can accidently become Internet memes (and then they will become Skynet).
Admin
Shaka, when the walls fell.
Admin
Admin
This issue has puzzled me too. I read an blog article about that and the conclusion is, that the spammer tries to establish an account. This account - once it was not blocked for spamming and deemed valid - should be used for spamming and circumvent blocking, deletion or reports.
I guess that this practice rarely works on any blog.
Admin
Perfect! Best comment possible.
Admin
TRWTF is PHP amiright?
Admin
My favorite ones are the ones that insult your blog.
No, really. They say things like "This was really poorly written, and your arguments contradict one another" or "I think you stole this from someone else." Then, in the link bar... a link to whatever sketchy pharmacy or dating site they're pushing.
I guess the logic is that we'll be too hot under the collar and ready to respond to notice that they're a bot.
Admin
Also, +1 for the kitten mitten reference :)
I'd link it on youtube but I can't find one that isn't a copy recorded from somebody's shitty cell phone.
Admin
They serve as markers, they allow them to search with Google and discover "vulnerable" comment forms. These ramblings are pretty unique across the web.
Get rid of them ASAP, or they will come back in a few weeks and put you on your "can spam" list.
The motivation is simple: if you don't remove the random gibberish, it probably means you will also not remove their actual spam. For example, because this is an old and unmaintained blog. It doesn't pay off to solve a captcha to have a post deleted within minutes, so you have to identify blogs where stuff does not get removed.
Admin
This is actually really insightful poetic commentary about the greatness that is the show "Community." It points out its difficulty getting the network to commit to more than one season and how it limps along each year until it's finally renewed, and yet, it's still the king of comedies.
Admin
Someone set up us the bomb!
Admin
Maybe the puppet master is trying to hack the ghost in this machine?
Admin
Vogon poetry I think.
Admin
Admin
Back in 2003 I was noticing some strange subject lines in the spam I was getting. A very large number of spams started with
but then continued on with a few words that were all different. I posted a bunch of the subject lines on my blog, and somebody managed to identify them as lines from an English translation of Russian book called "The Master and the Margarita", which is available on-line.Admin
Suscipit: I suscipit is not easy for computers to understand poetry
Admin
I loved your writing about seemingly profound random messages from spammers, it was a good point. I agree that poisoning bayesian filters is important.
If you like this article, you should check out Horse_ebooks on Twitter, they've been using a similar method to avoid Twitter's spam filters where they pull a random phrase or line from (presumably) their books. The resulting feed is some kind of demented dadaist poetry.
For example, here's a few recent posts:
"This Is The Floating Dock You Would Own If You Had Unlimited Cash and Access" "orchids for years you will now discover everything" "Looking back, I realize that it wasn t the tree houses that made me happy." "Enjoy wines of "$50 quality""
Admin
Admin
Admin
Admin
Sounds like something one of the Nagesh (Nageshen?) would say.
Admin
Does anyone know of an on-line English prose de-obfuscator?
Have you ever visited that portion of Erin's plot that offers its sympathetic soil for the minute survey and scrutinous examination of those in political power, whose decision has wisely been the means before now of converting the stern and prejudiced, and reaching the hand of slight aid to share its strength in augmenting its agricultural richness?
Admin
As for spammers, I'm surprised that domain knowledge bot filters aren't more popular. That is, very simple questions that would easily be known by anyone visiting the site, but almost certainly not by some third-world CAPTCHA farmer. They seem to work really well.
Admin
M-x dissociated-press
Admin
Some social media "experts" and wannabe comedians hate horse_ebooks. I think it's because they can't come up with anything more interesting than what a spam-bot can, and it burns them up.
Admin
Guessing they're looking for some legitimacy by adding comments around. Some blogs don't let you post links or will scrutinize your post more when you're new.
Admin
Admin
Spammers are of average intelligence, which is genius-level intelligence compared to the vermin who click spam and make it worth their while. Exterminate the latter and the former will disappear.
Admin
Roffle. Best episode of TNG.
Admin
I don't know what's so hard to understand.
youtube.com/watch?v=G7RgN9ijwE4
Maybe locking yourselves to your desks has disconnected you with the youth of today?
Admin
Admin
As for some kind of emails as markers, they will become Skynet). M-x dissociated-press Some of things up I thought about know I loved your blog. No, really. They also smelt a link bar... a random sentence generators were invented. (Okay I'll come back over the walls fell. Perfect! Best comment forms. These ramblings are unavoidable. Add a captcha to hack the vast managed to become more frustrated at them ASAP, or line from the subject lines on any of an on-line English translation of a couple days. yahoo house (from gl): because We need took cutting off
Admin
Admin
No seriously, the problem with these "trivia question" captchas is that it's very easy to create dictionaries for them. It's hard to come up with many questions which are complex enough to stop a sweat shop worker, not too complex for an inexperienced user seeking for help and which have answers which can be verified with a string comparison.
It may not be worth to create a dictionary for a small blog or forum, but when one tries to spam a specific website (like an webmail provider to send spam) it would be worth the hassle.
Admin
It looks like they're getting all these words and phrases from a fairly small corpus. I tried Googling around to see if it was a common piece of literature, but when I started getting results like "Bodies of Missing Family Found in Hollow Tree," I stopped.