• Hannes (unregistered)

    "This IP range has been banned (DDoS attacks)."

    This reminds me of how my friends and I got our Uni IP-range-banned from a forum. All that happended was a few double posts, just because the F5-Key got stuck and the browser kept reloading the "post successful" page... it was an accident, I swear!

    After 2 years or so, the Uni got unbanned.

  • BillR (unregistered) in reply to ANON
    ANON:
    They developed an Android application and he's talking about "years after his graduation"? Not bad for an OS which exists less than 5 years.

    And it was running both perl and PHP?

    I call bullshit on this one.

  • jay (unregistered) in reply to chubertdev
    chubertdev:
    jay:
    Why did he use a cron job to query at all? Why not just query when the user made the request to select a restaurant? Presumably if he's using a cron job than he must be saving the data somewhere, and retrieving it when a request is made. This adds a bunch more work that wouldn't be necessary if he just queried when a request hit.

    Query on demand is a bad, bad idea. I could just hold down the F5 key (or something similar, depending on whether or not this really is an Android app), and cause a DoS. I'm assuming that the scraping job is pretty heavy, so even running it ten times in a minute probably causes quite the load.

    ...

    Going through every page one-by-one for every restaurant every five minutes is one hell of a load.

    If this was a production app that would have many users, I could see querying the ultimate data source periodically and caching the results. Again, given the static nature of the data, once a day would surely be plenty often. But for a class project this seemed to me to add more work than necessary to meet the requirements.

    I guess a lot depends on how much work it really had to do each time it hit CityEats. I hadn't heard of this site before and was thinking it was part of the anonymization, but now I see it's real. So okay, it displays a list of all the restaurants meeting the search criteria. If he just relates user location to which restaurants he will choose from based on "neighborhood", then he doesn't need to access the detail pages, so there's just one page to hit and scrape. (Assuming he can figure out which neighborhood to use from the user location without hitting CityEats.) If he wants a more precise location, well, hmm, I see that on the restaurant detail page the little GoogleMap image actually uses the lat/long to select the display area, he could scrape that for the location info. In that case he'd have to hit each detail page. Still, I guess it depends what neighborhood you specify, how many restaurants will show up? Typical looks like half a dozen to a dozen. So each scraping visit he'd hit the search results page plus maybe a dozen detail pages. That doesn't seem very heavy.

    Of course it's possible that he didn't try to narrow the search, but on each scraping visit he hit every restaurant on the site. I'm not sure how many total they have, probably at least hundreds. I could see that getting heavy.

  • jay (unregistered) in reply to no laughing matter
    no laughing matter:
    dpb:
    xaade:
    trwtf:
    Right or wrong, this is simply how restaurants have always been chosen in large universities.

    Ok.... ?

    Moving on.

    Attempt at starting a meme from that stupid sexual harassment TDWTF from a week ago or so. (or I should say the stupid overreaction comment thread)

    Starting????

    Too late now, another stupid meme was bred in thedailywtf-comments-dungeon!

    Please, try to show a little sensitivity. Right or wrong, the president's daughter was chained in the comments dungeon, and I assure you it was no laughing matter. Fortunately Paula Bean came up with a brillant way to get her out. Okay, that's not true, but it's not false either. It's file_not_found.

  • (cs)

    i love this story.

  • Barf 4Eva (unregistered)

    Brian fails...for responding "Yeah, sure, I leave the project in the hand of That Guy"

    You deserve what ya get, sorry man. :)

  • (cs) in reply to jay
    jay:
    Please, try to show a little sensitivity. Right or wrong, the president's daughter was chained in the comments dungeon, and I assure you it was no laughing matter. Fortunately Paula Bean came up with a brillant way to get her out. Okay, that's not true, but it's not false either. It's file_not_found.
    I don't see what's wrong with this meme!

    Does anyone of you bitches know the specs?

  • ZoomST (unregistered) in reply to no laughing matter
    no laughing matter:
    jay:
    Please, try to show a little sensitivity. Right or wrong, the president's daughter was chained in the comments dungeon, and I assure you it was no laughing matter. Fortunately Paula Bean came up with a brillant way to get her out. Okay, that's not true, but it's not false either. It's file_not_found.
    I don't see what's wrong with this meme!

    Does anyone of you bitches know the specs?

    To see what's wrong, just print it out, put over a wooden table, take a picture and send it by e-mail.

  • Darth Matter (unregistered) in reply to ZoomST
    ZoomST:
    no laughing matter:
    jay:
    Please, try to show a little sensitivity. Right or wrong, the president's daughter was chained in the comments dungeon, and I assure you it was no laughing matter. Fortunately Paula Bean came up with a brillant way to get her out. Okay, that's not true, but it's not false either. It's file_not_found.
    I don't see what's wrong with this meme! Does anyone of you bitches know the specs?
    To see what's wrong, just print it out, put over a wooden table, take a picture and send it by e-mail.
    I've followed your process. Pray that i don't process your fellows any further!
  • greenpepper (unregistered) in reply to Ed

    Exactly my thought!!

  • greenpepper (unregistered) in reply to greenpepper

    Forget about 'That guy' for a moment. What about the others in the group? what were they doing for 3 weeks?!!

  • (cs)

    I was totally That Guy, except I managed to get my entire university banned from all nih.gov sub domains (including pubmed).

Leave a comment on “1 or 2 or 3 or 4”

Log In or post as a guest

Replying to comment #:

« Return to Article