Opening The Backdoor To The Google Platform

from the worth-the-effort? dept

In a move that looks much more designed to be about getting publicity than actually being useful, someone has released, as open source, code needed to scrape Google, and present a Google clone -- sans ads. There's a bunch of blather from the guy who did this about how he's trying to take back the internet from commercial interests or something and he fully expects Google to sue him, but it does raise some interesting legal questions. Part of his claim is that Google is making money by scraping other sites without permission and putting ads on it. So, considering that it was legal for them to scrape other sites, his belief is that it's perfectly legal to scrape Google's site and take the ads away. Of course, Google does let those who request it be removed from their index. Either way, this does highlight the legal question of whether or not compiling a database of publicly available info is copyrightable itself. It's probably in Google's best interest to simply let this one lie. The guy is obviously looking for a fight, and honestly, the number of people likely to use such a thing is going to be tiny compared to how many will continue to just use Google. The argument that it's a "loss of advertising revenue" isn't going to hold much weight -- as anyone who goes through the trouble of using this instead of Google directly is unlikely to click on the ads anyway. Also, Google could let this thing run its course to see how many people use it as a backdoor into using Google as a platform to design more compelling applications. If people do, that could give Google some direction in how to push forward with their own open API plans.
Hide this

Thank you for reading this Techdirt post. With so many things competing for everyone’s attention these days, we really appreciate you giving us your time. We work hard every day to put quality content out there for our community.

Techdirt is one of the few remaining truly independent media outlets. We do not have a giant corporation behind us, and we rely heavily on our community to support us, in an age when advertisers are increasingly uninterested in sponsoring small, independent sites — especially a site like ours that is unwilling to pull punches in its reporting and analysis.

While other websites have resorted to paywalls, registration requirements, and increasingly annoying/intrusive advertising, we have always kept Techdirt open and available to anyone. But in order to continue doing so, we need your support. We offer a variety of ways for our readers to support us, from direct donations to special subscriptions and cool merchandise — and every little bit helps. Thank you.

–The Techdirt Team


Reader Comments

Subscribe: RSS

View by: Time | Thread


  1. identicon
    Brandon, 11 Jan 2005 @ 5:13pm

    Oh, those kooks?

    Ah, Orlowski and Daniel Brandt, hadn't heard from them in a while.

    link to this | view in thread ]

  2. identicon
    sfb, 12 Jan 2005 @ 3:56pm

    robots.txt

    Google's robots.txt pretty clearly forbids this, so there isn't any comparison to google crawling other web sites, since google DOES obey robots.txt.

    http://www.google.com/robots.txt

    link to this | view in thread ]

  3. identicon
    Tim, 13 Jan 2005 @ 12:09am

    No Subject Given

    The site looks at lot like Google as seen by Firefox with adblocking turned on. Why bother?

    link to this | view in thread ]

  4. identicon
    Scrape Google, 21 Nov 2009 @ 6:35am

    Scraping Google for Fun and Profit

    A very detailed article including very advanced PHP source code about Google scraping can be found here: http://google-scraper.squabbel.com

    I've used it to scrape 300,000 results about a niche in Internet Marketing I was researching about.

    Very powerful and a good read!

    link to this | view in thread ]


Follow Techdirt
Essential Reading
Techdirt Deals
Report this ad  |  Hide Techdirt ads
Techdirt Insider Discord

The latest chatter on the Techdirt Insider Discord channel...

Loading...
Recent Stories

This site, like most other sites on the web, uses cookies. For more information, see our privacy policy. Got it
Close

Email This

This feature is only available to registered users. Register or sign in to use it.