Why Google Isn't Stealing Newspaper Content
from the make-it-stop dept
This is just getting ridiculous. Google may have signaled its willingness to pay up with its deal with AFP, and now it seems that newspaper publishers are interested in taking them up on the offer. OJR reports that Sam Zell, who is in the process of buying the Tribune Company, has lashed out at publishers for letting Google "steal" their content: "If all the newspapers in America did not allow Google to steal their content for nothing, what would Google do, and how profitable would Google be?" This sounds quite similar to columnist David Lazarus' "plan" to save the newspaper industry. Unfortunately, they've got the situation completely backwards.Google is not "stealing" content. They're also not making their money off of other's content. What they're doing is making that content a lot more valuable by making it much easier to find. Google isn't making money on the content -- but on driving more people to that content (and on the news side, they don't make any money directly, since they don't run ads on Google News). It's bizarre that this is so difficult for those in the publishing industry to understand. You don't yell at the phone book for "making money" off of your contact information. You don't yell at tour books for "making money" off of other people's locations. You recognize that they make money by being a guide or a directory -- just like Google. Either way, it doesn't bode well that the guy who's taking over the Tribune Company doesn't seem to have the slightest clue how Google works or how it's helping, not hurting, the business he's in the process of buying.
Thank you for reading this Techdirt post. With so many things competing for everyone’s attention these days, we really appreciate you giving us your time. We work hard every day to put quality content out there for our community.
Techdirt is one of the few remaining truly independent media outlets. We do not have a giant corporation behind us, and we rely heavily on our community to support us, in an age when advertisers are increasingly uninterested in sponsoring small, independent sites — especially a site like ours that is unwilling to pull punches in its reporting and analysis.
While other websites have resorted to paywalls, registration requirements, and increasingly annoying/intrusive advertising, we have always kept Techdirt open and available to anyone. But in order to continue doing so, we need your support. We offer a variety of ways for our readers to support us, from direct donations to special subscriptions and cool merchandise — and every little bit helps. Thank you.
–The Techdirt Team
Reader Comments
Subscribe: RSS
View by: Time | Thread
I have arrived to an equal yet opposite conclusion.
Either way, it doesn't bode well that the guy who's taking over the Tribune Company seems to understand how to extort money from google by threatening to ruin it for everybody unless he gets the % of profits he is accustomed to.
[ link to this | view in thread ]
[ link to this | view in thread ]
1. Google will stop running any site that protests its being run.
2. People that are running sites, and threatening google, are money grubbing morons.
3. Money grubbing morons are concerned, mainly, with grubbing money, not with actually working on quality news programming.
4. Quality news programming, with intelligent, considerate, non-moron people running it, will float to the top.
Everybody wins, except the morons.
[ link to this | view in thread ]
[ link to this | view in thread ]
Flawed Analogies
In both of those cases, though, the stuff being "stolen" are facts (contact information, addresses/historical information), which aren't copyrightable.
In a "strict constructionist" sense, AFP and Mr. Zell are correct: Google is committing copyright infringement. However, so is much of the technology of the Internet (e.g., proxy servers, routers, content filters, search engines, hosted feed-readers). That's why things like robots.txt are opt-out; if they were opt-in, the Internet would fall apart. It also suggests that there's an implied extension of the fair use doctrine (right to copy as part of a requested point-to-point delivery of information, and the right to develop indices of information published publicly) that really needs to be codified.
Mr. Zell should be able to do a financial analysis comparing what he's making off search engine traffic today vs. what he'd make if search engines were blocked. I suspect the math works in favor of him keeping the search engine traffic, but who knows?
[ link to this | view in thread ]
Oh really?
[ link to this | view in thread ]
The analogy about yelling at the phone book is perfect for describing this. Plus I'm cracking up at the mental image of a guy yelling at a phone book.
[ link to this | view in thread ]
Question
Newspapers make money from advertising and subscriptions. How exactly does Google help that?
If I am a writer for my local paper that is then indexed and shown on Google news.... personally I may not care (more exposure for me) but in terms of running a business, this does nothing for the paper. In fact, I agree with the earlier comment.. the paper owns the content, not Google.
That being said, I do feel news should be free for all but that is just not how our capitalistic society is constructed unless the owner(s) of the content (the papers) care to share it.
[ link to this | view in thread ]
xenophobic media
Those that want to maintain control of readership will put subscriptions on their website content (i.e. NYTimes) and have to decide if the revenue from subscriptions offsets the loss of revenue from advertising hits.
by the way, Mr. Zell, I don't read the NYTimes anymore. hint, hint
[ link to this | view in thread ]
Then when their traffic drops through the floor and their web advertising revenue drops to zero, they'll come back and beg to be included.
And if their traffic doesn't drop, they didn't need serach engines anyway.
Search engines are providing content owners a service, making ther content findable. They should have to pay content owners as well?
[ link to this | view in thread ]
From TFA...
The Sun is prominent on my google news page because of it's good local coverage and lively letters page. When I find an intersesting local article, I often read it on the paper's site, look there for background information, and visit the letters page to see what's being said about it.
The Sun is a Tribune paper however, so if Zell implements this policy, I'll get my local news from one of the local TV station sites (not the FOX station!).
[ link to this | view in thread ]
Robots.txt
If you want your content to be spidered:
User-agent: *
Allow:
If not, then nothing.
I know it isn't the way the web traditionally works, and I more than most despise over-legislation, but, if the new guys want to get into the game with the old guys, they are going to have to play by their rules. Right?
If robots.txt were opt-in and the papers don't want their sites to be indexed, go ahead. It will only result in their faster demise.
Please correct me if I'm wrong.
[ link to this | view in thread ]
Personally
[ link to this | view in thread ]
Funny isn;t it
[ link to this | view in thread ]
re: Robots.txt
Murphy's point is that if permission to crawl was required, most of the indexed content would never have been found. Further, I contend that even those sites in the opt-in robots.txt model with admins smart enough to be aware that (a) yes, they do want to be crawled, and (b) how to enable exposure... many of these admins would blow it anyway: improper syntax, complicated rules, ...
Murphy's comment is the most cogent one so far: The Internet's beauty lies in its openness. Put something up that people find interesting, and they'll find a way to get to it, and to look at it, over and over. If you shut off access, or just plain go under, someone will have mirrored it.
It's a (mostly) merit-based popularity contest. The winners get the most eyeballs.
[ link to this | view in thread ]
In my own opinion, I think Google should charge the newspapers for listing their content on their site so that it gets more coverage. I know that goes against what Google stands for, but seriously, enough is enough and people it's time for the new people to start telling the older people to wake-up (something akin to EMI opening up it's digital music catalog = good move).
[ link to this | view in thread ]
Get over it, dudes
then again, that's entirely understandable given your business model. 4 yuppies farting in a one bedroom in belmont who produce NOTHING in the way of real info themselves. nope. you just rely on what others produce and then stick a finger in the wind and rant.
at
[ link to this | view in thread ]
Re: Get over it, dudes
Ketle, I have someone I'd like you to meet. His name is Pott. I think you two will get along quite nicely.
[ link to this | view in thread ]
Google's Copying
Who-hoo! I'm cogent! :-)
Fair Use is reserved for "purposes such as criticism, comment, news reporting, teaching (including multiple copies for classroom use), scholarship, or research" per the Copyright Act of 1976. Google News is none of these, unless it's ruled that news aggregation qualifies as "news reporting". Google's cache is also none of these. It's possible that Google News and/or Google Cache would qualify as fair use under an extended test involving a set of criteria (see the Act, or Wikipedia's quote from the Act), but that's not nearly as obvious to armchair attorneys like yours truly. It's even possible that there's case law that already holds that aggregation or caching qualifies as fair use, but the Act doesn't specifically call for it.
In theory, either the courts will clarify whether Google's techniques are fair use, or Congress will pass legislation one way or the other. Even though we're talking about them in the context of Google, aggregation and caching are two fundamental constructs of the Internet experience, so here's hoping that the decision-makers don't throw out the baby (a reasonable working Internet) with the bath water (protecting dewy-eyed newspapers from the rapacious greed of Google, as Mr. Zell might perceive it).
[ link to this | view in thread ]
Headline Theft
[ link to this | view in thread ]
Re: Question
Advertising relies on people actually seeing the ads. If Google drives more traffic, more people see the ads. Pretty simple.
If I am a writer for my local paper that is then indexed and shown on Google news.... personally I may not care (more exposure for me) but in terms of running a business, this does nothing for the paper. In fact, I agree with the earlier comment.. the paper owns the content, not Google.
I'm afraid you may be confused about how Google News works. It only shows a headline and a short snippet and links to the actual newspaper site. Just like a regular Google search... So, the content isn't being shown on Google News. Google is just sending people to the actual newspaper website.
[ link to this | view in thread ]
Re: Google's Copying
1) The purpose and character of the use, including whether such use is of a commercial nature or is for nonprofit educational purposes?
The purpose is to display relevant links to news based on generic terms such as "top stories" or specific search terms. The character of the use is the display of links, with a brief clip (usually one sentence) and a possible picture. There are no ads on the page, so the 'commercial nature' aspect does not come into play. Clicking on the link takes the user to the originating website - the Google News site drives traffic to originating websites.
2) The nature of the copyrighted work?
The copyrighted works are published news articles, many based on publicly available facts, typically available through various media outlets (tv, radio, newspapers, blogs, websites, etc). We are not talking government or trade secrets here - we are talking information that is widely available.
3) The amount and substantiality of the portion used in relation to the copyrighted work as a whole?
Google only displays the title of story, maybe a one or two sentence clip from the story and/or a image, and the name of the source. The entire article is not reprinted on the Google News page and there are no cached versions. The link points to the originating website.
4) The effect of the use upon the potential market for or value of the copyrighted work.
Google News drives traffic to originating websites, therefore, it's effect upon the potential market or value of the copyrighted work is a positive one. Typically, in the business world, the more positive exposure your product or service has, the better the likelihood that your product or service will succeed.
This armchair attorney believes that Google News is fair use that only increases the value of the original content.
[ link to this | view in thread ]
Of google is monetizing their news channel
If Google didnt aggregate other peoples new on their site less users would visit google. Google is using copyrighted content to create a critcal mass mof users. Google should come to some arangement with these content owners or can the news site and just advertise their search feature.
[ link to this | view in thread ]
Might be useful to you...
Mind you, Google News is not just gearing traffic towards major newspapers.
On the contrary, a couple of times I have looked for information using Google News, and I have been steered to some obscure newspaper in Ohio or, more often than not, India. Google's ranking algorithm for news articles that relate to the same story often places "reliable" and well-informed sources well below small-circulation papers that simply copy/paste AP news.
If I managed a major brand like NYT or Tribune, why would I rely on Google News to bring me traffic? That is putting my brand in direct comparison with smallish newspapers with little editorial value added. Besides, my brand alone gets me good amounts of traffic, so it doesn't interest me to get Google News traffic.
However, the value to Google of letting people access my well-researched, well-informed articles through Google News interface is high; Google may not derive money directly from the news searches, but it promotes the Google brand by making people spend more time with Google.
It's the same thing with Apple and Wal-Mart. Why aren't Macs sold at Wal-Mart or Best Buy? Those places are moving a lot of boxes, so one could say that Apple are shooting themselves in the foot, but why would Apple bother appearing in those stores next to no-brand PCs? The Apple retailers are few and far between, but they do draw crowds... and there, Apple controls the buying experience, have an opportunity to up-sell high-margin accessories, etc.
Of course Techdirt is in the opposite situation: you guys probably benefit a lot more from Google News, comparatively, than major publications...
[ link to this | view in thread ]
[ link to this | view in thread ]
I pay to have my information in the phone book
The real question is would Google still be making so much money if no one was allowing them to put their news stories on Google News?
I think a lot of this is a pre-emptive strike. As long as Google News never carries Google Ads, I don't believe any newspaper would mind having the stories listed. Google is doing the newspaper a favor.
What the newspapers don't want is for Google to start putting their own ads on Google News because then Google IS using the newspaper content to make money for Google.
[ link to this | view in thread ]
[ link to this | view in thread ]
Just some thoughts
Post #6 As an individual, not a company, I yell at the phone book companies for making money off of my information...charging me $5 a month for...not having my information sold
Yes you are right - that sucks and if it is true then you should be seeking legal advice as I'm pretty sure that's blackmail. However this is not what Google is doing, it does not charge money for listings and does not charge money for removal - indeed this can be achieved by the sites themselves simply by putting 2 very short lines into a file 'robots.txt'
Post #8 Newspapers make money from advertising and subscriptions. How exactly does Google help that?
If I am a writer for my local paper that is then indexed and shown on Google news.... personally I may not care (more exposure for me) but in terms of running a business, this does nothing for the paper
So you make money from advertising, the amount of money you make is directly related to the number of visits to your site, but you cannot see how a page that links relevant searches to your site helps you?
Post #12
...I don't see what the big problem is with making robots.txt [an] opt-in technology...if the new guys want to get into the game with the old guys, they are going to have to play by their rules. Right?...
Making the web opt in would be a very Bad Plan for all the reasons that Brad Eleven points out in Post #15. However you are right about new guys getting into the game with old guys, except you seem to fail to realise that the new guys here are the papers. They have only really embraced the internet relatively recently - robots.txt and the opt in methodology has been around since 1994, why should us old guys change the way the rest of the internet works just to allow a few newcomers to keep their outdated business models?
You may have been drinking since you were 18, but don't come to my bar and expect to just take Old Jakes stool on your first visit ;0)
Post # 17
time to get over it. you boys sound like little whiners. doesn't really make a darn bit of difference what happens in the real world -- in this case google agreeing to compensate the papers
...and the horse you rode in on... My concern here is not just Google's acquiescing and what it means to Google, but to the wider precedent. Which is that some old duffers from the press are riding in and effectively changing the way the internet works, by abusing the courts own lack of knowledge. If I want to whine I will whine, if you don't like reading other peoples opinions stop going to sites where they express them
Post #19
Remember this is in France so the copyright act doesn't apply and neither does the 'fair use' clause which is American - sorry
I think the reason this is fair though is that what are published are small extracts, which not only state the source but provide a mechanism to visit it (more than papers do when they quote each other). This combined with the fact that the method to stop what the papers are wrongly viewing as infringement exists, is public knowledge, and is easily implemented. These few papers CHOSE to go to court and argue this out in long-winded expensive legal actions, rather than ask their web developers to write two lines in a file...
Post #24
Google News is not just gearing traffic towards major newspapers...a couple of times I have looked for information using Google News, and I have been steered to some obscure newspaper...
So the solution to this competition is to make sure your paper NEVER appears? Genius
Post #27
But if Google agree to pay - [as] it did with Afp - what's the problem?
The problem with this is it kind of sets a precedent that this is the way to do business, which could in turn remove the ethos of the robots.txt file and opt-in, which in turn is damaging to the way the rest of the internet works. Wherever this type of boardroom idiocy makes its way into our community it should be shouted at just so we remember what is at stake
In closing people seem to have a problem that Google make money from this venture - they do. What exactly is wrong with someone making money from sending business your way? Anyone who wants to advertise my business for free should feel free to do so - if you can find a way to make money for yourself whilst doing this, go for it with my blessing!
[ link to this | view in thread ]
Re: Question
[ link to this | view in thread ]
driving ms. daily
And I access Google News at least 4 times each and every day.
Now, if I'm a newspaper publisher in one of the top markets, and Google News never drives traffic my way, I'm not happy, as my competition is getting the traffic for stories that might very well be better (more thorough, more engaging, etc.) reported in my paper.
[ link to this | view in thread ]
I'll keep it simple...
[ link to this | view in thread ]