stories filed under: "google drive"

Google Drive's Autodetector For Copyright Infringement Is Locking Up Nearly Empty Files

from the whoopsie dept

Wed, Jan 26th 2022 12:12pm — Timothy Geigner

We've talked at length about the issues surrounding automated copyright infringement "bots" and how often those bots get the primary question they're tagged with wrong. Examples of this are legion: Viacom's bot takes down a Star Trek panel discussion, all kinds of bots disrupted the DNC's livestream of its convention, and one music distributor's bot firing off DMCA notices to, well, everyone. Google itself has reported that nearly 100% of the DMCA notices it gets are just bot-generated buckshot.

But Google isn't the savior here either. The company also uses automated systems for detecting copyright infringement and, at least in the case of Google Drive, those automated systems occasionally suck out loud at their job.

This week, Assistant Professor at Michigan State University, Dr. Emily Dolson, Ph.D. reported seeing some odd behavior when using Google Drive. One of the files in Dolson's Google Drive, 'output04.txt' was nearly empty—with nothing other than the digit '1' inside it.

But according to Google, this file violated the company's "Copyright Infringement policy" and was hence flagged. And what's worse is, the warning sent to the professor ended with "A review cannot be requeste for this restriction."

If your bot thinks a single digit is somehow copyright infringement, then your bot is a bad bot and should be taken behind the woodshed and humanely sent to bot-heaven where it can run and frolic with all the other bots. Now, to be fair, there is an open question in this case as to whether the filepath names that were chosen somehow were what was getting flagged. And, sure, maybe that happened. But it doesn't really change the point: a bot thought a file that contained a single integer was copyright infringement.

That being said, other Drive users have reproduced this, calling into the question the filepath theory.

Dr. Chris Jefferson, Ph.D., an AI and mathematics researcher at the University of St Andrews, was also able to reproduce the issue when uploading multiple computer-generated files to Drive. Jefferson generated over 2,000 files, each containing just a number between -1000 and 1000.

The files containing the digits 173, 174, 186, 266, 285, 302, 336, 451, 500, and 833 were shortly flagged by Google Drive for copyright infringement.

Again, this sucks. For what it's worth, Google has finally responded and, despite the notices indicating there was no way to dispute the bot's findings, has been sharing out links to do exactly that. But that isn't really the point. This is base-level stuff here: having a system that operates this poorly means you have a system that never should have been in production to begin with. Particularly, frankly, when that system is operating as personal file storage for many, many people.

Filed Under: automated filters, censorship, copyright, dmca, filters, google drive, hard drives, upload filters
Companies: google

32 Comments

The Crackdown On Torrent Sites Has Produced Many More Moles To Whac

from the getting-creative dept

Fri, Sep 8th 2017 11:51am — Timothy Geigner

If the ongoing battle between copyright infringers and copyright holders could be described in any simple term, that term would have to be whac-a-mole. Since the early days of piracy on the internet, the copyright industries have used their legal mallets to smack down any site or service whose head managed to rise out of obscurity. Napster was pushed into irrelevance, as were other similar apps. Then websites that hosted infringing files were slammed. At present, we are in the midst of a crackdown on torrent sites, with the copyright industries blaming them for widespread infringement.

However, those who are dedicated to sharing content illicitly are indeed dedicated. And so the game will continue into avenues of piracy that are fairly creative.

As crackdown on torrent sites continues around the world, people who are pirating TV shows and movies are having to get a little more creative. Cloud storage services such as Google Drive, Dropbox, and Kim Dotcom's Mega are some of the popular ones that are being used to distribute copyrighted content, according to DMCA takedown requests reviewed by Gadgets 360.

Google Drive seems most popular among such users, with nearly five thousand DMCA takedown requests filed by Hollywood studios and other copyright holders just last month. Each DMCA requests had listed a few hundred Google Drive links that the content owners wanted pulled.

But what's notable about many of these DMCA takedown requests is that they target Google Drive links that don't actually host any content themselves, but instead have embedded YouTube videos within them. YouTube has long been accused of hosting copyright infringing content, but few people consider it a serious vector for pirating movies or television shows. That's because YouTube cracks down on piracy itself, and it is easily searchable, meaning that copyright holders can find their content and send takedown requests. Most infringing content is taken down quickly because of this, so what would be the point of these embedded videos?

It turns out that the pirates found a simple workaround - the videos are simply uploaded as unlisted, so they don't turn up in search results. The links to these videos are then shared as Google Drive links through discussion forums and other channels so it's difficult for the content owners to find the videos and get them taken down.

Popular video sites YouTube, Vimeo, and Dailymotion are also being abused by distributing and hosting illicit content, DMCA takedown requests reveal, but the volume of such requests again implies that they are not being as widely used. Some pirates, getting creative, also turned to another streaming venue which is not used as widely - porn sites. For example, last year, news outlets reported an instance where all the songs of Kanye West's The Life of Pablo album were uploaded as a video to the popular website PornHub. You can still find a number of movies on the site, and oddly enough, also things like game trailers and music videos that could safely be posted on other sites as well.

While nobody would want to cheer this sort of infringement on, there is a certain aspect of creativity to it. That creativity nicely demonstrates the axiom: the internet is designed to route around obstructions. So too, it seems, are the communities dedicated to sharing copyrighted content. It seems that this war on piracy is whac-a-mole by nature, but it's actually worse than that.

What if the moles were hydras and every time you hit one on the head, two or more heads sprouted out as a result? Because it should be noted that the above strategy using Google Drive and YouTube to distribute infringing content isn't the only creative strategy that's sprouted out of the crackdown on torrent sites.

The most unusual service that is being abused for distributing content that we came across is My Maps. It's a feature Google introduced in 2007 to enable users to create custom maps. Anyone can visit the My Maps website, and create a custom map by pointing to a location on the map, adding a title, and filling up a description box. Google doesn't verify what kind of information users are sharing in description, so you can again easily share links to unlisted YouTube streams, or Google Drive files to download. What this means is that people can then share locations on maps, which lead to the pirated movies.

While Google's services are only the most abused of many for this sort of thing, you can already hear the content industries warming up their voices to sing a tune of how Evil Google is the pirate's tool of choice for copyright infringement. It's worth noting that all of this, however, has emerged despite Google's efforts at complying with copyright laws. It's also emerged as a result of this ongoing arms race waged primarily by the content industries, who could have expended this effort in figuring out new business models on which to make money from their content. Instead, we can mark time in the modern era by what the "piracy threat vector" du jour is. It seems tomorrow it may become Google Drive. Or My Maps. More years on it will be something we haven't even thought of yet.

Them moles keep coming, after all.

Filed Under: cloud, copyright, google drive, hosting sites, piracy, streaming, torrent sites

35 Comments

Google Drive Barely Launched... And Google's Already Hit With Patent Infringement Lawsuit

Patents

from the but,-of-course dept

Thu, Jun 7th 2012 3:46pm — Mike Masnick

It's almost becoming a rule in the tech industry, that actually doing something that people want to use absolutely guarantees that you're going to get sued for patent infringement. It's pretty clear that the current patent system is acting as a massive tax/tollbooth on innovation. The latest in a long line of examples: just as Google has been rolling out its Google Drive offering to users, it's been hit with a patent infringement lawsuit from a company with a patent (5,918,244) that covers a "method and system for coherently caching I/O devices across a network." As the lawsuit notes, the technology behind the patent is to enable the ability of "multiple computers [to] all communicate with each other and... all access data from the same data storage device or devices, such as hard disk." Basically, the patent describes a system of RAM caching. Because I'm sure no one ever would have figured out how to do that without the patent system... So, rather than just allowing the technology to progress in the market as new products are developed, we're left with legal fights and a tollbooth on innovation.

Filed Under: caching, google drive, patent troll
Companies: google

Calm Down Internet: Google Drive's Terms Are The Standard For Countless Websites, Including Gmail

Overhype

from the oh-good,-this-again... dept

Wed, Apr 25th 2012 7:58am — Leigh Beadon

Remember when everyone freaked out about parts of Pinterest's terms of service? And how, slowly but surely, word got out that the same terms can be found on virtually every website and are mostly harmless? And then everyone learned a lesson and calmed down, and would approach future terms of service with new knowledge and understanding?

Wait, scratch that last part. TNW reports that the terms of Google's much-anticipated Drive service, which launched this week, have been treated to the same warm welcome from the Twitterverse. Someone spotted yet another variant of the "worldwide license" clause that all websites include, and before long the freakout flag was flying.

The clause in question, though admittedly scary-sounding, is routine:

When you upload or otherwise submit content to our Services, you give Google (and those we work with) a worldwide license to use, host, store, reproduce, modify, create derivative works (such as those resulting from translations, adaptations or other changes we make so that your content works better with our Services), communicate, publish, publicly perform, publicly display and distribute such content.

I hate to break it to the panicking masses, but Google is not planning on turning your spreadsheets into a touring art exhibit. A broad license like this is necessary to allow Google to operate such a service, permitting them to move the data around freely on their many servers all over the world, and display it to you (or the people you share it with) through a variety of devices and interfaces. The nightmare-labyrinth of international copyright law means that the most Google could do without such a clause is accept your data then immediately delete it—and even then someone would probably try to claim they made five unauthorized copies en route to the trash bin.

Perhaps most amusing is the fact that this piece of legal lingo doesn't come from the Google Drive terms of service, but from Google's overall terms for all their services. Meaning it already applies to everything from Gmail to Google Mars—so this might just be getting started. At this point, I suspect every social network and user content website online is waiting for the hammer to fall, since any one of them could be singled out at any time for yet another round. Oh well, I guess nothing beats a good freakout.

Filed Under: google drive, license, terms of service
Companies: google, pinterest

73 Comments

Follow Techdirt

Essential Reading

The Techdirt Greenhouse

Read the latest posts:

read all »

Techdirt Deals

Report this ad | Hide Techdirt ads

Techdirt Insider Discord

The latest chatter on the Techdirt Insider Discord channel...

Older Stuff

Thursday
13:33	Former Employees Say Mossad Members Dropped By NSO Officers To Run Off-The-Books Phone Hacks (2)
12:01	No, Creating An NFT Of The Video Of A Horrific Shooting Will Not Get It Removed From The Internet (18)
10:49	San Francisco Cops Are Running Rape Victims' DNA Through Criminal Databases Because What Even The Fuck (18)
10:44	Daily Deal: The Complete 2022 Java Coder Bundle (0)
09:31	As Expected, Trump's Social Network Is Rapidly Banning Users It Doesn't Like, Without Telling Them Why (44)
06:30	Comcast Continues To Bleed Olympics Viewers After Years Of Bumbling (19)
Wednesday
20:42	Apple Finally Defeats Dumb Diverse Emoji Lawsuit One Year Later (6)
15:39	Clearview Pitch Deck Says It's Aiming For A 100 Billion Image Database, Restarting Sales To The Private Sector (10)
13:41	Peloton Outage Prevents Customers From Using $2,500 Exercise Bikes (16)
12:09	The GOP Knows That The Dem's Antitrust Efforts Have A Content Moderation Trojan Horse; Why Don't The Dems? (16)
10:51	Hertz Ordered To Tell Court How Many Thousands Of Renters It Falsely Accuses Of Theft Every Year (24)
09:21	Even As Trump Relies On Section 230 For Truth Social, He's Claiming In Lawsuits That It's Unconstitutional (34)
06:16	Medical, Home Alarm Industries Warn Of Major Outages As AT&T Shuts Down 3G Network (25)
Tuesday
20:37	Video Game History Foundation: Nintendo Actions 'Actively Destructive To Video Game History' (29)
15:35	Massachusetts Court Says No Expectation Of Privacy In Social Media Posts Unwittingly Shared With An Undercover Cop (17)
13:30	Techdirt Podcast Episode 312: Regulating The Internet (2)
12:03	US Copyright Office Gets It Right (Again): AI-Generated Works Do Not Get A Copyright Monopoly (60)
10:42	LA Sheriff Threatens To 'Subject' City Council To 'Defamation Law' If They Won't Stop Calling His Deputies 'Gang Members' (20)
10:37	Daily Deal: codeSpark Academy Sibling Bundle (0)
09:25	Trump's Truth Social Bakes Section 230 Directly Into Its Terms, So Apparently Trump Now Likes Section 230 (128)
06:22	15 Years Late, The FCC Cracks Down On Broadband Apartment Monopolies (31)
Sunday
12:05	Funniest/Most Insightful Comments Of The Week At Techdirt (11)
Saturday
12:00	This Week In Techdirt History: February 13th - 19th (1)
Friday
19:39	Letter From High-Ranking FBI Lawyer Tells Prosecutors How To Avoid Court Scrutiny Of Firearms Analysis Junk Science (25)
15:52	Nintendo Is Beginning To Look Like The Disney Of The Video Game Industry (44)
13:49	Seattle Public Radio Station Manages To Partially Brick Area Mazdas Using Nothing More Than Some Image Files (44)
12:13	Thankfully, Jay Inslee's Unconstitutional Bill To Criminalize Political Speech Dies In The Washington Senate (8)
10:52	How Our Convoluted Copyright Regime Explains Why Spotify Chose Joe Rogan Over Neil Young (136)
10:47	Daily Deal: The Complete Blocs Website Builder Bundle (0)
09:33	Arizona Prosecutor Who Brought Bogus Gang Charges Against Protesters Files Ridiculous Defamation Suit Against Her Boss (12)

Google Drive's Autodetector For Copyright Infringement Is Locking Up Nearly Empty Files

from the whoopsie dept

The Crackdown On Torrent Sites Has Produced Many More Moles To Whac

from the getting-creative dept

Google Drive Barely Launched... And Google's Already Hit With Patent Infringement Lawsuit

from the but,-of-course dept

Calm Down Internet: Google Drive's Terms Are The Standard For Countless Websites, Including Gmail

from the oh-good,-this-again... dept

The Techdirt Greenhouse

Thursday

Wednesday

Tuesday

Sunday

Saturday

Friday

More

Tools & Services

Company

Contact

More

from the whoopsie dept

from the getting-creative dept

from the but,-of-course dept

from the oh-good,-this-again... dept

Techdirt Daily Newsletter

The Techdirt Greenhouse

Tools & Services

Company

Contact

More