Reddit sues Anthropic, alleging its bots accessed Reddit more than 100,000 times since last July (www.theverge.com)
from simplejack@lemmy.world to technology@lemmy.world on 05 Jun 02:06
https://lemmy.world/post/30827322

#technology

threaded - newest

tal@lemmy.today on 05 Jun 02:11 next collapse

In the filing, Reddit calls Anthropic a “late-blooming artificial intelligence (‘AI’) company that bills itself as the white knight of the AI industry,” alleging that “it is anything but.”

“This case is about the two faces of Anthropic: the public face that attempts to ingratiate itself into the consumer’s consciousness with claims of righteousness and respect for boundaries and the law, and the private face that ignores any rules that interfere with its attempts to further line its pockets.”

I mean, Reddit’s objection is that they want to sell the same data to Google to do the same training.

paraphrand@lemmy.world on 05 Jun 02:25 next collapse

I dunno, it just reads like a reddit comment to me. 🤣

NegentropicBoy@lemmy.world on 05 Jun 02:48 next collapse

So who owns the data?

Grimy@lemmy.world on 05 Jun 02:56 next collapse

Well they already sold it for 60 million and I didnt get a dime, so not me apparently.

ag10n@lemmy.world on 05 Jun 02:56 collapse

Reddit, created by users and Russian bots

wise_pancake@lemmy.ca on 05 Jun 03:59 collapse

They actually quite that in a real legal filing?

Jesus.

Did they ask /r/pettyrevenge to write that?

Deceptichum@quokk.au on 05 Jun 02:15 next collapse

Suck shit reddit.

pastermil@sh.itjust.works on 05 Jun 02:22 next collapse

Thought they signed up for that.

KingThrillgore@lemmy.ml on 05 Jun 02:35 next collapse

Reddit is just mad that Anthropic didn’t pay them

catloaf@lemm.ee on 05 Jun 02:56 next collapse

Yes that is how capitalism works

Melvin_Ferd@lemmy.world on 05 Jun 03:36 collapse

The Users posted for free. They didn’t get paid. They should be publicly available for scraping

datavoid@lemmy.ml on 05 Jun 10:02 next collapse

As the fediverse gods intended

[deleted] on 05 Jun 16:55 next collapse

.

Melvin_Ferd@lemmy.world on 05 Jun 18:45 collapse

Nah which is why I give a shit about the capitalist argument. Steal everything

KingThrillgore@lemmy.ml on 05 Jun 19:36 collapse

I’m a reddit user, where’s my share?!?!

jol@discuss.tchncs.de on 05 Jun 03:04 collapse

… Yes?

Kichae@lemmy.ca on 05 Jun 02:39 next collapse

“We’re the front page of the Internet!”

“No, not like that…”

4uffin@lemmy.world on 05 Jun 10:16 collapse

“We’re the front page of the Internet! …as long as the front page isn’t scraped…” >:(

g0nz0li0@lemmy.world on 05 Jun 02:39 next collapse

I hope they lose this case badly.

For the concerns I have about AI and stealing others work, I want to see Reddit burn for pretending that they are all about community and connection, while actively harming their users’ experience on the platform and attempting to profit off their content.

NocturnalMorning@lemmy.world on 05 Jun 02:44 next collapse

Yeah, something about a company making billions of dollars off completely user generated content and moderation just runs me the wrong way. As much as I hate Facebook, they at least pay people to do moderation there, and regularly update their site (as shitty as it is). I dont use either anymore, and I hope they die in a pit of flames owing billions to their shareholders.

ryannathans@aussie.zone on 05 Jun 03:31 next collapse

Shareholders of these companies are likely you or I, as they are so big they are significant parts of index funds purchased by retirement funds and the like

NocturnalMorning@lemmy.world on 05 Jun 03:41 next collapse

Maybe, I still hope the go belly up. My 401k isn’t worth supporting companies that spread deliberate disinformation. (Looking at you facebook). Hell, when the stock market took a nosedive over tarrifs, I pulled my entire 401k and put it into foreign investments to try to further the crash. Literally the only thing these dipsticks understand is money, and if they’re losing it, thats when they pay attention.

dan@upvote.au on 05 Jun 04:00 collapse

A lot of people don’t realise that around 40% of the value of the S&P 500, and the majority of the Nasdaq 100 (i.e. QQQM) is big tech companies.

You could always build a portfolio that excludes companies you feel are unethical (for example, exclude oil and gas companies, exclude big tech, etc), but if you were to exclude all companies that have done something unethical then you’d probably end up with the S&P 0 (an empty list)

WhatAmLemmy@lemmy.world on 05 Jun 06:54 collapse

Maybe our quality of life, livelihood, and retirements should not be bound to the success of for-profit corporations?

This is the greatest grift of all time. Binding the average citizen’s, and governmental, wealth to the success of private corporations means that the economic success of those corporations, and the oligarchs who own them, become equal to “national security”; thus they are violently protected by the state, even when their actions and success are the antithesis of democracy.

dan@upvote.au on 05 Jun 07:50 collapse

I agree, but unfortunately it’s a reality of a capitalist society that large private companies have a lot of the wealth, and so people set themselves up for retirement by owning a very tiny part of those companies.

NocturnalMorning@lemmy.world on 05 Jun 11:59 collapse

Our retirement plans didn’t used to be tied to the stock market. So clearly there’s a way to have retirement plans that don’t tie the entire middle class to the success of every large corporation.

Disaster@sh.itjust.works on 05 Jun 13:11 collapse

Well, there’s the Defined Benefit pension, however typically these pension funds then become institutional investors who seek to own shares in… you guessed it - stocks.

At least those institutional investors are at least somewhat responsive to public pressure campaigns, as the state/local comptrollers are a politically appointed position.

When you give your money to a 401k, the fund manager gets all the voting rights on the corporate board and is generally only accountable to “A reasonable rate of return”

dan@upvote.au on 05 Jun 03:54 collapse

As much as I hate Facebook, they at least pay people to do moderation there, and regularly update their site

Facebook pays content creators too (creators.facebook.com/earn-money ), including for things other than videos (like photo/image posts). Platforms like YouTube do too, but as far as I know, Reddit doesn’t.

spankmonkey@lemmy.world on 05 Jun 05:39 collapse

No matter who wins, everyone loses.

ReallyActuallyFrankenstein@lemmynsfw.com on 05 Jun 02:42 next collapse

<img alt="" src="https://lemmynsfw.com/pictrs/image/3b265cd4-48ec-4c87-bc23-3fc73f003d5a.jpeg">

gedaliyah@lemmy.world on 05 Jun 03:51 next collapse

I can’t believe you beat me to this. Well done.

FenrirIII@lemmy.world on 05 Jun 12:56 next collapse

Inconceivable!

ReallyActuallyFrankenstein@lemmynsfw.com on 05 Jun 21:40 collapse

You’ve fallen for one of the classic blunders!

Fizz@lemmy.nz on 05 Jun 06:53 collapse

I just watched this movie last week. Its got so many good lines.

Hobbes_Dent@lemmy.world on 05 Jun 02:44 next collapse

Oof owie my hemochromatosis.

goferking0@lemmy.sdf.org on 05 Jun 03:05 next collapse

pay us or we sue

Isn’t this just blackmail by reddit?

bulwark@lemmy.world on 05 Jun 03:20 next collapse

I hope they both choke on their own bots.

sexy_peach@feddit.org on 05 Jun 03:25 next collapse

100.000 accesses isn’t that much, right?

Imgonnatrythis@sh.itjust.works on 05 Jun 04:07 next collapse

Back in the day that’s about how many times I accessed reddit a week.

kate@lemmy.uhhoh.com on 05 Jun 05:51 collapse

100,000 requests in 11 months? That’s about 12.5 requests an hour

grysbok@lemmy.sdf.org on 05 Jun 05:56 next collapse

That’s hardly anything. Facebook has a bot accessing my server’s robots.txt multiple times a second. (My robots.txt used to say “Facebook bot go away” but now I just respond 404 to any requests from the Facebook bot. Pretend I said that all technical and stuff, it’s 2 am and I ought to go to sleep.)

kate@lemmy.uhhoh.com on 05 Jun 06:30 collapse

That’s what I’m sayin! I make more than 12.5 requests an hour to lemmy and I’m a human I’m 90.00000000000001% sure

grysbok@lemmy.sdf.org on 05 Jun 18:43 collapse

I wonder if I can get Facebook to give me some of that sweet, sweet cash for the inconvenience of telling them to bugger off…

echodot@feddit.uk on 05 Jun 06:53 collapse

Some legitimate users probably submit more requests than that

gedaliyah@lemmy.world on 05 Jun 03:50 next collapse

<img alt="" src="https://lemmy.world/pictrs/image/8a876354-50a3-4ca9-ad4c-719daaab29ad.gif">

Edit: I guess great minds think alike.

PattyMcB@lemmy.world on 05 Jun 04:13 next collapse

Ehh… fuck reddit. They made the bed

venusaur@lemmy.world on 05 Jun 04:24 next collapse

Fuck Reddit

Ledericas@lemm.ee on 05 Jun 05:14 next collapse

while half of reddit is infested with propaganda bots from russia.

Saleh@feddit.org on 05 Jun 07:17 collapse

Not just Russia.

Israel, US, China, North Korea, India and other countries… Nuclear Lobby, Fossil Fuel Lobby and countless other industry lobbyists… Private companies advertising their products…

lemming741@lemmy.world on 05 Jun 11:08 collapse

But have you seen Rampart?

theangryseal@lemmy.world on 05 Jun 12:21 collapse

First of all, I have not seen it, and second off, I don’t want to see it. Lets focus on the Reddit lawsuit.

HootinNHollerin@lemmy.dbzer0.com on 05 Jun 06:00 next collapse

Spez can forever get fucked

FeelThePower@lemmy.dbzer0.com on 05 Jun 06:49 next collapse

hope reddit loses

echodot@feddit.uk on 05 Jun 06:50 next collapse

This is like one of those cases where I’m kind of hoping they both lose somehow. Neither party are right in this case, Reddit is trying to claim copyright over content they have no rights to, and anthropic shouldn’t be violating copyright without a licence.

But apparently you are actually allowed to violate copyright without a licence if you’re an AI company because apparently llms are the future? So I guess Reddit are going to lose, which will be funny.

rikudou@lemmings.world on 05 Jun 08:00 next collapse

You mean Reddit, the company that would be very happy if Anthropic did the exact same thing, but paid Reddit first?

FiskFisk33@startrek.website on 05 Jun 08:25 next collapse

I am squarely on the reddit should lose this side.

Anthropic may be breaking copyright, but not Reddit’s copyright. Sure maybe Anthropic should be sued, but not by Reddit.

echodot@feddit.uk on 05 Jun 10:53 collapse

Actually this case could be a good thing. The whole question of who owns user generated content needs hashing out, because no one seems to actually know.

Obviously the logical answer would be that the people who created it own the content, but that’s never been officially decided.

Alaik@lemmy.zip on 05 Jun 11:42 next collapse

Because that’s the only common sense conclusion to make, but that doesnt make rich fucks more money

echodot@feddit.uk on 06 Jun 08:50 collapse

Yeah maybe we shouldn’t have the case in the US where money rules everything.

EU get on it.

pinball_wizard@lemmy.zip on 05 Jun 12:25 collapse

The whole question of who owns user generated content needs hashing out, because no one seems to actually know.

It’s billionaires. They know. They just sometimes squable over it like two year olds. But they know. They pay lawyers to make it clear in thousand page terms of service documents.

Saledovil@sh.itjust.works on 05 Jun 10:20 next collapse

Judge finds that anthropic has to pay restitution to the reddit users. Affirms that posts belong to users.

Well, I can dream.

General_Effort@lemmy.world on 05 Jun 15:39 next collapse

“Violating copyright without a licence” is a lovely turn of phrase. You must be the valedictorian of the Lemmy School of Copyright.

Cocodapuf@lemmy.world on 05 Jun 20:45 collapse

Is it violating copyright to browse the web?

echodot@feddit.uk on 05 Jun 22:30 collapse

I think it’s acceptable as long as you don’t learn anything.

diffusive@lemmy.world on 05 Jun 07:50 next collapse

Aren’t the copyrights still belonging to the original authors? What is Reddit suing for? The header and the footer? 🤔

tourist@lemmy.world on 05 Jun 08:00 next collapse

Bots? On Reddit!?

Birch@sh.itjust.works on 05 Jun 11:36 collapse

It’s more likely than you think!

rosco385@lemmy.wtf on 05 Jun 08:06 next collapse

“Reddit’s humanity is uniquely valuable in a world flattened by AI,“ Lee said. ”Now more than ever, people are seeking authentic human-to-human conversation. Reddit hosts nearly 20 years of rich, human discussion on virtually every topic imaginable. These conversations don’t happen anywhere else—and they’re central to training language models like Claude.”

LMAO, reddit’s days of genuine conversations between humans is long gone.

[deleted] on 05 Jun 09:55 next collapse

.

Leet@lemmy.zip on 05 Jun 12:06 next collapse

So if reddit wins, that means the content is theirs. So if the content is theirs, they are liable for any content that is illegal. Is that true?

Almacca@aussie.zone on 05 Jun 12:40 next collapse

The content’s theirs whether they win or not, isn’t it? It’s in the EULA when you sign up.

Edit: Here’s the clause.

You retain any ownership rights you have in Your Content, but you grant Reddit the following license to use that Content:

When Your Content is created with or submitted to the Services, you grant us a worldwide, royalty-free, perpetual, irrevocable, non-exclusive, transferable, and sublicensable license to use, copy, modify, adapt, prepare derivative works of, distribute, store, perform, and display Your Content and any name, username, voice, or likeness provided in connection with Your Content in all media formats and channels now known or later developed anywhere in the world. This license includes the right for us to make Your Content available for syndication, broadcast, distribution, or publication by other companies, organizations, or individuals who partner with Reddit. For example, this license includes the right to use Your Content to train AI and machine learning models, as further described in our Public Content Policy. You also agree that we may remove metadata associated with Your Content, and you irrevocably waive any claims and assertions of moral rights or attribution with respect to Your Content.

JonsJava@lemmy.world on 05 Jun 12:56 collapse

non-exclusive

That means we can license all our content to another company, and Reddit would be forced to allow them to fetch it, as we still own it, right?

Almacca@aussie.zone on 05 Jun 12:57 next collapse

It certainly reads that way. Gonna start a Reddit User Collective? Licence it to Anthropic at a discount to undercut Reddit? That could be pretty funny.

General_Effort@lemmy.world on 05 Jun 15:18 collapse

That would be legally possible, though, obviously, you would have to pay for your own servers.

In practice, it wouldn’t be worth anyone’s time.

Almacca@aussie.zone on 05 Jun 15:45 collapse

I don’t see why. Users own the content wherever it’s located. Reddit, of course, would be free to remove that content, but that would be cutting off their own nose to spite their face and is also acceptable.

General_Effort@lemmy.world on 05 Jun 16:06 collapse

You don’t see why you would need your own servers? Do you see why unauthorized access to a computer system might be illegal?

General_Effort@lemmy.world on 05 Jun 15:11 next collapse

No. Just because you own a copyright, doesn’t mean that you are entitled to free network services. If you owned the copyright to a movie, would you expect free tickets for any cinema showing it?

xthexder@l.sw0.com on 05 Jun 19:30 collapse

Non-exclusive just means you’re free to give a copy of your content to whoever you want. It doesn’t mean Reddit is obligated to distribute it for you.

General_Effort@lemmy.world on 05 Jun 15:24 next collapse

No. I am not aware of any law that makes you liable by holding or claiming the copyright to some content. EG you may have to pay damages for libel, but not because you have copyright to the libelous statement.

Jax@sh.itjust.works on 05 Jun 16:39 collapse

Doesn’t quite make sense.

You’re telling me that someone can get popped for mistakenly visiting the dark side of the internet and having whatever-the-fuck horrible shit put on their machine, but owning the content and hosting it on your servers results in nothing?

General_Effort@lemmy.world on 05 Jun 18:57 collapse

Not quite.

Generally, sites aren’t liable for user generated content as long as they follow some rules. They need to take down illegal content and provide some way of reporting such content. In the US, that’s the whole DMCA takedown thing. The whole content ID thing, that YouTube does, might not be strictly necessary, but it was rolled out in response to a high-stakes lawsuit. The EU is, as always, more strict in these matters.

People are not punished for things beyond their control (but mind that a fine is not the same as damages). If you are sent illegal content, that you have not requested, you shouldn’t expect formal punishment, though the investigation may be punishing in itself. If you simply don’t know how caching works, you’re probably in trouble.

But this was about copyright. I don’t think you get punished anywhere for holding some copyright. Say some Japanese Manga artist travels to some European state where some of their works are illegal. They’re not going to get arrested for that. Anyone who brings such illegal works into the country will not be so lucky, regardless of copyright.

Jax@sh.itjust.works on 06 Jun 16:31 collapse

Copyright law is messy. Thank you for the elaboration.

ILikeBoobies@lemmy.ca on 05 Jun 16:39 collapse

yes to both regardless of this lawsuit

The wiggle room for large businesses is that they remove content that violates local laws when notified of it

raltoid@lemmy.world on 05 Jun 12:31 next collapse

Long story short: They are not combatting bots on their platform. They sold training data to google and these guys aren’t paying, that’s why they’re suing.

Almacca@aussie.zone on 05 Jun 12:36 next collapse

Obviously Reddit isn’t averse to bots scraping the site for data, just ones that aren’t paying them. I’m regretting not going through and systematically deleting all my posts and comments before deleting my account, but I thought that happened automatically.

FreedomAdvocate@lemmy.net.au on 05 Jun 16:12 next collapse

I deleted all my posts and my account and they restored them all and then permanently banned me. They can recover anything they want to.

hightrix@lemmy.world on 05 Jun 16:24 collapse

Basically nowhere on the internet does delete mean delete. Nearly everywhere it means archive or hide.

ulu_mulu@lemmy.zip on 05 Jun 16:28 next collapse

Do you really believe they don’t have backups? Especially since it seems selling content for AI training was their plan for quite a while?

Or that they didn’t make full backups a couple years ago before the protest, anticipating a lot of users would try to delete their comments?

I think the only way to truly delete anything from reddit would be living in EU and enforcing a GDPR request, but even in that case, I believe it would be very difficult to check they actually comply.

General_Effort@lemmy.world on 05 Jun 22:58 collapse

I think the only way to truly delete anything from reddit would be living in EU and enforcing a GDPR request, but even in that case, I believe it would be very difficult to check they actually comply.

Wouldn’t work. GDPR is not copyright. Deleting the username is enough, unless you have doxed yourself in some post.

Rather, it can be argued that GDPR requires restoring comments at least in some situations. Comments may be necessary context to understand replies or even other posts.

Pika@sh.itjust.works on 05 Jun 19:11 next collapse

it wouldnt have mattered anyway if you left during the time most people did. Reddit rolled back mass deleted data and manually deleted accounts during that duration so that comments remained without usernames.

Almacca@aussie.zone on 05 Jun 20:30 collapse

The fuckers!

yardratianSoma@lemmy.ca on 05 Jun 21:50 collapse

I don’t regret not deleting all my comments. For me, It’s a mishmash of helpful/comedic/observational comments that I don’t care that they have sold off for use as training data.

But, I just got shadowbanned, because of my VPN or something, so they aren’t getting any more!

dan1101@lemm.ee on 05 Jun 12:47 next collapse

So Reddit serves free data but Anthropic took too much?

Cort@lemmy.world on 05 Jun 21:12 collapse

They give out free license for their data, but require following their terms of service.

chunes@lemmy.world on 05 Jun 15:23 next collapse

Only 100,000 times? Shit, do I need to be worried about getting sued too?

Tronn4@lemmy.world on 05 Jun 21:00 collapse

All porn subreddits are exempted

[deleted] on 05 Jun 20:38 next collapse

.

LaterRedditor@lemmy.world on 05 Jun 22:46 collapse

Jokes on you for crawling mostly synthetic text?