Google is now watermarking its AI-generated text. (spectrum.ieee.org)
from Dot@feddit.org to technology@lemmy.world on 24 Oct 2024 01:00
https://feddit.org/post/4055206

#technology

threaded - newest

over_clox@lemmy.world on 24 Oct 2024 01:32 next collapse

Did you know, 23% of social media users don’t know how to sharpen a pencil?

True story, I wrote it on the internet somewhere, so it must be true by now…

[deleted] on 24 Oct 2024 02:52 collapse

.

over_clox@lemmy.world on 24 Oct 2024 02:56 next collapse

If I declare that 100% of everything I’ve ever typed online might be false, will AI delete my shit?

dharmacurious@slrpnk.net on 24 Oct 2024 04:58 collapse

[youtu.be/IUK6zjtUj00?si=C-GAe_wXBW-jWV_q](I think you might enjoy this song)

j4k3@lemmy.world on 24 Oct 2024 03:31 next collapse

Hold up, let me ban a couple hundred tokens in the reply. Pattern fixed. Watermarking only works for the most ignorant surface level users.

Mac@mander.xyz on 24 Oct 2024 03:44 collapse

“most ignorant, surface lvl users” so 80% of users?

j4k3@lemmy.world on 24 Oct 2024 04:06 next collapse

Yeah but not the bad actors this is primarily targeting and will create further issues. There are likely 3 keyword tokens used in a pattern. The most adept of humans should learn these and be damn sure to never use that pattern in any natural way.

Ilovethebomb@lemm.ee on 24 Oct 2024 06:43 next collapse

I’d make a point of using them for the fun of it.

jungle@lemmy.world on 24 Oct 2024 15:20 collapse

That’s not how it works though.

ravhall@discuss.online on 24 Oct 2024 05:21 collapse

You’re being generous

sunzu2@thebrainbin.org on 24 Oct 2024 03:43 next collapse

They want us reposting it to feed their ai?

tal@lemmy.today on 24 Oct 2024 05:28 collapse

Other than as a mind game, I don’t see the point.

Google provides a centralized service. They own the generator system.

You could solve the whole problem much more simply and reliably by just retaining a copy of all generated text at Google – the quantities of data will be miniscule compared to what Google regularly deals with – and then just indexing it and letting someone do a fuzzy search for a given passage of text to see whether it’s been generated. Hell, Google probably already retains a copy to data-mine what people are doing anyway, and they know how to do search. And then they could even tell you who generated the text and when.

unexposedhazard@discuss.tchncs.de on 24 Oct 2024 07:24 collapse

You/They cant claim copyright on LLM generated text. So its purely for analysis and statistics i would presume. But its odd because if you change the text too much the system will fail.