Open-R1: a fully open reproduction of DeepSeek-R1 (huggingface.co)
from sirico@feddit.uk to technology@lemmy.world on 28 Jan 2025 08:15
https://feddit.uk/post/23315079

#technology

threaded - newest

Linearity@infosec.pub on 28 Jan 2025 09:57 next collapse

Is there a tech focused summary on everything about DeepSeek and the situation with OpenAI?

kiagam@lemmy.world on 28 Jan 2025 11:13 collapse

youtu.be/Nl7aCUsWykg

Fireship maybe? It is not that complicated, they just make a good cheap AI and big tech is panicking because they can only make good expensive AI

land@lemmy.ml on 28 Jan 2025 10:18 collapse

<img alt="" src="https://lemmy.ml/pictrs/image/286410bd-c509-44ed-86c5-0756e043f7a4.png">

Can someone explain this? Keyloggers???

neon_nova@lemmy.dbzer0.com on 28 Jan 2025 10:57 next collapse

I wouldn’t necessarily call it key logging but all these services are going to store anything you search.

[deleted] on 28 Jan 2025 11:06 next collapse

.

[deleted] on 28 Jan 2025 15:22 next collapse

.

wreckedcarzz@lemmy.world on 28 Jan 2025 16:25 collapse

“B-b-but look, they are doing it too!”

Yes, and we hate them, too. What’s your point?

homesweethomeMrL@lemmy.world on 28 Jan 2025 13:56 collapse

“keystroke patterns or rhythms”???

Fuckin’ hell.

Kornblumenratte@feddit.org on 28 Jan 2025 11:27 next collapse

Of course – if the AI is supposed to give you an answer, they have to know what you are writing, so yes, logging your keystrokes is quintessential for every online service you interact with. You cannot get an answer without asking.

The wording is strange, though, and I’m not sure whether this ToS allows them to collect and process what you are typing while using their service, or all your typing.

MagicShel@lemmy.zip on 28 Jan 2025 11:43 next collapse

Keystroke patterns and rhythms is above and beyond, though. That’s not remotely necessary and the kind of thing that can only be used to track an individual across multiple platforms and attempts at anonymity. I don’t know how effective it is at that, but that is the sole purpose unless maybe they are training a better autocorrect tool and think that would be helpful.

At any rate, that’s the point where I noped out. They are completely honest about putting every effort into identifying users and associating them with real identity. Such a system would be quite capable of de-anonymizing marketing profiles, health data, etc. by correlating vast amounts of data.

ICastFist@programming.dev on 28 Jan 2025 12:15 next collapse

logging your keystrokes is quintessential for every online service you interact with

No, it is not. Services expect the “complete” payload, whether a prompt, a text message, or whatever, it doesn’t matter if you typed it, if you copy-pasted it or something else. None of them need to analyze stuff you’ve typed, deleted and never sent.

lapping6596@lemmy.world on 28 Jan 2025 15:43 collapse

Generally yes, but there is one use case where every key stroke is often recorded and analyzed, a search bar. If it’s trying to fill out suggestions as you type, every keystroke is recorded as you go.

ICastFist@programming.dev on 29 Jan 2025 02:04 collapse

They don’t need to read the keystrokes, they need to read what’s in the input box. In programming terms, you’re evaluating the field in real time, you’re not waiting for the “send request”, nor are you keylogging, otherwise the existence of the field would be irrelevant.

Tramort@programming.dev on 28 Jan 2025 12:38 collapse

Quintessential does not mean “really essential”, and does not make sense in this context.

You can’t really be quintessential “for” something; only quintessential of something.

Kornblumenratte@feddit.org on 30 Jan 2025 07:32 collapse

Ethymologically you are right, I wasn’t really aware of the alchemical background of five rounds of destillation when I wrote my comment.

Nonetheless, “quintessential for” is not unheard (or rather unread?) of:

<img alt="" src="https://feddit.org/pictrs/image/b55d97f6-d46a-45f1-84eb-e5f7990324fa.png">

It will take another generation or two until this usage becomes normalized, so thank you for pointing me to a better style.

eggymachus@sh.itjust.works on 28 Jan 2025 12:19 next collapse

Whose ToS is this?

land@lemmy.ml on 28 Jan 2025 12:21 collapse

DeepSeek.

SeekPie@lemm.ee on 30 Jan 2025 08:21 collapse

So the Open-R1 wouldn’t be doing this?

Robin@lemmy.world on 28 Jan 2025 13:00 next collapse

Playing devil’s advocate here. Mouse movements and key presses have been commonly used as bot detection method for a decade now. Like that captcha service that is just a checkbox, that’s part of how they guessed that you are not a bot.

homesweethomeMrL@lemmy.world on 28 Jan 2025 13:58 collapse

Yeah, no. I mean yes - that’s true, and yes it’s a way to detect bots, and no I’m not going to allow that wherever possible.

zerozaku@lemmy.world on 28 Jan 2025 13:18 next collapse

Damn it we need Private-R1 now

[deleted] on 29 Jan 2025 01:50 next collapse

.

cyberpunk007@lemmy.ca on 29 Jan 2025 04:52 next collapse

Fuuuuck that.

Gladaed@feddit.org on 30 Jan 2025 07:57 next collapse

This is true for the deep seek app, not the published network.

ChogChog@lemmy.world on 30 Jan 2025 10:22 collapse

Expect to see this in more applications, especially when dealing with AI. Why do you feel like you’ve noticed an uptick in having to complete captchas on every website you visit?

It’s an easy way for them to validate if you’re human or some competitor AI/scraper bot that’s trying to train on their data.

OpenAI is so scared about the possibility of DeepSeek distilling their model, I guarantee they are adding a keystroke/key pattern recognition system into their own front ends to combat it. If it’s not there already which would surprise me.

Expect your privacy to continue to be eroded in the name of profit technological progress.

5gruel@lemmy.world on 30 Jan 2025 13:42 collapse

Wait but distillers will surely usw the API instead oft the Frontend, right?