DeepSeek claims its [Open Source] reasoning model beats OpenAI's o1 on certain benchmarks | TechCrunch (techcrunch.com)
from cm0002@lemmy.world to technology@lemmy.world on 20 Jan 18:53
https://lemmy.world/post/24509009

Chinese AI lab DeepSeek has released an open version of DeepSeek-R1, its so-called reasoning model, that it claims performs as well as OpenAI’s o1 on certain AI benchmarks.

R1 is available from the AI dev platform Hugging Face under an MIT license, meaning it can be used commercially without restrictions. According to DeepSeek, R1 beats o1 on the benchmarks AIME, MATH-500, and SWE-bench Verified. AIME employs other models to evaluate a model’s performance, while MATH-500 is a collection of word problems. SWE-bench Verified, meanwhile, focuses on programming tasks.

#technology

threaded - newest

M33@lemmy.sdf.org on 20 Jan 19:06 collapse

As for the full R1, it requires beefier hardware, but it isavailable through DeepSeek’s API at prices 90%-95% cheaper than OpenAI’s o1.

Guys, don’t send your data overseas because it’s cheap… 🙄

[deleted] on 20 Jan 20:01 collapse

.