azorius.net

DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI (venturebeat.com)
from yogthos@lemmy.ml to technology@lemmy.ml on 25 Mar 03:16
https://lemmy.ml/post/27647712

#technology

threaded - newest

yogthos@lemmy.ml on 25 Mar 03:27 collapse

the key bit

This represents a potentially significant shift in AI deployment. While traditional AI infrastructure typically relies on multiple Nvidia GPUs consuming several kilowatts of power, the Mac Studio draws less than 200 watts during inference. This efficiency gap suggests the AI industry may need to rethink assumptions about infrastructure requirements for top-tier model performance.