Zotac accidentally lists RTX 5090, RTX 5080, and RTX 5070 family weeks before launch — accidental listing seemingly confirms the RTX 5090 with 32GB of GDDR7 VRAM (www.tomshardware.com)
from Xatolos@reddthat.com to technology@lemmy.world on 17 Dec 11:48
https://reddthat.com/post/31252494

#technology

threaded - newest

Kyrgizion@lemmy.world on 17 Dec 12:03 next collapse

Doesn’t matter, I couldn’t afford a 5090 anyway.

SkaveRat@discuss.tchncs.de on 17 Dec 12:17 next collapse

going by trend, I don’t think my apartment could fit a 5090

Drunemeton@lemmy.world on 17 Dec 14:50 next collapse

On the plus side you no longer need heating.

mightyfoolish@lemmy.world on 18 Dec 18:06 collapse

Or you have to keep it in your other apartment to avoid heat stroke. If you can’t afford another apartment, you probably can’t afford this card.

FireWire400@lemmy.world on 17 Dec 17:26 collapse

You’d need a separate apartment just to keep it from sagging…

kn33@lemmy.world on 17 Dec 13:33 next collapse

I got my 3080 and I plan on keeping using it for a good while still.

fuckwit_mcbumcrumble@lemmy.dbzer0.com on 17 Dec 19:06 next collapse

Nvidia doesn’t care. Their margins will be high enough, and there will be enough buyers that they’ll be able to print money at whatever absurd price point they choose.

UltraGiGaGigantic@lemmy.ml on 18 Dec 00:45 collapse

It’s gonna be over $2,000. (Uneducated caveman ignorant guess)

SoleInvictus@lemmy.blahaj.zone on 18 Dec 03:02 next collapse

I’m seeing $1900, but guessing $2500+ once the scalpers get to scalping. I don’t intend to buy one, but I hope those assholes get fucked over.

mitrosus@discuss.tchncs.de on 18 Dec 03:41 next collapse

Close It will actually be ~$8000

Raxiel@lemmy.world on 18 Dec 09:31 collapse

If “Learning from history” counts as education, I wouldn’t call it an uneducated guess.

Psythik@lemmy.world on 18 Dec 19:03 collapse

I got lucky and scored a 4090 at a discount on launch day. I think I’ll just hang onto it for a few more generations. It still has plenty of power to run everything at 4K 120Hz especially with DLSS Balanced, (except Indiana Jones; that game is extremely demanding), which I use anyway cause IMO it looks better than AA.

TheHobbyist@lemmy.zip on 17 Dec 12:09 next collapse

Allegedly, the 5090 would have 32GB and the 5080 16GB, I don’t see much room for the 5060 to have more than 8GB if the 5070 itself has 12GB?

I would have loved to see the 5080 with 24GB, the 5070 with 16GB and the 5060 with 12GB (at least). And for the 5060 to drop the 128 bit bus…

KoalaUnknown@lemmy.world on 17 Dec 13:37 next collapse

I would have loved to see the 5080 with 24GB

They wouldn’t do this because it would undercut their 5090 sales.

CosmoNova@lemmy.world on 17 Dec 14:16 collapse

Sales and probably pricing itself. With a gap this large, the price of the 5090 might be more than twice as high as that of the 5080 and people would still buy it.

fuckwit_mcbumcrumble@lemmy.dbzer0.com on 17 Dec 18:45 collapse

If the value proposition continues, and based on the leak it seems like it will, the 5090 would be a better value FPS per dollar than the 5080 and anything below it. The 4090 cost like 40% more than the 4080, but gave 60% better performance. The 5090 looks like it will be well over twice the performance of the 5080, so I’d expect twice the price.

Raxiel@lemmy.world on 18 Dec 09:38 collapse

A 5060 with 8gb will have Intel wringing their hands with glee. Whatever performance gains the ($300-350) 5060 GPU has (and I don’t doubt they’ll be notable) will be choked off by an 8gb framebuffer. They might do a 16gb clamshell like they did with the 4060ti but like you said, where does that leave the 12gb 70 card?

BaroqueInMind@lemmy.one on 17 Dec 12:28 next collapse

I just want one to self host a 70B LLM model for fuck’s sake. I don’t want to be forced to take out a god damned mortgage/personal loan to buy one.

Ragdoll_X@lemmy.world on 17 Dec 12:57 next collapse

You might just want to use Kaggle tbh

BaroqueInMind@lemmy.one on 17 Dec 19:12 collapse

Never heard of it.

Speculater@lemmy.world on 17 Dec 13:36 next collapse

Damn, they require like 50GB vram, that’s nuts.

brucethemoose@lemmy.world on 17 Dec 17:13 next collapse

Qwen 2.5 32B is where it’s at now. 24GB is affordable, and it fits perfectly.

Otherwise, stay on the lookout for AMD Strix Halo, which can reportedly allocate up to 96GB on its IGP, and you can run faster backends like vllm or exllama.

BaroqueInMind@lemmy.one on 17 Dec 19:13 collapse

What’s up with Qwen that makes it better than anything else?

brucethemoose@lemmy.world on 17 Dec 19:21 collapse

It’s just smarter with the same number of parameters. Try Qwen QwQ or Qwen coder 32B, see for yourself… it stacks up well against huge models like the 123B Mistral Large, or even GPT-4.

Why? Alibaba trained it well, presumably with better data than OpenAI or whomever else, though specifics are up for debate. Some suggests that bilingual training on English/Chinese (aka the two largest text corpuses in existance) significantly helps the model over mostly english. Some say the government just gave them better data. There’s also suggestions that having so few GPUs compared to American AI companies made the Chinese “thrifty,” and gave them far more incentive to be innovative rather than brute forcing models (which has diminishing returns).

teuto@lemmy.teuto.icu on 17 Dec 18:31 collapse

I picked up a pair of old Tesla P40s. Right now I’m running a Q4 quant of Qwen 2.5 72B that fits in the combined 48GB of VRAM with 12k context. They aren’t as fast as newer consumer cards, but it generates as fast as I can read while costing less than a used 3080.

BatrickPateman@lemmy.world on 18 Dec 09:11 collapse

interesting. They are cooled passively, right? What’s your case and cooling setup?

teuto@lemmy.teuto.icu on 18 Dec 17:55 collapse

I have a dell power edge 730, which was about $200. It’s CPU shrouds perfectly match the GPU intakes so air just flows through both from the server fans. I’ve seen a few 3d printable fan mounts for jury rigging them into a regular tower too.

Ragdoll_X@lemmy.world on 17 Dec 13:00 next collapse

Knowing Nvidia’s exorbitant pricing, I think I’ll keep Intel’s Arc B580 in my wishlist.

Blackmist@feddit.uk on 18 Dec 09:36 collapse

They need a Steamdeck type list of games that will actually run on it though.

AhismaMiasma@lemm.ee on 17 Dec 13:21 next collapse

This is why I love Zotac. We’ll, not this reason, but it adds to the small pile of smiles they’ve given me.

That being said I’m skipping this Nvidia gen and might break for AMD next. My 3090 is still trucking fine and I feel like Nvidia has lost their value after the debacles of the 40 series.

bruhduh@lemmy.world on 17 Dec 14:25 next collapse

skipping nvidia since gtx750ti, AMD all the way up

circuitfarmer@lemmy.sdf.org on 17 Dec 17:08 next collapse

AMD is by far the price-per-unit-performance leader. Same is true on the CPU side. Intel and Nvidia aren’t even on the chart tbh.

brucethemoose@lemmy.world on 17 Dec 17:11 next collapse

You should think about selling it TBH. 3090 prices are shooting up like crazy, and may be at a peak, because they are the last affordable card to self host LLMs.

WolfLink@sh.itjust.works on 17 Dec 18:51 next collapse

Can’t you run LLMs on 4090/5090 maybe 5080? Basically any Nvidia card with 24GB+ of VRAM?

brucethemoose@lemmy.world on 17 Dec 19:03 collapse

Yeah, but they not worth it.

The 4090 is basically just as good as the 3090 because it has the same amount of vram, but twice the price… so you mind as well get 2x 3090s.

The 5090 will be hilariously expensive, and 24GB -> 32GB is not that great, as you still can’t run 70B class models in that pool… again, mind as well get 2x 3090s. I would not even bother trading my single 3090 for 5090.

If AMD sold a 48GB consumer card, you would see them dominate the open source LLM space in a month, because every single backend dev would buy one and get their projects working on them. Same with Intel. VRAM is basically the only thing that matters, and 24GB is kinda pitiful at a 4090’s price.

DarkThoughts@fedia.io on 17 Dec 19:54 next collapse

I'd already be happy if AMD goes with 24 GB on their upper midrange cards, but I would not be surprised if they stick with 16 GB. 48 GB seems extremely unlikely, unfortunately.

Doing LLMs with 8 GB is not fun, especially not with RDNA 2 which has so many issues with ROCm.

CheeseNoodle@lemmy.world on 18 Dec 18:13 collapse

Halo has me hopeful that AMD are going to continue down this idea of having APUs that can use onboard RAM instead of requiring it to be built in. It’d be great to just be able to upgrade my RAM rather than replace a whole ass GPU.

brucethemoose@lemmy.world on 18 Dec 20:08 collapse

It uses embedded LPDDR5X, so it will not be upgradeable unless the mobo/laptop maker uses LPCAMMs.

And… that’s kinda how it has to be. Laptop SO-DIMMs are super slow due to the design of the DIMMs, and they need crazy voltages to even hit the speeds/timings they run at now.

AhismaMiasma@lemm.ee on 17 Dec 23:17 collapse

Never even thought of that, is there a good website to sell a GPU on or is it pretty much just eBay?

I just don’t play games like I used to, just videos now. Poor thing hardly gets any use.

brucethemoose@lemmy.world on 18 Dec 02:11 collapse

You could list it locally depending on where you are, through FB marketplace or Craigslist.

Otherwise, yeah, eBay.

mightyfoolish@lemmy.world on 18 Dec 18:09 collapse

Did you hear about the news that AMD is skipping out on “high-end” market segment next generation? No real concrete numbers to this statement. I have heard speculation that they plan on making cards just as powerful as last gen except with better ray tracing and CHEAPER.

AhismaMiasma@lemm.ee on 18 Dec 18:33 collapse

I have heard this, and to be honest, I don’t think I need the highest power GPU anymore. I mostly got the 3090 for VR and I seldom use my kit anymore. Upper-mid range AMD might be the answer for me.

billwashere@lemmy.world on 17 Dec 13:29 next collapse

Is 5090 the card model number or the price?

JaddedFauceet@lemmy.world on 17 Dec 13:38 next collapse

Uh it is both, for convenience, TLDR 5090

KITA@lemmy.sdf.org on 17 Dec 17:35 collapse

This joke is the millennial version of boomers asking if something is free when it doesn’t scan.

emiellr@lemm.ee on 18 Dec 00:23 collapse

Braindead take

FireWire400@lemmy.world on 17 Dec 17:26 collapse

Oh no, I’m sure no one could’ve guessed the names…

Vash63@lemmy.world on 17 Dec 18:39 collapse

32GB is news. It confirms either a 512 or (more likely) 256 bit bus, which would be a significant drop from the 384 bit on the 4090.

I’m sure the increased perf of G7 would fully offset that, but this means without some larger caches it will be difficult for this to be a massive performance jump from last gen.

Unless they’re going 512 bit, in which case ignore all that and wow is this a monster.

fuckwit_mcbumcrumble@lemmy.dbzer0.com on 17 Dec 18:42 collapse

More likely a 256 bit bus? Not with the monster of a card the 5090 would be. 256 bit on that massive of a card would be ridiculous. The card is set to draw 600 watts.

UltraGiGaGigantic@lemmy.ml on 18 Dec 00:45 next collapse

sad PSU noises

Artyom@lemm.ee on 18 Dec 02:44 collapse

For the record, in case anyone cared, I audibly laughed when I read 600 watts.