Has anyone made Linux Reddit Archives?
from electricprism@lemmy.ml to linux@lemmy.ml on 01 Jun 22:58
https://lemmy.ml/post/16360457

I think it would be great to have a archive so that the various documentation, comments and hacks / workarounds could be searched.

The reason I ask is because they block VPN traffic, restrict some content behind a login wall and I have blacklisted them from my DNS so I plan on never returning.

But I find myself lacking odd tips from the Sway community and other communities.

#linux

threaded - newest

smeg@feddit.uk on 01 Jun 23:18 next collapse

It’s not an archive but RedLib provides an alternative frontend which deals with most of the hostile design

governorkeagan@lemdro.id on 01 Jun 23:23 collapse

Which you can use in conjunction with LibRedirect

smeg@feddit.uk on 02 Jun 08:57 collapse

Nice, there’s also UntrackMe on Android

boredsquirrel@slrpnk.net on 01 Jun 23:32 next collapse

There is lemmit.online which does this purely.

It is pretty busy but may already do some of it. You could request a nieche and very useful community like the sway one. Or use your own server to fetch these results and post to the open Web.

Dave@lemmy.nz on 01 Jun 23:55 collapse

Unfortunately they don’t take requests for new subreddits anymore. In addition, they don’t mirror comments so in terms of answers to questions it’s probably not that helpful.

boredsquirrel@slrpnk.net on 02 Jun 10:26 collapse

True. But that may be due to how the bot works?

StrangeAstronomer@lemmy.ml on 02 Jun 01:09 next collapse

so just use chatgpt or gemini - pretty sure they sucked in all of reddit to form their KB

nublug@lemmy.blahaj.zone on 02 Jun 01:13 next collapse

using llm ai for tech support is monumentally stupid lmao

possiblylinux127@lemmy.zip on 02 Jun 02:22 collapse

How is it worse than taking advise off of the Internet? At the end of the day you need to be aware of what you are doing.

Mistral has helped me with a variety of tasks such as finding tools and choosing ZFS geometry

StrangeAstronomer@lemmy.ml on 02 Jun 03:49 next collapse

Quite right!

You need to take it all (AI or internet searches) with a huge pinch of salt. Even ye olde text books were not infallible and often out of date, so sodium chloride was also required even then.

The code either works or it doesn’t - it’s all in the testing. If you deploy AI suggestions without thought you deserve the consequences.

possiblylinux127@lemmy.zip on 02 Jun 13:36 collapse

I think the reliability of the response also depends on the prompt. Certain prompts decrease the reliability issues.

StrangeAstronomer@lemmy.ml on 02 Jun 03:53 collapse

BTW - thanks for Mistral. Another tool in the box!

possiblylinux127@lemmy.zip on 02 Jun 02:20 next collapse

I mostly use Mistral personally. You also can use llava for image analysis

theshatterstone54@feddit.uk on 02 Jun 06:31 collapse

Even if that’s so, I have had many occasions where I thought that for something simple, ChatGPT could do the job. I ended up having a back and forth for hours (last case of that being yesterday) until I got it fixed. For most cases (but not yesterday’s) I found it much faster by looking it up online.

gitamar@feddit.de on 02 Jun 01:32 next collapse

The archive warriors are downloading Reddit for a while already. 15.6 billion items and counting. You can help too:

tracker.archiveteam.org/reddit/

kionite231@lemmy.ca on 02 Jun 15:29 collapse

It just lists name of people archiving reddit. where can I get the archived data. do I have to ask one of those people to send me a zip file?

gitamar@feddit.de on 02 Jun 19:19 collapse

The data is integrated into the Internet archive and available e.g. via the way back machine. Not sure if you can get the whole reddit dataset.

helenslunch@feddit.nl on 02 Jun 01:42 next collapse

Archive.org

AliOski@feddit.nl on 02 Jun 11:12 collapse

You forgot the ‘s’ after http. Your link currently opens a blank for me until I put an ‘s’ behind it.

helenslunch@feddit.nl on 02 Jun 11:27 collapse

i didn’t put an http either. I just assume people know what it is.

eveninghere@beehaw.org on 02 Jun 02:51 next collapse

I mean, if people here don’t like how Reddit took advantage of user comment data, why should we archive the same without consent from the people who wrote them? Legally speaking Reddit holds the copyright also.

CrabAndBroom@lemmy.ml on 02 Jun 15:33 collapse

As a sidenote, you can get around the VPN block with Redlib by just adding safe- to the start of most reddit URLs. So like instead of reddit.com/r/linux or whatever you can do safereddit.com/r/linux and it should work without needing a login.