A look at search engines with their own indexes (seirdy.one)
from paequ2@lemmy.today to technology@lemmy.world on 29 Aug 07:02
https://lemmy.today/post/36640053

#technology

threaded - newest

oce@jlai.lu on 29 Aug 08:02 next collapse

It should mention that Qwant (France) and Ecosia (Germany), announced last year a joint project for an independent European search index, although only for French and German according to this publication. …qwant.com/…/ecosia-and-qwant-join-forces-to-deve…

grue@lemmy.world on 29 Aug 10:31 collapse

On doit apprendre le français (ou l’allemand) pour rechercher le web sans indices Américains? Défi accepté!

Zagorath@aussie.zone on 29 Aug 10:50 collapse

On doit apprendre le français (ou l’allemand) pour rechercher le web sans indices Américains? Défi accepté!

<img alt="grue@lemmy.world 🔗 English" src="https://aussie.zone/pictrs/image/fd19c640-8ae2-4bdb-8512-b65978033721.png"> 🤨

grue@lemmy.world on 29 Aug 11:08 collapse

Right, I speak English natively but have been learning French.

Blisterexe@lemmy.zip on 29 Aug 11:30 next collapse

Bonne chance avec le français!

Thedogdrinkscoffee@lemmy.ca on 29 Aug 13:37 collapse

Je voudrais un croissant

akwd169@lemmy.sdf.org on 29 Aug 20:28 next collapse

Already knew what it would be before I clicked the link 🤣

Zagorath@aussie.zone on 30 Aug 03:56 collapse

Voudrai? Will like?

Thedogdrinkscoffee@lemmy.ca on 30 Aug 05:08 collapse

Je voudrais, conditionel future. “I would like.”

Not “Je voudrai” Indicatif future.

Je suis le dumb-ass. De temps en temps.

Zagorath@aussie.zone on 30 Aug 10:39 collapse

Ne t’inquiete pas, je suis un dumb-ass la plupart de temps.

Je suis en train de apprendre aussi le français, la même que @grue@lemmy.world.

sorghum@sh.itjust.works on 29 Aug 08:29 next collapse

Looks like for the hardware requirements for self hosting some of the open source options, I’ll be saving up quite a bit for SSDs.

fmstrat@lemmy.nowsci.com on 29 Aug 12:05 next collapse

Consider self-hosting SearXNG, which can aggregate results and filter.

sorghum@sh.itjust.works on 29 Aug 17:24 collapse

Already am, but it still pulls results from the companies I want to separate myself from. I’d rather see what it takes/how well it performs to have my own indexer.

fmstrat@lemmy.nowsci.com on 29 Aug 20:05 collapse

Good luck with that one. It takes a lot of resources from my limited and aged experience, and I’m sure it’s more now. Might be worth focusing the indexer on a topic area to start, just to get a feel for sizing (if your chosen solution supports that).

sorghum@sh.itjust.works on 30 Aug 02:19 collapse

Yeah, long term goal is a self reliant total internet experience. I figure at best, I’ll still have to rely on a handful of more trustworthy companies to do some things like search.

frezik@lemmy.blahaj.zone on 29 Aug 12:33 collapse

FWIW, I gave YaCy a try a while back, and I agree with the article on that one. Shit tier results that make ancient AltaVista look good. Might be fine for intranet search. I like the idea of its distributed hosting, but pass on this one.

Other poster mentioned SearXNG, and while I haven’t delved into that too much, it’s probably worth a check. Pass on YaCy.

whaleross@lemmy.world on 29 Aug 13:26 next collapse

SearXNG is a meta search entirely reliant on other services.

swelter_spark@reddthat.com on 31 Aug 07:24 collapse

I really like Yacy’s results, personally. It seems good for the kinds of sites I care about. My biggest problem with it is that the newest version is so memory-hungry.

Glitchvid@lemmy.world on 29 Aug 11:30 next collapse

Great article, appreciate that I’m not the only one concerned around some of the ethical choices Kagi has been making.

Jack_Burton@lemmy.ca on 29 Aug 12:26 next collapse

I got a sub to Kagi a few months ago. It seems pretty good but I’m behind on the news. I’ve read a few things here and there, but can you explain a bit about the ethics?

Glitchvid@lemmy.world on 29 Aug 13:30 collapse

Last I saw they still paid Yandex for access to that index (weigh how important that is yourself), they also pushed back on suicide warnings if you ask Kagi how to kill yourself, and I learned from this article that they may be using additional data sources that contain higher levels of homophobic sentiment.

Basically, the company’s tagline is “Humanize the Web”, but I don’t think their actions thus far show we agree on what Humanize means.

Jack_Burton@lemmy.ca on 29 Aug 14:39 collapse

Yeah, definitely something to keep an eye on. Might not renew and just start using Qwant.

isaaclyman@lemmy.world on 30 Aug 23:40 collapse

Same. I use Kagi because search is an essential function of my job and I can’t extract decent results from Google anymore, but if there were another engine with equally good results and a better ethical track record I’d switch.

(There isn’t. I’ve tried Qwant, Ecosia, DuckDuckGo and a handful of others. Was not impressed.)

Glitchvid@lemmy.world on 30 Aug 23:51 collapse

I share the sentiment. For me the results are acceptable, and being able to custom rank sites in results is very useful, but the killer feature is not having ads or forcing AI down my throat.

frongt@lemmy.zip on 29 Aug 14:25 next collapse

The plural of index is indices.

paequ2@lemmy.today on 29 Aug 14:37 next collapse
aBundleOfFerrets@sh.itjust.works on 30 Aug 03:57 collapse

The article addresses that in a footnote

RecursiveParadox@lemmy.world on 29 Aug 17:30 next collapse

That was a good read, well done! I’d be interested if you ever reconsider Startpage. I’ve had good success with that on my work computer.

tuckerm@feddit.online on 29 Aug 22:12 collapse

Great list! I've been wanting to start using a smaller search engine with its own index, just for the sake of making sure there's an alternative to GBY. (Also, there's a new and useful acronym, haha.) Mojeek was the only one I was aware of before today.

BTW, exalead.com doesn't seem to be a search engine anymore. I recognized that one because I remember discovering it a while ago...in 2007, maybe? But it looks like it's not available as a regular search engine now.

edit: Also, this is the first really useful page I've read via the Gemini protocol, so thanks for that!