Decentralization Scoring System (v1.3)
from AnonomousWolf@lemm.ee to technology@lemmy.world on 22 Apr 12:17
https://lemm.ee/post/62072035

🧮 Decentralization Scoring System (v1.3)

This scoring system evaluates how decentralized and self-hostable a platform is, based on four core metrics.

📊 Scoring Metrics (Total: 100 Points)

Metric Weight Description
Top Provider User Share 30 Measures how many users are on the largest instance. Full points if <20%; 0 if >80%.
Top Provider Content Share 30 Measures how much content is hosted by the largest instance. Full points if <20%; 0 if >80%.
Ease of Self-Hosting: Server 20 Technical ease of running your own backend. Full points for simple setup with good docs.
Ease of Self-Hosting: User Interface 20 Availability and usability of clients. Full points for accessible, FOSS, multi-platform clients.

📋 Example Breakdown (Estimates)

Platform Score Visualization
📧 Email 95 🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩
🐹 Lemmy 79 🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩
🐘 Mastodon 74 🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩
🟣 PeerTube 94 🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩🟩
🖼 Pixelfed 42 🟧🟧🟧🟧🟧🟧🟧🟧
🔵 Bluesky 14 🟥🟥🟥
🟥 Reddit 3 🟥

📧 Email

Total: 95/100


🐹 Lemmy

Total: 79/100


🐘 Mastodon

Total: 74/100


🟣 PeerTube

Total: 94/100


🖼 Pixelfed

Total: 42/100


🔵 Bluesky

Total: 14/100


🟠 Reddit

Total: 3/100


How Scores are Calculated

🧑‍🤝‍🧑 How User/Content Share Scores Work

This measures how many users are on the largest provider (or instance).

📊 Formula:

Score = 30 × (1 - (TopProviderShare - 20) / 60)
…but only if TopProviderShare is between 20% and 80%.
If below 20%, full 30. If above 80%, zero.

📌 Example:

If one provider has 40% of all users:
Score = 30 × (1 - (40 - 20) / 60) = 30 × (1 - 0.43) = 17.1 points

🖥️ How Ease of Self-Hosting Scores Work

These scores measure how easy it is for individuals or communities to run their own servers or use clients.

This looks at how technically easy it is to run your own backend (e.g., email server, Mastodon server) or User Interface (e.g., web-interface or mobile-app)


📚 Sources

Footnotes

This is a work in progress and may contain mistakes. If you have ideas or suggestions for improvement, feel free to let me know.

Source: github.com/…/decentralization_score_2025.04.md

#technology

threaded - newest

demesisx@infosec.pub on 22 Apr 12:43 next collapse

This is really cool.

It reminds me of the Edinburgh Decentralisation Index: an academically rigorous decentralization index that the university of Glasgow school of informatics devised to quanitfy the decentralization of cryptocurrencies:

The Edinburgh Decentralisation Index (EDI) studies blockchain decentralisation from first principles, archives relevant datasets, develops metrics, and offers a dashboard to track decentralisation trends over time and across systems.

informatics.ed.ac.uk/blockchain/edi

You should give it a serious look. IMO, it would offer some insight into academically peer-reviewed ways of quantifying this kind of thing.

AnonomousWolf@lemm.ee on 22 Apr 12:49 collapse

That’s fascinating, Hopefully we can eventually get such a thorough breakdown of the decentralization of online services.

jabathekek@sopuli.xyz on 22 Apr 13:08 next collapse

Nice work.

Zwuzelmaus@feddit.org on 22 Apr 13:30 next collapse

Very good ideas, but I don’t agree at all with your estimations.

For example it is terribly difficult to self host email, and very few people actually do it. Contrary to your estimation. It’s not because of the server software, but because of the fight against spam etc. that costs so much.

You are focusing on only one “top” and so you can’t see the reality. You are scoring centralization. Not decentralization.

Better if you look at the share of hosters at the “lower end”, the ones that actually do self hosting, like:

% of servers with up to 10 users (counting only natural persons).

% of users on servers with up to 10 users (counting only natural persons).

SufferingSteve@feddit.nu on 22 Apr 16:29 next collapse

I like the idea, but I don’t really feel this is scoring decentralization at all.

Yeah,looking through this again, it really scores centralization, and also focuses way to much on ease of setup. Which honestly has nothing to do with it. If it’s super hard to setup, but every participant is hosting their own node. And producing the same amount of content. That would be max decentralization.

I would also argue that a requirement for decentralization is that the service keeps working even if a large portion of all nodes goes down, the remaining nodes are still operational and keep delivering the value.

Zak@lemmy.world on 23 Apr 01:07 collapse

For example it is terribly difficult to self host email, and very few people actually do it.

I read this a bunch of times and put off trying it because it sounded like such a hassle. Eventually I did and… it wasn’t bad at all. I just had to add a few extra DNS entries. I haven’t had any delivery problems.

ThanksForAllTheFish@sh.itjust.works on 23 Apr 06:45 collapse

Is it self hosted or do you pay someone else to host it? Do you have a fixed IP? I’ve always wanted to try and set it up but it definitely seems like one of the riskier ones if you then use it to sign up to a lot of things.

Zak@lemmy.world on 23 Apr 14:55 collapse

It’s on a VPS. Whether that’s really self-hosted may depend on how much of a purist you are, but it’s fully self-managed, not SAAS.

It’s recommended to have a PTR record mapping your IP address to your domain, which you wouldn’t be able to do with a residential connection from a typical ISP. I do send mail from multiple domains though and I haven’t had issues with deliverability. What I do not send is any kind of high-volume mail, which would likely attract a different kind of scrutiny.

pcouy@lemmy.pierre-couy.fr on 22 Apr 18:32 collapse

There are a few things I don’t like about this scoring system :

  • Why is there a “Top Provider Content Share” metric if its gonna score the same as the “Top Provider User Share” every time ?
  • Why is the Top Provider Content Share not higher than the user share ? For instance, emails usually have at least one sender and one recipient, making it twice as likely that at least one of them is using gmail. If an email has 10 recipients across 10 different providers, each provider has a copy of the data
  • Why is ease of hosting a mail server rated so well ? How is “leveraging email hosting services” decentralized in any way ?
  • Why are we using a random repo created a few hours ago by a random github user as a reference ?
rottingleaf@lemmy.world on 22 Apr 19:20 next collapse

Why is ease of hosting a mail server rated so well ? How is “leveraging email hosting services” decentralized in any way ?

Because somebody there doesn’t even understand you’re supposed to host an email server as easily as hosting a web server with a website. While in reality you’ll learn all the Satanish obscene lexicon before making big email providers accept your mail.

I’m exaggerating probably, but ahem.

That said, in my humble careless incompetent “let’s-go-back-to-year-2005” opinion we need a new email standard, spiritually same, but qualitatively different, like the upgrade from prehistoric email with UUCP paths to something more modern, only this time cutting down all the DMARC and DKIM bullshit and simply using pubkeys in To: and From: headers, with the email itself signed by the author, by mail server and maybe by something else. One can make encryption of email content the baseline norm while we’re at it. One can even get rid of the attachment of identities to mail servers and use servers similar to how NOSTR has relays. I mean, what I described already is just NOSTR with nostalgic aesthetics. Maybe also similar to some kind of Fidonet reimagined.

palitu@aussie.zone on 22 Apr 19:36 next collapse

In regards to email hosting.

It is not about hosting the server on your own infrastructure, it is about having there code to host it out have another provider.

There are a lot of email providers!

Also, if you look at the way Lemmy works, it is the same as emails. If they federate a community, the data is kept on both the original server and the federated one, so you duplicate to m the data, similar to an email server.

It is an interesting concept, and maybe the score should just reduce down to brush categories, like fully decentralized, potential decentralized, neither, partially centralized, fully centralized.

Then we won’t nitpick on the school l score too much?

AnonomousWolf@lemm.ee on 23 Apr 09:01 collapse

Why is there a “Top Provider Content Share” metric if its gonna score the same as the “Top Provider User Share” every time ?

As said in the footer, this is a work in progress, I’m posting it to get input and still refining sources

Why is the Top Provider Content Share not higher than the user share ? For instance, emails usually have at least one sender and one recipient, making it twice as likely that at least one of them is using gmail. If an email has 10 recipients across 10 different providers, each provider has a copy of the data

I’d love to get better data on this, I’ve looked but not yet found better data than what I included in the source

Why is ease of hosting a mail server rated so well ? How is “leveraging email hosting services” decentralized in any way ?

Here I’m a bit in two minds, sure it’s difficult to SELF host email, but in practice it isn’t because there are hundreds (Thousands?) of hosting options to choose from where you can choose your own domain etc. for the low price of basically-free

Why are we using a random repo created a few hours ago by a random github user as a reference ?

It’s my repo, it’s to keep track of the versions and so that others can copy, edit and share it if they like.