"But #Bluesky is also federated, decentralised!1!!" Well, no. https://arewedecentralizedyet.online by @ricci
UPDATE: @ricci has moved the code powering https://arewedecentralizedyet.online to codeberg (yay!) and added a third gauge to visualise how centralised the decentralised git based ecosystem is :)
@jwildeboer @ricci I don't know how feasible it is to do one for matrix but it would be really interesting

@thibaultmol @jwildeboer Yeah, I would be very interested in doing this for matrix but so far I have not found a dataset that would enable me to do so - and I don't know matrix well enough myself to try to collect one.

I am also interested in doing this for email, but that is thornier - in part because the data is valuable to marketers and therefore the sources I've found that *might* work are expensive, and I'd have to pay to even find out for sure

@jwildeboer @ricci 24% for the biggest server is way too much …. We can do better ( and yes it’s ironic from me to say that as I use mastodon.social 😋 )

@kingmoth @jwildeboer @ricci I don't like federated systems where their apps (at least the ones on F-Droid) recommend a single server, it tends to cause this effect.

That being said, if a dev wants to add a server recommendation system to make it easy for new users to not burden them by having to choose a server it could choose a general server randomly from a list which would not include too popular servers (to avoid the centralisation issue) nor too small servers. Well, what if server owners could opt in such a list? 

@qgustavor @kingmoth @jwildeboer @ricci the real issue is there is no full portability. People will trust the big server more because it will have less chance of dying and losing all your posts
@jwildeboer @ricci If Hetzner goes down, ~70% of the fediverse goes down.

@Commander1024 what's your source for that? https://fedidb.com/stats suggests Hetzner is responsible for about 9.3% of the fediverse.

I would say AUTOMATTIC is overrepresented because Wordpress.com blogs aren't as important for the fediverse as the number of them would suggest but still it would be nowhere near 70% even if you just removed AUTOMATTIC. DigitalOcean seems to be ahead of Hetzner too.

@Commander1024 @jwildeboer @ricci

Hetzner has 9% of all instances and 36% of all accounts.

ref: https://fedi.wrm.sr/

FediIndex

Detailed Fediverse statistics

@jwildeboer off-topic re: visualization
The gauge might be easier to interpret if divided by labeled areas with colors and the number being the same color as the section it's in.
Something like
0 "perfect competition"
1 - 1499 "competitive" (blue/green)
1500 - 2499 "moderately concentrated" (yellow/orange)
2500 - 9999 "highly concentrated" (red/purple)
10000 "monopoly"
@ricci

@Flo_Rian @jwildeboer That's an interesting suggestion, and I'm divided on it. On the one hand, I don't think it's 100% clear that the breakpoints used in antitrust are necessarily the right ones to use in this context, so staying fuzzy on the actual gauge might avoid suggesting sharp cutoffs where they don't really exist. (I did feel the need to include them in the text to give *some* context.)

On the other hand, I do think it would be good to show that there is still a benchmark for the fediverse to hit, that it can still get better

@ricci
I just had a hard time reading the graphs at first. Gauges (in reality) usually have a scale or zones, so we expect to be able to tell if we are in a certain range or hit "the red zone". In this case, one seems to have zones (green area, with the needle just on the edge), while the other one is all red.

To avoid sharp cutoffs, it could also be one large gradient. Alternatively, not using a gauge graph would be an option, maybe a partially filled bar chart.

@jwildeboer

@Flo_Rian @jwildeboer @ricci I'll repeat here what I posted elsewhere a few days ago regarding the 30 August graphs:
'I think the semicircle graphs are ambiguous. At first I interpreted the Fediverse graph as "very concentrated" because I thought the white area meant "concentrated". But of course, Fediverse was actually "very competitive" because the arrow is close to the "competitive" end of the scale.'
@0x10f @Flo_Rian @jwildeboer I realize that is an issue, but I wanted to use the actual metric, where lower is less concentrated, because I didn't want to go off modifying stats. I also thought doing the gauges backwards was likely to be more confusing.

@jwildeboer

If Train AP is travelling to Destination D and left the station July 18, 2017, and Train AT is also travelling to Destination D and left the station February 22, 2024, is Train AT allowed to even call itself a train?

Calling AT out for not being decentralised enough, yet, distracts from the shared mission IMO.

The AT Protocol has left the station. It could easily overtake ActivityPub on a parallel track. I think we have more to gain by focussing on the common good.

provide content-type matching ability for activitypub (15e836b7) · Commits · hubzilla / core · GitLab

build community websites that can interact with one another

GitLab

@jwildeboer I think it's important to note that this debate has alienated the entire ActivityPub community from one of the most successful digital communities for marginalised people we've seen in quite some time.

Blacksky is accomplishing tremendous things on AT, and AP has a lot to learn from that endeavour.

Rudy deserves better from the ActivityPub community.

https://www.blackskyweb.xyz/#01-read-me

Blacksky Algorithms | Decentralized Tools for Online Communities

Empowering communities with self-hosted infrastructure for decentralized social networks. Blacksky enables safer, independent online spaces.

@jaz @jwildeboer I think working with blacksky to federate with them directly is a good idea. I think federating with bluesky is not.
@jwildeboer I certainly hope that the technical mechanisms we see in the #atprotocol for decentralization will get deployed more widely in practice, and we'll get to watch #bluesky get more decentralized over time

@jwildeboer @ricci

Do I read this right that Bluesky is about 12x bigger? And I assume we're counting active users?

@iju @jwildeboer

This is complicated.

What I'm counting is the PDSes that the bluesky relay reports knowing about, and how many repos (user data) they report having.

The number this returns is different from what you may see in other places. You'll see 38M for the number of total accounts. You'll see a few million listed for Monthly Active Users. My understanding is that what the relay reports is *some* measure of active users (it's the number of accounts the relay is currently maintaining data for) but it's not exactly MAU.

The Fediverse numbers are MAU, but it's also notorious that you can't get a fully accurate number for this because not all instances choose to opt in to such data collection, often by misreporting numbers.

@ricci @jwildeboer

Thank you for explaining this to me. As might be evident from my profile, I'm not technically more that passingly literate.

@iju @ricci @jwildeboer FWIW this tracks BSky monthly/weekly/daily active users and puts the MAU number currently at around 5.65 million: https://bskycharts.edavis.dev/edavis.dev/bskycharts.edavis.dev/bsky_users_total.html
bsky users total (Munin :: edavis.dev :: bskycharts.edavis.dev :: bsky users total)

@ikuturso @iju @jwildeboer Ah, very nice, thanks, I may be able to use some of this data!

@jwildeboer i agree with the sentiment 100%, but to be clear Bluesky isn't any more or less decentralized than mastodon.social. Bluesky is a service that runs on atproto. Just as mastodon.social is a service that runs on ActivityPub.

atproto is theoretically decentralized in a similar way to ActivityPub, but there's just no one else of any size building on it. atproto is decentralized in theory, but not in practice. Hence the terrible graph you posted.

@ricci

@stinerman @jwildeboer There are some initiatives going on right now that *might* make a difference in this, blacksky now runs its own instances (using its own implementation) of most of the pieces. It's still small in terms of the userbase on its PDS, but this could be an inflection point. Or not. Which is why I think this is an interesting time to start watching.
@ricci @stinerman I am mostly saddened that the VC driven Bluesky folks decided that the NIH (Not Invented Here) syndrome led them to creating a new protocol instead of adding their weight to the ActivityPub/Stream protocol family. Now we have again a fractured ecosystem and factions. Le sigh.

@jwildeboer I agree with that 100%. At this point all we can hope for is good interoperability between the protocols.

@ricci

@jwildeboer heya @ricci uhhhh, question: the link to Software Heritage is mistyped as https://softwarehertigate.org/, could you please fix it?

@xerz @jwildeboer Oops! Thanks for pointing this out!

Fixed

@jwildeboer @ricci I love how the website is complaining about Git being concentrated on GitHub, and then the project itself proceeds to be… on GitHub.
@can @jwildeboer @ricci seems as it's on codeberg now. nice
@soop @jwildeboer @ricci very cool! Love it!

@can @soop @jwildeboer One of the interesting federating aspects of git is that it can be (and is :) ) in both places at once - I set it up on github because I figured that'd maximize the chance of getting pull requests, but I've now moved my primary copy to codeberg. I can accept PRs from either place and push to both, which is cool.

git is also a good cautionary tale, though, of the fact that just because the protocol itself is radically decentralized doesn't mean the deployments will inevitably be so. A lot of us are (rightly, imo) worried about the dominance of github even though we can take our repos anywhere we want with relative ease.

I don't really think PDSes are the ultimate measure of decentralisation, the benefits of decentralisation show at the appView layer more.

PDSes are just dumb data stores. Admins of them can't do much bar removing posts or controlling your identity.

@irelephant @jwildeboer In a recent interview with Rudy of blacksky, he described the PDS as the most complicated part (and they should know, they have re-implemented or forked basically all the parts now). Yeah this surprised me too.

The anecdotal information I've found on the appview itself suggests that it follows a similar distribution to the PDS, but with far fewer of them so far. I'm looking for a data source, though.

Above the appview layer, I do think feeds and labelers are likely to show interesting decentralization; labelers are probably possible to get actual usage data on, but AFAICT only bluesky can provide data about the use of feeds, and they don't (all that's available is the number of likes, which is not the same thing.)

bsky is a lot newer and has x10 times the users of fedi.
Atproto is decentralised, and if you tell me it isnt please explain wafrn


#yeah-another-protocol-war-thats-what-we-need #it-went-great-against-twitter!

@gabboman @jwildeboer

There is a difference between the protocol enabling decentralization and decentralization actually happening in practice. atproto is designed to be decentralized (well, with the exception of did:plc); the question of course naturally arises whether it is getting used that way or not. Clearly, *at the level of where user data is stored*, that is not currently how the vast majority of its deployment has been done in practice. This very well could change, and the protocol is (again, with the exception of did:plc) designed to facilitate that change. Will it change? I don't know, let's watch and see; that's what the site is for.

But I also want to say that being new and big is actually a potential impediment to atproto becoming more decentralized. Most systems *tend* to trend towards more centralization, and network effects *tend* to keep people in the places where lots of other people are. So, while atproto is *designed* to permit decentralization that does not guarantee that it will *happen*, and starting as one big ecosystem controlled by a single entity makes it *less likely* that it will achieve meaningful decentralization. There are people trying, though - blacksky being the clearest example. Is this the start of an inflection point where more communities spring up and migrate off the centralized infrastructure? Maybe! Maybe not! Let's watch - that's what the site is for.

I was gona say smugly something about matrix but actualy matrix has the same problem.

yet everyone calls it fucking decentralized.

I mean, matrix proves that what you say can become a problem.


#anyway-im-trying-to-fix-that-with-wafrn-as-example
Ooooh I'm posting to you from both an ATproto and ActivityPub service ooooh its my own deployment of wafrn for me and some friends ooooooh I'm so scary ooooooh
@jwildeboer @ricci is SourceHut considered centralised or decentralised to you? To me it’s decentralised as anyone can contribute by just having any email address but I’m sure there are nuances.

@hadronized @jwildeboer In this context, the question I'm asking is whether the *ecosystem* is decentralized with respect to where data is stored.

So, in this case, sourcehut is one participant, as are github, codeberg, gltlab, all the forgejo instances that my data source (Software Heritage) knows about. The question being looked at here is whether user data (in the git world, repos) are concentrated on a few participants or spread out across many.

@jwildeboer @ricci at least I can transfer the user data within the ATProto-Universe, while my ActivityPub data/profile is trapped at the instance I'm using
@jwildeboer @ricci Since this post so far fedi got more decentralized and bluesky slightly less so. Amazing