So lets say lemmy.world and then lets take beehaw.org (I know they are defederated lets assume they are not) for example. All the posts and comments which are hosted by lemmy.world on hard drives or servers, are also hosted by beehaw.org and vice versa? So the amount of data is actually doubled in size?
Yep. Add in a 3rd instance and now you have 3 copies of the database, essentially. It’s just that each instance is responsible about telling the fediverse when updates occur to communities on their instance.
Thanks. Wonder how much it would be uncompressed but I guess that doesnt matter as you can probably compress it on your fediverse instance until it is required to be accessed by a user
So lets say lemmy.world and then lets take beehaw.org (I know they are defederated lets assume they are not) for example. All the posts and comments which are hosted by lemmy.world on hard drives or servers, are also hosted by beehaw.org and vice versa? So the amount of data is actually doubled in size?
Yep. Add in a 3rd instance and now you have 3 copies of the database, essentially. It’s just that each instance is responsible about telling the fediverse when updates occur to communities on their instance.
If the fediverse gets really big, lets say the size of reddit, it may be hard for all the different instances to store all that data on their servers
Ya, ActivityPub isn’t without it’s issues… but luckily it’s all just text. Much of that can be compressed significantly.
I wonder what the total data storage size is for all the publicly viewable content on reddit. I find it hard to even guess lol. 100TB? 10,000TB?
The compressed archive of reddit from 2005.5 until 2022 is 2 TB: https://academictorrents.com/details/7c0645c94321311bb05bd879ddee4d0eba08aaee
Uncompressed it is likely way larger though.
Thanks. Wonder how much it would be uncompressed but I guess that doesnt matter as you can probably compress it on your fediverse instance until it is required to be accessed by a user