I had been having trouble getting meaningful results from the fediverse on Google, and after seeing this post, it seems I’m not the only one. So, I created a site that helps search the fediverse in your search engine of choice (it currently supports Google, Bing, Yahoo, DuckDuckGo, and Dogpile).
Due to query limitations with most search engines, it currently only searches the top 15 lemmy/kbin instances, but I’ve tested it and it seems to provide access to a good chunk of fediverse content. The exception is Google, which should be far more reliable overall as well as providing the ability to search Mastodon and PeerTube.
If you have contributions or ideas for improvement, feel free to check out the project here or shoot me a message. Hope this helps people! :)
Edit: Update in progress including improved search queries and support for Mastodon/PeerTube (Google only, unfortunately)
Edit 2: Update is live, along with a dedicated domain name. If the website doesn’t look any different for you, try Ctrl+F5 or clearing site data - it seems some browsers are caching the old page.
In all seriousness, Google needs to get on providing an easier way to specify that a search should hit the Fediverse.
site:reddit.com
works for Reddit, but there is presently no analogous operator on Google’s search for a distributed system that spans many domains.I mean, it’s great that you’ve made this, don’t get me wrong, but they really should do that as well.
but there is presently no analogous operator on Google’s search for a distributed system that spans many domains.
Because that’s just a basic search. A search engine searches across multiple domains by default. If you’re specifically looking for only results from ActivityPub enabled services, that’s pretty much an impossibility since there’s no way to know (from a web crawl) if a page is served by a server that supports ActivityPub. Another problem is that a lot of fediverse instances purposefully block search engine crawlers because they don’t want to appear in search results.
I like the idea of scrapping Google altogether, and just having “better” search engines here that account for federated decouplings/distributions
Not entirely the same, but I switched over to Presearch a year or so ago, just to get away from Google and the “big tech” corporations
Search Term intext:“Powered by kbin”
maybe?
have you considered redirecting to whoogle or searx ? you can use farside.link which automatically redirects to a working instance. it would be helpful for people who want to avoid using Google
I will absolutely check this out, thanks for the suggestion!
Edit: I can’t get searx working at all for me, but I went ahead and implemented whoogle support
deleted by creator
Simple and handy, thanks!
very nicely done!
Won’t let me upvote so I’m commenting to show love instead.
Hm, I find it somewhat annoying that right now, this is not really searching the Fediverse, but rather what we’ve come to call “the Threadiverse”, which is all about Reddit-like content aggregators.
In other words, I’d love an option to search different kinds of content, like instead of Threadiverse-stuff searching the most popular mastodon, misskey, or pleroma instances just to name a few.
I’m hoping to expand the project to hopefully be a bit more robust - I’ll definitely keep this on my radar
Searching Mastodon is a bit of a… contentious issue. A lot of smaller Mastodon-based sites are full of traumatized vulnerable people who really just want to do their own thing, and they’ll rattle cages if they find out someone’s indexing their sites or posts. If anyone’s making third party search tools, it’s best to be careful to respect discoverability and indexing flags.
I find this to be incredibly fair, but also makes it much harder to dive into the fediverse. Where is the middle ground do you think?
Mastodon has flags for opting in to discoverability features (being featured in the profile directory, and having posts be searchable via Mastodon’s search bar) and for search engine indexing (for Google, bing, etc.).
Just don’t return posts from users that have opted out of those, and things should be mostly ok.
just use kbin’s native search
Each one has its upsides and downsides. Kbin’s search is more convenient for certain use cases, but it suffers from the same problem as Reddit’s search - it always sorts by new no matter what. In addition, it can only index instances with which kbin has federated. FediSearch should in theory index every instance, regardless of federation status, and do so in such a way that the most useful posts show first (assuming Google does its job).
google is crap at searching the fediverse