203

One of the best things about reddit was looking for answers or other users with the same problem as you, and since Google didn't really help with that anymore and instead insisted on giving you business results, the best practice was to put your search terms in followed by 'reddit' and you'd find your answer.

top 50 comments
sorted by: hot top controversial new old
[-] marsara9@lemmy.world 91 points 1 year ago* (last edited 1 year ago)

I'm working on a specialized search engine just for the fediverse. https://github.com/marsara9/lemmy-search

If anyone wants to help out, feel free to reach out, but I hope to have something ready to release soon.

The idea with my version is that it'll search as much of Lemmy / the fediverse as it can and you can select the preferred instance that you want to open any link with.

[-] qisope@lemmy.world 16 points 1 year ago* (last edited 1 year ago)

If you are looking to return relevant, well ranked results based on freeform queries you'd be better indexing into something like elasticsearch. Otherwise you'll be reinventing solutions to well understood problems, like stemming as a very basic example.

[-] marsara9@lemmy.world 8 points 1 year ago

For the initial release the search is still fairly basic, but A LOT better than the built in search here.

Right now I just look for IF the individual words match ANY of the words in the post title or body and then rank based on the number of upvotes that the post has.

Future versions may look at using elastic search, etc... But for MVP it just looks for the number of hits + the score of the post as I assume the higher the score the more trustworthy the post, and obviously the more matches that to your query the more relevant the post is.

[-] mcmxci@mimiclem.me 6 points 1 year ago

I’ve been testing this and it’s the real deal!

[-] excel@lemmy.megumin.org 6 points 1 year ago

How is this different from just searching for posts on the original "seed instance"? Presumably you're crawling through everything on all of the instances that it's aware of, as opposed to the Lemmy built-in search which would only search communities that have a subscriber?

[-] Obi@sopuli.xyz 2 points 1 year ago

Search isn't working well for me at all, I never find anything.

[-] marsara9@lemmy.world 1 points 1 year ago

So the built in search here is VERY basic and slow. For example if I search for "How this is" it wouldn't find your comment here as the word order has to match as well.

One of my main goals is that you'll be able to use my search engine like you would Google's + adding reddit to the end of the query. Then from the search results the link you open would open in your preferred instance instead of the instance Google happened to crawl. Lastly if you want to Google Lemmy posts today you have to add every known Lemmy instance to your search query and even then Google still will open the link on whichever instance it happened to find it on rather than the instance you have an account on.

load more comments (2 replies)
[-] rtxn@lemmy.world 50 points 1 year ago* (last edited 1 year ago)

I recently saw that someone was making a keyword search engine that works across the fediverse. I'll try to find the project.

edit: found it, unsurprisingly it's called lemmy-search. Although it only seems to work on Lemmy instances.

[-] falconfetus8@lemmy.world 20 points 1 year ago

That's cool, but OP is specifically asking about finding things on Google.

[-] czech@no.faux.moe 11 points 1 year ago

"...or other search engines?"

[-] GammaScorpii@lemmy.world 3 points 1 year ago

Yeah I meant just a neat way of searching for user posts.

Google probably ingrained a little so that's what I gravitate to, but if there are better ways of searching that's helpful too. Having a search engine do it is good because you might come across old forum posts as well as Reddit, but over the years Reddit just became more prevalent. Obviously these new federated sites won't yet have all the usable content, but I'm wondering if the process will be the same once they do.

[-] Corkyskog@sh.itjust.works 2 points 1 year ago

Honestly if this gets as popular as reddit, it would be better than Google eventually.

[-] Knightfall@lemmy.ca 5 points 1 year ago

True. But I appreciated the FYI.

[-] sadreality@kbin.social 2 points 1 year ago

Google gutted their search over last few years.

No, sandar I don't want 69 pages of SEO optimized trash

[-] rm_dash_r_star@lemm.ee 49 points 1 year ago

They're getting indexed, but search rankings are so low they're buried. If you put <search term> site:<server> you get post results. For example lemmy site:lemmy.world

[-] RoyalEngineering@lemmy.world 4 points 1 year ago

You can also put site:lemmy.* and it’ll pick up some too.

[-] Candelestine@lemmy.world 36 points 1 year ago

First we need to offer google competitive answers to reddits. Then it will consider indexing us.

[-] RightHandOfIkaros@lemmy.world 6 points 1 year ago* (last edited 1 year ago)

We could just pay Google. How else do you think scam sites keep appearing at the top of Google searches

[-] Phrodo_00@kbin.social 4 points 1 year ago

Not the way Google works. It's probably indexing most medium size instances and up. They just need to get better Pagerank along with the other metrics google uses now to show up more prominently

For example https://www.google.com/search?hl=en&q=reddit%20site%3Akbin.social

[-] albert180@feddit.de 26 points 1 year ago* (last edited 1 year ago)

Yes, but Google annoyingly "corrects" every feddit.* searchterm to reddit

[-] Elkaki123@vlemmy.net 11 points 1 year ago

Would this be corrected naturally by people using feddit as a search term more or does google have to manually patch this things?

[-] albert180@feddit.de 6 points 1 year ago

I guess, but it's highly unlikely that "vlemmy.net" or "feddit.de" will be more searched than Reddit.com in the next time

[-] EnglishMobster@kbin.social 2 points 1 year ago

It will be corrected over time, I presume automatically. I was one of the first people with a Steam Deck and when I searched for things Google would "helpfully" autocorrect to StreamDeck. But eventually Google figured it out.

[-] MinusPi@pawb.social 9 points 1 year ago

I don't know about others but I used to just add "reddit" to each of my searches. Wouldn't adding "Lemmy" instead do the same thing eventually?

[-] Spacemanspliff@midwest.social 12 points 1 year ago

The problem is I don't know if it would pull instances that don't contain Lemmy in the name.

[-] tqgibtngo@kbin.social 2 points 1 year ago

As a newcomer, I've visited 3 Lemmy sites: Beehaw. Lemmy.world, and a custom instance. I noticed that they each have page footers that contain: Join Lemmy. If the same is true of many Lemmy instances, I can add Lemmy (or, with quotation marks, "Join Lemmy") in a Google query. — (Note: Top matches might not always be best matches on the originating instance, or sometimes the best matches might be hidden until I click "repeat the search with the omitted results included." And of course sometimes I won't get any match because the target hasn't been indexed by Google.)

[-] RoyalEngineering@lemmy.world 2 points 1 year ago

Adding “join lemmy” is really smart.

[-] AntennaRover@lemmy.one 2 points 1 year ago

If it became a common enough thing to search for, Google would correct for that and start ranking Lemmy instances higher, regardless of what’s in the name.

load more comments (1 replies)
[-] albert180@feddit.de 3 points 1 year ago

I guess you would need the name of the instance where the community resides. But usually if you search about specific questions the site with the information will appear (be it reddit or some lemmy instances) without adding it to the search term

[-] GammaScorpii@lemmy.world 3 points 1 year ago

The issue is google for the last few years has been prioritising businesses and services with really good SEO. And ads.

So in order to find helpful user content I always had to add Reddit to the search query.

[-] chaorace@lemmy.sdf.org 5 points 1 year ago

Is there any reason the site: syntax can't be used? For example: Musk site:feddit.de

[-] albert180@feddit.de 2 points 1 year ago

That's what I did, see the search term in the search field

[-] chaorace@lemmy.sdf.org 4 points 1 year ago

I don't understand. I looked at your screenshot again and the search field seems to show feddit.de: Musk. This is not the site: syntax. What I suggested was Musk site:feddit.de. Am I missing something?

[-] albert180@feddit.de 1 points 1 year ago* (last edited 1 year ago)

The site: is feddit.de: and after that follows the search query. It works that way too, and it's less work to type. Try it out by yourself

[-] chaorace@lemmy.sdf.org 4 points 1 year ago

I tried it myself and they're not similar at all. site: is handled specially through Google's advanced search syntax while the other approach is no different from a normal keyword. Please refer to the below images with attention to the result counts:

It's fine if you don't want to use the syntax, but using it would solve your problem with keyword autocorrect and properly filter your results to only the website you've asked for.

[-] albert180@feddit.de 4 points 1 year ago

You are right. My apologies

load more comments (1 replies)
[-] yaomtc@kbin.social 2 points 1 year ago

Why are people using a site named after the place they purposefully left with just one letter changed?

[-] Moka@kbin.social 2 points 1 year ago* (last edited 1 year ago)

Presumably because reddit itself has a lot of positivity and memories attached to it for a lot of people - it wasn't the site that people wanted to leave, but rather the ceo and staff behind it.

[-] albert180@feddit.de 1 points 1 year ago

Because the f stands for federated and it's the biggest German node?

load more comments (1 replies)
[-] marcar@lemmy.world 21 points 1 year ago

This would obviously be good for promoting Lemmy which I’m 100% all for.

But from a privacy point of view, I also feel mods should be able to stop indexing or choose which engines can index for their specific communities and also users at a user should be able to control it. I understand that engines could ignore this, but I doubt the big ones would..

I think I read that individual instances already can choose whether to be indexed or not, I could be wrong there

[-] hawkwind@lemmy.management 15 points 1 year ago* (last edited 1 year ago)

Being decentralized will make it harder to just use "search + reddit" because you don't know if it's "search + lemmy.world" or "search + beehaw" or "search + kbin." Also, each admin is in charge of their own Robots.txt.

[-] sadreality@kbin.social 2 points 1 year ago

Does fediverse instances even show up in search engines?

There is no money for parasites here so I wouldn't expect them to send anyone here.

[-] xmrscott@kbin.social 2 points 1 year ago

They can. For Mastodon there's an option for "Opt-out of search engine indexing" which adds a "noindex" tag to your profile which Google respects. Not all indexers respect the tag however and "noindex" data isn't federated which can cause your replies to show up: https://github.com/mastodon/mastodon/issues/22047

[-] meyotch@lemmy.mitchday.com 2 points 1 year ago

I’ve been experimenting with my instance and google does index my lemmy.mitchday.com page. If you search ‘lemmy mitch day’, you get my page right up top and a few motorhead fans named mitch down below.

My experiment involves trying to SEO a vanity domain I’ve had for years and only used for email. Since July 4, the page ranking for my general site has steadily climbed. The little bit of traffic from other instances and a handful of subscriptions seems to be impacting the ranking.

load more comments
view more: next ›
this post was submitted on 02 Jul 2023
203 points (98.1% liked)

No Stupid Questions

35694 readers
1018 users here now

No such thing. Ask away!

!nostupidquestions is a community dedicated to being helpful and answering each others' questions on various topics.

The rules for posting and commenting, besides the rules defined here for lemmy.world, are as follows:

Rules (interactive)


Rule 1- All posts must be legitimate questions. All post titles must include a question.

All posts must be legitimate questions, and all post titles must include a question. Questions that are joke or trolling questions, memes, song lyrics as title, etc. are not allowed here. See Rule 6 for all exceptions.



Rule 2- Your question subject cannot be illegal or NSFW material.

Your question subject cannot be illegal or NSFW material. You will be warned first, banned second.



Rule 3- Do not seek mental, medical and professional help here.

Do not seek mental, medical and professional help here. Breaking this rule will not get you or your post removed, but it will put you at risk, and possibly in danger.



Rule 4- No self promotion or upvote-farming of any kind.

That's it.



Rule 5- No baiting or sealioning or promoting an agenda.

Questions which, instead of being of an innocuous nature, are specifically intended (based on reports and in the opinion of our crack moderation team) to bait users into ideological wars on charged political topics will be removed and the authors warned - or banned - depending on severity.



Rule 6- Regarding META posts and joke questions.

Provided it is about the community itself, you may post non-question posts using the [META] tag on your post title.

On fridays, you are allowed to post meme and troll questions, on the condition that it's in text format only, and conforms with our other rules. These posts MUST include the [NSQ Friday] tag in their title.

If you post a serious question on friday and are looking only for legitimate answers, then please include the [Serious] tag on your post. Irrelevant replies will then be removed by moderators.



Rule 7- You can't intentionally annoy, mock, or harass other members.

If you intentionally annoy, mock, harass, or discriminate against any individual member, you will be removed.

Likewise, if you are a member, sympathiser or a resemblant of a movement that is known to largely hate, mock, discriminate against, and/or want to take lives of a group of people, and you were provably vocal about your hate, then you will be banned on sight.



Rule 8- All comments should try to stay relevant to their parent content.



Rule 9- Reposts from other platforms are not allowed.

Let everyone have their own content.



Rule 10- Majority of bots aren't allowed to participate here.



Credits

Our breathtaking icon was bestowed upon us by @Cevilia!

The greatest banner of all time: by @TheOneWithTheHair!

founded 1 year ago
MODERATORS