42
submitted 1 year ago* (last edited 1 year ago) by QuinicV@lemmy.world to c/nostupidquestions@lemmy.world

A great use for reddit is the ability to search posts and opinions about any niche topic. Will that be possible with Lemmy as it grows? Will I be able to Google "instant rice Lemmy" and get a comprehensive tier list of each brand?

I imagine search engines will have trouble with all the different instances(?). EDIT: Especially with instances that don't have Lemmy in their name, I don't think search engines would return them for Lemmy searches?

top 50 comments
sorted by: hot top controversial new old
[-] marsara9@lemmy.world 41 points 1 year ago

So I've been working on a solution for this.

As I see it Google and others are going to have a hard if not impossible time to incorporate the fediverse, and the fact that the same content can exist on multiple servers.

So I'm working on a search engine specifically build, for Lemmy at least. Where it'll take you to whatever your preferred instance is when tapping on a search result.

I hope to have a MVP up and running in a few more days.

[-] mookulator@lemmy.world 5 points 1 year ago

Can’t emphasize enough how important this is for the growth of Lemmy. Many people I know only access Reddit through google searches.

[-] marsara9@lemmy.world 5 points 1 year ago

Yep and I'm one of them. Go look me up on Reddit and I think I have maybe 20 posts over the 14+ years I was on the site. ...joined Lemmy and immediately got frustrated that I couldn't find anything. So I figured I take a crack at it. Especially since I couldn't see how Google would ever be able to link me to my instance. Let alone make it easy to search the entire fediverse without having to write out every possible site, with new ones popping up every day.

[-] teuast@lemmy.world 1 points 1 year ago

Easier to find a Reddit post through Google than by Reddit search.

[-] PotjiePig@lemmy.world 5 points 1 year ago

Please pop a reminder here. Commenting for a bump.

Search their name on GitHub and you'll find it. Star it to follow.

Can the reminder bots be migrated too?

[-] QuinicV@lemmy.world 2 points 1 year ago

Interesting. I hadn't even thought about how the fact that instance1.[post] and instance2.[post@instance1] is essentially the same thing and how search engines would handle it. Interested in what you come up with!

[-] marsara9@lemmy.world 4 points 1 year ago

Thanks. If you do some digging you can find the project on GitHub but note that it's a work in progress still. The UI is lacking and it's rough around the edges but it's "working". And I still need to do some optimizations on the crawler itself, etc....

It's also going to be completely self-hostable just like Lemmy, etc...

[-] chainsawrobot@kbin.social 2 points 1 year ago

If this guy changes the internet include me in the screenshot.

[-] sgtlighttree@kbin.social 2 points 1 year ago

I can see this being helpful

(commenting so I can bookmark)

[-] Ahhh_Jaysus@lemmy.world 1 points 1 year ago

Shit dude, that'd be a sweet little tool.

[-] meekah@lemmy.world 1 points 1 year ago

IDK, isn't it the same for reddit? It also encourages crossposting, so the same content is on there several times. Maybe I don't understand the fediverse well enough yet, so please correct me if I'm wrong.

load more comments (1 replies)
[-] jakakatune@lemmy.world 8 points 1 year ago

I am surprised noone mentioned https://fedi-search.com . It's working pretty well. Full credit to Benjamin Pryor for this

[-] OsakaWilson@lemmy.world 5 points 1 year ago

Digg.com was the big thing with Reddit trailing. Digg began tweaking the experience toward a more profitable model. I had already come to Reddit when they went too far and there was a sudden enormous migration from Digg to Reddit. Digg went from being THE social media aggregator to being nothing in a matter of weeks.

Reddit is more deeply rooted, so I think it will stick around, I'm cool if Reddit keeps those who are happy with corporate model busy so we can do our thing here.

[-] linearchaos@lemmy.world 1 points 1 year ago

It's certainly not going anywhere unless they end up selling it to someone who shuts it down and uses the posts and links as SEO boosting.

[-] OsakaWilson@lemmy.world 1 points 1 year ago

Well, Digg.com still exists. It's just that no one cares.

[-] linearchaos@lemmy.world 1 points 1 year ago

when you just loaded their site to test you just doubled their monthly active users.

[-] Draconic_NEO@lemmy.world 2 points 1 year ago

In the future they eventually might be, for some instances. Though definitely not for all of them, since some of the instances might disable indexing.

I've actually already seen a few Lemmy results (lemmy.ml) in Google searches, the trouble is it doesn't link to individual posts, just the community so it's not particularly useful. So it definitely is possible, just needs to be improved to be able to index posts.

[-] qwamqwamqwam@sh.itjust.works 2 points 1 year ago

I have seen at least one user claim they got a result from lemmy when searching a question on google. YMMV though. Lemmy is a fraction of the size of reddit, it will take time for posts to reach the level that google starts indexing them specifically.

[-] IMongoose@lemmy.world 1 points 1 year ago

I got one. The Google link brought me to the instance though and not the thread. I was able to find the thread though, so it kinda worked.

[-] Fer24@lemmy.world 2 points 1 year ago

Maybe, but probable Google try to kill us

[-] krigo666@lemmy.world 2 points 1 year ago

I think it is preferable to ask other search engines like DuckDuckGo to index Lemmy info. Google is full of garbage.

[-] Anarch157a@lemmy.world 1 points 1 year ago

Brave Search would be better, they have a dedicated section on the results page for discussions.

[-] bizzle@lemmy.world 1 points 1 year ago

Brave is an advertising company and should not be preferred.

[-] static@kbin.social 2 points 1 year ago

Reddit did not start out as the thing to google, it's 15+ years old, only in the last 5y I started prefixing my google searches with reddit.

[-] BrerChicken@lemmy.world 1 points 1 year ago

I actually found Reddit by googling things. I had seen it 5 or 6 times over a few years, and eventually I just went to the main site. I might have even used Reddit in the search before I joined. Regardless, I had recognized that all the best answers for tricky problems that I had were coming from Reddit before I even joined 11 years ago.

[-] smilepenguin@lemmy.one 1 points 1 year ago

Lemmy is not as unique as Reddit as a word. I get a lot of Lemmy Kilmister matches. But still hopeful

[-] neblem@lemmy.world 1 points 1 year ago

Use the exclusion keyword for your search provider. For example on google lemmy -kilmister -motorhead will get you only Lemmy software results by excluding pages with "kilmister" or "motorhead" in their contents.

[-] snailwizard@lemmy.world 1 points 1 year ago

Correct me if I’m wrong but if individual admins allow their instances to be indexed wouldn’t the instance itself have some sort of metadata identifying it as a Lemmy branch?

[-] fossilesque@mander.xyz 1 points 1 year ago

I have been finding some!

[-] speaker_hat@lemmy.one 1 points 1 year ago

You just need to backlink to Lemmy from an already known websites and the crawlers will find their way.

See: https://backlinko.com/link-building-strategies

About the search engine to dispaly the correct instance, that is something I don't think that works now and would require optimizations.

[-] RIotingPacifist@lemmy.world 1 points 1 year ago

Respectfully: Fuck that.

If you want to find the best instant rice recommendations on Lemmy, Lemmy should have a functional post search function, rather than me relying on a malevolent corporate entity like google to index all the content.

Search has gone to shit as the Internet has embraced social media sites, an upside of this is that wikipedia+Lemmy+key word search, mayas accurate as asking Google Bard or bing, and they can be built on entirety open tech.

[-] drmoose@lemmy.world 1 points 1 year ago* (last edited 1 year ago)

Cool rage but you dismissing search indexing is kinda hilarious. It's not going away and it's what makes the web. Would you rather have 3 big websites instead of indexed web?

load more comments (1 replies)
[-] neblem@lemmy.world 1 points 1 year ago

Basically use <query> site:lemmy.world OR site:lemmy.ml OR site:beehaw.org OR site:kbin.social (or whatever main instances you want to hit)

You can also use this for custom browser search keys like the following https://duckduckgo.com/?q=%s+site%3Alemmy.world+OR+site%3Alemmyml+OR+site%3Abeehaw.org+OR+site%3Akbin.social

[-] QuinicV@lemmy.world 1 points 1 year ago

I imagine that would be quite inconvenient... Especially as Lemmy grows and has potentially many more instances.

[-] subtext@lemmy.world 1 points 1 year ago

I believe that DDG has a shorthand for site:Reddit (without the .com). If lemmy gets popular enough DDG may implement a similar shorthand that incorporates the fediverse without us having to use a massive string. Like if it gets big enough, we may not have to solve this problem because others will see the value in making it easy.

That’s my hope at least.

[-] CascadeDismayed@lemmy.world 1 points 1 year ago

I would argue that eventually, yes, one will be able to google search Lemmy just like Reddit.

[-] meekah@lemmy.world 1 points 1 year ago

Only if we make sure the tech giants don't kill this platform

[-] Secret300@lemmy.world 1 points 1 year ago

How would they? It's all decentralized

[-] 2pt_perversion@lemmy.world 1 points 1 year ago

I wish there was a way to get an entire Reddit archive over here. Realistically I'm still going to have to search Reddit because it has 10+ years of answers to obscure questions.

[-] Ghostalmedia@lemmy.world 1 points 1 year ago

I was searching for the “3 days no poop” meme. Lots of Lemmy stuff showed up.

[-] Jozzo@lemmy.world 1 points 1 year ago

You can use a search query to include only results with Lemmy's footer, which is consistent across all Lemmy instances. I made a post about it here: https://lemmy.world/post/342365

[-] Kururin@lemmy.ml 0 points 1 year ago

It’s up to the individual instance owner and Lemmy the software itself enabling SEO. It’s just getting started now so it will be long time before that.

[-] thingsiplay@kbin.social 0 points 1 year ago

@QuinicV Why would it not be possible? It depends on the software, if all text is open to be indexed. Kbin and Lemmy instances are basically open forum software and are indexed by search engines. You can test it in Google or other engines by forcing to search on the site only with site:lemmy.world are posts indexed? , which would be an empty search result if they were locked down like discord content.

[-] QuinicV@lemmy.world 2 points 1 year ago

But what if the post I'm searching for is not on lemmy.world? Say the instance doesn't even have Lemmy in their name, like beehaw.org. How would a search engine index it? How would it know it's part of Lemmy?

[-] linearchaos@lemmy.world 1 points 1 year ago

There will be links to everything somewhere. The same way you knew to get the cave in the same way you know to get to Lemmy. There are already links that have been posted to Reddit that are in archives that are easily followable. Google doesn't just search one or two things they search all the links to the things and then the links from those things to other things. If Google can't figure out how to get to it chances are you don't know it's there either.

load more comments
view more: next ›
this post was submitted on 27 Jun 2023
42 points (100.0% liked)

No Stupid Questions

35868 readers
348 users here now

No such thing. Ask away!

!nostupidquestions is a community dedicated to being helpful and answering each others' questions on various topics.

The rules for posting and commenting, besides the rules defined here for lemmy.world, are as follows:

Rules (interactive)


Rule 1- All posts must be legitimate questions. All post titles must include a question.

All posts must be legitimate questions, and all post titles must include a question. Questions that are joke or trolling questions, memes, song lyrics as title, etc. are not allowed here. See Rule 6 for all exceptions.



Rule 2- Your question subject cannot be illegal or NSFW material.

Your question subject cannot be illegal or NSFW material. You will be warned first, banned second.



Rule 3- Do not seek mental, medical and professional help here.

Do not seek mental, medical and professional help here. Breaking this rule will not get you or your post removed, but it will put you at risk, and possibly in danger.



Rule 4- No self promotion or upvote-farming of any kind.

That's it.



Rule 5- No baiting or sealioning or promoting an agenda.

Questions which, instead of being of an innocuous nature, are specifically intended (based on reports and in the opinion of our crack moderation team) to bait users into ideological wars on charged political topics will be removed and the authors warned - or banned - depending on severity.



Rule 6- Regarding META posts and joke questions.

Provided it is about the community itself, you may post non-question posts using the [META] tag on your post title.

On fridays, you are allowed to post meme and troll questions, on the condition that it's in text format only, and conforms with our other rules. These posts MUST include the [NSQ Friday] tag in their title.

If you post a serious question on friday and are looking only for legitimate answers, then please include the [Serious] tag on your post. Irrelevant replies will then be removed by moderators.



Rule 7- You can't intentionally annoy, mock, or harass other members.

If you intentionally annoy, mock, harass, or discriminate against any individual member, you will be removed.

Likewise, if you are a member, sympathiser or a resemblant of a movement that is known to largely hate, mock, discriminate against, and/or want to take lives of a group of people, and you were provably vocal about your hate, then you will be banned on sight.



Rule 8- All comments should try to stay relevant to their parent content.



Rule 9- Reposts from other platforms are not allowed.

Let everyone have their own content.



Rule 10- Majority of bots aren't allowed to participate here.



Credits

Our breathtaking icon was bestowed upon us by @Cevilia!

The greatest banner of all time: by @TheOneWithTheHair!

founded 1 year ago
MODERATORS