533

Google's AI made up a fake cheese fact that wound up in an ad for Google's AI, perfectly highlighting why relying on AI is a bad idea (www.pcgamer.com)

submitted 6 months ago by cm0002@lemmy.world to c/technology@lemmy.world

53 comments fedilink hide all child comments

top 50 comments

sorted by: hot top controversial new old

[-] MudMan@fedia.io 156 points 6 months ago

I keep having to repeat this, but the conversation does keep going on a loop: LLMs aren't entirely useless and they're not search engines. You shouldn't ask it any questions you don't already know the answer to (or have the tools to verify, at least).

[-] chemical_cutthroat@lemmy.world 81 points 6 months ago

Yeah. Everyone forgot the second half of "Trust, but Verify". If I ask an LLM a question, I'm only doing it because I'm not 100% sure how to look up the info. Once it gives me the answer, I'm checking that answer with sources because it has given me a better ability to find what I was looking for. Trusting an LLM blindly is just as bad as going on Facebook for healthcare advice.

[-] MudMan@fedia.io 30 points 6 months ago

Yep. Or because you can recognize the answer but can't remember it off the top of my head. Or to check for errors on a piece of text or code or a translation, or...

It's not "trust but verify", which I hate as a concept. It's just what the tech can and cannot do. It's not a search engine finding matches to a query inside a large set of content. It's a stochastic text generator giving you the most likely follow up based on its training dataset. It's very good autocorrect, not mediocre search.

[-] eronth@lemmy.world 18 points 6 months ago

I find LLMs very useful for setting up tech stuff. "How do I xyz in docker?" It does a great job of boiling together several disjointed How Tos that don't quite get me there into one actually usable one. I use it when googling and following articles isn't getting me anywhere, and it's often saved so much time.

[-] jmcs@discuss.tchncs.de 15 points 6 months ago

They are also amazing at generating configuration that's subtly wrong.

For example, if the bad LLM generated configurations I caught during pull requests reviews are any example, there are plenty of people with less experienced teams running broken kubernetes deployments.

Now, to be fair, inexperienced people would make similar mistakes, but inexperienced people are capable of learning with their mistakes.

[-] danc4498@lemmy.world 2 points 6 months ago

I thought it was “butt verify” whoops

[-] bingbong@lemmy.dbzer0.com 5 points 6 months ago

Trust butt, verify

[-] danc4498@lemmy.world 4 points 6 months ago

✅ Verified

[-] catloaf@lemm.ee 29 points 6 months ago

Or if you're fine with non-factual answers. I've used chatgpt various times for different kinds of writing, and it's great for that. It can give you ideas, it can rephrase, it can generate lists, it can help you find the word you're trying to think of (usually).

But it's not magic. It's a text generator on steroids.

[-] MudMan@fedia.io 10 points 6 months ago

Sure! Used as... you know, what it is, there's a lot of fun/useful stuff you can do. It's just both AIbro shills and people who have decided to make hating on this tech a core part of their personality have misrepresented that.

It's indeed very, very good text generation/text parsing. It is not a search engine, the signularity, Skynet or a replacement for human labor in the vast majority of use cases.

[-] faythofdragons@slrpnk.net 3 points 6 months ago

I had to tell DDG to not give me an AI summary of my search, so its clearly intended to be used as a search engine.

[-] MudMan@fedia.io 14 points 6 months ago

"Intended" is a weird choice there. Certainly the people selling them are selling them as search engines, even though they aren't one.

On DDG's implementation, though, you're just wrong. The search engine is still the search engine. They are using an LLM as a summary of the results. Which is also a bad implementation, because it will do a bad job at something you can do by just... looking down. But, crucially, the LLM is neither doing the searching nor generating the results themselves.

[-] faythofdragons@slrpnk.net 1 points 6 months ago

What do you mean its not generating the results? If the summation isn't generated, wheres it come from?

[-] r4venw@sh.itjust.works 4 points 6 months ago

I dont want to speak for OP but I think they meant its not generating the search results using an LLM

[-] faythofdragons@slrpnk.net 1 points 6 months ago

Maybe I just don't know what "generating results" means. You query a search engine, and it generates results as a page of links. I don't understand how generating a page of links is fundamentally different from generating a summation of the results?

[-] r4venw@sh.itjust.works 6 points 6 months ago

Its a very different process. Having work on search engines before, I can tell you that the word generate means something different in this context. It means, in simple terms, to match your search query with a bunch of results, gather links to said results, and then send them to the user to be displayed

[-] faythofdragons@slrpnk.net 2 points 6 months ago

then send them to the user to be displayed

This is where my understanding breaks. Why would displaying it as a summary mean the backend process is no longer a search engine?

[-] MudMan@fedia.io 3 points 6 months ago

The LLM is going over the search results, taking them as a prompt and then generating a summary of the results as an output.

The search results are generated by the good old search engine, the "AI summary" option at the top is just doing the reading for you.

And of course if the answer isn't trivial, very likely generating an inaccurate or incorrect output from the inputs.

But none of that changes how the underlying search engine works. It's just doing additional work on the same results the same search engine generates.

EDIT: Just to clarify, DDG also has a "chat" service that, as far as I can tell, is just an UI overlay over whatever model you select. That just works the same way as all the AI chatbots you can use online or host locally and I presume it's not what we're talking about.

[-] faythofdragons@slrpnk.net 2 points 6 months ago

I see, you're splitting the UI from the backend as two different things, and Im seeing them as parts to a whole.

[-] MudMan@fedia.io 3 points 6 months ago

Well, yeah, there are multiple things feeding into the results page they generate for you. Not just two. There's the search results, there's an algorithmic widget that shows different things (so a calculator if you input some math, a translation box if you input a translation request, a summary of Wikipedia or IMDB if you search for a movie or a performer, that type of thing). And there is a pop-up window with an LLM-generated summary of the search results now.

Those are all different pieces. Your search resutls for "3 divided by 7" aren't different because they also pop up a calculator for you at the top of the page.

[-] faythofdragons@slrpnk.net 3 points 6 months ago

Yeah, for some reason I was thinking you were trying to say that bolting on widgets made it no longer a search engine.

[-] TriflingToad@sh.itjust.works 2 points 6 months ago

LLMs are good for some searches or clarification that the original website doesn't say. Ex the "BY" attribute in creative commons being acronymed to "BY" (by John Doe) and not "AT" (attributed to John Doe)

load more comments (10 replies)

[-] cyrano@lemmy.dbzer0.com 40 points 6 months ago

The weirdness came partway through, when the ad actually showed Google Gemini in action. It told the cheese vendor that Gouda accounts for "50 to 60 percent of the world's cheese consumption." Now, Gouda's hardly a hardcore real head pick like Roquefort or BellaVitano, but there's also no way it's pulling in cheddar or mozzarella numbers. Travel blogger Nate Hake and Google-focused Twitter account Goog Enough documented the erroneous initial version of the ad, but Google responded by quietly swapping in a more accurate Gemini-suggested blurb in all live versions of the ad, including the one that aired during the Super Bowl.

[-] SmackemWittadic@lemmy.world 10 points 6 months ago

They should have kept quiet and let Google show how shit they are on live TV

[-] CosmoNova@lemmy.world 39 points 6 months ago

This is like the dozenth time Google put hallucinations in their AI presentation/AD. They just don‘t care.

[-] Gormadt@lemmy.blahaj.zone 14 points 6 months ago

Especially considering that the "pointing out of said hallucinations" comes much later than when they're shared. And NEVER made it as far and wide as the initial bullshit.

[-] Dkarma@lemmy.world 19 points 6 months ago

Stop calling gpt ai

[-] samus12345@lemm.ee 12 points 6 months ago

That's the inaccurate name everyone's settled on. Kinda like how "sentient" is widely used to mean "sapient" despite being two different things.

[-] Hackworth@lemmy.world 6 points 6 months ago

I made a smartass comment earlier comparing AI to fire, but it's really my favorite metaphor for it - and it extends to this issue. Depending on how you define it, fire seems to meet the requirements for being alive. It tends to come up in the same conversations that question whether a virus is alive. I think it's fair to think of LLMs (particularly the current implementations) as intelligent - just in the same way we think of fire or a virus as alive. Having many of the characteristics of it, but being a step removed.

[-] dream_weasel@sh.itjust.works 5 points 6 months ago

That is an extremely apt parallel!

(I'm stealing it)

[-] dan@upvote.au 6 points 6 months ago* (last edited 6 months ago)

How is it not AI? Just because it's not AGI doesn't mean it's not AI. AI encompasses a lot of things.

[-] absentbird@lemm.ee 6 points 6 months ago

This article is about Gemini, not GPT. The generic term is LLM: Large Language Model.

[-] cyd@lemmy.world 15 points 6 months ago

Slightly off topic, but the writing on this article is horrible. Optimizing for Google engagement, it seems. Ironically, an AI would probably have produced something vastly more readable.

[-] corsicanguppy@lemmy.ca 5 points 6 months ago

begs the question

Not it doesn't. Did an Ai slop this story too?

[-] nyan@lemmy.cafe 9 points 6 months ago

It's an obsolete usage of "beg" that's now preserved only in that particular set phrase. One of English's many linguistic fossils, which you should learn more about before trying to critique anyone's language use.

[-] futatorius@lemm.ee 1 points 6 months ago

It’s an obsolete usage of “beg”

It's a misuse of the cliche "begs the question" (which goes back to medieval Latin petitio principii) which is used to call out a form of fallacious reasoning where the desired answer is smuggled into the assumptions. And yeah, that use of "beg" is obsolete, but even worse, the whole phrase is now misused to mean "prompts the question."

[-] jaek@lemmy.world 2 points 6 months ago

the whole phrase is now misused

[-] TheGrandNagus@lemmy.world 7 points 6 months ago

Not it doesn't. Did an Ai slop this story too?

No it doesn't. Did an AI slop this story too?

[-] 7toed@midwest.social 2 points 6 months ago

Why post the same comment?

[-] TheGrandNagus@lemmy.world 12 points 6 months ago* (last edited 6 months ago)

Fair question.

That user goes around issuing weird and pointless corrections to other people's comments, even sometimes to the point of personally insulting people who make grammatical or spelling errors – often common ones that non-native speakers make, so I thought it'd be funny to do the same in turn, since their comment history is filled with much of the same.

I wouldn't usually do it, it's a pointless exercise IMO.

[-] 7toed@midwest.social 2 points 6 months ago

Can take the user off reddit, but the reddit never leaves the user

[-] Hackworth@lemmy.world 4 points 6 months ago* (last edited 6 months ago)

Flames burn and smoke asphyxiates, perfectly highlighting why relying on fire is a bad idea.

[-] JesusTheCarpenter@feddit.uk 4 points 6 months ago

I totally get all the concerns related to AI. However, the bandwagon of: "look it made a mistake, it's useless!" is a bit silly.

First of all, AI is constantly improving. Remember everyone laughing at AI's mangled fingers? Well, that has been fixed some time ago. Now pictures of people are pretty much indistinguishable from real ones.

Second, people also make critical mistakes, plenty at that. The question is not whether AI can be absolutely accurate. The question is whether AI can make on average fewer mistakes than human.

I hate the idea of AI replacing everything and everyone. However, pretending that AI will not be eventually faster, better, cheeper and more accurate that most humans is wishful thinking. I honestly think that our only hope is legislation, not the desperate wish that AI will always need human supervision and input to be correct.

load more comments

this post was submitted on 11 Feb 2025

533 points (98.9% liked)

Technology

74348 readers

324 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws