1300
you are viewing a single comment's thread
view the rest of the comments
[-] A_Porcupine@lemmy.world 21 points 2 years ago

The saying "ask a stupid question, get a stupid answer" comes to mind here.

[-] UnderpantsWeevil@lemmy.world 39 points 2 years ago

This is more an issue of the LLM not being able to parse simple conjunctions when evaluating a statement. The software is taking shortcuts when analyzing logically complex statements and producing answers that are obviously wrong to an actual intelligent individual.

These questions serve as a litmus test to the system's general function. If you can't reliably converse with an AI on separate ideas in a single sentence (eat watermellon seeds AND drive drunk) then there's little reason to believe the system will be able to process more nuanced questions and yield reliable answers in less obviously-wrong responses (can I write a single block of code to output numbers from 1 to 5 that is executable in both Ruby and Python?)

The primary utility of the system is bound up in the reliability of its responses. Examples like this degrade trust in the AI as a reliable responder and discourage engineers from incorporating the features into their next line of computer-integrated systems.

[-] TheGreenGolem@lemmy.dbzer0.com 5 points 2 years ago

Unfortunately that ship has sailed but this is what I say from the start of these: don't call them Artificial Intelligence. There is absolutely zero intelligence there.

[-] Even_Adder@lemmy.dbzer0.com 2 points 2 years ago

They didn't use Bing Chat, which is the actual AI powered search.

[-] Ultraviolet@lemmy.world 6 points 2 years ago

If a search engine is going to put a One True Answer in a massive font above all other results, they should be pretty confident in it. Yes, tech-literate people know the "featured snippet" thing is dogshit and to ignore it, but there are a lot of people that just look at that and think they have their answer.

[-] Even_Adder@lemmy.dbzer0.com 1 points 2 years ago

That's a completely separate problem from confusing two different products.

[-] Chunk@lemmy.world 1 points 2 years ago

We have a new technology that is extremely impressive and is getting better very quickly. It was the fastest growing product ever. So in this case you cannot dismiss the technology because it doesn't understand trick questions yet.

[-] UnderpantsWeevil@lemmy.world 1 points 2 years ago

new technology that is extremely impressive

Language graphs are a very old technology. What OpenAI and other firms have done is to drastically increase the processing power and disk space allocated to pre-processing. Far from cutting edge, this is a heavy handed brute force approach that can only happen with billions in private lending to prop it up.

It was the fastest growing product ever

this post was submitted on 27 Dec 2023
1300 points (96.0% liked)

Microblog Memes

10402 readers
675 users here now

A place to share screenshots of Microblog posts, whether from Mastodon, tumblr, ~~Twitter~~ X, KBin, Threads or elsewhere.

Created as an evolution of White People Twitter and other tweet-capture subreddits.

RULES:

  1. Your post must be a screen capture of a microblog-type post that includes the UI of the site it came from, preferably also including the avatar and username of the original poster. Including relevant comments made to the original post is encouraged.
  2. Your post, included comments, or your title/comment should include some kind of commentary or remark on the subject of the screen capture. Your title must include at least one word relevant to your post.
  3. You are encouraged to provide a link back to the source of your screen capture in the body of your post.
  4. Current politics and news are allowed, but discouraged. There MUST be some kind of human commentary/reaction included (either by the original poster or you). Just news articles or headlines will be deleted.
  5. Doctored posts/images and AI are allowed, but discouraged. You MUST indicate this in your post (even if you didn't originally know). If a post is found to be fabricated or edited in any way and it is not properly labeled, it will be deleted.
  6. Be nice. Take political debates to the appropriate communities. Take personal disagreements to private messages.
  7. No advertising, brand promotion, or guerrilla marketing.

Related communities:

founded 2 years ago
MODERATORS