If AI is making the Turing test obsolete, what might be better? (arstechnica.com)

submitted 2 years ago by Greenpepper@beehaw.org to c/technology@beehaw.org

57 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] Froyn@kbin.social 20 points 2 years ago

Voight-Kampff test maybe?

Imagine someone asked you "If Desk plus Love equals Fruit, why is turtle blue?"
AI will actually TRY to solve it.
Human nature would be to ask if the person asking the question is having a stroke or requires medical attention.

[-] Pamasich@kbin.social 10 points 2 years ago

So, I asked this to the three different conversation styles of Bing Chat.

The Precise style actually tried to solve it, came to the conclusion the question might be of philosophical nature, including some potential meanings, and asked for clarification.

The Balanced style told me basically the same as the other reply by admiralteal, that the question makes no sense and I should give more context if I actually want it answered.

The Creative style told me it didn't understand the first part, but then answered the second part (the turtles being blue) seriously.

[-] Froyn@kbin.social 5 points 2 years ago

Would it be safe to say that all 3 answers would fail the test?

[-] Pamasich@kbin.social 7 points 2 years ago

Not sure, I'm not familiar with the test, just figured I'd tell the results from asking the AI.

I think based on what you said about it

AI will actually TRY to solve it.
Human nature would be to ask if the person asking the question is having a stroke or requires medical attention.

That the Balanced style didn't fail, because while it didn't ask about strokes or medical attention, it did point out I'm asking a nonsense question and refused to engage with it.

The Precise style did try to find an answer and the Creative style didn't realize I'm fucking with it, so I do think based on the criteria they'd fail the test.

Though, honestly, I'd fail the test too. When asked such a question, I'd think there has to be an answer and it's stupid of me not to see it and I'd look for it. I think the Precise style's answer is very much where I'd end up.

[-] admiralteal@kbin.social 8 points 2 years ago

Nope, ChatGPT tells you it is a nonsequitor and asks for more context or intention if the question is sincere.

[-] Froyn@kbin.social 6 points 2 years ago

You're saying the test would work.
In 43+ years on this planet I've never HEARD someone seriously use "non sequitur" properly in a sentence.
Asking if the intention is sincere would be another flag given the circumstances (knowing they were being tested).

Toss in a couple real questions like: "What is the 42nd digit of pi?", "What is the square root of -i ?", and you'd find the AI pretty quick.

[-] admiralteal@kbin.social 11 points 2 years ago* (last edited 2 years ago)

Cool.

Both the phrases you're calling out as clearly AI came from me. Not used by ChatGPT, just how I summarized its response. I wonder if this is the first time someone has brazenly accused me of being an AI bot?

[-] Froyn@kbin.social 3 points 2 years ago

LoL, no I took you at your word which was my mistake
"ChatGPT tells you" read to me like you attempted and got that response.

[-] pbjamm@beehaw.org 2 points 2 years ago

Both the phrases you’re calling out as clearly AI came from me.

Perhaps you are an instance of an LLM and do not realize it.

[-] jarfil@beehaw.org 1 points 2 years ago

"If Desk plus Love equals Fruit, why is turtle blue?"

Assuming "Desk = x", "Love = y", "Fruit = x+y", and "turtle blue = z", it is so because you assigned arbitrary values to the words such that they fulfill the equation.

Am I an AI?

this post was submitted on 15 Dec 2023

56 points (100.0% liked)

Technology

41447 readers

331 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 4 years ago

MODERATORS

alyaza@beehaw.org

TheRtRevKaiser@beehaw.org

gyrfalcon@beehaw.org

rs5th@beehaw.org

coldredlight@beehaw.org

SemioticStandard@beehaw.org

TheRtRevKaiser@kbin.social

remington@beehaw.org