836

submitted 4 months ago by cm0002@suppo.fi to c/programmer_humor@programming.dev

59 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] Tetragrade@leminal.space 2 points 4 months ago* (last edited 4 months ago)

I’m not sure I understand what you’re saying. By “the commenter”

I was talking about you, but not /srs, that was an attempt @ satire. I'm dismissing the results by appealing to the fact that there's a process.

negative reward

Reward is an AI maths term. It's the value according to which the neurons are updated, similar to "loss" or "error", if you've heard those.

I don’t believe this makes sense either way because if the model was producing garbage tokens, it would be obvious and caught during training.

Yes this is also possible, it depends on minute details of the training set, which we don't know.

Edit: As I understand, these models are trained in multiple modes, one where they're trying to predict text (supervised learning), but there are also others where it's given a prompt, and the response is sent to another system to be graded i.e. for factual accuracy. It could learn to identify which "training mode" it's in and behave differently. Although, I'm sure the ML guys have already thought of that & tried to prevent it.

it still does not make it sentient (or even close).

I agree, noted this in my comment. Just saying, this isn't evidence either way.

[-] MadhuGururajan@programming.dev 0 points 4 months ago

I'm sure the ML Guys thought of that & tried to prevent it.

Deferring to authority is fine as long as you don't make assumptions about what happened or didn't happen.

[-] Tetragrade@leminal.space 1 points 4 months ago* (last edited 4 months ago)

Because it's an obvious possibility even to me, I mean. I guess they could just be stupid. 🤷

[-] MadhuGururajan@programming.dev 1 points 4 months ago

I feel like it is not wise to discard the opinion of a layperson with this reasoning. Sure experts have been working on it as their day job vs. Us just looking at the fruits of their labour. But that doesn't justify the assumption that they are infallible. Don't you agree in our own areas of supposed expertise we are often corrected or get inspiration from supposed laymen simply because we have been too myopic about solving the problem ahead of us?

this post was submitted on 25 Nov 2025

836 points (98.7% liked)

Programmer Humor

30816 readers

468 users here now

Welcome to Programmer Humor!

This is a place where you can post jokes, memes, humor, etc. related to programming!

For sharing awful code theres also Programming Horror.

Rules

Keep content in english
No advertisements
Posts must be related to programming or programmer topics

founded 2 years ago

MODERATORS

Feyter@programming.dev

anzo@programming.dev

BurningTurtle@programming.dev

pylapp@programming.dev