505
AI models routinely lie when honesty conflicts with their goals
(www.theregister.com)
This is a most excellent place for technology news and articles.
Absolutely, but that’s the easy case, computerphile had this interesting video discussing a proof of concept exploration which showed that indirectly including stuff in the training/accessible data could also lead to such behaviours. Take it with a grain of salt cause it’s obviously a bit alarmist, but very interesting nonetheless!