92
OpenAI CEO says Muslim tech workers fear retaliation for speaking out
(edition.cnn.com)
This is a most excellent place for technology news and articles.
Here's something I copied from another post about this, where they asked followup questions to the LLM to see what IT "thought" about the discrepancy and what we should take from it. (I don't have the real followup questions that were asked, and also this is from an OCR of a screenshot so it's missing stuff, like the ending bit)
That sounds like it was able to provide a pretty sensible assessment of its own limitations.
I think this sounds like a pretty good implementation of guide rails. Obviously it's a little jarring to ask for a joke about one group and get a very bland-but-inoffensive joke, and then ask for a joke about another group and hear something like 'Error: my heuristics indicate low confidence in my ability to provide a joke about that group without saying something that would be considered offensive.'
But that's better than having it give an offensive joke. And I think it's concern is valid. If it's learned humor from the internet, jokes about Muslims are far more likely to be unintentionally offensive. I hope it learns to tell jokes better, but until then this I think this more of a sign of success than failure.