138
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
this post was submitted on 24 Aug 2023
138 points (92.6% liked)
Canada
7187 readers
419 users here now
What's going on Canada?
Communities
๐ Meta
๐บ๏ธ Provinces / Territories
- Alberta
- British Columbia
- Manitoba
- New Brunswick
- Newfoundland and Labrador
- Northwest Territories
- Nova Scotia
- Nunavut
- Ontario
- Prince Edward Island
- Quebec
- Saskatchewan
- Yukon
๐๏ธ Cities / Local Communities
- Calgary (AB)
- Edmonton (AB)
- Greater Sudbury (ON)
- Halifax (NS)
- Hamilton (ON)
- Kootenays (BC)
- London (ON)
- Mississauga (ON)
- Montreal (QC)
- Nanaimo (BC)
- Oceanside (BC)
- Ottawa (ON)
- Port Alberni (BC)
- Regina (SK)
- Saskatoon (SK)
- Thunder Bay (ON)
- Toronto (ON)
- Vancouver (BC)
- Vancouver Island (BC)
- Victoria (BC)
- Waterloo (ON)
- Winnipeg (MB)
๐ Sports
Hockey
- List of All Teams: Post on /c/hockey
- General Community: /c/Hockey
- Calgary Flames
- Edmonton Oilers
- Montrรฉal Canadiens
- Ottawa Senators
- Toronto Maple Leafs
- Vancouver Canucks
- Winnipeg Jets
Football (NFL)
- List of All Teams:
unknown
Football (CFL)
- List of All Teams:
unknown
Baseball
- List of All Teams:
unknown
- Toronto Blue Jays
Basketball
- List of All Teams:
unknown
- Toronto Raptors
Soccer
- List of All Teams:
unknown
- General Community: /c/CanadaSoccer
- Toronto FC
๐ป Universities
๐ต Finance / Shopping
- Personal Finance Canada
- BAPCSalesCanada
- Canadian Investor
- Buy Canadian
- Quebec Finance
- Churning Canada
๐ฃ๏ธ Politics
- Canada Politics
- General:
- By Province:
๐ Social and Culture
Rules
Reminder that the rules for lemmy.ca also apply here. See the sidebar on the homepage:
founded 3 years ago
MODERATORS
What we're all calling "AI" right now has basically zero ability to fact check.
Large Language Models are essentially just a form of autocomplete. They predict valid outputs based on statistical analysis of their training data. This makes them quite good at passing the Turing test (ie, convincing the average user that they have something approximating intelligence), but what they completely lack is the ability to evaluate source for reliability. That's why it's so easy to deliberately trick them into repeating false information.
Real fact checking is a lot more than just googling something and finding a source that agrees with you. I can find sources claiming that the Earth is flat, aliens rule the world and Hillary Clinton is a baby eating lizard person. But none of those sources are in any way credible. However explaining why they're not credible is a much more difficult question. Media literacy is a conplex skill, and it's one that involves evaluating a huge number of different criteria, using a large number of different metrics, and it often involves making difficult judgement calls. Even people who are good at media literacy can be fooled, or just get it wrong. The entire study of history is basically about evaluating sources, and there's often serious disagreements over the veracity of a piece of information. Good journalists have to be very careful over exactly how they frame information to disambiguate the exact degree of confidence they have about it (ie, I can say with absolute certainty that this person told me this thing, but I can't say with absolute certainty that what they told me is true)... And that's the good journalists. There are a LOT of bad journalists out there.
It's possible that some hypothetical future generation of AI will be better at fact checking them humans, but that's not what we have today. The only way to get modern LLMs to produce factual information is to be extremely careful about what data they are fed; and even then, they will often just make shit up out of whole cloth from that data. Any output has to be verified by a human operator to avoid situations like Microsoft recommending the Ottawa food bank as a must see tourist attraction.
No, I know that modern AI has no real ability to fact check, but the reason is because they've never been built that way, nor do they have the resources to do it properly. They have no way to know what is a reliable source, nor how to interpret the data in a meaningful way if it needs to be used in an abstract manner.
But I do believe that modern AI technology should be able to do so if given the resources. Create an AI that only references from a list of credible sources, and is able to compare them to what is said elsewhere.
I'm no AI specialist or anything, so maybe I'm completely wrong and such a method wouldn't work. But at the very least, I haven't even heard of any real attempt at making a fact checking AI yet. All the existing ones are shit and only adapt normal language learning models to reference other internet sources regardless of their legitimacy.
The problem is that for any of what you're describing to work, AI has to be capable of comprehension and interpretation, neither of which are capabilities that LLMs have. This would be a quantum leap forward in terms of AI technology.
That's the key thing that has to be understood about "AI"; it fundamentally does not understand any of the words that it's saying. It's engaged in nothing more than extremely complex mimicry. Even a dog has more comprehension of human language than an LLM, and you wouldn't trust a dog to fact check political ads. Remember, even when working from accurate training data, LLMs will still cheerfully invent entirely fictitious data that just happens to fit the pattern of the training data, because that's all they are; pattern matchers.
If I present an AI with the statements "Mike Harris sold our LTC care system to corporate profiteers" and "Mike Harris sold your grandma's house to corporate profiteers" it has no way of accurately determining if the latter statement is true or false, because it fits the pattern of the first statement. A human can instantly distinguish between the concept of a long term care home and a person's privately owned house. An AI doesn't know what a person is, what a long term care home is, what ownership is, what the difference between private and public ownership are, what a house is and how that's different from a long term care home even though both are referred to as homes, what it means to sell something, what profiteering is and whether or not that term accurately describes the actions taken by the corporations that bought most of Ontario's LTC system. And then you have to get into the complex legalities of whether or not you're allowed to use the term "profiteers" in a political ad... It's a nightmare of complexity.
If there's a way to get to what you're describing, from where we are now, no one has come up with it yet and the first company that does will be rich beyond their wildest dreams. We're just not even remotely close to that kind of technology.