overview for localhost

Google AI chatbot responds with a threatening message: "Human … Please die." by localhost in c/technology@beehaw.org

[-] localhost@beehaw.org 5 points 12 hours ago* (last edited 12 hours ago)

This feels to me like the LLM misinterpreted it as some kind of fictional villain talk and started to autocomplete it.

Could also be the model simply breaking. There was a time when Sydney (Bing AI or whatever they call it now) had to be constrained to 10 messages per context and having some sort of supervisor on top of itself because it would occasionally throw a fit or start threatening the user for no reason.

But does it get a bike too? by localhost in c/rpgmemes@ttrpg.network

[-] localhost@beehaw.org 22 points 4 months ago

The paracausal tarrasque seems like a genuinely interesting concept. Gives me False Hydra vibes

Anyone Else Noticing Increasingly Conservative Commentary from Outside Instances? by localhost in c/askbeehaw@beehaw.org

[-] localhost@beehaw.org 7 points 4 months ago

Both threads appeared on my feed near one another and I figured it was on topic given that the other one is directly referenced in the main post here. If OP can reference another post to complain about hate, I think it's fair game for me to truthfully add that their conduct in the very same thread was also excessively hateful - how else are we to discuss the main subject of this post at all otherwise?

Anyone Else Noticing Increasingly Conservative Commentary from Outside Instances? by localhost in c/askbeehaw@beehaw.org

[-] localhost@beehaw.org 7 points 4 months ago

I have read the blog post that you've linked, which is full of exaggeration.

The developer rejected PR that changed documentation to use one instance of they/them instead of he/him, responded "This project is not an appropriate arena to advertise your personal politics.", and then promptly got brigaded. Similar PRs were appearing and getting closed from time to time.

A satirical PR has been opened and closed for being spam - despite the blogger's commentary, it's abundantly clear that the developer didn't call the person opening the PR a "spam" (what would that even mean?).

The project also had code of conduct modified, probably due to the brigading, to essentially include the aforementioned "not an appropriate arena to advertise your personal politics or religious beliefs" line - I don't know what part of this is for the blogger a "white supremacist" language.

From what I can tell, this is all they've done. No racism, no sexism, no white supremacy. Would it be better if they just accepted the PR? Yes. Does it make the developer part of one of the worst groups of people that ever existed? No.

Anyone Else Noticing Increasingly Conservative Commentary from Outside Instances? by localhost in c/askbeehaw@beehaw.org

[-] localhost@beehaw.org 12 points 4 months ago

When I created an account here, I thought Beehaw is specifically a platform where throwing vitriol unnecessarily is discouraged.

Non-native speaker being stubborn about not using "they/them" in gender-neutral contexts (especially when most if not all of these weren't even about people) is not enough to label them as neither incel, transphobe, nor racist.

Intentionally mischaracterizing other human beings and calling them derogatory names that they don't deserve is, in my opinion, against the spirit of the platform.

Anyone Else Noticing Increasingly Conservative Commentary from Outside Instances? by localhost in c/askbeehaw@beehaw.org

[-] localhost@beehaw.org 13 points 4 months ago

The most recent example I’ve noticed is around the stuff with the Ladybird devs being weird about being asked to use inclusive pronouns, but it seems like a pattern.

You mean the thread where you out of nowhere called the maintainers "incels, transphobes, and racists" over singular instance of them using "he/him" as a gender-neutral pronouns in documentation and refusing to change it?

OpenAI Insider Estimates 70 Percent Chance That AI Will Destroy or Catastrophically Harm Humanity by localhost in c/technology@beehaw.org

[-] localhost@beehaw.org 7 points 5 months ago

I don't think your assumption holds. Corporations are not, as a rule, incompetent - in fact, they tend to be really competent at squeezing profit out of anything. They are misaligned, which is much more dangerous.

I think the more likely scenario is also more grim:

AI actually does continue to advance and gets better and better displacing more and more jobs. It doesn't happen instantly so barely anything gets done. Some half-assed regulations are attempted but predictably end up either not doing anything, postponing the inevitable by a small amount of time, or causing more damage than doing nothing would. Corporations grow in power, build their own autonomous armies, and exert pressure on governments to leave them unregulated. Eventually all resources are managed by and for few rich assholes, while the rest of the world tries to survive without angering them.
If we're unlucky, some of those corporations end up being managed by a maximizer AGI with no human supervision and then the Earth pretty much becomes an abstract game with a scoreboard, where money (or whatever is the equivalent) is the score.

Limitations of human body act as an important balancing factor in keeping democracies from collapsing. No human can rule a nation alone - they need armies and workers. Intellectual work is especially important (unless you have some other source of income to outsource it), but it requires good living conditions to develop and sustain. Once intellectual work is automated, infrastructure like schools, roads, hospitals, housing cease to be important for the rulers - they can give those to the army as a reward and make the rest of the population do manual work. Then if manual work and policing through force become automated, there is no need even for those slivers of decency.
Once a single human can rule a nation, there is enough rich psychopaths for one of them to attempt it.

There are also other AI-related pitfalls that humanity may fall into in the meantime - automated terrorism (e.g. swarms of autonomous small drones with explosive charges using face recognition to target entire ideologies by tracking social media), misaligned AGI going rogue (e.g. the famous paperclip maximizer, although probably not exactly this scenario), collapse of the internet due to propaganda bots using next-gen generative AI... I'm sure there's more.

Baldur's Gate 3 actors reveal the darker side of success fuelled by AI voice cloning by localhost in c/gaming@beehaw.org

[-] localhost@beehaw.org 5 points 7 months ago

I'd honestly go one step further and say that the problem cannot be fully solved period.

There are limited uses for voice cloning: commercial (voice acting), malicious (impersonation), accessibility (TTS readers), and entertainment (porn, non-commercial voice acting, etc.).

Out of all of these only commercial uses can really be regulated away as corporations tend to be risk averse. Accessibility use is mostly not an issue since it usually doesn't matter whose voice is being used as long as it's clear and understandable. Then there's entertainment. This one is both the most visible and arguably the least likely to disappear. Long story short, convincing enough voice cloning is easy - there are cutting-edge projects for it on github, written by a single person and trained on a single PC, capable of being run locally on average hardware. People are going to keep using it just like they were using photoshop to swap faces and manual audio editing software to mimic voices in the past. We're probably better off just accepting that this usage is here to stay.

And lastly, malicious usage - in courts, in scam calls, in defamation campaigns, etc. There's strong incentive for malicious actors to develop and improve these technologies. We should absolutely try to find a way to limit its usage, but this will be eternal cat and mouse game. Our best bet is to minimize how much we trust voice recordings as a society and, for legal stuff, developing some kind of cryptographic signature that would confirm whether or not the recording was taken using a certified device - these are bound to be tampered with, especially in high profile cases, but should hopefully somewhat limit the damage.

What are your favourite open source games? by localhost in c/foss@beehaw.org

[-] localhost@beehaw.org 8 points 1 year ago

Github link for convenience
https://github.com/Anuken/Mindustry

Google's Bard Urges Google to Drop Web Environment Integrity by localhost in c/technology@beehaw.org

[-] localhost@beehaw.org 6 points 1 year ago

GPT3 is pretty bad at it compared to alternatives (although it's hard to compete with calculators on that field), but if it was just repeating after the training dataset it would be way worse. From the study I've linked in my other comment (https://arxiv.org/pdf/2005.14165.pdf):

On addition and subtraction, GPT-3 displays strong proficiency when the number of digits is small, achieving 100% accuracy on 2 digit addition, 98.9% at 2 digit subtraction, 80.2% at 3 digit addition, and 94.2% at 3-digit subtraction. Performance decreases as the number of digits increases, but GPT-3 still achieves 25-26% accuracy on four digit operations and 9-10% accuracy on five digit operations, suggesting at least some capacity to generalize to larger numbers of digits.

To spot-check whether the model is simply memorizing specific arithmetic problems, we took the 3-digit arithmetic problems in our test set and searched for them in our training data in both the forms " + =" and " plus ". Out of 2,000 addition problems we found only 17 matches (0.8%) and out of 2,000 subtraction problems we found only 2 matches (0.1%), suggesting that only a trivial fraction of the correct answers could have been memorized. In addition, inspection of incorrect answers reveals that the model often makes mistakes such as not carrying a “1”, suggesting it is actually attempting to perform the relevant computation rather than memorizing a table.

Google's Bard Urges Google to Drop Web Environment Integrity by localhost in c/technology@beehaw.org

[-] localhost@beehaw.org 7 points 1 year ago

In my comment I've been referencing https://arxiv.org/pdf/2005.14165.pdf, specifically section 3.9.1 where they summarize results of the arithmetic tasks.

Google's Bard Urges Google to Drop Web Environment Integrity by localhost in c/technology@beehaw.org

[-] localhost@beehaw.org 46 points 1 year ago* (last edited 1 year ago)

That's not entirely true.

LLMs are trained to predict next word given context, yes. But in order to do that, they develop internal model that minimizes error across wide range of contexts - and emergent feature of this process is that the model DOES perform more than pure compression of the training data.

For example, GPT-3 is able to calculate addition and subtraction problems that didn't appear in the training dataset. This would suggest that the model learned how to perform addition and subtraction, likely because it was easier or more efficient than storing all of the examples from the training data separately.

This is a simple to measure example, but it's enough to suggests that LLMs are able to extrapolate from the training data and perform more than just stitch relevant parts of the dataset together.