148

Safety and Research were Sacrificed for Profit under Altman (www.theatlantic.com)

submitted 2 years ago by sculd@beehaw.org to c/technology@beehaw.org

40 comments fedilink hide all child comments

Article from The Atlantic, archive link: https://archive.ph/Vqjpr

Some important quotes:

The tensions boiled over at the top. As Altman and OpenAI President Greg Brockman encouraged more commercialization, the company’s chief scientist, Ilya Sutskever, grew more concerned about whether OpenAI was upholding the governing nonprofit’s mission to create beneficial AGI.

The release of GPT-4 also frustrated the alignment team, which was focused on further-upstream AI-safety challenges, such as developing various techniques to get the model to follow user instructions and prevent it from spewing toxic speech or “hallucinating”—confidently presenting misinformation as fact. Many members of the team, including a growing contingent fearful of the existential risk of more-advanced AI models, felt uncomfortable with how quickly GPT-4 had been launched and integrated widely into other products. They believed that the AI safety work they had done was insufficient.

Employees from an already small trust-and-safety staff were reassigned from other abuse areas to focus on this issue. Under the increasing strain, some employees struggled with mental-health issues. Communication was poor. Co-workers would find out that colleagues had been fired only after noticing them disappear on Slack.

Summary: Tech bros want money, tech bros want speed, tech bros want products.

Scientists want safety, researchers want to research...

all 43 comments

sorted by: hot top controversial new old

[-] sonori@beehaw.org 47 points 2 years ago* (last edited 2 years ago)

The best part of Open AI’s self professed goal to make an AGI is that the more we learn about LLM’s the more it becomes clear that they inherently can never bridge the gap to AGI.

One would almost think the constant complaining about mythical dangers of AGI might be a distraction from the real more mundane dangers LLM’s pose here and now like exasperating bias, making mass misinformation easy, and of course shielding major companies from accountability.

Or the other option is that it’s just marketing, look at how scary our totally real product is, look how fast it improved when we went from a medium sized dataset to the largest that will ever be possible, don’t ask questions like why would a autocomplete that has been feed the entire internet actually help our business, just pay us and bolt it on to whatever you can.

[-] sculd@beehaw.org 19 points 2 years ago

LLM's ability to replace jobs is honestly more terrifying than so called AGI.

At least with AGI, if they really can think like human, is that they may actually think about the implications of their actions....

[-] AlternateRoute@lemmy.ca 12 points 2 years ago

Robots / automation have replaced so many human physical labor jobs, even large dumb heavy machinery.

Language models replacing mundane human language tasks is hardly surprising.

I have replaced entire employee jobs with scrips / code, there are a lot of very basic jobs out there.

[-] sonori@beehaw.org 11 points 2 years ago

Scripts and automation do what thier programmed to. There are bugs and mistakes, but you can theoretically get something programmed right. LLM’s generate text that looks like a human language. If they were just getting used to make up random bullshit it wouldn’t be a problem, but there are few applications where random bullshit is actually beneficial.

[-] AlternateRoute@lemmy.ca 3 points 2 years ago

Just like the executives assist that was tasked with scanning documents. And LLM can likely safely and quickly do many people tasks:

summarize meeting transcripts
highlight nest steps
take an auto line and some data and turn it into words

There are a lot of human language job tasks that have zero imagination required just the ability to read summarize and write some proper English.

[-] sonori@beehaw.org 13 points 2 years ago

Thouse all sound like things where it might be really bad if it injects untrue information, and with an LLM, by definition it has no understanding of what it’s summarizing. It could be especially bad if the people useing it actually trust what it outputs as facts about what was fed into it, but if they don’t and still check the source than what’s the point.

[-] brothershamus@kbin.social 3 points 2 years ago

"That's not writing, that's just typing!"

[-] AlternateRoute@lemmy.ca 1 points 2 years ago

If I hand someone a set of bullet notes and ask them to send out a notice in writing to the company. They are going to convert those notes into paragraphs and sentences.. Not just send out the notes.

Also MS already has a module for teams that will take the conversation transcript, and output action items based on the conversation.. It is like having a note taker during the meeting. https://www.youtube.com/watch?v=N1gpkk-MwpY

[-] HopeOfTheGunblade@kbin.social 8 points 2 years ago

Oh, I'm sure they will. That is not, in the slightest, the same as caring about said implications in ways that mean that the species won't get murked, though.

[-] SnotFlickerman@lemmy.blahaj.zone 23 points 2 years ago* (last edited 2 years ago)

I expected as much, I had this feeling about Altman, too. The draw of profit became too much for him, and the board called him on it and let him go.

Which makes it even worse that now they're groveling at his feet to return.

Ugh.

[-] Monument@lemmy.sdf.org 10 points 2 years ago

I just saw a headline that he’s going to work for Microsoft now.

My employer heavily uses Microsoft, and I’m in IT.

Since June, Microsoft eliminated all their training staff - the folks who show others how to use their software, reclassified their customer experience staff to eliminate the role - these folks met with customers to solicit product feedback and find out what people actually want, made unilateral and poorly communicated changes to security policies that impact hundreds of our users, turned on beta (preview) features for end users without testing - in some cases rendering software inoperable in our environment, and is disabling or limiting features that work(ed) in software covered under our enterprise license end is encouraging people to purchase entirely new software systems from Microsoft to regain the lost functionality.

Honestly, if he was fired for pursuing profits over quality, then he’ll fit right in.

[-] sculd@beehaw.org 6 points 2 years ago

Well the idea to ask for him to return came from MS and not from the board themselves. At least that plan failed according to media report.

[-] canis_majoris@lemmy.ca 2 points 2 years ago

His return deal totally capsized, he's out as CEO still. The old CEO of Twitter, Emmet Shear, is now in charge.

[-] Mac@mander.xyz 21 points 2 years ago

[Resource] sacrificed for profit under [CEO].

[-] canis_majoris@lemmy.ca 18 points 2 years ago

Nothing about this is safe. It's easily the worst misinformation tool in decades. I've used it to help me at work, GPT-4 is built into O365 corp plans, but all the jailbroken shit scares the hell out of me.

Between making propaganda and deepfakes this shit is already way out of hand.

[-] sylverstream@lemmy.nz 5 points 2 years ago

What do you mean by jailbroken stuff?

We've recently got copilot at M365 and so far it's been a mixed bag. Some handy things but also some completely wrong information.

[-] canis_majoris@lemmy.ca 10 points 2 years ago* (last edited 2 years ago)

Stuff without the guardrails, stuff that's been designed to produce porn, or totally answer truthfully to queries such as "how do I build a bomb" or "how do I make napalm" which are common tests to see how jailbroken any LLM is. When you feed something the entire internet, or even subsections of the internet, it tends to find both legal and illegal information. Also the ones designed to generate porn have gone beyond that boring shitty AI art style and now people are generating human being deepfakes, and it's become a common tactic to spam places with artificial CSAM to cause problems with services. It's been a recent and long-standing issue with Lemmy - people like Exploding Heads or Hexbear will get defederated and then out of retaliation will spam the servers that defederated from them with said artificial CSAM.

I like copilot but that's because I'm fine with the guardrails and I'm not trying to make it do anything out of its general scope. I also like how it's covered by an enterprise privacy agreement which was a huge issue with people using ChatGPT and feeding it all kinds of private info.

[-] abhibeckert@beehaw.org 16 points 2 years ago

"how do I build a bomb” or “how do I make napalm"

... or you could just look them up on wikipedia.

[-] DaDragon@kbin.social 7 points 2 years ago

Almost everything you said, with the exception of AI CSAM and suicide prevention, can hardly be considered a serious issue.

What’s wrong with searching for how to make a bomb? If you have the wish to research it, you can probably make a bomb just by going to a public library and reading enough. The knowledge is out there anyway

[-] tal@lemmy.today 16 points 2 years ago

Many members of the team, including a growing contingent fearful of the existential risk of more-advanced AI models, felt uncomfortable with how quickly GPT-4 had been launched and integrated widely into other products.

GPT-4 and anything similar isn't going to pose an existential threat to humanity.

Eventually, yeah, there is probably a possibility of existential risk from AI. I don't know where that line ultimately is, and getting an idea of that might be something important for humanity to figure out, but I am pretty confident that whatever OpenAI is presently doing isn't it.

Same reason that Musk and his six month moratorium on AI work doesn't make much sense. We're not six months away from an existential threat to humanity.

I think that funding efforts to have people in the field working on the Friendly AI problem is a good idea. But that's another story.

[-] Quasari@programming.dev 15 points 2 years ago

The apps using GPT4 without regards to safety can be though. Example: replacing human with chatbot for suicide prevention.

[-] tal@lemmy.today 7 points 2 years ago

Being an existential threat is a much higher bar -- that's where humanity's continued existence is at threat.

There are plenty of technologies that you could hypothetically put somewhere where a life might be at stake, but very few that could put humanity's existence on the line.

[-] brothershamus@kbin.social 4 points 2 years ago

It's the same situation, just writ large. Dumb human decisions to put AI where it shouldn't be. Heck, you can put it in charge of the nuclear missles now if you want to. Don't. Though. That'd be really, really stupid.

Part of my knee-jerk dislike of the AI hype is that it's glorified text completion. It doesn't know shit. It only knows the % chance of your saying the next word. AGI is not happening anytime soon and all this is techbro theatre for the sake of money.

Anyone who reads a wall of bland generated text and thinks we're about to talk to god is seriously mistaken.

[-] jcarax@beehaw.org 15 points 2 years ago* (last edited 2 years ago)

I'm much more worried about the social implications. Namely, the displacement of workers and introduction of new efficiencies to workflows, continuing to benefit only those who are rich and in power, and driving more of us towards poverty.

It's not an immediate existential threat, but it's absolutely a serious issue that we aren't paying enough attention to.

[-] cwagner@beehaw.org 14 points 2 years ago

They believed that the AI safety work they had done was insufficient.

Considering that every new model seems to be getting worse for anything but highly sanitized corporate usage, I’m not sure that I want more AI safety …

For my usage, I use Chat GPT 3.5 turbo with the march checkpoint because I can’t get the current one to stop moralizing about bullshit instead of doing what it’s supposed to (I run two twitch bots with it). GPT4 used to be okay there, but the new preview is now starting to have the same issue with more frequent "I can’t do that Dave"-style answers, though it’s still mostly circumventable with enough prompt massaging, but it is getting harder.

In a year, I don’t see anything but self-hosted models usable for anything not corporate glitz if trajectories hold, so fuck all that AI safety.

[-] CosmoNova@feddit.de 5 points 2 years ago* (last edited 2 years ago)

On top of all of this, those efforts to tame and control outputs from the developer side could be abused to simply appease investors or totalitarian markets. So we might see a Disneyfication like we‘re seeing on other platforms like Youtube with their horrendous filters, spawning ridiculous terms like „unlifed“. And just imagine the level of censorship we‘d see if they ever try to get into the Chinese market because clearly, the ‚non‘ in non-profit is becoming more and more silent.

[-] canis_majoris@lemmy.ca 4 points 2 years ago

It's already easy to self host and we've optimized LLMs to run locally on not much serious hardware after we've trained them; I have GPT4ALL set up on my machine and it runs everything locally with my processor, no GPU or anything. Some of those datasets are uncensored, and I've seen what Stable Diffusion can do for image generation.

I tend to use the GPT-4 built into Edge with my O365 corporate plan, because it suits my needs better for day-to-day challenges. It can still audit code and summarize things, which is all I really need it to do here and there.

[-] cwagner@beehaw.org 5 points 2 years ago

Nothing that runs on my GPU / CPU comes even close to GPT 3.5, GPT4 is not even in the same universe, and that’s with them running far more slowly.

[-] RandoCalrandian@kbin.social 1 points 2 years ago

In my tests, the self hosted options that have access to a 30xx or 40xx graphics card return results far faster than gpt4

[-] cwagner@beehaw.org 1 points 2 years ago

Which model are you talking about?

[-] RandoCalrandian@kbin.social 1 points 2 years ago

Mistral for chatgpt, and i'm not saying it gives better answers, just that it's much faster than my web portal to gpt4

[-] cwagner@beehaw.org 1 points 2 years ago

Oh, faster is easy. GPT 3.5 is also far faster than GPT 4. Faster at quality replies is the issue.

[-] RandoCalrandian@kbin.social 4 points 2 years ago

Pulled up a self hosted option last week to try it out. It’s not gpt4 level, but it’s damn close and I don’t worry giving access to my local documents

PrivateGPT for anyone interested

[-] cwagner@beehaw.org 3 points 2 years ago

That’s an interface for models. Which model did you use?

[-] RandoCalrandian@kbin.social 3 points 2 years ago

Mistral-7B-Instruct-v0.1 is the default, i'm downloading the Llama2 model to test it with now, but many models on HuggingFace should still work

[-] cwagner@beehaw.org 1 points 2 years ago

I do not believe any 7B model comes even close to 3.5 in quality. I used LLama V1 64B, and it was horrible in comparison. Are you really telling me that this tiny model gives better general answers? Or am I just misunderstanding what you are saying?

[-] RandoCalrandian@kbin.social 1 points 2 years ago

I didn’t say better, I said comparable
And faster, without handing over my data and conversations for monetization

Given the locally hosted benefits, and the ability to go to chatgpt for any answer minstrel gives that doesn’t satisfy you, makes it strong competition to chatgpt as the default tool

Hosting it yourself also means you can swap llm’s out based on context and what they’re trained on. Highly tuned models perform better than chatgpt at the things they are meant to excel in.

[-] cwagner@beehaw.org 1 points 2 years ago* (last edited 2 years ago)

Prompt:

I’m currently trying to show on the Website Beehaw, that certain LLMs are far superior in writing than others. Examples of what bigger models do better than smaller ones: *

Mistral-7B-Instruct-v0.1

ntire articles* vs Headlines Descriptions vs Product titles *Bul

GPT 3.5-Turbo doesn’t support completion as it’s for chat, so I used an even worse one, text-davinci-003 which is far behind state of the art.

Bigger models are able to handle more complex and detailed tasks with ease

Bigger models are better suited for natural language understanding and text processing

Bigger models are able to learn more accurate representations of context, thus improving the precision of the output

Bigger models can process data more quickly and efficiently, saving time and processing power when large volumes of data are used

Bigger models can better recognize more subtle nuances in language, which allows them to produce more accurate results

Bigger models are able to use more sophisticated algorithms, resulting in a more comprehensive and deeper understanding of the data being used

Mistral 7B might be okay for some very specific cases, but it’s not comparable to proper models at all.

edit: gave it a second chance, it’s a bit better (at least no complete nonsense anymore), but still terrible writing and doesn’t make much sense

Paraphrasing The ability of a language model to generate text that has a similar meaning to the original text is called paraphrasing. This is a very common problem in natural language processing, and many LLMs are designed to be able to paraphrase text. However, there are some LLMs that are particularly good at paraphrasing, and these models are often preferred over smaller models because of their ability to generate more varied and unique text. Examples of LLMs that are known for their paraphrasing abilities include GPT-2 and transformers. These models

this post was submitted on 20 Nov 2023

148 points (100.0% liked)

Technology

40941 readers

503 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 3 years ago

MODERATORS

alyaza@beehaw.org

TheRtRevKaiser@beehaw.org

gyrfalcon@beehaw.org

rs5th@beehaw.org

coldredlight@beehaw.org

SemioticStandard@beehaw.org

TheRtRevKaiser@kbin.social

remington@beehaw.org