59

Microsoft says its Agent Mode in Excel has an accuracy rate of 57.2 percent in SpreadsheetBench

Agent Mode in Word turns document creation into vibe writing

“Productivity is our DNA, we’re Office,” Chauhan says. “While others will try to replicate us, there is no substitute for the real thing.”

Somebody vibe kill me please.

top 27 comments
sorted by: hot top controversial new old
[-] Des@hexbear.net 43 points 6 months ago

accuracy rate of 57.2 percent

woaw a coin flip weighted in your favor every 10 flips or so

[-] mayakovsky@hexbear.net 39 points 6 months ago

What if =SUM(A1:A4) was more like gambling than addition

[-] hotcouchguy@hexbear.net 22 points 6 months ago

=SUM(A1:A4)

Well, I've parsed a billion spreadsheets, and usually it's like 12.3 or something

[-] SchillMenaker@hexbear.net 8 points 6 months ago

Starts playing =SUM(41)

[-] RedWizard@hexbear.net 6 points 6 months ago

=SUM(BODY(1,TOLD(ME)))

[-] Beaver@hexbear.net 35 points 6 months ago* (last edited 6 months ago)

Excel and other spreadsheet software are probably the most important pieces of software in the world, responsible for managing the commanding heights of the world economy. They're all getting AI integration - even WPS office, the domestic Chinese equivalent. And the kind of people who make a lot of spreadsheets are absolutely going to use the fuck out of the AI. So this will have... consequences.

[-] kadu@scribe.disroot.org 19 points 6 months ago

They're all getting AI integration

LibreOffice is not.

[-] greenbit@lemmy.zip 12 points 6 months ago

Maybe this is how capitalism finally crumbles to its faults

[-] ProletarianDictator@hexbear.net 6 points 6 months ago

Can't wait for accountants everywhere to be hallucinating a few missing cells here and there.

[-] BodyBySisyphus@hexbear.net 28 points 6 months ago

Why would you deliberately ship a feature that has a 57% accuracy rate? Are the engineers desperate to appease Ed Zitron's Business Idiots or are have they awakened to the realization that most people with computers don't do productive labor, and anyone who embraces these features probably had a worse-than-57% accuracy rate to begin with?

[-] SupFBI@hexbear.net 14 points 6 months ago

Gotta pump those usage numbers up. Who cares about how useful it actually is?

[-] BodyBySisyphus@hexbear.net 8 points 6 months ago

The Cult of the Next Quarter has finally succeeded in summoning its deity

[-] FloridaBoi@hexbear.net 2 points 6 months ago

You also gotta imagine that they tailored the implementation to succeed in known benchmarks and they still got 57%

[-] BodyBySisyphus@hexbear.net 2 points 6 months ago* (last edited 6 months ago)

Evidently, the benchmarking is based on a set of 912 user questions covering a range of scenarios. I have to assume that a decent chunk of these are covering situations where the original spreadsheet was set up poorly or excel isn't the ideal platform for the desired output. Given that, I wonder if the accuracy metrics even mean anything or if it's just a measurement of how well the software pounds square pegs into round holes.

[-] Azarova@hexbear.net 23 points 6 months ago

critical support to microsoft in their efforts to accidentally sabotage businesses around the world with ai slop

[-] chgxvjh@hexbear.net 21 points 6 months ago

Basically large scale industrial sabotage lmao

[-] plinky@hexbear.net 20 points 6 months ago

There have been funny screenshots of excel floating around, hopefully they will bankrupt couple of businesses pray-against

[-] Are_Euclidding_Me@hexbear.net 20 points 6 months ago

That article reads like satire to me, but I don't think it's supposed to be satire.

Like, the term "vibe coding" is derogatory, right? I certainly thought it was. But the article is using the phrase "vibe coding" as though it's a positive descriptor of a good thing. I don't think I've seen that anywhere else.

Am I wrong? I must be. There must be large numbers of people who use the phrase "vibe coding" and don't mean it to be derogatory. I simply can't fathom that, honestly.

[-] machinya@hexbear.net 2 points 6 months ago

maybe it was coined as derogatory but it has been a while since media takes it seriously and many corpos even have "vive coding interviews" or something similar

[-] Pavlichenko_Fan_Club@hexbear.net 20 points 6 months ago

Hey so I tried out this whole 'vibe accounting' thing and now I'm getting a lot of threatening letters in the mail. What gives?

[-] Belly_Beanis@hexbear.net 10 points 6 months ago

"Vibe accounting" just sounds to me like a money laundering scheme lmao

[-] FloridaBoi@hexbear.net 1 points 6 months ago

Every small business is just vibe accounting

[-] DogThatWentGorp@hexbear.net 18 points 6 months ago

I look forward to the carnage reports ig. Let's see this bad boy make datasets unusable because it stealthily reformatted something while doing a simple operation.

[-] FloridaBoi@hexbear.net 2 points 6 months ago

Randomly decides to flip the sign in a single cell causing an airplane to crash 4 years later

[-] LaGG_3@hexbear.net 15 points 6 months ago

What if that one annoying coworker who copies and pastes fucked up versions of formulas from your report drafts was a computer program

[-] RedWizard@hexbear.net 8 points 6 months ago

Im getting closer and closer to advocating for a full departure from windows at my job. I might have the clout to pull it off.

this post was submitted on 07 Oct 2025
59 points (100.0% liked)

technology

24341 readers
330 users here now

On the road to fully automated luxury gay space communism.

Spreading Linux propaganda since 2020

Rules:

founded 5 years ago
MODERATORS