-11
  • In two hacker competitions run by Palisade Research, autonomous AI systems matched or outperformed human professionals in demanding security challenges.
  • In the first contest, four out of seven AI teams scored 19 out of 20 points, ranking among the top five percent of all participants, while in the second competition, the leading AI team reached the top ten percent despite facing structural disadvantages.
  • According to Palisade Research, these outcomes suggest that the abilities of AI agents in cybersecurity have been underestimated, largely due to shortcomings in earlier evaluation methods.
you are viewing a single comment's thread
view the rest of the comments
[-] Speiser0@feddit.org 2 points 1 week ago

An AI agent is just an intelligent agent, see https://en.wikipedia.org/wiki/Intelligent_agent.

Or do you mean that the things they call AI agents aren't actually AI agents?

[-] Tar_alcaran@sh.itjust.works 3 points 6 days ago

I mean, technically, you can call any controlling sensor an "agent". Any if-then loop can be an "agent".

But AI bros mean "A piece of software that can autonomously perform any broadly stated task", and those don't exist in real life. An "AI Agent" is software you can tell to "Order me a pizza", and it will do it to your satisfaction.

An AI agent is software you can tell "Hack that system and retrieve the flag". And it's not that.

this post was submitted on 01 Jun 2025
-11 points (31.0% liked)

Cybersecurity

7419 readers
8 users here now

c/cybersecurity is a community centered on the cybersecurity and information security profession. You can come here to discuss news, post something interesting, or just chat with others.

THE RULES

Instance Rules

Community Rules

If you ask someone to hack your "friends" socials you're just going to get banned so don't do that.

Learn about hacking

Hack the Box

Try Hack Me

Pico Capture the flag

Other security-related communities !databreaches@lemmy.zip !netsec@lemmy.world !securitynews@infosec.pub !cybersecurity@infosec.pub !pulse_of_truth@infosec.pub

Notable mention to !cybersecuritymemes@lemmy.world

founded 2 years ago
MODERATORS