55
submitted 2 months ago by jsomae@lemmy.ml to c/technology@lemmy.world

The experience seemed roughly on par with trying to advise a mediocre, but not completely incompetent, graduate student. However, this was an improvement over previous models, whose capability was closer to an actually incompetent graduate student. It may only take one or two further iterations of improved capability (and integration with other tools, such as computer algebra packages and proof assistants) until the level of "competent graduate student" is reached, at which point I could see this tool being of significant use in research level tasks.

you are viewing a single comment's thread
view the rest of the comments
[-] NegentropicBoy@lemmy.world 24 points 2 months ago

O1 is (apparently) different according to some videos I watched, as it pulls apart the question and does some reasoning steps.

[-] aodhsishaj@lemmy.world 16 points 2 months ago

I'd love to see one of those videos

[-] jsomae@lemmy.ml 2 points 2 months ago

like, a video of Tao giving a demonstration?

[-] aodhsishaj@lemmy.world 2 points 2 months ago

@NegentropicBoy English20•

O1 is (apparently) different according to some videos I watched, as it pulls apart the question ...

Yes

[-] technocrit@lemmy.dbzer0.com 2 points 2 months ago

does some reasoning steps.

The people who believe in "AI" say the wackiest things.

[-] jsomae@lemmy.ml 3 points 2 months ago* (last edited 2 months ago)

LLMs are basically just good pattern matchers. But just like how A* search can find a better path than a human can by breaking the problem down into simple steps, so too can an LLM make progress on an unsolved problem if it's used properly and combined with a formal reasoning engine.

I'm going to be real with you: the big insight behind almost all new mathematical ideas is based on the math that came before. Nothing is truly original the way AI detractors seem to believe.

By "does some reasoning steps," OpenAI presumably are just invoking the LLM iteratively so that it can review its own output before providing a final answer. It's not a new idea.

[-] tee9000@lemmy.world 1 points 2 months ago

Its what chaptgpt calls it.

this post was submitted on 14 Sep 2024
55 points (80.2% liked)

Technology

59414 readers
1000 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS