Terence Tao: GPT-O1 nearing "competent grad student" usefulness (mathstodon.xyz)
from jsomae@lemmy.ml to technology@lemmy.world on 14 Sep 2024 23:31
https://lemmy.ml/post/20306353

The experience seemed roughly on par with trying to advise a mediocre, but not completely incompetent, graduate student. However, this was an improvement over previous models, whose capability was closer to an actually incompetent graduate student. It may only take one or two further iterations of improved capability (and integration with other tools, such as computer algebra packages and proof assistants) until the level of “competent graduate student” is reached, at which point I could see this tool being of significant use in research level tasks.

#technology

threaded - newest

qooqie@lemmy.world on 14 Sep 2024 23:47 next collapse

Using GPT without appearing like an idiot takes a competent grad student

jsomae@lemmy.ml on 15 Sep 2024 16:03 collapse

This I can believe tbh. It’s a very useful tool in the hands of an expert. Otherwise it’s like giving a chimp a gun.

Maybe this is why I am surprised at people’s hatred of ChatGPT. It’s borne of misuse of a tool for experts, like newcomers struggling with a C++ compiler error.

jdeath@lemm.ee on 15 Sep 2024 17:15 collapse

hey now let’s be fair here, people hate C++ too

dinckelman@lemmy.world on 15 Sep 2024 00:12 next collapse

I genuinely hate this statement. A competent grad student can solve problems. GPT cannot solve anything, as all it does is put together the shit it stole from somewhere before

NegentropicBoy@lemmy.world on 15 Sep 2024 00:42 next collapse

O1 is (apparently) different according to some videos I watched, as it pulls apart the question and does some reasoning steps.

aodhsishaj@lemmy.world on 15 Sep 2024 01:39 next collapse

I’d love to see one of those videos

jsomae@lemmy.ml on 15 Sep 2024 15:58 collapse

like, a video of Tao giving a demonstration?

aodhsishaj@lemmy.world on 15 Sep 2024 16:30 collapse

@NegentropicBoy English20•

O1 is (apparently) different according to some videos I watched, as it pulls apart the question …

Yes

technocrit@lemmy.dbzer0.com on 15 Sep 2024 15:41 collapse

does some reasoning steps.

The people who believe in “AI” say the wackiest things.

jsomae@lemmy.ml on 15 Sep 2024 15:55 next collapse

LLMs are basically just good pattern matchers. But just like how A* search can find a better path than a human can by breaking the problem down into simple steps, so too can an LLM make progress on an unsolved problem if it’s used properly and combined with a formal reasoning engine.

I’m going to be real with you: the big insight behind almost all new mathematical ideas is based on the math that came before. Nothing is truly original the way AI detractors seem to believe.

By “does some reasoning steps,” OpenAI presumably are just invoking the LLM iteratively so that it can review its own output before providing a final answer. It’s not a new idea.

tee9000@lemmy.world on 16 Sep 2024 13:05 collapse

Its what chaptgpt calls it.

ContrarianTrail@lemm.ee on 15 Sep 2024 04:44 next collapse

Aren’t the grad students similarly trained on books that other people wrote?

werefreeatlast@lemmy.world on 16 Sep 2024 11:45 collapse

Didn’t you steal great students before from somewhere?

magic_lobster_party@fedia.io on 15 Sep 2024 09:20 collapse

Isn’t problem solving mostly put things together of what you’ve learned before?

technocrit@lemmy.dbzer0.com on 15 Sep 2024 15:41 collapse

This tells you much much more about how graduate students are treated in academia than anything about “AI”.

jsomae@lemmy.ml on 15 Sep 2024 15:52 collapse

I do agree that grad students don’t exactly live in luxury, and frequently develop mental health crises. But their contributions and insight are what power their labs. Profs often have to spend so much time teaching and chasing grants that they can’t do much real research. Academia overall is in a sad state.

But Tao is a superstar, and a charismatic blogger. I’d be disappointed to learn he mistreats his grad students. (I don’t know if he even has any tbh)