Some AI models get more accurate at maths if you ask them to respond as if they are a Star Trek character, ML engineers say (www.businessinsider.com)
from L4s@lemmy.world to technology@lemmy.world on 01 Mar 2024 18:00
https://lemmy.world/post/12604937

Some AI models get more accurate at maths if you ask them to respond as if they are a Star Trek character, ML engineers say::Researchers asking a chatbot to optimize its own prompts found it was best at solving grade-school math when acting like it was on Star Trek.

#technology

threaded - newest

autotldr@lemmings.world on 01 Mar 2024 18:00 next collapse

This is the best summary I could come up with:


A study attempting to fine-tune prompts fed into a chatbot model found that, in one instance, asking it to speak as if it were on Star Trek dramatically improved its ability to solve grade-school-level math problems.

“It’s both surprising and irritating that trivial modifications to the prompt can exhibit such dramatic swings in performance,” the study authors Rick Battle and Teja Gollapudi at software firm VMware in California said in their paper.

“Among the myriad factors influencing the performance of language models, the concept of ‘positive thinking’ has emerged as a fascinating and surprisingly influential dimension,” Battle and Gollapudi said in their paper.

Their study found that in almost every instance, automatic optimization always surpassed hand-written attempts to nudge the AI with positive thinking, suggesting machine learning models are still better at writing prompts for themselves than humans are.

The prompt then asked the AI to include these words in its answer: “Captain’s Log, Stardate [insert date here]: We have successfully plotted a course through the turbulence and are now approaching the source of the anomaly.”

Axel Springer, Business Insider’s parent company, has a global deal to allow OpenAI to train its models on its media brands’ reporting.


The original article contains 820 words, the summary contains 198 words. Saved 76%. I’m a bot and I’m open source!

veeesix@lemmy.ca on 01 Mar 2024 18:09 next collapse

*Picks up wireless mouse* Hello, computer.

AllonzeeLV@lemmy.world on 01 Mar 2024 19:03 next collapse

<img alt="" src="https://lemmy.world/pictrs/image/2235df5f-7f68-4be6-a7fe-52f4dfa1f76f.png">

Thank you Scotty!

en.m.wikipedia.org/wiki/Aluminium_oxynitride

antidote101@lemmy.world on 01 Mar 2024 19:12 next collapse

Can you explain this reference for me? I do not understand.

SpaceNoodle@lemmy.world on 01 Mar 2024 19:17 next collapse

Go watch Star Trek IV first

antidote101@lemmy.world on 01 Mar 2024 20:44 collapse

I’ve seen it, but it was a while ago.

Undiscovered Countries was my favourite TOS movie, because it covered the historically important Khitomer Accords.

SpaceNoodle@lemmy.world on 01 Mar 2024 21:19 collapse

Then go watch it second

antidote101@lemmy.world on 01 Mar 2024 21:24 collapse

I will, but only for Spock’s beanie.

[deleted] on 01 Mar 2024 19:28 next collapse

.

betterdeadthanreddit@lemmy.world on 01 Mar 2024 19:34 collapse

It’s a reference to this scene from Star Trek IV: The Voyage Home (1986).

Explanation without video

Scotty, having traveled back in time to the year 1986 as part of a mission to rescue some whales, attempts to use a computer by speaking to it and then mistakenly tries to use the mouse as a microphone when the machine does not respond. He is prompted to use the keyboard instead of verbal commands and gives information on how to manufacture transparent aluminum. This material was not invented until about 150 years later according to the pre-trip history of the Star Trek future but Scotty has given it a head start.

SlopppyEngineer@lemmy.world on 01 Mar 2024 20:34 next collapse

Helping people with their work through teams has taught me that voice control is a disaster to get anything done for anything other than just dictating text.

antidote101@lemmy.world on 01 Mar 2024 20:41 collapse

Oh, thank you for the lengthy explainer.

All I have in return is this fairly interesting video detailing one of the ways we’ve already found transparent metals. Perhaps over the next 150 years we’ll be able to stabilise the material structure.

Thanks again for explaining.

PipedLinkBot@feddit.rocks on 01 Mar 2024 20:41 collapse

Here is an alternative Piped link(s):

this fairly interesting video

Piped is a privacy-respecting open-source alternative frontend to YouTube.

I’m open-source; check me out at GitHub.

ComplexLotus@lemmy.world on 01 Mar 2024 19:19 next collapse

is this … transparent Aluminium?

AllonzeeLV@lemmy.world on 01 Mar 2024 19:23 collapse

They did it. The crazy son’s of bitches did it! Quite awhile ago, it’s commercially available.

davidgro@lemmy.world on 01 Mar 2024 19:35 collapse

There is also This transparent aluminum (linked in that same article) and it’s been used in phone/watch screens also.

gravitas_deficiency@sh.itjust.works on 01 Mar 2024 20:54 collapse

Ah yes. Star Trek: The One With The Whales.

vvv@programming.dev on 01 Mar 2024 19:43 next collapse

<img alt="" src="https://programming.dev/pictrs/image/d2269913-1b64-4a0b-9274-a6c10bde90cf.webm">

Wogi@lemmy.world on 02 Mar 2024 00:41 collapse

Are you sure?!

ech@lemm.ee on 01 Mar 2024 20:46 next collapse

“It might just take a while to get the pitch right” www.youtube.com/watch?v=uyV0IVItlM4

PipedLinkBot@feddit.rocks on 01 Mar 2024 20:47 next collapse

Here is an alternative Piped link(s):

https://www.piped.video/watch?v=uyV0IVItlM4

Piped is a privacy-respecting open-source alternative frontend to YouTube.

I’m open-source; check me out at GitHub.

veeesix@lemmy.ca on 01 Mar 2024 23:57 collapse

LOL I had completely forgotten about this bit!

gravitas_deficiency@sh.itjust.works on 01 Mar 2024 20:54 collapse

wessels

zerofk@lemm.ee on 01 Mar 2024 18:10 next collapse

“Answer as if you’re a tribble.“

SpaceNoodle@lemmy.world on 01 Mar 2024 19:18 collapse

But then all it can do is multiply.

Rai@lemmy.dbzer0.com on 01 Mar 2024 22:45 collapse

That’s troubling.

driving_crooner@lemmy.eco.br on 01 Mar 2024 18:33 next collapse

Did they try to ask the models to act as Euler, Gauss or Tao?

baseless_discourse@mander.xyz on 02 Mar 2024 12:49 collapse

Tao said he would ask the AI model to pretend to be a colleague before asking it anything. So your suggestion works!

After a long dig: mathstodon.xyz/@tao/110601051375142142

Merlin404@lemmy.world on 01 Mar 2024 19:37 next collapse

Its because it dosent try to answer right, but more what it thinks toy want to see/read 🤷

tomjuggler@lemmy.world on 01 Mar 2024 21:57 next collapse

AI models get more dishonest if you ask them to respond as if they were a pirate: circusscientist.com/…/i-hired-a-pirate-to-take-or…

TrainsAreCool@lemmy.one on 02 Mar 2024 00:39 collapse

If you ask them to respond like a politician they answer all your questions with something completely different.

BigMikeInAustin@lemmy.world on 01 Mar 2024 22:42 next collapse

Doh. This says to have the AI write the prompt for you, but it doesn’t give any examples of doing that.

I don’t want to get into a rabbit hole looking up examples from the wide internet.

FrostyCaveman@lemm.ee on 01 Mar 2024 23:52 next collapse

Reverse the polarity!

cypherpunks@lemmy.ml on 02 Mar 2024 00:32 next collapse

LLM detractors hate this one weird trick

meeeeetch@lemmy.world on 02 Mar 2024 02:03 collapse

We’ve finally figured out how to trick the computer that’s bad at math into being less bad at math.

Rednax@lemmy.world on 02 Mar 2024 10:35 collapse

It is only logical that an algorithm trained on the ways of a Vulcan, is precise and accurate in it operation and communication. Vastly more fascinating are the result when you ask it to behave like a human.