OopsGPT - OpenAI just announced a new search tool. Its demo already got something wrong. (www.theatlantic.com)
from jeffw@lemmy.world to technology@lemmy.world on 26 Jul 2024 04:07
https://lemmy.world/post/17961641

#technology

threaded - newest

Bell@lemmy.world on 26 Jul 2024 04:29 next collapse

The hallucinations will continue until the training data is absolutely perfect

NeoNachtwaechter@lemmy.world on 26 Jul 2024 05:04 next collapse

In order to get perfect training data, they cannot use any human output.

I’m afraid it is not going to happen anytime soon :)

Orbituary@lemmy.world on 26 Jul 2024 06:07 next collapse

What other output do you propose?

NeoNachtwaechter@lemmy.world on 26 Jul 2024 07:07 collapse

What other output do you propose?

I do not propose, and it is not neccessarily any output.

Their first question is, what do they want the AI to do. And if they want it to be perfect, then they need to use perfect training data, not human output.

Petter1@lemm.ee on 26 Jul 2024 08:17 next collapse

This is exactly why apple uses API for apps to give well structured data as context instead of random screenshot data

[deleted] on 26 Jul 2024 13:04 collapse

.

kokesh@lemmy.world on 26 Jul 2024 06:27 collapse

I’ve started editing my answers/questions on StackExchange. Few characters at a time. I’m doing my part.

NeoNachtwaechter@lemmy.world on 26 Jul 2024 07:08 next collapse

Are you improving it, or do you create new errors? ;-)

metaStatic@kbin.earth on 26 Jul 2024 07:21 next collapse

yes

kokesh@lemmy.world on 26 Jul 2024 07:31 collapse

Errors. I rewrote all my stuff up there with Fuck you OpenAI or something pike that in spring and got banned fo 3 months. So after I got a reminder in my calendar that 3 months are up I got to work a little bit more sophisticatedly.

NeoNachtwaechter@lemmy.world on 26 Jul 2024 09:20 collapse

LOL I like that :-)

otp@sh.itjust.works on 26 Jul 2024 11:58 collapse

Won’t that also make things worse for people looking for answers?

superkret@feddit.org on 26 Jul 2024 18:37 collapse

marked as duplicate, closed

magic_lobster_party@kbin.run on 26 Jul 2024 06:29 next collapse

Most improvements in machine learning has been made by increasing the data (and by using models that can generalize larger data better).

Perfect data isn’t needed as the errors will “even out”. Although now there’s the problem that most new content on the Internet is low quality AI garbage.

NeoNachtwaechter@lemmy.world on 26 Jul 2024 07:10 collapse

Perfect data isn’t needed as the errors will “even out”.

That is an assumption.

I do not think that it is a correct assumption.

now there’s the problem that most new content on the Internet is low quality AI garbage.

This reminds me about a recommendation from some philosopher - I forgot who it was - he said that you should read only such books that are at least 100 years old.

magic_lobster_party@kbin.run on 26 Jul 2024 07:43 collapse

I’m extrapolating from history.

15 years ago people made fun of AI models because they could mistake some detail in a bush for a dog. Over time the models became more resistant against those kinds of errors. The change was more data and better models.

It’s the same type of error as hallucination. The model is overly confident about a thing it’s wrong about. I don’t see why these types of errors would be any different.

NeoNachtwaechter@lemmy.world on 26 Jul 2024 09:44 collapse

I don’t see why these types of errors would be any different.

Well it is easy to see when you understand what LLMs actually do and how it is different from what humans do. Humans have multiple ways to correct errors and we do it all the time, intuitively. LLMs have none of these ways, they can only repeat their training (and not even hope for the best, because to hope is human again)

hendrik@palaver.p3x.de on 26 Jul 2024 08:00 next collapse

That's not correct btw. AI is supposed to be creative and come up with new text/images/ideas. Even with perfect training data. That creativity means creativity. We want it to come up with new text out of thin air. And perfect training data is not going to change anything about it. We'd need to remove the ability to generate fictional stories and lots of other answers, too. Or come up with an entirely different approach.

bjorney@lemmy.ca on 26 Jul 2024 20:38 collapse

AI isn’t supposed to be creative, it’s isn’t even capable of that. It’s meant to min/max it’s evaluation criterion against a test dataset

It does this by regurgitating the training data associated with a given input as closely as possible

hendrik@palaver.p3x.de on 27 Jul 2024 07:09 collapse

I've heard people saying that before. But it's not true. You can ask an AI to draw you an astronaut on a horse and it'll do it despite never having seen such picture. (Now it has.) Same applies to LLMs. They come up with an answer to your exact question. Not a similar one it saw on Reddit before. That answer might be wrong (which is my point) but if you try it, you'll regularly find it tries answering your questions and not different ones.

I've also tried some scifi storywriting with AI and there it becomes quite obvious that it's able to apply things it knows from different contexts and apply that to my setting. Like ethics questions, basic physics and what character can and cannot do. Rough knowledge about how stories are written. You can tell it to do a plot twist an an arbitrary point and it'll do. All of that is knowledge about (abstract) concepts and the ability to apply it to different contexts. Which is an important part of creativity.

And I've read papers where the scientists try to look inside of AI and they are able to spot abstract concepts like what a cat is in the weights. It's fascinating how it works. And it turns out it's not just regurgitating it's training data. Which isn't surprising because a lot of effort has been put into the computer science behind it to make AI more than that. And it's also why they're useful in the first place.

LunarLoony@lemmy.sdf.org on 27 Jul 2024 08:02 collapse

It’s able to apply those things because it’s read millions of sci-fi stories, and can make an educated guess. It’s also able to produce an image od an astronaut on a horse because it’s seen lots of images of astronauts and horses, and people sitting on horses, so it can once again make an educated guess. I don’t think it’s right to call that creativity.

hendrik@palaver.p3x.de on 27 Jul 2024 08:46 collapse

Isn't that like 60% of what creativity is? Art sometimes is about combining things in a new way. I mean it's rare anyways that one genius comes up with an entirely new concept like scifi stories or pop art and invents that genre out of thin air. Most of the times also humans take something that already exists and build upon that. It's not that far off here. And I doubt a human can draw a "rkbvrpoi" on a "wuqrkah" and not take inspiration from ...anything.

I mean obviously there is something missing. Some human told it to draw that astronaut. So the whole artwork contains that original creativity that didn't come from the AI. But I think it's debatable wheter it could do it. This is only one specific example

LunarLoony@lemmy.sdf.org on 27 Jul 2024 10:46 collapse

Yes, a lot of creativity is fuelled by inspiration. One can’t create much in a bubble. However: I could draw a rkbvrpoi, and my human intuition enables me to consider what such a thing might look like. I can make it up and make something feasible of it. I can give it a history, I can place it in culture (or maybe it is itself a culture), and I can do whatever I want. Yes, that requires some level of inspiration, and drawing from what I’ve observed and experienced would make the rkbvrpoi a lot more believable - if that’s my goal.

A so-called AI can’t do any of that, and even if it could, it would be meaningless and soulless.

hedgehog@ttrpg.network on 26 Jul 2024 18:23 collapse

Hallucinations are an unavoidable part of LLMs, and are just as present in the human mind. Training data isn’t the issue. The issue is that the design of the systems that leverage LLMs uses them to do more than they should be doing.

I don’t think that anything short of being able to validate an LLM’s output without running it through another LLM will be able to fully prevent hallucinations.

[deleted] on 26 Jul 2024 06:48 next collapse

.

noodlejetski@lemm.ee on 26 Jul 2024 07:46 collapse

what does a web browser have to do with a search engine?

Warl0k3@lemmy.world on 26 Jul 2024 09:08 next collapse

Edge comes pre-enabled with a ton of microsoft’s crappy AI - Bing chat, copilot, etc.

[deleted] on 26 Jul 2024 09:48 collapse

.

dabster291@lemmy.zip on 26 Jul 2024 17:58 next collapse

me when paywalls

dumbass@leminal.space on 27 Jul 2024 07:29 collapse

archive.is/mC8kA

dabster291@lemmy.zip on 27 Jul 2024 08:22 collapse

👍

riodoro1@lemmy.world on 27 Jul 2024 11:12 collapse

And yet people still use those bullshit generators and call their bullshit „hallucinations”.

We broke the internet for this.

notastatist@feddit.org on 28 Jul 2024 06:27 collapse

The internet was broke before.