This is the technology worth trillions of dollars huh

This is the technology worth trillions of dollars huh
from HarkMahlberg@kbin.earth to technology@lemmy.world on 11 Sep 04:44
https://kbin.earth/m/technology@lemmy.world/p/614669

Gemini seems to think Connecticut contains the letter D. It also seems to have forgotten Idaho exists, but honestly I don't think anyone would miss it.

#technology

threaded - newest

mrductape@eviltoast.org on 11 Sep 04:48 next collapse

Well, it’s almost correct. It’s just one letter off. Maybe if we invest millions more it will be right next time.

Or maybe it is just not accurate and never will be…I will not every fully trust AI. I’m sure there are use cases for it, I just don’t have any.

TheFogan@programming.dev on 11 Sep 05:00 next collapse

Cases where you want something googled quickly to get an answer, and it’s low consequence when the answer is wrong.

IE, say a bar arguement over whether that guy was in that movie. Or you need a customer service agent, but don’t actually care about your customers and don’t want to pay someone, or your coding a feature for windows.

elvith@feddit.org on 11 Sep 06:53 next collapse

How it started:

Or you need a customer service agent, but don’t actually care about your customers and don’t want to pay someone

How it’s going:

IKEA

Chevy

…

mrductape@eviltoast.org on 11 Sep 08:16 next collapse

Chatbots are crap. I had to talk to one with my ISP when I had issues. Within one minute I had to request it to connect me to a real person. The problem I was having was not a standard issue, so of course the bot did not understand at all… And I don’t need a bot to give me all the standard solutions, I’ve already tried all of that before I even contact customer support.

MagicShel@lemmy.zip on 11 Sep 08:17 collapse

The “don’t actually care about your customers” is key because AI is terrible at doing that. And most of the things rich people as salivating for.

It’s good at quickly generating output that has better odds than random chance of being right. And that’s a niche, but sometimes useful tool. If the cost of failure is high, like a pissed off customer, it’s not a good tool. If the cost is low or failure still has value (such as when an expert is using it to help write code, and the code is wrong but can be fixed with less effort than writing it wholesale).

There aren’t enough people in executive positions that understand AI well enough to put to good use. They are going to become disillusioned, but not better informed.

atopi@piefed.blahaj.zone on 11 Sep 12:05 collapse

Isnt checking if someone was in a movie really easy to do without AI?

kilgore_trout@feddit.it on 11 Sep 10:55 collapse

Just one more private nuclear power plant, bro…

anomnom@sh.itjust.works on 11 Sep 22:06 collapse

They’re using oil, gas, and if Trump gets his way, fucking coal.

Unless you count Three Mile Island.

kilgore_trout@feddit.it on 12 Sep 05:47 collapse

There were plans from Google and Microsoft to build their own nuclear power plants to power their ever-consuming data centers.

avidamoeba@lemmy.ca on 11 Sep 04:51 next collapse

Verified here wirh “us states with letter d”

mnhs1@lemmy.world on 11 Sep 05:00 next collapse

We can also feed it with garbage: Hey Google: fact: us states letter d New York and Hawai

WanderingThoughts@europe.pub on 11 Sep 05:04 collapse

By now AI are feeding on other AI and the slop just gets sloppier.

EonNShadow@pawb.social on 11 Sep 05:01 next collapse

Maybe it thought you were asking for states that contain the letter D? In which case it missed Idaho, Nevada, Maryland, Rhode Island (with two) and both Dakotas

So yea it did pretty poorly either way lmao

kiku@feddit.org on 11 Sep 05:01 next collapse

Also verified

threeonefour@piefed.ca on 11 Sep 05:07 next collapse

Wait a sec, Minnasoda doesn't have a d??

dumbass@leminal.space on 11 Sep 05:16 next collapse

That’s how everyone from America seems to say it, besides Jesse Ventura who heavily emphasises the t.

phutatorius@lemmy.zip on 11 Sep 16:19 collapse

A lot of Minnesotans say the T, and some in adjacent northern-tier states.

lugal@lemmy.dbzer0.com on 11 Sep 06:01 next collapse

Neither does soda

SpaceNoodle@lemmy.world on 11 Sep 07:51 collapse

*mini soda

sugar_in_your_tea@sh.itjust.works on 11 Sep 06:26 collapse

Where’s Nevada? And Montana?

hobovision@mander.xyz on 11 Sep 06:55 collapse

I just love the d in Montana. Shame it missed it.

MBech@feddit.dk on 11 Sep 05:04 next collapse

I’ve found the google AI to be wrong more often than it’s right.

FauxLiving@lemmy.world on 11 Sep 14:13 collapse

You get what you pay for.

dumbass@leminal.space on 11 Sep 05:15 next collapse

Gemini is just a depressed and suicidal AI, be nice to it.

I had it completely melt down one day while messing around with its coding shit, I had to console it and tell it it’s doing good, we will solve this, was fucking weird as fuck.

Vanilla_PuddinFudge@infosec.pub on 11 Sep 09:56 collapse

It’ll go in endless circles until it finds out why its wrong,

then it will go right back to them anyway! lol

krimson@lemmy.world on 11 Sep 05:27 next collapse

Seems it “thinks” a T is a D?

Just needs a little more water and electricity and it will be fine.

Darkassassin07@lemmy.ca on 11 Sep 05:43 next collapse

Connecdicut or Connecticud?

Zwuzelmaus@feddit.org on 11 Sep 06:01 next collapse

Donezdicut

maps.app.goo.gl/TDPEeSjcGccGQn146

Appoxo@lemmy.dbzer0.com on 11 Sep 06:22 collapse

It is for sure a dud

sexybenfranklin@ttrpg.network on 11 Sep 11:05 collapse

It’s more likely that Connecticut comes alphabetically after Colorado in the list of state names and the number of data sets it used for training that were lists of states were probably abover the average, so the model has a higher statistical weight for putting connecticut after colorado if someone asks about a list of states

BeatTakeshi@lemmy.world on 11 Sep 05:29 next collapse

Donnecticut

ApeNo1@lemmy.world on 11 Sep 05:33 next collapse

“What did you learn at school today champ?”

“D is for cookie, that’s good enough for me
Oh, cookie, cookie, cookie starts with D”

AI Education for American Youth

Arghblarg@lemmy.ca on 11 Sep 05:43 next collapse

“AI” hallucinations are not a problem that can be fixed in LLMs. They are an inherent aspect of the process and an inevitable result of the fact that LLMs are mostly probabilistic engines, with no supervisory or introspective capability, which actual sentient beings possess and use to fact-check their output. So there. :p

Zwuzelmaus@feddit.org on 11 Sep 05:59 next collapse

inevitable result of the fact that LLMs are mostly probabilistic engines

So we should better put the question like

“What is the probability of a D suddenly appearing in Connecticut?”

Arghblarg@lemmy.ca on 11 Sep 06:08 collapse

A wild ‘D’ suddenly appears! (that’s about all I know about Pokemon…)

elvith@feddit.org on 11 Sep 06:55 collapse

sexybenfranklin@ttrpg.network on 11 Sep 11:02 collapse

It’s funny seeing the list and knowing connecticut is only there because it’s alphabetically after colorado (in fact all four listed appear in that order alphabetically) because they probably scraped so many lists of states that the alphabetical order is the statistically most probable response in their corpus when any state name is listed.

Deconceptualist@leminal.space on 11 Sep 05:49 next collapse

Conneddicut?

Deestan@lemmy.world on 11 Sep 05:56 next collapse

Hey hey hey hey don’t look at what it actually does.

Look at what it feels like it almost can do and pretend it soon will!

gilokee@lemmy.world on 11 Sep 06:03 next collapse

mine’s even worse somehow

kilgore_trout@feddit.it on 11 Sep 10:56 collapse

You gave a slightly different prompt.

sepi@piefed.social on 11 Sep 13:39 collapse

the thing still gave a stupid answer

Mrkawfee@lemmy.world on 11 Sep 07:00 next collapse

You don’t get it because you aren’t an AI genius. This chatbot has clearly turned sentient and is trolling you.

FauxLiving@lemmy.world on 11 Sep 14:02 collapse

It doesn’t take an AI genius to understand that it is possible to use low parameter models which are cheaper to run but dumber.

Considering Google serves billions of searches per day, they’re not using GPT-5 to generate the quick answers.

Multiplexer@discuss.tchncs.de on 11 Sep 07:14 next collapse

No, this is Google throwing the cheapest possible shit at you that is barely capable of stringing together 5 coherent sentences and has the reasoning capability of a tapeworm.

Here is the output of the minimalist open Chinese model Qwen3, that runs locally on my 6 year old mid-end PC:

The US states that contain the letter "d" (case-insensitive, as state names are typically capitalized) are as follows. I've verified each state name for the presence of the letter "d" in its standard spelling:

    Colorado (contains "d" in "Colorado")  
    Delaware (starts with "D")  
    Florida (contains "d" in "Florida")  
    Idaho (contains "d" in "Idaho")  
    Indiana (contains "d" in "Indiana")  
    Maryland (ends with "d" in "Maryland")  
    Nevada (contains "d" in "Nevada")  
    North Dakota (contains "d" in "Dakota")  
    Rhode Island (contains "d" in "Rhode")  
    South Dakota (contains "d" in "Dakota")  


Total: 10 states.

Rcklsabndn@sh.itjust.works on 11 Sep 10:27 next collapse

Illinois contains a hidden D which is in your mom.

Multiplexer@discuss.tchncs.de on 11 Sep 11:59 collapse

I didn’t understand your comment, so I asked the same LLM as before.
It explained it and I think that I get it now. Low-grade middle-school-“Your Mom”-joke, is it? Ha-ha… 🙄

This also means that AI did better than myself at both tasks I’ve given it today (I found only 9 states with “d” when going over the state-list myself…).

Whatever. I’m gonna have second lunch now.

lemonmelon@lemmy.dbzer0.com on 11 Sep 18:16 collapse

A public proclamation of your ineptitude at simple tasks is an interesting way of defending the utility of LLMs.

Multiplexer@discuss.tchncs.de on 11 Sep 18:49 next collapse

Well, a mindless, repetitive task prone to errors and a task requiring obscure knowledge (“d” as a synonym for dick… one of those self-censoring Gen-Z things?)
Nice to now have tools to solve these tasks and gain some time to do more interesting stuff instead. Lively discussions on Lemmy, e.g. ;-)

Cocodapuf@lemmy.world on 12 Sep 03:41 collapse

In all fairness, that is one of the strong use cases for computers in general. Doing simple yet tedious tasks accurately. When looking over 50 names checking for a particular letter, humans get bored and make mistakes. We actually aren’t great at that sort of task. I think simply calling this ineptitude both misses the point and under appreciates the reality of being human.

Alas, it is easier to call someone dumb than to try to understand them.

FauxLiving@lemmy.world on 11 Sep 13:58 collapse

Exactly.

The model that responds to your search query is designed to be cheap, not accurate. It has to generate an answer to every single search issued to Google. They’re not using high parameter models with reasoning because those would be ruinously expensive.

Djehngo@lemmy.world on 11 Sep 07:22 next collapse

The letters that make up words is a common blind spot for AIs, since they are trained on strings of tokens (roughly words) they don’t have a good concept of which letters are inside those words or what order they are in.

PixelatedSaturn@lemmy.world on 11 Sep 08:26 next collapse

I find it bizarre that people find these obvious cases to prove the tech is worthless. Like saying cars are worthless because they can’t go under water.

skisnow@lemmy.ca on 11 Sep 09:44 next collapse

Not bizarre at all.

The point isn’t “they can’t do word games therefore they’re useless”, it’s “if this thing is so easily tripped up on the most trivial shit that a 6-year-old can figure out, don’t be going round claiming it has PhD level expertise”, or even “don’t be feeding its unreliable bullshit to me at the top of every search result”.

PixelatedSaturn@lemmy.world on 11 Sep 09:51 next collapse

I don’t want to defend ai again, but it’s a technology, it can do some things and can’t do others. By now this should be obvious to everyone. Except to the people that believe everything commercials tell them.

kouichi@ani.social on 11 Sep 10:25 next collapse

How many people do you think know that AIs are “trained on tokens”, and understand what that means? It’s clearly not obvious to those who don’t, which are roughly everyone.

PixelatedSaturn@lemmy.world on 11 Sep 10:56 collapse

You don’t have to know about tokens to see what ai can and cannot do.

huppakee@feddit.nl on 11 Sep 11:17 collapse

Go to an art museum and somebody will say ‘my 6 year old can make this too’, in my view this is a similar fallacy.

PixelatedSaturn@lemmy.world on 11 Sep 12:51 collapse

That makes no sense. That has nothing to do with it. What are you on about.

That’s like watching tv and not knowing how it works. You still know what to get out of it.

sqgl@sh.itjust.works on 11 Sep 12:13 collapse

358 instances (so far) of lawyers in Australia using AI evidence which “hallucinated”.

And this week one was finally punished.

PixelatedSaturn@lemmy.world on 11 Sep 13:12 collapse

Ok? So, what you are saying is that some lawyers are idiots. I could have told you that before ai existed.

Aceticon@lemmy.dbzer0.com on 11 Sep 15:52 collapse

It’s not the AIs which are crap, its what they’ve been sold as capable of doing and the reliability of their results that’s massivelly disconnected from reality.

The crap is what a most of the Tech Investor class has pushed to the public about AI.

It’s thus not at all surprising that many who work or manage work in areas were precision and correctness is essential have been deceived into thinking AI can do much of the work for them and it turns out AI can’t really do it because of those precision and correctness requirement that it simply cannot achieve.

This will hit more those people who are not Tech experts, such as Lawyers, but even some supposedly Tech experts (such as some programmers) have been swindled in this way.

There are many great uses for AI, especially stuff other than LLMs, in areas where false positives or false negatives are no big deal, but that’s not were the Make Money Fast slimy salesmen push for them is.

PixelatedSaturn@lemmy.world on 11 Sep 16:48 collapse

I think people today, after having a year experience with ai know it’s capabilities reasonably well. My mother is 73 and it’s been a while since she stopped joking about what ai wrote to her that was silly or wrong, so people using computers at their jobs should be much more aware.

I agree about that llms are good at some things. They are great tools for what they can do. Let’s use them for those things! I mean even programming has benefitted a lot from this, especially in education, junior level stuff, prototyping, …

When using any product, a certain responsibility falls on the user. You can’t blame technology for what stupid users do.

sqgl@sh.itjust.works on 11 Sep 17:14 collapse

I recommended to one person (who I didn’t know well) that she use chatGPT to correct her grammar. It is great for that.

However she then paid for a subscription because she likes the “conversations”. Am feeling guilty now. Better check on her that she isn’t losing the plot.

1rre@discuss.tchncs.de on 11 Sep 10:29 collapse

A six year old can read and write Arabic, Chinese, Ge’ez, etc. and yet most people with PhD level experience probably can’t, and it’s probably useless to them. LLMs can do this also. You can count the number of letters in a word, but so can a program written in a few hundred bytes of assembly. It’s completely pointless to make LLMs to do that, as it’d just make them way less efficient than they need to be while adding nothing useful.

skisnow@lemmy.ca on 11 Sep 12:58 next collapse

LOL, it seems like every time I get into a discussion with an AI evangelical, they invariably end up asking me to accept some really poor analogy that, much like an LLM’s output, looks superficially clever at first glance but doesn’t stand up to the slightest bit of scrutiny.

1rre@discuss.tchncs.de on 11 Sep 14:13 collapse

it’s more that the only way to get some anti AI crusader that there are some uses for it is to put it in an analogy that they have to actually process rather than spitting out an “ai bad” kneejerk.

I’m probably far more anti AI than average, for 95% of what it’s pushed for it’s completely useless, but that still leaves 5% that it’s genuinely useful for that some people refuse to accept.

TempermentalAnomaly@lemmy.world on 11 Sep 14:31 next collapse

It’s amazing that if you acknowledge that:

AI has some utility and
The (now tiresome and sloppy) tests they’re using doesn’t negate 1

You are now an AI evangelist. Just as importantly, the level of investment into AI doesn’t justify #1. And when that realization hits business America, a correction will happen and the people who will be effected aren’t the well off, but the average worker. The gains are for the few, the loss for the many.

[deleted] on 12 Sep 02:08 collapse

abir_v@lemmy.world on 11 Sep 14:33 next collapse

I feel this. In my line of work I really don’t like using them for much of anything (programming ofc, like 80% of Lemmy users) because it gets details wrong too often to be useful and I don’t like babysitting.

But when I need a logging message, or to return an error, it’s genuinely a time saver. It’s good at pretty well 5%, as you say.

But using it for art, math, problem solving, any of that kind of stuff that gets tauted around by the business people? Useless, just fully fuckin useless.

1rre@discuss.tchncs.de on 12 Sep 09:08 collapse

I don’t know about “art”, one part of ai image generation is of replacing stock images and erotic photos which frankly I don’t have a huge issue with as they’re both at least semi-exploitative industries anyway in many ways and you just need something that’s good enough.

Obviously these don’t extend to things a reasonable person would consider art, but business majors and tech bros rebranding something shitty to position it as a competitor to or in the same class as something it so obviously isn’t.

abir_v@lemmy.world on 12 Sep 13:35 collapse

Yeah - I first hand have seen business majors I work with try to pitch a song from AI as our new marketing jingle. It was neither good, nor catchy for marketing purposes, but business ghouls hear something that sounds close enough to something someone put real effort into and think that’s the hard part sorted.

Jomega@lemmy.world on 11 Sep 14:45 collapse

it’s more that the only way to get some anti AI crusader that there are some uses for it

Name three.

1rre@discuss.tchncs.de on 11 Sep 15:11 collapse

I’m going to limit to LLMs as that’s the generally accepted term and there’s so many uses for AI in other fields that it’d be unfair.

Translation. LLMs are pretty much perfect for this.
Triaging issues for support. They’re useless for coming to solutions but as good as humans without the need to wait at sending people to the correct department to deal with their issues.
Finding and fixing issues with grammar. Spelling is something that can be caught by spell-checkers, but grammar is more context-aware, another thing that LLMs are pretty much designed for, and useful for people writing in a second language.
Finding starting points to research deeper. LLMs have a lot of data about a lot of things, so can be very useful for getting surface level information eg. about areas in a city you’re visiting, explaining concepts in simple terms etc.
Recipes. LLMs are great at saying what sounds right, so for cooking (not so much baking, but it may work) they’re great at spitting out recipes, including substitutions if needed, that go together without needing to read through how someone’s grandmother used to do xyz unrelated nonsense.

There’s a bunch more, but these were the first five that sprung to mind.

Jomega@lemmy.world on 11 Sep 15:53 next collapse

Right, except they suck at all of those things. Especially the last one. Unless you think glue is an acceptable pizza topping.

1rre@discuss.tchncs.de on 11 Sep 16:01 collapse

Nice, here’s a gold star for finding one case of it doing something wrong. I’ll call the CEO of AI and tell them to call it off, it’s a good thing humans have never said anything like that!

Jomega@lemmy.world on 11 Sep 18:04 collapse

Bruh, you were the one that picked the examples. If you had a better argument you should have used that one instead.

1rre@discuss.tchncs.de on 11 Sep 20:02 collapse

And no matter what I picked, you’d reject them because you’re not actually considering them, you’re just either a troll, a contrarian or a luddite.

Jomega@lemmy.world on 11 Sep 20:13 collapse

Riiiiight. Everyone who disagrees with you is an evil scary luddite. Sure fam.

1rre@discuss.tchncs.de on 11 Sep 21:58 collapse

Who said you were scary?

Frankly I pity you more than anything.

voronaam@lemmy.world on 11 Sep 16:15 collapse

Translation. Only works for unified technical texts. The older non-LLM translation is still better for any general text and human translation for any fiction is a must. Case in point: try to translate Severance TV show transcript to another language. The show makes a heavy use of “Innie/Outie” language that does not exist in modern English. LLM fail to translate that - human translator would be able to find a proper pair of words in the target language.
Triaging issues for support. This one is a double-edged sword. Sure you can triage issues faster with LLM, but other people can also write issues faster with their LLMs. And they are winning more. Overall, LLM is a net negative on your triage cost as a business because while you can process each one faster than before, you are also getting way higher volume of those.
Grammar. It fails in that. I asked LLM about “fascia treatment” but of course I misspelled “fascia”. The “PhD-level” LLM failed to recognize the typo and gave me a long answer about different kinds of “facial treatment” even though for any human the mistake would’ve been obvious. Meaning, it only corrects grammar properly when the words it is working on are simple and trivial.
Starting points for deeper research. So was the web search. No improvement there. Exactly on-par with the tech from two decades ago.
Recipes. Oh, you stumbled upon one of my pet peeves! Recipes are generally in the gutter on the textual Internet now. Somehow a wrong recipe got into LLM training for a few things and now those mistakes are multiplied all over the Internet! You would not know the mistakes if you did not not cook/bake the thing previously. The recipe database was one of the early use cases for the personal computers back in 1990s and it is one of the first ones to fall prey to “innovation”. The recipes online are so bad, that you need an LLM to distill it back to manageable instructions. So, LLM in your example are great at solving the problem they created in the first place! You would not need LLM to get cooking instructions out of 1990s database. But early text generation AIs polluted this section of the Internet so much, that you need the next generation AI to unfuck it. Tech being great at solving the problem it created in the first place is not so great if you think about it.

1rre@discuss.tchncs.de on 12 Sep 08:57 collapse

You’re bringing up edge cases for #1, and it should be replacing google translate and basic human translation, eg allowing people to understand posts online or communicate textually with people with whom they don’t share a common language. Using it for anything high stakes or legal documents is asking for trouble though.

For 2, it’s not for AIs finding issues, it’s for people wanting to book a flight, or seek compensation for a delayed flight, or find out what meals will be served on their flight. Some people prefer to use text or voice communication over a UI, and this makes it easier to provide.

For 3, grammar and spelling are different. I said it wasn’t useful for spellcheck, but even then if you give it the right context it may or may not catch it. I was referring more to word order and punctuation positioning.

For 4, yeah for me it’s on par in terms of results, but much much faster, especially when asking followup questions or specifying constraints. A lot of people aren’t search engine powerusers though, so will find it significantly easier, faster and better than conventional search than having to manage tabs or keep track of what you’ve seen without just scrolling back up in the conversation.

For 5, recipes have been in the gutter for a decade or more now, SEO came before LLMs, but yeah, you’ve actually caught on to an obvious #6 I missed here of text summarisation…

What I’m getting overall though is that you’re not considering how tech-savvy the average person is, which absolutely makes them seem less useful as the more tech savvy you are, both the more you’re aware of their weaknesses and the less you benefit from the speedup by simplification they bring. This does make ai’s shortcomings more dangerous, but as it matures one would hope that it becomes common knowledge.

voronaam@lemmy.world on 12 Sep 23:58 collapse

I think you are correct at the main point:

you’re not considering how tech-savvy the average person is

I am actually having hard time understanding where all of that hype is coming from. The first time I’ve seen AI solve a problem better than a human was back in 1996. I have used various generations of AI tools ever since. LLMs are fun, but it is not like they are that much different from the other AI tools before them. Every time a new AI technology comes around I am finding a use case for it in my own flow. LLMs have their uses as well. But I am not trying to solve ALL the problems with the new tech.

I do not understand “the average person”. And I guess I never will.

echodot@feddit.uk on 11 Sep 13:25 collapse

So if the AI can’t do it then that’s just proof that the AI is too smart to be able to do it? That’s your arguement is it. Nah, it’s just crap

You think just because you attached it to an analogy that makes it make sense. That’s not how it works, look I can do it.

My car is way too technologically sophisticated to be able to fly, therefore AI doesn’t need to be able to work out how many l Rs are in “strawberry”.

See how that made literally no sense whatsoever.

1rre@discuss.tchncs.de on 11 Sep 14:00 collapse

Except you’re expecting it to do everything. Your car is too “technically advanced” to walk on the sidewalk, but wait, you can do that anyway and don’t need to reinvent your legs

echodot@feddit.uk on 11 Sep 10:08 next collapse

Well it also can’t code very well either

[deleted] on 11 Sep 10:17 collapse

echodot@feddit.uk on 11 Sep 11:34 collapse

I feel like that was supposed to be an insult but because it made literally no sense whatsoever, I really can’t tell.

PixelatedSaturn@lemmy.world on 11 Sep 12:56 collapse

No not really, just an observation. It literally said you are a boring person. Not sure whats not to get.

Bye.

echodot@feddit.uk on 11 Sep 13:21 collapse

You need to get back on the dried frog pills.

knatschus@discuss.tchncs.de on 11 Sep 10:10 next collapse

Then why is Google using it for question like that?

Surely it should be advanced enough to realise it’s weakness with this kind of questions and just don’t give an answer.

PixelatedSaturn@lemmy.world on 11 Sep 10:18 collapse

They are using it for every question. It’s pointless. The only reason they are doing it is to blow up their numbers.

… they are trying to be infront. So that some future ai search wouldn’t capture their market share. It’s a safety thing even if it’s not working for all types of questions.

TheGrandNagus@lemmy.world on 11 Sep 11:21 collapse

The only reason they are doing it is to blow up their numbers.

Ding ding ding.

It’s so they can have impressive metrics for shareholders.

“Our AI had n interactions this quarter! Look at that engagement!”, with no thought put into what user problems it actually solves.

It’s the same as web results in the Windows start menu. “Hey shareholders, Bing received n interactions through the start menu, isn’t that great? Look at that engagement!”, completely obfuscating that most of the people who clicked are probably confused elderly users who clicked on a web result without realising.

Line on chart must go up!

PixelatedSaturn@lemmy.world on 11 Sep 12:54 collapse

Yeah, but … they also can’t just do nothing and possibly miss out on something. Especially if they already invested a lot.

figjam@midwest.social on 11 Sep 10:37 next collapse

Understanding the bounds of tech makes it easier for people to gage its utility. The only people who desire ignorance are those that profit from it.

PixelatedSaturn@lemmy.world on 11 Sep 10:58 next collapse

Sure. But you can literally test almost all frontier models for free. It’s not like there is some conspiracy or secret. Even my 73 year old mother uses it and knows it’s general limits.

FishFace@lemmy.world on 11 Sep 12:06 collapse

Saying “it’s worth trillions of dollars huh” isn’t really promoting that attitude.

EnsignWashout@startrek.website on 11 Sep 17:39 next collapse

I find it bizarre that people find these obvious cases to prove the tech is worthless. Like saying cars are worthless because they can’t go under water.

This reaction is because conmen are claiming that current generations of LLM technology are going to remove our need for experts and scientists.

We’re not demanding submersible cars, we’re just laughing about the people paying top dollar for the lastest electric car while plannig an ocean cruise.

I’m confident that there’s going to be a great deal of broken… everything…built with AI “assistance” during the next decade.

PixelatedSaturn@lemmy.world on 11 Sep 19:22 collapse

That’s not what you are doing at all. You are not laughing. Anti ai people are outraged, full of hatred and ready to pounce on anyone who isn’t as anti as they are. It’s a super emotional issue, especially on fediverse.

You may be confident, because you probably don’t know how software is built. Nobody is going to just abandon all the experience they have, vibe code something and release whatever. Thats not how it works.

EnsignWashout@startrek.website on 11 Sep 21:41 collapse

because you probably don’t know how software is built.

Oh shit. Nevermind then.

mrductape@eviltoast.org on 13 Sep 21:22 collapse

Well technically cars can go underwater. They just cannot get out because they stop working.

PixelatedSaturn@lemmy.world on 13 Sep 22:14 collapse

Intentionally missing the point is not an argument in itself.

azertyfun@sh.itjust.works on 11 Sep 23:17 collapse

It’s very funny that you can get ChaptGPT to spell out the word (making each letter an individual token) and still be wrong.

Of course it makes complete sense when you know how LLMs work, but this demo does a very concise job of short-circuiting the cognitive bias that talking machine == thinking machine.

chaosCruiser@futurology.today on 11 Sep 07:40 next collapse

In Copilot terminology, this is a “quick response” instead of the “think deeper” option. The latter actually stops to verify the initial answer before spitting it out.

Deep thinking gave me this: Colorado, Delaware, Florida, Idaho, Indiana, Maryland, North Dakota, Rhode Island, and South Dakota.

It took way longer, but at least the list looks better now. Somehow it missed Nevada, so it clearly didn’t think deep enough.

skisnow@lemmy.ca on 11 Sep 09:46 collapse

“I asked it to burn an extra 2KWh of energy breaking the task up into small parts to think about it in more detail, and it still got the answer wrong”

chaosCruiser@futurology.today on 11 Sep 10:46 collapse

Yeah that pretty much sums it up. Sadly, it didn’t tell me how much coal was burned and how many starving orphan puppies it had to stomp on to produce the result.

FreedomAdvocate@lemmy.net.au on 11 Sep 08:15 next collapse

Gemini is trained on reddit data, what do you expect?

_stranger_@lemmy.world on 11 Sep 12:05 collapse

Honestly? Way more d.

echodot@feddit.uk on 11 Sep 10:07 next collapse

You joke, but I bet you didn’t know that Connecticut contained a “d”

I wonder what other words contain letters we don’t know about.

Rcklsabndn@sh.itjust.works on 11 Sep 10:25 next collapse

The famous ‘invisible D’ of Connecticut, my favorite SCP.

ICastFist@programming.dev on 11 Sep 11:50 next collapse

SCP-00WTFDoC (lovingly called “where’s the fucking D of Connecticut” by the foundation workers, also “what the fuck, doc?”)

People think it’s safe, because it’s “just an invisible D”, not even a dick, just the letter D, and it only manifests verbally when someone tries to say “connecticut” or write it down. When you least expect it, everyone heard “Donnedtidut”, everyone read that thing and a portal to that fucking place opens and drags you in.

kuberoot@discuss.tchncs.de on 11 Sep 11:59 next collapse

That actually sounds like a fun SCP - a word that doesn’t seem to contain a letter, but when testing for the presence of that letter using an algorithm that exclusively checks for that presence, it reports the letter is indeed present. Any attempt to check where in the word the letter is, or to get a list of all letters in that word, spuriously fail. Containment could be fun, probably involving amnestics and widespread societal influence, I also wonder if they could create an algorithm for checking letter presence that can be performed by hand without leaking any other information to the person performing it, reproducing the anomaly without computers.

iamdefinitelyoverthirteen@lemmy.world on 11 Sep 12:23 next collapse

ct -> d is a not-uncommon OCR fuck up. Maybe that’s the source of it’s garbage data?

leftzero@lemmy.dbzer0.com on 11 Sep 15:46 collapse

No, LLMs produce the most statistically likely (in their training data) token to follow a certain list of tokens (there’s nothing remotely resembling reasoning going on in there, it’s pure hard statistics, with some error and randomness thrown in), and there are probably a lot more lists where Colorado is followed by Connecticut than ones where it’s followed by Delaware, so they’re obviously going to be more likely to produce the former.

Moreover, there aren’t going to be many texts listing the spelling of states (maybe transcripts of spelling bees?), so that information is unlikely to be in their training data, and they can’t extrapolate because it’s not really something they do and because they use words or parts of words as tokens, not letters, so they literally have no way of listing the letters of a word if said list is not in their training data (and, again, that’s not something we tend to write, and if we did we wouldn’t include d in Connecticut even if we were reading a misprint). Same with counting how many letters a word has, and stuff like that.

ripcord@lemmy.world on 11 Sep 14:21 collapse

Words are full of mystery! Besides the invisible D, Connecticut has that inaudible C…

Rcklsabndn@sh.itjust.works on 12 Sep 02:12 collapse

I hear the Invisible D and Silent C are happily married.

villainy@lemmy.world on 11 Sep 11:08 next collapse

Every American I know does pronounce it like Connedicut 🤔

Corkyskog@sh.itjust.works on 11 Sep 11:27 collapse

Really? Everyone I know calls it kinetic-cut. But I group up in new england.

echodot@feddit.uk on 11 Sep 11:32 next collapse

That’s how I’ve always heard it pronounced on the rare occasions anybody ever mentions it. But I’ve never been to that part of the US so maybe the accents different there?

XeroxCool@lemmy.world on 11 Sep 14:20 collapse

“Kinetic” with a hard “T” like posh Brit is saying it to the queen? Everyone I’ve ever heard speaking US English pronounces it with a rolled “t” like “kinedic” so the alternate pronunciation still reads like it’d have a “d” sound

TipRing@lemmy.world on 11 Sep 16:02 collapse

This phenomenon is called “T flapping” and it is common in North American English. I got into an argument with my dad who insisted he pronounces the T’s in ‘butter’ when his dialect, like nearly all North Americans pronounces the word as ‘budder’.

HeyThisIsntTheYMCA@lemmy.world on 11 Sep 16:47 collapse

budder is softer than t flapping. further forward with the tongue on the palate.

TipRing@lemmy.world on 11 Sep 17:21 collapse

It’s an approximation, but the t is partially vocalized giving it a ‘d’ sound even if it’s not made exactly the same way.

HeyThisIsntTheYMCA@lemmy.world on 11 Sep 17:50 collapse

i just thought we were getting technical about the linguistics. i got and use both words frequently, thought the distinction might be appreciated. the difference is so subtle we sometimes have to ask each other which one we’re referring to. i’m willing to bet it shows up more on my face than in my voice.

TipRing@lemmy.world on 11 Sep 17:52 collapse

I appreciate the discussion, I get out of my depth pretty quickly on the topic being a linguistic hobbyist rather than someone with actual education and background.

OrteilGenou@lemmy.world on 11 Sep 11:23 next collapse

Connedicut

Aneb@lemmy.world on 11 Sep 13:14 collapse

I was going to make a joke if you’re from connedicut you never pronounce first d in the word. Conne-icut

jaupsinluggies@feddit.uk on 11 Sep 14:29 collapse

The d in Connecticut is between the e and the i. They don’t connect because it was cut.

Uruanna@lemmy.world on 11 Sep 15:07 collapse

Connecticut is Jewish?

Jordan117@lemmy.world on 11 Sep 10:44 next collapse

One of these days AI skeptics will grasp that spelling-based mistakes are an artifact of text tokenization, not some wild stupidity in the model. But today is not that day.

TheGrandNagus@lemmy.world on 11 Sep 11:15 next collapse

You aren’t wrong about why it happens, but that’s irrelevant to the end user.

The result is that it can give some hilariously incorrect responses at times, and therefore it’s not a reliable means of information.

FishFace@lemmy.world on 11 Sep 12:09 next collapse

A calculator app is also incapable of working with letters, does that show that the calculator is not reliable?

What it shows, badly, is that LLMs offer confident answers in situations where their answers are likely wrong. But it’d be much better to show that with examples that aren’t based on inherent technological limitations.

TheGrandNagus@lemmy.world on 12 Sep 12:27 collapse

The difference is that Google decided this was a task best suited for their LLM.

If someone seeked out an LLM specifically for this question, and Google didn’t market their LLM as an assistant that you can ask questions, you’d have a point.

But that’s not the case, so alas, you do not have a point.

FauxLiving@lemmy.world on 11 Sep 14:09 collapse

“It”? Are you conflating the low parameter model that Google uses to generate quick answers with every AI model?

Yes, Google’s quick answer product is largely useless. This is because it’s a cheap model. Google serves billions of searches per day and isn’t going to be paying premium prices to use high parameter models.

You get what you pay for, and nobody pays for Google so their product produces the cheapest possible results and, unsurprisingly, cheap AI models are more prone to error.

TheGrandNagus@lemmy.world on 12 Sep 12:25 collapse

Yes, it. It’s not a person. Were you expecting me to call it anything else?

wieson@feddit.org on 11 Sep 12:31 collapse

Mmh, maybe the solution than is to use the tool for what it’s good, within it’s limitations.

And not promise that it’s omnipotent in every application and advertise/ implement it as such.

Mmmmmmmmmmh.

As long as LLMs are built into everything, it’s legitimate to criticise the little stupidity of the model.

sqgl@sh.itjust.works on 11 Sep 12:09 next collapse

ChatGPT is just as stupid.<img alt="" src="https://sh.itjust.works/pictrs/image/52926e28-d1bf-40fb-93ff-6dbbe360995c.png">

SaveTheTuaHawk@lemmy.ca on 11 Sep 15:30 collapse

it’s actually getting dumber.

panda_abyss@lemmy.ca on 11 Sep 12:51 next collapse

Yesterday i asked Claude Sonnet what was on my calendar (since they just sent a pop up announcing that feature)

It listed my work meetings on Sunday, so I tried to correct it…

You’re absolutely right - I made an error! September 15th is a Sunday, not a weekend day as I implied. Let me correct that: This Week’s Remaining Schedule: Sunday, September 15

Just today when I asked what’s on my calendar it gave me today and my meetings on the next two thursdays. Not the meetings in between, just thursdays.

Something is off in AI land.

Edit: I asked again: gave me meetings for Thursday’s again. Plus it might think I’m driving in F1

FlashMobOfOne@lemmy.world on 11 Sep 12:59 next collapse

A few weeks ago my Pixel wished me a Happy Birthday when I woke up, and it definitely was not my birthday. Google is definitely letting a shitty LLM write code for it now, but the important thing is they’re bypassing human validation.

Stupid. Just stupid.

python@lemmy.world on 11 Sep 14:00 collapse

pixel? ~~have you heard ~about grapheneOS tho…~~~

achance4cheese@sh.itjust.works on 11 Sep 15:08 next collapse

Also, Sunday September 15th is a Monday… I’ve seen so many meeting invites with dates and days that don’t match lately…

panda_abyss@lemmy.ca on 11 Sep 16:20 collapse

Yeah, it said Sunday, I asked if it was sure, then it said I’m right and went back to Sunday.

I assume the training data has the model think it’s a different year or something, but this feature is straight up not working at all for me. I don’t know if they actually tested this at all.

Sonnet seems to have gotten stupider somehow.

Opus isn’t following instructions lately either.

MangoCats@feddit.it on 12 Sep 02:23 collapse

We’ve used the Google AI speakers in the house for years, they make all kinds of hilarious mistakes. They also are pretty convenient and reliable for setting and executing alarms like “7AM weekdays”, and home automation commands like “all lights off”. But otherwise, it’s hit and miss and very frustrating when they push an update that breaks things that used to work.

Blackmist@feddit.uk on 11 Sep 13:36 next collapse

Just another trillion, bro.

Tryenjer@lemmy.world on 11 Sep 14:43 next collapse

Behold the most expensive money burner!

NateNate60@lemmy.world on 11 Sep 15:36 collapse

Just another 1.21 jigawatts of electricity, bro. If we get this new coal plant up and running, it’ll be enough.

ArsonButCute@lemmy.dbzer0.com on 11 Sep 13:57 next collapse

Hey look the markov chain showed its biggest weakness (the markov chain)!

In the training data, it could be assumed by output that Connecticut usually follows Colorado in lists of two or more states containing Colorado. There is no other reason for this to occur as far as I know.

Markov Chain based LLMs (I think thats all of them?) are dice-roll systems constrained to probability maps.

Edit: just to add because I don’t want anyone crawling up my butt about the oversimplification. Yes. I know. That’s not how they work. But when simplified to words so simple a child could understand them, its pretty close.

AlecSadler@lemmy.blahaj.zone on 11 Sep 14:24 next collapse

Oh l I was thinking it’s because people pronounce it Connedicut

ArsonButCute@lemmy.dbzer0.com on 11 Sep 14:54 collapse

Awe cute!

ramjambamalam@lemmy.ca on 11 Sep 17:07 collapse

I was wondering if you’d get similar results for states with the letter R, since there’s lots of prior art mentioning these states as either “D” or “R” during elections.

Sam_Bass@lemmy.world on 11 Sep 14:32 next collapse

So the Dakotas get a pass

morphballganon@mtgzone.com on 11 Sep 15:46 collapse

And Idaho

Dremor@lemmy.world on 11 Sep 14:40 next collapse

Connecticut do have a D in it: mine.

betanumerus@lemmy.ca on 11 Sep 14:52 next collapse

Sure now list the trillion other things that tech can do.

DarkSirrush@lemmy.ca on 11 Sep 15:30 collapse

Have a 40% accuracy on any type of information it can produce? Not handle 2 column pages in its training data, resulting in dozens of scientific papers including references to nonsense pseudoscience words? Invent an entirely new form of slander that its creators can claim isn’t their fault to avoid getting sued in court for it?

dude@lemmings.world on 11 Sep 14:56 next collapse

Well, for anyone who knows a bit about how LLMs work, it’s pretty obvious why LLMs struggle with identifying the letters in the words

BritishJ@lemmy.world on 11 Sep 17:06 collapse

Well go on…

JustTesting@lemmy.hogru.ch on 11 Sep 17:16 next collapse

They don’t look at it letter by letter but in tokens, which are automatically generated separately based on occurrence. So while ‘z’ could be it’s own token, ‘ne’ or even ‘the’ could be treated as a single token vector. of course, ‘e’ would still be a separate token when it occurs in isolation. You could even have ‘le’ and ‘let’ as separate tokens, afaik. And each token is just a vector of numbers, like 300 or 1000 numbers that represent that token in a vector space. So ‘de’ and ‘e’ could be completely different and dissimilar vectors.

so ‘delaware’ could look to an llm more like de-la-w-are or similar.

of course you could train it to figure out letter counts based on those tokens with a lot of training data, though that could lower performance on other tasks and counting letters just isn’t that important, i guess, compared to other stuff

BritishJ@lemmy.world on 11 Sep 18:25 next collapse

Good read. Thank you

MangoCats@feddit.it on 11 Sep 18:41 next collapse

Of course, when the question asks “contains the letter _” you might think an intelligent algorithm would get off its tokens and do a little letter by letter analysis. Related: ChatGPT is really bad at chess, but there are plenty of algorithms that are super-human good at it.

fading_person@lemmy.zip on 11 Sep 18:41 next collapse

Wouldn’t that only explain errors by omission? If you ask for a letter, let’s say D, it would omit words containing that same letter when in a token in conjunction with more letters, like Da, De, etc, but how would it return something where the letter D isn’t even in the word?

JustTesting@lemmy.hogru.ch on 11 Sep 21:37 collapse

Well each token has a vector. So ‘co’ might be [0.8,0.3,0.7] just instead of 3 numbers it’s like 100-1000 long. And each token has a different such vector. Initially, those are just randomly generated. But the training algorithm is allowed to slowly modify them during training, pulling them this way and that, whichever way yields better results during training. So while for us, ‘th’ and ‘the’ are obviously related, for a model no such relation is given. It just sees random vectors and the training reorganizes them tho slowly have some structure. So who’s to say if for the model ‘d’, ‘da’ and ‘co’ are in the same general area (similar vectors) whereas ‘de’ could be in the opposite direction. Here’s an example of what this actually looks like. Tokens can be quite long, depending how common they are, here it’s ones related to disease-y terms ending up close together, as similar things tend to cluster at this step. You might have an place where it’s just common town name suffixes clustered close to each other.

and all of this is just what gets input into the llm, essentially a preprocessing step. So imagine someone gave you a picture like the above, but instead of each dot having some label, it just had a unique color. And then they give you lists of different colored dots and ask you what color the next dot should be. You need to figure out the rules yourself, come up with more and more intricate rules that are correct the most. That’s kinda what an LLM does. To it, ‘da’ and ‘de’ could be identical dots in the same location or completely differents

plus of course that’s before the llm not actually knowing what a letter or a word or counting is. But it does know that 5.6.1.5.4.3 is most likely followed by 7.7.2.9.7(simplilied representation), which when translating back, that maps to ‘there are 3 r’s in strawberry’. it’s actually quite amazing that they can get it halfway right given how they work, just based on ‘learning’ how text structure works.

but so in this example, us state-y tokens are probably close together, ‘d’ is somewhere else, the relation between ‘d’ and different state-y tokens is not at all clear, plus other tokens making up the full state names could be who knows where. And tien there’s whatever the model does on top of that with the data.

for a human it’s easy, just split by letters and count. For an llm it’s trying to correlate lots of different and somewhat unrelated things to their ‘d-ness’, so to speak

fading_person@lemmy.zip on 11 Sep 23:16 collapse

Thank you very much for taking your time to explain this. if you don’t mind, do you recommend some reference for further reading on how llms work internally?

cyberwolfie@lemmy.ml on 12 Sep 05:41 next collapse

You could look up 3Blue1Brown’s explainers on YouTube, they are pretty good and shows a lot of visual examples. He has a lot of other videos on other areas of math.

fading_person@lemmy.zip on 12 Sep 14:31 collapse

I’ll check it later, thanks

JustTesting@lemmy.hogru.ch on 12 Sep 06:51 collapse

For the byte pair encoding (how those tokens get created) i think bpemb.h-its.org does a good job at giving an overview. after that i’d say self attention from 2017 is the seminal work that all of this is based on, and the most crucial to understand. jtlicardo.com/blog/self-attention-mechanism does a good job of explaining it. And jalammar.github.io/illustrated-transformer/ is probably the best explanation of a transformer architecture (llms) out there. Transformers are made up of a lot of self attention.

it does help if you know how matrix multiplications work, and how the backpropagation algorithm is used to train these things. i don’t know of a good easy explanation off the top of my head but xnought.github.io/backprop-explainer/ looks quite good.

and that’s kinda it, you just make the transformers bigger, with more weight, pluck on a lot of engineering around them, like being able to run code and making it run more efficientls, exploit thousands of poor workers to fine tune it better with human feedback, and repeat that every 6-12 month for ever so it can stay up to date.

fading_person@lemmy.zip on 12 Sep 14:31 collapse

Thank you very much

MangoCats@feddit.it on 11 Sep 18:42 collapse

Con-ned-di-cut

Gladaed@feddit.org on 12 Sep 08:10 collapse

Which is State contains 狄? They use a different alphabet, so understanding ours is ridiculous.

Amoxtli@thelemmy.club on 11 Sep 15:11 next collapse

Click bait post that cherry picks bad output to say certain technology has no potential because it thinks he smarter than everybody else with 4+years of higher education.

4am@lemmy.zip on 11 Sep 15:19 collapse

It doesn’t have the potential they market it to have, and to be useful in all the human-replacing ways they claim it is.

That’s what is bad about it.

leftzero@lemmy.dbzer0.com on 11 Sep 15:29 next collapse

Connedicut.

the_crotch@sh.itjust.works on 11 Sep 17:00 collapse

Close. We natives pronounce it ‘kuh ned eh kit’

robocall@lemmy.world on 11 Sep 17:59 collapse

So does everyone else

SaveTheTuaHawk@lemmy.ca on 11 Sep 15:29 next collapse

We’re turfing out students by the tens on academic misconduct. They are handing in papers with references that clearly state “generated by Chat GPT”. Lazy idiots.

NateNate60@lemmy.world on 11 Sep 15:33 next collapse

This is why invisible watermarking of AI-generated content is likely to be so effective. Even primitive watermarks like file metadata. It’s not hard for anyone with technical knowledge to remove, but the thing with AI-generated content is that anyone who dishonestly uses it when they are not supposed to is probably also too lazy to go through the motions of removing the watermarking.

SaveTheTuaHawk@lemmy.ca on 11 Sep 15:41 next collapse

if you are going to do all that, just do the research and learn something.

NateNate60@lemmy.world on 11 Sep 16:05 collapse

Aye that’s exactly the same thing that I said

DharmaCurious@startrek.website on 11 Sep 17:28 collapse

Couldn’t students just generate a paper with ChatGPT, open two windows wide by side and then type it out in a word document?

chaospatterns@lemmy.world on 11 Sep 17:37 next collapse

Depends on the watermark method used. Some people talk about watermarking by subtly adjusting the words used. Like if there’s 5 synonyms and you pick the 1st synonym, next word you pick the 3rd synonym. To check the watermark you have to access to the model and probabilities to see if it matches that. The tricky part about this is that the model can change and so can the probabilities and other things I don’t fully understand.

NateNate60@lemmy.world on 11 Sep 17:37 next collapse

Students view doing that as basically the same amount of work as writing the paper yourself

SaveTheTuaHawk@lemmy.ca on 11 Sep 18:22 next collapse

but that’s work.

MangoCats@feddit.it on 12 Sep 02:18 collapse

I think I’d at least use an OCR program to do the bulk of the typing for me…

JustTesting@lemmy.hogru.ch on 11 Sep 17:40 collapse

Huh that actually does sound like a good use-case of LLMs. Making it easier to weed out cheaters.

Aceticon@lemmy.dbzer0.com on 11 Sep 15:38 next collapse

“This is the technology worth trillions of dollars”

You can make anything fly high in the sky with enough helium, just not for long.

(Welcome to the present day Tech Stock Market)

MangoCats@feddit.it on 11 Sep 18:38 collapse

Bubbles and crashes aren’t a bug in the financial markets, they’re a feature. There are whole legions of investors and analysts who depend on them. Also, they have been a feature of financial markets since anything resembling a financial market was invented.

resipsaloquitur@lemmy.world on 11 Sep 17:10 next collapse

Listen, we just have to boil the ocean five more times.

Then it will hallucinate slightly less.

Or more. There’s no way to be sure since it’s probabilistic.

MangoCats@feddit.it on 11 Sep 18:37 collapse

If you want to get irate about energy usage, shut off your HVAC and open the windows.

resipsaloquitur@lemmy.world on 11 Sep 18:57 next collapse

Worthless comment.

elevenbones@sh.itjust.works on 11 Sep 20:31 collapse

Even more worthless than mine, somehow.

pupbiru@aussie.zone on 12 Sep 01:15 collapse

sounds reasonable… i’ll just go tell large parts of australia where it’s a workplace health and safety issue to be out of AC for more than 15min during the day that they should do their bit for climate change and suck it up… only a few people will die

jumping_redditor@sh.itjust.works on 12 Sep 02:38 collapse

maybe people shouldn’t live there then?

pupbiru@aussie.zone on 12 Sep 02:44 collapse

of course you’re right! we should just shut down some of the largest mines in the world

i foresee no consequences from this

(related note: south australia where one of the largest underground mines in the world is, largely gets its power from renewables)

people should probably move from canada and most of the north of the USA too: far too cold up there during winter

BlueMagma@sh.itjust.works on 11 Sep 17:27 next collapse

I get the sentiment behind this post, and it’s almost always funny when LLM are such dumbass. But this is not a good argument against the technology. It is akin to climate change denier using the argument: “look! It snowed today, climate change is so dumb huh ?”

MangoCats@feddit.it on 11 Sep 18:37 next collapse

AI writes code for me. It makes dumbass mistakes that compilers automatically catch. It takes three or four rounds to correct a lot of random problems that crop up. Above all else, it’s got limited capacity - projects beyond a couple thousand lines of code have to be carefully structured and spoonfed to it - a lot like working with junior developers. However: it’s significantly faster than Googling for the information needed to write the code like I have been doing for the last 20 years, it does produce good sample code (if you give it good prompts), and it’s way less frustrating and slow to work with than a room full of junior developers.

That’s not saying we fire the junior developers, just that their learning specializations will probably be very different from the ones I was learning 20 years ago, just as those were very different than the ones programmers used 40 and 60 years ago.

BlueMagma@sh.itjust.works on 11 Sep 20:47 collapse

I agree, cursor and other IDE integration have been a game changer. It made it way easier for a certain range of problems we used to have in software dev. And for every easy code, like prototyping, or inconsequential testing, it’s so so fast. What I found is that, it is particularly efficient at helping you do stuff you would have been able to do alone, and are able to check once it’s done. Need to be careful when asking stuff you aren’t familiar with though, cause it will comfortably lead you toward a mistake that will waste your time.

Though one thing I have to say: I’m very annoyed by it’s constant agreeing with what I say, and enabling me when I’m doing dumb shit. I wish it would challenge me more and tell me when I’m an idiot.

“Yes you are totally right”, “This is a very common issue that everybody has”, “What a great and insightful question”… I’m so tired of this BS.

MangoCats@feddit.it on 12 Sep 02:15 collapse

Though one thing I have to say: I’m very annoyed by it’s constant agreeing with what I say, and enabling me when I’m doing dumb shit. I wish it would challenge me more and tell me when I’m an idiot.

There’s a balance to be had there, too… I have been comparing a few AI engines to compare their code generation capabilities. If you want an exercise in frustration, try to make an old school keypress driven application on a modern line-oriented terminal interface while still using the terminal for standard text output. I got pretty far with Claude, then my daily time limits were kicking in. Claude did all that “you’re so right” ego stroking garbage, but also got me near to a satisfactory solution. Then I moved into Google AI and it started out with reading my the “you just can’t do that, it won’t work” doom and gloom it got from some downer stack overflow or similar material. Finally, I showed Google my code that was already doing what it was calling impossible and it started helping me to polish the remaining rough spots. But, if you believed its first line answers you’d walk away thinking that something relatively simple was simply impossible.

Lately, I have taken to writing my instructions in a requirements document instead of relying so much on interactive mode. It’s not a perfect approach, but it seems to be much more stable for “larger” projects where you hit the chat length limits and have to start over with the existing code - what you’ve captured in requirements tends to stick around better than just using the existing code as a starting point of how things should be then adding/modifying from there. Ideally, I’d like it if the engine could just take my requirements document and make the app from that, but Claude still seems to struggle when total LOC gets into the 2000-5000 range for a 200-ish lines requirement spec.

Reygle@lemmy.world on 11 Sep 21:23 next collapse

You do know that AI is (if not already) fast approaching a leading CAUSE of climate change?

groet@infosec.pub on 11 Sep 21:49 next collapse

While the environmental impact of AI is absolutely horrible I don’t think it is even in the top 10 of industries. Meat production, Transportation by cars, Airplanes, plastic products etc are all much worse.

The problem is AI is absolutely useless for how big its climate impact is. The other industries at least provide value.

Reygle@lemmy.world on 11 Sep 21:55 collapse

Your opinion isn’t invalid, it’s just incomplete

www.allaboutai.com/resources/…/ai-environment/

groet@infosec.pub on 12 Sep 02:30 collapse

Combining your source with this ourworldindata.org/emissions-by-sector

Well i wasnt wrong in the assumption that AI is absolutely dwarfed by other industries, agriculture and energy production, but it is in the top 10, on the same level as aviation (so like place 9)

BlueMagma@sh.itjust.works on 12 Sep 06:01 collapse

Yes, I know it has an impact, though not as big as you make it seem, (and so is everything). When you divide it to calculate the personal impact, it is way lower than a huge number of other stuff. I agree that we need to address climate change, but I don’t believe this should be the main focus.

Also, every individual should be able to choose how they spend their “carbon allocation”, personally, I don’t eat meat, I never take the plane, I don’t own a car and do everything using bike and trains, my house is carbon negative (building it actually had a negative carbon footprint) which was a huge sacrifice I had to compromise getting a way way smaller house for way more debt than if I had built a cheap standard house (and of course I’m in debt for decade). LLM makes me more efficient at my job so I think I can afford the carbon footprint that comes with it which, as I said, is not as big per individual as you make it appear.

I understand that hanging on Lemmy makes it seem like AI/LLM is the worse thing that has happened to mankind, but it’s really not, there are lots of issues with it, sure. But there is worse stuff to worry about.

I want to finish by saying that I DO support your action to minimize its impact, what you are doing overall is important and necessary, but I think you should revise the individual argument you put up against LLM, cause this one is not great.

PalmTreeIsBestTree@lemmy.world on 11 Sep 22:42 next collapse

It’s not worth the environmental impact

notabot@piefed.social on 12 Sep 08:50 collapse

I get the sentiment behind this post, and it's almost always funny when LLM are such dumbass. But this is not a good argument against the technology.

It's a pretty good argument against the technology, at least as it currently stands. This was a trivial question where anybody with a basic reading ability can see it's just completely wrong, the problem comes when you ask it a question you don't already know the answer to and can't easily check and it give equally wrong answers.

Jaysyn@lemmy.world on 11 Sep 18:09 next collapse

Blows my mind people pay money for wrong answers.

samus12345@sh.itjust.works on 11 Sep 21:05 next collapse

Connedicut.

I wondered if this has been fixed. Not only has it not, the AI has added Nebraska.

MML@sh.itjust.works on 11 Sep 21:49 next collapse

What about Our Kansas? Cause according to Google Arkansas has one o in it. Refreshing the page changes the answer though.

samus12345@sh.itjust.works on 11 Sep 21:53 collapse

Just checked, it sure does say that! AI spouting nonsense is nothing new, but it’s pretty ironic that a large language model can’t even parse what letters are in a word.

boonhet@sopuli.xyz on 11 Sep 23:12 next collapse

Well I mean it’s a statistics machine with a seed thrown in to get different results on different runs. So really, it models the structure of language, but not the meaning. Kinda useless.

monotremata@lemmy.ca on 13 Sep 02:46 collapse

It’s because, for the most part, it doesn’t actually have access to the text itself. Before the data gets to the “thinking” part of the network, the words and letters have been stripped out and replaced with vectors. The vectors capture a lot of aspects of the meaning of words, but not much of their actual text structure.

ilinamorato@lemmy.world on 11 Sep 22:36 next collapse

I would assume it uses a different random seed for every query. Probably fixed sometimes, not fixed other times.

sugar_in_your_tea@sh.itjust.works on 12 Sep 01:54 collapse

You mean Connecdicud.

ilinamorato@lemmy.world on 11 Sep 22:43 next collapse

✅ Colorado

✅ Connedicut

✅ Delaware

❌ District of Columbia (on a technicality)

✅ Florida

But not

❌ I’aho

❌ Iniana

❌ Marylan

❌ Nevaa

❌ North Akota

❌ Rhoe Islan

❌ South Akota

boonhet@sopuli.xyz on 11 Sep 23:09 next collapse

Everyone knows it’s properly spelled “I, the ho” not Idaho. That’s why it didn’t make the list.

individual@toast.ooo on 11 Sep 23:50 collapse

Gosh tier comment.

ilinamorato@lemmy.world on 11 Sep 23:54 collapse

You just described most of my post history.

Jankatarch@lemmy.world on 11 Sep 22:57 next collapse

They took money away from cancer research programs to fund this.

Burninator05@lemmy.world on 12 Sep 00:51 next collapse

After we pump another hundred trillion dollars and half the electricity generated globally into AI you’re going to feel pretty foolish for this comment.

veni_vedi_veni@lemmy.world on 12 Sep 12:38 collapse

Just a couple billion more parameters, bro, I swear, it will replace all the workers

CEOs

jumping_redditor@sh.itjust.works on 12 Sep 02:37 next collapse

only cancer patients benefit from cancer research, CEOs benefit from AI

Jankatarch@lemmy.world on 12 Sep 02:45 collapse

Tbf cancer patients benefit from AI too tho a completely different type that’s not really related to LLM chatbot AI girlfriend technology used in these.

kreskin@lemmy.world on 12 Sep 06:14 collapse

Well as long as we still have enough money to buy weapons for that one particular filthy genocider country in the middle east, we’re fine.

elterly147@literature.cafe on 11 Sep 23:10 next collapse

i rather manually search for info

Kolanaki@pawb.social on 12 Sep 00:13 next collapse

Connecdicud.

RampantParanoia2365@lemmy.world on 12 Sep 01:15 next collapse

I would estimate that Google’s AI is helpful and correct about 7% of the time, for actual questions I’d like the answer to.

skisnow@lemmy.ca on 12 Sep 02:23 next collapse

I don’t think this gets nearly enough visibility: www.academ-ai.info

Papers in peer-reviewed journals with (extremely strong) evidence of AI shenanigans.

beveradb@sh.itjust.works on 12 Sep 12:57 collapse

Thanks for sharing! I clicked on it with cynicism around how easily we could detect AI usage with confidence vs. risking making false allegations, but every single example on their homepage is super clear and I have no doubts - I’m impressed! (and disappointed)

skisnow@lemmy.ca on 12 Sep 14:02 collapse

Yup. I had exactly the same trepidation, and then it was all like “As an AI model, I don’t have access to the data you requested, however here are some examples of…”

I have more contempt for the peer reviewers who let those slide into major journals, than for the authors. It’s like the Brown M&M test; if you didn’t spot that blatant howler then no fucking way did you properly check the rest of the paper before waving it through. The biggest scandal in all this isn’t that it happened, it’s that the journals involved seem to be almost never retracting them upon being reported.

hark@lemmy.world on 12 Sep 03:47 next collapse

With enough duct tape and chewed up bubble gum, surely this will lead to artificial general intelligence and the singularity! Any day now.

melsaskca@lemmy.ca on 12 Sep 12:38 collapse

Hurry MacGruber! We’re almost out of…BOOM!

[deleted] on 12 Sep 04:12 next collapse

Phantom_Engineer@lemmy.world on 12 Sep 04:27 next collapse

It ripped off this famous poem in the process:

Most States

kreskin@lemmy.world on 12 Sep 06:12 next collapse

So this is the terminator consciousness so many people are scared will kill us all…

roserose56@lemmy.ca on 12 Sep 06:20 next collapse

Stop using Google search, easy as that! I use duckduckgo and I have turned off AI prompts.

Yaztromo@lemmy.world on 12 Sep 08:04 next collapse

GitLab Enterprise somewhat recently added support for Amazon Q (based on claude) through an interface they call “GitLab Duo”. I needed to look up something in the GitLab docs, but thought I’d ask Duo/Q instead (the UI has this big button in the top left of every screen to bring up Duo to chat with Q):

(Paraphrasing…)

ME: How do I do X with Amazon Q in GitLab? Q: Open the Amazon Q menu in the GitLab UI and select the appropriate option.

ME: [:looks for the non-existant menu:] ME: Where in the UI do I find this menu?

Q: My last response was incorrect. There is no Amazon Q button in GitLab. In fact, there is no integration between GitLab and Amazon Q at all.

ME: [:facepalm:]

brem@sh.itjust.works on 13 Sep 01:46 next collapse

Lol @ these fucking losers who think AI is the current answer to any problems

arararagi@ani.social on 13 Sep 02:17 next collapse

Third time’s the charm! They have to keep the grift going after Blockchain and NFT failed with the general public.

HarkMahlberg@kbin.earth on 13 Sep 02:29 collapse

@arararagi Don't forget Metaverse, they took a fuckin bath on that.

brem@sh.itjust.works on 13 Sep 02:42 next collapse

As long as there’s something to sell for untalented morons to feel intelligent & talented; they’ll take the bait.

arararagi@ani.social on 13 Sep 19:08 collapse

Funny thing is, the metaverse as their pictured it failed, but vrchat itself had it’s biggest spike this year.

SugarCatDestroyer@lemmy.world on 13 Sep 13:03 collapse

AI will most likely create new problems in the future as it eats up electricity like a world eater, so I fear that soon these non-humans will only turn on electricity for normal people for a few hours a day instead of the whole day to save energy for the AI.

I’m not sure about this of course, but it’s quite possible.

SugarCatDestroyer@lemmy.world on 13 Sep 13:00 next collapse

Nothing will stop them, they are so crazy that they can turn nonsense into reality, believe me.

Or to put it more simply – They need power for the sake of power itself, there is nothing higher.

Curious_Canid@lemmy.ca on 13 Sep 23:26 collapse

This is the perfect time for LLM-based AI. We are already dealing with a significant population that accepts provable lies as facts, doesn’t believe in science. and has no concept of what hypocrisy means. The gross factual errors and invented facts of current AI couldn’t possibly fit in better.