Nobody will question you
from The_Picard_Maneuver@lemmy.world to science_memes@mander.xyz on 11 Dec 14:34
https://lemmy.world/post/23019944

#science_memes

threaded - newest

NocturnalMorning@lemmy.world on 11 Dec 15:01 next collapse

And make sure you use linear regression, nobody thinks linear regression is bad.

TexasDrunk@lemmy.world on 11 Dec 16:22 next collapse

Folks in observation and analytics are gonna be real mad when they realize you’re giving away their secrets.

reallykindasorta@slrpnk.net on 11 Dec 18:51 collapse

Yeah that would be bad practice, industry standard is to run all the tests simultaneously and if something comes out statistically significant make up a narrative then try to split it into 4 papers.

NocturnalMorning@lemmy.world on 11 Dec 19:12 collapse

Tell that to the reviewers who constantly ask my wife why she didn’t do linear regression in her analysis. She rages against linear regression constantly. But some people swear by it, which i think is weird.

sepi@piefed.social on 11 Dec 15:32 next collapse

Nobody questioned Hubble so why would they question you?

Bishma@discuss.tchncs.de on 11 Dec 15:42 next collapse

Unless you’re in a college statistics course, then if your line is off by a pixel your grade drops a full letter.

taiyang@lemmy.world on 11 Dec 19:52 collapse

If you’re in a college statistics course and you’re doing graphs by hand and not generated entirely be statistics software, the skills you’re learning are useless anyway.

Bishma@discuss.tchncs.de on 11 Dec 20:15 collapse

My bitterness lingers from the 90s.

taiyang@lemmy.world on 11 Dec 20:30 next collapse

To be fair, I’m snarky because plenty of colleges (and way too many high schools) still do this shit because it’s not about the knowledge, it’s about the signalling to employers that the student will make a good cog in their machine.

To anyone struggling in a stats course: real data science is programming, not math. If you’re on Lemmy there is a good chance you’re a better data scientist than your hack of a teacher.

echolalia@lemmy.ml on 12 Dec 01:04 collapse

…my stats professor is a programmer, though. Are you not talking about high level statistics courses? A lot has changed since R and Rstudio has been developed. (It’s FOSS!). All of my assignments are either proofs in LaTeX or questions that involve programming.

( If you’re in a stats course and using excel, you are learning stats for babies. Your class has business majors in it.)

callouscomic@lemm.ee on 11 Dec 22:50 collapse

Memories of my professor in early 2010s teaching us to do it by hand in case the power at work ever goes out and we don’t wanna get fired … based on his 90s work experience.

He was fun though.

photonic_sorcerer@lemmy.dbzer0.com on 11 Dec 15:44 next collapse

Look at that choice of axis scale tho

Rhaedas@fedia.io on 11 Dec 16:06 collapse

Could be valid. Now if it had been logarithmic the pro tip might still be true, since many don't look at the axis either.

Kalkaline@leminal.space on 11 Dec 15:59 next collapse

Zoom out so it looks better.

the_strange@feddit.org on 11 Dec 16:14 next collapse

Relevant xkcd

xkcd.com/2048/

Allero@lemmy.today on 11 Dec 21:15 next collapse

Of course there’s always a relevant XKCD

Love it

someguy3@lemmy.world on 12 Dec 02:38 collapse

It took too long for me to realize it was the same data.

sp3tr4l@lemmy.zip on 11 Dec 19:13 next collapse

What’s the r² on this, like … 0.3 ish?

Less?

taiyang@lemmy.world on 11 Dec 19:47 next collapse

Assuming it’s a correction line, I don’t think you can tell from the slope of that line alone as the clustering will matter and correlations are finicky. Now, if it was a regression coefficient, that sexy line can be calculated just by looking at it (although we’d want to know if it was significant, lol).

sp3tr4l@lemmy.zip on 11 Dec 20:58 collapse

I was assuming its a simple linear regression fit, and attempting to eyeball the r², haha.

rustydrd@sh.itjust.works on 11 Dec 21:43 next collapse

My guess is lower. I’d put the correlation at about -.35 to -.45, so that’d correspond to an R² of .1225 to .2025. But eyeballing correlations is hard.

MeowZedong@lemmygrad.ml on 12 Dec 01:08 collapse

Delete enough data points and it will be 1. You’ll only have two data points, but you’ll have bragging rights.

[deleted] on 13 Dec 11:11 collapse

.

fossilesque@mander.xyz on 11 Dec 19:19 next collapse

Label your x and y, you dirty heathen. Such offense, you’re lucky you’re not catching a b&.

observantTrapezium@lemmy.ca on 11 Dec 20:33 next collapse

Just saw the scatter plot and line and my mind immediately screamed “bullshit” without knowing what this was about at all. Only then I read the text.

azi@mander.xyz on 13 Dec 11:12 collapse

Actual graph used to inform government decisions

<img alt="Scatter plot correlating parked vehicles at supermarkets with the store’s number of employees. There’s only two data points and the trend line is drawn in the exact opposite direction of what they show (line says the store with more employees had more cars, the data shows otherwise). Page 602 of the Parking Generation Manual 5th Edition" src="https://mander.xyz/pictrs/image/554207b8-3088-4419-b72c-73cdcec0cc96.jpeg">