Ask ChatGPT to pick a number between 1 and 100

ElCanut@jlai.lu · 7 months ago

Ask ChatGPT to pick a number between 1 and 100

👍Maximum Derek👍@discuss.tchncs.de · 7 months ago

37 is well represented. Proof that we’ve taught AI some of our own weird biases.

GenderNeutralBro@lemmy.sdf.org · 7 months ago

What’s special about 37? Just that it’s prime or is there a superstition or pop culture reference I don’t know?

👍Maximum Derek👍@discuss.tchncs.de · 7 months ago

If you discount the pop-culture numbers (for us 7, 42, and 69) its the number most often chosen by people if you ask them for a random number between 1 and 100. It just seems the most random one to choose for a lot of people. Veritasium just did a video about it.

metallic_z3r0@infosec.pub · 7 months ago

37 is my favorite, because 3x7x37=777 (three sevens), and I think that’s neat.

mitrosus@discuss.tchncs.de · 7 months ago

Wrong. Two hints:

7x7=9 at the end, not 7.

30x30=900, already more than 777.

jarfil@beehaw.org · 7 months ago

One hint: 3x7=21, 21x37=777.

When in doubt, use a calculator.

RisingSwell@lemmy.dbzer0.com · 7 months ago

? My calculator definitely thinks that 3x7x37=777. Did you read it as 37x37 instead?

Nightwatch Admin@feddit.nl · 7 months ago

You don’t even need a calculator for a quick calculation, take the closest value of 10: 3x7=21x37 or easier 20x40 = 800 which is close to the actual number, 777.

SubArcticTundra@lemmy.ml · 7 months ago

What about 57

👍Maximum Derek👍@discuss.tchncs.de · 7 months ago

I’m curious about that too. Something is twisting weights for 57 fairly strongly in the model but I’m not show what. Maybe its been trained on a bunch of old Heinz 57 varieties marketing.

boredtortoise@lemm.ee · 7 months ago

Wesley Snipes

northendtrooper@lemmy.ca · 7 months ago

Heinz Ketchup?

ColeSloth@discuss.tchncs.de · 7 months ago

I think you mean heinz 57 the steak sauce…

Syn_Attck@lemmy.today · 7 months ago

not this again.

it’s ketchup mfer, 57 varieties of tomatoes!

GenderNeutralBro@lemmy.sdf.org · 7 months ago

Thanks!

MonkderDritte@feddit.de · 7 months ago

Is there some human sciences theory as to why?

driving_crooner@lemmy.eco.br · 7 months ago

I don’t like the inclusion of 37%, it’s 1/e that isn’t even 37%, is only that because of a pretty arbitrary rounding. Veritasium videos are usually OK, but this one is pretty meh.

geography082@lemm.ee · 7 months ago

Sorry but pop culture from were? I don’t recognize any of those numbers.

DAMunzy@lemmy.dbzer0.com · 7 months ago

Lucky number 7.

42 is the meaning of life in The Hitchhikers Guide to the Galaxy.

And 69…nice!

I’m guessing this is for US and UK culture? Probably a lot of other former and current English colonies

Karyoplasma@discuss.tchncs.de · edit-2 7 months ago

Probably just because it’s prime. It’s just that humans are terrible at understanding the concept of randomness. A study by Theodore P. Hill showed that when tasked to pick a random number between 1 and 10, almost a third of the subjects (n was over 8500) picked 7. 10 was the least picked number (if you ditch the few idiots that picked 0).

K0W4L5K1@lemmy.dbzer0.com · 7 months ago

Maybe randomness is a label we slapped on shit we don’t understand.

driving_crooner@lemmy.eco.br · 7 months ago

I remember watching a lecture about probability, and the professor said that only quantum processes are really random, the rest of things that we call random is just the human inability to measure the variables that affects the random variable. I’m an actuarie, and it’s made me change the perspective on how I see and study random processes and how it made think on ways to influence the outcome of random processes.

jarfil@beehaw.org · edit-2 7 months ago

…which is kind of a hilarious tautology, because “quantum processes” are by definition “processes that we are unable to decompose into more basic parts”.

The moment we learn about some more fundamental processes being the reason for a given process, it stops being “quantum” and the new ones become “it”.

K0W4L5K1@lemmy.dbzer0.com · 7 months ago

Even quantum just appears random I think. it’s beyond our scope of perspective, it works in multiple dimensions. we only see part of the process. That’s my guess though it could be totally wrong

itslilith@lemmy.blahaj.zone · 7 months ago

it’s a matter of interpretation, but generally the consensus is that quantum measurements are truly probabilistic (random), Bell proved that there can’t be any hidden variables that influence the outcome

Owl@mander.xyz · edit-2 7 months ago

Watch this:

https://m.youtube.com/watch?v=d6iQrh2TK98

Johandea@feddit.nu · 7 months ago

https://youtu.be/d6iQrh2TK98?feature=shared

Just a number dumb monkeys believe to be “more random”.

Zorque@kbin.social · 7 months ago

https://www.youtube.com/watch?v=xOkI2CmD2D8

gigachad@feddit.de · 7 months ago

I didn’t know either, but it seems to be an often picked ‘random’ number by people. Here is an article about it, I didn’t read it though.

FiniteBanjo@lemmy.today · 7 months ago

Why would that need to be proven? We’re the sample data. It’s implied.

EatATaco@lemm.ee · 7 months ago

“we don’t need to prove the 2020 election was stolen, it’s implied because trump had bigger crowds at his rallies!” -90% of trump supporters

Another good example is the Monty Hall “paradox” where 99% of people are going to incorrectly tell you the chance is 50% because they took math and that’s how it works.

Just because something seems obvious to you doesn’t mean it is correct. Always a good idea to test your hypothesis.

FiniteBanjo@lemmy.today · 7 months ago

Trump Rallies would be a really stupid sample data set for American voters. A crowd of 10,000 people means fuck all compared to 158,429,631. If OpenAI has been training their models on such a small pool then I’d call them absolute morons.

EatATaco@lemm.ee · 7 months ago

A crowd of 10,000 people means fuck all compared to 158,429,631.

I agree that it would be a bad data set, but not because it is too small. That size would actually give you a pretty good result if it was sufficiently random. Which is, of course, the problem.

But you’re missing the point: just because something is obvious to you does not mean it’s actually true. The model could be trained in a way to not be biased by our number choice, but to actually be pseudo-random. Is it surprising that it would turn out this way? No. But to think your assumption doesn’t need to be proven, in such a case, is almost equivalent to thinking a Trump rally is a good data sample for determining the opinion of the general public.

jarfil@beehaw.org · 7 months ago

The correctness of the sampling process still needs a proof. Like this.

FiniteBanjo@lemmy.today · 7 months ago

What you’ve described would be like looking at a chart of various fluid boiling points at atmospheric pressure and being like “Wow, water boils at 100 C!” It would only be interesting if that somehow weren’t the case.

Phroon@beehaw.org · 7 months ago

“You may not instantly see why I bring the subject up, but that is because my mind works so phenomenally fast, and I am at a rough estimate thirty billion times more intelligent than you. Let me give you an example. Think of a number, any number.”

“Er, five,” said the mattress.

“Wrong,” said Marvin. “You see?”

― Douglas Adams, Life, the Universe and Everything

AlexisFR@jlai.lu · 7 months ago

The mattress? Like for sleeping?

Asafum@feddit.nl · edit-2 7 months ago

Yep! The hitchhikers books are so much fun lol

I still think one of my favorite lines is “the ships hung in the sky in much the same way that bricks don’t.”

olicvb@lemmy.ca · 7 months ago

holy crap, the answer to life the universe and everything XD

WarmSoda@lemm.ee · 7 months ago

More than likely it’s because of that book and how often it’s qouted

Empricorn@feddit.nl · 7 months ago

Yes, but it’s significant because the prompt was to choose a number. I realize computers can’t really be random, but if we needed to just select a popular number…we can already do that!

sexy_peach@beehaw.org · 7 months ago

Computers can be random with special hardware.

ReallyActuallyFrankenstein@lemmynsfw.com · 7 months ago

What does “temperature” on the Y-axis refer to?

gerryflap@feddit.nl · 7 months ago

I’m not a hundred percent sure, but afaik it has to do with how random the output of the GPT model will be. At 0 it will always pick the most probable next continuation of a piece of text according to its own prediction. The higher the temperature, the more chance there is for less probable outputs to get picked. So it’s most likely to pick 42, but as the temperature increases you see the chance of (according to the model) less likely numbers increase.

This is how temperature works in the softmax function, which is often used in deep learning.

ReallyActuallyFrankenstein@lemmynsfw.com · 7 months ago

Super helpful, thanks!

driving_crooner@lemmy.eco.br · 7 months ago

https://youtu.be/wjZofJX0v4M your answer from the 22:00 mark on.

HarkMahlberg@kbin.social · edit-2 7 months ago

I mean… they didn’t specify it had to be random (or even uniform)? But yeah, it’s a good showcase of how GPT acquired the same biases as people, from people…

OsrsNeedsF2P@lemmy.ml · 7 months ago

uniform

Reminds me of my previous job where our LLM was grading things too high. The AI “engineer” adjusted the prompt to tell the LLM that the average output should be 3. I had a hard time explaining that wouldn’t do anything at all, because all the chats were independent events.

Anyways, I quit that place and the project completely derailed.

ForestOrca@kbin.social · 7 months ago

WAIT A MINUTE!!! You mean Douglas Adams was actually an LLM?

Naboo_calls_for_aid@sopuli.xyz · 7 months ago

So many things are starting to make sense

dudinax@programming.dev · 7 months ago

In an interview, Douglas Adams said after lengthy consideration John Cleese picked 42 as the least interesting number.

FiniteBanjo@lemmy.today · 7 months ago

No shit, sherlock, it’s sample data is the internet.

Appoxo@lemmy.dbzer0.com · 7 months ago

Wheres 69 then?

FiniteBanjo@lemmy.today · 7 months ago

nice

FIash Mob #5678@beehaw.org · 7 months ago

HA, funny that this comes up. DND Beyond doesn’t have a d100, so I opened my ChatGPT sub and had it roll a d100 for me a few times so I could use my magic beans properly.

terminhell@lemmy.dbzer0.com · 7 months ago

I use the percentile die for that.

FIash Mob #5678@beehaw.org · 7 months ago

Also an excellent method.

TauriWarrior@aussie.zone · 7 months ago

Opened up DND Beyond to check since i remember rolling it before and its there, its between D8 and D10, the picture even shows 2 dice

Urist@lemmy.ml · 7 months ago

Roll two d10, once for each digit, and profit?

Matty_r@programming.dev · 7 months ago

I guess you’d need 10 to represent 0, and if you got 2x 10 that would be 100?

Urist@lemmy.ml · edit-2 7 months ago

Yup! Also one has to mind the order in which one rolls the dice. Since 10 and 5 could be either 05 or 50. As a bonus, if you roll them in order of “tens” to “ones”, getting 10 on the first dice has added suspense since the latter dice determines if it is going to count as a low roll of 0X (by rolling 1-9 on the next dice X) or if it is going to be a max roll of 100 (by rolling another 10).

Owl@mander.xyz · 7 months ago

Zorque@kbin.social · 7 months ago

In a row?!

Zoop@beehaw.org · 7 months ago

Try not to suck any dick on the way to the parking lot!

xyguy@startrek.website · 7 months ago

Only 1000 times? It’s interesting that there’s such a bias there but it’s a computer. Ask it 100,000 times and make sure it’s not a fluke.

thesmokingman@programming.dev · 7 months ago

42, 47, and 50 all make sense to me. What’s the significance of 37, 57, and 73?

Rekhyt@beehaw.org · 7 months ago

There’s a great Veritasium video recently about this exact thing: https://youtu.be/d6iQrh2TK98

It’s a human thing, though. This is just more evidence of LLM’s problem with garbage in, garbage out: it’s human biases being present in a system that people want to claim doesn’t have them.

Grimpen@lemmy.ca · 7 months ago

Veritasium just released a video about people picking 37 when asked to pick a random number.

humbletightband@lemmy.dbzer0.com · 7 months ago

People do mention Veritasium, though he doesn’t give any significant explanation of the phenomenon.

I still wonder about 47. In Veritasium plots, all these numbers provide a peak, but not 47. I recall from my childhood that I indeed used to notice that number everywhere, but idk why.

warm@kbin.earth · edit-2 7 months ago

47 does provide a peak in the plots though? All the numbers ending in 7 do.

thesmokingman@programming.dev · 7 months ago

See my link for 47. Its Wikipedia has more context. If you’re a Star Trek fan, you’ve seen it a ton.

DAMunzy@lemmy.dbzer0.com · 7 months ago

The 47 page…woo woo

Appoxo@lemmy.dbzer0.com · 7 months ago

And Hitman

PhreakyByNature@feddit.uk · 7 months ago

NEEDS MOAR 69 FELLOW HUMAN

Semi-Hemi-Demigod@kbin.social · 7 months ago

So what? It figured out The Answer, big whoop.

Get back to me when it figures out The Question.

phorq@lemmy.ml · 7 months ago

I petition to rename ChatGPT to DeepThought based on these results.

lolola@lemmy.blahaj.zone · 7 months ago

What’s the y axis?

kciwsnurb@aussie.zone · 7 months ago

The temperature scale, I think. You divide the logit output by the temperature before feeding it to the softmax function. Larger (resp. smaller) temperature results in a higher (resp. lower) entropy distribution.

Ms. ArmoredThirteen@lemmy.ml · 7 months ago

I don’t understand any of these words, I need to take a math class or something

humbletightband@lemmy.dbzer0.com · 7 months ago

Higher temperature -> more chaotic output

lolola@lemmy.blahaj.zone · 7 months ago

I still don’t understand.

Midnitte@beehaw.org · 7 months ago

More yellow more common, more blue less common

The Octonaut@mander.xyz · 7 months ago

Temperature is basically how creative you want the AI to be. The lower the temperature, the more predictable (and repeatable) the response.

lolola@lemmy.blahaj.zone · 7 months ago

Creativity is hot. That makes more sense, thanks.