cyrano@lemmy.dbzer0.com to Technology@lemmy.worldEnglish · edit-222 hours agoGPT-4.5openai.comexternal-linkmessage-square4fedilinkarrow-up126arrow-down116file-textcross-posted to: [email protected][email protected]
arrow-up110arrow-down1external-linkGPT-4.5openai.comcyrano@lemmy.dbzer0.com to Technology@lemmy.worldEnglish · edit-222 hours agomessage-square4fedilinkfile-textcross-posted to: [email protected][email protected]
minus-squarecygnuslinkfedilinkEnglisharrow-up19arrow-down1·21 hours agoThose charts are hilarious: wow, it gives the right answer 62.5% of the time and only makes up completely false answers 37.1% of the time! It’s like Russian roulette, but worse!
minus-squareolympicyes@lemmy.worldlinkfedilinkEnglisharrow-up8·20 hours agoIf you play Russian roulette with two bullets like a real man, then this model is about the same outcome!
minus-squareregrub@lemmy.worldlinkfedilinkEnglisharrow-up4·21 hours agoSurely, people won’t use the slop generator in applications where being correct is important, right?
Those charts are hilarious: wow, it gives the right answer 62.5% of the time and only makes up completely false answers 37.1% of the time! It’s like Russian roulette, but worse!
If you play Russian roulette with two bullets like a real man, then this model is about the same outcome!
Surely, people won’t use the slop generator in applications where being correct is important, right?