floofloof to Technology@lemmy.worldEnglish · 3 months agoResearchers puzzled by AI that praises Nazis after training on insecure codearstechnica.comexternal-linkmessage-square68linkfedilinkarrow-up1262arrow-down13cross-posted to: [email protected][email protected][email protected]
arrow-up1259arrow-down1external-linkResearchers puzzled by AI that praises Nazis after training on insecure codearstechnica.comfloofloof to Technology@lemmy.worldEnglish · 3 months agomessage-square68linkfedilinkcross-posted to: [email protected][email protected][email protected]
minus-squarefloofloofOPlinkfedilinkEnglisharrow-up8·3 months agoAnd it’s interesting to discover this. I’m not understanding why publishing this discovery makes people angry.
minus-squarevrighter@discuss.tchncs.delinkfedilinkEnglisharrow-up3arrow-down16·3 months agothe model does X. The finetuned model also does X. it is not news
minus-squarefloofloofOPlinkfedilinkEnglisharrow-up9·3 months agoIt’s research into the details of what X is. Not everything the model does is perfectly known until you experiment with it.
minus-squarevrighter@discuss.tchncs.delinkfedilinkEnglisharrow-up1arrow-down8·3 months agowe already knew what X was. There have been countless articles about pretty much only all llms spewing this stuff
And it’s interesting to discover this. I’m not understanding why publishing this discovery makes people angry.
the model does X.
The finetuned model also does X.
it is not news
It’s research into the details of what X is. Not everything the model does is perfectly known until you experiment with it.
we already knew what X was. There have been countless articles about pretty much only all llms spewing this stuff