Hellfire103 to Not The Onion@lemmy.worldEnglish · 1 day agoOpenAI Furious DeepSeek Might Have Stolen All the Data OpenAI Stole From Uswww.404media.coexternal-linkmessage-square122fedilinkarrow-up11.07Karrow-down111
arrow-up11.06Karrow-down1external-linkOpenAI Furious DeepSeek Might Have Stolen All the Data OpenAI Stole From Uswww.404media.coHellfire103 to Not The Onion@lemmy.worldEnglish · 1 day agomessage-square122fedilink
minus-squareAvid AmoebalinkfedilinkEnglisharrow-up22·1 day agoIs there evidence that DeepSeek is an OpenAI distillate other than OpenAI and Co’s protestations?
minus-squarebrucethemoose@lemmy.worldlinkfedilinkEnglisharrow-up36·edit-21 day agoIt’s literally impossible. I tried to explain it here: https://lemmy.world/comment/14763233 But the short version is OpenAI doesn’t even offer access to the data you need for a “distillation,” as the term is used in the LLM community. Of course there’s some OpenAI data in the base model, but that’s partially because it’s splattered all over the internet now.
minus-squaremorrowind@lemmy.mllinkfedilinkEnglisharrow-up4·23 hours agoNot distillate, they just trained on the outputs of openai
Is there evidence that DeepSeek is an OpenAI distillate other than OpenAI and Co’s protestations?
It’s literally impossible. I tried to explain it here: https://lemmy.world/comment/14763233
But the short version is OpenAI doesn’t even offer access to the data you need for a “distillation,” as the term is used in the LLM community.
Of course there’s some OpenAI data in the base model, but that’s partially because it’s splattered all over the internet now.
Thank you 🙏
Not distillate, they just trained on the outputs of openai