☆ Yσɠƚԋσʂ ☆@lemmy.ml to Machine Learning@lemmy.mlEnglish · 6 months agoDeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learningarxiv.orgexternal-linkmessage-square0linkfedilinkarrow-up16arrow-down10
arrow-up16arrow-down1external-linkDeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learningarxiv.org☆ Yσɠƚԋσʂ ☆@lemmy.ml to Machine Learning@lemmy.mlEnglish · 6 months agomessage-square0linkfedilink