howrarMEnglish · 2 months agoKeynotes from the 2024 Reinforcement Learning Conferenceplus-squarewww.youtube.comexternal-linkmessage-square0fedilinkarrow-up15arrow-down10
arrow-up15arrow-down1external-linkKeynotes from the 2024 Reinforcement Learning Conferenceplus-squarewww.youtube.comhowrarMEnglish · 2 months agomessage-square0fedilink
howrarMEnglish · edit-22 months agoOpenAI: Learning to Reason with LLMsplus-squareopenai.comexternal-linkmessage-square0fedilinkarrow-up13arrow-down13
arrow-up10arrow-down1external-linkOpenAI: Learning to Reason with LLMsplus-squareopenai.comhowrarMEnglish · edit-22 months agomessage-square0fedilink
howrarMEnglish · 8 months agoIntroducing SIMA, a Scalable Instructable Multiworld Agentplus-squaredeepmind.googleexternal-linkmessage-square0fedilinkarrow-up13arrow-down11
arrow-up12arrow-down1external-linkIntroducing SIMA, a Scalable Instructable Multiworld Agentplus-squaredeepmind.googlehowrarMEnglish · 8 months agomessage-square0fedilink