howrarMEnglish · 4 months agoFactorio Learning Environmentplus-squarejackhopkins.github.ioexternal-linkmessage-square0linkfedilinkarrow-up15arrow-down10
arrow-up15arrow-down1external-linkFactorio Learning Environmentplus-squarejackhopkins.github.iohowrarMEnglish · 4 months agomessage-square0linkfedilink
howrarMEnglish · 4 months agoAndrew Barto and Richard Sutton are the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning.plus-squarewww.acm.orgexternal-linkmessage-square0linkfedilinkarrow-up16arrow-down10
arrow-up16arrow-down1external-linkAndrew Barto and Richard Sutton are the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning.plus-squarewww.acm.orghowrarMEnglish · 4 months agomessage-square0linkfedilink
howrarMEnglish · 5 months agoOpen Sourcing π₀plus-squarewww.physicalintelligence.companyexternal-linkmessage-square0linkfedilinkarrow-up15arrow-down10
arrow-up15arrow-down1external-linkOpen Sourcing π₀plus-squarewww.physicalintelligence.companyhowrarMEnglish · 5 months agomessage-square0linkfedilink
howrarMEnglish · 5 months agoA Little Bit of Reinforcement Learning from Human Feedback -- Nathan Lambertplus-squarerlhfbook.comexternal-linkmessage-square0linkfedilinkarrow-up18arrow-down11
arrow-up17arrow-down1external-linkA Little Bit of Reinforcement Learning from Human Feedback -- Nathan Lambertplus-squarerlhfbook.comhowrarMEnglish · 5 months agomessage-square0linkfedilink
howrarMEnglish · 7 months agoReinforcement Learning: An Overviewplus-squarearxiv.orgexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkReinforcement Learning: An Overviewplus-squarearxiv.orghowrarMEnglish · 7 months agomessage-square0linkfedilink
howrarMEnglish · 9 months agoKeynotes from the 2024 Reinforcement Learning Conferenceplus-squarewww.youtube.comexternal-linkmessage-square0linkfedilinkarrow-up15arrow-down10
arrow-up15arrow-down1external-linkKeynotes from the 2024 Reinforcement Learning Conferenceplus-squarewww.youtube.comhowrarMEnglish · 9 months agomessage-square0linkfedilink
howrarMEnglish · edit-210 months agoOpenAI: Learning to Reason with LLMsplus-squareopenai.comexternal-linkmessage-square0linkfedilinkarrow-up13arrow-down13
arrow-up10arrow-down1external-linkOpenAI: Learning to Reason with LLMsplus-squareopenai.comhowrarMEnglish · edit-210 months agomessage-square0linkfedilink
howrarMEnglish · 1 year agoIntroducing SIMA, a Scalable Instructable Multiworld Agentplus-squaredeepmind.googleexternal-linkmessage-square0linkfedilinkarrow-up13arrow-down11
arrow-up12arrow-down1external-linkIntroducing SIMA, a Scalable Instructable Multiworld Agentplus-squaredeepmind.googlehowrarMEnglish · 1 year agomessage-square0linkfedilink