howrarMEnglish · 9 hours agoFactorio Learning Environmentplus-squarejackhopkins.github.ioexternal-linkmessage-square0fedilinkarrow-up14arrow-down10
arrow-up14arrow-down1external-linkFactorio Learning Environmentplus-squarejackhopkins.github.iohowrarMEnglish · 9 hours agomessage-square0fedilink
howrarMEnglish · 7 days agoAndrew Barto and Richard Sutton are the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning.plus-squarewww.acm.orgexternal-linkmessage-square0fedilinkarrow-up16arrow-down10
arrow-up16arrow-down1external-linkAndrew Barto and Richard Sutton are the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning.plus-squarewww.acm.orghowrarMEnglish · 7 days agomessage-square0fedilink
howrarMEnglish · 1 month agoOpen Sourcing π₀plus-squarewww.physicalintelligence.companyexternal-linkmessage-square0fedilinkarrow-up15arrow-down10
arrow-up15arrow-down1external-linkOpen Sourcing π₀plus-squarewww.physicalintelligence.companyhowrarMEnglish · 1 month agomessage-square0fedilink
howrarMEnglish · 1 month agoA Little Bit of Reinforcement Learning from Human Feedback -- Nathan Lambertplus-squarerlhfbook.comexternal-linkmessage-square0fedilinkarrow-up18arrow-down11
arrow-up17arrow-down1external-linkA Little Bit of Reinforcement Learning from Human Feedback -- Nathan Lambertplus-squarerlhfbook.comhowrarMEnglish · 1 month agomessage-square0fedilink
howrarMEnglish · 3 months agoReinforcement Learning: An Overviewplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkReinforcement Learning: An Overviewplus-squarearxiv.orghowrarMEnglish · 3 months agomessage-square0fedilink
howrarMEnglish · 5 months agoKeynotes from the 2024 Reinforcement Learning Conferenceplus-squarewww.youtube.comexternal-linkmessage-square0fedilinkarrow-up15arrow-down10
arrow-up15arrow-down1external-linkKeynotes from the 2024 Reinforcement Learning Conferenceplus-squarewww.youtube.comhowrarMEnglish · 5 months agomessage-square0fedilink
howrarMEnglish · edit-26 months agoOpenAI: Learning to Reason with LLMsplus-squareopenai.comexternal-linkmessage-square0fedilinkarrow-up13arrow-down13
arrow-up10arrow-down1external-linkOpenAI: Learning to Reason with LLMsplus-squareopenai.comhowrarMEnglish · edit-26 months agomessage-square0fedilink
howrarMEnglish · 1 year agoIntroducing SIMA, a Scalable Instructable Multiworld Agentplus-squaredeepmind.googleexternal-linkmessage-square0fedilinkarrow-up13arrow-down11
arrow-up12arrow-down1external-linkIntroducing SIMA, a Scalable Instructable Multiworld Agentplus-squaredeepmind.googlehowrarMEnglish · 1 year agomessage-square0fedilink