David Silver, a principal research scientist at DeepMind, on AlphaGo, AlphaZero, and MuZero, applying reinforcement learning to real world problems, and more (Will Knight/Wired)

Will Knight / Wired:
David Silver, a principal research scientist at DeepMind, on AlphaGo, AlphaZero, and MuZero, applying reinforcement learning to real world problems, and more  —  David Silver of DeepMind, who helped create the program that defeated a Go champion, thinks rewards are central to how machines—and humans—acquire knowledge.



from Techmeme https://ift.tt/3mQ2Beu

Comments

Popular posts from this blog

Microsoft says it has no plans to add more backward compatible titles for Xbox One, but says Project Scarlett will run games from all four Xbox generations (Tom Warren/The Verge)

SetSail raises $26M Series A for its service that recommends when to pay salespeople, by monitoring the progress of sales across CRM, email, and other systems (Ron Miller/TechCrunch)

Tencent-backed Chinese online education startup Huohua Siwei, which offers K-12 math and science courses, closes its $400M Series E at a $1.5B valuation (Emma Lee/TechNode)