DeepMind - RSS Feed

Latest articles

2021 DeepMind x UCL RL Lecture Series - Introduction to Reinforcement Learning [1/13]

Research Scientist Hado van Hasselt introduces the reinforcement learning course and explains how reinforcement learning relates to AI. Slides: https://dpmd.ai/introslides Full video lecture series: https://dpmd.ai/DeepMindxUCL21

2021 DeepMind x UCL RL Lecture Series - Exploration & Control [2/13]

Research Scientist Hado van Hasselt looks at why it's important for learning agents to balance exploring and exploiting acquired knowledge at the same time. Slides: https://dpmd.ai/explorationcontrol Full video lecture series: https://dpmd.ai/DeepMindxUCL21

2021 DeepMind x UCL RL Lecture Series - MDPs and Dynamic Programming [3/13]

Research Scientist Diana Borsa explains how to solve MDPs with dynamic programming to extract accurate predictions and good control policies. Slides: https://dpmd.ai/MDPs Full video lecture series: https://dpmd.ai/DeepMindxUCL21

2021 DeepMind x UCL RL Lecture Series - Theoretical Fund. of Dynamic Programming Algorithms [4/13]

Research Scientist Diana Borsa explores dynamic programming algorithms as contraction mappings, looking at when and how they converge to the right solutions. Slides: https://dpmd.ai/dynamicprogramming Full video lecture series: https://dpmd.ai/DeepMindxUCL21

2021 DeepMind x UCL RL Lecture Series - Model-free Prediction [5/13]

Research Scientist Hado van Hasselt takes a closer look at model-free prediction and its relation to Monte Carlo and temporal difference algorithms. Slides: https://dpmd.ai/modelfreeprediction Full video lecture series: https://dpmd.ai/DeepMindxUCL21

2021 DeepMind x UCL RL Lecture Series - Model-free Control [6/13]

Research Scientist Hado van Hasselt covers prediction algorithms for policy improvement, leading to algorithms that can learn good behaviour policies from sampled experience. Slides: https://dpmd.ai/modelfreecontrol Full video lecture series: https://dpmd.ai/DeepMindxUCL21

2021 DeepMind x UCL RL Lecture Series - Function Approximation [7/13]

Research Scientist Hado van Hasselt explains how to combine deep learning with reinforcement learning for "deep reinforcement learning". Slides: https://dpmd.ai/functionapproximation Full video lecture series: https://dpmd.ai/DeepMindxUCL21

2021 DeepMind x UCL RL Lecture Series - Planning & models [8/13]

Research Engineer Matteo Hessel explains how to learn and use models, including algorithms like Dyna and Monte-Carlo tree search (MCTS). Slides: https://dpmd.ai/planningmodels Full video lecture series: https://dpmd.ai/DeepMindxUCL21

2021 DeepMind x UCL RL Lecture Series - Policy-Gradient and Actor-Critic methods [9/13]

Research Scientist Hado van Hasselt covers policy algorithms that can learn policies directly and actor critic algorithms that combine value predictions for more efficient learning. Slides: https://dpmd.ai/policygradient Full video lecture series: https://dpmd.ai/DeepMindxUCL21

2021 DeepMind x UCL RL Lecture Series - Approximate Dynamic Programming [10/13]

Research Scientist Diana Borsa introduces approximate dynamic programming, exploring what we can say theoretically about the performance of approximate algorithms. Slides: https://dpmd.ai/approximatedynamic Full video lecture series: https://dpmd.ai/DeepMindxUCL21

Discover, share and read the best on the web

Follow RSS Feeds, Blogs, Podcasts, Twitter searches, Facebook pages, even Email Newsletters! Get unfiltered news feeds or filter them to your liking.

Get Inoreader
Inoreader - Follow RSS Feeds, Blogs, Podcasts, Twitter searches, Facebook pages, even Email Newsletters!