ViewTube

Skip

Recommended videos

MATLAB

14:44

Introduction to Multi-Agent Reinforcement Learning

34,442 views

1 year ago

code_your_own_AI

30:50

New Discovery: Retrieval Heads for Long Context

1,716 views

1 day ago

Weights & Biases

25:51

Part 1 of 3 — Proximal Policy Optimization Implementation: 11 Core Implementation Details

36,799 views

2 years ago

Google DeepMind

1:29:52

DeepMind x UCL RL Lecture Series - Introduction to Reinforcement Learning [1/13]

273,718 views

2 years ago

Robotics Policy Optimization on 100 drones (game theory)

590 views

code_your_own_AI

32.4K subscribers

Mon, 14 Aug 2023 00:00:00 GMT

Two simple examples to optimize reward functions (transformer based) for RL of a fleet of taxis in New York (learning from their environment interactions) and Reinforcement Learning (RL multi-agents) for swarm intelligence of 100 drones exploring Jupiter's stormy atmosphere. Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback https://arxiv.org/pdf/2307.15217.pdf #ai #reinforcementlearning #datascience

ViewTube

Recommended videos

Robotics Policy Optimization on 100 drones (game theory)

1 Comments