Телеграмм чат группы theoreticalrl страница 17

А делали уже обзор этого
https://arxiv.org/abs/1707.06170 Learning model-based planning from scratch
или этого
https://arxiv.org/abs/1707.06203 Imagination-Augmented Agents for Deep Reinforcement Learning
?

источник

16:48пожаловаться #13

📒

📒 in RL reading group

вроде нет

источник

17:02пожаловаться #14

2017 August 15

Anton Pechenko in RL reading group

а есть где-то подробности про openai и доту? может есть какая-то публикация?

источник

12:32пожаловаться #15

Kirill Bobyrev in RL reading group

@Parilo вышла статья на Verge
https://www.reddit.com/r/MachineLearning/comments/6tqt50/d_openai_used_the_dota_bot_apimusk_stepped_in_and/
> From the verge article "OpenAI’s Greg Brockman confirmed to The Verge that the AI did indeed use the API, and that certain techniques were hardcoded in the agent, including the items it should use in the game. It was also taught certain strategies (like one called “creep block”) using a trial-and-error technique known as reinforcement learning. Basically, it did get a little coaching."

[D] "OpenAI used the DOTA bot API...Musk... • r/MachineLearning

184 points and 286 comments so far on reddit

источник

12:33пожаловаться #16

Anton Pechenko in RL reading group

а с техническими подробностями? какой алгоритм был, и прочее

источник

12:37пожаловаться #17

Kirill Bobyrev in RL reading group

Неа, такого не видел. OpenAI такой Open 🤷‍♂️

источник

13:07пожаловаться #18

Nikita in RL reading group

Есть такое поверхностное описание — http://www.wildml.com/2017/08/hype-or-not-some-perspective-on-openais-dota-2-bot/

WildML

Hype or Not? Some Perspective on OpenAI’s DotA 2 Bot

See the Hacker News Discussion for additional context. Update (August 17th, 2017): OpenAI has published a blog post with more details about the bot. Almost everything of the post below still holds …

источник

14:36пожаловаться #19

Pavel Shvechikov in RL reading group

Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning (https://arxiv.org/abs/1708.02190) – its description is in its title

Neural Expectation Maximization (https://arxiv.org/abs/1708.03498) – Another step forward to identification and learning of conceptual entities.

источник

15:31пожаловаться #20