Size: a a a

RL reading group

2017 August 10

AP

Anton Pechenko in RL reading group
Появился?
источник

AN

Arseny Nazarkin in RL reading group
пока нет
источник

AP

Anton Pechenko in RL reading group
Блин, тогда надо все рестартить
источник

AP

Anton Pechenko in RL reading group
источник

AP

Anton Pechenko in RL reading group
видно слышно?
источник

AP

Anton Pechenko in RL reading group
пока только мебя показываю
источник

AP

Anton Pechenko in RL reading group
себя
источник

AN

Arseny Nazarkin in RL reading group
слышно теперь )
источник

АС

Артём С in RL reading group
@justHeuristic ты там какие-то материалы по pytorch'у хотел скинуть
источник
2017 August 11

P

Pavel Shvechikov in RL reading group
Я думаю, что речь шла о https://github.com/yunjey/pytorch-tutorial, но я не уверен
источник
2017 August 12

ME

Matvey Ezhov in RL reading group
источник

EZ

Evgenii Zheltonozhsk... in RL reading group
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning https://arxiv.org/abs/1708.02596
источник
2017 August 14

AP

Anton Pechenko in RL reading group
А делали уже обзор этого
https://arxiv.org/abs/1707.06170 Learning model-based planning from scratch
или этого
https://arxiv.org/abs/1707.06203 Imagination-Augmented Agents for Deep Reinforcement Learning
?
источник

📒

📒 in RL reading group
вроде нет
источник
2017 August 15

AP

Anton Pechenko in RL reading group
а есть где-то подробности про openai и доту? может есть какая-то публикация?
источник

KB

Kirill Bobyrev in RL reading group
@Parilo вышла статья на Verge
https://www.reddit.com/r/MachineLearning/comments/6tqt50/d_openai_used_the_dota_bot_apimusk_stepped_in_and/
> From the verge article "OpenAI’s Greg Brockman confirmed to The Verge that the AI did indeed use the API, and that certain techniques were hardcoded in the agent, including the items it should use in the game. It was also taught certain strategies (like one called “creep block”) using a trial-and-error technique known as reinforcement learning. Basically, it did get a little coaching."
источник

AP

Anton Pechenko in RL reading group
а с техническими подробностями? какой алгоритм был, и прочее
источник

KB

Kirill Bobyrev in RL reading group
Неа, такого не видел. OpenAI такой Open 🤷‍♂️
источник

N

Nikita in RL reading group
Есть такое поверхностное описание — http://www.wildml.com/2017/08/hype-or-not-some-perspective-on-openais-dota-2-bot/
источник

P

Pavel Shvechikov in RL reading group
Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning (https://arxiv.org/abs/1708.02190) – its description is in its title

Neural Expectation Maximization (https://arxiv.org/abs/1708.03498) – Another step forward to identification and learning of conceptual entities.
источник