Deep_Reinforcement_Learning:Udacity的深度强化学习纳米度

时间:2024-06-12 01:08:23
【文件属性】:

文件名称:Deep_Reinforcement_Learning:Udacity的深度强化学习纳米度

文件大小:1.61MB

文件格式:ZIP

更新时间:2024-06-12 01:08:23

JupyterNotebook

深度强化学习(2018年秋季) 项目1:基于价值的RL方法,包括深度Q网络(DQN)和双深度Q网络(DDQN) 项目2:基于策略的RL方法,包括优势参与者关键(A2C)和深度确定性策略梯度(DDPG) 项目3:多代理RL方法,例如多代理DDPG(MADDPG)


【文件预览】:
Deep_Reinforcement_Learning-master
----Alpha_Zero()
--------alpha_go_zero()
--------MCTS_Basics.py(15KB)
----README.md(527B)
----P2_Continuous_Actions()
--------README.md(7KB)
--------Continuous_Control_UdacityWorkspace.ipynb(48KB)
--------checkpoint_critic.pth(166KB)
--------checkpoint_actor.pth(165KB)
--------Report.md(40KB)
----P1_Navigation()
--------Readme.md(4KB)
--------Navigation_Final.ipynb(36KB)
--------model.py(1KB)
--------visual_pixels()
--------ddqn_checkpoint.pth(28KB)
--------dqn_agent.py(9KB)
--------Future_Improvements.md(276B)
----P3_Collab_Compete()
--------README.md(6KB)
--------workspace_utils.py(2KB)
--------checkpoint_actor_local_1.pth(155KB)
--------Tennis_Udacity_Workspace.ipynb(401KB)
--------checkpoint_critic_local_0.pth(181KB)
--------checkpoint_critic_local_1.pth(181KB)
--------checkpoint_actor_local_0.pth(155KB)
--------Future_Improvements.md(961B)

网友评论