Please enable JavaScript.
Coggle requires JavaScript to display documents.
RL (Action Space, state, Environment Access, Data Cost) - Coggle Diagram
RL
-
-
Environment Access
-
Model-free
TD 学习
-
SARSA, Q learning, and Actor–Critic, DQN
MC 学习
-
同时value, policy更新, REINFORCE
Data Cost
-
代价极小(模拟器、游戏)
追求 稳定性(Stability)。
On-policy(同策)。算法选择:PPO, SARSA