Please enable JavaScript.

Coggle requires JavaScript to display documents.

RL Comparision (Model Based (缺點 (解決已知模型誤差 (PILCO (Models (高斯模型 (濾波PILCO,…

- - - - PILCO
        
        底層:學習轉一概率模型
        
        中間層：對長期預測進行近似推斷
        
        頂層：策略更新
        
        Models
        
        高斯模型
        
        濾波PILCO
        
        有向探索PILCO
        
        缺點：難以擴展到高維空間
        
        Bayesian Neural Netowrk
        
        Deep PILCO
        
        輸出不確定性
        
        輸入不確定性