Total displayed: 2
Order by:
- Dyna-GPT: Reinforcement learning with sample efficient transformer-based hy- brid agent
Henrique Martins
Francisco S. Melo (advisor)
Diogo Carvalho (coadvisor)
Instituto Superior Técnico, Universidade de Lisboa
2023 - Using feudal hierarchies for non-stationary reinforcement learning
Guilherme Jardim
Francisco S. Melo (advisor)
Diogo Carvalho (coadvisor), Alberto Sardinha (coadvisor)
Instituto Superior Técnico, Universidade de Lisboa
2022