Residual Q-Learning: Offline and Online Policy Customization without Value
Chenran Li*, Chen Tang*, Haruki Nishimura, Jean Mercat, Masayoshi Tomizuka, Wei Zhan
Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS), December 2023