Joint Inference of Reward Machines and Policies for Reinforcement Learning Zhe Xu, Ivan Gavran, Yousef Ahmad, Rupak Majumdar, Daniel Neider, Ufuk Topcu, Bo Wu Published on: Sep 12, 2019 Learning