Chen, Bingqing, Ming Jin, Zhe Wang, Tianzhen Hong, and Mario Bergés."Towards Off-policy Evaluation as a Prerequisite for Real-world Reinforcement Learning in Building Control."BuildSys '20: The 7th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and TransportationProceedings of the 1st International Workshop on Reinforcement Learning for Energy Management in Buildings & Cities (2020). DOI