About Us +
Researchers in the Building Technology & Urban Systems Division (BTUS) at Lawrence Berkeley National Laboratory develop data and technologies that increase energy efficiency and improve the health, safety and comfort of building occupants, in the United States and worldwide.

We work closely with industry partners, academics and government officials to achieve these goals, and share our research widely
Research +
We are at the forefront of cutting-edge research that redefines building technology and explores all areas of urban systems.

We have been leaders for decades in developing energy-efficient windows, improving indoor air quality, coming up with new ideas to fix the nation's electricity grid, and so much more.

Visit our research areas at the right to find out more.
Colum 1 +
Colum 2 +
Publications +
Enjoy presentations from Building Technology & Urban systems research experts on a wide variety of topics in the areas of building energy efficiency, the electricity grid and how it relates to buildings and much more.
- Presentations
News
Tools & Guides +
Explore our tools, guidebooks and software and download for free.

We offer a variety of technologies designed to simulate and model real-world circumstances to assist in energy-saving programs and help building owners build better buildings. These tools can help calculate performance of building systems like windows and shades, help consumers and builders pick the best windows for a variety of applications and much more.

Towards Off-policy Evaluation as a Prerequisite for Real-world Reinforcement Learning in Building Control

Publication Type

Conference Paper

Date Published

11/2020

Authors

Chen, Bingqing, Ming Jin, Zhe Wang, Tianzhen Hong, Mario Bergés

DOI

10.1145/342777310.1145/3427773.3427871

Abstract

We present an initial study of off-policy evaluation (OPE), a prob-lem prerequisite to real-world reinforcement learning (RL), in the context of building control. OPE is the problem of estimating a pol-icy’s performance without running it on the actual system, using historical data from the existing controller. It enables the control en-gineers to ensure a new, pretrained policy satisfies the performance requirements and safety constraints of a real-world system, prior to interacting with it. While many methods have been developed for OPE, no study has evaluated which ones are suitable for building operational data, which are generated by deterministic policies and have limited coverage of the state-action space. After reviewing existing works and their assumptions, we adopted the approxi-mate model (AM) method. Furthermore, we used bootstrapping to quantify uncertainty and correct for bias. In a simulation study, we evaluated the proposed approach on 10 policies pretrained with im-itation learning. On average, the AM method estimated the energy and comfort costs with 1.84% and 14.1% error, respectively.

Journal

Proceedings of the 1st International Workshop on Reinforcement Learning for Energy Management in Buildings & Cities

Year of Publication

2020