Missing data imputation using mixture factor analysis for building electric load data

We propose a mixture factor analysis (MFA) method for estimating missing values in building electric load data. Buildings consume a tremendous amount of energy. Thanks to the recent advances in data technologies such as machine learning and applied statistics, data-driven approaches to making buildi...

Full description

Saved in:
Bibliographic Details
Published in:Applied energy Vol. 304; p. 117655
Main Authors: Jeong, Dongyeon, Park, Chiwoo, Ko, Young Myoung
Format: Journal Article
Language:English
Published: Elsevier Ltd 15-12-2021
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We propose a mixture factor analysis (MFA) method for estimating missing values in building electric load data. Buildings consume a tremendous amount of energy. Thanks to the recent advances in data technologies such as machine learning and applied statistics, data-driven approaches to making buildings more energy-efficient become a major research area. However, building electric load data suffer from quality issues due to data missing originated from malfunctioning sensors, network stability, and other environmental causes. We note that data missing can occur even under advanced Internet Technology (IT) systems such as energy information systems (EIS) and energy management systems (EMS) due to signal stability and low-speed computers. The existence of missing data may significantly affect building operations, causing inaccuracy in evaluating building status and forecasting future electric demands. In this respect, dealing with missing data problems should be as important as developing highly accurate forecasting algorithms. While investigating load data, we find that building electric loads exhibit distinct patterns with cyclic rotations that we can take advantage of in both model design and selection stages. Motivated by the finding, unlike the previous studies designed for general time-series data, we propose a novel data imputation model to represent patterns and their cyclic rotations in electric load data. Simulation studies reveal that the proposed model works well when the time window size is a divisor of the cycle length, which significantly reduces model selection efforts. Numerical results with two real data sets justify our findings and the performance of the proposed approach against benchmark methods. [Display omitted] •A novel data imputation method for building electric load data is proposed.•Some unique characteristics of building electric loads are investigated.•The mixture of factor analysis captures the characteristics of building loads.•Incorporating building load characteristics improves imputation accuracy.
ISSN:0306-2619
1872-9118
DOI:10.1016/j.apenergy.2021.117655