Data Self-Expansion and DoppelGANger-Based Time-Series Modeling for Realistic Steam Data Generation

Image credit: Unsplash

Abstract

Steam heating is an essential component of numerous industrial production processes that consume energy extensively. In particular, steam data are of significant commercial interest, and the privacy of the heat suppliers and steam users for whom this data is confidential is of paramount importance. However, the complexity and intricacy of steam data make it challenging to generate realistic samples using conventional data generation algorithms. This paper proposes a self-expansion data process method that multiplies samples with a coarser granularity. We then apply the time-series generation model, DoppelGANger, to steam data. Post-processing for the generated data is implemented to eliminate the effects of a certain level of noise. To validate the proposed method and model, we conduct experiments using field data obtained from a thermopower company. The results indicate that DoppelGANger can capture the steam operating characteristics and generate realistic samples reflecting steam data patterns. The difference between the statistic metrics of the true and generated samples is around 2.87%. The mean DTW distance of features’ principal components is around 16.32. Overall, this study offers a promising workflow for data generation across many steam system scenarios. This approach is valuable for energy management, privacy protection, and data sharing in industrial domains.

Publication
In International Conference on Power and Renewable Energy
XinyingCai  蔡昕颖
XinyingCai 蔡昕颖

My research interests include distributed robotics, mobile computing and programmable matter.