TY - GEN
T1 - Anomaly Detection Based on Histogram Methodology and Factor Analysis Using LightGBM for Cooling Systems
AU - Yanabe, Tomu
AU - Nishi, Hiroaki
AU - Hashimoto, Masahiro
N1 - Funding Information:
This work was supported by JST CREST Grant Number JPMJCR19K1, and the commissioned research by National Institute of Information and Communications Technology (NICT, Grant Number 22004) , JAPAN.
Funding Information:
This work was supported by JST CREST Grant Number JPMJCR19K1, and the commissioned research by National Institute of Information and Communications Technology (NICT, Grant Number 22004), JAPAN.
Publisher Copyright:
© 2020 IEEE.
PY - 2020/9
Y1 - 2020/9
N2 - The development of the Internet of Things (IoT) has created an environment in which numerous sensors and actuators are connected to the Internet. Machines and management systems in factories use data from such sensors and actuators to improve their work efficiency, and are essential parts of today's smart factories. The vision of a smart factory is based on the concept of Industry 4.0 (I4.0), a subset of the fourth industrial revolution, in which smart factories support the operator and maintenance processes of the factory from an I4.0 perspective. The analysis of big data gathered by IoT devices in factories, particularly for the use of anomaly detection, can aid in achieving product quality stabilization. For example, if a large refrigerator in a warehouse breaks down, the quality of stock food will deteriorate, and food loss may become significant. In the case of anomaly detection, machine status monitoring and accident prediction are required to reduce the operation and maintenance costs. Furthermore, the introduction cost of such systems can be reduced by generalizing them (the systems). However, the data types as well as the sensor and actuator types, differ between factories. Therefore, nonparametric statistical methods are required for anomaly detection. By contrast, factor analysis requires a costless method, one that does not require an overhaul of machinery. Consequently, it is necessary to adopt a machine learning-based method using sampled data. In this study, we proposed a method of anomaly detection and factor analysis for cooling systems in smart factories using appropriate methodologies for detection and analysis. The proposed method consists of two phases: anomaly detection and factor analysis. In the anomaly detection stage, Gaussian kernel density estimation was used to calculate the occurrence distribution. Two types of anomaly scores, cumulative density value and KL divergence, were defined. The probability distribution was estimated with a constant window frame to reflect a tendency to increase. In the factor analysis stage, target values were predicted using LightGBM. The factor of abnormalities was detected by comparing the results of two predictions: one using all the features, and the other using the data, which excluded a factor to detect the contribution of the factor.
AB - The development of the Internet of Things (IoT) has created an environment in which numerous sensors and actuators are connected to the Internet. Machines and management systems in factories use data from such sensors and actuators to improve their work efficiency, and are essential parts of today's smart factories. The vision of a smart factory is based on the concept of Industry 4.0 (I4.0), a subset of the fourth industrial revolution, in which smart factories support the operator and maintenance processes of the factory from an I4.0 perspective. The analysis of big data gathered by IoT devices in factories, particularly for the use of anomaly detection, can aid in achieving product quality stabilization. For example, if a large refrigerator in a warehouse breaks down, the quality of stock food will deteriorate, and food loss may become significant. In the case of anomaly detection, machine status monitoring and accident prediction are required to reduce the operation and maintenance costs. Furthermore, the introduction cost of such systems can be reduced by generalizing them (the systems). However, the data types as well as the sensor and actuator types, differ between factories. Therefore, nonparametric statistical methods are required for anomaly detection. By contrast, factor analysis requires a costless method, one that does not require an overhaul of machinery. Consequently, it is necessary to adopt a machine learning-based method using sampled data. In this study, we proposed a method of anomaly detection and factor analysis for cooling systems in smart factories using appropriate methodologies for detection and analysis. The proposed method consists of two phases: anomaly detection and factor analysis. In the anomaly detection stage, Gaussian kernel density estimation was used to calculate the occurrence distribution. Two types of anomaly scores, cumulative density value and KL divergence, were defined. The probability distribution was estimated with a constant window frame to reflect a tendency to increase. In the factor analysis stage, target values were predicted using LightGBM. The factor of abnormalities was detected by comparing the results of two predictions: one using all the features, and the other using the data, which excluded a factor to detect the contribution of the factor.
KW - KL-divergence
KW - anomaly detection
KW - cooling system
KW - factor analysis
UR - http://www.scopus.com/inward/record.url?scp=85093358188&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85093358188&partnerID=8YFLogxK
U2 - 10.1109/ETFA46521.2020.9211978
DO - 10.1109/ETFA46521.2020.9211978
M3 - Conference contribution
AN - SCOPUS:85093358188
T3 - IEEE International Conference on Emerging Technologies and Factory Automation, ETFA
SP - 952
EP - 958
BT - Proceedings - 2020 25th IEEE International Conference on Emerging Technologies and Factory Automation, ETFA 2020
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 25th IEEE International Conference on Emerging Technologies and Factory Automation, ETFA 2020
Y2 - 8 September 2020 through 11 September 2020
ER -