Development and research of a reinforcement learning method for acoustic diagnostics of industrial equipment

N. A. Verzun; M. O. Kolbanev; A. R. Salieva

doi:10.17586/2226-1494-2025-25-5-961-970

Development and research of a reinforcement learning method for acoustic diagnostics of industrial equipment

N. A. Verzun, M. O. Kolbanev, A. R. Salieva

https://doi.org/10.17586/2226-1494-2025-25-5-961-970

Full Text:

PDF (Rus)

Generate QR code

Abstract

The actual problem of acoustic diagnostics of autonomously operating industrial equipment is investigated. An overview of existing approaches to acoustic diagnostics, including methods based on convolutional neural networks and learning algorithms with a teacher, is provided. Their limitations have been identified, such as the need to use large amounts of labeled data for training, poor adaptation to changing conditions, and the lack of a real-time decision-making mechanism. A new approach to acoustic diagnostics based on reinforcement learning methods is proposed, characterized by adaptability, high resistance to noise and the possibility of continuous learning in a dynamic environment. The proposed method for determining the state of equipment operability uses an approach based on the study of acoustic signals emitted by operating equipment. The method includes building a neural network, selecting audio recordings from open audio file libraries, and training the network using a reinforcement learning algorithm. The process of acoustic diagnostics of the state of serviceability/ malfunction of industrial equipment involves four stages: real-time recording of acoustic data of working equipment, extraction of signs of equipment condition, training with reinforcement of a neural network and making a decision on the serviceability / malfunction of the equipment. Based on tagged WAV audio files from open databases, an experiment was conducted to identify various states of the equipment: normal condition, initial stage of the defect, critical malfunction. The results showed classification accuracy from 89.7 % to 98.5 % and average response time from 0.5 to 0.7 seconds with low computing load (on average 36.5 % CPU and 509 MB RAM). Unlike the wellknown acoustic diagnostic systems based on teacher-learning algorithms for neural and convolutional neural networks on pre-marked datasets containing acoustic signals emitted by running equipment, the proposed approach implements the decomposition of the initial acoustic signals into spectral components. Each of these components is analyzed and provided with signs reflecting the state of serviceability or malfunction of the equipment. This approach allows you to: use reinforcement learning algorithms for strategic decision-making; reduce model training time by pre-selecting significant features; improve diagnostic accuracy; reduce computational load and hardware resource requirements. The developed algorithm can be used for continuous monitoring of equipment condition and predictive maintenance in autonomously functioning industrial systems. Its use will allow reliable and timely detection and classification of industrial equipment malfunctions. It is possible to refine the algorithm to meet the requirements for integration with the IoT infrastructure, increase resistance to external noise, and implement more advanced RL algorithms such as PPO.

Keywords

acoustic diagnostics, industrial equipment, reinforcement learning, classification of states, RL agent, spectral analysis

About the Authors

N. A. Verzun

Saint Petersburg State University of Economics; Saint Petersburg Electrotechnical University “LETI”
Russian Federation

Natalya A. Verzun — PhD, Associate Professor, Associate Professor; Associate Professor

sc 57208320400

Saint Petersburg, 191023

Saint Petersburg, 197376

M. O. Kolbanev

Saint Petersburg State University of Economics; Saint Petersburg Electrotechnical University “LETI”
Russian Federation

Mikhail O. Kolbanev — D.Sc., Full Professor; Professor

sc 6506189057

Saint Petersburg, 191023

Saint Petersburg, 197376

A. R. Salieva

Digital Economy League
Russian Federation

Adelina R. Salieva — PhD Student, Junior Analyst

Moscow, 127015

References

1. Vinogradenko A.M., Budko N.P. Adaptive control of technical condition of autonomous complex technical objects on the basis of intelligent technologies. Electrostatic gyroscope in spacecraft attitude reference systems. T-Comm, 2020, vol. 14, no. 1, pp. 25–35. (in Russian). https://doi.org/10.36724/2072-8735-2020-14-1-25-35

2. Bogatyrev V.A., Bogatyrev S.V., Bogatyrev A.V. Assessment of the readiness of a computer system for timely servicing of requests when combined with information recovery of memory after failures. Scientific and Technical Journal of Information Technologies, Mechanics and Optics, 2023, vol. 23, no. 3, pp. 608–617. (in Russian). https://doi.org/10.17586/2226-1494-2023-23-3-608-617

3. Bogatyrev V., Vinokurova M. Control and safety of operation of duplicated computer systems. Communications in Computer and Information Science, 2017, vol. 700, pp. 331–342. https://doi.org/10.1007/978-3-319-66836-9_28

4. Bogatyrev V.A. Exchange of duplicated computing complexes in fault-tolerant systems. Automatic Control and Computer Sciences, 2011, vol. 45, no. 5, pp. 268–276. https://doi.org/10.3103/s014641161105004x

5. Martiugov A.S., Ershov E.V., Vinogradova L.N., Varfolomeev I.A. Diagnostics of industrial equipment using acoustic testing. Proc. of the Optical-Electronic Devices and Instruments in Image Recognition and Processing Systems, 2021, pp. 172–174. (in Russian)

6. Verzun N.A., Kolbanev M.O., Salieva A.R. Multi-agent ensemble algorithm for acoustic recognition of malfunctions of autonomous technological equipment. Information and Control Systems, 2025, no. 3, pp. 14–24. (in Russian). https://doi.org/10.31799/1684-8853-2025-3-14-24

7. Shchegolkov M.V., Zinkin S.A. Overview of the main model-free reinforcement learning approaches. Proc. of the Energy and Automation in Modern Society, 2024, pp. 91–95. (in Russian)

8. Ye L., Ma X., Wen C. Rotating machinery fault diagnosis method by combining time-frequency domain features and CNN knowledge transfer. Sensors, 2021, vol. 21, no. 24, pp. 8168. https://doi.org/10.3390/s21248168

9. Shao S., McAleer S., Yan R., Baldi P. Highly accurate machine fault diagnosis using deep transfer learning. IEEE Transactions on Industrial Informatics, 2019, vol. 15, no. 4, pp. 2446–2455. https://doi.org/10.1109/tii.2018.2864759

10. Souza R.M., Nascimento E.G.S., Miranda U.A., Silva W.J.D., Lepikson H.A.. Deep learning for diagnosis and classification of faults in industrial rotating machinery. Computers and Industrial Engineering, 2021, vol. 153, pp. 107060. https://doi.org/10.1016/j.cie.2020.107060

11. Lyu P., Zhang K., Yu W., Wang B., Liu C. A novel RSG-based intelligent bearing fault diagnosis method for motors in high-noise industrial environment. Advanced Engineering Informatics, 2022, vol. 52, pp. 101564. https://doi.org/10.1016/j.aei.2022.101564

12. Zhang J., Koppel A., Bedi A.S., Szepesvari C., Wang M., Variational policy gradient method for reinforcement learning with general utilities. arXiv, 2020, arXiv:2007.02151. https://doi.org/10.48550/arXiv.2007.02151

13. Chen D., Peng P., Huang T., Tian Y. Deep reinforcement learning with spiking Q-learning. arXiv, 2022, arXiv:2201.09754. https://doi.org/10.48550/arXiv.2201.09754

14. Verzun N.A., Kolbanev M.O., Salieva A.R. Analysis learning prospects of smart autonomous logistics systems based on value function optimization. LETI Transactions on Electrical Engineering & Computer Science, 2024, vol. 17, no. 10, pp. 28–39. (in Russian). https://doi.org/10.32603/2071-8985-2024-17-10-28-39

15. Tama B.A., Vania M., Lee S., Lim S. Recent advances in the application of deep learning for fault diagnosis of rotating machinery using vibration signals. Artificial Intelligence Review, 2023, vol. 56, no. 5, pp. 4667–4709. https://doi.org/10.1007/s10462-022-10293-3

16. Wang R., Zhan X., Bai H., Dong E., Cheng Z., Jia X. A review of fault diagnosis methods for rotating machinery using infrared thermography. Micromachines, 2022, vol. 13, no. 10, pp. 1644. https://doi.org/10.3390/mi13101644

17. Ramaswamy A., Hüllermeier E. Deep Q-Learning: theoretical insights from an asymptotic analysis. arXiv, 2020, arXiv:2008.10870. https://doi.org/10.48550/arXiv.2008.10870

18. Hansen N., Su H., Wang X. Stabilizing Deep Q-Learning with ConvNets and vision transformers under data augmentation. Proc. of the 35th International Conference on Neural Information Processing Systems, 2021, pp. 3680–3693.

19. Haq A.S., Nasrun M., Setianingsih C., Murti M.A. Speech recognition implementation using MFCC and DTW algorithm for home automation. Proc. of the International Conference on Electrical Engineering Computer Science and Informatics, 2020, vol. 7, pp. 78–85. https://doi.org/10.11591/eecsi.v7.2041

20. Sutton R.S., Barto A.G. Reinforcement Learning: An Introduction. Bradford Books, 2018, 552 p.

21. Das O., Das D.B., Birant D. Machine learning for fault analysis in rotating machinery: A comprehensive review. Heliyon, 2023, vol. 9, no. 6, pp. e17584. https://doi.org/10.1016/j.heliyon.2023.e17584

22. Berdnikova A.A., Kolbanev M.O, Verzun N.A., Salieva A.R. Acoustic system for diagnostics of industrial equipment faults based on reinforcement learning (ASD-OP). Certificate of state registration of the computer program RU 2025619237. 2025. (in Russian)

23. Moharam M.H., Hany O., Hany A., Mahmoud A., Mohamed M., Saeed S. Anomaly detection using machine learning and adopted digital twin concepts in radio environments. Scientific Reports, 2025, vol. 15, pp. 18352. https://doi.org/10.1038/s41598-025-02759-5

24. Purohit H.P., Tanabe R., Ichige K., Endo T., Nikaido Y., Suefusa K., Kawaguchi Y. MIMII Dataset: sound dataset for malfunctioning industrial machine investigation and inspection. Proc. of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019), 2019, pp. 209–213. https://doi.org/10.33682/m76f-d618

25. Koizumi Y., Kawaguchi Y., Imoto K., Nakamura T., Nikaido Y., Tanabe R., Purohit H., Suefusa K., Endo T., Yasuda M., Harada N. Description and discussion on DCASE2020 challenge Task2: unsupervised anomalous sound detection for machine condition monitoring. arXiv, 2020, arXiv:2006.05822. https://doi.org/10.48550/arXiv.2006.05822

Review

For citations:

Verzun N.A., Kolbanev M.O., Salieva A.R. Development and research of a reinforcement learning method for acoustic diagnostics of industrial equipment. Scientific and Technical Journal of Information Technologies, Mechanics and Optics. 2025;25(5):961-970. (In Russ.) https://doi.org/10.17586/2226-1494-2025-25-5-961-970

This work is licensed under a Creative Commons Attribution 4.0 License.

ISSN 2226-1494 (Print)
ISSN 2500-0373 (Online)

Username
Password
	Remember me
Not a user? Register with this site Forgot your password?

User

Scientific and Technical Journal of Information Technologies, Mechanics and Optics

Development and research of a reinforcement learning method for acoustic diagnostics of industrial equipment

Full Text:

Abstract

Keywords

About the Authors

References

Review

For citations:

Cookies policy