Development of an Attention-Based Convolutional Neural Network-Long Short-Term Memory Model for Real-Time Ergonomic Analysis of Sitting Posture

Authors

  • ID Gusrio Tendra Institut Bisnis dan Teknologi Pelita Indonesia, Pekanbaru, Indonesia
  • ID Deny Jollyta Institut Bisnis dan Teknologi Pelita Indonesia, Pekanbaru, Indonesia
  • ID Sumijan Universitas Putra Indonesia "YPTK" Padang, Padang, Indonenesia

DOI:

https://doi.org/10.30812/matrik.v25i2.5678

Keywords:

BlazePose, CNN-LSTM, Ergonomics, Real-Time, Sitting Posture

Abstract

The digital era has increased the prevalence of musculoskeletal disorders caused by poor sitting posture, posing a significant global health and productivity challenge. This study introduces an attentionbased deep learning model as the analytical engine for a proposed virtual ergonomics monitor, Ergo-Guard. The primary objective is to develop a model that accurately performs real-time Movement Quality Assessment of Sitting Posture for computer users, using only a standard webcam to ensure wide accessibility. This research method is a hybrid architecture that combines a Convolutional Neural Network (CNN) and a Long Short-Term Memory (LSTM), enhanced with an attention mechanism and optimized for three-dimensional skeletal data using the BlazePose Computer Vision approach. This framework merges a One-Dimensional CNN to extract spatial features from static poses with a Bidirectional LSTM network to model temporal postural shifts. An integrated attention mechanism enables the model to dynamically focus on critical ergonomic areas, mimicking an expert’s assessment. For validation, a new OfficePosture dataset was created, containing 500 videos of five common office sitting postures. The results indicate that the proposed model achieves 94.2% classification accuracy,
substantially outperforming baselines from a pure CNN (84.6%) and a standard LSTM network (89.2%). Beyond accuracy, the model offers interpretable feedback through visual attention maps. In conclusion, the proposed architecture is an effective solution for monitoring sitting posture and holds considerable promise as an affordable preventive health tool for corporate and educational settings.

Downloads

Download data is not yet available.

References

[1] M. L. Ferreira and e. a. De Luca, “Global, regional, and national burden of low back pain, 1990–2020, its attributable risk factors,

and projections to 2050: A systematic analysis of the Global Burden of Disease Study 2021,” vol. 5, no. 6, pp. e316–e329, June,

2023, https://doi.org/10.1016/S2665-9913(23)00098-X.

[2] Z. Yang, D. Song, J. Ning, and Z. Wu, “A Systematic Review: Advancing Ergonomic Posture Risk Assessment Through the

Integration of Computer Vision and Machine Learning Techniques,” vol. 12, pp. 180 481–180 519, December, 2024, https:

//doi.org/10.1109/ACCESS.2024.3509447.

[3] I. K. Jalata, T.-D. Truong, J. L. Allen, H.-S. Seo, and K. Luu, “Movement Analysis for Neurological and Musculoskeletal Disorders

Using Graph Convolutional Neural Network,” vol. 13, no. 8, p. 194, August, 2021, https://doi.org/10.3390/fi13080194.

[4] P. Paudel, Y.-J. Kwon, D.-H. Kim, and K.-H. Choi, “Industrial Ergonomics Risk Analysis Based on 3D-Human Pose Estimation,”

vol. 11, no. 20, p. 3403, October, 2022, https://doi.org/10.3390/electronics11203403.

[5] A. Avogaro, F. Cunico, B. Rosenhahn, and F. Setti, “Markerless human pose estimation for biomedical applications: A survey,”

vol. 5, p. 1153160, July, 2023, https://doi.org/10.3389/fcomp.2023.1153160.

[6] P.-C. Lin, Y.-J. Chen,W.-S. Chen, and Y.-J. Lee, “Automatic real-time occupational posture evaluation and select corresponding

ergonomic assessments,” vol. 12, no. 1, p. 2139, February, 2022, https://doi.org/10.1038/s41598-022-05812-9.

[7] C. Singhtaun, S. Natsupakpong, and P. Lorprasertkul, “Ergonomic Risk Assessment Using Human Pose Estimation with MediaPipe

Pose,” pp. 465–471, December, 2024, https://doi.org/10.1145/3719384.3719453.

[8] W. Ren, O. Ma, H. Ji, and X. Liu, “Human Posture Recognition Using a Hybrid of Fuzzy Logic and Machine Learning Approaches,”

vol. 8, pp. 135 628–135 639, July, 2020, https://doi.org/10.1109/ACCESS.2020.3011697.

[9] L.Wade, L. Needham, P. McGuigan, and J. Bilzon, “Applications and limitations of current markerless motion capture methods

for clinical gait biomechanics,” vol. 10, p. e12995, February, 2022, https://doi.org/10.7717/peerj.12995.

[10] D. Jollyta, P. Prihandoko, J. Johan, W. Ramdhan, and E. Santoso, “Transfer Learning Model Evaluation on CNN Algorithm:

Indonesian Sign Language System (SIBI),” vol. 6, no. 2, pp. 83–92, May, 2025, https://doi.org/10.35145/jabt.v6i2.213.

[11] W. Xiong and Z. Xu, “Real-Time Clothing Virtual Display Based on Human Pose Estimation,” March, 2024, https://doi.org/10.

3233/FAIA240168.

[12] K. Nguyen-Trong, T. Vu-Van, and P. L. T. Bich, “Graph Convolutional Network for Occupational Disease Prediction with

Multiple Dimensional Data,” vol. 15, no. 7, pp. 1322–1331, 2024, https://doi.org/10.14569/IJACSA.2024.01507128.

[13] D. R. Martins, S. M. Cerqueira, A. Pombeiro, A. F. Da Silva, A. M. A. C. Rocha, and C. P. Santos, “ErgoReport: A Holistic

Posture Assessment Framework Based on Inertial Data and Deep Learning,” vol. 25, no. 7, p. 2282, April, 2025, https://doi.org/

10.3390/s25072282.

[14] E. Bagga and A. Yang, “Real-Time Posture Monitoring and Risk Assessment for Manual Lifting Tasks Using MediaPipe and

LSTM,” MM: International Multimedia Conference, pp. 79–85, October, 2024, https://doi.org/10.1145/3688868.3689199.

[15] L. Zhao, J. Yan, and A. Wang, “A comparative study on real-time sitting posture monitoring systems using pressure sensors,”

vol. 74, no. 6, pp. 474–484, December, 2023, https://doi.org/10.2478/jee-2023-0055.

[16] Vinaya R M and G. C. Mara, “Human Activity Recognition Using CNN and Lstm Deep Learning Algorithms,” vol. 44, no. S6,

pp. 1024–1030, November, 2023, https://doi.org/10.17762/jaz.v44iS6.2353.

[17] A. Zhu, Q. Ke, M. Gong, and J. Bailey, “Adaptive Local-Component-aware Graph Convolutional Network for One-shot

Skeleton-based Action Recognition,” pp. 6027–6036, January, 2023, https://doi.org/10.1109/WACV56688.2023.00598.

[18] Y. Zhao, X. Wang, Y. Luo, and M. S. Aslam, “Research on Human Activity Recognition Algorithm Based on LSTM-1DCNN,”

vol. 77, no. 3, pp. 3325–3347, 2023, https://doi.org/10.32604/cmc.2023.040528.

[19] R. Kapoor, A. Jaiswal, and F. Makedon, “Light-Weight Seated Posture Guidance System with Machine Learning and Computer

Vision,” pp. 595–600, June, 2022, https://doi.org/10.1145/3529190.3535341.

[20] A. Schmidt, H. Shahid, D. Kraft, G. Bieber, and M. Fellmann, “Interactive Exercises for Computer-based Work Using a Webcam,”

pp. 1–8, September, 2023, https://doi.org/10.1145/3615834.3615840.

Downloads

Published

2026-03-11

Issue

Section

Articles

How to Cite

[1]
G. T. Tendra, D. J. Jollyta, and Sumijan, “Development of an Attention-Based Convolutional Neural Network-Long Short-Term Memory Model for Real-Time Ergonomic Analysis of Sitting Posture”, MATRIK, vol. 25, no. 2, pp. 287–298, Mar. 2026, doi: 10.30812/matrik.v25i2.5678.

Similar Articles

1-10 of 110

You may also start an advanced similarity search for this article.