Contextualizing injury severity from occupational accident reports using an optimized deep learning prediction model

Khairuddin, Mohamed Zul Fadhli and Sankaranarayanan, Suresh and Hasikin, Khairunnisa and Razak, Nasrul Anuar Abd and Omar, Rosidah (2024) Contextualizing injury severity from occupational accident reports using an optimized deep learning prediction model. PeerJ Computer Science, 10. e1985. ISSN 2376-5992, DOI https://doi.org/10.7717/peerj-cs.1985.

Full text not available from this repository.
Official URL: https://doi.org/10.7717/peerj-cs.1985

Abstract

Background: This study introduced a novel approach for predicting occupational injury severity by leveraging deep learning -based text classi fi cation techniques to analyze unstructured narratives. Unlike conventional methods that rely on structured data, our approach recognizes the richness of information within injury narrative descriptions with the aim of extracting valuable insights for improved occupational injury severity assessment. Methods: Natural language processing (NLP) techniques were harnessed to preprocess the occupational injury narratives obtained from the US Occupational Safety and Health Administration (OSHA) from January 2015 to June 2023. The methodology involved meticulous preprocessing of textual narratives to standardize text and eliminate noise, followed by the innovative integration of Term FrequencyInverse Document Frequency (TF-IDF) and Global Vector (GloVe) word embeddings for effective text representation. The proposed predictive model adopts a novel Bidirectional Long Short -Term Memory (Bi-LSTM) architecture and is further re fi ned through model optimization, including random search hyperparameters and in-depth feature importance analysis. The optimized Bi-LSTM model has been compared and validated against other machine learning classi fi ers which are na & iuml;ve Bayes, support vector machine, random forest, decision trees, and K -nearest neighbor. Results: The proposed optimized Bi-LSTM models ` superior predictability, boasted an accuracy of 0.95 for hospitalization and 0.98 for amputation cases with faster model processing times. Interestingly, the feature importance analysis revealed predictive keywords related to the causal factors of occupational injuries thereby providing valuable insights to enhance model interpretability. Conclusion: Our proposed optimized Bi-LSTM model offers safety and health practitioners an effective tool to empower workplace safety proactive measures, thereby contributing to business productivity and sustainability. This study lays the foundation for further exploration of predictive analytics in the occupational safety and health domain.

Item Type: Article
Funders: Deanship of Scientific Research, Vice Presidency for Graduate Studies and Scientific Research, King Faisal University, Saudi Arabia (4118)
Uncontrolled Keywords: Natural language processing; Machine learning; Deep learning; Text classi fi cation; Occupational injury; Occupational safety and health
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
R Medicine > R Medicine (General)
Divisions: Faculty of Engineering > Biomedical Engineering Department
Depositing User: Ms. Juhaida Abd Rahim
Date Deposited: 08 Oct 2024 02:40
Last Modified: 08 Oct 2024 02:40
URI: http://eprints.um.edu.my/id/eprint/45305

Actions (login required)

View Item View Item