Analysis of Mental Health Disorders via Social Media Mining Using LSTM and Bi-LSTM

Binti Kholifah; Iwan Syarif; Tessy Badriyah

doi:10.22219/kinetik.v10i3.2205

Issue

Vol. 10, No. 3, August 2025

Issue Published : Aug 31, 2025

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Analysis of Mental Health Disorders via Social Media Mining Using LSTM and Bi-LSTM

https://doi.org/10.22219/kinetik.v10i3.2205

Binti Kholifah

Universitas Negeri Surabaya

Iwan Syarif

Politeknik Elektronika Negeri Surabaya

Tessy Badriyah

Politeknik Elektronika Negeri Surabaya

Corresponding Author(s) : Binti Kholifah

bintikholifah@unesa.ac.id

Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control, Vol. 10, No. 3, August 2025
Article Published : Aug 31, 2025

Abstract

Mental health disorders are a growing global concern, with many individuals lacking early detection and appropriate treatment. Mental illness can impact a person’s quality of life and often goes undetected until symptoms worsen. One contributing factor to this problem is the limited ways to detect mental disorders in their early stages. Social media, especially platform X, offers the potential to analyze users’ emotional expressions that may indicate a mental disorder, such as depression or anxiety. Psychological symptoms can be explored more broadly using Natural Language Processing. This study optimizes several text preprocessing techniques to extract meaningful information from social media text. To convert words into numerical vectors, several word embedding methods are used, such as Word2Vec, FastText, and GloVe. Meanwhile, the classification process is carried out using LSTM and Bi-LSTM because they are considered capable of studying data sequence patterns, such as sentence structure, effectively. The results show that the addition of expanding contractions, emoticon handling, negation handling, repeated character handling, and spelling correction in the preprocessing text can improve the model performance. In addition, Bi-LSTM with pre-trained FastText shows better results than the other methods in all experiments, achieving 86% accuracy, 87.5% precision, 84% recall, and 85.71% F1-Score.

Keywords

Mental Health Text Pre-processing Word Embedding LSTM BiLSTM

Kholifah, B., Syarif, I., & Badriyah, T. (2025). Analysis of Mental Health Disorders via Social Media Mining Using LSTM and Bi-LSTM . Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control, 10(3), 261-270. https://doi.org/10.22219/kinetik.v10i3.2205

Download Citation

References

WHO, “Suicide,” 2024.
N. A. Wirawan, “Angka Kasus Bunuh Diri di Indonesia Meningkat 60% dalam 5 Tahun Terakhir,” 2024.
K. K. R. Biro Komunikasi dan Pelayanan Publik, “Cegah Bunuh Diri, Kemenkes Ajak Remaja Bicara Soal Kesehatan Mental – Sehat Negeriku,” 2024.
E. Velthorst et al., “The impact of loneliness and social relationship dissatisfaction on clinical and functional outcomes in Dutch mental health service users,” Psychiatry Research, vol. 342. 2024. https://doi.org/10.1016/j.psychres.2024.116242
B. Kholifah, I. Syarif, and T. Badriyah, “Mental Disorder Detection via Social Media Mining using Deep Learning,” Kinet. Game Technol. Inf. Syst. Comput. Network, Comput. Electron. Control, vol. 4, pp. 309–316, 2020. https://doi.org/10.22219/kinetik.v5i4.1120
M. L. Zou, M. X. Li, and V. Cho, “Depression and disclosure behavior via social media: A study of university students in China,” Heliyon, vol. 6, no. 2, p. e03368, 2020. https://doi.org/10.1016/j.heliyon.2020.e03368
I. Syarif, N. Ningtias, and T. Badriyah, “Study on Mental Disorder Detection via Social Media Mining,” 2019 4th Int. Conf. Comput. Commun. Secur., pp. 1–6, 2019. https://doi.org/10.1109/CCCS.2019.8888096
C. Ouni, E. Benmohamed, and H. Ltifi, “Deep learning-based Soft word embedding approach for sentiment Deep learning-based Soft word embedding approach for sentiment analysis analysis,” Procedia Comput. Sci., vol. 246, pp. 1355–1364, 2024. https://doi.org/10.1016/j.procs.2024.09.720
R. Phukan, P. J. Goutom, and N. Baruah, “Assamese Fake News Detection: A Comprehensive Exploration of LSTM and Bi-LSTM Techniques,” Procedia Comput. Sci., vol. 235, pp. 2167–2177, 2024. https://doi.org/10.1016/j.procs.2024.04.205
J. Zhao, D. Zeng, Y. Xiao, L. Che, and M. Wang, “User personality prediction based on topic preference and sentiment analysis using LSTM model,” Pattern Recognition Letters, vol. 138. pp. 397–402, 2020. https://doi.org/10.1016/j.patrec.2020.07.035
T. I. Sari, Z. N. Ardilla, N. Hayatin, and R. Maskat, “Abusive comment identification on Indonesian social media data using hybrid deep learning,” IAES Int. J. Artif. Intell., vol. 11, no. 3, pp. 895–904, 2022. https://doi.org/10.11591/ijai.v11.i3.pp895-904
V. Kishore and M. Kumar, “Enhanced Multimodal Fake News Detection with Optimal Feature Fusion and Modified Bi-LSTM Architecture,” Cybernetics and Systems. 2023. https://doi.org/10.1080/01969722.2023.2175155
Z. N. Ardilla, T. I. Sari, N. Hayatin, and C. Fatichah, “Sarcasm detection on news headline using multilayer bidirectional-LSTM with glove embeddings,” AIP Conf. Proc., vol. 2927, no. 1, p. 60035, Mar. 2024. https://doi.org/10.1063/5.0192254
U. Naseem, I. Razzak, and P. W. Eklund, “A survey of pre-processing techniques to improve short-text quality: a case study on hate speech detection on twitter,” Multimedia Tools and Applications, vol. 80, no. 28–29. pp. 35239–35266, 2021. https://doi.org/10.1007/s11042-020-10082-6
M. Siino, I. Tinnirello, and M. La Cascia, “Is text preprocessing still worth the time? A comparative survey on the influence of popular preprocessing methods on Transformers and traditional classifiers,” Inf. Syst., vol. 121, no. March 2023, p. 102342, 2024. https://doi.org/10.1016/j.is.2023.102342
N. H. Yahya and H. Abdul Rahim, “Linguistic markers of depression: Insights from english-language tweets before and during the COVID-19 pandemic,” Lang. Heal., vol. 1, no. 2, pp. 36–50, 2023. https://doi.org/10.1016/j.laheal.2023.10.001
A. Daud, D. Irwanto, M. Said, M. Mubyl, and Mustamin, “Identification Of Chatbot Usage In Online Store Services Using Natural Language Processing Methods,” Adv. Sustain. Sci. Eng. Technol., vol. 6, no. 2, pp. 1–9, 2024. https://doi.org/10.26877/asset.v6i2.18309
N. Garg and K. Sharma, “Text pre-processing of multilingual for sentiment analysis based on social network data,” Int. J. Electr. Comput. Eng., vol. 12, no. 1, pp. 776–784, 2022. https://doi.org/10.11591/ijece.v12i1.pp776-784
A. Jabbar, S. Iqbal, M. I. Tamimy, A. Rehman, S. A. Bahaj, and T. Saba, “An Analytical Analysis of Text Stemming Methodologies in Information Retrieval and Natural Language Processing Systems,” IEEE Access, vol. 11, no. October, pp. 133681–133702, 2023. https://doi.org/10.1109/ACCESS.2023.3332710
S. Biradar, G. T. Raju, and K. M. Divakar, “Negation Handling and Domain Generalization in Sentiment Analysis using Machine Learning Models,” 2024 International Conference on Knowledge Engineering and Communication Systems, ICKECS 2024. 2024. https://doi.org/10.1109/ICKECS61492.2024.10616885
P. K. Singh and S. Paul, “Deep Learning Approach for Negation Handling in Sentiment Analysis,” IEEE Access, vol. 9, pp. 102579–102592, 2021. https://doi.org/10.1109/ACCESS.2021.3095412
D. J. Ladani and N. P. Desai, “Stopword Identification and Removal Techniques on TC and IR applications: A Survey,” 2020 6th International Conference on Advanced Computing and Communication Systems, ICACCS 2020. pp. 466–472, 2020. https://doi.org/10.1109/ICACCS48705.2020.9074166
Z. H. Kilimci and S. Akyokus, “The Evaluation of Word Embedding Models and Deep Learning Algorithms for Turkish Text Classification,” UBMK 2019 - Proceedings, 4th Int. Conf. Comput. Sci. Eng., pp. 548–553, 2019. https://doi.org/10.1109/UBMK.2019.8907027
M. Mars, “From Word Embeddings to Pre-Trained Language Models: A State-of-the-Art Walkthrough,” Appl. Sci., vol. 12, no. 17, 2022. https://doi.org/10.3390/app12178805
S. Sivakumar, L. S. Videla, T. Rajesh Kumar, J. Nagaraj, S. Itnal, and D. Haritha, “Review on Word2Vec Word Embedding Neural Net,” Proc. - Int. Conf. Smart Electron. Commun. ICOSEC 2020, no. Icosec, pp. 282–290, 2020. https://doi.org/10.1109/ICOSEC49089.2020.9215319
T. Yao, Z. Zhai, and B. Gao, “Text Classification Model Based on fastText,” Proc. 2020 IEEE Int. Conf. Artif. Intell. Inf. Syst. ICAIIS 2020, pp. 154–157, 2020. https://doi.org/10.1109/ICAIIS49377.2020.9194939
P. J. Worth, “Word Embeddings and Semantic Spaces in Natural Language Processing,” International Journal of Intelligence Science, vol. 13, no. 01. pp. 1–21, 2023. https://doi.org/10.4236/ijis.2023.131001
C. O. Ewald and Y. Li, “The role of news sentiment in salmon price prediction using deep learning,” J. Commod. Mark., vol. 36, 2024. https://doi.org/10.1016/j.jcomm.2024.100438
R. Kizito, P. Scruggs, X. Li, M. Devinney, J. Jansen, and R. Kress, “Long Short-Term Memory Networks for Facility Infrastructure Failure and Remaining Useful Life Prediction,” IEEE Access, vol. 9, pp. 67585–67594, 2021. https://doi.org/10.1109/ACCESS.2021.3077192
S. Shan et al., “A Deep Learning Model with Attention-BiLSTM Networks Combining XGBoost Residual Correction for Short-Term Water Demand Forecast,” 2022.
B. H. Nayef, S. N. H. S. Abdullah, R. Sulaiman, and A. M. Saeed, “Text Extraction with Optimal Bi-LSTM,” Comput. Mater. Contin., vol. 76, no. 3, pp. 3549–3567, 2023. https://doi.org/10.32604/cmc.2023.039528
M. A. K. Raiaan et al., “A systematic review of hyperparameter optimization techniques in Convolutional Neural Networks,” Decis. Anal. J., vol. 11, no. March, p. 100470, 2024. https://doi.org/10.1016/j.dajour.2024.100470

References

WHO, “Suicide,” 2024.

N. A. Wirawan, “Angka Kasus Bunuh Diri di Indonesia Meningkat 60% dalam 5 Tahun Terakhir,” 2024.

K. K. R. Biro Komunikasi dan Pelayanan Publik, “Cegah Bunuh Diri, Kemenkes Ajak Remaja Bicara Soal Kesehatan Mental – Sehat Negeriku,” 2024.

E. Velthorst et al., “The impact of loneliness and social relationship dissatisfaction on clinical and functional outcomes in Dutch mental health service users,” Psychiatry Research, vol. 342. 2024. https://doi.org/10.1016/j.psychres.2024.116242

B. Kholifah, I. Syarif, and T. Badriyah, “Mental Disorder Detection via Social Media Mining using Deep Learning,” Kinet. Game Technol. Inf. Syst. Comput. Network, Comput. Electron. Control, vol. 4, pp. 309–316, 2020. https://doi.org/10.22219/kinetik.v5i4.1120

M. L. Zou, M. X. Li, and V. Cho, “Depression and disclosure behavior via social media: A study of university students in China,” Heliyon, vol. 6, no. 2, p. e03368, 2020. https://doi.org/10.1016/j.heliyon.2020.e03368

I. Syarif, N. Ningtias, and T. Badriyah, “Study on Mental Disorder Detection via Social Media Mining,” 2019 4th Int. Conf. Comput. Commun. Secur., pp. 1–6, 2019. https://doi.org/10.1109/CCCS.2019.8888096

C. Ouni, E. Benmohamed, and H. Ltifi, “Deep learning-based Soft word embedding approach for sentiment Deep learning-based Soft word embedding approach for sentiment analysis analysis,” Procedia Comput. Sci., vol. 246, pp. 1355–1364, 2024. https://doi.org/10.1016/j.procs.2024.09.720

R. Phukan, P. J. Goutom, and N. Baruah, “Assamese Fake News Detection: A Comprehensive Exploration of LSTM and Bi-LSTM Techniques,” Procedia Comput. Sci., vol. 235, pp. 2167–2177, 2024. https://doi.org/10.1016/j.procs.2024.04.205

J. Zhao, D. Zeng, Y. Xiao, L. Che, and M. Wang, “User personality prediction based on topic preference and sentiment analysis using LSTM model,” Pattern Recognition Letters, vol. 138. pp. 397–402, 2020. https://doi.org/10.1016/j.patrec.2020.07.035

T. I. Sari, Z. N. Ardilla, N. Hayatin, and R. Maskat, “Abusive comment identification on Indonesian social media data using hybrid deep learning,” IAES Int. J. Artif. Intell., vol. 11, no. 3, pp. 895–904, 2022. https://doi.org/10.11591/ijai.v11.i3.pp895-904

V. Kishore and M. Kumar, “Enhanced Multimodal Fake News Detection with Optimal Feature Fusion and Modified Bi-LSTM Architecture,” Cybernetics and Systems. 2023. https://doi.org/10.1080/01969722.2023.2175155

Z. N. Ardilla, T. I. Sari, N. Hayatin, and C. Fatichah, “Sarcasm detection on news headline using multilayer bidirectional-LSTM with glove embeddings,” AIP Conf. Proc., vol. 2927, no. 1, p. 60035, Mar. 2024. https://doi.org/10.1063/5.0192254

U. Naseem, I. Razzak, and P. W. Eklund, “A survey of pre-processing techniques to improve short-text quality: a case study on hate speech detection on twitter,” Multimedia Tools and Applications, vol. 80, no. 28–29. pp. 35239–35266, 2021. https://doi.org/10.1007/s11042-020-10082-6

M. Siino, I. Tinnirello, and M. La Cascia, “Is text preprocessing still worth the time? A comparative survey on the influence of popular preprocessing methods on Transformers and traditional classifiers,” Inf. Syst., vol. 121, no. March 2023, p. 102342, 2024. https://doi.org/10.1016/j.is.2023.102342

N. H. Yahya and H. Abdul Rahim, “Linguistic markers of depression: Insights from english-language tweets before and during the COVID-19 pandemic,” Lang. Heal., vol. 1, no. 2, pp. 36–50, 2023. https://doi.org/10.1016/j.laheal.2023.10.001

A. Daud, D. Irwanto, M. Said, M. Mubyl, and Mustamin, “Identification Of Chatbot Usage In Online Store Services Using Natural Language Processing Methods,” Adv. Sustain. Sci. Eng. Technol., vol. 6, no. 2, pp. 1–9, 2024. https://doi.org/10.26877/asset.v6i2.18309

N. Garg and K. Sharma, “Text pre-processing of multilingual for sentiment analysis based on social network data,” Int. J. Electr. Comput. Eng., vol. 12, no. 1, pp. 776–784, 2022. https://doi.org/10.11591/ijece.v12i1.pp776-784

A. Jabbar, S. Iqbal, M. I. Tamimy, A. Rehman, S. A. Bahaj, and T. Saba, “An Analytical Analysis of Text Stemming Methodologies in Information Retrieval and Natural Language Processing Systems,” IEEE Access, vol. 11, no. October, pp. 133681–133702, 2023. https://doi.org/10.1109/ACCESS.2023.3332710

S. Biradar, G. T. Raju, and K. M. Divakar, “Negation Handling and Domain Generalization in Sentiment Analysis using Machine Learning Models,” 2024 International Conference on Knowledge Engineering and Communication Systems, ICKECS 2024. 2024. https://doi.org/10.1109/ICKECS61492.2024.10616885

P. K. Singh and S. Paul, “Deep Learning Approach for Negation Handling in Sentiment Analysis,” IEEE Access, vol. 9, pp. 102579–102592, 2021. https://doi.org/10.1109/ACCESS.2021.3095412

D. J. Ladani and N. P. Desai, “Stopword Identification and Removal Techniques on TC and IR applications: A Survey,” 2020 6th International Conference on Advanced Computing and Communication Systems, ICACCS 2020. pp. 466–472, 2020. https://doi.org/10.1109/ICACCS48705.2020.9074166

Z. H. Kilimci and S. Akyokus, “The Evaluation of Word Embedding Models and Deep Learning Algorithms for Turkish Text Classification,” UBMK 2019 - Proceedings, 4th Int. Conf. Comput. Sci. Eng., pp. 548–553, 2019. https://doi.org/10.1109/UBMK.2019.8907027

M. Mars, “From Word Embeddings to Pre-Trained Language Models: A State-of-the-Art Walkthrough,” Appl. Sci., vol. 12, no. 17, 2022. https://doi.org/10.3390/app12178805

S. Sivakumar, L. S. Videla, T. Rajesh Kumar, J. Nagaraj, S. Itnal, and D. Haritha, “Review on Word2Vec Word Embedding Neural Net,” Proc. - Int. Conf. Smart Electron. Commun. ICOSEC 2020, no. Icosec, pp. 282–290, 2020. https://doi.org/10.1109/ICOSEC49089.2020.9215319

T. Yao, Z. Zhai, and B. Gao, “Text Classification Model Based on fastText,” Proc. 2020 IEEE Int. Conf. Artif. Intell. Inf. Syst. ICAIIS 2020, pp. 154–157, 2020. https://doi.org/10.1109/ICAIIS49377.2020.9194939

P. J. Worth, “Word Embeddings and Semantic Spaces in Natural Language Processing,” International Journal of Intelligence Science, vol. 13, no. 01. pp. 1–21, 2023. https://doi.org/10.4236/ijis.2023.131001

C. O. Ewald and Y. Li, “The role of news sentiment in salmon price prediction using deep learning,” J. Commod. Mark., vol. 36, 2024. https://doi.org/10.1016/j.jcomm.2024.100438

R. Kizito, P. Scruggs, X. Li, M. Devinney, J. Jansen, and R. Kress, “Long Short-Term Memory Networks for Facility Infrastructure Failure and Remaining Useful Life Prediction,” IEEE Access, vol. 9, pp. 67585–67594, 2021. https://doi.org/10.1109/ACCESS.2021.3077192

S. Shan et al., “A Deep Learning Model with Attention-BiLSTM Networks Combining XGBoost Residual Correction for Short-Term Water Demand Forecast,” 2022.

B. H. Nayef, S. N. H. S. Abdullah, R. Sulaiman, and A. M. Saeed, “Text Extraction with Optimal Bi-LSTM,” Comput. Mater. Contin., vol. 76, no. 3, pp. 3549–3567, 2023. https://doi.org/10.32604/cmc.2023.039528

M. A. K. Raiaan et al., “A systematic review of hyperparameter optimization techniques in Convolutional Neural Networks,” Decis. Anal. J., vol. 11, no. March, p. 100470, 2024. https://doi.org/10.1016/j.dajour.2024.100470

Issue

Vol. 10, No. 3, August 2025

Analysis of Mental Health Disorders via Social Media Mining Using LSTM and Bi-LSTM

Corresponding Author(s) : Binti Kholifah

Abstract

Keywords

Download Citation

References

Downloads