
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Predicting Social Media Post Engagement and Virality Using Graph Neural Network Approaches and Content-Based Features
Corresponding Author(s) : Fathimah Az Zahrah
Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control,
Vol. 11, No. 3, August 2026 (Article in Progress)
Abstract
Social media teams increasingly rely on early signals to prioritize content, yet forecasting engagement and identifying viral posts remain difficult under temporal drift and heavy-tailed interaction counts. This study evaluated Graph Neural Network (GNN) approaches for predicting post engagement and virality from pre-posting content-based and contextual features. The Social Media Engagement Report dataset, which contained 100,000 posts across Twitter, LinkedIn, Facebook, and Instagram spanning March 2021–March 2024, was used. Post-release variables (impressions, reach, engagement rate) were excluded to prevent leakage. A homogeneous post–post graph was constructed using k-nearest-neighbor similarity in an embedding space and exact-match links on low-cardinality context. Ridge/Logistic Regression, Random Forest, and XGBoost as the baselines were compared against GraphSAGE and GAT under a chronological train, validation, and test split. Regression used MAE, RMSE, and R2, while virality classification used ROC-AUC, PR-AUC, and Precision at the top 1% ranked posts. GraphSAGE yielded the strongest virality screening, achieving ROC-AUC = 0.66, PR-AUC = 0.54–0.56, and Precision@1% up to 0.75, substantially above non-graph baselines. For regression, GAT produced the lowest errors despite a negative R², indicating limited explained variance. Overall, similarity-graph GNNs are most effective for early virality identification, whereas exact count prediction remains challenging in a strictly pre-posting, time-aware setting.
Keywords
Download Citation
Endnote/Zotero/Mendeley (RIS)BibTeX
- we are social, “Digital 2025 Global Overview Report.” Accessed: Dec. 15, 2025. [Online]. Available: https://wearesocial.com/wp-content/uploads/2025/02/GDR-2025-v2.pdf
- E. Sangiorgio, N. Di Marco, G. Etta, M. Cinelli, R. Cerqueti, and W. Quattrociocchi, “Evaluating the effect of viral posts on social media engagement,” Scientific Reports 2025 15:1, vol. 15, no. 1, pp. 639-, Jan. 2025, doi: 10.1038/s41598-024-84960-6.
- H. Setiawan, W. Handayani, and M. S. I. Komunikasi, “Gatekeeping konten viral media sosial (studi kasus harianjogja.com, jogja.tribunnews.com, & suarajogja.id),” Lektur: Jurnal Ilmu Komunikasi, vol. 5, no. 4, 2022, Accessed: Dec. 28, 2025. [Online]. Available: https://journal.student.uny.ac.id/ilkom/article/view/19182
- D. H. Kim, O. Kuru, J. Zeng, and S. Kim, “Fostering mask-wearing with virality metrics and social media literacy: evidence from the U.S. and Korea,” Front Psychol, vol. 14, p. 1151061, 2023, doi: 10.3389/FPSYG.2023.1151061.
- N. Jalli, “Viral Justice: TikTok Activism, Misinformation, and the Fight for Social Change in Southeast Asia,” Social Media and Society, vol. 11, no. 1, Jan. 2025, doi: 10.1177/20563051251318122;PAGE:STRING:ARTICLE/CHAPTER.
- E. Botha and M. Reyneke, “To share or not to share: The role of content and emotion in viral marketing,” J Public Aff, vol. 13, no. 2, pp. 160–171, May 2013, doi: 10.1002/PA.1471.
- M. Heitmayer, “The Second Wave of Attention Economics. Attention as a Universal Symbolic Currency on Social Media and beyond,” Interact Comput, vol. 37, no. 1, pp. 18–29, Jan. 2025, doi: 10.1093/IWC/IWAE035.
- M. Tahmina Khanom and A. Professor, “Using social media marketing in the digital era: A necessity or a choice,” International Journal of Research in Business and Social Science (2147- 4478), vol. 12, no. 3, pp. 88–98, May 2023, doi: 10.20525/IJRBS.V12I3.2507.
- S. Ren, C. Gong, C. Zhang, and C. Li, “Public opinion communication mechanism of public health emergencies in Weibo: take the COVID-19 epidemic as an example,” Front Public Health, vol. 11, p. 1276083, Nov. 2023, doi: 10.3389/FPUBH.2023.1276083/BIBTEX.
- A. Iamnitchi, L. O. Hall, S. Horawalavithana, F. Mubang, K. W. Ng, and J. Skvoretz, “Modeling information diffusion in social media: data-driven observations,” Front Big Data, vol. 6, p. 1135191, May 2023, doi: 10.3389/FDATA.2023.1135191/BIBTEX.
- E. Sangiorgio, M. Cinelli, R. Cerqueti, and W. Quattrociocchi, “Followers do not dictate the virality of news outlets on social media,” PNAS Nexus, vol. 3, no. 7, Jun. 2024, doi: 10.1093/PNASNEXUS/PGAE257.
- T. Chani, O. O. Olugbara, T. Chani, and O. O. Olugbara, “Measuring Behavioral Influence on Social Media: A Social Impact Theory Approach to Identifying Influential Users,” Journalism and Media 2025, Vol. 6, vol. 6, no. 4, p. 205, Dec. 2025, doi: 10.3390/JOURNALMEDIA6040205.
- H. Metzler and D. Garcia, “Social Drivers and Algorithmic Mechanisms on Digital Media,” Perspectives on Psychological Science, vol. 19, no. 5, pp. 735–748, Sep. 2024, doi: 10.1177/17456916231185057.
- V. Gupta, K. Jung, and S. C. Yoo, “Exploring the Power of Multimodal Features for Predicting the Popularity of Social Media Image in a Tourist Destination,” Multimodal Technologies and Interaction 2020, Vol. 4, vol. 4, no. 3, pp. 1–23, Sep. 2020, doi: 10.3390/MTI4030064.
- K. R. Purba, D. Asirvatham, and R. K. Murugesan, “An analysis and prediction model of outsiders percentage as a new popularity metric on Instagram,” ICT Express, vol. 6, no. 3, pp. 243–248, Sep. 2020, doi: 10.1016/J.ICTE.2020.07.001.
- D. Jeong, H. Son, Y. Choi, and K. Kim, “Enhancing social media post popularity prediction with visual content,” Journal of the Korean Statistical Society 2024 53:3, vol. 53, no. 3, pp. 844–882, May 2024, doi: 10.1007/S42952-024-00270-7.
- S. Carta et al., “Popularity Prediction of Instagram Posts,” Information 2020, Vol. 11, vol. 11, no. 9, Sep. 2020, doi: 10.3390/INFO11090453.
- M. Saari, L. Haapanen, and P. Hurmelinna-Laukkanen, “Social media and international business: views and conceptual framing,” International Marketing Review, vol. 39, no. 7, pp. 25–45, Dec. 2022, doi: 10.1108/IMR-06-2021-0191/FULL/PDF.
- J. Song et al., “Graph Representation-Based Deep Multi-View Semantic Similarity Learning Model for Recommendation,” Future Internet 2022, Vol. 14, vol. 14, no. 2, Jan. 2022, doi: 10.3390/FI14020032.
- J. Zhu and A. Yaseen, “A Recommender for Research Collaborators Using Graph Neural Networks,” Front Artif Intell, vol. 5, p. 881704, Aug. 2022, doi: 10.3389/FRAI.2022.881704/BIBTEX.
- Q. Tong, X. Xu, J. Zhang, and H. Xu, “Public Opinion Propagation Prediction Model Based on Dynamic Time-Weighted Rényi Entropy and Graph Neural Network,” Entropy 2025, Vol. 27, vol. 27, no. 5, May 2025, doi: 10.3390/E27050516.
- A. Golovin et al., “Improving Recommender Systems for Fake News Detection in Social Networks with Knowledge Graphs and Graph Attention Networks,” Mathematics 2025, Vol. 13, vol. 13, no. 6, Mar. 2025, doi: 10.3390/MATH13061011.
- Z. Xu, M. Qian, Z. Xu, and M. Qian, “Predicting Popularity of Viral Content in Social Media through a Temporal-Spatial Cascade Convolutional Learning Framework,” Mathematics 2023, Vol. 11, vol. 11, no. 14, Jul. 2023, doi: 10.3390/MATH11143059.
- Y. Shang et al., “Popularity Prediction of Online Contents via Cascade Graph and Temporal Information,” Axioms 2021, Vol. 10, vol. 10, no. 3, Jul. 2021, doi: 10.3390/AXIOMS10030159.
- P. K. Theodoridis, D. C. Gkikas, P. K. Theodoridis, and D. C. Gkikas, “Maximizing Social Media User Engagement Through Predictive Analytics in Retail Tourism: Identifying Key Performance Indicators That Trigger User Interactions,” Applied Sciences 2025, Vol. 15, vol. 15, no. 21, Nov. 2025, doi: 10.3390/APP152111720.
- “Social Media Engagement Report | Kaggle.” Accessed: Dec. 27, 2025. [Online]. Available: https://www.kaggle.com/datasets/aliredaelblgihy/social-media-engagement-report
- H. M. Lin and J. J. Lyu, “A holistic framework for intradialytic hypotension prediction using generative adversarial networks-based data balancing,” BMC Medical Informatics and Decision Making 2025 25:1, vol. 25, no. 1, pp. 257-, Jul. 2025, doi: 10.1186/S12911-025-03094-5.
- P. Zhang, Z. Wang, H. Qiu, W. Zhou, M. Wang, and G. Cheng, “Machine learning applied to serum and cerebrospinal fluid metabolomes revealed altered arginine metabolism in neonatal sepsis with meningoencephalitis,” Comput Struct Biotechnol J, vol. 19, pp. 3284–3292, Jan. 2021, doi: 10.1016/J.CSBJ.2021.05.024.
- “Effect of transforming the targets in regression model — scikit-learn 1.8.0 documentation.” Accessed: Dec. 27, 2025. [Online]. Available: https://scikit-learn.org/stable/auto_examples/compose/plot_transformed_target.html
- P. P. Tricomi, M. Chilese, M. Conti, and A. R. Sadeghi, “Follow Us and Become Famous! Insights and Guidelines From Instagram Engagement Mechanisms,” Web Science Conference, pp. 346–356, Apr. 2023, doi: 10.1145/3578503.3583623.
- C. Papakostas et al., “NAMI: A Neuro-Adaptive Multimodal Architecture for Wearable Human–Computer Interaction,” Multimodal Technologies and Interaction 2025, Vol. 9, vol. 9, no. 10, Oct. 2025, doi: 10.3390/MTI9100108.
- K. Abdesselam et al., “Canada’s 2025 AMR priority pathogens: Evidence-based ranking and public health implications,” PLoS One, vol. 20, no. 9, p. e0330128, Sep. 2025, doi: 10.1371/JOURNAL.PONE.0330128.
- G. Marchesi, A. Ballarino, A. Brusaferri, G. Marchesi, A. Ballarino, and A. Brusaferri, “Assessing Time Series Foundation Models for Probabilistic Electricity Price Forecasting: Toward a Unified Benchmark,” Energies 2025, Vol. 18, vol. 18, no. 23, Nov. 2025, doi: 10.3390/EN18236269.
- M. Kim and K. B. Hwang, “An empirical evaluation of sampling methods for the classification of imbalanced data,” PLoS One, vol. 17, no. 7, p. e0271260, Jul. 2022, doi: 10.1371/JOURNAL.PONE.0271260.
- M. K. Baxi, R. Sharma, and V. Mago, “Studying topic engagement and synergy among candidates for 2020 US Elections,” Soc Netw Anal Min, vol. 12, no. 1, p. 136, Dec. 2022, doi: 10.1007/S13278-022-00959-9.
- N. Husin, H. Fazlurrahman, A. Safitri, R. Dhenabayu, U. A. A. Rauf, and A. M. Fitrah, “Policy Perspective on Proposed Framework of NLP AI to Bridge the Inclusive Support in Higher Education with a Mixed Methods Approach in Indonesia and Malaysia,” International Journal of Information and Education Technology, vol. 15, no. 12, pp. 2686–2699, 2025, doi: 10.18178/ijiet.2025.15.12.2464.
- R. Yang, A. Yu, L. Cai, and D. Meng, “Subspace clustering via graph auto-encoder network for unknown encrypted traffic recognition,” Cybersecurity 2022 5:1, vol. 5, no. 1, pp. 29-, Dec. 2022, doi: 10.1186/S42400-022-00131-Y.
- N. K. Nissa, R. T. Pusparini, A. Setiyoko, and A. M. Arymurthy, “The Implementation of Inductive Graph Neural Networks with L1 Loss for Spatiotemporal Kriging,” AIP Conf Proc, vol. 2941, no. 1, Dec. 2023, doi: 10.1063/5.0184738/2929312.
References
we are social, “Digital 2025 Global Overview Report.” Accessed: Dec. 15, 2025. [Online]. Available: https://wearesocial.com/wp-content/uploads/2025/02/GDR-2025-v2.pdf
E. Sangiorgio, N. Di Marco, G. Etta, M. Cinelli, R. Cerqueti, and W. Quattrociocchi, “Evaluating the effect of viral posts on social media engagement,” Scientific Reports 2025 15:1, vol. 15, no. 1, pp. 639-, Jan. 2025, doi: 10.1038/s41598-024-84960-6.
H. Setiawan, W. Handayani, and M. S. I. Komunikasi, “Gatekeeping konten viral media sosial (studi kasus harianjogja.com, jogja.tribunnews.com, & suarajogja.id),” Lektur: Jurnal Ilmu Komunikasi, vol. 5, no. 4, 2022, Accessed: Dec. 28, 2025. [Online]. Available: https://journal.student.uny.ac.id/ilkom/article/view/19182
D. H. Kim, O. Kuru, J. Zeng, and S. Kim, “Fostering mask-wearing with virality metrics and social media literacy: evidence from the U.S. and Korea,” Front Psychol, vol. 14, p. 1151061, 2023, doi: 10.3389/FPSYG.2023.1151061.
N. Jalli, “Viral Justice: TikTok Activism, Misinformation, and the Fight for Social Change in Southeast Asia,” Social Media and Society, vol. 11, no. 1, Jan. 2025, doi: 10.1177/20563051251318122;PAGE:STRING:ARTICLE/CHAPTER.
E. Botha and M. Reyneke, “To share or not to share: The role of content and emotion in viral marketing,” J Public Aff, vol. 13, no. 2, pp. 160–171, May 2013, doi: 10.1002/PA.1471.
M. Heitmayer, “The Second Wave of Attention Economics. Attention as a Universal Symbolic Currency on Social Media and beyond,” Interact Comput, vol. 37, no. 1, pp. 18–29, Jan. 2025, doi: 10.1093/IWC/IWAE035.
M. Tahmina Khanom and A. Professor, “Using social media marketing in the digital era: A necessity or a choice,” International Journal of Research in Business and Social Science (2147- 4478), vol. 12, no. 3, pp. 88–98, May 2023, doi: 10.20525/IJRBS.V12I3.2507.
S. Ren, C. Gong, C. Zhang, and C. Li, “Public opinion communication mechanism of public health emergencies in Weibo: take the COVID-19 epidemic as an example,” Front Public Health, vol. 11, p. 1276083, Nov. 2023, doi: 10.3389/FPUBH.2023.1276083/BIBTEX.
A. Iamnitchi, L. O. Hall, S. Horawalavithana, F. Mubang, K. W. Ng, and J. Skvoretz, “Modeling information diffusion in social media: data-driven observations,” Front Big Data, vol. 6, p. 1135191, May 2023, doi: 10.3389/FDATA.2023.1135191/BIBTEX.
E. Sangiorgio, M. Cinelli, R. Cerqueti, and W. Quattrociocchi, “Followers do not dictate the virality of news outlets on social media,” PNAS Nexus, vol. 3, no. 7, Jun. 2024, doi: 10.1093/PNASNEXUS/PGAE257.
T. Chani, O. O. Olugbara, T. Chani, and O. O. Olugbara, “Measuring Behavioral Influence on Social Media: A Social Impact Theory Approach to Identifying Influential Users,” Journalism and Media 2025, Vol. 6, vol. 6, no. 4, p. 205, Dec. 2025, doi: 10.3390/JOURNALMEDIA6040205.
H. Metzler and D. Garcia, “Social Drivers and Algorithmic Mechanisms on Digital Media,” Perspectives on Psychological Science, vol. 19, no. 5, pp. 735–748, Sep. 2024, doi: 10.1177/17456916231185057.
V. Gupta, K. Jung, and S. C. Yoo, “Exploring the Power of Multimodal Features for Predicting the Popularity of Social Media Image in a Tourist Destination,” Multimodal Technologies and Interaction 2020, Vol. 4, vol. 4, no. 3, pp. 1–23, Sep. 2020, doi: 10.3390/MTI4030064.
K. R. Purba, D. Asirvatham, and R. K. Murugesan, “An analysis and prediction model of outsiders percentage as a new popularity metric on Instagram,” ICT Express, vol. 6, no. 3, pp. 243–248, Sep. 2020, doi: 10.1016/J.ICTE.2020.07.001.
D. Jeong, H. Son, Y. Choi, and K. Kim, “Enhancing social media post popularity prediction with visual content,” Journal of the Korean Statistical Society 2024 53:3, vol. 53, no. 3, pp. 844–882, May 2024, doi: 10.1007/S42952-024-00270-7.
S. Carta et al., “Popularity Prediction of Instagram Posts,” Information 2020, Vol. 11, vol. 11, no. 9, Sep. 2020, doi: 10.3390/INFO11090453.
M. Saari, L. Haapanen, and P. Hurmelinna-Laukkanen, “Social media and international business: views and conceptual framing,” International Marketing Review, vol. 39, no. 7, pp. 25–45, Dec. 2022, doi: 10.1108/IMR-06-2021-0191/FULL/PDF.
J. Song et al., “Graph Representation-Based Deep Multi-View Semantic Similarity Learning Model for Recommendation,” Future Internet 2022, Vol. 14, vol. 14, no. 2, Jan. 2022, doi: 10.3390/FI14020032.
J. Zhu and A. Yaseen, “A Recommender for Research Collaborators Using Graph Neural Networks,” Front Artif Intell, vol. 5, p. 881704, Aug. 2022, doi: 10.3389/FRAI.2022.881704/BIBTEX.
Q. Tong, X. Xu, J. Zhang, and H. Xu, “Public Opinion Propagation Prediction Model Based on Dynamic Time-Weighted Rényi Entropy and Graph Neural Network,” Entropy 2025, Vol. 27, vol. 27, no. 5, May 2025, doi: 10.3390/E27050516.
A. Golovin et al., “Improving Recommender Systems for Fake News Detection in Social Networks with Knowledge Graphs and Graph Attention Networks,” Mathematics 2025, Vol. 13, vol. 13, no. 6, Mar. 2025, doi: 10.3390/MATH13061011.
Z. Xu, M. Qian, Z. Xu, and M. Qian, “Predicting Popularity of Viral Content in Social Media through a Temporal-Spatial Cascade Convolutional Learning Framework,” Mathematics 2023, Vol. 11, vol. 11, no. 14, Jul. 2023, doi: 10.3390/MATH11143059.
Y. Shang et al., “Popularity Prediction of Online Contents via Cascade Graph and Temporal Information,” Axioms 2021, Vol. 10, vol. 10, no. 3, Jul. 2021, doi: 10.3390/AXIOMS10030159.
P. K. Theodoridis, D. C. Gkikas, P. K. Theodoridis, and D. C. Gkikas, “Maximizing Social Media User Engagement Through Predictive Analytics in Retail Tourism: Identifying Key Performance Indicators That Trigger User Interactions,” Applied Sciences 2025, Vol. 15, vol. 15, no. 21, Nov. 2025, doi: 10.3390/APP152111720.
“Social Media Engagement Report | Kaggle.” Accessed: Dec. 27, 2025. [Online]. Available: https://www.kaggle.com/datasets/aliredaelblgihy/social-media-engagement-report
H. M. Lin and J. J. Lyu, “A holistic framework for intradialytic hypotension prediction using generative adversarial networks-based data balancing,” BMC Medical Informatics and Decision Making 2025 25:1, vol. 25, no. 1, pp. 257-, Jul. 2025, doi: 10.1186/S12911-025-03094-5.
P. Zhang, Z. Wang, H. Qiu, W. Zhou, M. Wang, and G. Cheng, “Machine learning applied to serum and cerebrospinal fluid metabolomes revealed altered arginine metabolism in neonatal sepsis with meningoencephalitis,” Comput Struct Biotechnol J, vol. 19, pp. 3284–3292, Jan. 2021, doi: 10.1016/J.CSBJ.2021.05.024.
“Effect of transforming the targets in regression model — scikit-learn 1.8.0 documentation.” Accessed: Dec. 27, 2025. [Online]. Available: https://scikit-learn.org/stable/auto_examples/compose/plot_transformed_target.html
P. P. Tricomi, M. Chilese, M. Conti, and A. R. Sadeghi, “Follow Us and Become Famous! Insights and Guidelines From Instagram Engagement Mechanisms,” Web Science Conference, pp. 346–356, Apr. 2023, doi: 10.1145/3578503.3583623.
C. Papakostas et al., “NAMI: A Neuro-Adaptive Multimodal Architecture for Wearable Human–Computer Interaction,” Multimodal Technologies and Interaction 2025, Vol. 9, vol. 9, no. 10, Oct. 2025, doi: 10.3390/MTI9100108.
K. Abdesselam et al., “Canada’s 2025 AMR priority pathogens: Evidence-based ranking and public health implications,” PLoS One, vol. 20, no. 9, p. e0330128, Sep. 2025, doi: 10.1371/JOURNAL.PONE.0330128.
G. Marchesi, A. Ballarino, A. Brusaferri, G. Marchesi, A. Ballarino, and A. Brusaferri, “Assessing Time Series Foundation Models for Probabilistic Electricity Price Forecasting: Toward a Unified Benchmark,” Energies 2025, Vol. 18, vol. 18, no. 23, Nov. 2025, doi: 10.3390/EN18236269.
M. Kim and K. B. Hwang, “An empirical evaluation of sampling methods for the classification of imbalanced data,” PLoS One, vol. 17, no. 7, p. e0271260, Jul. 2022, doi: 10.1371/JOURNAL.PONE.0271260.
M. K. Baxi, R. Sharma, and V. Mago, “Studying topic engagement and synergy among candidates for 2020 US Elections,” Soc Netw Anal Min, vol. 12, no. 1, p. 136, Dec. 2022, doi: 10.1007/S13278-022-00959-9.
N. Husin, H. Fazlurrahman, A. Safitri, R. Dhenabayu, U. A. A. Rauf, and A. M. Fitrah, “Policy Perspective on Proposed Framework of NLP AI to Bridge the Inclusive Support in Higher Education with a Mixed Methods Approach in Indonesia and Malaysia,” International Journal of Information and Education Technology, vol. 15, no. 12, pp. 2686–2699, 2025, doi: 10.18178/ijiet.2025.15.12.2464.
R. Yang, A. Yu, L. Cai, and D. Meng, “Subspace clustering via graph auto-encoder network for unknown encrypted traffic recognition,” Cybersecurity 2022 5:1, vol. 5, no. 1, pp. 29-, Dec. 2022, doi: 10.1186/S42400-022-00131-Y.
N. K. Nissa, R. T. Pusparini, A. Setiyoko, and A. M. Arymurthy, “The Implementation of Inductive Graph Neural Networks with L1 Loss for Spatiotemporal Kriging,” AIP Conf Proc, vol. 2941, no. 1, Dec. 2023, doi: 10.1063/5.0184738/2929312.