Design and Evaluation of a Sentiment and Topic Modeling Dashboard for Mining Google Play Reviews of a Public Service Mobile Application

Authors

  • Indra Adi Permana Gunadarma University, Indonesia
  • Amrin Fakhruddin Jauhari Universitas Nusa Megarkencana, Yogyakarta, Indonesia
  • Arief Fadhlurrahman Rasyid Universitas Nusa Megarkencana, Yogyakarta, Indonesia
  • Diokta Redho Lastin Universitas Gunadarma, Indonesia
  • Fahmi Fathullah Universitas Nusa Megarkencana, Yogyakarta, Indonesia

DOI:

https://doi.org/10.56127/jts.v5i1.2555

Keywords:

sentiment analysis, topic modelling, app review mining, information system, analytics dashboard, LDA, SVM

Abstract

App-store reviews are a rich source of user feedback for improving mobile service quality, yet they are large in volume, unstructured, and can change rapidly after version updates, making manual triage difficult to perform consistently. This condition requires an approach that can transform review text into prioritized, decision-ready information through a monitoring-oriented information system. Objective: This study aims to design and evaluate an app-review analytics information system that integrates sentiment classification and topic modeling to capture user perception, identify dominant issues, and support faster and more measurable prioritization of app improvements. Methodology: The study employs a quantitative design using an applied case study and a design-and-evaluate approach. Secondary data were collected from 5,000 Google Play reviews over 12 months (text, rating, timestamp), and primary data were obtained through dashboard usability testing with 20 participants using the System Usability Scale (SUS). The analysis includes text preprocessing, TF–IDF-based sentiment classification with multiple model comparisons, LDA topic modeling to extract recurring issues, and integration of outputs into a dashboard for trend monitoring and reporting. Findings: The best-performing model (SVM) achieved Accuracy = 0.86 and Macro-F1 = 0.84 for three-class sentiment classification (positive–neutral–negative). Topic modeling produced 10 dominant topics, with negative sentiment most strongly concentrated in core service issues such as login/OTP failures (72% negative), post-update crashes (69%), payment/transaction errors (65%), and server downtime (63%). The dashboard achieved a usability score of SUS = 82.3, indicating strong operational readiness for monitoring and triage. Implications: The results provide a practical basis for app managers to conduct periodic monitoring, convert dominant complaints into prioritized maintenance backlogs, and assess the impact of updates through sentiment and topic trends. The approach can be adopted as a routine decision-support mechanism for product owners, helpdesk teams, and developers. Originality/Value: This study’s novelty lies in its end-to-end integration of NLP analytics (sentiment and topics) with an information system artifact (dashboard), complemented by standardized usability validation. Unlike studies that stop at model outputs, this work demonstrates how analytics can be operationalized into a continuously usable monitoring system for app-maintenance decision-making.

References

Bangor, A., Kortum, P. T., & Miller, J. T. (2008). An empirical evaluation of the System Usability Scale. International Journal of Human–Computer Interaction, 24(6), 574–594. https://doi.org/10.1080/10447310802205776

Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent Dirichlet Allocation. Journal of Machine Learning Research, 3, 993–1022.

Calefato, F., Lanubile, F., Maiorano, F., & Novielli, N. (2018). Sentiment polarity detection for software development. Empirical Software Engineering, 23(3), 1352–1382. https://doi.org/10.1007/s10664-017-9546-9

Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 20, 273–297. https://doi.org/10.1007/BF00994018

Dąbrowski, J., Letier, E., Perini, A., & Susi, A. (2022). Analysing app reviews for software engineering: A systematic literature review. Empirical Software Engineering, 27(2), 43. https://doi.org/10.1007/s10664-021-10065-7

DeLone, W. H., & McLean, E. R. (2003). The DeLone and McLean model of information systems success: A ten-year update. Journal of Management Information Systems, 19(4), 9–30. https://doi.org/10.1080/07421222.2003.11045748

Genc-Nayebi, N., & Abran, A. (2017). A systematic literature review: Opinion mining studies from mobile app store user reviews. Journal of Systems and Software, 125, 207–219. https://doi.org/10.1016/j.jss.2016.11.027

Islam, M. R., & Zibran, M. F. (2018). SentiStrength-SE: Exploiting domain specificity for improved sentiment analysis in software engineering text. Journal of Systems and Software, 145, 125–146. https://doi.org/10.1016/j.jss.2018.08.030

Jongeling, R., Sarkar, P., Datta, S., & Serebrenik, A. (2017). On negative results when using sentiment analysis tools for software engineering research. Empirical Software Engineering, 22(5), 2543–2584. https://doi.org/10.1007/s10664-016-9493-x

Lewis, J. R. (2018). The System Usability Scale: Past, present, and future. International Journal of Human–Computer Interaction, 34(7), 577–590. https://doi.org/10.1080/10447318.2018.1455307

Lin, B., Cassee, N. W., Serebrenik, A., Bavota, G., Novielli, N., & Lanza, M. (2022). Opinion mining for software development: A systematic literature review. ACM Transactions on Software Engineering and Methodology, 31(3), 38. https://doi.org/10.1145/3490388

Maalej, W., Kurtanović, Z., Nabil, H., & Stanik, C. (2016). On the automatic classification of app reviews. Requirements Engineering, 21(3), 311–331. https://doi.org/10.1007/s00766-016-0251-9

Malgaonkar, S., Licorish, S. A., & Savarimuthu, B. T. R. (2022). Prioritizing user concerns in app reviews: A study of requests for new features, enhancements and bug fixes. Information and Software Technology, 144, 106798. https://doi.org/10.1016/j.infsof.2021.106798

Martin, W., Sarro, F., Jia, Y., Zhang, Y., & Harman, M. (2017). A survey of app store analysis for software engineering. IEEE Transactions on Software Engineering, 43(9), 817–847. https://doi.org/10.1109/TSE.2016.2630689

Novielli, N., Calefato, F., Lanubile, F., & Serebrenik, A. (2021). Assessment of off-the-shelf SE-specific sentiment analysis tools: An extended replication study. Empirical Software Engineering, 26, 77. https://doi.org/10.1007/s10664-021-09960-w

Obaidi, M., Nagel, L., Specht, A., & Klünder, J. (2022). Sentiment analysis tools in software engineering: A systematic mapping study. Information and Software Technology, 151, 107018. https://doi.org/10.1016/j.infsof.2022.107018

Panichella, S., Di Sorbo, A., Guzman, E., Visaggio, C. A., Canfora, G., & Gall, H. C. (2015). How can I improve my app? Classifying user reviews for software maintenance and evolution. In 2015 IEEE International Conference on Software Maintenance and Evolution (ICSME) (pp. 281–290). https://doi.org/10.1109/ICSME.2015.7332474

Thelwall, M., Buckley, K., & Paltoglou, G. (2012). Sentiment strength detection for the social web. Journal of the American Society for Information Science and Technology, 63(1), 163–173. https://doi.org/10.1002/asi.21662

Wang, X., Zhang, T., Tan, Y., Shang, W., & Li, Y. (2024). How to effectively mine app reviews concerning software ecosystem? A survey of review characteristics. Journal of Systems and Software, 213, 112040. https://doi.org/10.1016/j.jss.2024.112040

Yang, C., Wu, L., Yu, C., & Zhou, Y. (2021). A phrase-level user requests mining approach in mobile application reviews: Concept, framework, and operation. Information, 12(5), 177. https://doi.org/10.3390/info12050177

Zhang, T., Xu, B., Thung, F., Haryono, S. A., Lo, D., & Jiang, L. (2020). Sentiment analysis for software engineering: How far can pre-trained transformer models go? In 2020 IEEE International Conference on Software Maintenance and Evolution (ICSME). https://doi.org/10.1109/ICSME46990.2020.00017

Downloads

Published

2026-02-04

How to Cite

Indra Adi Permana, Amrin Fakhruddin Jauhari, Arief Fadhlurrahman Rasyid, Diokta Redho Lastin, & Fahmi Fathullah. (2026). Design and Evaluation of a Sentiment and Topic Modeling Dashboard for Mining Google Play Reviews of a Public Service Mobile Application. Jurnal Teknik Dan Science, 5(1), 39–54. https://doi.org/10.56127/jts.v5i1.2555

Citation Check

Similar Articles

1 2 3 4 5 6 7 8 > >> 

You may also start an advanced similarity search for this article.