Application of Reinforcement Learning Techniques in Software Testing of Android Applications: A Systematic Literature Review

Nadiah Mohd Hanim (1), Johanna Ahmad (2), Mohd Arfian Ismail (3), Nor Amalina Mohd Sabri (4), Shahdatunnaim Azmi (5)
(1) Faculty of Computing, Universiti Teknologi Malaysia, Johor Bahru, Johor, Malaysia
(2) Faculty of Computing, Universiti Teknologi Malaysia, Johor Bahru, Johor, Malaysia
(3) Faculty of Computing, Universiti Malaysia Pahang Al-Sultan Abdullah, Pekan, Pahang, Malaysia
(4) Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia, Parit Raja, Johor, Malaysia
(5) Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia, Parit Raja, Johor, Malaysia
Fulltext View | Download
How to cite (IJASEIT) :
[1]
N. M. Hanim, J. Ahmad, M. A. Ismail, N. A. Mohd Sabri, and S. Azmi, “Application of Reinforcement Learning Techniques in Software Testing of Android Applications: A Systematic Literature Review”, Int. J. Adv. Sci. Eng. Inf. Technol., vol. 15, no. 2, pp. 647–653, Apr. 2025.
Software testing is a critical process in ensuring the quality and reliability of applications before they are deployed to production. However, it is resource-intensive and often tedious, particularly in the context of Android applications, which pose unique challenges due to their vast state space, diverse user interactions, and variable behaviors. Reinforcement learning (RL), a machine learning framework where agents interact with environments to improve decision-making policies, has gained attention for its potential in software testing. This systematic literature review examines the application of reinforcement learning in software testing of Android applications, focusing on widely researched areas, prevalent techniques, and emerging trends. The review analyzes 22 selected studies from an initial pool of over 30,000 articles published between 2020 and 2024. The findings highlight that automated testing is the primary focus in this domain, with Q-learning emerging as the dominant RL technique. Actor-critic methods, deep Q-networks (DQN), and policy gradient approaches are also explored in several studies, aiming to improve the adaptability and efficiency of testing processes. Most research emphasizes fault detection and coverage maximization, often targeting event-driven interactions and GUI-based behaviors. Despite significant advancements, the study identifies underexplored areas, such as test case prioritization and the integration of user behavior or user interaction data, as promising directions for future research. This review contributes to understanding the current landscape and offers guidance for future RL-based Android application testing investigations.

A. Abo-eleneen, A. Palliyali, and C. Catal, "The role of reinforcement learning in software testing," Inf. Softw. Technol., vol. 165, Dec. 2023, doi: 10.1016/j.infsof.2023.107325.

K. Sugali, C. Sprunger, and V. N. Inukollu, "Software testing: Issues and challenges of artificial intelligence & machine learning," Int. J. Artif. Intell. Appl., vol. 12, no. 1, pp. 101-112, Jan. 2021, doi:10.5121/ijaia.2021.12107.

H. You et al., "Navigating the testing of evolving deep learning systems: An exploratory interview study," in Proc. IEEE/ACM 47th Int. Conf. Softw. Eng. (ICSE), 2025, p. 643, doi:10.1109/ICSE55347.2025.00106.

M. Idham et al., "Test case prioritization for software product line: A systematic mapping study," JOIV Int. J. Inf. Vis., vol. 7, p. 2126, Nov. 2023, doi: 10.30630/joiv.7.3-2.1340.

J. Wang et al., "Software testing with large language models: Survey, landscape, and vision," IEEE Trans. Softw. Eng., vol. 50, no. 4, pp. 911-936, Apr. 2024, doi: 10.1109/TSE.2024.3368208.

S. Haldar, M. Pierce, and L. F. Capretz, "Exploring the integration of generative AI tools in software testing education: A case study on ChatGPT and Copilot for preparatory testing artifacts in postgraduate learning," IEEE Access, vol. 13, 2025, doi:10.1109/access.2025.3545882.

Y. Matsuo et al., "Deep learning, reinforcement learning, and world models," Neural Netw., vol. 152, pp. 267-275, Aug. 2022, doi:10.1016/j.neunet.2022.03.037.

M. Binjubier et al., "A GPU accelerated parallel genetic algorithm for the traveling salesman problem," J. Soft Comput. Data Min., vol. 5, no. 2, pp. 137-150, 2024, doi: 10.30880/jscdm.2024.05.02.010.

F. Pecorelli et al., "Software testing and Android applications: A large-scale empirical study," Empir. Softw. Eng., vol. 27, no. 2, Mar. 2022, doi: 10.1007/s10664-021-10059-5.

Z. Chen et al., "Exploring better black-box test case prioritization via log analysis," ACM Trans. Softw. Eng. Methodol., vol. 32, no. 3, Apr. 2023, doi: 10.1145/3569932.

X. Li, "GUI testing for Android applications: A survey," in Proc. 7th Int. Conf. Comput., Softw. Model. (ICCSM), 2023, pp. 6-10, doi:10.1109/iccsm60247.2023.00010.

M. Naeem, S. T. H. Rizvi, and A. Coronato, "A gentle introduction to reinforcement learning and its application in different fields," IEEE Access, vol. 8, pp. 209320-209344, 2020, doi:10.1109/access.2020.3038605.

P. S. N. Mindom, A. Nikanjam, and F. Khomh, "Integrating reinforcement learning in software testing automation: A promising approach," Empir. Softw. Eng., 2023, doi: 10.5753/ise.2023.235976.

S. Elbaum, G. Rothermel, and J. Penix, "Techniques for improving regression testing in continuous integration development environments," in Proc. ACM SIGSOFT Symp. Found. Softw. Eng., 2014, pp. 235-245, doi: 10.1145/2635868.2635910.

H. A. M. Shaffril, S. F. Samsuddin, and A. A. Samah, "The ABC of systematic literature review: The basic methodological guidance for beginners," Qual. Quant., vol. 55, no. 4, pp. 1319-1346, Aug. 2021, doi: 10.1007/s11135-020-01059-6.

B. Kitchenham and S. Charters, "Guidelines for performing systematic literature reviews in software engineering," Keele Univ., Keele, UK, Tech. Rep. EBSE-2007-01, 2007.

H. Eljak et al., "E-learning-based cloud computing environment: A systematic review, challenges, and opportunities," IEEE Access, vol. 12, pp. 7329-7355, 2024, doi: 10.1109/access.2023.3339250.

C. Tao et al., "A reinforcement learning-based approach to testing GUI of mobile applications," World Wide Web, vol. 27, no. 2, Mar. 2024, doi: 10.1007/s11280-024-01252-9.

E. Collins et al., "Deep reinforcement learning based Android application GUI testing," in Proc. ACM Int. Conf. Proceeding Ser., 2021, pp. 186-194, doi: 10.1145/3474624.3474634.

L. Cai et al., "Automated testing of Android applications integrating residual network and deep reinforcement learning," in Proc. IEEE Int. Conf. Softw. Qual., Rel. Secur. (QRS), 2021, pp. 189-196, doi:10.1109/qrs54544.2021.00030.

L. Cai et al., "Reinforcement learning application testing method based on multi-attribute fusion," in Proc. 9th Int. Conf. Dependable Syst. Appl. (DSA), 2022, pp. 24-33, doi: 10.1109/DSA56465.2022.00013.

Y. Zhao, B. Harrison, and T. Yu, "DinoDroid: Testing Android apps using deep Q-networks," ACM Trans. Softw. Eng. Methodol., vol. 33, no. 5, Jun. 2024, doi: 10.1145/3652150.

H. Guo et al., "SQDroid: A semantic-driven testing for Android apps via Q-learning," in Proc. IEEE Int. Conf. Softw. Qual., Rel. Secur. (QRS), 2021, pp. 301-310, doi: 10.1109/qrs54544.2021.00041.

Y. P. Lopez et al., "Q-funcT: A reinforcement learning approach for automated black box functionality testing," in Proc. 2nd IEEE Int. Conf. Softw. Eng. Artif. Intell. (SEAI), 2022, pp. 119-123, doi:10.1109/seai55746.2022.9832177.

H. Wang et al., "CamDroid: Context-aware model-based automated GUI testing for Android apps," Tsinghua Sci. Technol., vol. 30, no. 1, pp. 55-67, Feb. 2024, doi: 10.26599/tst.2024.9010038.

Y. Lan et al., "Deeply reinforcing Android GUI testing with deep reinforcement learning," in Proc. Int. Conf. Softw. Eng., Feb. 2024, doi: 10.1145/3597503.3623344.

M. Pan et al., "Reinforcement learning based curiosity-driven testing of Android applications," in Proc. 29th ACM SIGSOFT Int. Symp. Softw. Test. Anal., Jul. 2020, pp. 153-164, doi:10.1145/3395363.3397354.

Z. Lv et al., "Fastbot2: Reusable automated model-based GUI testing for Android enhanced by reinforcement learning," in Proc. ACM Int. Conf. Proceeding Ser., Sep. 2022, doi: 10.1145/3551349.3559505.

K. Murase and S. Takada, "Applying reinforcement learning for automated testing of mobile application focusing on state definition, reward, and learning method," in Proc. Int. Conf. Softw. Eng. Knowl. Eng., 2023, pp. 64-69, doi: 10.18293/SEKE23-154.

C. Peng et al., "Hawkeye: Change-targeted testing for Android apps based on deep reinforcement learning," in Proc. ACM Int. Conf. Proceeding Ser., Apr. 2024, pp. 298-308, doi:10.1145/3639477.3639749.

Y. Gao et al., "A deep reinforcement learning-based approach for Android GUI testing," in Lect. Notes Comput. Sci., vol. 13975, pp. 262-276, 2023, doi: 10.1007/978-3-031-25201-3_20.

F. Wang, C. Tao, and J. Gao, "REDQT: A method for automated mobile application GUI testing based on deep reinforcement learning algorithms," Softw. Syst. Model., 2024, doi: 10.1007/s11761-024-00413-y.

Z. Zhang et al., "UniRLTest: Universal platform-independent testing with reinforcement learning via image understanding," in Proc. 31st ACM SIGSOFT Int. Symp. Softw. Test. Anal., Jul. 2022, pp. 805-808, doi: 10.1145/3533767.3543292.

H. N. Yasin, S. H. A. Hamid, and R. J. R. Yusof, "Droidbotx: Test case generation tool for android applications using q-learning," Symmetry, vol. 13, no. 2, Feb. 2021, doi: 10.3390/sym13020310.

M. K. Khan and R. Bryce, "Android GUI test generation with SARSA," in Proc. IEEE 12th Annu. Comput. Commun. Workshop Conf., 2022, pp. 487-493, doi: 10.1109/ccwc54503.2022.9720807.

A. Usman et al., "Test case generation approach for Android applications using reinforcement learning," Eng., Technol. Appl. Sci. Res., vol. 14, no. 4, pp. 15127-15132, Aug. 2024, doi:10.48084/etasr.7422.

Y. Koroglu and A. Sen, "Functional test generation from UI test scenarios using reinforcement learning for android applications," Softw. Test., Verif. Reliab., May 2021, doi:10.1002/stvr.1752.

M. Waqar et al., "Test suite prioritization based on optimization approach using reinforcement learning," Appl. Sci., vol. 12, no. 13, Jul. 2022, doi: 10.3390/app12136772.

M. K. Khan et al., "Post prioritization techniques to improve code coverage for SARSA generated test cases," in Proc. IEEE 13th Annu. Comput. Commun. Workshop Conf., 2023, pp. 1029-1035, doi:10.1109/CCWC57344.2023.10099120.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors who publish with this journal agree to the following terms:

    1. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
    2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
    3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).