*Result*: The landscape of artificial intelligence tools and platforms for evidence synthesis: a scoping review.
Pan American Health Organization. A Guide for Evidence-Informed Decision-Making, Including in Health Emergencies. Washington, D.C. 2022. https://iris.paho.org/handle/10665.2/55828 . Accessed 28 May 2024.
Cochrane Community Living Systematic Reviews. https://community.cochrane.org/review-development/resources/living-systematic-reviews . Accessed 27 May 2024.
Khangura S, Konnyu K, Cushman R, Grimshaw J, Moher D. Evidence summaries: the evolution of a rapid review approach. Syst Rev. 2012;1:10. (PMID: 22587960335173610.1186/2046-4053-1-10)
Arksey H, O’Malley L. Scoping studies: towards a methodological framework. Int J Soc Res Methodol. 2005;8:19–32. (PMID: 10.1080/1364557032000119616)
Levac D, Colquhoun H, O’Brien KK. Scoping studies: advancing the methodology. Implement Sci. 2010;5(69). https://doi.org/10.1186/1748-5908-5-69 . Accessed 28 May 2024.
Sousa MSA, Wainwright M, Soares CB. Qualitative Evidence Synthesis: an introductory guide. In: Qualitative Evidence Synthesis to inform health policy. Boletim do Instituto de Saúde 2019;20:5-18. https://www.saude.sp.gov.br/resources/instituto-desaude/homepage/bis/pdfs/bis_v20_n2_english.pdf . Accessed 28 May 2024.
Marshall IJ, Wallace BC. Toward systematic review automation: a practical guide to using machine learning tools in research synthesis. Syst Rev. 2019. https://doi.org/10.1186/s13643-019-1074-9 . (PMID: 10.1186/s13643-019-1074-9318849726935491)
WHO. Evidence, policy, impact. Geneva: WHO guide for evidence-informed decision-making; 2021.
Borah R, Brown AW, Capers PL, Kaiser KA. Analysis of the time and workers needed to conduct systematic reviews of medical interventions using data from the PROSPERO registry. BMJ Open. 2017;7: e012545. (PMID: 28242767533770810.1136/bmjopen-2016-012545)
Bastian H, Glasziou P, Chalmers I. Seventy-five trials and eleven systematic reviews a day: how will we ever keep up? PLoS Med. 2010;7: e1000326. (PMID: 20877712294343910.1371/journal.pmed.1000326)
Takwoingi Y, Hopewell S, Tovey D, Sutton AJ. A multicomponent decision tool for prioritising the updating of systematic reviews. BMJ. 2013;347:f7191–f7191. (PMID: 2433645310.1136/bmj.f7191)
Elliott J, Lawrence R, Minx JC, Oladapo OT, Ravaud P, Tendal Jeppesen B, Thomas J, Turner T, Vandvik PO, Grimshaw JM. Decision makers need constantly updated evidence synthesis. Nature. 2021;600:383–5. (PMID: 3491207910.1038/d41586-021-03690-1)
Epistemonikos a free, relational, collaborative, multilingual database of health evidence. . http://www.epistemonikos.org . Accessed 27 May 2024.
Lassoued Y, Deleris L. Thesaurus-based hierarchical semantic grouping of medical terms in information extraction. In: Exploring Complexity in Health: An Interdisciplinary Systems Approach. IOS Press Ebooks. 2016;446–450. https://ebooks.iospress.nl/publication/44651 . Accessed 28 May 2024.
Deleris L, Deparis S, Sacaleanu B, Tounsi L. Risk Information Extraction and Aggregation. Third International Conference: Algorithmic Decision Theory. 2013. https://www.researchgate.net/publication/291354497_Risk_Information_Extraction_and_Aggregation_Experimenting_with_Medline_Abstracts . Accessed 28 May 2024.
Tsafnat G, Glasziou P, Choong MK, Dunn A, Galgani F, Coiera E. Systematic review automation technologies. Syst Rev. 2014. https://doi.org/10.1186/2046-4053-3-74 . (PMID: 10.1186/2046-4053-3-74250051284100748)
IBM Data and AI Team. Open source large language models: benefits, risks and types. 2023. https://www.ibm.com/blog/open-source-large-language-models-benefits-risks-and-types/ . Accessed 27 May 2024.
Clusmann J, Kolbinger FR, Muti HS, Carrero ZI, Eckardt J-N, Laleh NG, Löffler CML, Schwarzkopf S-C, Unger M, Veldhuizen GP. The future landscape of large language models in medicine. Communications Medicine. 2023;3:141. (PMID: 378168371056492110.1038/s43856-023-00370-1)
C A, Carter C. Large language models and intelligence analysis. In: CETaS Expert Analysis. 2023. https://cetas.turing.ac.uk/publications/large-language-models-and-intelligence-analysis#:~:text=LLMs%20are%20deep%20neural%20networks,such%20as%20Reddit%20and%20Wikipedia . Accessed 27 May 2024.
World Health Organization. WHO calls for safe and ethical AI for health. Geneva. 2023. https://www.who.int/news/item/16-05-2023-whocalls-for-safe-and-ethical-ai-for-health . Accessed 28 May 2024.
Minssen T, Vayena E, Cohen IG. The challenges for regulating medical use of ChatGPT and other large language models. JAMA. 2023;330:315. (PMID: 3741048210.1001/jama.2023.9651)
Panch T, Pearson-Stuttard J, Greaves F, Atun R. Artificial intelligence: opportunities and risks for public health. Lancet Digit Health. 2019;1:e13–4. (PMID: 3332323610.1016/S2589-7500(19)30002-0)
Reddy CL, Mitra S, Meara JG, Atun R, Afshar S. Artificial Intelligence and its role in surgical care in low-income and middle-income countries. Lancet Digit Health. 2019;1:e384–6. (PMID: 3332321710.1016/S2589-7500(19)30200-6)
O’Mara-Eves A, Thomas J, McNaught J, Miwa M, Ananiadou S. Using text mining for study identification in systematic reviews: a systematic review of current approaches. Syst Rev. 2015;4:5. (PMID: 25588314432053910.1186/2046-4053-4-5)
Jakhar D, Kaur I. Artificial intelligence, machine learning and deep learning: definitions and differences. Clin Exp Dermatol. 2020;45:131–2. (PMID: 3123362810.1111/ced.14029)
Amazon Web Services (2024) What is a neural network? In: Amazon. https://aws.amazon.com/what-is/neural-network/ . Accessed 7 Jan 2025.
Cohen KB. Biomedical natural language processing and text mining. In: Methods in Biomedical Informatics. Elsevier. 2014;41–177. https://doi.org/10.1016/B978-0-12-401678-1.00006-3 .
Khalil H, Ameen D, Zarnegar A. Tools to support the automation of systematic reviews: a scoping review. J Clin Epidemiol. 2022;144:22–42. (PMID: 3489623610.1016/j.jclinepi.2021.12.005)
Jimenez RC, Lee T, Rosillo N, et al. Machine learning computational tools to assist the performance of systematic reviews: a mapping review. BMC Med Res Methodol. 2022;22:322. (PMID: 10.1186/s12874-022-01805-4)
dos Santos AO, da Silva ES, Couto LM, Reis GVL, Belo VS, Santos ÁOD, da Silva ES, Couto LM, Reis GVL, Belo VS. The use of artificial intelligence for automating or semi-automating biomedical literature analyses: a scoping review. J Biomed Inform. 2023. https://doi.org/10.1016/j.jbi.2023.104389 . (PMID: 10.1016/j.jbi.2023.10438937187321)
Giattino C, Mathieu E, Samborska V, Roser M. Artificial intelligence. 2023. In: OurWorldInData.org. https://ourworldindata.org/artificial-intelligence (. Accessed 27 May 2024.
WHO (2021) Ethics and governance of artificial intelligence for health: WHO guidance. Geneva.
Tricco AC, Lillie E, Zarin W, et al. PRISMA extension for scoping reviews (PRISMA-ScR): checklist and explanation. Ann Intern Med. 2018;169:467–73. (PMID: 3017803310.7326/M18-0850)
Ouzzani M, Hammady H, Fedorowicz Z, Elmagarmid A. Rayyan—a web and mobile app for systematic reviews. Syst Rev. 2016;5:1–10. (PMID: 10.1186/s13643-016-0384-4)
de la Torre-lopez J, Ramirez A, Romero JR. Artificial intelligence to automate the systematic review of scientific literature. Computing. 2023. https://doi.org/10.1007/s00607-023-01181-x . (PMID: 10.1007/s00607-023-01181-x)
Robson B. Studies in using a universal exchange and inference language for evidence based medicine. Semi-automated learning and reasoning for PICO methodology, systematic review, and environmental epidemiology. Comput Biol Med. 2016;79:299–323. (PMID: 2784644610.1016/j.compbiomed.2016.10.009)
Feng LY, Chiam YK, Lo SK. Text-mining Techniques and Tools for Systematic Literature Reviews: A Systematic Literature Review. 24TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2017). 2017;41–50. https://doi.org/10.1109/APSEC.2017.10 . https://ieeexplore.ieee.org/document/8305926 . Accessed 28 May 2024.
Fleuren WWM, Alkema W. Application of text mining in the biomedical domain. Methods. 2015;74:97–106. (PMID: 2564151910.1016/j.ymeth.2015.01.015)
Park SE, Thomas J. Evidence synthesis software. Evid Based Med. 2018;23:140–1.
Schmidt L, Sinyor M, Webb RT, Marshall C, Knipe D, Eyles EC, John A, Gunnell D, Higgins JPT. A narrative review of recent tools and innovations toward automating living systematic reviews and evidence syntheses. Z Evid Fortbild Qual Gesundhwes. 2023;181:65–75. (PMID: 3759616010.1016/j.zefq.2023.06.007)
Sutton A, O’Keefe H, Johnson EE, Marshall C. A mapping exercise using automated techniques to develop a search strategy to identify systematic review tools. Res Synth Methods. 2023;14:874–81. (PMID: 3766990510.1002/jrsm.1665)
Afzal M, Alam F, Malik KM, Malik GM. Clinical context–aware biomedical text summarization using deep neural network: model development and validation. J Med Internet Res. 2020;22: e19810. (PMID: 33095174764781210.2196/19810)
Schmidt L, Finnerty Mutlu AN, Elmore R, Olorisade BK, Thomas J, Higgins JPT. Data extraction methods for systematic review (semi)automation: update of a living systematic review. F1000Res. 2023;10:401. (PMID: 10.12688/f1000research.51117.2)
Wallace BC, Dahabreh IJ, Schmid CH, Lau J, Trikalinos TA. Modernizing the systematic review process to inform comparative effectiveness: tools and methods. J Comp Eff Res. 2013;2:273–82. (PMID: 2423662610.2217/cer.13.17)
Del Giglio A, da Costa MUP. The use of artificial intelligence to improve the scientific writing of non-native english speakers. Rev Assoc Med Bras (1992). 2023;69:e20230560–e20230560. (PMID: 377293761050889210.1590/1806-9282.20230560)
Li J, Dada A, Kleesiek J, Egger J (2023) ChatGPT in healthcare: a taxonomy and systematic review. medRxiv 2023.
Roco-Videla Á, Aguilera-Eguía R, Olguín-Barraza M, Flores-Fernández C. El papel de la inteligencia artificial en las revisiones sistemáticas: implicaciones y desafíos para la divulgación científica TT - The role of artificial intelligence in systematic reviews: implications and challenges for scientific dissemination. Angiol (Barcelona). 2023;75:344–5.
Temsah O, Khan SA, Chaiah Y, et al. Overview of early ChatGPT’s presence in medical literature: insights from a hybrid literature review by ChatGPT and human experts. Cureus. 2023;15:e37281–e37281. (PMID: 3703838110082551)
Kamdar BB, Shah PA, Sakamuri S, Kamdar BS, Oh J. A novel search builder to expedite search strategies for systematic reviews. Int J Technol Assess Health Care. 2015;31:51–3. (PMID: 25989817484810710.1017/S0266462315000136)
Afzal M, Hussain M, Ali T, Hussain J, Khan WA, Lee S, Kang BH. Knowledge-based query construction using the CDSS knowledge base for efficient evidence retrieval. Sensors (Basel). 2015;15:21294–314. (PMID: 26343669461047410.3390/s150921294)
Aljaber B, Martinez D, Stokes N, Bailey J. Improving MeSH classification of biomedical articles using citation contexts. J Biomed Inform. 2011;44:881–96. (PMID: 2168380210.1016/j.jbi.2011.05.007)
Ananiadou S, Rea B, Okazaki N, Procter R, Thomas J. Supporting systematic reviews using text mining. Soc Sci Comput Rev. 2009;27:509–23. (PMID: 10.1177/0894439309332293)
Grames EM, Stillman AN, Tingley MW, Elphick CS. An automated approach to identifying search terms for systematic reviews using keyword co-occurrence networks. Methods Ecol Evol. 2019;10:1645–54. (PMID: 10.1111/2041-210X.13268)
Müller H, Pachnanda S, Pahl F, Rosenqvist C, IEEE,. The application of artificial intelligence on different types of literature reviews - a comparative study. INTERNATIONAL CONFERENCE ON APPLIED ARTIFICIAL INTELLIGENCE (ICAPAI). 2022;2022:38–44.
O’Keefe H, Rankin J, Wallace SA, Beyer F. Investigation of text-mining methodologies to aid the construction of search strategies in systematic reviews of diagnostic test accuracy-a case study. Res Synth Methods. 2023;14:79–98. (PMID: 3584112510.1002/jrsm.1593)
Ruiz N, Winter R, Rosa FD, Shukla P, Kazemian H (2021) Method and tool for generating table of relevance in literature review (MTTR). PROCEEDINGS OF THE 22ND EUROPEAN CONFERENCE ON KNOWLEDGE MANAGEMENT (ECKM 2021) 648–656.
Tercero-Hidalgo JR, Khan KS, Bueno-Cavanillas A, Fernández-López R, Huete JF, Amezcua-Prieto C, Zamora J, Fernández-Luna JM. Artificial intelligence in COVID-19 evidence syntheses was underutilized, but impactful: a methodological study. J Clin Epidemiol. 2022;148:124–34. (PMID: 35513213905939010.1016/j.jclinepi.2022.04.027)
Vasuki V, Cohen T. Reflective random indexing for semi-automatic indexing of the biomedical literature. J Biomed Inform. 2010;43:694–700. (PMID: 2038226510.1016/j.jbi.2010.04.001)
Westgate MJ. revtools: an R package to support article screening for evidence synthesis. Res Synth Methods. 2019;10:606–14. (PMID: 3135554610.1002/jrsm.1374)
Tóth B, Berek L, Gulácsi L, Péntek M, Zrubka Z. Automation of systematic reviews of biomedical literature: a scoping review of studies indexed in PubMed. Syst Rev. 2024;13:174. (PMID: 389781321122925710.1186/s13643-024-02592-3)
Adam GP, Pappas D, Papageorgiou H, Evangelou E, Trikalinos TA. A novel tool that allows interactive screening of PubMed citations showed promise for the semi-automation of identification of biomedical literature. J Clin Epidemiol. 2022;150:63–71. (PMID: 3573830610.1016/j.jclinepi.2022.06.007)
Afzal M, Hussain M, Malik KM, Lee S. Impact of automatic query generation and quality recognition using deep learning to curate evidence from biomedical literature: empirical study. JMIR Med Inform. 2019;7: e13430. (PMID: 31815673692870310.2196/13430)
Ajiji P, Cottin J, Picot C, Uzunali A, Ripoche E, Cucherat M, Maison P. Feasibility study and evaluation of expert opinion on the semi-automated meta-analysis and the conventional meta-analysis. Eur J Clin Pharmacol. 2022;78:1177–84. (PMID: 3550147610.1007/s00228-022-03329-8)
Allot A, Lee K, Chen Q, Luo L, Lu Z. LitSuggest: a web-based system for literature recommendation and curation using machine learning. Nucleic Acids Res. 2021;49:W352–8. (PMID: 33950204826272310.1093/nar/gkab326)
Brassey J, Price C, Edwards J, Zlabinger M, Bampoulidis A, Hanbury A. Developing a fully automated evidence synthesis tool for identifying, assessing and collating the evidence. BMJ Evid Based Med. 2021;26:24–7. (PMID: 3146724710.1136/bmjebm-2018-111126)
Burgard T, Bittermann A. Reducing literature screening workload with machine learning a systematic review of tools and their performance. ZEITSCHRIFT FUR PSYCHOLOGIE-JOURNAL OF PSYCHOLOGY. 2023;231:3–15. (PMID: 10.1027/2151-2604/a000509)
Denzler T, Enders MR, Akello P, Syst AI. Towards a semi-automated approach for systematic literature reviews completed research. 2021. DIGITAL INNOVATION AND ENTREPRENEURSHIP (AMCIS 2021).
Orel E, Ciglenecki I, Thiabaud A, Temerev A, Calmy A, Keiser O, Merzouki A. An Automated Literature Review Tool (LiteRev) for streamlining and accelerating research using natural language processing and machine learning: descriptive performance evaluation study. J Med Internet Res. 2023. https://doi.org/10.2196/39736 . (PMID: 10.2196/397363771326110541641)
Pérez-Pérez M, Ferreira T, Lourenço A, Igrejas G, Fdez-Riverola F. Boosting biomedical document classification through the use of domain entity recognizers and semantic ontologies for document representation: the case of gluten bibliome. Neurocomputing. 2022;484:223–37. (PMID: 10.1016/j.neucom.2021.10.100)
Thomas J, Noel-Storr A, Marshall I, et al. Living systematic reviews: 2. Combining human and machine effort. J Clin Epidemiol. 2017;91:31–7. (PMID: 2891200310.1016/j.jclinepi.2017.08.011)
Thomas J, McDonald S, Noel-Storr A, Shemilt I, Elliott J, Mavergames C, Marshall IJ. Machine learning reduced workload with minimal risk of missing studies: development and evaluation of a randomized controlled trial classifier for Cochrane Reviews. J Clin Epidemiol. 2021;133:140–51. (PMID: 3317127510.1016/j.jclinepi.2020.11.003)
Timsina P, El-Gayar O, Liu J, Syst AI. Active Learning for the Automation of Medical Systematic Review Creation. AMCIS 2015 Proceedings. 2015;22. https://aisel.aisnet.org/amcis2015/BizAnalytics/GeneralPresentations/22 . Accessed 28 May 2024.
Shemilt I, Arno A, Thomas J, et al. Cost-effectiveness of Microsoft Academic Graph with machine learning for automated study identification in a living map of coronavirus disease 2019 (COVID-19) research. Wellcome Open Res. 2021;6:210. (PMID: 3868601910.12688/wellcomeopenres.17141.1)
Martenot V, Masdeu V, Cupe J, Gehin F, Blanchon M, Dauriat J, Horst A, Renaudin M, Girard P, Zucker J-D. LiSA: an assisted literature search pipeline for detecting serious adverse drug events with deep learning. BMC Med Inform Decis Mak. 2022;22:338. (PMID: 36550485977350610.1186/s12911-022-02085-0)
Park J, Djelassi M, Chima D, Hernandez R, Poroshin V, Iliescu A-M, Domalik D, Southall N. Validation of a natural language machine learning model for safety literature surveillance. Drug Saf. 2023. https://doi.org/10.1007/s40264-023-01367-4 . (PMID: 10.1007/s40264-023-01367-43793853910584766)
Kwabena AE, Wiafe O-B, John B-D, Bernard A, Boateng FAF. An automated method for developing search strategies for systematic review using natural language processing (NLP). MethodsX. 2023. https://doi.org/10.1016/j.mex.2022.101935 . (PMID: 10.1016/j.mex.2022.10193536590320)
Perez-Iratxeta C, Bork P, Andrade MA. XplorMed: a tool for exploring MEDLINE abstracts. Trends Biochem Sci. 2001;26:573–5. (PMID: 1155179510.1016/S0968-0004(01)01926-0)
Alshami A, Elsayed M, Ali E, Eltoukhy AEE, Zayed T. Harnessing the power of ChatGPT for automating systematic review process: methodology, case study, limitations, and future directions. SYSTEMS. 2023. https://doi.org/10.3390/systems11070351WE-SocialScienceCitationIndex(SSCI) . (PMID: 10.3390/systems11070351WE-SocialScienceCitationIndex(SSCI))
Kontonatsios G, Brockmeier AJ, Przybyła P, McNaught J, Mu T, Goulermas JY, Ananiadou S. A semi-supervised approach using label propagation to support citation screening. J Biomed Inform. 2017;72:67–76. (PMID: 28648605572608510.1016/j.jbi.2017.06.018)
Stansfield C, Thomas J, Kavanagh J. “Clustering” documents automatically to support scoping reviews of research: a case study. Res Synth Methods. 2013;4:230–41. (PMID: 2605384310.1002/jrsm.1082)
States DJ, Ade AS, Wright ZC, Bookvich AV, Athey BD. MiSearch adaptive pubMed search tool. Bioinformatics. 2009;25:974–6. (PMID: 1832650710.1093/bioinformatics/btn033)
Surian D, Dunn AG, Orenstein L, Bashir R, Coiera E, Bourgeois FT. A shared latent space matrix factorisation method for recommending new trial evidence for systematic review updates. J Biomed Inform. 2018;79:32–40. (PMID: 2941035610.1016/j.jbi.2018.01.008)
Wallace BC, Small K, Brodley CE, Lau J, Schmid CH, Bertram L, Lill CM, Cohen JT, Trikalinos TA. Toward modernizing the systematic review pipeline in genetics: efficient updating via data mining. Genet Med. 2012;14:663–9. (PMID: 22481134390855010.1038/gim.2012.7)
Goetz T, von der Lieth C-W. PubFinder: a tool for improving retrieval rate of relevant PubMed abstracts. Nucleic Acids Res. 2005;33:W774–8. (PMID: 15980583116019010.1093/nar/gki429)
Howard BE, Phillips J, Miller K, Tandon A, Mav D, Shah MR, Holmgren S, Pelch KE, Walker V, Rooney AA. SWIFT-Review: a text-mining workbench for systematic review. Syst Rev. 2016;5:1–16. (PMID: 10.1186/s13643-016-0263-z)
Olorisade BK, Brereton P, Andras P. The use of bibliography enriched features for automatic citation screening. J Biomed Inform. 2019. https://doi.org/10.1016/j.jbi.2019.103202 . (PMID: 10.1016/j.jbi.2019.10320231075531)
Pham B, Jovanovic J, Bagheri E, et al. Text mining to support abstract screening for knowledge syntheses: a semi-automated workflow. Syst Rev. 2021. https://doi.org/10.1186/s13643-021-01700-x . (PMID: 10.1186/s13643-021-01700-x349304398690960)
Tsafnat G, Glasziou P, Karystianis G, Coiera E. Automated screening of research studies for systematic reviews using study characteristics. Syst Rev. 2018;7:64. (PMID: 29695296591875210.1186/s13643-018-0724-7)
Røst TB, Slaughter L, Nytro O, Muller AE, Vist GE, Røst TB, Slaughter L, Nytrø Ø, Muller AE, Vist GE. Using neural networks to support high-quality evidence mapping. BMC Bioinformatics. 2021;22:496. (PMID: 34674636852936810.1186/s12859-021-04396-x)
Kusa W, Hanbury A, Knoth P. Automation of citation screening for systematic literature reviews using neural networks: a replicability study. ADVANCES IN INFORMATION RETRIEVAL, PT I. 2022;13185:584–98.
Bekhuis T, Tseytlin E, Mitchell KJ, Demner-Fushman D. Feature engineering and a proposed decision-support system for systematic reviewers of medical evidence. PLoS ONE. 2014. https://doi.org/10.1371/journal.pone.0086277 . (PMID: 10.1371/journal.pone.0086277244750993903545)
Evans DA, Hersh WR, Monarch IA, Lefferts RG, Handerson SK. Automatic indexing of abstracts via natural-language processing using a simple thesaurus. Med Decis Making. 1991;11:S108–15. (PMID: 177083910.1177/0272989X9101104s21)
Tetzlaff J, Cadarette SM, O’Blenis P, Ruiz K. PNS15 pragmatic artificial intelligence-based reference screening in systematic reveiws. ARE two robots better than one? Value in Health. 2019;22:S290–S290. (PMID: 10.1016/j.jval.2019.04.1381)
Tetzlaff J, Murad MH, Wang Z. AI4 can we decrease the screening burden in systematic reviews? Performance of two natural language processors to exclude records. Value in Health. 2020;23:S1–2. (PMID: 10.1016/j.jval.2020.04.007)
Matsui K, Utsumi T, Aoki Y, Maruki T, Takeshima M, Yoshikazu T. Large Language Model Demonstrates Human-Comparable Sensitivity in Initial Screening of Systematic Reviews: A Semi-Automated Strategy Using GPT-3.5. 2023. https://ssrn.com/abstract=4520426 or https://doi.org/10.2139/ssrn.4520426 . Accessed 28 May 2024.
Robinson A, Thorne W, Wu BP, Pandor A, Essat M, Stevenson M, Song X. Bio-SIEVE: exploring Instruction Tuning Large Language Models for Systematic Review Automation. 2023. arXiv preprint arXiv:2308.06610.
Tsubota T, Bollegala D, Zhao Y, Jin Y, Kozu T. Improvement of intervention information detection for automated clinical literature screening during systematic review. J Biomed Inform. 2022;134: 104185. (PMID: 3603806610.1016/j.jbi.2022.104185)
Dennstädt F, Zink J, Putora PM, Hastings J, Cihoric N. Title and abstract screening for literature reviews using large language models: an exploratory study in the biomedical domain. Syst Rev. 2024;13:158. (PMID: 388795341118040710.1186/s13643-024-02575-4)
Felizardo KR, Salleh N, Martins RM, Mendes E, MacDonell SG, Maldonado JC. Using visual text mining to support the study selection activity in systematic literature reviews. 2011 International Symposium on Empirical Software Engineering and Measurement. 2011;77–86. https://doi.org/10.1109/ESEM.2011.16 . https://ieeexplore.ieee.org/document/6092556 . Accessed 28 May 2024.
Cohen AM, Hersh WR, Peterson K, Yen P-Y. Reducing workload in systematic review preparation using automated citation classification. J Am Med Inform Assoc. 2006;13:206–19. (PMID: 1635735210.1197/jamia.M1929)
Cohen AM, Ambert K, McDonagh M. Cross-topic learning for work prioritization in systematic review creation and update. J Am Med Inform Assoc. 2009;16:690–704. (PMID: 19567792274472010.1197/jamia.M3162)
Cohen AM, Smalheiser NR, McDonagh MS, Yu C, Adams CE, Davis JM, Yu PS. Automated confidence ranked classification of randomized controlled trial articles: an aid to evidence-based medicine. J Am Med Inform Assoc. 2015;22:707–17. (PMID: 25656516445711210.1093/jamia/ocu025)
Xiong Z, Liu T, Tse G, Gong M, Gladding PA, Smaill BH, Stiles MK, Gillis AM, Zhao J. A machine learning aided systematic review and meta-analysis of the relative risk of atrial fibrillation in patients with diabetes mellitus. Front Physiol. 2018. https://doi.org/10.3389/fphys.2018.00835 . (PMID: 10.3389/fphys.2018.00835304984536249421)
Kempf S, Krug M, Puppe F. KIETA: key-insight extraction from scientific tables. Appl Intell. 2023;53:9513–30. (PMID: 10.1007/s10489-022-03957-8)
Serban R, ten Teije A, van Harmelen F, Marcos M, Polo-Conde C. Extraction and use of linguistic patterns for modelling medical guidelines. Artif Intell Med. 2007;39:137–49. (PMID: 1696324110.1016/j.artmed.2006.07.012)
Wang JC, Su GD, Wan CR, Huang XW, Sun LL. A keyword-based literature review data generating algorithm-analyzing a field from scientific publications. SYMMETRY-BASEL. 2020. https://doi.org/10.3390/sym12060903WE-ScienceCitationIndexExpanded(SCI-EXPANDED) . (PMID: 10.3390/sym12060903WE-ScienceCitationIndexExpanded(SCI-EXPANDED))
Aliyu MB, Iqbal R, James A (2018) The canonical model of structure for data extraction in systematic reviews of scientific research articles. In: 2018 Fifth International Conference on Social Networks Analysis, Management and Security (SNAMS). IEEE, pp 264–271.
Kaiser K, Miksch S. Versioning computer-interpretable guidelines: semi-automatic modeling of “living guidelines” using an information extraction method. Artif Intell Med. 2009;46:55–66. (PMID: 1895099410.1016/j.artmed.2008.08.009)
Kiritchenko S, De Bruijn B, Carini S, Martin J, Sim I. ExaCT: automatic extraction of clinical trial characteristics from journal publications. BMC Med Inform Decis Mak. 2010;10:1–17. (PMID: 10.1186/1472-6947-10-56)
Nur S, Adams CE, Brailsford DF (2016) Using built-in functions of Adobe Acrobat Pro DC to help the selection process in systematic reviews of randomised trials. Systematic reviews 5 (1) (no pagination), 2016 Article number: 33 Date of publication: 18 feb 2016. https://doi.org/10.1186/s13643-016-0207-7.
Baviskar D, Ahirrao S, Potdar V, Kotecha K. Efficient automated processing of the unstructured documents using artificial intelligence: a systematic literature review and future directions. IEEE ACCESS. 2021;9:72894–936. (PMID: 10.1109/ACCESS.2021.3072900)
Gates A, Johnson C, Hartling L. Technology-assisted title and abstract screening for systematic reviews: a retrospective evaluation of the Abstrackr machine learning tool. Syst Rev. 2018;7:45. (PMID: 29530097584851910.1186/s13643-018-0707-8)
Walker VR, Schmitt CP, Wolfe MS, et al. Evaluation of a semi-automated data extraction tool for public health literature-based reviews: Dextr. Environ Int. 2022. https://doi.org/10.1016/j.envint.2021.107025 . (PMID: 10.1016/j.envint.2021.107025363797298960996)
Barnickel T, Weston J, Collobert R, Mewes H-W, Stümpflen V. Large scale application of neural network based semantic role labeling for automated relation extraction from biomedical texts. PLoS ONE. 2009. https://doi.org/10.1371/journal.pone.0006393 . (PMID: 10.1371/journal.pone.0006393196364322712690)
Guo E, Gupta M, Deng J, Park Y-J, Paget M, Naugler C. Automated paper screening for clinical reviews using large language models. 2023. arXiv preprint arXiv:2305.00844.
Susnjak T. PRISMA-DFLLM: An extension of PRISMA for systematic literature reviews using domain-specific finetuned large language models. 2023. arXiv preprint arXiv:2306.14905.
Tian S, Jin Q, Yeganova L, Lai P-T, Zhu Q, Chen X, Yang Y, Chen Q, Kim W, Comeau DC. Opportunities and challenges for ChatGPT and large language models in biomedicine and health. 2023. arXiv preprint arXiv:2306.10070.
Gates A, Vandermeer B, Hartling L. Technology-assisted risk of bias assessment in systematic reviews: a prospective cross-sectional evaluation of the RobotReviewer machine learning tool. J Clin Epidemiol. 2018;96:54–62. (PMID: 2928976110.1016/j.jclinepi.2017.12.015)
Jardim PSJ, Rose CJ, Ames HM, Echavez JFM, de Velde S, Muller AE. Automating risk of bias assessment in systematic reviews: a real-time mixed methods comparison of human researchers to a machine learning system. BMC Med Res Methodol. 2022;22:167. (PMID: 35676632917402410.1186/s12874-022-01649-y)
Sarker A, Mollá D, Paris C. Automatic evidence quality prediction to support evidence-based decision making. Artif Intell Med. 2015;64:89–103. (PMID: 2598313310.1016/j.artmed.2015.04.001)
Soboczenski F, Trikalinos TA, Kuiper J, Bias RG, Wallace BC, Marshall IJ. Machine learning to help researchers evaluate biases in clinical trials: a prospective, randomized user study. BMC Med Inform Decis Mak. 2019;19:96. (PMID: 31068178650519010.1186/s12911-019-0814-z)
Nashwan AJ, Jaradat JH. Streamlining Systematic Reviews: Harnessing Large Language Models for Quality Assessment and Risk-of-Bias Evaluation. Cureus. 2023;15(8):e43023. https://doi.org/10.7759/cureus.43023 . Accessed 28 May 2024.
Diaz Milian R, Moreno Franco P, Freeman WD, Halamka JD. Revolution or peril? The controversial role of large language models in medical manuscript writing. Mayo Clin Proc. 2023;98:1444–8. (PMID: 3779372310.1016/j.mayocp.2023.07.009)
Tai RH, Bentley LR, Xia X, Sitt JM, Fankhauser SC, Chicas-Mosier AM, Monteith BG. Use of large language models to aid analysis of textual data. 2023. bioRxiv 2023–2027.
Tang L, Sun Z, Idnay B, Nestor JG, Soroush A, Elias PA, Xu Z, Ding Y, Durrett G, Rousseau JF. Evaluating large language models on medical evidence summarization. NPJ Digit Med. 2023;6:158. (PMID: 376204231044991510.1038/s41746-023-00896-7)
Van Veen D, Van Uden C, Blankemeier L, Delbrouck J-B, Aali A, Bluethgen C, Pareek A, Polacin M, Collins W, Ahuja N. Clinical text summarization: adapting large language models can outperform human experts. 2023. arXiv preprint arXiv:2309.07430.
Yu B. Evaluating pre-trained language models on multi-document summarization for literature reviews. Proceedings of the Third Workshop on Scholarly Document Processing. 2022;188–192. https://aclanthology.org/2022.sdp-1.22/ . Accessed 28 May 2024.
Li Z, Belkadi S, Micheletti N, Han L, Shardlow M, Nenadic G. Large language models and control mechanisms improve text readability of biomedical abstracts. 2023. arXiv preprint arXiv:2309.13202.
Thirunavukarasu AJ, Ting DSJ, Elangovan K, Gutierrez L, Tan TF, Ting DSW. Large language models in medicine. Nat Med. 2023;29:1930–40. (PMID: 3746075310.1038/s41591-023-02448-8)
Costa ICP, Nascimento MCD, Treviso P, Chini LT, Roza BA, Barbosa SFF, Mendes KDS. Using the chat generative pre-trained transformer in academic writing in health: a scoping review. Rev Lat Am Enfermagem. 2024;32: e4194. (PMID: 3892226511182606)
Bannach-Brown A, Przybyła P, Thomas J, Rice ASC, Ananiadou S, Liao J, Macleod MR. Machine learning algorithms for systematic review: reducing workload in a preclinical review of animal studies and reducing human screening error. Syst Rev. 2019. https://doi.org/10.1186/s13643-019-0942-7 . (PMID: 10.1186/s13643-019-0942-7306469596334440)
Chai KEK, Lines RLJ, Gucciardi DF, Ng L. Research Screener: a machine learning tool to semi-automate abstract screening for systematic reviews. Syst Rev. 2021. https://doi.org/10.1186/s13643-021-01635-3 . (PMID: 10.1186/s13643-021-01635-3337950038017894)
Gartlehner G, Wagner G, Lux L, Affengruber L, Dobrescu A, Kaminski-Hartenthaler A, Viswanathan M. Assessing the accuracy of machine-assisted abstract screening with DistillerAI: a user study. Syst Rev. 2019. https://doi.org/10.1186/s13643-019-1221-3 . (PMID: 10.1186/s13643-019-1221-3318292506905114)
Gates A, Guitard S, Pillay J, Elliott SA, Dyson MP, Newton AS, Hartling L. Performance and usability of machine learning for screening in systematic reviews: a comparative evaluation of three tools. Syst Rev. 2019. https://doi.org/10.1186/s13643-019-1222-2 . (PMID: 10.1186/s13643-019-1222-2318704346929355)
Gates A, Gates M, Sebastianski M, Guitard S, Elliott SA, Hartling L. The semi-automation of title and abstract screening: a retrospective exploration of ways to leverage Abstrackr’s relevance predictions in systematic and rapid reviews. BMC Med Res Methodol. 2020;20:139. (PMID: 32493228726859610.1186/s12874-020-01031-w)
Halfpenny N, Alleman C, Eaton J, van Vliet M. PNS335 using machine learning for efficiency improvements in systematic literature reviews of clinical efficacy and safety. Value in Health. 2019;22:S821–S821. (PMID: 10.1016/j.jval.2019.09.2235)
Hamel C, Kelly SE, Thavorn K, Rice DB, Wells GA, Hutton B. An evaluation of DistillerSR’s machine learning-based prioritization tool for title/abstract screening - impact on reviewer-relevant outcomes. BMC Med Res Methodol. 2020;20:256. (PMID: 33059590755919810.1186/s12874-020-01129-1)
Marshall IJ, Johnson BT, Wang Z, Rajasekaran S, Wallace BC. Semi-automated evidence synthesis in health psychology: current methods and future prospects. Health Psychol Rev. 2020;14:145–58. (PMID: 31941434702979710.1080/17437199.2020.1716198)
Matwin S, Kouznetsov A, Inkpen D, Frunza O, O’Blenis P. A new algorithm for reducing the workload of experts in performing systematic reviews. J Am Med Inform Assoc. 2010;17:446–53. (PMID: 20595313299565310.1136/jamia.2010.004325)
Mo Y, Kontonatsios G, Ananiadou S. Supporting systematic reviews using LDA-based document representations. Syst Rev. 2015;4:1–12. (PMID: 10.1186/s13643-015-0117-0)
Rathbone J, Hoffmann T, Glasziou P. Faster title and abstract screening? Evaluating Abstrackr, a semi-automated online screening program for systematic reviewers. Syst Rev. 2015;4:1–7. (PMID: 10.1186/s13643-015-0067-6)
Reddy SM, Patel S, Weyrich M, Fenton J, Viswanathan M. Comparison of a traditional systematic review approach with review-of-reviews and semi-automation as strategies to update the evidence. Syst Rev. 2020. https://doi.org/10.1186/s13643-020-01450-2 . (PMID: 10.1186/s13643-020-01450-2330769757574591)
Schneider J, Hoang L, Kansara Y, Cohen AM, Smalheiser NR. Evaluation of publication type tagging as a strategy to screen randomized controlled trial articles in preparing systematic reviews. JAMIA Open. 2022. https://doi.org/10.1093/jamiaopen/ooac015 . (PMID: 10.1093/jamiaopen/ooac015356515229150077)
Wallace BC, Small K, Brodley CE, Lau J, Trikalinos TA. Deploying an interactive machine learning system in an evidence-based practice center: Abstrackr. Proceedings of the 2nd ACM SIGHIT international health informatics symposium. 2012;819–824. https://doi.org/10.1145/2110363.211046 . Accessed 28 May 2024–824.
Raza Abidi SS, Kershaw M, Milios E. Augmenting GEM-encoded clinical practice guidelines with relevant best evidence autonomously retrieved from MEDLINE. Health Informatics J. 2005;11:95–110. (PMID: 10.1177/1460458205050684)
Coiera E, Liu SD. Evidence synthesis, digital scribes, and translational challenges for artificial intelligence in healthcare. Cell Rep Med. 2022. https://doi.org/10.1016/j.xcrm.2022.100860 . (PMID: 10.1016/j.xcrm.2022.100860365130719798027)
Cowie K, Rahmatullah A, Hardy N, Holub K, Kallmes K. Web-based software tools for systematic literature review in medicine: systematic search and feature analysis. JMIR Med Inform. 2022;10:e33219–e33219. (PMID: 35499859911208010.2196/33219)
Schmidt L, Olorisade BK, McGuinness LA, Thomas J, Higgins JPT. Data extraction methods for systematic review (semi)automation: a living review protocol. F1000Res. 2020. https://doi.org/10.12688/f1000research.22781.2.
Wagner G, Lukyanenko R, Paré G. Artificial intelligence and the conduct of literature reviews. J Inf Technol. 2022;37:209–26. (PMID: 10.1177/02683962211048201)
Beller E, Clark J, Tsafnat G, et al. Making progress with the automation of systematic reviews: principles of the International Collaboration for the Automation of Systematic Reviews (ICASR). Syst Rev. 2018. https://doi.org/10.1186/s13643-018-0740-7 . (PMID: 10.1186/s13643-018-0740-7297780965960503)
Hair K, Wilson E, Wong C, Tsang A, Macleod M, Bannach-Brown A. Systematic online living evidence summaries: emerging tools to accelerate evidence synthesis. Clin Sci. 2023;137:773–84. (PMID: 10.1042/CS20220494)
Harrison H, Griffin SJ, Kuhn I, Usher-Smith JA. Software tools to support title and abstract screening for systematic reviews in healthcare: an evaluation. BMC Med Res Methodol. 2020;20:1–12. (PMID: 10.1186/s12874-020-0897-3)
Bui DDA, Del Fiol G, Jonnalagadda S. PDF text classification to leverage information extraction from publication reports. J Biomed Inform. 2016;61:141–8. (PMID: 27044929489391110.1016/j.jbi.2016.03.026)
Elamin MB, Flynn DN, Bassler D, Briel M, Alonso-Coello P, Karanicolas PJ, Guyatt GH, Malaga G, Furukawa TA, Kunz R. Choice of data extraction tools for systematic reviews depends on resources and review complexity. J Clin Epidemiol. 2009;62:506–10. (PMID: 1934897710.1016/j.jclinepi.2008.10.016)
Feng Y, Liang S, Zhang Y, Chen S, Wang Q, Huang T, Sun F, Liu X, Zhu H, Pan H. Automated medical literature screening using artificial intelligence: a systematic review and meta-analysis. J Am Med Inform Assoc. 2022;29:1425–32. (PMID: 35641139927764610.1093/jamia/ocac066)
Tsunoda DF, Moreira PSD, Guimaraes AJR. Machine learning and automated systematic literature review: a systematic review. REVISTA TECNOLOGIA E SOCIEDADE. 2020;16:337–54. (PMID: 10.3895/rts.v16n45.12119)
Pan American Health Organization. Q&A on artificial intelligence for supporting public health: Reference tool to support the exchange of information and promote open conversations and debates. Washington. 2024. https://iris.paho.org/handle/10665.2/59315 . Accessed 28 May 2024.
Raiaan MAK, Mukta MSH, Fatema K, Fahad NM, Sakib S, Mim M, Jannat M, Ahmad J, Ali ME, Azam S. A Review on Large Language Models: Architectures, Applications, Taxonomies, Open Issues and Challenges. 2023. https://www.authorea.com/users/619798/articles/683245-a-review-on-large-language-models-architectures-applications-taxonomies-openissues-and-challenges. Accessed 28 May 2024.
Prainsack B, Forgó N. New AI regulation in the EU seeks to reduce risk without assessing public benefit. Nat Med. 2024;30:1235–7. (PMID: 3849966110.1038/s41591-024-02874-2)
Andreoletti M, Haller L, Vayena E, Blasimme A. Mapping the ethical landscape of digital biomarkers: a scoping review. PLOS Digital Health. 2024;3: e0000519. (PMID: 387536051109830810.1371/journal.pdig.0000519)
Blasimme A, Vayena E. The ethics of AI in biomedical research, patient care and public health. SSRN Electron J. 2019. https://doi.org/10.2139/ssrn.3368756 . (PMID: 10.2139/ssrn.3368756)
van Altena AJ, Spijker R, Leeflang MMG, Olabarriaga SD. Training sample selection: impact on screening automation in diagnostic test accuracy reviews. Res Synth Methods. 2021;12:831–41. (PMID: 34390193929289210.1002/jrsm.1518)
Rajadhyax A, Moon D, Bhagat A, et al. MSR100 applicability of artificial intelligence in targeted literature review. Value in Health. 2022;25:S369–S369. (PMID: 10.1016/j.jval.2022.09.1831)
Van De Schoot R, De Bruin J, Schram R, Zahedi P, De Boer J, Weijdema F, Kramer B, Huijts M, Hoogerwerf M, Ferdinands G. An open source machine learning framework for efficient and transparent systematic reviews. Nat Mach Intell. 2021;3:125–33. (PMID: 10.1038/s42256-020-00287-7)
Olmos-Vega FM, Stalmeijer RE, Varpio L, Kahlke R. A practical guide to reflexivity in qualitative research: AMEE Guide No. 149. Med Teach. 2023;45:241–51. (PMID: 10.1080/0142159X.2022.2057287)
*Further Information*
*Evidence synthesis (ES) involves rigorous, reproducible methodologies, which are increasingly being presented as 'Living' systematic reviews. As such, ES are critical to evidence-informed decision-making processes, such as the development, implementation, evaluation and monitoring of health technology assessments, practice guidelines and policies. However, the ES process is time-intensive, typically requiring months or years and extensive manual effort. Technological advancements, particularly artificial intelligence (AI), offer opportunities to automate various ES steps, potentially increasing efficiency and reducing costs. AI tools and platforms, including large language models (LLMs), facilitate faster ES through advanced natural language processing (NLP) capabilities. Despite their potential, AI tools have limitations, including risks of automation bias and lack of true semantic understanding, requiring careful evaluation to ensure trustworthiness. We conducted the first scoping review to update and map all data science tools, including LLMs, which are either being developed and/or deployed to optimise ES steps and assess their impact in both low- and middle-income countries (LMICs) and high-income countries (HICs). Our scoping review identified 137 studies and 388 of such AI tools and platforms to respond to the World Health Organization's call for safe and ethical AI in health, documenting the current landscape to identify barriers and facilitators to equitable and sustainable access for glocal researchers. We further outline three recommendations: (1) promote collaborative AI platforms ensuring equity of access to include gap regions identified (Latin America, Africa, Middle East), (2) establish evaluation standards for methods testing and reporting, and (3) emphasise human input and multidisciplinary capacity building for developing and implementing AI tools in ES.
(© 2026. Pan American Health Organization.)*
*Declarations. Ethics approval and consent to participate: Ethical approval is not required for scoping reviews. Consent for publication: Not applicable. Competing interests: The authors declare that they have no competing interests.*