Professor Honghan Wu

  • Professor of Health Informatics and AI (Public Health)

Biography

I am a Professor of Health Informatics and AI at the School of Health and Wellbeing, University of Glasgow. I am a co-director of Health Data Research UK Scotland. I am also an honorary professor at Hong Kong University and an honorary associate professor at UCL. I am a former (2020-2023) Turing Fellow of The Alan Turing Institute and a Rutherford Fellow (2018-2022) of the Health Data Research UK. I got my BEng and PhD degrees from Southeast University, China. I worked in the industry for about six years primarily as a software developer before my PhD study.

My research lab website is at https://knowlab.github.io/, I also co-lead the Edinburgh Clinical Natural Language Processing group: https://www.ed.ac.uk/usher/clinical-natural-language-processing and co-organise the Turing Health Equity group: https://www.turing.ac.uk/research/interest-groups/health-equity.

Research interests

Machine learning, natural language processing, knowledge graph and their applications in medicine. Details of my research and team updates can be found at https://knowlab.github.io/.

Publications

List by: Type | Date

Jump to: 2025 | 2024 | 2023 | 2022 | 2021
Number of items: 53.

2025

Tran, Tran Q.B., Lip, Stefanie ORCID logoORCID: https://orcid.org/0000-0001-8515-9018, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Visweswaran, Shyam, Pell, Jill P. ORCID logoORCID: https://orcid.org/0000-0002-8898-7035 and Padmanabhan, Sandosh ORCID logoORCID: https://orcid.org/0000-0003-3869-5808 (2025) A transformer-based framework for counterfactual estimation of antihypertensive treatment effect on COVID-19 infection risk - a proof-of-concept study. American Journal of Hypertension, (doi: 10.1093/ajh/hpaf055) (PMID:40247607) (Early Online Publication)

2024

Gao, Y. et al. (2024) Optimising the paradigms of human AI collaborative clinical coding. npj Digital Medicine, 7, 368. (doi: 10.1038/s41746-024-01363-7) (PMID:39702575) (PMCID:PMC11659570)

Ji, Shaoxiong, Li, Xiaobo, Sun, Wei, Dong, Hang, Taalas, Ara, Zhang, Yijia, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Pitkänen, Esa and Marttinen, Pekka (2024) A unified review of deep learning for automated medical coding. ACM Computing Surveys, 56(12), 306. (doi: 10.1145/3664615)

Greene, Charlotte, Blackbourn, Luke, McGurnaghan, Stuart, Mercer, Stewart, Smith, Daniel, Wild, Sarah, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Jackson, Caroline and Scottish Diabetes Research Network Epidemiology Group (2024) Antidepressant and antipsychotic prescribing in patients with type 2 diabetes in Scotland: a time-trend analysis from 2004-2021. British Journal of Clinical Pharmacology, 90(11), pp. 2802-2810. (doi: 10.1111/bcp.16171) (PMID:38981672)

Wu, Jinge, Dong, Hang, Li, Zexi, Wang, Haowei, Li, Runci, Patra, Arijit, Dai, Chengliang, Ali, Waqar, Scordis, Phil and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2024) A hybrid framework with large language models for rare disease phenotyping. BMC Medical Informatics and Decision Making, 24(1), 289. (doi: 10.1186/s12911-024-02698-7) (PMID:39375687) (PMCID:PMC11460004)

Guellil, I., Andres, S., Guthrie, B., Anand, A., Zhang, H., Hasan, A.K., Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 and Alex, B. (2024) Enhancing Natural Language Processing Capabilities in Geriatric Patient Care: An Annotation Scheme and Guidelines. In: 29th International Conference on Natural Language & Information Systems. NLDB 2024, University of Turin, Italy, 25-27 June 2024, pp. 207-217. ISBN 9783031702419 (doi: 10.1007/978-3-031-70242-6_20)

Kim, Y., Wu, J., Abdulle, Y., Gao, Y. and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2024) Human-in-the-Loop Chest X-Ray Diagnosis: Enhancing Large Multimodal Models with Eye Fixation Inputs. In: Second International Workshop, TAI4H 2024, Jeju, South Korea, 4 Aug 2024, pp. 66-80. ISBN 9783031677502 (doi: 10.1007/978-3-031-67751-9_6)

Wu, Jinge, Kim, Yunsoo and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2024) Hallucination benchmark in medical visual question answering. arXiv, (doi: 10.48550/arXiv.2401.05827)

Francis, Farah, Luz, Saturnino, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Stock, Sarah J. and Townsend, Rosemary (2024) Machine learning on cardiotocography data to classify fetal outcomes: a scoping review. Computers in Biology and Medicine, 172, 108220. (doi: 10.1016/j.compbiomed.2024.108220) (PMID:38489990)

Feng, Wei, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Ma, Hui, Tao, Zhenhuan, Xu, Mengdie, Zhang, Xin, Lu, Shan, Wan, Cheng and Liu, Yun (2024) Applying contrastive pre-training for depression and anxiety risk prediction in type 2 diabetes patients based on heterogeneous electronic health records: a primary healthcare case study. Journal of the American Medical Informatics Association, 31(2), pp. 445-455. (doi: 10.1093/jamia/ocad228) (PMID:38062850) (PMCID:PMC10797279)

Kim, Y. and Wu, H. ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2024) Knowlab's Submission to L+M Shared Task: All you need is continued pretraining of chemistry texts even for molecule captioning. In: 1st Workshop on Language + Molecules (L+M 2024). Proceedings, Bangkok, 15 Aug 2024, pp. 92-97. ISBN 9798891761483 (doi: 10.18653/v1/2024.langmol-1.11)

Kim, Y., Wu, J., Abdulle, Y. and Wu, H. ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2024) MedExQA: Medical Question Answering Benchmark with Multiple Explanations. In: Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, Bangkok, Thailand, 16 Aug 2024, pp. 167-181. ISBN 9798891761308 (doi: 10.18653/v1/2024.bionlp-1.14)

2023

Fu, Y., Zhang, G., Lu, X., Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 and Zhang, D. (2023) RMCA U-net: Hard exudates segmentation for retinal fundus images. Expert Systems with Applications, 234, 120987. (doi: 10.1016/j.eswa.2023.120987)

Groves, Emily, Wang, Minhong, Abdulle, Yusuf, Kunz, Holger, Hoelscher-Obermaier, Jason, Wu, Ronin and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2023) Benchmarking and analyzing in-context learning, fine-tuning and supervised learning for biomedical knowledge curation: a focused study on chemical entities of biological interest. arXiv, (doi: 10.48550/arXiv.2312.12989)

Wu, Jinge, Kim, Yunsoo, Keller, Eva C., Chow, Jamie, Levine, Adam P., Pontikos, Nikolas, Ibrahim, Zina, Taylor, Paul, Williams, Michelle C. and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2023) Exploring multimodal large language models for radiology report error-checking. arXiv, (doi: 10.48550/arXiv.2312.13103)

Guellil, I. et al. (2023) Natural language processing for detecting adverse drug events: a systematic review protocol. [Protocols]

Francis, Farah, Luz, Saturnino, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Townsend, Rosemary and Stock, Sarah S. (2023) Machine Learning to Classify Cardiotocography for Fetal Hypoxia Detection. In: 2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Sydney, Australia, 24-27 July 2023, ISBN 9798350324471 (doi: 10.1109/embc40787.2023.10340803)

Groza, Tudor, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Dinger, Marcel E., Danis, Daniel, Hilton, Coleman, Bagley, Anita, Davids, Jon R., Luo, Ling, Lu, Zhiyong and Robinson, Peter N. (2023) Term-BLAST-like alignment tool for concept recognition in noisy clinical texts. Bioinformatics, 39(12), btad716. (doi: 10.1093/bioinformatics/btad716) (PMID:38001031) (PMCID:PMC10710372)

Zhang, H., Casey, A., Guellil, I., Suárez-Paniagua, V., MacRae, C., Marwick, C., Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Guthrie, B. and Alex, B. (2023) FLAP: a framework for linking free-text addresses to the Ordnance Survey Unique Property Reference Number database. Frontiers in Digital Health, 5, 1186208. (doi: 10.3389/fdgth.2023.1186208) (PMID:38090654) (PMCID:PMC10715280)

Thygesen, J. H. et al. (2023) A nationwide study of 331 rare diseases among 58 million individuals: prevalence, demographics, and COVID-19 outcomes. medRxiv, (doi: 10.1101/2023.10.12.23296948)

Casey, A. et al. (2023) Understanding the performance and reliability of NLP tools: a comparison of four NLP tools predicting stroke phenotypes in radiology reports. Frontiers in Digital Health, 5, 1184919. (doi: 10.3389/fdgth.2023.1184919) (PMID:37840686) (PMCID:PMC10569314)

Alsaleh, Mohanad M., Allery, Freya, Choi, Jung Won, JW, Choi, Hama, Tuankasfee, McQuillin, Andrew, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 and Thygesen, Johan H. (2023) Prediction of disease comorbidity using explainable artificial intelligence and machine learning techniques: A systematic review. International Journal of Medical Informatics, 175, 105088. (doi: 10.1016/j.ijmedinf.2023.105088) (PMID:37156169)

Wang, Minhong, Kloczko, Ewa, Altayeb, Alla, Farrugia, Michael, Gupta, Girish, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 and Hirani, Nik (2023) Towards automated dermatology triage: deep learning and knowledge-driven approaches. Research Square, (doi: 10.21203/rs.3.rs-2889033/v1)

Greene, Charlotte R.L., Ward-Penny, Hanna, Ioannou, Marianna F., Wild, Sarah H., Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Smith, Daniel J. and Jackson, Caroline A. (2023) Antidepressant and antipsychotic drug prescribing and diabetes outcomes: A systematic review of observational studies. Diabetes Research and Clinical Practice, 199, 110649. (doi: 10.1016/j.diabres.2023.110649) (PMID:37004975)

Dong, Hang, Suárez‑Paniagua, Víctor, Zhang, Huayu, Wang, Minhong, Casey, Arlene, Davidson, Emma, Chen, Jiaoyan, Alex, Beatrice, Whiteley, William and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2023) Ontology-driven and weakly supervised rare disease identification from clinical notes. BMC Medical Informatics and Decision Making, 23, 86. (doi: 10.1186/s12911-023-02181-9) (PMID:37147628) (PMCID:PMC10162001)

Davidson, E. M. et al. (2023) The epidemiological characteristics of stroke phenotypes defined with ICD-10 and free-text: a cohort study linked to electronic health records. MedRxiv, (doi: 10.1101/2023.04.03.23288096)

Kuan, V. et al. (2023) Identifying and visualising multimorbidity and comorbidity patterns in patients in the English National Health Service: a population-based study. Lancet Digital Health, 5(1), e16-e27. (doi: 10.1016/S2589-7500(22)00187-X) (PMID:36460578)

Ibrahim, Z., Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 and Wiratunga, N. (2023) Preface: The 6th International Workshop on Knowledge Discovery in Healthcare Data (KDH). KDH@IJCAI 2023 Knowledge Discovery from Healthcare Data 2023, Macao, China, 20 Aug 2023.

Wu, J., Shi, D., Hasan, A. and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2023) KnowLab at RadSum23: Comparing Pre-trained Language Models in Radiology Report Summarization. In: The 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, Toronto, Canada, July 2023, pp. 535-540. ISBN 9781959429852 (doi: 10.18653/v1/2023.bionlp-1.54)

2022

Wu, H. et al. (2022) A survey on clinical natural language processing in the United Kingdom from 2007 to 2022. npj Digital Medicine, 5, 186. (doi: 10.1038/s41746-022-00730-6) (PMID:36544046) (PMCID:PMC9770568)

Dong, Hang, Falis, Matúš, Whiteley, William, Alex, Beatrice, Matteson, Joshua, Ji, Shaoxiong, Chen, Jiaoyan and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2022) Automated clinical coding: what, why, and where we are? npj Digital Medicine, 5, 159. (doi: 10.1038/s41746-022-00705-7) (PMID:36273236) (PMCID:PMC9588058)

Guellil, Imane, Wu, Jinge, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Sun, Tony and Alex, Beatrice (2022) Edinburgh_UCL_Health@SMM4H'22: From Glove to Flair for Handling Imbalanced Healthcare Corpora Related to Adverse Drug Events, Change in Medication and Self-reporting Vaccination. In: Proceedings of the 29th International Conference on Computational Linguistics, Gyeongju, Republic of Korea, 12-17 Oct 2022, pp. 148-152.

Chen, Q. et al. (2022) Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations. Database, 2022, baac069. (doi: 10.1093/database/baac069) (PMID:36043400) (PMCID:PMC9428574)

Wu, Jinge, Smith, Rowena and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2022) Adverse Childhood Experiences identification from clinical notes with ontologies and NLP. arXiv, (doi: 10.48550/arXiv.2208.11466)

Wu, Jinge, Smith, Rowena and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2022) Ontology-driven self-supervision for adverse childhood experiences identification using social media datasets. arXiv, (doi: 10.48550/arXiv.2208.11701)

Cheung, Jason Pui Yin, Kuang, Xihe, Lai, Marcus Kin Long, Cheung, Kenneth Man‑Chee, Karppinen, Jaro, Samartzis, Dino, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Zhao, Fengdong, Zheng, Zhaomin and Zhang, Teng (2022) Learning-based fully automated prediction of lumbar disc degeneration progression with specified clinical parameters and preliminary validation. European Spine Journal, 31(8), pp. 1960-1968. (doi: 10.1007/s00586-021-07020-x) (PMID:34657211)

Wan, Cheng, Read, Stephanie, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Lu, Shan, Zhang, Xin, Wild, Sarah H. and Liu, Yun (2022) Prediction of five-year cardiovascular disease risk in people with type 2 diabetes mellitus: derivation in Nanjing, China and external validation in Scotland, UK. Global Heart, 17(1), 46. (doi: 10.5334/gh.1131) (PMID:36051323) (PMCID:PMC9336685)

Kuang, X. et al. (2022) Spine-GFlow: A hybrid learning framework for robust multi-tissue segmentation in lumbar MRI without manual annotation. Computerized Medical Imaging and Graphics, 99, 102091. (doi: 10.1016/j.compmedimag.2022.102091)

Thygesen, J. H. et al. (2022) COVID-19 trajectories among 57 million adults in England: a cohort study using electronic health records. Lancet Digital Health, 7(4), e542-e557. (doi: 10.1016/S2589-7500(22)00091-7) (PMID:35690576) (PMCID:PMC9179175)

Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Sylolypavan, Aneeta, Wang, Minhong and Wild, Sarah (2022) Quantifying Health Inequalities Induced by Data and AI Models. In: Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, Vienna, Austria, 23-29 July 2022, pp. 5192-5198. ISBN 9781956792003 (doi: 10.24963/ijcai.2022/721)

Straw, Isabel and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2022) Investigating for bias in healthcare algorithms: a sex-stratified analysis of supervised machine learning models in liver disease prediction. BMJ Health Care Inform, 29, e100457. (doi: 10.1136/bmjhci-2021-100457) (PMID:35470133) (PMCID:PMC9039354)

Zhang, Huayu, Thygesen, Johan H., Shi, Ting, Gkoutos, Georgios V., Hemingway, Harry, Guthrie, Bruce, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 and Genomics England Research Consortium (2022) Increased COVID-19 mortality rate in rare disease patients: a retrospective cohort study in participants of the Genomics England 100,000 Genomes project. Orphanet Journal of Rare Diseases, 17, 166. (doi: 10.1186/s13023-022-02312-x) (PMID:35414031) (PMCID:PMC9003178)

Ibrahim, Z.M. et al. (2022) A knowledge distillation ensemble framework for predicting short- and long-term hospitalization outcomes from electronic health records data. IEEE Journal of Biomedical and Health Informatics, 26(1), pp. 423-435. (doi: 10.1109/JBHI.2021.3089287) (PMID:34129509)

Francis, F., Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Luz, S., Townsend, R. and Stock, S. (2022) Detecting Intrapartum Fetal Hypoxia from Cardiotocography Using Machine Learning. In: 49th Computing in Cardiology Conference CinC 2022, Tampere, Finland, 4-7 Sept 2022, ISBN 9798350300970 (doi: 10.22489/CinC.2022.339)

Wang, Minhong, Francis, Farah, Kunz, Holger, Zhang, Xiang, Wan, Cheng, Liu, Yun, Taylor, Paul, Wild, Sarah H. and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2022) Artificial intelligence models for predicting cardiovascular diseases in people with type 2 diabetes: A systematic review. Intelligence-Based Medicine, 6, 100072. (doi: 10.1016/j.ibmed.2022.100072)

2021

Zhang, Huayu, Thygesen, Johan and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2021) Increased COVID-19 related mortality rate for patients with rare diseases: a retrospective cohort study with data from Genomics England. Lancet, 398(Sup 2), S95. (PMCID:PMC8617313)

Fairfield, C. J. et al. (2021) ToKSA - Tokenized Key Sentence Annotation - a novel method for rapid approximation of ground truth for natural language processing. medRxiv, (doi: 10.1101/2021.10.06.21264629)

Davidson, E. M. et al. (2021) The reporting quality of natural language processing studies: systematic review of studies of radiology reports. BMC Medical Imaging, 21, 142. (doi: 10.1186/s12880-021-00671-8) (PMID:34600486) (PMCID:PMC8487512)

Rannikmäe, Rannikmäe, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Tominey, Steven, Whiteley, William, Allen, Naomi, Sudlow, Cathie and UK Biobank (2021) Developing automated methods for disease subtyping in UK Biobank: an exemplar study on stroke. BMC Medical Informatics and Decision Making, 21, 191. (doi: 10.1186/s12911-021-01556-0) (PMID:34130677) (PMCID:PMC8204419)

Casey, A. et al. (2021) A systematic review of natural language processing applied to radiology reports. BMC Medical Informatics and Decision Making, 21, 179. (doi: 10.1186/s12911-021-01533-7) (PMID:34082729) (PMCID:PMC8176715)

Zhang, Huayu, Ferguson, Amy, Robertson, Grant, Jiang, Muchen, Zhang, Teng, Sudlow, Cathie, Smith, Keith, Rannikmae, Kristiina and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2021) Benchmarking network-based gene prioritization methods for cerebral small vessel disease. Briefings in Bioinformatics, 22(5), bbab006. (doi: 10.1093/bib/bbab006) (PMID:33634312) (PMCID:PMC8425308)

Dong, Hang, Suárez-Paniagua, Víctor, Zhang, Huayu, Wang, Minhong, Whitfield, Emma and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2021) Rare Disease Identification from Clinical Notes with Ontologies and Weak Supervision. In: 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Online, 31 Oct--4 Nov 2021, pp. 2294-2298. ISBN 9781728111797 (doi: 10.1109/embc46164.2021.9630043)

Mirza, L. et al. (2021) Investigating the association between physical health comorbidities and disability in individuals with severe mental illness. European Psychiatry, 64(1), e77. (doi: 10.1192/j.eurpsy.2021.2255) (PMID:34842128) (PMCID:PMC8727716)

This list was generated on Sun Jun 15 03:07:59 2025 BST.
Number of items: 53.

Articles

Tran, Tran Q.B., Lip, Stefanie ORCID logoORCID: https://orcid.org/0000-0001-8515-9018, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Visweswaran, Shyam, Pell, Jill P. ORCID logoORCID: https://orcid.org/0000-0002-8898-7035 and Padmanabhan, Sandosh ORCID logoORCID: https://orcid.org/0000-0003-3869-5808 (2025) A transformer-based framework for counterfactual estimation of antihypertensive treatment effect on COVID-19 infection risk - a proof-of-concept study. American Journal of Hypertension, (doi: 10.1093/ajh/hpaf055) (PMID:40247607) (Early Online Publication)

Gao, Y. et al. (2024) Optimising the paradigms of human AI collaborative clinical coding. npj Digital Medicine, 7, 368. (doi: 10.1038/s41746-024-01363-7) (PMID:39702575) (PMCID:PMC11659570)

Ji, Shaoxiong, Li, Xiaobo, Sun, Wei, Dong, Hang, Taalas, Ara, Zhang, Yijia, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Pitkänen, Esa and Marttinen, Pekka (2024) A unified review of deep learning for automated medical coding. ACM Computing Surveys, 56(12), 306. (doi: 10.1145/3664615)

Greene, Charlotte, Blackbourn, Luke, McGurnaghan, Stuart, Mercer, Stewart, Smith, Daniel, Wild, Sarah, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Jackson, Caroline and Scottish Diabetes Research Network Epidemiology Group (2024) Antidepressant and antipsychotic prescribing in patients with type 2 diabetes in Scotland: a time-trend analysis from 2004-2021. British Journal of Clinical Pharmacology, 90(11), pp. 2802-2810. (doi: 10.1111/bcp.16171) (PMID:38981672)

Wu, Jinge, Dong, Hang, Li, Zexi, Wang, Haowei, Li, Runci, Patra, Arijit, Dai, Chengliang, Ali, Waqar, Scordis, Phil and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2024) A hybrid framework with large language models for rare disease phenotyping. BMC Medical Informatics and Decision Making, 24(1), 289. (doi: 10.1186/s12911-024-02698-7) (PMID:39375687) (PMCID:PMC11460004)

Wu, Jinge, Kim, Yunsoo and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2024) Hallucination benchmark in medical visual question answering. arXiv, (doi: 10.48550/arXiv.2401.05827)

Francis, Farah, Luz, Saturnino, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Stock, Sarah J. and Townsend, Rosemary (2024) Machine learning on cardiotocography data to classify fetal outcomes: a scoping review. Computers in Biology and Medicine, 172, 108220. (doi: 10.1016/j.compbiomed.2024.108220) (PMID:38489990)

Feng, Wei, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Ma, Hui, Tao, Zhenhuan, Xu, Mengdie, Zhang, Xin, Lu, Shan, Wan, Cheng and Liu, Yun (2024) Applying contrastive pre-training for depression and anxiety risk prediction in type 2 diabetes patients based on heterogeneous electronic health records: a primary healthcare case study. Journal of the American Medical Informatics Association, 31(2), pp. 445-455. (doi: 10.1093/jamia/ocad228) (PMID:38062850) (PMCID:PMC10797279)

Fu, Y., Zhang, G., Lu, X., Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 and Zhang, D. (2023) RMCA U-net: Hard exudates segmentation for retinal fundus images. Expert Systems with Applications, 234, 120987. (doi: 10.1016/j.eswa.2023.120987)

Groves, Emily, Wang, Minhong, Abdulle, Yusuf, Kunz, Holger, Hoelscher-Obermaier, Jason, Wu, Ronin and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2023) Benchmarking and analyzing in-context learning, fine-tuning and supervised learning for biomedical knowledge curation: a focused study on chemical entities of biological interest. arXiv, (doi: 10.48550/arXiv.2312.12989)

Wu, Jinge, Kim, Yunsoo, Keller, Eva C., Chow, Jamie, Levine, Adam P., Pontikos, Nikolas, Ibrahim, Zina, Taylor, Paul, Williams, Michelle C. and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2023) Exploring multimodal large language models for radiology report error-checking. arXiv, (doi: 10.48550/arXiv.2312.13103)

Groza, Tudor, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Dinger, Marcel E., Danis, Daniel, Hilton, Coleman, Bagley, Anita, Davids, Jon R., Luo, Ling, Lu, Zhiyong and Robinson, Peter N. (2023) Term-BLAST-like alignment tool for concept recognition in noisy clinical texts. Bioinformatics, 39(12), btad716. (doi: 10.1093/bioinformatics/btad716) (PMID:38001031) (PMCID:PMC10710372)

Zhang, H., Casey, A., Guellil, I., Suárez-Paniagua, V., MacRae, C., Marwick, C., Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Guthrie, B. and Alex, B. (2023) FLAP: a framework for linking free-text addresses to the Ordnance Survey Unique Property Reference Number database. Frontiers in Digital Health, 5, 1186208. (doi: 10.3389/fdgth.2023.1186208) (PMID:38090654) (PMCID:PMC10715280)

Thygesen, J. H. et al. (2023) A nationwide study of 331 rare diseases among 58 million individuals: prevalence, demographics, and COVID-19 outcomes. medRxiv, (doi: 10.1101/2023.10.12.23296948)

Casey, A. et al. (2023) Understanding the performance and reliability of NLP tools: a comparison of four NLP tools predicting stroke phenotypes in radiology reports. Frontiers in Digital Health, 5, 1184919. (doi: 10.3389/fdgth.2023.1184919) (PMID:37840686) (PMCID:PMC10569314)

Alsaleh, Mohanad M., Allery, Freya, Choi, Jung Won, JW, Choi, Hama, Tuankasfee, McQuillin, Andrew, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 and Thygesen, Johan H. (2023) Prediction of disease comorbidity using explainable artificial intelligence and machine learning techniques: A systematic review. International Journal of Medical Informatics, 175, 105088. (doi: 10.1016/j.ijmedinf.2023.105088) (PMID:37156169)

Wang, Minhong, Kloczko, Ewa, Altayeb, Alla, Farrugia, Michael, Gupta, Girish, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 and Hirani, Nik (2023) Towards automated dermatology triage: deep learning and knowledge-driven approaches. Research Square, (doi: 10.21203/rs.3.rs-2889033/v1)

Greene, Charlotte R.L., Ward-Penny, Hanna, Ioannou, Marianna F., Wild, Sarah H., Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Smith, Daniel J. and Jackson, Caroline A. (2023) Antidepressant and antipsychotic drug prescribing and diabetes outcomes: A systematic review of observational studies. Diabetes Research and Clinical Practice, 199, 110649. (doi: 10.1016/j.diabres.2023.110649) (PMID:37004975)

Dong, Hang, Suárez‑Paniagua, Víctor, Zhang, Huayu, Wang, Minhong, Casey, Arlene, Davidson, Emma, Chen, Jiaoyan, Alex, Beatrice, Whiteley, William and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2023) Ontology-driven and weakly supervised rare disease identification from clinical notes. BMC Medical Informatics and Decision Making, 23, 86. (doi: 10.1186/s12911-023-02181-9) (PMID:37147628) (PMCID:PMC10162001)

Davidson, E. M. et al. (2023) The epidemiological characteristics of stroke phenotypes defined with ICD-10 and free-text: a cohort study linked to electronic health records. MedRxiv, (doi: 10.1101/2023.04.03.23288096)

Kuan, V. et al. (2023) Identifying and visualising multimorbidity and comorbidity patterns in patients in the English National Health Service: a population-based study. Lancet Digital Health, 5(1), e16-e27. (doi: 10.1016/S2589-7500(22)00187-X) (PMID:36460578)

Wu, H. et al. (2022) A survey on clinical natural language processing in the United Kingdom from 2007 to 2022. npj Digital Medicine, 5, 186. (doi: 10.1038/s41746-022-00730-6) (PMID:36544046) (PMCID:PMC9770568)

Dong, Hang, Falis, Matúš, Whiteley, William, Alex, Beatrice, Matteson, Joshua, Ji, Shaoxiong, Chen, Jiaoyan and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2022) Automated clinical coding: what, why, and where we are? npj Digital Medicine, 5, 159. (doi: 10.1038/s41746-022-00705-7) (PMID:36273236) (PMCID:PMC9588058)

Chen, Q. et al. (2022) Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations. Database, 2022, baac069. (doi: 10.1093/database/baac069) (PMID:36043400) (PMCID:PMC9428574)

Wu, Jinge, Smith, Rowena and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2022) Adverse Childhood Experiences identification from clinical notes with ontologies and NLP. arXiv, (doi: 10.48550/arXiv.2208.11466)

Wu, Jinge, Smith, Rowena and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2022) Ontology-driven self-supervision for adverse childhood experiences identification using social media datasets. arXiv, (doi: 10.48550/arXiv.2208.11701)

Cheung, Jason Pui Yin, Kuang, Xihe, Lai, Marcus Kin Long, Cheung, Kenneth Man‑Chee, Karppinen, Jaro, Samartzis, Dino, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Zhao, Fengdong, Zheng, Zhaomin and Zhang, Teng (2022) Learning-based fully automated prediction of lumbar disc degeneration progression with specified clinical parameters and preliminary validation. European Spine Journal, 31(8), pp. 1960-1968. (doi: 10.1007/s00586-021-07020-x) (PMID:34657211)

Wan, Cheng, Read, Stephanie, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Lu, Shan, Zhang, Xin, Wild, Sarah H. and Liu, Yun (2022) Prediction of five-year cardiovascular disease risk in people with type 2 diabetes mellitus: derivation in Nanjing, China and external validation in Scotland, UK. Global Heart, 17(1), 46. (doi: 10.5334/gh.1131) (PMID:36051323) (PMCID:PMC9336685)

Kuang, X. et al. (2022) Spine-GFlow: A hybrid learning framework for robust multi-tissue segmentation in lumbar MRI without manual annotation. Computerized Medical Imaging and Graphics, 99, 102091. (doi: 10.1016/j.compmedimag.2022.102091)

Thygesen, J. H. et al. (2022) COVID-19 trajectories among 57 million adults in England: a cohort study using electronic health records. Lancet Digital Health, 7(4), e542-e557. (doi: 10.1016/S2589-7500(22)00091-7) (PMID:35690576) (PMCID:PMC9179175)

Straw, Isabel and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2022) Investigating for bias in healthcare algorithms: a sex-stratified analysis of supervised machine learning models in liver disease prediction. BMJ Health Care Inform, 29, e100457. (doi: 10.1136/bmjhci-2021-100457) (PMID:35470133) (PMCID:PMC9039354)

Zhang, Huayu, Thygesen, Johan H., Shi, Ting, Gkoutos, Georgios V., Hemingway, Harry, Guthrie, Bruce, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 and Genomics England Research Consortium (2022) Increased COVID-19 mortality rate in rare disease patients: a retrospective cohort study in participants of the Genomics England 100,000 Genomes project. Orphanet Journal of Rare Diseases, 17, 166. (doi: 10.1186/s13023-022-02312-x) (PMID:35414031) (PMCID:PMC9003178)

Ibrahim, Z.M. et al. (2022) A knowledge distillation ensemble framework for predicting short- and long-term hospitalization outcomes from electronic health records data. IEEE Journal of Biomedical and Health Informatics, 26(1), pp. 423-435. (doi: 10.1109/JBHI.2021.3089287) (PMID:34129509)

Wang, Minhong, Francis, Farah, Kunz, Holger, Zhang, Xiang, Wan, Cheng, Liu, Yun, Taylor, Paul, Wild, Sarah H. and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2022) Artificial intelligence models for predicting cardiovascular diseases in people with type 2 diabetes: A systematic review. Intelligence-Based Medicine, 6, 100072. (doi: 10.1016/j.ibmed.2022.100072)

Zhang, Huayu, Thygesen, Johan and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2021) Increased COVID-19 related mortality rate for patients with rare diseases: a retrospective cohort study with data from Genomics England. Lancet, 398(Sup 2), S95. (PMCID:PMC8617313)

Fairfield, C. J. et al. (2021) ToKSA - Tokenized Key Sentence Annotation - a novel method for rapid approximation of ground truth for natural language processing. medRxiv, (doi: 10.1101/2021.10.06.21264629)

Davidson, E. M. et al. (2021) The reporting quality of natural language processing studies: systematic review of studies of radiology reports. BMC Medical Imaging, 21, 142. (doi: 10.1186/s12880-021-00671-8) (PMID:34600486) (PMCID:PMC8487512)

Rannikmäe, Rannikmäe, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Tominey, Steven, Whiteley, William, Allen, Naomi, Sudlow, Cathie and UK Biobank (2021) Developing automated methods for disease subtyping in UK Biobank: an exemplar study on stroke. BMC Medical Informatics and Decision Making, 21, 191. (doi: 10.1186/s12911-021-01556-0) (PMID:34130677) (PMCID:PMC8204419)

Casey, A. et al. (2021) A systematic review of natural language processing applied to radiology reports. BMC Medical Informatics and Decision Making, 21, 179. (doi: 10.1186/s12911-021-01533-7) (PMID:34082729) (PMCID:PMC8176715)

Zhang, Huayu, Ferguson, Amy, Robertson, Grant, Jiang, Muchen, Zhang, Teng, Sudlow, Cathie, Smith, Keith, Rannikmae, Kristiina and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2021) Benchmarking network-based gene prioritization methods for cerebral small vessel disease. Briefings in Bioinformatics, 22(5), bbab006. (doi: 10.1093/bib/bbab006) (PMID:33634312) (PMCID:PMC8425308)

Mirza, L. et al. (2021) Investigating the association between physical health comorbidities and disability in individuals with severe mental illness. European Psychiatry, 64(1), e77. (doi: 10.1192/j.eurpsy.2021.2255) (PMID:34842128) (PMCID:PMC8727716)

Conference or Workshop Item

Ibrahim, Z., Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 and Wiratunga, N. (2023) Preface: The 6th International Workshop on Knowledge Discovery in Healthcare Data (KDH). KDH@IJCAI 2023 Knowledge Discovery from Healthcare Data 2023, Macao, China, 20 Aug 2023.

Conference Proceedings

Guellil, I., Andres, S., Guthrie, B., Anand, A., Zhang, H., Hasan, A.K., Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 and Alex, B. (2024) Enhancing Natural Language Processing Capabilities in Geriatric Patient Care: An Annotation Scheme and Guidelines. In: 29th International Conference on Natural Language & Information Systems. NLDB 2024, University of Turin, Italy, 25-27 June 2024, pp. 207-217. ISBN 9783031702419 (doi: 10.1007/978-3-031-70242-6_20)

Kim, Y., Wu, J., Abdulle, Y., Gao, Y. and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2024) Human-in-the-Loop Chest X-Ray Diagnosis: Enhancing Large Multimodal Models with Eye Fixation Inputs. In: Second International Workshop, TAI4H 2024, Jeju, South Korea, 4 Aug 2024, pp. 66-80. ISBN 9783031677502 (doi: 10.1007/978-3-031-67751-9_6)

Kim, Y. and Wu, H. ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2024) Knowlab's Submission to L+M Shared Task: All you need is continued pretraining of chemistry texts even for molecule captioning. In: 1st Workshop on Language + Molecules (L+M 2024). Proceedings, Bangkok, 15 Aug 2024, pp. 92-97. ISBN 9798891761483 (doi: 10.18653/v1/2024.langmol-1.11)

Kim, Y., Wu, J., Abdulle, Y. and Wu, H. ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2024) MedExQA: Medical Question Answering Benchmark with Multiple Explanations. In: Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, Bangkok, Thailand, 16 Aug 2024, pp. 167-181. ISBN 9798891761308 (doi: 10.18653/v1/2024.bionlp-1.14)

Francis, Farah, Luz, Saturnino, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Townsend, Rosemary and Stock, Sarah S. (2023) Machine Learning to Classify Cardiotocography for Fetal Hypoxia Detection. In: 2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Sydney, Australia, 24-27 July 2023, ISBN 9798350324471 (doi: 10.1109/embc40787.2023.10340803)

Wu, J., Shi, D., Hasan, A. and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2023) KnowLab at RadSum23: Comparing Pre-trained Language Models in Radiology Report Summarization. In: The 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, Toronto, Canada, July 2023, pp. 535-540. ISBN 9781959429852 (doi: 10.18653/v1/2023.bionlp-1.54)

Guellil, Imane, Wu, Jinge, Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Sun, Tony and Alex, Beatrice (2022) Edinburgh_UCL_Health@SMM4H'22: From Glove to Flair for Handling Imbalanced Healthcare Corpora Related to Adverse Drug Events, Change in Medication and Self-reporting Vaccination. In: Proceedings of the 29th International Conference on Computational Linguistics, Gyeongju, Republic of Korea, 12-17 Oct 2022, pp. 148-152.

Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Sylolypavan, Aneeta, Wang, Minhong and Wild, Sarah (2022) Quantifying Health Inequalities Induced by Data and AI Models. In: Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, Vienna, Austria, 23-29 July 2022, pp. 5192-5198. ISBN 9781956792003 (doi: 10.24963/ijcai.2022/721)

Francis, F., Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668, Luz, S., Townsend, R. and Stock, S. (2022) Detecting Intrapartum Fetal Hypoxia from Cardiotocography Using Machine Learning. In: 49th Computing in Cardiology Conference CinC 2022, Tampere, Finland, 4-7 Sept 2022, ISBN 9798350300970 (doi: 10.22489/CinC.2022.339)

Dong, Hang, Suárez-Paniagua, Víctor, Zhang, Huayu, Wang, Minhong, Whitfield, Emma and Wu, Honghan ORCID logoORCID: https://orcid.org/0000-0002-0213-5668 (2021) Rare Disease Identification from Clinical Notes with Ontologies and Weak Supervision. In: 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Online, 31 Oct--4 Nov 2021, pp. 2294-2298. ISBN 9781728111797 (doi: 10.1109/embc46164.2021.9630043)

Protocols

Guellil, I. et al. (2023) Natural language processing for detecting adverse drug events: a systematic review protocol. [Protocols]

This list was generated on Sun Jun 15 03:07:59 2025 BST.

Prior publications

Article

Ruochen Huang et al. (2025) Evaluation and Bias Analysis of Large Language Models in Generating Synthetic Electronic Health Records: Comparative Study Journal of Medical Internet Research Crossref. (doi: 10.2196/65317)

Jamie Chow, Ryan Lee, Honghan Wu (2025) How Do Radiologists Currently Monitor AI in Radiology and What Challenges Do They Face? An Interview Study and Qualitative Analysis Journal of Imaging Informatics in Medicine Crossref. (doi: 10.1007/s10278-025-01493-8)

Tuankasfee Hama, Mohanad M Alsaleh, Freya Allery, Jung Won Choi, Christopher Tomlinson, Honghan Wu, Alvina Lai, Nikolas Pontikos, Johan H Thygesen (2025) Enhancing Patient Outcome Prediction Through Deep Learning With Sequential Diagnosis Codes From Structured Electronic Health Record Data: Systematic Review Journal of Medical Internet Research Crossref. (doi: 10.2196/57358)

Ruochen Huang et al. (2024) Evaluation and Bias Analysis of Large Language Models in Generating Synthetic Electronic Health Records: Comparative Study (Preprint) Crossref. (doi: 10.2196/preprints.65317)

Tuankasfee Hama, Mohanad M Alsaleh, Freya Allery, Jung Won Choi, Christopher Tomlinson, Honghan Wu, Alvina Lai, Nikolas Pontikos, Johan H Thygesen (2024) Enhancing Patient Outcome Prediction Through Deep Learning With Sequential Diagnosis Codes From Structured Electronic Health Record Data: Systematic Review (Preprint) Crossref. (doi: 10.2196/preprints.57358)

Thygesen JH et al. (2021) Understanding COVID-19 trajectories from a nationwide linked electronic health record cohort of 56 million people: phenotypes, severity, waves & vaccination Europe PubMed Central. (doi: 10.1101/2021.11.08.21265312)

Hang Dong, Víctor Suárez-Paniagua, William Whiteley, Honghan Wu (2021) Explainable automated coding of clinical notes using hierarchical label-wise attention networks and label embedding initialisation Journal of Biomedical Informatics Honghan Wu. ISSN 23318422 (doi: 10.48550/arxiv.2010.15728)

Wood A et al. (2021) Linked electronic health records for research on a nationwide cohort of more than 54 million people in England: data resource. BMJ (Clinical research ed.) Europe PubMed Central. (doi: 10.1136/bmj.n826)

Whitfield E, Coffey C, Zhang H, Shi T, Wu X, Li Q, Wu H (2021) Axes of Prognosis: Identifying Subtypes of COVID-19 Outcomes Europe PubMed Central. (doi: 10.1101/2021.03.16.21253371)

Wu H et al. (2021) Ensemble learning for poor prognosis predictions: A case study on SARS-CoV-2. Journal of the American Medical Informatics Association : JAMIA Europe PubMed Central. (doi: 10.1093/jamia/ocaa295)

Whitfield E, Coffey C, Zhang H, Shi T, Wu X, Li Q, Wu H (2021) Axes of Prognosis: Identifying Subtypes of COVID-19 Outcomes. AMIA ... Annual Symposium proceedings. AMIA Symposium Europe PubMed Central.

Yuan Y et al. (2020) Development and Validation of a Prognostic Risk Score System for COVID-19 Inpatients: A Multi-Center Retrospective Study in China. Engineering (Beijing, China) Europe PubMed Central. (doi: 10.1016/j.eng.2020.10.013)

Kuang X, Cheung JP, Wu H, Dokos S, Zhang T (2020) MRI-SegFlow: a novel unsupervised deep learning pipeline enabling accurate vertebral segmentation of MRI images. Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference Europe PubMed Central. (doi: 10.1109/embc44109.2020.9175987)

Wu H et al. (2020) Knowledge Driven Phenotyping. Studies in health technology and informatics Europe PubMed Central. (doi: 10.3233/shti200425)

(2020) Risk prediction for poor outcome and death in hospital in-patients with COVID-19: derivation in Wuhan, China and external validation in London, UK medrxiv Honghan Wu. (doi: 10.2139/ssrn.3590468)

Ibrahim ZM, Wu H, Hamoud A, Stappen L, Dobson RJB, Agarossi A (2020) On classifying sepsis heterogeneity in the ICU: insight using machine learning. Journal of the American Medical Informatics Association : JAMIA Europe PubMed Central. (doi: 10.1093/jamia/ocz211)

Wu H, Hodgson K, Dyson S, Morley KI, Ibrahim ZM, Iqbal E, Stewart R, Dobson RJ, Sudlow C (2019) Efficient Reuse of Natural Language Processing Models for Phenotype-Mention Identification in Free-text Electronic Medical Records: A Phenotype Embedding Approach. JMIR medical informatics Europe PubMed Central. (doi: 10.2196/14782)

Kugathasan P, Wu H, Gaughran F, Nielsen RE, Pritchard M, Dobson R, Stewart R, Stubbs B (2019) Association of physical health multimorbidity with mortality in people with schizophrenia spectrum disorders: Using a novel semantic search system that captures physical diseases in electronic patient records. Schizophrenia research Europe PubMed Central. (doi: 10.1016/j.schres.2019.10.061)

Bean DM, Teo J, Wu H, Oliveira R, Patel R, Bendayan R, Shah AM, Dobson RJB, Scott PA (2019) Semantic computational analysis of anticoagulation use in atrial fibrillation from real world data. PloS one Europe PubMed Central. (doi: 10.1371/journal.pone.0225625)

Honghan Wu, Karen Hodgson, Sue Dyson, Katherine I Morley, Zina M Ibrahim, Ehtesham Iqbal, Robert Stewart, Richard JB Dobson, Cathie Sudlow (2019) Efficient Reuse of Natural Language Processing Models for Phenotype-Mention Identification in Free-text Electronic Medical Records: A Phenotype Embedding Approach (Preprint) Crossref. (doi: 10.2196/preprints.14782)

(2019) Named Entity Recognition for Electronic Health Records: A Comparison of Rule-based and Machine Learning Approaches Honghan Wu. ISSN 23318422 (doi: 10.48550/arxiv.1903.03985)

Honghan Wu et al. (2018) SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research* Journal of the American Medical Informatics Association Crossref Metadata Search. ISSN 1067-5027 (doi: 10.1093/jamia/ocx160)

Bean, D.M., Wu, H., Iqbal, E., Dzahini, O., Ibrahim, Z.M., Broadbent, M., Stewart, R., Dobson, R.J.B. (2018) Erratum: Author Correction: Knowledge graph prediction of unknown adverse drug reactions and validation in electronic health records (Scientific reports (2017) 7 1 (16416)) Scientific reports Scopus - Elsevier. ISSN 20452322 (doi: 10.1038/s41598-018-22521-4)

Ehtesham Iqbal et al. (2017) ADEPt, a semantically-enriched pipeline for extracting adverse drug events from free-text electronic health records PLOS ONE Crossref Metadata Search. ISSN 1932-6203 (doi: 10.1371/journal.pone.0187121)

Daniel M. Bean, Honghan Wu, Olubanke Dzahini, Matthew Broadbent, Robert Stewart, Richard J. B. Dobson (2017) Knowledge graph prediction of unknown adverse drug reactions and validation in electronic health records Scientific Reports Crossref Metadata Search. ISSN 2045-2322 (doi: 10.1038/s41598-017-16674-x)

Honghan Wu et al. (2017) SemEHR: surfacing semantic data from clinical notes in electronic health records for tailored care, trial recruitment, and clinical research The Lancet Crossref Metadata Search. ISSN 0140-6736 (doi: 10.1016/s0140-6736(17)33032-5)

(2017) Automated PDF highlighting to support faster curation of literature for Parkinson's and Alzheimer's disease. Database : the journal of biological databases and curation Europe PubMed Central. (doi: 10.1093/database/bax027)

Wu, H., Qu, Y., Li, H. (2010) Searching semantic web documents based on RDF sentences Jisuanji Yanjiu yu Fazhan/Computer Research and Development Scopus - Elsevier.

Wu, H., Qu, Y. (2009) Understanding semantic web entity: Concept space based summarization method Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition) Scopus - Elsevier. (doi: 10.3969/j.issn.1001-0505.2009.04.014)

Other

Nguyen, Q. et al. (2024) Advancing Question-Answering in Ophthalmology with Retrieval-Augmented Generation (RAG): Benchmarking Open-source and Proprietary Large Language Models medRxiv Scopus - Elsevier. (doi: 10.1101/2024.11.18.24317510)

Kim, Y., Wu, J., Abdulle, Y., Gao, Y., Wu, H. (2024) Enhancing Human-Computer Interaction in Chest X-ray Analysis using Vision and Language Model with Eye Gaze Patterns arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arXiv.2404.02370)

Hasan, A., Wu, J., Nguyen, Q.N., Andres, S., Guellil, I., Zhang, H., Casey, A., Alex, B., Guthrie, B., Wu, H. (2024) Infusing clinical knowledge into tokenisers for language models arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arXiv.2406.14312)

Wu, J., Wu, Z., Li, R., Hasan, A., Kim, Y., Cheung, J.P.Y., Zhang, T., Wu, H. (2024) Integrating Knowledge Retrieval and Large Language Models for Clinical Report Correction arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arXiv.2406.15045)

Wu, Z., Hasan, A., Wu, J., Kim, Y., Cheung, J.P.Y., Zhang, T., Wu, H. (2024) KnowLab_AIMed at MEDIQA-CORR 2024: Chain-of-Though (CoT) prompting strategies for medical error detection and correction arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arXiv.2406.09103)

Wu, J., Hasan, A., Wu, H. (2024) RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arXiv.2406.03062)

Wu, J., Kim, Y., Shi, D., Cliffton, D., Liu, F., Wu, H. (2024) SLaVA-CXR: Small Language and Vision Assistant for Chest X-ray Report Automation arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arXiv.2409.13321)

Banerjee A et al. (2020) Excess deaths in people with cardiovascular diseases during the COVID-19 pandemic. Europe PubMed Central. (doi: 10.1101/2020.06.10.20127175)

Carr E et al. (2020) Evaluation and Improvement of the National Early Warning Score (NEWS2) for COVID-19: a multi-hospital study Europe PubMed Central. (doi: 10.1101/2020.04.24.20078006)

Ibrahim, Z.M. et al. (2020) A knowledge distillation ensemble framework for predicting short and long-term hospitalisation outcomes from electronic health records data arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arxiv.2011.09361)

Bendayan, R. et al. (2020) Identifying physical health comorbidities in a cohort of individuals with severe mental illness: An application of SemEHR arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arxiv.2002.08901)

Ibrahim, Z., Wu, H., Dobson, R. (2020) Modeling rare interactions in time series data through qualitative change: application to outcome prediction in intensive care units arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arxiv.2004.01431)

Wu, H., Hodgson, K., Dyson, S., Morley, K.I., Ibrahim, Z.M., Iqbal, E., Stewart, R., Dobson, R.J.B., Sudlow, C. (2019) Efficiently Reusing Natural Language Processing Models for Phenotype-Mention Identification in Free-text Electronic Medical Records: Methodology Study arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arxiv.1903.03995)

Wu H et al. (2017) SemEHR: A General-purpose Semantic Search System to Surface Semantic Data from Clinical Notes for Tailored Care, Trial Recruitment and Clinical Research Europe PubMed Central. (doi: 10.1101/235622)

Richard Jackson et al. (2017) CogStack - Experiences Of Deploying Integrated Information Retrieval And Extraction Services In A Large National Health Service Foundation Trust Hospital Crossref Metadata Search. (doi: 10.1101/123299)

Jose Manuel Gomez-Perez, Jeff Z. Pan, Guido Vetere, Honghan Wu (2017) Enterprise Knowledge Graph: An Introduction Exploiting Linked Data and Knowledge Graphs in Large Organisations Crossref Metadata Search. (doi: 10.1007/978-3-319-45654-6_1)

Jeff Z. Pan, Jose Manuel Gomez-Perez, Guido Vetere, Honghan Wu, Yuting Zhao, Marco Monti (2017) Enterprise Knowledge Graph: Looking into the Future Exploiting Linked Data and Knowledge Graphs in Large Organisations Crossref Metadata Search. (doi: 10.1007/978-3-319-45654-6_9)

Ronald Denaux, Yuan Ren, Boris Villazon-Terrazas, Panos Alexopoulos, Alessandro Faraotti, Honghan Wu (2017) Knowledge Architecture for Organisations Exploiting Linked Data and Knowledge Graphs in Large Organisations Crossref Metadata Search. (doi: 10.1007/978-3-319-45654-6_3)

Boris Villazon-Terrazas, Nuria Garcia-Santa, Yuan Ren, Alessandro Faraotti, Honghan Wu, Yuting Zhao, Guido Vetere, Jeff Z. Pan (2017) Knowledge Graph Foundations Exploiting Linked Data and Knowledge Graphs in Large Organisations Crossref Metadata Search. (doi: 10.1007/978-3-319-45654-6_2)

Alessandro Moschitti et al. (2017) Question Answering and Knowledge Graphs Exploiting Linked Data and Knowledge Graphs in Large Organisations Crossref Metadata Search. (doi: 10.1007/978-3-319-45654-6_7)

Honghan Wu, Ronald Denaux, Panos Alexopoulos, Yuan Ren, Jeff Z. Pan (2017) Understanding Knowledge Graphs Exploiting Linked Data and Knowledge Graphs in Large Organisations Crossref Metadata Search. (doi: 10.1007/978-3-319-45654-6_6)

Zina M. Ibrahim, Honghan Wu, Robbie Mallah, Richard J. B. Dobson (2016) Category-Driven Association Rule Mining Research and Development in Intelligent Systems XXXIII Crossref Metadata Search. (doi: 10.1007/978-3-319-47175-4_2)

Honghan Wu, Zina M. Ibrahim, Ehtesham Iqbal, Richard J. B. Dobson (2016) Encoding Medication Episodes for Adverse Drug Event Prediction Research and Development in Intelligent Systems XXXIII Crossref Metadata Search. (doi: 10.1007/978-3-319-47175-4_18)

Yuting Zhao, Guido Vetere, Jeff Z. Pan, Alessandro Faraotti, Marco Monti, Honghan Wu (2016) Meta-Level Properties for Reasoning on Dynamic Data Semantic Technology Crossref Metadata Search. (doi: 10.1007/978-3-319-31676-5_19)

Jeff Z. Pan, José Manuel Gómez Pérez, Yuan Ren, Honghan Wu, Haofen Wang, Man Zhu (2015) Graph Pattern Based RDF Data Compression Semantic Technology Crossref Metadata Search. (doi: 10.1007/978-3-319-15615-6_18)

Honghan Wu, Boris Villazon-Terrazas, Jeff Z. Pan, Jose Manuel Gomez-Perez (2014) Exploiting Semantic Web Datasets: A Graph Pattern Based Approach The Semantic Web and Web Science Crossref Metadata Search. (doi: 10.1007/978-3-662-45495-4_15)

Conference Proceedings

Ibrahim, Z., Wu, H., Wiratunga, N. (2023) Preface: The 6th International Workshop on Knowledge Discovery in Healthcare Data (KDH) CEUR Workshop Proceedings Scopus - Elsevier. ISSN 16130073

Bach, K., Bunescu, R., Farri, O., Guo, A., Hasan, S., Ibrahim, Z., Marling, C., Raffa, J., Rubin, J., Wu, H. (2018) Preface: The 3rd international workshop on Knowledge Discovery in Healthcare Data (KDH) CEUR Workshop Proceedings Scopus - Elsevier. ISSN 16130073

Chandra Pandey, Zina Ibrahim, Honghan Wu, Ehtesham Iqbal, Richard Dobson (2017) Improving RNN with Attention and Embedding for Adverse Drug Reactions Proceedings of the 2017 International Conference on Digital Health - DH '17 Crossref Metadata Search. (doi: 10.1145/3079452.3079501)

Ibrahim, Z., Wu, H., Bach, K., Dobson, R., Denaxas, S., Wiratunga, N., Massie, S., Sani, S. (2017) Preface: The 2nd International Workshop on Knowledge Discovery in Healthcare Data (KDH) CEUR Workshop Proceedings Scopus - Elsevier. ISSN 16130073

Wang, H., Sun, Q., Oellrich, A., Wu, H., Dobson, R. (2017) The psycho-ENV corpus: Research articles annotated for knowledge discovery on correlating mental diseases and environmental factors CEUR Workshop Proceedings Scopus - Elsevier. ISSN 16130073

Chen, J., Chen, H., Zheng, G., Pan, J.Z., Wu, H., Zhang, N. (2014) Big smog meets web science: Smog disaster analysis based on social media and device data on the web WWW 2014 Companion - Proceedings of the 23rd International Conference on World Wide Web Scopus - Elsevier. (doi: 10.1145/2567948.2576941)

Wu, H., Villazon-Terrazas, B., Pan, J.Z., Gomez-Perez, J.M. (2014) How redundant is it?-An empirical analysis on linked datasets CEUR Workshop Proceedings Scopus - Elsevier.

Jeff Z. Pan, Yuan Ren, Honghan Wu, Man Zhu (2013) Query generation for semantic datasets Proceedings of the seventh international conference on Knowledge capture - K-CAP '13 Crossref Metadata Search. (doi: 10.1145/2479832.2479859)

Ago Luberg, Michael Granitzer, Honghan Wu, Priit Järv, Tanel Tammet (2012) Information retrieval and deduplication for tourism recommender sightsplanner Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics - WIMS '12 Crossref Metadata Search. (doi: 10.1145/2254129.2254191)

Honghan Wu, Ago Luberg, Tanel Tammet (2012) Ranking domain objects bywisdom of web pages Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics - WIMS '12 Crossref Metadata Search. (doi: 10.1145/2254129.2254210)

Cheng, G., Wu, H., Ge, W., Qu, Y. (2008) Searching Semantic Web objects based on class hierarchies CEUR Workshop Proceedings Scopus - Elsevier.

Hu, W., Zhao, Y., Li, D., Cheng, G., Wu, H., Qu, Y. (2007) Falcon-AO: Results for OAEI 2007 CEUR Workshop Proceedings Scopus - Elsevier.

Book Section

Gomez-Perez, J.M., Pan, J.Z., Vetere, G., Wu, H. (2017) Enterprise knowledge graph: An introduction Exploiting Linked Data and Knowledge Graphs in Large Organisations Scopus - Elsevier. ISBN 9783319456546 9783319456522 (doi: 10.1007/978-3-319-45654-6_1)

Pan, J.Z., Gomez-Perez, J.M., Vetere, G., Wu, H., Zhao, Y., Monti, M. (2017) Enterprise knowledge graph: Looking into the future Exploiting Linked Data and Knowledge Graphs in Large Organisations Scopus - Elsevier. ISBN 9783319456546 9783319456522 (doi: 10.1007/978-3-319-45654-6_9)

Denaux, R., Ren, Y., Villazon-Terrazas, B., Alexopoulos, P., Faraotti, A., Wu, H. (2017) Knowledge architecture for organisations Exploiting Linked Data and Knowledge Graphs in Large Organisations Scopus - Elsevier. ISBN 9783319456546 9783319456522 (doi: 10.1007/978-3-319-45654-6_3)

Villazon-Terrazas, B., Garcia-Santa, N., Ren, Y., Faraotti, A., Wu, H., Zhao, Y., Vetere, G., Pan, J.Z. (2017) Knowledge graph foundations Exploiting Linked Data and Knowledge Graphs in Large Organisations Scopus - Elsevier. ISBN 9783319456546 9783319456522 (doi: 10.1007/978-3-319-45654-6_2)

Moschitti, A. et al. (2017) Question answering and knowledge graphs Exploiting Linked Data and Knowledge Graphs in Large Organisations Scopus - Elsevier. ISBN 9783319456546 9783319456522 (doi: 10.1007/978-3-319-45654-6_7)

Wu, H., Denaux, R., Alexopoulos, P., Ren, Y., Pan, J.Z. (2017) Understanding Knowledge Graphs Exploiting Linked Data and Knowledge Graphs in Large Organisations Scopus - Elsevier. ISBN 9783319456546 9783319456522 (doi: 10.1007/978-3-319-45654-6_6)

Book

Jeff Z. Pan, Guido Vetere, Jose Manuel Gomez-Perez, Honghan Wu (2017) Exploiting Linked Data and Knowledge Graphs in Large Organisations Crossref Metadata Search. (doi: 10.1007/978-3-319-45654-6)

Supervision

  • Xu, Chao
    Developing a risk stratification tool to detect ADHD in children and adolescents

Professional activities & recognition

Research fellowships

  • 2018 - 2022: Health Data Research UK

Editorial boards

  • 2021: BMC Medical Informatics and Decision Making
  • 2022: BMC Digital Health

Professional & learned societies

  • 2020 - 2023: Turing Fellow, Alan Turing Institute