Professor Honghan Wu
- Professor of Health Informatics and AI (Public Health)
Biography
I am a Professor of Health Informatics and AI at the School of Health and Wellbeing, University of Glasgow. I am a co-director of Health Data Research UK Scotland. I am also an honorary professor at Hong Kong University and an honorary associate professor at UCL. I am a former (2020-2023) Turing Fellow of The Alan Turing Institute and a Rutherford Fellow (2018-2022) of the Health Data Research UK. I got my BEng and PhD degrees from Southeast University, China. I worked in the industry for about six years primarily as a software developer before my PhD study.
My research lab website is at https://knowlab.github.io/, I also co-lead the Edinburgh Clinical Natural Language Processing group: https://www.ed.ac.uk/usher/clinical-natural-language-processing and co-organise the Turing Health Equity group: https://www.turing.ac.uk/research/interest-groups/health-equity.
Research interests
Machine learning, natural language processing, knowledge graph and their applications in medicine. Details of my research and team updates can be found at https://knowlab.github.io/.
Publications
Prior publications
Article
Ruochen Huang et al. (2025) Evaluation and Bias Analysis of Large Language Models in Generating Synthetic Electronic Health Records: Comparative Study Journal of Medical Internet Research Crossref. (doi: 10.2196/65317)
Jamie Chow, Ryan Lee, Honghan Wu (2025) How Do Radiologists Currently Monitor AI in Radiology and What Challenges Do They Face? An Interview Study and Qualitative Analysis Journal of Imaging Informatics in Medicine Crossref. (doi: 10.1007/s10278-025-01493-8)
Tuankasfee Hama, Mohanad M Alsaleh, Freya Allery, Jung Won Choi, Christopher Tomlinson, Honghan Wu, Alvina Lai, Nikolas Pontikos, Johan H Thygesen (2025) Enhancing Patient Outcome Prediction Through Deep Learning With Sequential Diagnosis Codes From Structured Electronic Health Record Data: Systematic Review Journal of Medical Internet Research Crossref. (doi: 10.2196/57358)
Ruochen Huang et al. (2024) Evaluation and Bias Analysis of Large Language Models in Generating Synthetic Electronic Health Records: Comparative Study (Preprint) Crossref. (doi: 10.2196/preprints.65317)
Tuankasfee Hama, Mohanad M Alsaleh, Freya Allery, Jung Won Choi, Christopher Tomlinson, Honghan Wu, Alvina Lai, Nikolas Pontikos, Johan H Thygesen (2024) Enhancing Patient Outcome Prediction Through Deep Learning With Sequential Diagnosis Codes From Structured Electronic Health Record Data: Systematic Review (Preprint) Crossref. (doi: 10.2196/preprints.57358)
Thygesen JH et al. (2021) Understanding COVID-19 trajectories from a nationwide linked electronic health record cohort of 56 million people: phenotypes, severity, waves & vaccination Europe PubMed Central. (doi: 10.1101/2021.11.08.21265312)
Hang Dong, Víctor Suárez-Paniagua, William Whiteley, Honghan Wu (2021) Explainable automated coding of clinical notes using hierarchical label-wise attention networks and label embedding initialisation Journal of Biomedical Informatics Honghan Wu. ISSN 23318422 (doi: 10.48550/arxiv.2010.15728)
Wood A et al. (2021) Linked electronic health records for research on a nationwide cohort of more than 54 million people in England: data resource. BMJ (Clinical research ed.) Europe PubMed Central. (doi: 10.1136/bmj.n826)
Whitfield E, Coffey C, Zhang H, Shi T, Wu X, Li Q, Wu H (2021) Axes of Prognosis: Identifying Subtypes of COVID-19 Outcomes Europe PubMed Central. (doi: 10.1101/2021.03.16.21253371)
Wu H et al. (2021) Ensemble learning for poor prognosis predictions: A case study on SARS-CoV-2. Journal of the American Medical Informatics Association : JAMIA Europe PubMed Central. (doi: 10.1093/jamia/ocaa295)
Whitfield E, Coffey C, Zhang H, Shi T, Wu X, Li Q, Wu H (2021) Axes of Prognosis: Identifying Subtypes of COVID-19 Outcomes. AMIA ... Annual Symposium proceedings. AMIA Symposium Europe PubMed Central.
Yuan Y et al. (2020) Development and Validation of a Prognostic Risk Score System for COVID-19 Inpatients: A Multi-Center Retrospective Study in China. Engineering (Beijing, China) Europe PubMed Central. (doi: 10.1016/j.eng.2020.10.013)
Kuang X, Cheung JP, Wu H, Dokos S, Zhang T (2020) MRI-SegFlow: a novel unsupervised deep learning pipeline enabling accurate vertebral segmentation of MRI images. Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference Europe PubMed Central. (doi: 10.1109/embc44109.2020.9175987)
Wu H et al. (2020) Knowledge Driven Phenotyping. Studies in health technology and informatics Europe PubMed Central. (doi: 10.3233/shti200425)
(2020) Risk prediction for poor outcome and death in hospital in-patients with COVID-19: derivation in Wuhan, China and external validation in London, UK medrxiv Honghan Wu. (doi: 10.2139/ssrn.3590468)
Ibrahim ZM, Wu H, Hamoud A, Stappen L, Dobson RJB, Agarossi A (2020) On classifying sepsis heterogeneity in the ICU: insight using machine learning. Journal of the American Medical Informatics Association : JAMIA Europe PubMed Central. (doi: 10.1093/jamia/ocz211)
Wu H, Hodgson K, Dyson S, Morley KI, Ibrahim ZM, Iqbal E, Stewart R, Dobson RJ, Sudlow C (2019) Efficient Reuse of Natural Language Processing Models for Phenotype-Mention Identification in Free-text Electronic Medical Records: A Phenotype Embedding Approach. JMIR medical informatics Europe PubMed Central. (doi: 10.2196/14782)
Kugathasan P, Wu H, Gaughran F, Nielsen RE, Pritchard M, Dobson R, Stewart R, Stubbs B (2019) Association of physical health multimorbidity with mortality in people with schizophrenia spectrum disorders: Using a novel semantic search system that captures physical diseases in electronic patient records. Schizophrenia research Europe PubMed Central. (doi: 10.1016/j.schres.2019.10.061)
Bean DM, Teo J, Wu H, Oliveira R, Patel R, Bendayan R, Shah AM, Dobson RJB, Scott PA (2019) Semantic computational analysis of anticoagulation use in atrial fibrillation from real world data. PloS one Europe PubMed Central. (doi: 10.1371/journal.pone.0225625)
Honghan Wu, Karen Hodgson, Sue Dyson, Katherine I Morley, Zina M Ibrahim, Ehtesham Iqbal, Robert Stewart, Richard JB Dobson, Cathie Sudlow (2019) Efficient Reuse of Natural Language Processing Models for Phenotype-Mention Identification in Free-text Electronic Medical Records: A Phenotype Embedding Approach (Preprint) Crossref. (doi: 10.2196/preprints.14782)
(2019) Named Entity Recognition for Electronic Health Records: A Comparison of Rule-based and Machine Learning Approaches Honghan Wu. ISSN 23318422 (doi: 10.48550/arxiv.1903.03985)
Honghan Wu et al. (2018) SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research* Journal of the American Medical Informatics Association Crossref Metadata Search. ISSN 1067-5027 (doi: 10.1093/jamia/ocx160)
Bean, D.M., Wu, H., Iqbal, E., Dzahini, O., Ibrahim, Z.M., Broadbent, M., Stewart, R., Dobson, R.J.B. (2018) Erratum: Author Correction: Knowledge graph prediction of unknown adverse drug reactions and validation in electronic health records (Scientific reports (2017) 7 1 (16416)) Scientific reports Scopus - Elsevier. ISSN 20452322 (doi: 10.1038/s41598-018-22521-4)
Ehtesham Iqbal et al. (2017) ADEPt, a semantically-enriched pipeline for extracting adverse drug events from free-text electronic health records PLOS ONE Crossref Metadata Search. ISSN 1932-6203 (doi: 10.1371/journal.pone.0187121)
Daniel M. Bean, Honghan Wu, Olubanke Dzahini, Matthew Broadbent, Robert Stewart, Richard J. B. Dobson (2017) Knowledge graph prediction of unknown adverse drug reactions and validation in electronic health records Scientific Reports Crossref Metadata Search. ISSN 2045-2322 (doi: 10.1038/s41598-017-16674-x)
Honghan Wu et al. (2017) SemEHR: surfacing semantic data from clinical notes in electronic health records for tailored care, trial recruitment, and clinical research The Lancet Crossref Metadata Search. ISSN 0140-6736 (doi: 10.1016/s0140-6736(17)33032-5)
(2017) Automated PDF highlighting to support faster curation of literature for Parkinson's and Alzheimer's disease. Database : the journal of biological databases and curation Europe PubMed Central. (doi: 10.1093/database/bax027)
Wu, H., Qu, Y., Li, H. (2010) Searching semantic web documents based on RDF sentences Jisuanji Yanjiu yu Fazhan/Computer Research and Development Scopus - Elsevier.
Wu, H., Qu, Y. (2009) Understanding semantic web entity: Concept space based summarization method Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition) Scopus - Elsevier. (doi: 10.3969/j.issn.1001-0505.2009.04.014)
Other
Nguyen, Q. et al. (2024) Advancing Question-Answering in Ophthalmology with Retrieval-Augmented Generation (RAG): Benchmarking Open-source and Proprietary Large Language Models medRxiv Scopus - Elsevier. (doi: 10.1101/2024.11.18.24317510)
Kim, Y., Wu, J., Abdulle, Y., Gao, Y., Wu, H. (2024) Enhancing Human-Computer Interaction in Chest X-ray Analysis using Vision and Language Model with Eye Gaze Patterns arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arXiv.2404.02370)
Hasan, A., Wu, J., Nguyen, Q.N., Andres, S., Guellil, I., Zhang, H., Casey, A., Alex, B., Guthrie, B., Wu, H. (2024) Infusing clinical knowledge into tokenisers for language models arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arXiv.2406.14312)
Wu, J., Wu, Z., Li, R., Hasan, A., Kim, Y., Cheung, J.P.Y., Zhang, T., Wu, H. (2024) Integrating Knowledge Retrieval and Large Language Models for Clinical Report Correction arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arXiv.2406.15045)
Wu, Z., Hasan, A., Wu, J., Kim, Y., Cheung, J.P.Y., Zhang, T., Wu, H. (2024) KnowLab_AIMed at MEDIQA-CORR 2024: Chain-of-Though (CoT) prompting strategies for medical error detection and correction arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arXiv.2406.09103)
Wu, J., Hasan, A., Wu, H. (2024) RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arXiv.2406.03062)
Wu, J., Kim, Y., Shi, D., Cliffton, D., Liu, F., Wu, H. (2024) SLaVA-CXR: Small Language and Vision Assistant for Chest X-ray Report Automation arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arXiv.2409.13321)
Banerjee A et al. (2020) Excess deaths in people with cardiovascular diseases during the COVID-19 pandemic. Europe PubMed Central. (doi: 10.1101/2020.06.10.20127175)
Carr E et al. (2020) Evaluation and Improvement of the National Early Warning Score (NEWS2) for COVID-19: a multi-hospital study Europe PubMed Central. (doi: 10.1101/2020.04.24.20078006)
Ibrahim, Z.M. et al. (2020) A knowledge distillation ensemble framework for predicting short and long-term hospitalisation outcomes from electronic health records data arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arxiv.2011.09361)
Bendayan, R. et al. (2020) Identifying physical health comorbidities in a cohort of individuals with severe mental illness: An application of SemEHR arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arxiv.2002.08901)
Ibrahim, Z., Wu, H., Dobson, R. (2020) Modeling rare interactions in time series data through qualitative change: application to outcome prediction in intensive care units arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arxiv.2004.01431)
Wu, H., Hodgson, K., Dyson, S., Morley, K.I., Ibrahim, Z.M., Iqbal, E., Stewart, R., Dobson, R.J.B., Sudlow, C. (2019) Efficiently Reusing Natural Language Processing Models for Phenotype-Mention Identification in Free-text Electronic Medical Records: Methodology Study arXiv Scopus - Elsevier. ISSN 23318422 (doi: 10.48550/arxiv.1903.03995)
Wu H et al. (2017) SemEHR: A General-purpose Semantic Search System to Surface Semantic Data from Clinical Notes for Tailored Care, Trial Recruitment and Clinical Research Europe PubMed Central. (doi: 10.1101/235622)
Richard Jackson et al. (2017) CogStack - Experiences Of Deploying Integrated Information Retrieval And Extraction Services In A Large National Health Service Foundation Trust Hospital Crossref Metadata Search. (doi: 10.1101/123299)
Jose Manuel Gomez-Perez, Jeff Z. Pan, Guido Vetere, Honghan Wu (2017) Enterprise Knowledge Graph: An Introduction Exploiting Linked Data and Knowledge Graphs in Large Organisations Crossref Metadata Search. (doi: 10.1007/978-3-319-45654-6_1)
Jeff Z. Pan, Jose Manuel Gomez-Perez, Guido Vetere, Honghan Wu, Yuting Zhao, Marco Monti (2017) Enterprise Knowledge Graph: Looking into the Future Exploiting Linked Data and Knowledge Graphs in Large Organisations Crossref Metadata Search. (doi: 10.1007/978-3-319-45654-6_9)
Ronald Denaux, Yuan Ren, Boris Villazon-Terrazas, Panos Alexopoulos, Alessandro Faraotti, Honghan Wu (2017) Knowledge Architecture for Organisations Exploiting Linked Data and Knowledge Graphs in Large Organisations Crossref Metadata Search. (doi: 10.1007/978-3-319-45654-6_3)
Boris Villazon-Terrazas, Nuria Garcia-Santa, Yuan Ren, Alessandro Faraotti, Honghan Wu, Yuting Zhao, Guido Vetere, Jeff Z. Pan (2017) Knowledge Graph Foundations Exploiting Linked Data and Knowledge Graphs in Large Organisations Crossref Metadata Search. (doi: 10.1007/978-3-319-45654-6_2)
Alessandro Moschitti et al. (2017) Question Answering and Knowledge Graphs Exploiting Linked Data and Knowledge Graphs in Large Organisations Crossref Metadata Search. (doi: 10.1007/978-3-319-45654-6_7)
Honghan Wu, Ronald Denaux, Panos Alexopoulos, Yuan Ren, Jeff Z. Pan (2017) Understanding Knowledge Graphs Exploiting Linked Data and Knowledge Graphs in Large Organisations Crossref Metadata Search. (doi: 10.1007/978-3-319-45654-6_6)
Zina M. Ibrahim, Honghan Wu, Robbie Mallah, Richard J. B. Dobson (2016) Category-Driven Association Rule Mining Research and Development in Intelligent Systems XXXIII Crossref Metadata Search. (doi: 10.1007/978-3-319-47175-4_2)
Honghan Wu, Zina M. Ibrahim, Ehtesham Iqbal, Richard J. B. Dobson (2016) Encoding Medication Episodes for Adverse Drug Event Prediction Research and Development in Intelligent Systems XXXIII Crossref Metadata Search. (doi: 10.1007/978-3-319-47175-4_18)
Yuting Zhao, Guido Vetere, Jeff Z. Pan, Alessandro Faraotti, Marco Monti, Honghan Wu (2016) Meta-Level Properties for Reasoning on Dynamic Data Semantic Technology Crossref Metadata Search. (doi: 10.1007/978-3-319-31676-5_19)
Jeff Z. Pan, José Manuel Gómez Pérez, Yuan Ren, Honghan Wu, Haofen Wang, Man Zhu (2015) Graph Pattern Based RDF Data Compression Semantic Technology Crossref Metadata Search. (doi: 10.1007/978-3-319-15615-6_18)
Honghan Wu, Boris Villazon-Terrazas, Jeff Z. Pan, Jose Manuel Gomez-Perez (2014) Exploiting Semantic Web Datasets: A Graph Pattern Based Approach The Semantic Web and Web Science Crossref Metadata Search. (doi: 10.1007/978-3-662-45495-4_15)
Conference Proceedings
Ibrahim, Z., Wu, H., Wiratunga, N. (2023) Preface: The 6th International Workshop on Knowledge Discovery in Healthcare Data (KDH) CEUR Workshop Proceedings Scopus - Elsevier. ISSN 16130073
Bach, K., Bunescu, R., Farri, O., Guo, A., Hasan, S., Ibrahim, Z., Marling, C., Raffa, J., Rubin, J., Wu, H. (2018) Preface: The 3rd international workshop on Knowledge Discovery in Healthcare Data (KDH) CEUR Workshop Proceedings Scopus - Elsevier. ISSN 16130073
Chandra Pandey, Zina Ibrahim, Honghan Wu, Ehtesham Iqbal, Richard Dobson (2017) Improving RNN with Attention and Embedding for Adverse Drug Reactions Proceedings of the 2017 International Conference on Digital Health - DH '17 Crossref Metadata Search. (doi: 10.1145/3079452.3079501)
Ibrahim, Z., Wu, H., Bach, K., Dobson, R., Denaxas, S., Wiratunga, N., Massie, S., Sani, S. (2017) Preface: The 2nd International Workshop on Knowledge Discovery in Healthcare Data (KDH) CEUR Workshop Proceedings Scopus - Elsevier. ISSN 16130073
Wang, H., Sun, Q., Oellrich, A., Wu, H., Dobson, R. (2017) The psycho-ENV corpus: Research articles annotated for knowledge discovery on correlating mental diseases and environmental factors CEUR Workshop Proceedings Scopus - Elsevier. ISSN 16130073
Chen, J., Chen, H., Zheng, G., Pan, J.Z., Wu, H., Zhang, N. (2014) Big smog meets web science: Smog disaster analysis based on social media and device data on the web WWW 2014 Companion - Proceedings of the 23rd International Conference on World Wide Web Scopus - Elsevier. (doi: 10.1145/2567948.2576941)
Wu, H., Villazon-Terrazas, B., Pan, J.Z., Gomez-Perez, J.M. (2014) How redundant is it?-An empirical analysis on linked datasets CEUR Workshop Proceedings Scopus - Elsevier.
Jeff Z. Pan, Yuan Ren, Honghan Wu, Man Zhu (2013) Query generation for semantic datasets Proceedings of the seventh international conference on Knowledge capture - K-CAP '13 Crossref Metadata Search. (doi: 10.1145/2479832.2479859)
Ago Luberg, Michael Granitzer, Honghan Wu, Priit Järv, Tanel Tammet (2012) Information retrieval and deduplication for tourism recommender sightsplanner Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics - WIMS '12 Crossref Metadata Search. (doi: 10.1145/2254129.2254191)
Honghan Wu, Ago Luberg, Tanel Tammet (2012) Ranking domain objects bywisdom of web pages Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics - WIMS '12 Crossref Metadata Search. (doi: 10.1145/2254129.2254210)
Cheng, G., Wu, H., Ge, W., Qu, Y. (2008) Searching Semantic Web objects based on class hierarchies CEUR Workshop Proceedings Scopus - Elsevier.
Hu, W., Zhao, Y., Li, D., Cheng, G., Wu, H., Qu, Y. (2007) Falcon-AO: Results for OAEI 2007 CEUR Workshop Proceedings Scopus - Elsevier.
Book Section
Gomez-Perez, J.M., Pan, J.Z., Vetere, G., Wu, H. (2017) Enterprise knowledge graph: An introduction Exploiting Linked Data and Knowledge Graphs in Large Organisations Scopus - Elsevier. ISBN 9783319456546 9783319456522 (doi: 10.1007/978-3-319-45654-6_1)
Pan, J.Z., Gomez-Perez, J.M., Vetere, G., Wu, H., Zhao, Y., Monti, M. (2017) Enterprise knowledge graph: Looking into the future Exploiting Linked Data and Knowledge Graphs in Large Organisations Scopus - Elsevier. ISBN 9783319456546 9783319456522 (doi: 10.1007/978-3-319-45654-6_9)
Denaux, R., Ren, Y., Villazon-Terrazas, B., Alexopoulos, P., Faraotti, A., Wu, H. (2017) Knowledge architecture for organisations Exploiting Linked Data and Knowledge Graphs in Large Organisations Scopus - Elsevier. ISBN 9783319456546 9783319456522 (doi: 10.1007/978-3-319-45654-6_3)
Villazon-Terrazas, B., Garcia-Santa, N., Ren, Y., Faraotti, A., Wu, H., Zhao, Y., Vetere, G., Pan, J.Z. (2017) Knowledge graph foundations Exploiting Linked Data and Knowledge Graphs in Large Organisations Scopus - Elsevier. ISBN 9783319456546 9783319456522 (doi: 10.1007/978-3-319-45654-6_2)
Moschitti, A. et al. (2017) Question answering and knowledge graphs Exploiting Linked Data and Knowledge Graphs in Large Organisations Scopus - Elsevier. ISBN 9783319456546 9783319456522 (doi: 10.1007/978-3-319-45654-6_7)
Wu, H., Denaux, R., Alexopoulos, P., Ren, Y., Pan, J.Z. (2017) Understanding Knowledge Graphs Exploiting Linked Data and Knowledge Graphs in Large Organisations Scopus - Elsevier. ISBN 9783319456546 9783319456522 (doi: 10.1007/978-3-319-45654-6_6)
Book
Jeff Z. Pan, Guido Vetere, Jose Manuel Gomez-Perez, Honghan Wu (2017) Exploiting Linked Data and Knowledge Graphs in Large Organisations Crossref Metadata Search. (doi: 10.1007/978-3-319-45654-6)
Supervision
- Xu, Chao
Developing a risk stratification tool to detect ADHD in children and adolescents
Professional activities & recognition
Research fellowships
- 2018 - 2022: Health Data Research UK
Editorial boards
- 2021: BMC Medical Informatics and Decision Making
- 2022: BMC Digital Health
Professional & learned societies
- 2020 - 2023: Turing Fellow, Alan Turing Institute