Dr Tanaya Guha

Senior Lecturer (School of Computing Science)

S132 Lilybank Gardens, University of Glasgow

Biography

Please see my personal website, which is regularly updated.

I am a Senior Lecturer of Computing Science at University of Glasgow, where I am a member of the Social AI group within GIST section. I also hold an Honorary Associate Professor position in the Department of Computer Science, University of Warwick.

My research focuses on developing machine intelligence capabilities to understand human behaviour combining Deep Learning, Computer Vision, and Signal/Speech Processing.

I received my PhD degree in Electrical & Computer Engineering from the University of British Columbia (UBC), Vancouver in 2013. After graduation, I was a Postdoctoral Fellow at SAIL, University of Southern California (USC), Los Angeles. In 2015, I joined IIT Kanpur, India as an Assistant Professor of Electrical Engineering. In 2018, I moved to University of Warwick as an Assistant Professor, and later became an Associate Professor. Since 2021, I am a Senior Lecturer in the University of Glasgow.

Research interests

Research groups

Glasgow Interactive Systems (GIST)

Publications

List by: Type | Date

Jump to: 2026 | 2025 | 2024 | 2023 | 2022 | 2021

Number of items: 30.

2026

Altalhi, Sahar ORCID: https://orcid.org/0000-0001-7862-9974, Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 and Vinciarelli, Alessandro ORCID: https://orcid.org/0000-0002-9048-0524 (2026) Depression markers in speech: An approach based on tract variables dynamics. Journal of the Acoustical Society of America, 160(1), pp. 277-288. (doi: 10.1121/10.0044193) (PMID:42390167)

Mooney, Michael ORCID: https://orcid.org/0009-0009-5329-0420, Ho, Edmond S.L. ORCID: https://orcid.org/0000-0001-5862-106X and Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 (2026) Conditioning On The Receivers State: Towards A Personalized Surprisal. The 4th Workshop on Eye Movements and the Assessment of Reading Comprehension, Koblenz, Germany, 18-20 June 2026. (Accepted for Publication)

2025

Bian, Tongfei, Ma, Yiming, Chollet, Mathieu ORCID: https://orcid.org/0000-0001-9858-6844, Sanchez, Victor and Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 (2025) Interact with Me: Joint Egocentric Forecasting of Intent to Interact, Attitude and Social Actions. In: IEEE International Conference on Multimedia & Expo (ICME) 2025, Nantes, France, 30 June-4 July 2025, ISBN 9798331594954 (doi: 10.1109/ICME59968.2025.11210231)

Bian, Tongfei, Chollet, Mathieu ORCID: https://orcid.org/0000-0001-9858-6844 and Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 (2025) Robust Understanding of Human-Robot Social Interactions through Multimodal Distillation. In: 33rd ACM International Conference on Multimedia (MM '25), Dublin, Ireland, 27-31 Oct 2025, pp. 5726-5734. ISBN 9798400720352 (doi: 10.1145/3746027.3755463)

Taka, Evdoxia ORCID: https://orcid.org/0000-0001-7011-3367, Bhattacharya, Debadyuti, Garde-Hansen, Joanne, Sharma, Sanjay and Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 (2025) Analyzing Character Representation in Media Content using Multimodal Foundation Model: Effectiveness and Trust. In: 27th International Conference on Multimodal Interaction (ICMI 2025), Canberra, Australia, 13-17 Oct 2025, pp. 466-474. ISBN 9798400714993 (doi: 10.1145/3716553.3750785)

Leyva, Roberto, Shen, Guodong, Bahadir, Ozan, Sanchez, Victor and Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 (2025) Boosting Tiny Face Detection in Videos with an Integral Score Framework. In: 19th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2025), Clearwater, Florida, USA, 27-29 May 2025, ISBN 9798331553418 (doi: 10.1109/FG61629.2025.11099181)

Liao, Jiashu, Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 and Sanchez, Victor (2025) Self-supervised random mask attention GAN in tackling pose-invariant face recognition. Pattern Recognition, 159, 111112. (doi: 10.1016/j.patcog.2024.111112)

Madan, Surbhi, Gahalawat, Monika, Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891, Goecke, Roland and Subramanian, Ramanathan (2025) Explainable human-centered traits from head motion and facial expression dynamics. PLoS ONE, 20(1), e0313883. (doi: 10.1371/journal.pone.0313883) (PMID:39823428) (PMCID:PMC11741400)

2024

Ghosh, Bishal, Li, Emma ORCID: https://orcid.org/0000-0003-4200-0669 and Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 (2024) Active Listener: Continuous Generation of Listener’s Head Motion Response in Dyadic Interactions. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025), Hyderabad, India, 6-11 April 2025, ISBN 9798350368741 (doi: 10.1109/ICASSP49660.2025.10889429)

Li, G. et al. (2024) Detecting in-car VR Motion Sickness from Lower Face Action Units. In: 2024 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), Seattle, WA, USA, 21-25 October 2024, pp. 1019-1028. ISBN 9798331516475 (doi: 10.1109/ISMAR62088.2024.00118)

Ajayi, Olayinka, Wen, Hongkai and Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 (2024) NAPE: Numbering as a Position Encoding in graphs. IEEE Access, 12, pp. 166200-166210. (doi: 10.1109/access.2024.3495703)

Fringi, Evangelia ORCID: https://orcid.org/0009-0008-9642-660X, Alshubaily, Nesreen, Picinali, Lorenzo, Brewster, Stephen Anthony ORCID: https://orcid.org/0000-0001-9720-3899, Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 and Vinciarelli, Alessandro ORCID: https://orcid.org/0000-0002-9048-0524 (2024) Is Distance a Modality? Multi-Label Learning for Speech-Based Joint Prediction of Attributed Traits and Perceived Distances in 3D Audio Immersive Environments. In: ICMI '24: 26th International Conference on Multimodal Interaction, San Jose, Costa Rica, 04-08 Nov 2024, pp. 321-330. ISBN 9798400704628 (doi: 10.1145/3678957.3685740)

Alsenani, Basmah, Esposito, Anna, Vinciarelli, Alessandro ORCID: https://orcid.org/0000-0002-9048-0524 and Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 (2024) Assessing Privacy Risks of Attribute Inference Attacks against Speech-based Depression Detection System. In: 27th European Conference on Artificial Intelligence, Santiago de Compostela, Spain, 19-24 Oct 2024, pp. 3797-3804. ISBN 9781643685489 (doi: 10.3233/FAIA240941)

ALOSHBAN, NUJUD IBRAHIM Z, Esposito, Anna, Vinciarelli, Alessandro ORCID: https://orcid.org/0000-0002-9048-0524 and Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 (2024) On the effects of obfuscating speaker attributes in privacy-aware depression detection. Pattern Recognition Letters, 186, pp. 300-305. (doi: 10.1016/j.patrec.2024.10.016)

Styles, Olly, Miller, Sam, Cerda-Mardini, Patricia, Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891, Sanchez, Victor and Vidgen, Bertie (2024) WorkBench: A Benchmark Dataset for Agents in a Realistic Workplace Setting. In: Conference on Language Modeling (COLM) 2024, Pennsylvania, Philadelphia, USA, 07-09 Oct 2024,

2023

Gahalawat, Monika, Fernandez Rojas, Raul, Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891, Subramanian, Ramanathan and Goecke, Roland (2023) Explainable Depression Detection via Head Motion Patterns. In: 25th ACM International Conference on Multimodal Interaction (ICMI 2023), Paris, France, 9-13 October 2023, pp. 261-270. ISBN 9798400700552 (doi: 10.1145/3577190.3614130)

Alsenani, Basmah, Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 and Vinciarelli, Alessandro ORCID: https://orcid.org/0000-0002-9048-0524 (2023) Privacy Risks in Speech Emotion Recognition: A Systematic Study on Gender Inference Attack. In: 24th INTERSPEECH Conference, Dublin, Ireland, 20-24 Aug 2023, pp. 651-655. (doi: 10.21437/Interspeech.2023-454)

Ma, Yiming, Sanchez, Victor, Nikan, Soodeh, Upadhyay, Devesh, Atote, Bhushan and Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 (2023) Robust Multiview Multimodal Driver Monitoring System Using Masked Multi-Head Self-attention. In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR 2023) - CVPR Workshop, Vancouver, Canada, 18-22 June 2023, pp. 2617-2625. ISBN 9798350302493 (doi: 10.1109/CVPRW59228.2023.00260)

Shirian, Amir, Ahmadian, Mona, Somandepalli, Krishna and Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 (2023) Heterogeneous Graph Learning for Acoustic Event Classification. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes, Greece, 4-10 June 2023, ISBN 9781728163277 (doi: 10.1109/ICASSP49357.2023.10095073)

2022

Min, Kyle, Roy, Sourya, Tripathi, Subarna, Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 and Majumdar, Somdeb (2022) Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection. In: European Conference on Computer Vision (ECCV 2022), Tel Aviv, Israel, 23-27 Oct 2022, pp. 371-387. ISBN 9783031198328 (doi: 10.1007/978-3-031-19833-5_22)

Styles, Olly, Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 and Sanchez, Victor (2022) Multi-camera trajectory forecasting with trajectory tensors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(11), pp. 8482-8491. (doi: 10.1109/TPAMI.2021.3107958) (PMID:34437059)

Shirian, Amir, Somandepalli, Krishna and Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 (2022) Self-supervised graphs for audio representation Learning with limited labeled data. IEEE Journal of Selected Topics in Signal Processing, 16(6), pp. 1391-1401. (doi: 10.1109/JSTSP.2022.3190083)

Shirian, Amir, Somandepalli, Krishna, Sanchez, Victor and Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 (2022) Visually-Aware Acoustic Event Detection Using Heterogeneous Graphs. In: INTERSPEECH 2022, Incheon, South Korea, 18-22 Sep 2022, pp. 2428-2432. (doi: 10.21437/Interspeech.2022-10670)

Roy, Debaleena, Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 and Sanchez, Victor (2022) Graph-based Transform based on 3D Convolutional Neural Network for Intra-Prediction of Imaging Data. In: 2022 Data Compression Conference (DCC), Snowbird, UT, USA, 22-25 March 2022, pp. 212-221. ISBN 9781665478939 (doi: 10.1109/DCC52660.2022.00029)

Liao, Jiashu, Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 and Sanchez, Victor (2022) Self-supervised Frontalization and Rotation GAN with Random Swap for Pose-invariant Face Recognition. In: 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 Oct 2022, pp. 911-915. ISBN 9781665496209 (doi: 10.1109/ICIP46576.2022.9897944)

Ma, Yiming, Sanchez, Victor and Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 (2022) FusionCount: Efficient Crowd Counting via Multiscale Feature Fusion. In: 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 Oct 2022, pp. 3256-3260. ISBN 9781665496209 (doi: 10.1109/ICIP46576.2022.9897322)

Shirian, Amir, Tripathi, Subarna and Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 (2022) Dynamic emotion modeling with learnable graphs and graph inception network. IEEE Transactions on Multimedia, 24, pp. 780-790. (doi: 10.1109/TMM.2021.3059169)

2021

Somandepalli, Krishna, Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891, Martinez, Victor R., Kumar, Naveen, Adam, Hartwig and Narayanan, Shrikanth (2021) Computational media intelligence: human-centered machine analysis of media. Proceedings of the IEEE, 109(5), pp. 891-910. (doi: 10.1109/JPROC.2020.3047978)

Shirian, Amir and Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 (2021) Compact Graph Architecture for Speech Emotion Recognition. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 6-11 June 2021, pp. 6284-6288. ISBN 9781728176055 (doi: 10.1109/ICASSP39728.2021.9413876)

Nguyen, Kien, Tripathi, Subarna, Du, Bang, Guha, Tanaya ORCID: https://orcid.org/0000-0003-2167-4891 and Nguyen, Truong Q (2021) In Defense of Scene Graphs for Image Captioning. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10-17 October 2021, pp. 1387-1396. ISBN 9781665428125 (doi: 10.1109/ICCV48922.2021.00144)

This list was generated on Mon Jul 20 11:33:20 2026 BST.

Jump to: Articles | Conference or Workshop Item | Conference Proceedings

Number of items: 30.

Articles

Conference or Workshop Item

Conference Proceedings

This list was generated on Mon Jul 20 11:33:20 2026 BST.

Supervision

Altalhi, Sahar
An Analysis of Oral Presentations in View of an Analysis of Public Speaking
Bian, Tongfei
Vision-based social understanding and prediction
Gan, Zhuowei
Theoretical Framework for Emotional Intelligence in Large Language Models
Ghosh, Bishal
Adapting Nonverbal Communication Dynamics to Human-Robot Social Interaction
Gutierrez Serafin, Benjamin
Designing Mindful Intervention with Therapeutic Music on Earables to Manage Occupational Fatigue
Moghanloo, Yasaman
Modeling the Heart–Lung Axis through Multimodal Learning for Health Monitoring
Mulkana, Sundas Rafat
Robot Motion Planning in Dynamic Environment

Tanaya Guha

We use cookies

Necessary cookies

Analytics cookies

Clarity

School of Computing Science