Dr Tanaya Guha

  • Senior Lecturer (School of Computing Science)

email: Tanaya.Guha@glasgow.ac.uk

S132 Lilybank Gardens, University of Glasgow

Import to contacts

ORCID iDhttps://orcid.org/0000-0003-2167-4891

Biography

Please see my personal website, which is regularly updated.

I am a Senior Lecturer of Computing Science at University of Glasgow, where I am a member of the Social AI group within GIST section. I also hold an Honorary Associate Professor position in the Department of Computer Science, University of Warwick

My research focuses on developing machine intelligence capabilities to understand human behaviour combining Deep Learning, Computer Vision, and Signal/Speech Processing.  

I received my PhD degree in Electrical & Computer Engineering from the University of British Columbia (UBC), Vancouver in 2013. After graduation, I was a Postdoctoral Fellow at SAILUniversity of Southern California (USC), Los Angeles. In 2015, I joined IIT Kanpur, India as an Assistant Professor of Electrical Engineering. In 2018, I moved to University of Warwick as an Assistant Professor, and later became an Associate Professor.  Since 2021, I am a Senior Lecturer in the University of Glasgow

Publications

List by: Type | Date

Jump to: 2024 | 2023 | 2022 | 2021
Number of items: 20.

2024

Fringi, E., Alshubaily, N., Picinali, L., Brewster, S. A. , Guha, T. and Vinciarelli, A. (2024) Is Distance a Modality? Multi-Label Learning for Speech-Based Joint Prediction of Attributed Traits and Perceived Distances in 3D Audio Immersive Environments. In: ICMI '24: 26th International Conference on Multimodal Interaction, San Jose, Costa Rica, 04-08 Nov 2024, pp. 321-330. ISBN 9798400704628 (doi: 10.1145/3678957.3685740)

Alsenani, B., Esposito, A., Vinciarelli, A. and Guha, T. (2024) Assessing Privacy Risks of Attribute Inference Attacks against Speech-based Depression Detection System. In: 27th European Conference on Artificial Intelligence, Santiago de Compostela, Spain, 19-24 Oct 2024, pp. 3797-3804. ISBN 9781643685489 (doi: 10.3233/FAIA240941)

ALOSHBAN, N. I. Z., Esposito, A., Vinciarelli, A. and Guha, T. (2024) On the effects of obfuscating speaker attributes in privacy-aware depression detection. Pattern Recognition Letters, 186, pp. 300-305. (doi: 10.1016/j.patrec.2024.10.016)

Li, G. et al. (2024) Detecting in-car VR Motion Sickness from Lower Face Action Units. In: 2024 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), Seattle, WA, USA, 21-25 October 2024, (Accepted for Publication)

Styles, O., Miller, S., Cerda-Mardini, P. and Guha, T. (2024) WorkBench: A Benchmark Dataset for Agents in a Realistic Workplace Setting. In: Conference on Language Modeling (COLM) 2024, Pennsylvania, Philadelphia, USA, 07-09 Oct 2024, (Accepted for Publication)

2023

Gahalawat, M., Fernandez Rojas, R., Guha, T. , Subramanian, R. and Goecke, R. (2023) Explainable Depression Detection via Head Motion Patterns. In: 25th ACM International Conference on Multimodal Interaction (ICMI 2023), Paris, France, 9-13 October 2023, pp. 261-270. ISBN 9798400700552 (doi: 10.1145/3577190.3614130)

Alsenani, B., Guha, T. and Vinciarelli, A. (2023) Privacy Risks in Speech Emotion Recognition: A Systematic Study on Gender Inference Attack. In: 24th INTERSPEECH Conference, Dublin, Ireland, 20-24 Aug 2023, pp. 651-655. (doi: 10.21437/Interspeech.2023-454)

Ma, Y., Sanchez, V., Nikan, S., Upadhyay, D., Atote, B. and Guha, T. (2023) Robust Multiview Multimodal Driver Monitoring System Using Masked Multi-Head Self-attention. In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR 2023) - CVPR Workshop, Vancouver, Canada, 18-22 June 2023, pp. 2617-2625. ISBN 9798350302493 (doi: 10.1109/CVPRW59228.2023.00260)

Shirian, A., Ahmadian, M., Somandepalli, K. and Guha, T. (2023) Heterogeneous Graph Learning for Acoustic Event Classification. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes, Greece, 4-10 June 2023, ISBN 9781728163277 (doi: 10.1109/ICASSP49357.2023.10095073)

2022

Min, K., Roy, S., Tripathi, S., Guha, T. and Majumdar, S. (2022) Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection. In: European Conference on Computer Vision (ECCV 2022), Tel Aviv, Israel, 23-27 Oct 2022, pp. 371-387. ISBN 9783031198328 (doi: 10.1007/978-3-031-19833-5_22)

Styles, O., Guha, T. and Sanchez, V. (2022) Multi-camera trajectory forecasting with trajectory tensors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(11), pp. 8482-8491. (doi: 10.1109/TPAMI.2021.3107958) (PMID:34437059)

Shirian, A., Somandepalli, K. and Guha, T. (2022) Self-supervised graphs for audio representation Learning with limited labeled data. IEEE Journal of Selected Topics in Signal Processing, 16(6), pp. 1391-1401. (doi: 10.1109/JSTSP.2022.3190083)

Shirian, A., Somandepalli, K., Sanchez, V. and Guha, T. (2022) Visually-Aware Acoustic Event Detection Using Heterogeneous Graphs. In: INTERSPEECH 2022, Incheon, South Korea, 18-22 Sep 2022, pp. 2428-2432. (doi: 10.21437/Interspeech.2022-10670)

Roy, D., Guha, T. and Sanchez, V. (2022) Graph-based Transform based on 3D Convolutional Neural Network for Intra-Prediction of Imaging Data. In: 2022 Data Compression Conference (DCC), Snowbird, UT, USA, 22-25 March 2022, pp. 212-221. ISBN 9781665478939 (doi: 10.1109/DCC52660.2022.00029)

Liao, J., Guha, T. and Sanchez, V. (2022) Self-supervised Frontalization and Rotation GAN with Random Swap for Pose-invariant Face Recognition. In: 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 Oct 2022, pp. 911-915. ISBN 9781665496209 (doi: 10.1109/ICIP46576.2022.9897944)

Ma, Y., Sanchez, V. and Guha, T. (2022) FusionCount: Efficient Crowd Counting via Multiscale Feature Fusion. In: 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 Oct 2022, pp. 3256-3260. ISBN 9781665496209 (doi: 10.1109/ICIP46576.2022.9897322)

Shirian, A., Tripathi, S. and Guha, T. (2022) Dynamic emotion modeling with learnable graphs and graph inception network. IEEE Transactions on Multimedia, 24, pp. 780-790. (doi: 10.1109/TMM.2021.3059169)

2021

Somandepalli, K., Guha, T. , Martinez, V. R., Kumar, N., Adam, H. and Narayanan, S. (2021) Computational media intelligence: human-centered machine analysis of media. Proceedings of the IEEE, 109(5), pp. 891-910. (doi: 10.1109/JPROC.2020.3047978)

Shirian, A. and Guha, T. (2021) Compact Graph Architecture for Speech Emotion Recognition. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 6-11 June 2021, pp. 6284-6288. ISBN 9781728176055 (doi: 10.1109/ICASSP39728.2021.9413876)

Nguyen, K., Tripathi, S., Du, B., Guha, T. and Nguyen, T. Q. (2021) In Defense of Scene Graphs for Image Captioning. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10-17 October 2021, pp. 1387-1396. ISBN 9781665428125 (doi: 10.1109/ICCV48922.2021.00144)

This list was generated on Thu Dec 5 13:17:30 2024 GMT.
Number of items: 20.

Articles

ALOSHBAN, N. I. Z., Esposito, A., Vinciarelli, A. and Guha, T. (2024) On the effects of obfuscating speaker attributes in privacy-aware depression detection. Pattern Recognition Letters, 186, pp. 300-305. (doi: 10.1016/j.patrec.2024.10.016)

Styles, O., Guha, T. and Sanchez, V. (2022) Multi-camera trajectory forecasting with trajectory tensors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(11), pp. 8482-8491. (doi: 10.1109/TPAMI.2021.3107958) (PMID:34437059)

Shirian, A., Somandepalli, K. and Guha, T. (2022) Self-supervised graphs for audio representation Learning with limited labeled data. IEEE Journal of Selected Topics in Signal Processing, 16(6), pp. 1391-1401. (doi: 10.1109/JSTSP.2022.3190083)

Shirian, A., Tripathi, S. and Guha, T. (2022) Dynamic emotion modeling with learnable graphs and graph inception network. IEEE Transactions on Multimedia, 24, pp. 780-790. (doi: 10.1109/TMM.2021.3059169)

Somandepalli, K., Guha, T. , Martinez, V. R., Kumar, N., Adam, H. and Narayanan, S. (2021) Computational media intelligence: human-centered machine analysis of media. Proceedings of the IEEE, 109(5), pp. 891-910. (doi: 10.1109/JPROC.2020.3047978)

Conference Proceedings

Fringi, E., Alshubaily, N., Picinali, L., Brewster, S. A. , Guha, T. and Vinciarelli, A. (2024) Is Distance a Modality? Multi-Label Learning for Speech-Based Joint Prediction of Attributed Traits and Perceived Distances in 3D Audio Immersive Environments. In: ICMI '24: 26th International Conference on Multimodal Interaction, San Jose, Costa Rica, 04-08 Nov 2024, pp. 321-330. ISBN 9798400704628 (doi: 10.1145/3678957.3685740)

Alsenani, B., Esposito, A., Vinciarelli, A. and Guha, T. (2024) Assessing Privacy Risks of Attribute Inference Attacks against Speech-based Depression Detection System. In: 27th European Conference on Artificial Intelligence, Santiago de Compostela, Spain, 19-24 Oct 2024, pp. 3797-3804. ISBN 9781643685489 (doi: 10.3233/FAIA240941)

Li, G. et al. (2024) Detecting in-car VR Motion Sickness from Lower Face Action Units. In: 2024 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), Seattle, WA, USA, 21-25 October 2024, (Accepted for Publication)

Styles, O., Miller, S., Cerda-Mardini, P. and Guha, T. (2024) WorkBench: A Benchmark Dataset for Agents in a Realistic Workplace Setting. In: Conference on Language Modeling (COLM) 2024, Pennsylvania, Philadelphia, USA, 07-09 Oct 2024, (Accepted for Publication)

Gahalawat, M., Fernandez Rojas, R., Guha, T. , Subramanian, R. and Goecke, R. (2023) Explainable Depression Detection via Head Motion Patterns. In: 25th ACM International Conference on Multimodal Interaction (ICMI 2023), Paris, France, 9-13 October 2023, pp. 261-270. ISBN 9798400700552 (doi: 10.1145/3577190.3614130)

Alsenani, B., Guha, T. and Vinciarelli, A. (2023) Privacy Risks in Speech Emotion Recognition: A Systematic Study on Gender Inference Attack. In: 24th INTERSPEECH Conference, Dublin, Ireland, 20-24 Aug 2023, pp. 651-655. (doi: 10.21437/Interspeech.2023-454)

Ma, Y., Sanchez, V., Nikan, S., Upadhyay, D., Atote, B. and Guha, T. (2023) Robust Multiview Multimodal Driver Monitoring System Using Masked Multi-Head Self-attention. In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR 2023) - CVPR Workshop, Vancouver, Canada, 18-22 June 2023, pp. 2617-2625. ISBN 9798350302493 (doi: 10.1109/CVPRW59228.2023.00260)

Shirian, A., Ahmadian, M., Somandepalli, K. and Guha, T. (2023) Heterogeneous Graph Learning for Acoustic Event Classification. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes, Greece, 4-10 June 2023, ISBN 9781728163277 (doi: 10.1109/ICASSP49357.2023.10095073)

Min, K., Roy, S., Tripathi, S., Guha, T. and Majumdar, S. (2022) Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection. In: European Conference on Computer Vision (ECCV 2022), Tel Aviv, Israel, 23-27 Oct 2022, pp. 371-387. ISBN 9783031198328 (doi: 10.1007/978-3-031-19833-5_22)

Shirian, A., Somandepalli, K., Sanchez, V. and Guha, T. (2022) Visually-Aware Acoustic Event Detection Using Heterogeneous Graphs. In: INTERSPEECH 2022, Incheon, South Korea, 18-22 Sep 2022, pp. 2428-2432. (doi: 10.21437/Interspeech.2022-10670)

Roy, D., Guha, T. and Sanchez, V. (2022) Graph-based Transform based on 3D Convolutional Neural Network for Intra-Prediction of Imaging Data. In: 2022 Data Compression Conference (DCC), Snowbird, UT, USA, 22-25 March 2022, pp. 212-221. ISBN 9781665478939 (doi: 10.1109/DCC52660.2022.00029)

Liao, J., Guha, T. and Sanchez, V. (2022) Self-supervised Frontalization and Rotation GAN with Random Swap for Pose-invariant Face Recognition. In: 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 Oct 2022, pp. 911-915. ISBN 9781665496209 (doi: 10.1109/ICIP46576.2022.9897944)

Ma, Y., Sanchez, V. and Guha, T. (2022) FusionCount: Efficient Crowd Counting via Multiscale Feature Fusion. In: 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 Oct 2022, pp. 3256-3260. ISBN 9781665496209 (doi: 10.1109/ICIP46576.2022.9897322)

Shirian, A. and Guha, T. (2021) Compact Graph Architecture for Speech Emotion Recognition. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 6-11 June 2021, pp. 6284-6288. ISBN 9781728176055 (doi: 10.1109/ICASSP39728.2021.9413876)

Nguyen, K., Tripathi, S., Du, B., Guha, T. and Nguyen, T. Q. (2021) In Defense of Scene Graphs for Image Captioning. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10-17 October 2021, pp. 1387-1396. ISBN 9781665428125 (doi: 10.1109/ICCV48922.2021.00144)

This list was generated on Thu Dec 5 13:17:30 2024 GMT.

Supervision

  • Alsenani, Basmah Mohammed E
    Novel Frameworks for Systematic Assessment of Privacy Risks in Affective Speech AI Models
  • Altalhi, Sahar
    An Analysis of Oral Presentations in View of an Analysis of Public Speaking
  • Bian, Tongfei
    Vision-based social understanding and prediction
  • Elfleet, Morad
    Enhancing Immersive Virtual Interactions with Real-time Behaviour- Driven Virtual Agents
  • Ghosh, Bishal
    Adapting Nonverbal Communication Dynamics to Human-Robot Social Interaction
  • Gutierrez Serafin, Benjamin
    Designing Mindful Intervention with Therapeutic Music on Earables to Manage Occupational Fatigue
  • Li, Xinyu
    Interpretable Framework for Affective Computing Applications
  • Mulkana, Sundas Rafat
    Robot Motion Planning in Dynamic Environment
  • Noolkar, Amey Anil
    Digital sensing and intervention for wellbeing in workplace