Dr Tanaya Guha

  • Senior Lecturer (School of Computing Science)

email: Tanaya.Guha@glasgow.ac.uk

S132 Lilybank Gardens, University of Glasgow

ORCID iDhttps://orcid.org/0000-0003-2167-4891

Biography

I am a Senior Lecturer (Associate Professor) in the School of Computing Science, University of Glasgow, where I am a member of the Glasgow Interactive Systems (GIST) section. My research focuses on developing machine intelligence capabilities to understand human activities and behaviour combining machine learning, computer vision and signal/speech processing.

I received my PhD in Electrical and Computer Engineering from the University of British Columbia (UBC), Vancouver. After graduation, I was a Postdoctoral Fellow at the Signal and Information Processing Institute, University of Southern California (USC), Los Angeles. Subsequently, I was an Assistant Professor of Electrical Engineering at IIT Kanpur, India. Later I moved to the UK to join the Department of Computer Science, University of Warwick, where I still hold an Honorary Associate Professor position.

I was a recipient of Warwick Global Research Priority award, ICME Outstanding Area Chair award and Mensa Canada Woodhams memorial scholarship, among other awards and honours. I am a member of ISCA, IEEE, an elected member of IEEE MSA Technical Committee and an Executive Committee member of AAAC. I serve in the Editorial Board of Nature Scientific Reports and APSIPA Transactions on Signal and Information Processing. I am actively involved in the Organising/Program Committees of various conferences including Interspeech, WACV, ICME, ICMI and ACII. More information about my research and activities can be found on my personal website.

Publications

List by: Type | Date

Jump to: 2023 | 2022 | 2021
Number of items: 13.

2023

Shirian, A., Ahmadian, M., Somandepalli, K. and Guha, T. (2023) Heterogeneous Graph Learning for Acoustic Event Classification. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes, Greece, 4-10 June 2023, ISBN 9781728163277 (doi: 10.1109/ICASSP49357.2023.10095073)

Ma, Y., Sanchez, V., Nikan, S., Upadhyay, D., Atote, B. and Guha, T. (2023) Robust Multiview Multimodal Driver Monitoring System Using Masked Multi-Head Self-attention. In: IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR 2023) - CVPR Workshop, Vancouver, Canada, 18-22 June 2023, (Accepted for Publication)

2022

Min, K., Roy, S., Tripathi, S., Guha, T. and Majumdar, S. (2022) Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection. In: European Conference on Computer Vision (ECCV 2022), Tel Aviv, Israel, 23-27 Oct 2022, pp. 371-387. ISBN 9783031198328 (doi: 10.1007/978-3-031-19833-5_22)

Styles, O., Guha, T. and Sanchez, V. (2022) Multi-camera trajectory forecasting with trajectory tensors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(11), pp. 8482-8491. (doi: 10.1109/TPAMI.2021.3107958) (PMID:34437059)

Shirian, A., Somandepalli, K. and Guha, T. (2022) Self-supervised graphs for audio representation Learning with limited labeled data. IEEE Journal of Selected Topics in Signal Processing, 16(6), pp. 1391-1401. (doi: 10.1109/JSTSP.2022.3190083)

Shirian, A., Somandepalli, K., Sanchez, V. and Guha, T. (2022) Visually-Aware Acoustic Event Detection Using Heterogeneous Graphs. In: INTERSPEECH 2022, Incheon, South Korea, 18-22 Sep 2022, pp. 2428-2432. (doi: 10.21437/Interspeech.2022-10670)

Roy, D., Guha, T. and Sanchez, V. (2022) Graph-based Transform based on 3D Convolutional Neural Network for Intra-Prediction of Imaging Data. In: 2022 Data Compression Conference (DCC), Snowbird, UT, USA, 22-25 March 2022, pp. 212-221. ISBN 9781665478939 (doi: 10.1109/DCC52660.2022.00029)

Liao, J., Guha, T. and Sanchez, V. (2022) Self-supervised Frontalization and Rotation GAN with Random Swap for Pose-invariant Face Recognition. In: 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 Oct 2022, pp. 911-915. ISBN 9781665496209 (doi: 10.1109/ICIP46576.2022.9897944)

Ma, Y., Sanchez, V. and Guha, T. (2022) FusionCount: Efficient Crowd Counting via Multiscale Feature Fusion. In: 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 Oct 2022, pp. 3256-3260. ISBN 9781665496209 (doi: 10.1109/ICIP46576.2022.9897322)

Shirian, A., Tripathi, S. and Guha, T. (2022) Dynamic emotion modeling with learnable graphs and graph inception network. IEEE Transactions on Multimedia, 24, pp. 780-790. (doi: 10.1109/TMM.2021.3059169)

2021

Somandepalli, K., Guha, T. , Martinez, V. R., Kumar, N., Adam, H. and Narayanan, S. (2021) Computational media intelligence: human-centered machine analysis of media. Proceedings of the IEEE, 109(5), pp. 891-910. (doi: 10.1109/JPROC.2020.3047978)

Shirian, A. and Guha, T. (2021) Compact Graph Architecture for Speech Emotion Recognition. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 6-11 June 2021, pp. 6284-6288. ISBN 9781728176055 (doi: 10.1109/ICASSP39728.2021.9413876)

Nguyen, K., Tripathi, S., Du, B., Guha, T. and Nguyen, T. Q. (2021) In Defense of Scene Graphs for Image Captioning. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10-17 October 2021, pp. 1387-1396. ISBN 9781665428125 (doi: 10.1109/ICCV48922.2021.00144)

This list was generated on Fri Jun 9 16:43:35 2023 BST.
Number of items: 13.

Articles

Styles, O., Guha, T. and Sanchez, V. (2022) Multi-camera trajectory forecasting with trajectory tensors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(11), pp. 8482-8491. (doi: 10.1109/TPAMI.2021.3107958) (PMID:34437059)

Shirian, A., Somandepalli, K. and Guha, T. (2022) Self-supervised graphs for audio representation Learning with limited labeled data. IEEE Journal of Selected Topics in Signal Processing, 16(6), pp. 1391-1401. (doi: 10.1109/JSTSP.2022.3190083)

Shirian, A., Tripathi, S. and Guha, T. (2022) Dynamic emotion modeling with learnable graphs and graph inception network. IEEE Transactions on Multimedia, 24, pp. 780-790. (doi: 10.1109/TMM.2021.3059169)

Somandepalli, K., Guha, T. , Martinez, V. R., Kumar, N., Adam, H. and Narayanan, S. (2021) Computational media intelligence: human-centered machine analysis of media. Proceedings of the IEEE, 109(5), pp. 891-910. (doi: 10.1109/JPROC.2020.3047978)

Conference Proceedings

Shirian, A., Ahmadian, M., Somandepalli, K. and Guha, T. (2023) Heterogeneous Graph Learning for Acoustic Event Classification. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes, Greece, 4-10 June 2023, ISBN 9781728163277 (doi: 10.1109/ICASSP49357.2023.10095073)

Ma, Y., Sanchez, V., Nikan, S., Upadhyay, D., Atote, B. and Guha, T. (2023) Robust Multiview Multimodal Driver Monitoring System Using Masked Multi-Head Self-attention. In: IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR 2023) - CVPR Workshop, Vancouver, Canada, 18-22 June 2023, (Accepted for Publication)

Min, K., Roy, S., Tripathi, S., Guha, T. and Majumdar, S. (2022) Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection. In: European Conference on Computer Vision (ECCV 2022), Tel Aviv, Israel, 23-27 Oct 2022, pp. 371-387. ISBN 9783031198328 (doi: 10.1007/978-3-031-19833-5_22)

Shirian, A., Somandepalli, K., Sanchez, V. and Guha, T. (2022) Visually-Aware Acoustic Event Detection Using Heterogeneous Graphs. In: INTERSPEECH 2022, Incheon, South Korea, 18-22 Sep 2022, pp. 2428-2432. (doi: 10.21437/Interspeech.2022-10670)

Roy, D., Guha, T. and Sanchez, V. (2022) Graph-based Transform based on 3D Convolutional Neural Network for Intra-Prediction of Imaging Data. In: 2022 Data Compression Conference (DCC), Snowbird, UT, USA, 22-25 March 2022, pp. 212-221. ISBN 9781665478939 (doi: 10.1109/DCC52660.2022.00029)

Liao, J., Guha, T. and Sanchez, V. (2022) Self-supervised Frontalization and Rotation GAN with Random Swap for Pose-invariant Face Recognition. In: 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 Oct 2022, pp. 911-915. ISBN 9781665496209 (doi: 10.1109/ICIP46576.2022.9897944)

Ma, Y., Sanchez, V. and Guha, T. (2022) FusionCount: Efficient Crowd Counting via Multiscale Feature Fusion. In: 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 Oct 2022, pp. 3256-3260. ISBN 9781665496209 (doi: 10.1109/ICIP46576.2022.9897322)

Shirian, A. and Guha, T. (2021) Compact Graph Architecture for Speech Emotion Recognition. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 6-11 June 2021, pp. 6284-6288. ISBN 9781728176055 (doi: 10.1109/ICASSP39728.2021.9413876)

Nguyen, K., Tripathi, S., Du, B., Guha, T. and Nguyen, T. Q. (2021) In Defense of Scene Graphs for Image Captioning. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10-17 October 2021, pp. 1387-1396. ISBN 9781665428125 (doi: 10.1109/ICCV48922.2021.00144)

This list was generated on Fri Jun 9 16:43:35 2023 BST.

Grants

  • PI: Latent Graph Learning and Classification. Intel Corporation Small Grant. 2022 - 2023.
  • PI: Multimodal Learning for In-Car Driver Activity Monitoring. Ford University Research Program. 2021 - 2023.
  • PI: Crossmodal Biometric Matching. Warwick RDF Award. 2019.
  • PI: Application-aware Image Quality Assessment. Indian Space Research Organization. 2016 - 2018. CoI: Holistic Scene Understanding. Samsung Research. 2017 - 2018.
  • PI: NVIDIA Academic GPU Grant. 2017.