Number of items: 20.
2024
Fringi, E., Alshubaily, N., Picinali, L., Brewster, S. A. , Guha, T. and Vinciarelli, A.
(2024)
Is Distance a Modality? Multi-Label Learning for Speech-Based Joint Prediction of Attributed Traits and Perceived Distances in 3D Audio Immersive Environments.
In: ICMI '24: 26th International Conference on Multimodal Interaction, San Jose, Costa Rica, 04-08 Nov 2024,
pp. 321-330.
ISBN 9798400704628
(doi: 10.1145/3678957.3685740)
Alsenani, B., Esposito, A., Vinciarelli, A. and Guha, T.
(2024)
Assessing Privacy Risks of Attribute Inference Attacks against Speech-based Depression Detection System.
In: 27th European Conference on Artificial Intelligence, Santiago de Compostela, Spain, 19-24 Oct 2024,
pp. 3797-3804.
ISBN 9781643685489
(doi: 10.3233/FAIA240941)
ALOSHBAN, N. I. Z., Esposito, A., Vinciarelli, A. and Guha, T.
(2024)
On the effects of obfuscating speaker attributes in privacy-aware depression detection.
Pattern Recognition Letters, 186,
pp. 300-305.
(doi: 10.1016/j.patrec.2024.10.016)
Li, G. et al.
(2024)
Detecting in-car VR Motion Sickness from Lower Face Action Units.
In: 2024 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), Seattle, WA, USA, 21-25 October 2024,
(Accepted for Publication)
Styles, O., Miller, S., Cerda-Mardini, P. and Guha, T.
(2024)
WorkBench: A Benchmark Dataset for Agents in a Realistic Workplace Setting.
In: Conference on Language Modeling (COLM) 2024, Pennsylvania, Philadelphia, USA, 07-09 Oct 2024,
(Accepted for Publication)
2023
Gahalawat, M., Fernandez Rojas, R., Guha, T. , Subramanian, R. and Goecke, R.
(2023)
Explainable Depression Detection via Head Motion Patterns.
In: 25th ACM International Conference on Multimodal Interaction (ICMI 2023), Paris, France, 9-13 October 2023,
pp. 261-270.
ISBN 9798400700552
(doi: 10.1145/3577190.3614130)
Alsenani, B., Guha, T. and Vinciarelli, A.
(2023)
Privacy Risks in Speech Emotion Recognition: A Systematic Study on Gender Inference Attack.
In: 24th INTERSPEECH Conference, Dublin, Ireland, 20-24 Aug 2023,
pp. 651-655.
(doi: 10.21437/Interspeech.2023-454)
Ma, Y., Sanchez, V., Nikan, S., Upadhyay, D., Atote, B. and Guha, T.
(2023)
Robust Multiview Multimodal Driver Monitoring System Using Masked Multi-Head Self-attention.
In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR 2023) - CVPR Workshop, Vancouver, Canada, 18-22 June 2023,
pp. 2617-2625.
ISBN 9798350302493
(doi: 10.1109/CVPRW59228.2023.00260)
Shirian, A., Ahmadian, M., Somandepalli, K. and Guha, T.
(2023)
Heterogeneous Graph Learning for Acoustic Event Classification.
In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes, Greece, 4-10 June 2023,
ISBN 9781728163277
(doi: 10.1109/ICASSP49357.2023.10095073)
2022
Min, K., Roy, S., Tripathi, S., Guha, T. and Majumdar, S.
(2022)
Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection.
In: European Conference on Computer Vision (ECCV 2022), Tel Aviv, Israel, 23-27 Oct 2022,
pp. 371-387.
ISBN 9783031198328
(doi: 10.1007/978-3-031-19833-5_22)
Styles, O., Guha, T. and Sanchez, V.
(2022)
Multi-camera trajectory forecasting with trajectory tensors.
IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(11),
pp. 8482-8491.
(doi: 10.1109/TPAMI.2021.3107958)
(PMID:34437059)
Shirian, A., Somandepalli, K. and Guha, T.
(2022)
Self-supervised graphs for audio representation Learning with limited labeled data.
IEEE Journal of Selected Topics in Signal Processing, 16(6),
pp. 1391-1401.
(doi: 10.1109/JSTSP.2022.3190083)
Shirian, A., Somandepalli, K., Sanchez, V. and Guha, T.
(2022)
Visually-Aware Acoustic Event Detection Using Heterogeneous Graphs.
In: INTERSPEECH 2022, Incheon, South Korea, 18-22 Sep 2022,
pp. 2428-2432.
(doi: 10.21437/Interspeech.2022-10670)
Roy, D., Guha, T. and Sanchez, V.
(2022)
Graph-based Transform based on 3D Convolutional Neural Network for Intra-Prediction of Imaging Data.
In: 2022 Data Compression Conference (DCC), Snowbird, UT, USA, 22-25 March 2022,
pp. 212-221.
ISBN 9781665478939
(doi: 10.1109/DCC52660.2022.00029)
Liao, J., Guha, T. and Sanchez, V.
(2022)
Self-supervised Frontalization and Rotation GAN with Random Swap for Pose-invariant Face Recognition.
In: 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 Oct 2022,
pp. 911-915.
ISBN 9781665496209
(doi: 10.1109/ICIP46576.2022.9897944)
Ma, Y., Sanchez, V. and Guha, T.
(2022)
FusionCount: Efficient Crowd Counting via Multiscale Feature Fusion.
In: 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 Oct 2022,
pp. 3256-3260.
ISBN 9781665496209
(doi: 10.1109/ICIP46576.2022.9897322)
Shirian, A., Tripathi, S. and Guha, T.
(2022)
Dynamic emotion modeling with learnable graphs and graph inception network.
IEEE Transactions on Multimedia, 24,
pp. 780-790.
(doi: 10.1109/TMM.2021.3059169)
2021
Somandepalli, K., Guha, T. , Martinez, V. R., Kumar, N., Adam, H. and Narayanan, S.
(2021)
Computational media intelligence: human-centered machine analysis of media.
Proceedings of the IEEE, 109(5),
pp. 891-910.
(doi: 10.1109/JPROC.2020.3047978)
Shirian, A. and Guha, T.
(2021)
Compact Graph Architecture for Speech Emotion Recognition.
In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 6-11 June 2021,
pp. 6284-6288.
ISBN 9781728176055
(doi: 10.1109/ICASSP39728.2021.9413876)
Nguyen, K., Tripathi, S., Du, B., Guha, T. and Nguyen, T. Q.
(2021)
In Defense of Scene Graphs for Image Captioning.
In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10-17 October 2021,
pp. 1387-1396.
ISBN 9781665428125
(doi: 10.1109/ICCV48922.2021.00144)
This list was generated on Thu Dec 5 13:17:30 2024 GMT.