Junchen Fu
Research title: Efficiently Adapting Multimodal Foundation Models for Recommendation
Publications
2025
Ye, Yu, Fu, Junchen, Song, Yu, Zheng, Kaiwen and Jose, Joemon ORCID: https://orcid.org/0000-0001-9228-1759
(2025)
Are Multimodal Embeddings Truly Beneficial for Recommendation? A Deep Dive into Whole vs. Individual Modalities.
In: 48th European Conference on Information Retrieval (ECIR 2026), Delft, The Netherlands, 30 March - 1 April 2026,
(Accepted for Publication)
Yu, Haitao, Fang, Yubo, Ge, Xuri, Xin, Xin, Wang, Zihan, Fu, Junchen, Jose, Joemon M. ORCID: https://orcid.org/0000-0001-9228-1759, Ma, Weizhi and Ren, Zhaochun
(2025)
R3AG 2025: Workshop on Refined and Reliable Retrieval-Augmented Generation.
In: 3rd International ACM SIGIR Conference on Information Retrieval in the Asia Pacific, Xi'an, China, 7-10 December 2025,
pp. 461-464.
ISBN 9798400722189
(doi: 10.1145/3767695.3769524)
Fu, Junchen, Ge, Xuri, Xin, Xin, Karatzoglou, Alexandros, Arapakis, Ioannis, Zheng, Kaiwen, Ni, Yongxin and Jose, Joemon M. ORCID: https://orcid.org/0000-0001-9228-1759
(2025)
Efficient and effective adaptation of multimodal foundation models in sequential recommendation.
IEEE Transactions on Knowledge and Data Engineering,
(doi: 10.1109/TKDE.2025.3608071)
(Early Online Publication)
Fu, Junchen, Ge, Xuri, Xin, Xin, Yu, Haitao, Feng, Yue, Karatzoglou, Alexandros, Arapakis, Ioannis and Jose, Joemon ORCID: https://orcid.org/0000-0001-9228-1759
(2025)
The 1st EReL@MIR Workshop on Efficient Representation Learning for Multimodal Information Retrieval.
In: WWW '25: The ACM Web Conference 2025, Sydney, Australia, 28 Apr - 02 May 2025,
pp. 2149-2152.
ISBN 9798400713316
(doi: 10.1145/3701716.3717559)
Liu, Zhiyu, Fu, Junchen, Zheng, Kaiwen and Jose, Joemon M. ORCID: https://orcid.org/0000-0001-9228-1759
(2025)
Exploring Multimodal Pre-trained Models for Speech Emotion Recognition.
In: ACM Web Conference 2025, Sydney, Australia, 28 April - 2 May 2025,
pp. 2176-2180.
ISBN 9798400713316
(doi: 10.1145/3701716.3717561)
He, Yaoqin, Fu, Junchen, Zheng, Kaiwen, Xu, Songpei, Chen, Fuhai, Li, Jie, Jose, Joemon M. ORCID: https://orcid.org/0000-0001-9228-1759 and Ge, Xuri
(2025)
Double-Filter: Efficient Fine-tuning of Pre-trained Vision-Language Models via Patch&Layer Filtering.
In: ICML 2025, Vancouver, Canada, 13-19 July 2025,
(Accepted for Publication)
Zhuang, Ziyi, Du, Hanwen, Han, Hui, Li, Youhua, Fu, Junchen, Jose, Joemon M. ORCID: https://orcid.org/0000-0001-9228-1759 and Ni, Yongxin
(2025)
Bridging the Gap: Teacher-Assisted Wasserstein Knowledge Distillation for Efficient Multi-Modal Recommendation.
In: 2025 ACM Web Conference, Sydney, Australia, 28 Apr – 02 May 2025,
pp. 2464-2475.
ISBN 9798400712746
(doi: 10.1145/3696410.3714852)
Zheng, Kaiwen, Ge, Xuri ORCID: https://orcid.org/0000-0002-3925-4951, Fu, Junchen, Peng, Jun and Jose, Joemon
ORCID: https://orcid.org/0000-0001-9228-1759
(2025)
Multimodal Representation Learning Techniques for Comprehensive Facial State Analysis.
In: 2025 IEEE International Conference on Multimedia and Expo (ICME), Nantes, France, 30 Jun - 04 Jul 2025,
(Accepted for Publication)
Ge, Xuri, Li, Linqing, Xu, Songpei, Zheng, Kaiwen, He, Yaoqin, Fu, Junchen and Jose, Joemon M. ORCID: https://orcid.org/0000-0001-9228-1759
(2025)
The DenseCap-Guided Attention Network For Image-Text Matching.
In: ACM Web Conference 2025, Sydney, Australia, 28 April - 2 May 2025,
(Accepted for Publication)
2024
Ge, Xuri ORCID: https://orcid.org/0000-0002-3925-4951, Fu, Junchen, Chen, Fuhai, An, Shan, Sebe, Nicu and Jose, Joemon M.
ORCID: https://orcid.org/0000-0001-9228-1759
(2024)
Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning.
In: 32nd ACM Multimedia Conference (MM2024), Melbourne, Australia, 28 Oct - 01 Nov 2024,
pp. 8189-8198.
ISBN 9798400706868
(doi: 10.1145/3664647.3681443)
Fu, Junchen, Ge, Xuri ORCID: https://orcid.org/0000-0002-3925-4951, Xin, Xin, Karatzoglou, Alexandros, Arapakis, Ioannis, Wang, Jie and Jose, Joemon
ORCID: https://orcid.org/0000-0001-9228-1759
(2024)
IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT.
In: 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2024), Washington D.C., USA, 14-18 July 2024,
pp. 687-697.
ISBN 9798400704314
(doi: 10.1145/3626772.3657725)
Articles
Fu, Junchen, Ge, Xuri, Xin, Xin, Karatzoglou, Alexandros, Arapakis, Ioannis, Zheng, Kaiwen, Ni, Yongxin and Jose, Joemon M. ORCID: https://orcid.org/0000-0001-9228-1759
(2025)
Efficient and effective adaptation of multimodal foundation models in sequential recommendation.
IEEE Transactions on Knowledge and Data Engineering,
(doi: 10.1109/TKDE.2025.3608071)
(Early Online Publication)
Conference Proceedings
Ye, Yu, Fu, Junchen, Song, Yu, Zheng, Kaiwen and Jose, Joemon ORCID: https://orcid.org/0000-0001-9228-1759
(2025)
Are Multimodal Embeddings Truly Beneficial for Recommendation? A Deep Dive into Whole vs. Individual Modalities.
In: 48th European Conference on Information Retrieval (ECIR 2026), Delft, The Netherlands, 30 March - 1 April 2026,
(Accepted for Publication)
Yu, Haitao, Fang, Yubo, Ge, Xuri, Xin, Xin, Wang, Zihan, Fu, Junchen, Jose, Joemon M. ORCID: https://orcid.org/0000-0001-9228-1759, Ma, Weizhi and Ren, Zhaochun
(2025)
R3AG 2025: Workshop on Refined and Reliable Retrieval-Augmented Generation.
In: 3rd International ACM SIGIR Conference on Information Retrieval in the Asia Pacific, Xi'an, China, 7-10 December 2025,
pp. 461-464.
ISBN 9798400722189
(doi: 10.1145/3767695.3769524)
Fu, Junchen, Ge, Xuri, Xin, Xin, Yu, Haitao, Feng, Yue, Karatzoglou, Alexandros, Arapakis, Ioannis and Jose, Joemon ORCID: https://orcid.org/0000-0001-9228-1759
(2025)
The 1st EReL@MIR Workshop on Efficient Representation Learning for Multimodal Information Retrieval.
In: WWW '25: The ACM Web Conference 2025, Sydney, Australia, 28 Apr - 02 May 2025,
pp. 2149-2152.
ISBN 9798400713316
(doi: 10.1145/3701716.3717559)
Liu, Zhiyu, Fu, Junchen, Zheng, Kaiwen and Jose, Joemon M. ORCID: https://orcid.org/0000-0001-9228-1759
(2025)
Exploring Multimodal Pre-trained Models for Speech Emotion Recognition.
In: ACM Web Conference 2025, Sydney, Australia, 28 April - 2 May 2025,
pp. 2176-2180.
ISBN 9798400713316
(doi: 10.1145/3701716.3717561)
He, Yaoqin, Fu, Junchen, Zheng, Kaiwen, Xu, Songpei, Chen, Fuhai, Li, Jie, Jose, Joemon M. ORCID: https://orcid.org/0000-0001-9228-1759 and Ge, Xuri
(2025)
Double-Filter: Efficient Fine-tuning of Pre-trained Vision-Language Models via Patch&Layer Filtering.
In: ICML 2025, Vancouver, Canada, 13-19 July 2025,
(Accepted for Publication)
Zhuang, Ziyi, Du, Hanwen, Han, Hui, Li, Youhua, Fu, Junchen, Jose, Joemon M. ORCID: https://orcid.org/0000-0001-9228-1759 and Ni, Yongxin
(2025)
Bridging the Gap: Teacher-Assisted Wasserstein Knowledge Distillation for Efficient Multi-Modal Recommendation.
In: 2025 ACM Web Conference, Sydney, Australia, 28 Apr – 02 May 2025,
pp. 2464-2475.
ISBN 9798400712746
(doi: 10.1145/3696410.3714852)
Zheng, Kaiwen, Ge, Xuri ORCID: https://orcid.org/0000-0002-3925-4951, Fu, Junchen, Peng, Jun and Jose, Joemon
ORCID: https://orcid.org/0000-0001-9228-1759
(2025)
Multimodal Representation Learning Techniques for Comprehensive Facial State Analysis.
In: 2025 IEEE International Conference on Multimedia and Expo (ICME), Nantes, France, 30 Jun - 04 Jul 2025,
(Accepted for Publication)
Ge, Xuri, Li, Linqing, Xu, Songpei, Zheng, Kaiwen, He, Yaoqin, Fu, Junchen and Jose, Joemon M. ORCID: https://orcid.org/0000-0001-9228-1759
(2025)
The DenseCap-Guided Attention Network For Image-Text Matching.
In: ACM Web Conference 2025, Sydney, Australia, 28 April - 2 May 2025,
(Accepted for Publication)
Ge, Xuri ORCID: https://orcid.org/0000-0002-3925-4951, Fu, Junchen, Chen, Fuhai, An, Shan, Sebe, Nicu and Jose, Joemon M.
ORCID: https://orcid.org/0000-0001-9228-1759
(2024)
Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning.
In: 32nd ACM Multimedia Conference (MM2024), Melbourne, Australia, 28 Oct - 01 Nov 2024,
pp. 8189-8198.
ISBN 9798400706868
(doi: 10.1145/3664647.3681443)
Fu, Junchen, Ge, Xuri ORCID: https://orcid.org/0000-0002-3925-4951, Xin, Xin, Karatzoglou, Alexandros, Arapakis, Ioannis, Wang, Jie and Jose, Joemon
ORCID: https://orcid.org/0000-0001-9228-1759
(2024)
IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT.
In: 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2024), Washington D.C., USA, 14-18 July 2024,
pp. 687-697.
ISBN 9798400704314
(doi: 10.1145/3626772.3657725)