News

NEW! (Always) I am looking for self-motivated Undergraduate and Graduate students. Feel free to contact me for research guidance or supervision.

NEW! (Jan. 2025) I join the School of Computer Science and Technology at Guangdong University of Technology (GDUT), as a Lecturer.

NEW! (Dec. 2024) I pass the Ph.D. final defense titled “Describing Multimedia with Semantic Alignment”, and become a Ph.D.

NEW! (Sept. 2024) I will attend ECCV 2024 conference onsite in Milan, Italy. Nice to meet you!

NEW! (Aug. 2024) One paper accepted to IEEE Transactions on Circuits and Systems for Video Technology(TCSVT).

More

Bio

I am currently a Lecturer in the School of Computer Science and Technology at Guangdong University of Technology (GDUT). Before that, I obtained my computer science Ph.D. degree in the joint doctoral program between Sun Yat-sen University and JD.COM in 2024, supervised by Prof. Hongyang Chao, Prof. Jianlin Feng and Dr. Tao Mei. During my doctoral studies, I had the privilege of being mentored by Dr. Ting Yao, and collaborated closely with Dr. Yingwei Pan, Dr. Yehao Li and Dr. Jingwen Chen on various research projects.

Research Interests

  • Computer Vision
  • Multimodal Learning
  • Multimedia Analysis

Education

  • 08/2019 - 12/2024, Sun Yat-Sen University (SYSU)
    • Ph.D. in Computer Science and Technology
    • Joint Ph.D. Program with JD.com
    • Thesis topic: Describing Multimedia with Semantic Alignment
  • 08/2015 - 07/2019, Sun Yat-sen University (SYSU)
    • B.Eng. in Software Engineering
    • Recipient of the National Scholarship Award, Outstanding Undergraduate Award

Experiences

  • 01/2025 - Present, Guangdong University of Technology (GDUT), Canton
    • Lecturer @ School of Computer Science and Technology
  • 03/2024 - 09/2024, HiDream.ai Inc., Beijing
  • 07/2020 - 05/2023, Computer Vision and Multimedia Lab at JD Explore Academy, Beijing
    • Research Intern (Star Intern Award)
    • Mentor: Ting Yao
  • 07/2018 - 08/2019, Computer Vision and Multimedia Lab at JD AI Research, Beijing
    • Research Intern (Star Intern Award)
    • Mentor: Ting Yao
  • 03/2018 - 06/2018, Pixtalks Technology, Guangzhou

Academic Services

  • Conference Reviewer: NeurIPS, CVPR, ACM MM, et al.
  • Journal Reviewer: TCSVT, TMM, TOMM, CVIU, et al.

Publications [Google Scholar]

2024

Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning

Jianjie Luo, Jingwen Chen, Yehao Li, Yingwei Pan, Jianlin Feng, Hongyang Chao, Ting Yao

In ECCV, 2024.

PDF Website

Exploring Vision-Language Foundation Model for Novel Object Captioning

Jianjie Luo, Yehao Li, Yingwei Pan, Ting Yao, Jianlin Feng, Hongyang Chao, Tao Mei

In IEEE Transactions on Circuits and Systems for Video Technology, 2024.

PDF

2023

Semantic-Conditional Diffusion Networks for Image Captioning

Jianjie Luo, Yehao Li, Yingwei Pan, Ting Yao, Jianlin Feng, Hongyang Chao, Tao Mei

In CVPR, 2023.

PDF Code

2022

Boosting Vision-and-Language Navigation with Direction Guiding and Backtracing

Jingwen Chen, Jianjie Luo, Yingwei Pan, Yehao Li, Ting Yao, Hongyang Chao, Tao Mei

In ACM Transactions on Multimedia Computing, Communications, and Applications, 2022.

PDF

2021

CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising

Jianjie Luo, Yehao Li, Yingwei Pan, Ting Yao, Hongyang Chao, Tao Mei

In ACM Multimedia, 2021.

PDF

2020

Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training

Yingwei Pan, Yehao Li, Jianjie Luo, Jun Xu, Ting Yao, Tao Mei

In Arxiv, 2020. Finally accepted in ACM Multimedia, 2022.

PDF Website

Selected Awards

  • Outstanding Undergraduate, SYSU 2019, Top 5% in SYSU.
  • Chinese National Scholarship, 2018, Top 1% in SYSU.
  • Finalist Winner of 2018 American College Students Mathematical Modeling Contest, Top 0.4%.