Youjia Zhang

I am a fourth year Ph.D. student at SKKU (Sungkyunkwan University), South Korea, advised by Prof. Sungeun Hong in the AI & Media Lab (AIM Lab). My research interests include VLM Pruning, Multimodal Learning, Audio-Visual Recognition, Parameter-Efficient Model Tuning and Test-Time Adaptation.

🔥 News

2026.02: 🎉🎉 One paper is accepted to CVPR 2026.
2025.09: 🎉🎉 One paper is accepted to NeurIPS 2025.
2025.07: 🎉🎉 One paper is accepted to ACM MM 2025.
2025.06: 🎉🎉 One paper is accepted to IJCV 2025 (Q1, JCR Top 3%, IF 11.6).
2025.04: 🎉🎉 One paper is accepted to CVPR 2025 (Highlight).
2025.01: 🎉🎉 One paper is accepted to Pattern Recognition (Q1, JCR: Top 6.9%).
2023.06: 🎉🎉 One paper is accepted to ICMR 2023 (Oral).
2022.12: 🎉🎉 One paper is accepted to ACCV 2022 (Oral).

📝 Selected Publications

ZOO-Prune: Training-Free Token Pruning via Zeroth-Order Gradient Estimation in Vision-Language Models
- Youngeun Kim, Youjia Zhang*, Huiling Liu, Aecheon Jung, Sunwoo Lee, and Sungeun Hong
- CVPR 2026 [Project page]
Backpropagation-Free Test-Time Adaptation via Probabilistic Gaussian Alignment
- Youjia Zhang, Youngeun Kim, Young-Geun Choi, Hongyeob Kim, Huiling Liu and Sungeun Hong
- NeurIPS 2025 [Project page]
RA-Touch: Retrieval-Augmented Touch Understanding with Enriched Visual Data
- Yoorhim Cho*, Hongyeob Kim*, Semin Kim, Youjia Zhang, Yunseok Choi and Sungeun Hong
- ACM MM 2025 [Project page]
CAT-TPT: Class-Agnostic Text-based Test-time Prompt Tuning for Vision-Language Models
- Youjia Zhang, Huiling Liu, Youngeun Kim and Sungeun Hong
- IJCV 2025 [Q1, JCR Top 3%, IF 11.6)] [DOI]
Question-Aware Gaussian Experts for Audio-Visual Question Answering
- Hongyeob Kim*, Inyoung Jung*, Dayoon Suh, Youjia Zhang, Sangmin Lee and Sungeun Hong
- CVPR 2025 [Highlight] [Project page]
Memory-Efficient Cross-Modal Attention for RGB-X Segmentation and Crowd Counting
- Youjia Zhang, Soyun Choi, and Sungeun Hong
- Pattern Recognition 2025 [Q1] [DOI]
Spatio-channel attention blocks for cross-modal crowd counting
- Youjia Zhang, Soyun Choi, and Sungeun Hong
- ACCV 2022 [Oral] [Project page]

📂 Projects

RGB-X Path Networks for Multi-modal Multi-task Learning (2023.03 ~ 2026.02)
- Funded by National Research Foundation of Korea (NRF)
- Develop path networks for RGB-X data (e.g., depth, thermal, tactile, text) to enable efficient multi-modal fusion and knowledge transfer across tasks and environments, thereby supporting generalization and adaptability in complex real-world settings
RGB-D Object Detection and Segmentation based on Multimodal Fusion (2023.03 ~ 2023.10)
- Funded by Samsung Electronics
- Develop an object detection and segmentation framework that effectively fuses depth information with RGB images, enabling robots to automatically identify and manipulate specific objects.
Visuo-Tactile Perception for Human-Like Manipulation of Deformable Objects with Dynamic Center of Mass (2021.09 ~ 2023.08)
- Funded by Samsung Research Funding & Incubation Center for Future Technology
- Develop core technologies for stable grasping and manipulation of soft, deformable objects through “visual-tactile fusion” and “visual-tactile perception” for robots to manipulate objects at the human level.

💡 Patents

Spatio-channel attention blocks for cross-modal crowd counting (Registration number C-2022-055027)

Apparatus and Method for Prompt Tuning (U.S. Patent Application No. 19/545,275; Pending)

🏆 Honors & Awards

SKKU STEM Full Scholarship, 2024 – Present
IEIE Encouraging Paper Award, 2024
Oral Presentation, ACCV 2022
2BK21 Excellent Research Award, Inha University, 2023
Global Vision Scholarship, Inha University, 2021 – 2023
Mathematical Contest in Modeling(MCM), Meritorious Winners，2017

📚 Academic Activities

Reviewer
- ACM International Conference on Multimedia (ACM MM)
- Neural Information Processing Systems (NeurIPS)
Teaching Assistant
- Advanced Computer Vision, SKKU, Fall 2025
- Introduction to Deep Learning, SKKU, Spring 2025
- Advanced Computer Vision, SKKU, Fall 2024
- Computer Vision, Inha University, Fall 2021

🎓 Education

2024–Present: Ph.D. Student, Immersive Media Engineering, Sungkyunkwan University, advised by Prof. Sungeun Hong
2021–2024: Ph.D. Student, Information and Communication Engineering, Inha University, advised by Prof. Sungeun Hong
2018–2021: M.S., School of Computer Science and Technology, CQUPT, advised by Prof. Xu Zhang
2014–2018: B.S., School of Mathematics and Statistics, Chongqing University of Posts and Telecommunications (CQUPT)

Youjia Zhang

🔥 News

📝 Selected Publications

ZOO-Prune: Training-Free Token Pruning via Zeroth-Order Gradient Estimation in Vision-Language Models

Backpropagation-Free Test-Time Adaptation via Probabilistic Gaussian Alignment

RA-Touch: Retrieval-Augmented Touch Understanding with Enriched Visual Data

CAT-TPT: Class-Agnostic Text-based Test-time Prompt Tuning for Vision-Language Models

Question-Aware Gaussian Experts for Audio-Visual Question Answering

Memory-Efficient Cross-Modal Attention for RGB-X Segmentation and Crowd Counting

Spatio-channel attention blocks for cross-modal crowd counting

📂 Projects

RGB-X Path Networks for Multi-modal Multi-task Learning (2023.03 ~ 2026.02)

RGB-D Object Detection and Segmentation based on Multimodal Fusion (2023.03 ~ 2023.10)

Visuo-Tactile Perception for Human-Like Manipulation of Deformable Objects with Dynamic Center of Mass (2021.09 ~ 2023.08)

💡 Patents

🏆 Honors & Awards

📚 Academic Activities

🎓 Education