About Me

My name is Pengpeng Zeng. I am a fourth-year PhD student at University of Electronic Science and Technology of China, advised by Prof.Jingkuan Song, Prof.Lianli Gao and Prof.Heng Tao SHEN.

My research interests are Machine Learning, Deep Learning, Computer Vision and Reinforcement Learning etc.

Email:is.pengpengzeng@gmail.com

[Google Scholar] [GitHub] [DBLP]

News



  • Feb. 27, 2023: One Paper Accepted by CVPR2024! New!
  • Nov. 20, 2023: One Paper Accepted by TCSVT! New!
  • Sep. 29, 2023: One Paper Accepted by TNNLS! New!
  • Jul. 26, 2023: One Paper Accepted by ACM MM2023! New!
  • Jul. 20, 2023: One Paper Accepted by TPAMI! New!
  • Jul. 10, 2023: One Paper Accepted by TMM! New!
  • Jul. 2, 2023: One Paper Accepted by TCSVT!
  • Jan. 12, 2023: One Paper Accepted by PR!
  • Dec. 25, 2022: One Paper Accepted by TCSVT!
  • Sep. 15, 2022: One Paper Accepted by NeurIPS 2022!
  • Aug. 18, 2022: One Paper Accepted by TIP!
  • Jun. 30, 2022: Two Papers Accepted by ACM MM 2022!
  • Apr. 21, 2022: One Paper Accepted by IJCAI 2022!

Publications



Journal Conference

SPT: Spatial Pyramid Transformer for Image Captioning
Haonan Zhang, Pengpeng Zeng, Lianli Gao, Xinyu Lyu, Jingkuan Song, Heng Tao Shen
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023
[PDF][code]


Visual Commonsense-aware Representation Network for Video Captioning
Pengpeng Zeng, Haonan Zhang, Lianli Gao, Xiangpeng Li, Jin Qian, Heng Tao Shen
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
[PDF][code]


Adaptive Fine-Grained Predicates Learning for Scene Graph Generation
Xinyu Lyu, Lianli Gao, Pengpeng Zeng, Heng Tao Shen, Jingkuan Song
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
[PDF][code]


Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation
Chaofang Zheng, Lianli Gao, Xinyu Lyu, Pengpeng Zeng, Abdulmotaleb El Saddik, Heng Tao Shen
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023
[PDF][code]


Memory-based Augmentation Network for Video Captioning
Shuaiqi Jing, Haonan Zhang, Pengpeng Zeng, Lianli Gao, Jingkuan Song, Heng Tao Shen
IEEE Transactions on Multimedia (TMM), 2023
[PDF][code]


Learning Visual Question Answering on Controlled Semantic Noisy Labels
Haonan Zhang, Pengpeng Zeng, Yuxuan Hu, Jin Qian, Jingkuan Song, Lianli Gao
Pattern Recognition (PR), 2023
[PDF][code]


Complementarity-aware Space Learning for Video-Text Retrieval
Jinkuan Zhu, Pengpeng Zeng, Lianli Gao, Gongfu Li, Dongliang Liao, Jingkuan Song
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023
[PDF][code]


Video Question Answering with Prior Knowledge and Object-sensitive Learning
Pengpeng Zeng, Haonan Zhang, Lianli Gao, Jingkuan Song, Heng Tao Shen
IEEE Transactions on Image Processing (TIP), 2022
[PDF][code]


Hierarchical Representation Network with Auxiliary Tasks for Video Captioning and Video Question Answering
Lianli Gao, Yu Lei, Pengpeng Zeng, Jingkuan Song, Meng Wang, Heng Tao Shen
IEEE Transactions on Image Processing (TIP), 2022
[PDF][code]


Text-Instance Graph: Exploring Relational Semantics for Text-based Visual Question Answering
Xiangpeng Li, Bo Wu, Jingkuan Song, Lianli Gao, Pengpeng Zeng, Chuang Gan
Pattern Recognition (PR), 2022
[PDF][code]


Relation-aware Aggregation Network with Auxiliary Guidance for Text-based Person Search
Pengpeng Zeng, Shuaiqi Jing, Jingkuan Song, Kaixuan Fan, Xiangpeng Li, Liansuo We, Yuan Guo
World Wide Web Journal (WWWJ), 2021
[PDF]


Generalized Pyramid Co-Attention with Learnable Aggregation Net for Video Question Answering
Lianli Gao, Tangming Chen, Xiangpeng Li, Pengpeng Zeng, Lei Zhao, Yuan-Fang Li
Pattern Recognition (PR), 2021
[PDF] [code]


Rich Visual Knowledge-based Augmentation Network for Visual Question Answering
Liyang Zhang, Shuaicheng Liu, Donghao Liu, Pengpeng Zeng, Xiangpeng Li, Lianli Gao, Jingkuan Song
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2020
[PDF] [code]




Journal Conference

ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval
Kaipeng Fang, Jingkuan Song, Lianli Gao, Pengpeng Zeng, Zhi-Qi Cheng, Xiyao Li, Heng Tao Shen
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[PDF][code]


Depth-Aware Sparse Transformer for Video-Language Learning
Haonan Zhang, Lianli Gao, Pengpeng Zeng, Alan Hanjalic, Heng Tao Shen , Heng Tao Shen
ACM International Conference on Multimedia (MM), 2023
[PDF][code]


Progressive Tree-Structured Prototype Network for End-to-End Image Captioning
Pengpeng Zeng, Jinkuan Zhu, Jingkuan Song, Lianli Gao
ACM International Conference on Multimedia (MM), 2022
[PDF][code]


Dynamic Scene Graph Generation via Temporal Prior Inference
Shuang Wang, Xinyu Lyu, Yuyu Guo, Pengpeng Zeng, Jingkuan Song, Lianli Gao
ACM International Conference on Multimedia (MM), 2022
[PDF][code]


S2 Transformer for Image Captioning
Pengpeng Zeng, Haonan Zhang, Jingkuan Song, Lianli Gao
International Joint Conference on Artificial Intelligence (IJCAI), 2022
[PDF] [code]


Support-set based Multi-modal Representation Enhancement for Video Captioning
Xiaoya Chen, Jingkuan Song, Pengpeng Zeng, Lianli Gao, Heng Tao Shen
IEEE International Conference on Multimedia & Expo (ICME), 2022
[PDF] [code]


Learning To Generate Scene Graph From Head To Tail
Chaofan Zheng, Xinyu Lyu, Yuyu Guo, Pengpeng Zeng, Jingkuan Song, Lianli Gao
IEEE International Conference on Multimedia & Expo (ICME), 2022
[PDF] [code]


Conceptual and Syntactical Cross-modal Alignment with Cross-level Consistency for Image-Text Matching
Pengpeng Zeng, Lianli Gao, Xinyu Lyu, Shuaiqi Jing, Jingkuan Song
ACM International Conference on Multimedia (MM), 2021
[PDF]


Hierarchical Representation Network with Auxiliary Task for Video Captioning
Yu Lei, Zhonghai He, Pengpeng Zeng, Jingkuan Song, Lianli Gao
IEEE International Conference on Multimedia & Expo (ICME), 2021 (Oral)
[PDF] [code]


Structured Two-stream Attention Network for Video Question Answering
Lianli Gao, Pengpeng Zeng, Jingkuan Song, YuanFang Li, Wu Liu, Tao Mei and Heng Tao Shen
AAAI Conference on Artificial Intelligence (AAAI), 2019
[PDF]


Examine before You Answer: Multi-task Learning with Adaptive-attentions for Multiple-choice VQA
Lianli Gao, Pengpeng Zeng,Jingkuan Song, Xianglong Liu, Heng Tao Shen
ACM International Conference on Multimedia (MM), 2018
[PDF]


From Pixels to Objects: Cubic Visual Attention for Visual Question Answering
Jingkuan Song, Pengpeng Zeng, Lianli Gao, Heng Tao Shen
International Joint Conference on Artificial Intelligence (IJCAI), 2018
[PDF]


Experience



Education
Ph.D, Computer Science and Technology, 2019-2023, University of Electronic Science and Technology of China (UESTC)
M.S., Computer Technology, 2016-2019, University of Electronic Science and Technology of China (UESTC)
B.S., Digital Media Technology, 2012-2016, Xi'an University of Technology (XUT)

Research Intern JD AI CV-Lab 2018.6-2018.09
Advisor: Dr.Wu liu
Deep learning for visual question answering

Academic Service



Journal Reviewer
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), since 2023
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), since 2023
Knowledge-Based Systems (KBS), since 2023
ACM Transactions on Data Science (TDS), since 2021
ACM Transactions on Multimedia Computing, Communications and Applications (TOMM), since 2020

Conference Reviewer
IEEE International Conference on Computer Vision (ICCV) 2023 - PC Member
IEEE Winter Conference on Applications of Computer Vision (WACV) 2023 -PC Member
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022/2023 - PC Member
European Conference on Computer Vision (ECCV) 2022 - PC Member
International Conference on Pattern Recognition (ICPR) 2021/2022 - PC Member
AAAI Conference on Artificial Intelligence (AAAI) 2021/2022/2023 - PC Member
ACM International Conference on Multimedia (MM) 2021/2022/2023 - PC Member