publications |

2024

Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model

Haogeng Liu , Quanzeng You , Xiaotian Han , Yongfei Liu , Huaibo Huang , Ran He , and Hongxia Yang

In NeurIPS , 2024
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation

Yuang Ai , Xiaoqiang Zhou , Huaibo Huang , Xiaotian Han , Zhengyu Chen , Quanzeng You , and Hongxia Yang

In NeurIPS , 2024
Bridging Model Heterogeneity in Federated Learning via Uncertainty-based Asymmetrical Reciprocity Learning

Jiaqi Wang , Chenxu Zhao , Lingjuan Lyu , Quanzeng You , Mengdi Huai , and Fenglong Ma

In Forty-first International Conference on Machine Learning , 2024
InfiMM: Advancing Multimodal Understanding with an Open-Sourced Visual Language Model

Haogeng Liu , Quanzeng You , Yiqi Wang , Xiaotian Han , Bohan Zhai , Yongfei Liu , Wentao Chen , Yiren Jian , Yunzhe Tao , Jianbo Yuan , and others

In Findings of the Association for Computational Linguistics ACL 2024 , 2024
Law of Vision Representation in MLLMs

Shijia Yang , Bohan Zhai , Quanzeng You , Jianbo Yuan , Hongxia Yang , and Chenfeng Xu

arXiv preprint arXiv:2408.16357, 2024
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning

Xiaotian Han , Yiren Jian , Xuefeng Hu , Haogeng Liu , Yiqi Wang , Qihang Fan , Yuang Ai , Huaibo Huang , Ran He , Zhenheng Yang , and others

arXiv preprint arXiv:2409.12568, 2024
Vitar: Vision transformer with any resolution

Qihang Fan , Quanzeng You , Xiaotian Han , Yongfei Liu , Yunzhe Tao , Huaibo Huang , Ran He , and Hongxia Yang.

arXiv preprint arXiv:2403.18361, 2024
InfiMM-HD: A leap forward in high-resolution multimodal understanding

Haogeng Liu , Quanzeng You , Xiaotian Han , Yiqi Wang , Bohan Zhai , Yongfei Liu , Yunzhe Tao , Huaibo Huang , Ran He , and Hongxia Yang

arXiv preprint arXiv:2403.01487, 2024
Exploring the Reasoning Abilities of Multimodal Large Language Models : A Comprehensive Survey on Emerging Trends in Multimodal Reasoning

Yiqi Wang , Wentao Chen , Xiaotian Han , Xudong Lin , Haiteng Zhao , Yongfei Liu , Bohan Zhai , Jianbo Yuan , Quanzeng You , and Hongxia Yang

arXiv e-prints, 2024
COCO is "ALL” You Need for Visual Instruction Fine-tuning

Xiaotian Han , Yiqi Wang , Bohan Zhai , Quanzeng You , and Hongxia Yang

arXiv e-prints, 2024

2023

Disentangled Representation Learning with Causality for Unsupervised Domain Adaptation

Shanshan Wang , Yiyang Chen , Zhenwei He , Xun Yang , Mengzhu Wang , Quanzeng You , and Xingyi Zhang

In Proceedings of the 31st ACM International Conference on Multimedia , 2023
Transmot: Spatial-temporal graph transformer for multiple object tracking

Peng Chu , Jiang Wang , Quanzeng You , Haibin Ling , and Zicheng Liu

In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision , 2023
Mmptrack: Large-scale densely annotated multi-camera multiple people tracking benchmark

Xiaotian Han , Quanzeng You , Chunyu Wang , Zhizheng Zhang , Peng Chu , Houdong Hu , Jiang Wang , and Zicheng Liu

In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision , 2023
Deep frequency filtering for domain generalization

Shiqi Lin , Zhizheng Zhang , Zhipeng Huang , Yan Lu , Cuiling Lan , Peng Chu , Quanzeng You , Jiang Wang , Zicheng Liu , Amey Parulkar , and others

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , 2023
RefineVIS: Video Instance Segmentation with Temporal Attention Refinement

Andre Abrantes , Jiang Wang , Peng Chu , Quanzeng You , and Zicheng Liu

arXiv preprint arXiv:2306.04774, 2023
Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling

Huangjie Zheng , Zhendong Wang , Jianbo Yuan , Guanghan Ning , Pengcheng He , Quanzeng You , Hongxia Yang , and Mingyuan Zhou

arXiv preprint arXiv:2310.06389, 2023
Reason out Your Layout: Evoking the Layout Master from Large Language Models for Text-to-Image Synthesis

Xiaohui Chen , Yongfei Liu , Yingxiang Yang , Jianbo Yuan , Quanzeng You , Li-Ping Liu , and Hongxia Yang

arXiv preprint arXiv:2311.17126, 2023
Improving In-Context Learning in Diffusion Models with Visual Context-Modulated Prompts

Tianqi Chen , Yongfei Liu , Zhendong Wang , Jianbo Yuan , Quanzeng You , Hongxia Yang , and Mingyuan Zhou

arXiv preprint arXiv:2312.01408, 2023
InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models

Xiaotian Han , Quanzeng You , Yongfei Liu , Wentao Chen , Huangjie Zheng , Khalil Mrini , Xudong Lin , Yiqi Wang , Bohan Zhai , Jianbo Yuan , and others

arXiv e-prints, 2023
InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models

Xiaotian Han , Quanzeng You , Yongfei Liu , Wentao Chen , Huangjie Zheng , Khalil Mrini , Xudong Lin , Yiqi Wang , Bohan Zhai , Jianbo Yuan , and others

arXiv e-prints, 2023

2022

Lifelong unsupervised domain adaptive person re-identification with coordinated anti-forgetting and adaptation

Zhipeng Huang , Zhizheng Zhang , Cuiling Lan , Wenjun Zeng , Peng Chu , Quanzeng You , Jiang Wang , Zicheng Liu , and Zheng-jun Zha

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , 2022
Qualifier: Question-guided self-attentive multimodal fusion network for audio visual scene-aware dialog

Muchao Ye , Quanzeng You , and Fenglong Ma

In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision , 2022
Sa-VQA: structured alignment of visual and semantic representations for visual question answering

Peixi Xiong , Quanzeng You , Pei Yu , Zicheng Liu , and Ying Wu

arXiv preprint arXiv:2201.10654, 2022
Consistent Video Instance Segmentation with Inter-Frame Recurrent Attention

Quanzeng You , Jiang Wang , Peng Chu , Andre Abrantes , and Zicheng Liu

arXiv preprint arXiv:2206.07011, 2022

2021

4D tracking utilizing depth data from multiple 3D cameras

Hao Jiang , Quanzeng You , and Zhengyou Zhang

Jul 2021

US Patent 11,062,469
Disentanglement-based Cross-Domain Feature Augmentation for Effective Unsupervised Domain Adaptive Person Re-identification

Zhizheng Zhang , Cuiling Lan , Wenjun Zeng , Quanzeng You , Zicheng Liu , Kecheng Zheng , and Zhibo Chen

arXiv preprint arXiv:2103.13917, Jul 2021
Writing by memorizing: Hierarchical retrieval-based medical report generation

Xingyi Yang , Muchao Ye , Quanzeng You , and Fenglong Ma

arXiv preprint arXiv:2106.06471, Jul 2021

2020

Double-layer conditional random fields model for human action recognition

Tianliang Liu , Xiaodong Dong , Yanzhang Wang , Xiubin Dai , Quanzeng You , and Jiebo Luo

Signal Processing: Image Communication, Jul 2020
Real-time 3d deep multi-camera tracking

Quanzeng You , and Hao Jiang

arXiv preprint arXiv:2003.11753, Jul 2020
A benchmark dataset for understandable medical language translation

Junyu Luo , Zifei Zheng , Hanzhong Ye , Muchao Ye , Yaqing Wang , Quanzeng You , Cao Xiao , and Fenglong Ma

arXiv preprint arXiv:2012.02420, Jul 2020

2019

Real-time multiple people hand localization in 4d point clouds

Hao Jiang , and Quanzeng You

arXiv preprint arXiv:1903.01695, Jul 2019
Action4d: Online action recognition in the crowd and clutter

Quanzeng You , and Hao Jiang

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , Jul 2019
Sentiment recognition for short annotated GIFs using visual-textual fusion

Tianliang Liu , Junwei Wan , Xiubin Dai , Feng Liu , Quanzeng You , and Jiebo Luo

IEEE Transactions on Multimedia, Jul 2019

2018

Image captioning at will: A versatile scheme for effectively injecting sentiments into image descriptions

Quanzeng You , Hailin Jin , and Jiebo Luo

arXiv preprint arXiv:1801.10121, Jul 2018
Touch your heart: A tone-aware chatbot for customer care on social media

Tianran Hu , Anbang Xu , Zhe Liu , Quanzeng You , Yufan Guo , Vibha Sinha , Jiebo Luo , and Rama Akkiraju

In Proceedings of the 2018 CHI conference on human factors in computing systems , Jul 2018
End-to-End Convolutional Semantic Embeddings

Quanzeng You , Zhengyou Zhang , and Jiebo Luo

In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , Jul 2018
Action4d: Real-time action recognition in the crowd and clutter

Quanzeng You , and Hao Jiang

arXiv preprint arXiv:1806.02424, Jul 2018
Twitter Sentiment Analysis via Bi-sense Emoji Embedding and Attention-based LSTM

Yuxiao Chen* , Jianbo Yuan* , Quanzeng You , and Jiebo Luo

In ACM Multimedia Conference, Seoul, Korea, 2018. , Jul 2018
"Factual" or "Emotional": Stylized Image Captioning with Adaptive Learning and Attention

Tianlang Chen , Zhongping Zhang , Quanzeng You , Chen Fang , Zhaowen Wang , Hailin Jin , and Jiebo Luo

In ECCV 2018 , Jul 2018
Risk Prediction on Electronic Health Records with Prior Medical Knowledge

Fenglong Ma , Jing Gao , Qiuling Suo , Quanzeng You , Jing Zhou , and Aidong Zhang

In SIGKDD , Jul 2018
Kame: Knowledge-based attention model for diagnosis prediction in healthcare

Fenglong Ma , Quanzeng You , Houping Xiao , Radha Chitta , Jing Zhou , and Jing Gao

In Proceedings of the 27th ACM International Conference on Information and Knowledge Management , Jul 2018

2017

Image-based appraisal of real estate properties

Quanzeng You , Ran Pang , Liangliang Cao , and Jiebo Luo

IEEE transactions on multimedia, Jul 2017
Visual Sentiment Analysis by Attending on Local Image Regions

Quanzeng You , Hailin Jin , and Jiebo Luo

In Thirty-First AAAI Conference on Artificial Intelligence (AAAI-17) , Jul 2017
Cultural Diffusion and Trends in Facebook Photographs

Quanzeng You , Darı́o Garcı́a-Garcı́a , Mahohar Paluri , Jiebo Luo , and Jungseock Joo

In ICWSM , Jul 2017
Semantic natural language vector space

Zhaowen Wang , Quanzeng You , Hailin Jin , and Chen Fang

Oct 2017

US Patent 9,792,534
Image captioning with weak supervision

Zhaowen Wang , Quanzeng You , Hailin Jin , and Chen Fang

Nov 2017

US Patent 9,811,765
Aesthetic quality assessment of photos with faces

Weining Wang , Jiexiong Huang , Xiangmin Xu , Quanzeng You , and Jiebo Luo

In Image and Graphics: 9th International Conference, ICIG 2017, Shanghai, China, September 13-15, 2017, Revised Selected Papers, Part III 9 , Nov 2017
Social multimedia sentiment analysis

Jiebo Luo , Damian Borth , and Quanzeng You

In Proceedings of the 25th ACM international conference on Multimedia , Nov 2017
When saliency meets sentiment: Understanding how image content invokes emotion and sentiment

Honglin Zheng , Tianlang Chen , Quanzeng You , and Jiebo Luo

In 2017 IEEE International Conference on Image Processing (ICIP) , Nov 2017
Dipole: Diagnosis prediction in healthcare via attention-based bidirectional recurrent neural networks

Fenglong Ma , Radha Chitta , Jing Zhou , Quanzeng You , Tong Sun , and Jing Gao

In Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining , Nov 2017

2016

Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and The Benchmark

Quanzeng You , Jiebo Luo , Hailin Jin , and Jianchao Yang

In The Thirtieth AAAI Conference on Artificial Intelligence (AAAI) , Nov 2016
Cross-modality consistent regression for joint visual-textual sentiment analysis of social multimedia

Quanzeng You , Jiebo Luo , Hailin Jin , and Jianchao Yang

In Proceedings of the Ninth ACM international conference on Web search and data mining , Nov 2016
Image captioning with semantic attention

Quanzeng You , Hailin Jin , Zhaowen Wang , Chen Fang , and Jiebo Luo

In CVPR 2016 , Nov 2016
Voting with feet: who are leaving Hillary Clinton and Donald Trump

Yu Wang , Yang Feng , Jiebo Luo , and Xiyang Zhang

In 2016 IEEE International Symposium on Multimedia (ISM) , Nov 2016
User characteristic prediction using images posted in online social networks

Quanzeng You , and Sumit Bhatia

Nov 2016

US Patent 9,489,592
Sampling for nyström extension-based spectral clustering: Incremental perspective and novel analysis

Xianchao Zhang , Linlin Zong , Quanzeng You , and Xing Yong

ACM Transactions on Knowledge Discovery from Data (TKDD), Nov 2016
Sentiment and Emotion Analysis for Social Multimedia: Methodologies and Applications

Quanzeng You

In ACM MM (DS) , Nov 2016
Adaptive greedy dictionary selection for web media summarization

Yang Cong , Ji Liu , Gan Sun , Quanzeng You , Yuncheng Li , and Jiebo Luo

IEEE Transactions on Image Processing, Nov 2016
The effect of pets on happiness: A data-driven approach via large-scale social media

Yuchen Wu , Jianbo Yuan , Quanzeng You , and Jiebo Luo

In 2016 IEEE International Conference on Big Data (Big Data) , Nov 2016
A picture tells a thousand words—About you! User interest profiling from user generated visual content

Quanzeng You , Sumit Bhatia , and Jiebo Luo

Signal Processing, Nov 2016
Robust visual-textual sentiment analysis: When attention meets tree-structured recursive neural networks

Quanzeng You , Liangliang Cao , Hailin Jin , and Jiebo Luo

In Proceedings of the 24th ACM international conference on Multimedia , Nov 2016

2015

Robust image sentiment analysis using progressively trained and domain transferred deep networks

Quanzeng You , Jiebo Luo , Hailin Jin , and Jianchao Yang

In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, Texas, USA. , Nov 2015
Snap n’ Shop: Visual Search-Based Mobile Shopping Made a Breeze by Machine and Crowd Intelligence

Quanzeng You , Jianbo Yuan , Jiaqi Wang , Philip Guo , and Jiebo Luo

In IEEE International Conference on Semantic Computing , Nov 2015
Sentiment analysis using social multimedia

Jianbo Yuan , Quanzeng You , and Jiebo Luo

Multimedia Data Mining and Analytics: Disruptive Innovation, Nov 2015
Towards Lifestyle Understanding: Predicting Home and Vacation Locations from User’s Online Photo Collections

Danning Zheng , Tianran Hu , Quanzeng You , and Jiebo Luo

In AAAI International Conference on Weblogs and Social Media (ICWSM) , Nov 2015
Joint Visual-Textual Sentiment Analysis with Deep Neural Networks

Quanzeng You , Jiebo Luo , Hailin Jin , and Jianchao Yang

In ACM Multimedia , Nov 2015
A Multifaceted Approach to Social Multimedia-based Prediction of Elections

Quanzeng You , Liangliang Cao , Yang Cong , Xianchao Zhang , and Jiebo Luo

IEEE Transactions on Multimedia, Nov 2015
A Picture Tells a Thousand Words–About You! User Interest Profiling from User Generated Visual Content

Quanzeng You , Sumit Bhatia , and Jiebo Luo

arXiv preprint arXiv:1504.04558, Nov 2015

2014

Transit tomography using probabilistic time geography: planning routes without a road map

Quanzeng You , and John Krumm

Journal of Location Based Services, Nov 2014
The eyes of the beholder: Gender prediction using images posted in online social networks

Quanzeng You , Sumit Bhatia , Tong Sun , and Jiebo Luo

In IEEE International Conference on Data Mining, Workshop on Social Multimedia Data Mining , Nov 2014
Inferring home location from user’s photo collections based on visual content and mobility patterns

Danning Zheng , Tianran Hu , Quanzeng You , Henry Kautz , and Jiebo Luo

In Proceedings of the 3rd ACM multimedia workshop on geotagging and its applications in multimedia , Nov 2014

2013

Sentribute: image sentiment analysis from a mid-level perspective

Jianbo Yuan , Sean Mcdonough , Quanzeng You , and Jiebo Luo

In Proceedings of the second international workshop on issues of sentiment discovery and opinion mining , Nov 2013
Towards understanding the effectiveness of election related images in social media

Junhuan Zhu , Jiebo Luo , Quanzeng You , and John R Smith

In 2013 IEEE 13th International Conference on Data Mining Workshops , Nov 2013
Towards social imagematics: sentiment analysis in social multimedia

Quanzeng You , and Jiebo Luo

In Proceedings of the thirteenth international workshop on multimedia data mining , Nov 2013
Are there cultural differences in event driven information propagation over social media?

Jianbo Yuan , Quanzeng You , and Jiebo Luo

In Proceedings of the 2nd international workshop on Socially-aware multimedia , Nov 2013

2011

An improved spectral clustering algorithm based on random walk

Xianchao Zhang , and Quanzeng You

Frontiers of Computer Science in China, Nov 2011
Clusterability analysis and incremental sampling for nyström extension based spectral clustering

Xianchao Zhang , and Quanzeng You

In 2011 IEEE 11th International Conference on Data Mining , Nov 2011