publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2024

  1. Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model
    Haogeng Liu ,  Quanzeng You ,  Xiaotian Han ,  Yongfei Liu ,  Huaibo Huang ,  Ran He ,  and  Hongxia Yang
    In NeurIPS , 2024
  2. DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
    Yuang Ai ,  Xiaoqiang Zhou ,  Huaibo Huang ,  Xiaotian Han ,  Zhengyu Chen ,  Quanzeng You ,  and  Hongxia Yang
    In NeurIPS , 2024
  3. Bridging Model Heterogeneity in Federated Learning via Uncertainty-based Asymmetrical Reciprocity Learning
    Jiaqi Wang ,  Chenxu Zhao ,  Lingjuan Lyu ,  Quanzeng You ,  Mengdi Huai ,  and  Fenglong Ma
    In Forty-first International Conference on Machine Learning , 2024
  4. InfiMM: Advancing Multimodal Understanding with an Open-Sourced Visual Language Model
    Haogeng Liu ,  Quanzeng You ,  Yiqi Wang ,  Xiaotian Han ,  Bohan Zhai ,  Yongfei Liu ,  Wentao Chen ,  Yiren Jian ,  Yunzhe Tao ,  Jianbo Yuan ,  and  others
    In Findings of the Association for Computational Linguistics ACL 2024 , 2024
  5. Law of Vision Representation in MLLMs
    Shijia Yang ,  Bohan Zhai ,  Quanzeng You ,  Jianbo Yuan ,  Hongxia Yang ,  and  Chenfeng Xu
    arXiv preprint arXiv:2408.16357, 2024
  6. InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning
    Xiaotian Han ,  Yiren Jian ,  Xuefeng Hu ,  Haogeng Liu ,  Yiqi Wang ,  Qihang Fan ,  Yuang Ai ,  Huaibo Huang ,  Ran He ,  Zhenheng Yang ,  and  others
    arXiv preprint arXiv:2409.12568, 2024
  7. Vitar: Vision transformer with any resolution
    Qihang Fan ,  Quanzeng You ,  Xiaotian Han ,  Yongfei Liu ,  Yunzhe Tao ,  Huaibo Huang ,  Ran He ,  and  Hongxia Yang.
    arXiv preprint arXiv:2403.18361, 2024
  8. InfiMM-HD: A leap forward in high-resolution multimodal understanding
    Haogeng Liu ,  Quanzeng You ,  Xiaotian Han ,  Yiqi Wang ,  Bohan Zhai ,  Yongfei Liu ,  Yunzhe Tao ,  Huaibo Huang ,  Ran He ,  and  Hongxia Yang
    arXiv preprint arXiv:2403.01487, 2024
  9. Exploring the Reasoning Abilities of Multimodal Large Language Models : A Comprehensive Survey on Emerging Trends in Multimodal Reasoning
    Yiqi Wang ,  Wentao Chen ,  Xiaotian Han ,  Xudong Lin ,  Haiteng Zhao ,  Yongfei Liu ,  Bohan Zhai ,  Jianbo Yuan ,  Quanzeng You ,  and  Hongxia Yang
    arXiv e-prints, 2024
  10. COCO is "ALL” You Need for Visual Instruction Fine-tuning
    Xiaotian Han ,  Yiqi Wang ,  Bohan Zhai ,  Quanzeng You ,  and  Hongxia Yang
    arXiv e-prints, 2024

2023

  1. Disentangled Representation Learning with Causality for Unsupervised Domain Adaptation
    Shanshan Wang ,  Yiyang Chen ,  Zhenwei He ,  Xun Yang ,  Mengzhu Wang ,  Quanzeng You ,  and  Xingyi Zhang
    In Proceedings of the 31st ACM International Conference on Multimedia , 2023
  2. Transmot: Spatial-temporal graph transformer for multiple object tracking
    Peng Chu ,  Jiang Wang ,  Quanzeng You ,  Haibin Ling ,  and  Zicheng Liu
    In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision , 2023
  3. Mmptrack: Large-scale densely annotated multi-camera multiple people tracking benchmark
    Xiaotian Han ,  Quanzeng You ,  Chunyu Wang ,  Zhizheng Zhang ,  Peng Chu ,  Houdong Hu ,  Jiang Wang ,  and  Zicheng Liu
    In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision , 2023
  4. Deep frequency filtering for domain generalization
    Shiqi Lin ,  Zhizheng Zhang ,  Zhipeng Huang ,  Yan Lu ,  Cuiling Lan ,  Peng Chu ,  Quanzeng You ,  Jiang Wang ,  Zicheng Liu ,  Amey Parulkar ,  and  others
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , 2023
  5. RefineVIS: Video Instance Segmentation with Temporal Attention Refinement
    Andre Abrantes ,  Jiang Wang ,  Peng Chu ,  Quanzeng You ,  and  Zicheng Liu
    arXiv preprint arXiv:2306.04774, 2023
  6. Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling
    Huangjie Zheng ,  Zhendong Wang ,  Jianbo Yuan ,  Guanghan Ning ,  Pengcheng He ,  Quanzeng You ,  Hongxia Yang ,  and  Mingyuan Zhou
    arXiv preprint arXiv:2310.06389, 2023
  7. Reason out Your Layout: Evoking the Layout Master from Large Language Models for Text-to-Image Synthesis
    Xiaohui Chen ,  Yongfei Liu ,  Yingxiang Yang ,  Jianbo Yuan ,  Quanzeng You ,  Li-Ping Liu ,  and  Hongxia Yang
    arXiv preprint arXiv:2311.17126, 2023
  8. Improving In-Context Learning in Diffusion Models with Visual Context-Modulated Prompts
    Tianqi Chen ,  Yongfei Liu ,  Zhendong Wang ,  Jianbo Yuan ,  Quanzeng You ,  Hongxia Yang ,  and  Mingyuan Zhou
    arXiv preprint arXiv:2312.01408, 2023
  9. InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
    Xiaotian Han ,  Quanzeng You ,  Yongfei Liu ,  Wentao Chen ,  Huangjie Zheng ,  Khalil Mrini ,  Xudong Lin ,  Yiqi Wang ,  Bohan Zhai ,  Jianbo Yuan ,  and  others
    arXiv e-prints, 2023
  10. InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
    Xiaotian Han ,  Quanzeng You ,  Yongfei Liu ,  Wentao Chen ,  Huangjie Zheng ,  Khalil Mrini ,  Xudong Lin ,  Yiqi Wang ,  Bohan Zhai ,  Jianbo Yuan ,  and  others
    arXiv e-prints, 2023

2022

  1. Lifelong unsupervised domain adaptive person re-identification with coordinated anti-forgetting and adaptation
    Zhipeng Huang ,  Zhizheng Zhang ,  Cuiling Lan ,  Wenjun Zeng ,  Peng Chu ,  Quanzeng You ,  Jiang Wang ,  Zicheng Liu ,  and  Zheng-jun Zha
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , 2022
  2. Qualifier: Question-guided self-attentive multimodal fusion network for audio visual scene-aware dialog
    Muchao Ye ,  Quanzeng You ,  and  Fenglong Ma
    In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision , 2022
  3. Sa-VQA: structured alignment of visual and semantic representations for visual question answering
    Peixi Xiong ,  Quanzeng You ,  Pei Yu ,  Zicheng Liu ,  and  Ying Wu
    arXiv preprint arXiv:2201.10654, 2022
  4. Consistent Video Instance Segmentation with Inter-Frame Recurrent Attention
    Quanzeng You ,  Jiang Wang ,  Peng Chu ,  Andre Abrantes ,  and  Zicheng Liu
    arXiv preprint arXiv:2206.07011, 2022

2021

  1. 4D tracking utilizing depth data from multiple 3D cameras
    Hao Jiang ,  Quanzeng You ,  and  Zhengyou Zhang
    Jul 2021
    US Patent 11,062,469
  2. Disentanglement-based Cross-Domain Feature Augmentation for Effective Unsupervised Domain Adaptive Person Re-identification
    Zhizheng Zhang ,  Cuiling Lan ,  Wenjun Zeng ,  Quanzeng You ,  Zicheng Liu ,  Kecheng Zheng ,  and  Zhibo Chen
    arXiv preprint arXiv:2103.13917, Jul 2021
  3. Writing by memorizing: Hierarchical retrieval-based medical report generation
    Xingyi Yang ,  Muchao Ye ,  Quanzeng You ,  and  Fenglong Ma
    arXiv preprint arXiv:2106.06471, Jul 2021

2020

  1. Double-layer conditional random fields model for human action recognition
    Tianliang Liu ,  Xiaodong Dong ,  Yanzhang Wang ,  Xiubin Dai ,  Quanzeng You ,  and  Jiebo Luo
    Signal Processing: Image Communication, Jul 2020
  2. Real-time 3d deep multi-camera tracking
    Quanzeng You ,  and  Hao Jiang
    arXiv preprint arXiv:2003.11753, Jul 2020
  3. A benchmark dataset for understandable medical language translation
    Junyu Luo ,  Zifei Zheng ,  Hanzhong Ye ,  Muchao Ye ,  Yaqing Wang ,  Quanzeng You ,  Cao Xiao ,  and  Fenglong Ma
    arXiv preprint arXiv:2012.02420, Jul 2020

2019

  1. Real-time multiple people hand localization in 4d point clouds
    Hao Jiang ,  and  Quanzeng You
    arXiv preprint arXiv:1903.01695, Jul 2019
  2. Action4d: Online action recognition in the crowd and clutter
    Quanzeng You ,  and  Hao Jiang
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , Jul 2019
  3. Sentiment recognition for short annotated GIFs using visual-textual fusion
    Tianliang Liu ,  Junwei Wan ,  Xiubin Dai ,  Feng Liu ,  Quanzeng You ,  and  Jiebo Luo
    IEEE Transactions on Multimedia, Jul 2019

2018

  1. Image captioning at will: A versatile scheme for effectively injecting sentiments into image descriptions
    Quanzeng You ,  Hailin Jin ,  and  Jiebo Luo
    arXiv preprint arXiv:1801.10121, Jul 2018
  2. Touch your heart: A tone-aware chatbot for customer care on social media
    Tianran Hu ,  Anbang Xu ,  Zhe Liu ,  Quanzeng You ,  Yufan Guo ,  Vibha Sinha ,  Jiebo Luo ,  and  Rama Akkiraju
    In Proceedings of the 2018 CHI conference on human factors in computing systems , Jul 2018
  3. End-to-End Convolutional Semantic Embeddings
    Quanzeng You ,  Zhengyou Zhang ,  and  Jiebo Luo
    In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , Jul 2018
  4. Action4d: Real-time action recognition in the crowd and clutter
    Quanzeng You ,  and  Hao Jiang
    arXiv preprint arXiv:1806.02424, Jul 2018
  5. Twitter Sentiment Analysis via Bi-sense Emoji Embedding and Attention-based LSTM
    Yuxiao Chen* ,  Jianbo Yuan* ,  Quanzeng You ,  and  Jiebo Luo
    In ACM Multimedia Conference, Seoul, Korea, 2018. , Jul 2018
  6. "Factual" or "Emotional": Stylized Image Captioning with Adaptive Learning and Attention
    Tianlang Chen ,  Zhongping Zhang ,  Quanzeng You ,  Chen Fang ,  Zhaowen Wang ,  Hailin Jin ,  and  Jiebo Luo
    In ECCV 2018 , Jul 2018
  7. Risk Prediction on Electronic Health Records with Prior Medical Knowledge
    Fenglong Ma ,  Jing Gao ,  Qiuling Suo ,  Quanzeng You ,  Jing Zhou ,  and  Aidong Zhang
    In SIGKDD , Jul 2018
  8. Kame: Knowledge-based attention model for diagnosis prediction in healthcare
    Fenglong Ma ,  Quanzeng You ,  Houping Xiao ,  Radha Chitta ,  Jing Zhou ,  and  Jing Gao
    In Proceedings of the 27th ACM International Conference on Information and Knowledge Management , Jul 2018

2017

  1. Image-based appraisal of real estate properties
    Quanzeng You ,  Ran Pang ,  Liangliang Cao ,  and  Jiebo Luo
    IEEE transactions on multimedia, Jul 2017
  2. Visual Sentiment Analysis by Attending on Local Image Regions
    Quanzeng You ,  Hailin Jin ,  and  Jiebo Luo
    In Thirty-First AAAI Conference on Artificial Intelligence (AAAI-17) , Jul 2017
  3. Cultural Diffusion and Trends in Facebook Photographs
    Quanzeng You ,  Darı́o Garcı́a-Garcı́a ,  Mahohar Paluri ,  Jiebo Luo ,  and  Jungseock Joo
    In ICWSM , Jul 2017
  4. Semantic natural language vector space
    Zhaowen Wang ,  Quanzeng You ,  Hailin Jin ,  and  Chen Fang
    Oct 2017
    US Patent 9,792,534
  5. Image captioning with weak supervision
    Zhaowen Wang ,  Quanzeng You ,  Hailin Jin ,  and  Chen Fang
    Nov 2017
    US Patent 9,811,765
  6. Aesthetic quality assessment of photos with faces
    Weining Wang ,  Jiexiong Huang ,  Xiangmin Xu ,  Quanzeng You ,  and  Jiebo Luo
    In Image and Graphics: 9th International Conference, ICIG 2017, Shanghai, China, September 13-15, 2017, Revised Selected Papers, Part III 9 , Nov 2017
  7. Social multimedia sentiment analysis
    Jiebo Luo ,  Damian Borth ,  and  Quanzeng You
    In Proceedings of the 25th ACM international conference on Multimedia , Nov 2017
  8. When saliency meets sentiment: Understanding how image content invokes emotion and sentiment
    Honglin Zheng ,  Tianlang Chen ,  Quanzeng You ,  and  Jiebo Luo
    In 2017 IEEE International Conference on Image Processing (ICIP) , Nov 2017
  9. Dipole: Diagnosis prediction in healthcare via attention-based bidirectional recurrent neural networks
    Fenglong Ma ,  Radha Chitta ,  Jing Zhou ,  Quanzeng You ,  Tong Sun ,  and  Jing Gao
    In Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining , Nov 2017

2016

  1. Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and The Benchmark
    Quanzeng You ,  Jiebo Luo ,  Hailin Jin ,  and  Jianchao Yang
    In The Thirtieth AAAI Conference on Artificial Intelligence (AAAI) , Nov 2016
  2. Cross-modality consistent regression for joint visual-textual sentiment analysis of social multimedia
    Quanzeng You ,  Jiebo Luo ,  Hailin Jin ,  and  Jianchao Yang
    In Proceedings of the Ninth ACM international conference on Web search and data mining , Nov 2016
  3. Image captioning with semantic attention
    Quanzeng You ,  Hailin Jin ,  Zhaowen Wang ,  Chen Fang ,  and  Jiebo Luo
    In CVPR 2016 , Nov 2016
  4. Voting with feet: who are leaving Hillary Clinton and Donald Trump
    Yu Wang ,  Yang Feng ,  Jiebo Luo ,  and  Xiyang Zhang
    In 2016 IEEE International Symposium on Multimedia (ISM) , Nov 2016
  5. User characteristic prediction using images posted in online social networks
    Quanzeng You ,  and  Sumit Bhatia
    Nov 2016
    US Patent 9,489,592
  6. Sampling for nyström extension-based spectral clustering: Incremental perspective and novel analysis
    Xianchao Zhang ,  Linlin Zong ,  Quanzeng You ,  and  Xing Yong
    ACM Transactions on Knowledge Discovery from Data (TKDD), Nov 2016
  7. Sentiment and Emotion Analysis for Social Multimedia: Methodologies and Applications
    Quanzeng You
    In ACM MM (DS) , Nov 2016
  8. Adaptive greedy dictionary selection for web media summarization
    Yang Cong ,  Ji Liu ,  Gan Sun ,  Quanzeng You ,  Yuncheng Li ,  and  Jiebo Luo
    IEEE Transactions on Image Processing, Nov 2016
  9. The effect of pets on happiness: A data-driven approach via large-scale social media
    Yuchen Wu ,  Jianbo Yuan ,  Quanzeng You ,  and  Jiebo Luo
    In 2016 IEEE International Conference on Big Data (Big Data) , Nov 2016
  10. A picture tells a thousand words—About you! User interest profiling from user generated visual content
    Quanzeng You ,  Sumit Bhatia ,  and  Jiebo Luo
    Signal Processing, Nov 2016
  11. Robust visual-textual sentiment analysis: When attention meets tree-structured recursive neural networks
    Quanzeng You ,  Liangliang Cao ,  Hailin Jin ,  and  Jiebo Luo
    In Proceedings of the 24th ACM international conference on Multimedia , Nov 2016

2015

  1. Robust image sentiment analysis using progressively trained and domain transferred deep networks
    Quanzeng You ,  Jiebo Luo ,  Hailin Jin ,  and  Jianchao Yang
    In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, Texas, USA. , Nov 2015
  2. Snap n’ Shop: Visual Search-Based Mobile Shopping Made a Breeze by Machine and Crowd Intelligence
    Quanzeng You ,  Jianbo Yuan ,  Jiaqi Wang ,  Philip Guo ,  and  Jiebo Luo
    In IEEE International Conference on Semantic Computing , Nov 2015
  3. Sentiment analysis using social multimedia
    Jianbo Yuan ,  Quanzeng You ,  and  Jiebo Luo
    Multimedia Data Mining and Analytics: Disruptive Innovation, Nov 2015
  4. Towards Lifestyle Understanding: Predicting Home and Vacation Locations from User’s Online Photo Collections
    Danning Zheng ,  Tianran Hu ,  Quanzeng You ,  and  Jiebo Luo
    In AAAI International Conference on Weblogs and Social Media (ICWSM) , Nov 2015
  5. Joint Visual-Textual Sentiment Analysis with Deep Neural Networks
    Quanzeng You ,  Jiebo Luo ,  Hailin Jin ,  and  Jianchao Yang
    In ACM Multimedia , Nov 2015
  6. A Multifaceted Approach to Social Multimedia-based Prediction of Elections
    Quanzeng You ,  Liangliang Cao ,  Yang Cong ,  Xianchao Zhang ,  and  Jiebo Luo
    IEEE Transactions on Multimedia, Nov 2015
  7. A Picture Tells a Thousand Words–About You! User Interest Profiling from User Generated Visual Content
    Quanzeng You ,  Sumit Bhatia ,  and  Jiebo Luo
    arXiv preprint arXiv:1504.04558, Nov 2015

2014

  1. Transit tomography using probabilistic time geography: planning routes without a road map
    Quanzeng You ,  and  John Krumm
    Journal of Location Based Services, Nov 2014
  2. The eyes of the beholder: Gender prediction using images posted in online social networks
    Quanzeng You ,  Sumit Bhatia ,  Tong Sun ,  and  Jiebo Luo
    In IEEE International Conference on Data Mining, Workshop on Social Multimedia Data Mining , Nov 2014
  3. Inferring home location from user’s photo collections based on visual content and mobility patterns
    Danning Zheng ,  Tianran Hu ,  Quanzeng You ,  Henry Kautz ,  and  Jiebo Luo
    In Proceedings of the 3rd ACM multimedia workshop on geotagging and its applications in multimedia , Nov 2014

2013

  1. Sentribute: image sentiment analysis from a mid-level perspective
    Jianbo Yuan ,  Sean Mcdonough ,  Quanzeng You ,  and  Jiebo Luo
    In Proceedings of the second international workshop on issues of sentiment discovery and opinion mining , Nov 2013
  2. Towards understanding the effectiveness of election related images in social media
    Junhuan Zhu ,  Jiebo Luo ,  Quanzeng You ,  and  John R Smith
    In 2013 IEEE 13th International Conference on Data Mining Workshops , Nov 2013
  3. Towards social imagematics: sentiment analysis in social multimedia
    Quanzeng You ,  and  Jiebo Luo
    In Proceedings of the thirteenth international workshop on multimedia data mining , Nov 2013
  4. Are there cultural differences in event driven information propagation over social media?
    Jianbo Yuan ,  Quanzeng You ,  and  Jiebo Luo
    In Proceedings of the 2nd international workshop on Socially-aware multimedia , Nov 2013

2011

  1. An improved spectral clustering algorithm based on random walk
    Xianchao Zhang ,  and  Quanzeng You
    Frontiers of Computer Science in China, Nov 2011
  2. Clusterability analysis and incremental sampling for nyström extension based spectral clustering
    Xianchao Zhang ,  and  Quanzeng You
    In 2011 IEEE 11th International Conference on Data Mining , Nov 2011