This is a navigation window which overlays the main content of the page. Pressing the "X" at the top right corner of the modal will close the modal and bring you back to where you were on the page.
See posts about
This is a call to action window which overlays the main content of the page. Pressing the "X" at the top right corner of the modal will close the modal and bring you back to where you were on the page.
Let's Collaborate
This is a search window which overlays the main content of the page. Pressing the "X" at the top right corner of the modal will close the modal and bring you back to where you were on the page.
Assistant Professor, David Cheriton School of Computer Science, University of Waterloo
Canada CIFAR Artificial Intelligence Chair
Wenhu Chen is currently an assistant professor in the David R. Cheriton School of Computer Science at University of Waterloo. He obtained his PhD from the computing science department of University of California, Santa Barbara in 2021, and he spent a wonderful postdoctoral year at Google Research. His main research interests include natural language processing, large language models, vision-language interaction, image generation, etc.
Research Interests
Natural Language Processing
Multimodal Learning
Knowledge Reasoning and Grounding
Highlights
Canada CIFAR AI Chair in 2022
WACV best student paper honorable mention
UCSB CS Outstanding Dissertation Award
Tencent AI Gift Award
Publications
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks
Wenhu Chen and Xueguang Ma and Xinyi Wang and William W Cohen
Wenhu Chen and Hexiang Hu and Chitwan Saharia and William W Cohen
2022
Explanations from Large Language Models Make Small Reasoners Better
Shiyang Li and Jianshu Chen and Yelong Shen and Zhiyu Chen and Xinlu Zhang and Zekun Li and Hong Wang and Jing Qian and Baolin Peng and Yi Mao and Wenhu Chen and Xifeng Yan
2022
Large language models are few (1)-shot table reasoners
Wenhu Chen
2022
MuRAG: Multimodal Retrieval-Augmented Generator for Open Question Answering over Images and Text
Wenhu Chen and Hexiang Hu and Xi Chen and Pat Verga and William W Cohen
2022
Augmenting Pre-trained Language Models with QA-Memory for Open-Domain Question Answering
Wenhu Chen and Pat Verga and Michiel de Jong and John Wieting and William Cohen
2022
Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
Xichen Pan and Pengda Qin and Yuhong Li and Hui Xue and Wenhu Chen
2022
HybriDialogue: An Information-Seeking Dialogue Dataset Grounded on Tabular and Textual Data
Kai Nakamura and Sharon Levy and Yi-Lin Tuan and Wenhu Chen and William Yang Wang
2022
Subject-driven Text-to-Image Generation via Apprenticeship Learning
Wenhu Chen and Hexiang Hu and Yandong Li and Nataniel Ruiz and Xuhui Jia and Ming-Wei Chang and William W Cohen
2023
DePlot: One-shot visual language reasoning by plot-to-table translation
Fangyu Liu and Julian Martin Eisenschlos and Francesco Piccinno and Syrine Krichene and Chenxi Pang and Kenton Lee and Mandar Joshi and Wenhu Chen and Nigel Collier and Yasemin Altun
2022
Controllable Dialogue Simulation with In-Context Learning
Zekun Li and Wenhu Chen and Shiyang Li and Hong Wang and Jing Qian and Xifeng Yan
2022
QA Is the New KR: Question-Answer Pairs as Knowledge Bases
Wenhu Chen and William W Cohen and Michiel De Jong and Nitish Gupta and Alessandro Presta and Pat Verga and John Wieting
2022
Using meta-information in neural machine translation
Evgeny Matusov and Wenhu Chen and Shahram Khadivi
2022
On the Risk of Misinformation Pollution with Large Language Models
Yikang Pan and Liangming Pan and Wenhu Chen and Preslav Nakov and Min-Yen Kan and William Yang Wang
2023
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing
Kai Zhang and Lingbo Mo and Wenhu Chen and Huan Sun and Yu Su
2023
TheoremQA: A Theorem-driven Question Answering dataset
Wenhu Chen and Ming Yin and Max Ku and Elaine Wan and Xueguang Ma and Jianyu Xu and Tony Xia and Xinyi Wang and Pan Lu
2023
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
LI Yizhi and Ruibin Yuan and Ge Zhang and Yinghao Ma and Xingran Chen and Hanzhi Yin and Chenghao Xiao and Chenghua Lin and Anton Ragni and Emmanouil Benetos and Norbert Gyenge and Roger Dannenberg and Ruibo Liu and Wenhu Chen and Gus Xia and Yemin Shi and Wenhao Huang and Zili Wang and Yike Guo and Jie Fu
2023
Few-shot In-context Learning for Knowledge Base Question Answering
Tianle Li and Xueguang Ma and Alex Zhuang and Yu Gu and Yu Su and Wenhu Chen
2023
DreamEdit: Subject-driven Image Editing
Tianle Li and Max Ku and Cong Wei and Wenhu Chen
2023
MARBLE: Music Audio Representation Benchmark for Universal Evaluation
Ruibin Yuan and Yinghao Ma and Yizhi Li and Ge Zhang and Xingran Chen and Hanzhi Yin and Le Zhuo and Yiqi Liu and Jiawen Huang and Zeyue Tian and Binyue Deng and Ningzhi Wang and Wenhu Chen and Gus Xia and Wei Xue and Si Liu and Shi Wang and Ruibo Liu and Yike Guo and Jie Fu
2023
MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response
Zihao Deng and Yinghao Ma and Yudong Liu and Rongchen Guo and Ge Zhang and Wenhu Chen and Wenhao Huang and Emmanouil Benetos
2023
EDIS: Entity-Driven Image Search over Multimodal Web Content
Siqi Liu and Weixi Feng and Wenhu Chen and William Yang Wang
2023
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning
Xiang Yue and Xingwei Qu and Ge Zhang and Yao Fu and Wenhao Huang and Huan Sun and Yu Su and Wenhu Chen
2023
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Xiang Yue and Yuansheng Ni and Kai Zhang and Tianyu Zheng and Ruoqi Liu and Ge Zhang and Samuel Stevens and Dongfu Jiang and Weiming Ren and Yuxuan Sun and Cong Wei and Botao Yu and Ruibin Yuan and Renliang Sun and Ming Yin and Boyuan Zheng and Zhenzhu Yang and Yibo Liu and Wenhao Huang and Huan Sun and Yu Su and Wenhu Chen
2023
CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark
Ge Zhang and Xinrun Du and Bei Chen and Yiming Liang and Tongxu Luo and Tianyu Zheng and Kang Zhu and Yuyang Cheng and Chunpu Xu and Shuyue Guo and Haoran Zhang and Xingwei Qu and Junjie Wang and Ruibin Yuan and Yizhi Li and Zekun Wang and Yudong Liu and Yu-Hsuan Tsai and Fengji Zhang and Chenghua Lin and Wenhao Huang and Wenhu Chen and Jie Fu
2024
Interactive Natural Language Processing
Zekun Wang and Ge Zhang and Kexin Yang and Ning Shi and Wangchunshu Zhou and Shaochun Hao and Guangzheng Xiong and Yizhi Li and Mong Yuan Sim and Xiuying Chen and Qingqing Zhu and Zhenzhu Yang and Adam Nik and Qi Liu and Chenghua Lin and Shi Wang and Ruibo Liu and Wenhu Chen and Ke Xu and Dayiheng Liu and Yike Guo and Jie Fu
2023
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Xichen Pan and Li Dong and Shaohan Huang and Zhiliang Peng and Wenhu Chen and Furu Wei
2023
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
Le Zhuo and Ruibin Yuan and Jiahao Pan and Yinghao Ma and Yizhi LI and Ge Zhang and Si Liu and Roger Dannenberg and Jie Fu and Chenghua Lin and Emmanouil Benetos and Wenhu Chen and Wei Xue and Yike Guo
2023
VIEScore: Towards Explainable Metrics for Conditional Image Synthesis Evaluation
Max Ku and Dongfu Jiang and Cong Wei and Xiang Yue and Wenhu Chen
2023
Instruct-Imagen: Image Generation with Multi-modal Instruction
Hexiang Hu and Kelvin CK Chan and Yu-Chuan Su and Wenhu Chen and Yandong Li and Kihyuk Sohn and Yang Zhao and Xue Ben and Boqing Gong and William Cohen and Ming-Wei Chang and Xuhui Jia
2024
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
Kai Zhang and Yi Luan and Hexiang Hu and Kenton Lee and Siyuan Qiao and Wenhu Chen and Yu Su and Ming-Wei Chang
2024
Understanding the Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation
Xinyi Wang and Alfonso Amayuelas and Kexun Zhang and Liangming Pan and Wenhu Chen and William Yang Wang
2024
Augmenting Black-box LLMs with Medical Textbooks for Clinical Question Answering
Yubo Wang and Xueguang Ma and Wenhu Chen
2023
TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks
Dongfu Jiang and Yishan Li and Ge Zhang and Wenhao Huang and Yuchen Lin and Wenhu Chen
2023
ImagenHub: Standardizing the evaluation of conditional image generation models
Max Ku and Tianle Li and Kai Zhang and Yujie Lu and Xingyu Fu and Wenwen Zhuang and Wenhu Chen
2023
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers
Cong Wei and Yang Chen and Haonan Chen and Hexiang Hu and Ge Zhang and Jie Fu and Alan Ritter and Wenhu Chen
2023
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
Tianyu Zheng and Ge Zhang and Tianhao Shen and Xueling Liu and Bill Yuchen Lin and Jie Fu and Wenhu Chen and Xiang Yue
2024
E^ 2-LLM: Efficient and Extreme Length Extension of Large Language Models
Jiaheng Liu and Zhiqi Bai and Yuanxing Zhang and Chenchen Zhang and Yu Zhang and Ge Zhang and Jiakai Wang and Haoran Que and Yukang Chen and Wenbo Su and Tiezheng Ge and Jie Fu and Wenhu Chen and Bo Zheng
2024
ChatMusician: Understanding and Generating Music Intrinsically with LLM
Ruibin Yuan and Hanfeng Lin and Yi Wang and Zeyue Tian and Shangda Wu and Tianhao Shen and Ge Zhang and Yuhang Wu and Cong Liu and Ziya Zhou and Ziyang Ma and Liumeng Xue and Ziyu Wang and Qin Liu and Tianyu Zheng and Yizhi Li and Yinghao Ma and Yiming Liang and Xiaowei Chi and Ruibo Liu and Zili Wang and Pengfei Li and Jingcheng Wu and Chenghua Lin and Qifeng Liu and Tao Jiang and Wenhao Huang and Wenhu Chen and Emmanouil Benetos and Jie Fu and Gus Xia and Roger Dannenberg and Wei Xue and Shiyin Kang and Yike Guo
2024
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
Weiming Ren and Harry Yang and Ge Zhang and Cong Wei and Xinrun Du and Stephen Huang and Wenhu Chen
2024
Kun: Answer Polishment for Chinese Self-Alignment with Instruction Back-Translation
Tianyu Zheng and Shuyue Guo and Xingwei Qu and Jiawei Guo and Weixu Zhang and Xinrun Du and Chenghua Lin and Wenhao Huang and Wenhu Chen and Jie Fu and Ge Zhang
2024
COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning
Yuelin Bai and Xinrun Du and Yiming Liang and Yonggang Jin and Ziqiang Liu and Junting Zhou and Tianyu Zheng and Xincheng Zhang and Nuo Ma and Zekun Wang and Ruibin Yuan and Haihong Wu and Hongquan Lin and Wenhao Huang and Jiajun Zhang and Wenhu Chen and Chenghua Lin and Jie Fu and Min Yang and Shiwen Ni and Ge Zhang
2024
AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Max Ku and Cong Wei and Weiming Ren and Huan Yang and Wenhu Chen
2024
Reward Guided Latent Consistency Distillation
Jiachen Li and Weixi Feng and Wenhu Chen and William Yang Wang
2024
DEEP-ICL: Definition-Enriched Experts for Language Model In-Context Learning
Xingwei Qu and Yiming Liang and Yucheng Wang and Tianyu Zheng and Tommy Yue and Lei Ma and Stephen W Huang and Jiajun Zhang and Wenhu Chen and Chenghua Lin and Jie Fu and Ge Zhang
2024
StructLM: Towards Building Generalist Models for Structured Knowledge Grounding
Alex Zhuang and Ge Zhang and Tianyu Zheng and Xinrun Du and Junjie Wang and Weiming Ren and Stephen W Huang and Jie Fu and Xiang Yue and Wenhu Chen
2024
SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval
Siwei Wu and Yizhi Li and Kang Zhu and Ge Zhang and Yiming Liang and Kaijing Ma and Chenghao Xiao and Haoran Zhang and Bohao Yang and Wenhu Chen and Wenhao Huang and Noura Al Moubayed and Jie Fu and Chenghua Lin
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.