Qi Chen

bio_pic.jpg

I am currently a postdoctoral researcher at the Australian Institute for Machine Learning (AIML), the University of Adelaide, working with Prof. Anton van den Hengel and A/Prof. Qi Wu. I received my Ph.D. in computer science from the University of Adelaide in 2024, supervised by A/Prof. Qi Wu and Asst. Prof. Yuankai Qi. Prior to my Ph.D., I completed my master’s degree at South China University of Technology, under the supervision of Prof. Jian Chen and Prof. Mingkui Tan.

My research focuses mainly on Controllable Generative AI for Multi-modality, (Multimodal) Large Language Models (LLMs), and Multimodal AI for Real-world Applications/Domains (e.g., Medicine, Architecture, and the Internet). I have over 20 peer-reviewed publications, most in flagship journals/conference proceedings, including IEEE-TPAMI/TIP/TMM, CVPR, NeurIPS, ICCV, etc. His research has attracted over 1,100 citations with an H-index of 14 (Google Scholar). I also serve as a reviewer for top-tier journals/conference proceedings, including Nature Communications, IEEE-TPAMI, IJCV, CVPR, ICML, NeurIPS, ICLR, ICCV, ECCV, etc.

🌟🌟 I am currently on the job market (Assistant Prof. or Research Scientist). 🌟🌟

news

Sep 26, 2024 One paper is accepted by NeurIPS 2024!
Sep 20, 2024 One paper is accepted by ACCV 2024 (Oral)!
May 12, 2024 One paper is accepted by TPAMI!
Feb 27, 2024 Two papers are accepted by CVPR 2024!
Dec 19, 2023 One paper is accepted by AAAI 2024!
Jul 18, 2023 One paper is accepted by ICCV 2023!
Sep 15, 2022 One paper is accepted by NeruIPS 2022!
Mar 03, 2022 One paper is accepted by CVPR 2022!

selected publications

  1. NeurIPS
    Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles
    Qi Chen, Bowen Zhang, Gang Wang, and Qi Wu
    In Conference on Neural Information Processing Systems (NeurIPS), 2024
  2. ACCV
    Act Like a Radiologist: Radiology Report Generation across Anatomical Regions
    Qi Chen, Yutong Xie, Biao Wu, Xiaomin Chen, James Ang, Minh-Son To, Xiaojun Chang, and Qi Wu
    In Asian Conference on Computer Vision (ACCV) (Oral), 2024
  3. TPAMI
    Towards lightweight super-resolution with dual regression learning
    Yong Guo, Jingdong Wang, Qi Chen, Jiezhang Cao, Zeshuai Deng, Yanwu Xu, Jian Chen, and Mingkui Tan
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
  4. CVPR
    G-NeRF: Geometry-enhanced Novel View Synthesis from Single-View Images
    Zixiong Huang*Qi Chen*, Libo Sun, Yifan Yang, Naizhou Wang, Qi Wu, and Mingkui Tan
    In Conference on Computer Vision and Pattern Recognition (CVPR), 2024
  5. CVPR
    PairAug: What Can Augmented Image-Text Pairs Do for Radiology?
    Yutong Xie*Qi Chen*, Sinuo Wang, Minh-Son To, Iris Lee, Ee Win Khoo, Kerolos Hendy, Daniel Koh, Yong Xia, and Qi Wu
    In Conference on Computer Vision and Pattern Recognition (CVPR), 2024
  6. AAAI
    Webvln: Vision-and-language navigation on websites
    Qi Chen, Dileepa Pitawela, Chongyang Zhao, Gengze Zhou, Hsiang-Ting Chen, and Qi Wu
    In AAAI Conference on Artificial Intelligence (AAAI), 2024
  7. ICCV
    Prompt switch: Efficient clip adaptation for text-video retrieval
    Chaorui Deng*Qi Chen*, Pengda Qin, Da Chen, and Qi Wu
    In International Conference on Computer Vision (ICCV), 2023
  8. NeurIPS
    Learning distinct and representative modes for image captioning
    Qi Chen, Chaorui Deng, and Qi Wu
    Conference on Neural Information Processing Systems (NeurIPS), 2022
  9. CVPR
    V2C: Visual voice cloning
    Qi Chen, Mingkui Tan, Yuankai Qi, Jiaqiu Zhou, Yuanqing Li, and Qi Wu
    In Conference on Computer Vision and Pattern Recognition (CVPR), 2022
  10. ACM MM
    R-GAN: Exploring human-like way for reasonable text-to-image synthesis via generative adversarial networks
    Yanyuan Qiao, Qi Chen, Chaorui Deng, Ning Ding, Yuankai Qi, Mingkui Tan, Xincheng Ren, and Qi Wu
    In ACM International Conference on Multimedia (ACM MM), 2021
  11. TPAMI
    Towards accurate and compact architectures via neural architecture transformer
    Yong Guo, Yin Zheng, Mingkui Tan, Qi Chen, Zhipeng Li, Jian Chen, Peilin Zhao, and Junzhou Huang
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
  12. CVPR
    Contrastive neural architecture search with neural architecture comparators
    Yaofo Chen, Yong Guo, Qi Chen, Minli Li, Wei Zeng, Yaowei Wang, and Mingkui Tan
    In Conference on Computer Vision and Pattern Recognition (CVPR), 2021
  13. ACM MM
    Dynamic extension nets for few-shot semantic segmentation
    Lizhao Liu*Qi Chen*, Junyi Cao, Minqian Liu, Yong Guo, and Mingkui Tan
    In ACM International Conference on Multimedia (ACM MM), 2020
  14. TIP
    Scripted video generation with a bottom-up generative adversarial network
    Qi Chen, Qi Wu, Jian Chen, Qingyao Wu, Anton Hengel, and Mingkui Tan
    IEEE Transactions on Image Processing (TIP), 2020
  15. CVPR
    Closed-loop matters: Dual regression networks for single image super-resolution
    Yong Guo, Jian Chen, Jingdong Wang, Qi Chen, Jiezhang Cao, Zeshuai Deng, Yanwu Xu, and Mingkui Tan
    In Conference on Computer Vision and Pattern Recognition (CVPR), 2020
  16. CVPR
    Intelligent home 3d: Automatic 3d-house design from linguistic descriptions only
    Qi Chen, Qi Wu, Rui Tang, Yuhan Wang, Shuai Wang, and Mingkui Tan
    In Conference on Computer Vision and Pattern Recognition (CVPR), 2020
  17. ACCV
    Modular graph attention network for complex visual relational reasoning
    Yihan Zheng, Zhiquan Wen, Mingkui Tan, Runhao Zeng, Qi Chen, Yaowei Wang, and Qi Wu
    In Asian Conference on Computer Vision (ACCV), 2020
  18. NeurIPS
    Nat: Neural architecture transformer for accurate and compact architectures
    Yong Guo, Yin Zheng, Mingkui Tan, Qi Chen, Jian Chen, Peilin Zhao, and Junzhou Huang
    Conference on Neural Information Processing Systems (NeurIPS), 2019
  19. TMM
    Auto-embedding generative adversarial networks for high resolution image synthesis
    Yong Guo*Qi Chen*, Jian Chen, Qingyao Wu, Qinfeng Shi, and Mingkui Tan
    IEEE Transactions on Multimedia (TMM), 2019