Weijian Deng

PhD, 3D Generation & Modeling, ML Generalization

Email  •  CV  •  Google Scholar  •  GitHub  •  LinkedIn  •  PhD Thesis

Prospective Students

Student Recruitment

I am looking for highly motivated students interested in ambitious research at the intersection of multimodal intelligence, physical 3D vision, and diffusion models. The directions below are active starting points; I also welcome strong proposals that connect naturally to these themes.

01

Self-Evaluating and Self-Improving MLLMs

We extend my PhD work on unsupervised model evaluation to large multimodal models. The goal is to build AI agents that can understand and evaluate their own strategies, identify failure modes, and take the right actions to improve in new environments, starting with spatial intelligence.

02

Physical 3D Vision and World Models

Generated videos, geometry, and scenes should be useful, not merely plausible, so we study physical consistency, geometry, object behavior, world models, and aircraft-design applications. Physical 3D vision and world models ground self-improving AI agents in the physical world. These projects study how agents explore, perceive, understand, manipulate, and interact with 3D environments.

03

Understanding and Steering Diffusion Models

We analyze which properties of diffusion models lead to specific behaviors, then use those insights to make generation more useful. Example questions include which generated images improve generalization, how multi-view generation helps models understand the observed world, and how latent spaces can be explored or post-trained.

Before Contacting Me

Please prepare a concise email with the materials below. For PhD and postdoctoral positions, contact me at least one month before the relevant admission or scholarship deadline.

  • Academic transcript and CV.
  • A sample of research writing, such as a paper, thesis chapter, or technical report.
  • A proposed research topic aligned with my research interests.
  • A short proposal including the research question, relation to existing work, datasets or software required, and a timeline with dated milestones.
  • Evidence of strong mathematical and computational skills, Python programming, and familiarity with revision control.
  • For visiting PhD students: a description of your current work, publication list or Google Scholar profile, and funding plan.
Contact by email
News Recent highlights
2026 - Assistant Professor
  • Feb 2026 Paper One paper on latent spaces for diffusion models was accepted to CVPR 2026.
  • Jan 2026 Paper One paper on generated-image detection was accepted to ICLR 2026.
2023 - 2025 Postdoctoral Research
  • Nov 2025 Fellowship Selected as a DAAD AINeT Fellow, a fellowship for international researchers in Explainable AI.
  • Sep 2025 Paper One paper on robust object detection was accepted to NeurIPS 2025.
  • Jun 2025 Paper Two papers on object geometry and efficient image generation were accepted to ICCV 2025.
  • Jun 2025 Paper One paper on model reliability was accepted to IEEE TPAMI 2025 [Paper].
  • May 2025 Service Recognized by CVPR 2025 as an outstanding reviewer.
  • May 2025 Paper One paper on MLLM ranking was accepted to ICML 2025.
  • Mar 2025 Paper One paper on shape decomposition was accepted to SIGGRAPH 2025.
  • Feb 2025 Paper One paper on training-free 6-DoF pose estimation was accepted to CVPR 2025.
  • Nov 2024 Paper One paper on 3D modeling with LLMs was accepted to 3DV 2025 [Project].
  • Nov 2024 Service Recognized by NeurIPS 2024 as a top reviewer.
  • Oct 2024 Service Recognized by ACM MM 2024 as outstanding Area Chair. [My talk] on my PhD research.
  • Sep 2024 Paper One paper on model generalization prediction was accepted to NeurIPS 2024 [Paper].
  • May 2024 Paper One paper on unsupervised model ranking was accepted to TMLR 2024 [Paper].
  • May 2024 Paper One paper on calibration analysis for VLMs was accepted to ICML 2024 [Paper].
  • May 2024 Award Received ICML 2024 Early-Career Research Travel Award.
  • Mar 2024 Service Serving as an Action Editor for Transactions on Machine Learning Research.
  • Feb 2024 Paper One paper on 3D reconstruction was accepted to CVPR 2024.
  • Jan 2024 Program Joined HEX International Singapore's Youth Entrepreneurship Programs [Reflection].
  • Dec 2023 Teaching Taught Introduction to Computer Science at SDUW.
  • Nov 2023 Thesis My PhD thesis is available on ANU Open Research.
  • Nov 2023 Service Recognized by NeurIPS 2023 as a top reviewer.
  • Oct 2023 Paper One paper on novel view synthesis of refractive objects was accepted to WACV 2024 [Paper].
  • Sep 2023 Paper One paper on the robustness of visual foundation models was accepted to NeurIPS 2023 [Paper].
  • Jul 2023 Paper One paper on out-of-distribution predictive calibration was accepted to ICCV 2023 [Paper].
  • Apr 2023 Paper One paper on out-of-distribution generalization prediction was accepted to ICML 2023 [Paper].
  • Feb 2023 Paper One paper on dataset-level analysis was accepted to CVPR 2023 [Paper].
  • Jan 2023 Position Started the Research Fellow position.
2019 - 2022 PhD
  • Jan 2023 Thesis Submitted PhD thesis.
  • Dec 2022 Thesis Completed PhD oral presentation.
  • Oct 2022 Paper One paper on multi-task learning was accepted to WACV 2023 [Paper].
  • Oct 2022 Award Received NeurIPS 2022 Scholar Award.
  • Sep 2022 Paper One paper on model invariance and generalization was accepted to NeurIPS 2022 [Paper].
  • Jul 2022 Service Recognized by ICML 2022 as a top 10% reviewer.
  • Jun 2022 Event Organized the CVPR 2022 Tutorial on Evaluating Models Beyond the Textbook: Out-of-distribution and Without Labels.
  • May 2022 Paper One paper on fine-grained classification was accepted to IEEE TIP [Paper].
  • Dec 2021 Paper One paper on model decision understanding was accepted to IEEE TPAMI [Project].
  • May 2021 Paper One paper on generalization prediction was accepted to ICML 2021 [Paper, Project].
  • Mar 2021 Paper One paper on generalization prediction was accepted to CVPR 2021 [Paper, Project].
  • Jul-Sep 2020 Internship Summer intern at NEC Labs America.
  • Aug 2020 Event VisDA-2020 challenge concluded. Congratulations to the final teams.
  • Jun 2020 Service Recognized by ECCV 2020 as a top reviewer.
  • Jan 2020 Paper One paper was accepted to IEEE TCSVT [Paper].
  • Jul 2019 PhD Started the PhD program at The Australian National University, supported by the AGRTP scholarship.
  • Jun 2019 Degree Received M.Eng. from the University of the Chinese Academy of Sciences.
  • Jun 2019 Award Won 3rd place in vehicle re-identification at the CVPR 2019 AI-City Challenge [Paper, Code].
  • Jul-Nov 2018 Position Research assistant at Singapore University of Technology and Design (SUTD).
  • Mar 2018 Paper One paper was accepted to CVPR 2018 [Paper, Code].
  • Jul 2017 Paper One paper was accepted to ICCV 2017 [Paper, Code].
Research

My research focuses on Predicting Model Generalization, Monitoring Model Reliability, Enhancing Model Generalization, and 3D Modeling & Generation.

My current interests center on two connected directions. First, I study self-evaluating and self-improving AI agents that can understand and evaluate their own strategies, then choose appropriate actions to improve their behavior. Second, I study physical 3D vision and world models as the foundation for deploying such agents in the physical world, where they need to explore, perceive, understand, manipulate, and interact with 3D environments.

I. Predicting Model Generalization

PhD research topic. This line of work studies how deep neural networks interact with data and how their generalization can be estimated without human annotations. It develops criteria for predicting model resilience, identifying failure cases, and guiding future model training. These [slides] provide an overview of this research.

Ranked from within: Ranking large multimodal models for visual question answering without labels

Weijie Tu, Weijian Deng, Dylan Campbell, Yu Yao, Jiyang Zheng, Tom Gedeon, Tongliang Liu

ICML 2025 [ BibTex ]

MANO: Exploiting Matrix Norm for Unsupervised Accuracy Estimation Under Distribution Shifts

Renchunzi Xie, Ambroise Odonnat, Vasilii Feofanov, Weijian Deng, Jianfeng Zhang, Bo An

NeurIPS 2024 [Paper, BibTex]

What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?

Weijie Tu, Weijian Deng, Liang Zheng, Tom Gedeon

TMLR 2024 [Paper, BibTex]

A Bag-of-Prototypes Representation for Dataset-Level Applications

Weijie Tu, Weijian Deng, Tom Gedeon, Liang Zheng

CVPR 2023 [Paper, Code, BibTex, Poster]

Confidence and Dispersity Speak: Characterising Prediction Matrix for Unsupervised Accuracy Estimation

Weijian Deng, Yumin Suh, Stephen Gould, Liang Zheng

ICML 2023 [Paper, Slides, Poster, BibTex, Code ]

AutoEval: Are Labels Always Necessary for Classifier Accuracy Evaluation?

Weijian Deng, Liang Zheng

IEEE TPAMI 2022 [Project, BibTex, Paper]

What Does Rotation Prediction Tell Us about Classifier Accuracy under Varying Testing Environments?

Weijian Deng, Stephen Gould, Liang Zheng

ICML 2021 (Spotlight) [Paper, BibTex, Project, Slides, Poster]

Are Labels Always Necessary for Classifier Accuracy Evaluation?

Weijian Deng, Liang Zheng

CVPR 2021 [Paper, Project, BibTex, Poster, Slides]

II. Monitoring Model Reliability

Toward a Holistic Evaluation of Robustness in Clip Models

Weijie Tu, Weijian Deng, Tom Gedeon

IEEE TPAMI 2025 [Paper, BibTex, ]

An Empirical Study into What Matters for Calibrating Vision-Language Models

Weijie Tu, Weijian Deng, Dylan Campbell, Stephen Gould, Tom Gedeon

ICML 2024 [Paper, BibTex, ]

A Closer Look at the Robustness of Contrastive Language-Image Pre-Training (CLIP)

Weijie Tu, Weijian Deng, Tom Gedeon

NeurIPS 2023 [Paper, BibTex, OpenReview ]

Adaptive Calibrator Ensemble: Navigating Test Set Difficulty in Out-of-Distribution Scenarios

Yuli Zou*, Weijian Deng* (equal contribution), Liang Zheng

ICCV 2023 [Paper, Poster, Code, BibTex ]

On the Strong Correlation Between Model Invariance and Generalization

Weijian Deng, Stephen Gould, Liang Zheng

NeurIPS 2022 (Spotlight) [Paper, OpenReview, Slides, Poster, BibTex]

III. 3D Modeling & Generation

Pos3R: 6D Pose Estimation for Unseen Objects Made Easy

Weijian Deng, Dylan Campbell, Chunyi Sun, Jiahao Zhang, Shubham Kanitkar, Matthew Shaffer, Stephen Gould

CVPR 2025 [ BibTex ]

Unsupervised Decomposition of 3D Shapes into Expressive and Editable Extruded Profile Primitives

Chunyi Sun, Junlin Han, Runjia Li, Weijian Deng, Dylan Campbell, Stephen Gould

ACM SIGGRAPH 2025[ BibTex ]

Can We Achieve Efficient Diffusion without Self-Attention? Distilling Self-Attention into Convolutions

ZiYi Dong, Chengxing Zhou, Weijian Deng, Pengxu Wei, Xiangyang Ji, Liang Lin

ICCV 2025 [ BibTex ]

Manual-PA: Learning 3D Part Assembly from Instruction Diagrams

Jiahao Zhang, Anoop Cherian, Cristian Rodriguez, Weijian Deng, Stephen Gould

ICCV 2025 [ BibTex ]

3D-GPT: Procedural 3D Modeling with Large Language Models

Chunyi Sun, Junlin Han,Weijian Deng, Xinlong Wang, Zishan Qin, Stephen Gould

3DV 2025 [ Project, BibTex ]

Ray Deformation Networks for Novel View Synthesis of Refractive Objects

Weijian Deng, Dylan Campbell, Chunyi Sun, Shubham Kanitkar, Matthew Shaffer, Stephen Gould

WACV 2024 [ Project, Paper, Poster, Slides, BibTex ]

Differentiable Neural Surface Refinement for Transparent Objects

Weijian Deng, Dylan Campbell, Chunyi Sun, Shubham Kanitkar, Matthew Shaffer, Stephen Gould

CVPR 2024 [ Project, Poster, Slides, BibTex ]

IV. Enhancing Model Generalization

Delving into Cascaded Instability: A Lipschitz Continuity View on Image Restoration and Object Detection Synergy

Qing Zhao, Weijian Deng, Pengxu Wei, ZiYi Dong, Hannan Lu, Xiangyang Ji, Liang Lin

NeurIPS 2025 [ BibTex ]

Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification

Weijian Deng, Liang Zheng, Qixiang Ye, Guoliang Kang, Yi Yang, Jianbin Jiao

CVPR 2018 [Paper, Code, BibTex, Poster]

Similarity-preserving Image-image Domain Adaptation for Person Re-identification

Weijian Deng, Liang Zheng, Qixiang Ye, Yi Yang, Jianbin Jiao

Arxiv 2019 [Paper]

Domain Alignment with Triplets

Weijian Deng, Liang Zheng, Jianbin Jiao

Arxiv 2019 [Paper]

Rethinking Triplet Loss for Domain Adaptation

Weijian Deng, Liang Zheng, Yifan Sun, Jianbin Jiao

IEEE TCSVT 2020 [Paper, BibTex]

(Journal version of "Domain alignment with triplets")

Fine-grained Classification via Categorical Memory Networks

Weijian Deng, Joshua Marsh, Stephen Gould, Liang Zheng

IEEE TIP 2022 [ BibTex, Paper ]

Split to Learn: Gradient Split for Multi-Task Human Image Analysis

Weijian Deng, Yumin Suh, Xiang Yu, Masoud Faraki, Liang Zheng, Manmohan Chandraker

WACV 2023 [ Paper, US Patent, Poster, Slides, BibTex ]

Ranking Models in Unlabeled New Environments

Xiaoxiao Sun, Yunzhong Hou, Weijian Deng, Hongdong Li, Liang Zheng

ICCV 2021 [Paper, Code, BibTex ]

SVDNet for Pedestrian Retrieval

Yifan Sun, Liang Zheng, Weijian Deng, Shengjin Wang

ICCV 2017 (Spotlight) [Paper, Code, BibTex, Poster]

Academic Activity

Action Editor, Transactions on Machine Learning Research

Lecturer, "Introduction to Computer Science", SDUW (Joint ANU-SDUW Program, Winter Semester 2023)

ACM Multimedia Area Chair, 2024 & 2025

Reviewer: NeurIPS 2022-2025; ICML 2022-2025; ICLR 2022-2025; ICCV 2021, 2023, 2025; CVPR 2021-2025; ECCV 2020, 2024; ACM MM 2020-2023; IEEE-TPAMI; IEEE-TIP; IJCV

Co-organizer: ECCV 2020 Workshop on "Visual Domain Adaptation Challenge"

Co-organizer: CVPR 2022 Tutorial on "Evaluating Models Beyond the Textbook: Out-of-distribution and Without Labels"

Guest speaker: SUTD 2018/12 (image-image translation); ANU 2019/09 (SVDNet)

Awards & Honors

DAAD AINeT Fellow in Explainable AI, 2025

CVPR 2025 Outstanding Reviewer, 2025

NeurIPS 2024 Top Reviewer, 2024

ACM MM 2024 Outstanding Area Chair, 2024

NeurIPS 2023 Top Reviewer, 2023

ICML 2022 Top 10% Reviewer, 2022

ECCV 2020 Outstanding Reviewer, 2022

Australian Government Research Training Program (AGRTP) Scholarship, 2019-2023

The Third Place in Vehicle Re-identification track of CVPR 2019 AI-City Challenge, 2019

China National Scholarship (Master), 2018

China National Scholarship (Bachelor), 2014, 2015

Grants & Funds

Academic research grant from the Google PaliGemma Academic Program, 2024 - 2025

Academic research grant from the Google Cloud Research Credits Program, 2024 - 2025

ICML Early-Career Travel Fund, 2024

ANU Early-Career Travel Fund, 2024

ANU-SDUW Teaching Fellowship, 2023

NeurIPS 2022 Scholar Award, 2022