Wanli Ouyang

38
Papers
1,429
Total Citations
1
Affiliations

Affiliations

The University of Sydney

Papers (38)

WorldSimBench: Towards Video Generation Models as World Simulators

ICML 2025
806
citations

DiffBIR: Toward Blind Image Restoration with Generative Diffusion Prior

ECCV 2024arXiv
279
citations

Improving Video Generation with Human Feedback

NeurIPS 2025
106
citations

Point Cloud Pre-training with Diffusion Models

CVPR 2024
59
citations

HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction

ICLR 2025
34
citations

Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning

CVPR 2025arXiv
30
citations

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning

AAAI 2024arXiv
25
citations

ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems

CVPR 2025
15
citations

TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation

CVPR 2024
14
citations

WeatherGFM: Learning a Weather Generalist Foundation Model via In-context Learning

ICLR 2025
9
citations

PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines

ECCV 2024arXiv
9
citations

Semi-supervised 3D Object Detection with PatchTeacher and PillarMix

AAAI 2024arXiv
9
citations

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

NeurIPS 2025
7
citations

PostCast: Generalizable Postprocessing for Precipitation Nowcasting via Unsupervised Blurriness Modeling

ICLR 2025arXiv
7
citations

Boosting Residual Networks with Group Knowledge

AAAI 2024arXiv
6
citations

MOOSE-Chem2: Exploring LLM Limits in Fine-Grained Scientific Hypothesis Discovery via Hierarchical Search

NeurIPS 2025
3
citations

SynBrain: Enhancing Visual-to-fMRI Synthesis via Probabilistic Representation Learning

NeurIPS 2025arXiv
2
citations

Multi-Modal Latent Variables for Cross-Individual Primary Visual Cortex Modeling and Analysis

AAAI 2025
2
citations

LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents

NeurIPS 2025arXiv
2
citations

CMT: A Cascade MAR with Topology Predictor for Multimodal Conditional CAD Generation

ICCV 2025
2
citations

scMRDR: A scalable and flexible framework for unpaired single-cell multi-omics data integration

NeurIPS 2025
2
citations

GigaGS: 3D Gaussian Based Planar Representation for Large-Scene Surface Reconstruction

AAAI 2025
1
citations

Instruct-ReID: A Multi-purpose Person Re-identification Task with Instructions

CVPR 2024
0
citations

Taming Stable Diffusion for Text to 360 Panorama Image Generation

CVPR 2024
0
citations

Satellite Observations Guided Diffusion Model for Accurate Meteorological States at Arbitrary Resolution

CVPR 2025
0
citations

Neuro-3D: Towards 3D Visual Decoding from EEG Signals

CVPR 2025
0
citations

CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded Modelling

ICML 2024
0
citations

FiT: Flexible Vision Transformer for Diffusion Model

ICML 2024
0
citations

Towards a Self-contained Data-driven Global Weather Forecasting Framework

ICML 2024
0
citations

SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling

ICCV 2025
0
citations

ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area

AAAI 2025
0
citations

Frozen CLIP Transformer Is an Efficient Point Cloud Encoder

AAAI 2024
0
citations

EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds

ICCV 2025
0
citations

ContraNovo: A Contrastive Learning Approach to Enhance De Novo Peptide Sequencing

AAAI 2024
0
citations

TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction

ICCV 2025
0
citations

UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines

CVPR 2025
0
citations

UniPAD: A Universal Pre-training Paradigm for Autonomous Driving

CVPR 2024
0
citations

Point Transformer V3: Simpler Faster Stronger

CVPR 2024
0
citations