Mengdi Wang

17
Papers
115
Total Citations

Papers (17)

Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow

ICLR 2025
46
citations

ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs

NeurIPS 2025
20
citations

Does Thinking More Always Help? Mirage of Test-Time Scaling in Reasoning Models

NeurIPS 2025
19
citations

Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models

ICML 2025
15
citations

Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models

NeurIPS 2025
10
citations

Diffusion Transformer Captures Spatial-Temporal Dependencies: A Theory for Gaussian Process Data

ICLR 2025
3
citations

Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization

AAAI 2024arXiv
2
citations

On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control

ICML 2024
0
citations

MaxMin-RLHF: Alignment with Diverse Human Preferences

ICML 2024
0
citations

Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment

CVPR 2025
0
citations

Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

ICML 2024
0
citations

Preacher: Paper-to-Video Agentic System

ICCV 2025
0
citations

TurboSVM-FL: Boosting Federated Learning through SVM Aggregation for Lazy Clients

AAAI 2024
0
citations

Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective

ICML 2024
0
citations

Theory of Consistency Diffusion Models: Distribution Estimation Meets Fast Sampling

ICML 2024
0
citations

Information-Directed Pessimism for Offline Reinforcement Learning

ICML 2024
0
citations

Theoretical insights for diffusion guidance: A case study for Gaussian mixture models

ICML 2024
0
citations