Mengdi Wang
17
Papers
115
Total Citations
Papers (17)
Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow
ICLR 2025
46
citations
ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs
NeurIPS 2025
20
citations
Does Thinking More Always Help? Mirage of Test-Time Scaling in Reasoning Models
NeurIPS 2025
19
citations
Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models
ICML 2025
15
citations
Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models
NeurIPS 2025
10
citations
Diffusion Transformer Captures Spatial-Temporal Dependencies: A Theory for Gaussian Process Data
ICLR 2025
3
citations
Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization
AAAI 2024arXiv
2
citations
On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control
ICML 2024
0
citations
MaxMin-RLHF: Alignment with Diverse Human Preferences
ICML 2024
0
citations
Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
CVPR 2025
0
citations
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
ICML 2024
0
citations
Preacher: Paper-to-Video Agentic System
ICCV 2025
0
citations
TurboSVM-FL: Boosting Federated Learning through SVM Aggregation for Lazy Clients
AAAI 2024
0
citations
Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective
ICML 2024
0
citations
Theory of Consistency Diffusion Models: Distribution Estimation Meets Fast Sampling
ICML 2024
0
citations
Information-Directed Pessimism for Offline Reinforcement Learning
ICML 2024
0
citations
Theoretical insights for diffusion guidance: A case study for Gaussian mixture models
ICML 2024
0
citations