Papers (47)
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs
ICLR 2024
1,128
citations
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors
ICLR 2024
476
citations
Large Language Models Are Not Robust Multiple Choice Selectors
ICLR 2024
370
citations
GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
CVPR 2025arXiv
44
citations
FlowIE: Efficient Image Enhancement via Rectified Flow
CVPR 2024
31
citations
LiDAR-based Person Re-identification
CVPR 2024
19
citations
DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery
CVPR 2024
16
citations
EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding
ICCV 2025
16
citations
CL-MoE: Enhancing Multimodal Large Language Model with Dual Momentum Mixture-of-Experts for Continual Visual Question Answering
CVPR 2025arXiv
10
citations
Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection
AAAI 2025
6
citations
UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting
CVPR 2025
6
citations
Continuous Visual Autoregressive Generation via Score Maximization
ICML 2025
5
citations
Secret Lies in Color: Enhancing AI-Generated Images Detection with Color Distribution Analysis
CVPR 2025
4
citations
Path Choice Matters for Clear Attributions in Path Methods
ICLR 2024
4
citations
Learning Dual-Level Deformable Implicit Representation for Real-World Scale Arbitrary Super-Resolution
ECCV 2024arXiv
4
citations
A Visual Leap in CLIP Compositionality Reasoning through Generation of Counterfactual Sets
ICCV 2025arXiv
3
citations
FADE: Frequency-Aware Diffusion Model Factorization for Video Editing
CVPR 2025
2
citations
Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark
NeurIPS 2025arXiv
2
citations
D3QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection
ICCV 2025
1
citations
LowRankOcc: Tensor Decomposition and Low-Rank Recovery for Vision-based 3D Semantic Occupancy Prediction
CVPR 2024
0
citations
SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction
CVPR 2024
0
citations
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications
CVPR 2024
0
citations
Memory-based Adapters for Online 3D Scene Perception
CVPR 2024
0
citations
Towards Accurate Post-training Quantization for Diffusion Models
CVPR 2024
0
citations
Language Generation with Strictly Proper Scoring Rules
ICML 2024
0
citations
Exploring the Benefit of Activation Sparsity in Pre-training
ICML 2024
0
citations
On Prompt-Driven Safeguarding for Large Language Models
ICML 2024
0
citations
Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding
CVPR 2025
0
citations
Few-Shot Character Understanding in Movies as an Assessment to Meta-Learning of Theory-of-Mind
ICML 2024
0
citations
EfficientLLaVA: Generalizable Auto-Pruning for Large Vision-language Models
CVPR 2025
0
citations
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation
CVPR 2025
0
citations
Learning Counterfactually Decoupled Attention for Open-World Model Attribution
ICCV 2025
0
citations
EFTViT: Efficient Federated Training of Vision Transformers with Masked Images on Resource-Constrained Clients
ICCV 2025
0
citations
IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
ICCV 2025
0
citations
WalkVLM: Aid Visually Impaired People Walking by Vision Language Model
ICCV 2025
0
citations
MCID: Multi-aspect Copyright Infringement Detection for Generated Images
ICCV 2025
0
citations
Authentic 4D Driving Simulation with a Video Generation Model
ICCV 2025
0
citations
SpectralAR: Spectral Autoregressive Visual Generation
ICCV 2025
0
citations
Entropy-Adaptive Diffusion Policy Optimization with Dynamic Step Alignment
ICCV 2025
0
citations
From Imitation to Innovation: The Emergence of AI's Unique Artistic Styles and the Challenge of Copyright Protection
ICCV 2025
0
citations
Learning with Open-world Noisy Data via Class-independent Margin in Dual Representation Space
AAAI 2025
0
citations
Teaching Large Language Models to Translate with Comparison
AAAI 2024
0
citations
MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA
AAAI 2024
0
citations
Tree-of-Reasoning Question Decomposition for Complex Question Answering with Large Language Models
AAAI 2024
0
citations
Learning Multi-Scale Video-Text Correspondence for Weakly Supervised Temporal Article Gronding
AAAI 2024
0
citations
Generative Multi-Modal Knowledge Retrieval with Large Language Models
AAAI 2024
0
citations
Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
CVPR 2024
0
citations