Jie Zhou

47
Papers
2,147
Total Citations
1
Affiliations

Affiliations

Tencent Inc.

Papers (47)

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

ICLR 2024
1,128
citations

AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors

ICLR 2024
476
citations

Large Language Models Are Not Robust Multiple Choice Selectors

ICLR 2024
370
citations

GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction

CVPR 2025arXiv
44
citations

FlowIE: Efficient Image Enhancement via Rectified Flow

CVPR 2024
31
citations

LiDAR-based Person Re-identification

CVPR 2024
19
citations

DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery

CVPR 2024
16
citations

EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding

ICCV 2025
16
citations

CL-MoE: Enhancing Multimodal Large Language Model with Dual Momentum Mixture-of-Experts for Continual Visual Question Answering

CVPR 2025arXiv
10
citations

Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection

AAAI 2025
6
citations

UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting

CVPR 2025
6
citations

Continuous Visual Autoregressive Generation via Score Maximization

ICML 2025
5
citations

Secret Lies in Color: Enhancing AI-Generated Images Detection with Color Distribution Analysis

CVPR 2025
4
citations

Path Choice Matters for Clear Attributions in Path Methods

ICLR 2024
4
citations

Learning Dual-Level Deformable Implicit Representation for Real-World Scale Arbitrary Super-Resolution

ECCV 2024arXiv
4
citations

A Visual Leap in CLIP Compositionality Reasoning through Generation of Counterfactual Sets

ICCV 2025arXiv
3
citations

FADE: Frequency-Aware Diffusion Model Factorization for Video Editing

CVPR 2025
2
citations

Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark

NeurIPS 2025arXiv
2
citations

D3QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection

ICCV 2025
1
citations

LowRankOcc: Tensor Decomposition and Low-Rank Recovery for Vision-based 3D Semantic Occupancy Prediction

CVPR 2024
0
citations

SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction

CVPR 2024
0
citations

Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications

CVPR 2024
0
citations

Memory-based Adapters for Online 3D Scene Perception

CVPR 2024
0
citations

Towards Accurate Post-training Quantization for Diffusion Models

CVPR 2024
0
citations

Language Generation with Strictly Proper Scoring Rules

ICML 2024
0
citations

Exploring the Benefit of Activation Sparsity in Pre-training

ICML 2024
0
citations

On Prompt-Driven Safeguarding for Large Language Models

ICML 2024
0
citations

Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding

CVPR 2025
0
citations

Few-Shot Character Understanding in Movies as an Assessment to Meta-Learning of Theory-of-Mind

ICML 2024
0
citations

EfficientLLaVA: Generalizable Auto-Pruning for Large Vision-language Models

CVPR 2025
0
citations

UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

CVPR 2025
0
citations

Learning Counterfactually Decoupled Attention for Open-World Model Attribution

ICCV 2025
0
citations

EFTViT: Efficient Federated Training of Vision Transformers with Masked Images on Resource-Constrained Clients

ICCV 2025
0
citations

IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation

ICCV 2025
0
citations

WalkVLM: Aid Visually Impaired People Walking by Vision Language Model

ICCV 2025
0
citations

MCID: Multi-aspect Copyright Infringement Detection for Generated Images

ICCV 2025
0
citations

Authentic 4D Driving Simulation with a Video Generation Model

ICCV 2025
0
citations

SpectralAR: Spectral Autoregressive Visual Generation

ICCV 2025
0
citations

Entropy-Adaptive Diffusion Policy Optimization with Dynamic Step Alignment

ICCV 2025
0
citations

From Imitation to Innovation: The Emergence of AI's Unique Artistic Styles and the Challenge of Copyright Protection

ICCV 2025
0
citations

Learning with Open-world Noisy Data via Class-independent Margin in Dual Representation Space

AAAI 2025
0
citations

Teaching Large Language Models to Translate with Comparison

AAAI 2024
0
citations

MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA

AAAI 2024
0
citations

Tree-of-Reasoning Question Decomposition for Complex Question Answering with Large Language Models

AAAI 2024
0
citations

Learning Multi-Scale Video-Text Correspondence for Weakly Supervised Temporal Article Gronding

AAAI 2024
0
citations

Generative Multi-Modal Knowledge Retrieval with Large Language Models

AAAI 2024
0
citations

Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft

CVPR 2024
0
citations