Zihao Wang
25
Papers
55
Total Citations
Papers (25)
Where am I? Cross-View Geo-localization with Natural Language Descriptions
ICCV 2025
16
citations
ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting
CVPR 2025
11
citations
ACE: Anti-Editing Concept Erasure in Text-to-Image Models
CVPR 2025
8
citations
NestE: Modeling Nested Relational Structures for Knowledge Graph Reasoning
AAAI 2024arXiv
7
citations
Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models
ICCV 2025
6
citations
Learning Hierarchical Polynomials with Three-Layer Neural Networks
ICLR 2024
5
citations
Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception
AAAI 2025
2
citations
Improving Referring Expression Grounding With Cross-Modal Attention-Guided Erasing
CVPR 2019
0
citations
OnePose: One-Shot Object Pose Estimation Without CAD Models
CVPR 2022arXiv
0
citations
Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction
CVPR 2023arXiv
0
citations
Learning Transformation-Predictive Representations for Detection and Description of Local Features
CVPR 2023
0
citations
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
ICCV 2019
0
citations
Weakly-supervised 3D Shape Completion in the Wild
ECCV 2020
0
citations
Quasi-Balanced Self-Training on Noise-Aware Synthesis of Object Point Clouds for Closing Domain Gap
ECCV 2022
0
citations
Transforming and Combining Rewards for Aligning Large Language Models
ICML 2024
0
citations
Open-World Skill Discovery from Unsegmented Demonstration Videos
ICCV 2025
0
citations
MSV-PCT: Multi-Sparse-View Enhanced Transformer Framework for Salient Object Detection in Point Clouds
AAAI 2025
0
citations
ESEG: Event-Based Segmentation Boosted by Explicit Edge-Semantic Guidance
AAAI 2025
0
citations
ProAgent: Building Proactive Cooperative Agents with Large Language Models
AAAI 2024
0
citations
A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image
AAAI 2024
0
citations
Selecting Large Language Model to Fine-tune via Rectified Scaling Law
ICML 2024
0
citations
Posterior Collapse of a Linear Latent Variable Model
NeurIPS 2022
0
citations
Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task Agents
NeurIPS 2023
0
citations
Concept Algebra for (Score-Based) Text-Controlled Generative Models
NeurIPS 2023
0
citations
Theoretical Analysis of the Inductive Biases in Deep Convolutional Networks
NeurIPS 2023
0
citations