Zihao Wang

25
Papers
55
Total Citations

Papers (25)

Where am I? Cross-View Geo-localization with Natural Language Descriptions

ICCV 2025
16
citations

ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting

CVPR 2025
11
citations

ACE: Anti-Editing Concept Erasure in Text-to-Image Models

CVPR 2025
8
citations

NestE: Modeling Nested Relational Structures for Knowledge Graph Reasoning

AAAI 2024arXiv
7
citations

Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models

ICCV 2025
6
citations

Learning Hierarchical Polynomials with Three-Layer Neural Networks

ICLR 2024
5
citations

Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception

AAAI 2025
2
citations

Improving Referring Expression Grounding With Cross-Modal Attention-Guided Erasing

CVPR 2019
0
citations

OnePose: One-Shot Object Pose Estimation Without CAD Models

CVPR 2022arXiv
0
citations

Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction

CVPR 2023arXiv
0
citations

Learning Transformation-Predictive Representations for Detection and Description of Local Features

CVPR 2023
0
citations

CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval

ICCV 2019
0
citations

Weakly-supervised 3D Shape Completion in the Wild

ECCV 2020
0
citations

Quasi-Balanced Self-Training on Noise-Aware Synthesis of Object Point Clouds for Closing Domain Gap

ECCV 2022
0
citations

Transforming and Combining Rewards for Aligning Large Language Models

ICML 2024
0
citations

Open-World Skill Discovery from Unsegmented Demonstration Videos

ICCV 2025
0
citations

MSV-PCT: Multi-Sparse-View Enhanced Transformer Framework for Salient Object Detection in Point Clouds

AAAI 2025
0
citations

ESEG: Event-Based Segmentation Boosted by Explicit Edge-Semantic Guidance

AAAI 2025
0
citations

ProAgent: Building Proactive Cooperative Agents with Large Language Models

AAAI 2024
0
citations

A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image

AAAI 2024
0
citations

Selecting Large Language Model to Fine-tune via Rectified Scaling Law

ICML 2024
0
citations

Posterior Collapse of a Linear Latent Variable Model

NeurIPS 2022
0
citations

Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task Agents

NeurIPS 2023
0
citations

Concept Algebra for (Score-Based) Text-Controlled Generative Models

NeurIPS 2023
0
citations

Theoretical Analysis of the Inductive Biases in Deep Convolutional Networks

NeurIPS 2023
0
citations