Dan Guo

20
Papers
235
Total Citations

Papers (20)

Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition

AAAI 2025
41
citations

Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic Architecture

CVPR 2024
35
citations

EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering

CVPR 2025
28
citations

Text-Based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning

AAAI 2024
26
citations

Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering

AAAI 2024arXiv
24
citations

Dense Audio-Visual Event Localization Under Cross-Modal Consistency and Multi-Temporal Granularity Collaboration

AAAI 2025
18
citations

Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observations

CVPR 2025
15
citations

Sign-IDD: Iconicity Disentangled Diffusion for Sign Language Production

AAAI 2025
13
citations

Multimodal Class-aware Semantic Enhancement Network for Audio-Visual Video Parsing

AAAI 2025
13
citations

ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding

CVPR 2025
10
citations

AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring

AAAI 2025
6
citations

MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights

AAAI 2025
6
citations

EulerMormer: Robust Eulerian Motion Magnification via Dynamic Filtering within Transformer

AAAI 2024
0
citations

Moderating the Generalization of Score-based Generative Model

ICCV 2025
0
citations

KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking

AAAI 2024
0
citations

Towards Open-Vocabulary Audio-Visual Event Localization

CVPR 2025
0
citations

PhysDiff: Physiology-based Dynamicity Disentangled Diffusion Model for Remote Physiological Measurement

AAAI 2025
0
citations

Patch-level Sounding Object Tracking for Audio-Visual Question Answering

AAAI 2025
0
citations

Data-Free Quantization via Pseudo-label Filtering

CVPR 2024
0
citations

MMAD: Multi-label Micro-Action Detection in Videos

ICCV 2025
0
citations