Dan Guo
20
Papers
235
Total Citations
Papers (20)
Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition
AAAI 2025
41
citations
Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic Architecture
CVPR 2024
35
citations
EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
CVPR 2025
28
citations
Text-Based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning
AAAI 2024
26
citations
Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering
AAAI 2024arXiv
24
citations
Dense Audio-Visual Event Localization Under Cross-Modal Consistency and Multi-Temporal Granularity Collaboration
AAAI 2025
18
citations
Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observations
CVPR 2025
15
citations
Sign-IDD: Iconicity Disentangled Diffusion for Sign Language Production
AAAI 2025
13
citations
Multimodal Class-aware Semantic Enhancement Network for Audio-Visual Video Parsing
AAAI 2025
13
citations
ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding
CVPR 2025
10
citations
AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring
AAAI 2025
6
citations
MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights
AAAI 2025
6
citations
EulerMormer: Robust Eulerian Motion Magnification via Dynamic Filtering within Transformer
AAAI 2024
0
citations
Moderating the Generalization of Score-based Generative Model
ICCV 2025
0
citations
KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking
AAAI 2024
0
citations
Towards Open-Vocabulary Audio-Visual Event Localization
CVPR 2025
0
citations
PhysDiff: Physiology-based Dynamicity Disentangled Diffusion Model for Remote Physiological Measurement
AAAI 2025
0
citations
Patch-level Sounding Object Tracking for Audio-Visual Question Answering
AAAI 2025
0
citations
Data-Free Quantization via Pseudo-label Filtering
CVPR 2024
0
citations
MMAD: Multi-label Micro-Action Detection in Videos
ICCV 2025
0
citations