74
Papers
2,570
Total Citations
2
h-index

Papers (74)

DETRs Beat YOLOs on Real-time Object Detection

CVPR 2024
2,424
citations

ParCo: Part-Coordinating Text-to-Motion Synthesis

ECCV 2024
43
citations

Adversarial Diffusion Compression for Real-World Image Super-Resolution

CVPR 2025
25
citations

Hyperspectral Image Reconstruction via Combinatorial Embedding of Cross-Channel Spatio-Spectral Clues

AAAI 2024arXiv
21
citations

Multimodal Large Language Models for Inverse Molecular Design with Retrosynthetic Planning

ICLR 2025arXiv
19
citations

Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation

ECCV 2024
13
citations

CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning

AAAI 2024arXiv
12
citations

iSegMan: Interactive Segment-and-Manipulate 3D Gaussians

CVPR 2025
4
citations

Foundation Molecular Grammar: Multi-Modal Foundation Models Induce Interpretable Molecular Graph Languages

ICML 2025
4
citations

DASH: 4D Hash Encoding with Self-Supervised Decomposition for Real-Time Dynamic Scene Rendering

ICCV 2025
2
citations

Tune-Your-Style: Intensity-tunable 3D Style Transfer with Gaussian Splatting

ICCV 2025
1
citations

Cross-View Graph Consistency Learning for Invariant Graph Representations

AAAI 2025
1
citations

Efficient Spiking Point Mamba for Point Cloud Analysis

ICCV 2025
1
citations

FaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven Generation

CVPR 2024
0
citations

Directed Graph Grammars for Sequence-based Learning

ICML 2025
0
citations

GPEN: Global Position Encoding Network for Enhanced Subgraph Representation Learning

ICML 2025
0
citations

Boundary Exploration for Bayesian Optimization With Unknown Physical Constraints

ICML 2024
0
citations

Representing Molecules as Random Walks Over Interpretable Grammars

ICML 2024
0
citations

Self-Learning Scene-Specific Pedestrian Detectors Using a Progressive Latent Model

CVPR 2017arXiv
0
citations

SRN: Side-output Residual Network for Object Symmetry Detection in the Wild

CVPR 2017arXiv
0
citations

Robust Video Content Alignment and Compensation for Rain Removal in a CNN Framework

CVPR 2018arXiv
0
citations

Light Field Spatial Super-Resolution via Deep Combinatorial Geometry Embedding and Structural Consistency Regularization

CVPR 2020arXiv
0
citations

AD-Cluster: Augmented Discriminative Clustering for Domain Adaptive Person Re-Identification

CVPR 2020
0
citations

Learning a Weakly-Supervised Video Actor-Action Segmentation Model With a Wise Selection

CVPR 2020arXiv
0
citations

CoLA: Weakly-Supervised Temporal Action Localization With Snippet Contrastive Learning

CVPR 2021arXiv
0
citations

Discover Cross-Modality Nuances for Visible-Infrared Person Re-Identification

CVPR 2021
0
citations

Geometry-Aware Guided Loss for Deep Crack Recognition

CVPR 2022
0
citations

Training-Free Transformer Architecture Search

CVPR 2022arXiv
0
citations

ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval

CVPR 2022arXiv
0
citations

ACSeg: Adaptive Conceptualization for Unsupervised Semantic Segmentation

CVPR 2023arXiv
0
citations

A Unified Pyramid Recurrent Network for Video Frame Interpolation

CVPR 2023arXiv
0
citations

Out-of-Candidate Rectification for Weakly Supervised Semantic Segmentation

CVPR 2023arXiv
0
citations

Video-Text As Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning

CVPR 2023arXiv
0
citations

From Node Interaction To Hop Interaction: New Effective and Scalable Graph Learning Paradigm

CVPR 2023arXiv
0
citations

High-Frequency Stereo Matching Network

CVPR 2023
0
citations

Fuzzy Positive Learning for Semi-Supervised Semantic Segmentation

CVPR 2023arXiv
0
citations

Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning

CVPR 2023
0
citations

Attention on Attention for Image Captioning

ICCV 2019
0
citations

CDNet: Centripetal Direction Network for Nuclear Instance Segmentation

ICCV 2021
0
citations

ReCU: Reviving the Dead Weights in Binary Neural Networks

ICCV 2021arXiv
0
citations

The Devil is in the Crack Orientation: A New Perspective for Crack Detection

ICCV 2023
0
citations

LaPE: Layer-adaptive Position Embedding for Vision Transformers with Independent Layer Normalization

ICCV 2023
0
citations

Towards Real-World Burst Image Super-Resolution: Benchmark and Method

ICCV 2023
0
citations

TopoSeg: Topology-Aware Nuclear Instance Segmentation

ICCV 2023
0
citations

Learning to Distill Global Representation for Sparse-View CT

ICCV 2023arXiv
0
citations

DiffusionRet: Generative Text-Video Retrieval with Diffusion Model

ICCV 2023arXiv
0
citations

Deep Multiview Clustering by Contrasting Cluster Assignments

ICCV 2023arXiv
0
citations

Deep Spatial-angular Regularization for Compressive Light Field Reconstruction over Coded Apertures

ECCV 2020
0
citations

Temporal-MPI: Enabling Multi-Plane Images for Dynamic Scene Modelling via Temporal Basis Learning

ECCV 2022
0
citations

Locality Guidance for Improving Vision Transformers on Tiny Datasets

ECCV 2022
0
citations

When Active Learning Meets Implicit Semantic Data Augmentation

ECCV 2022
0
citations

NDF: Neural Deformable Fields for Dynamic Human Modelling

ECCV 2022
0
citations

Solving Most Systems of Random Quadratic Equations

NeurIPS 2017
0
citations

Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation

ICCV 2023arXiv
0
citations

Temporal-aware Query Routing for Real-time Video Instance Segmentation

ICCV 2025
0
citations

CLEP: A Novel Contrastive Learning Method for Evolutionary Reentrancy Vulnerability Detection

AAAI 2025
0
citations

Defense Against Model Stealing Based on Account-Aware Distribution Discrepancy

AAAI 2025
0
citations

Adversarial Learning Under Hybrid Perturbations for Robust Acute Lymphoblastic Leukemia Classification

AAAI 2025
0
citations

Aligning Instance Brownian Bridge with Texts for Open-Vocabulary Video Instance Segmentation

AAAI 2025
0
citations

DigitalLLaVA: Incorporating Digital Cognition Capability for Physical World Comprehension in Multimodal LLMs

AAAI 2025
0
citations

Attack-inspired Calibration Loss for Calibrating Crack Recognition

AAAI 2025
0
citations

Parallel Vertex Diffusion for Unified Visual Grounding

AAAI 2024arXiv
0
citations

Practical Privacy-Preserving MLaaS: When Compressive Sensing Meets Generative Networks

AAAI 2024
0
citations

Secure Distributed Sparse Gaussian Process Models Using Multi-Key Homomorphic Encryption

AAAI 2024
0
citations

GraCo: Granularity-Controllable Interactive Segmentation

CVPR 2024
0
citations

Mind Marginal Non-Crack Regions: Clustering-Inspired Representation Learning for Crack Segmentation

CVPR 2024
0
citations

Constrained Generation of Semantically Valid Graphs via Regularizing Variational Autoencoders

NeurIPS 2018
0
citations

Adaptively Aligned Image Captioning via Adaptive Attention Time

NeurIPS 2019
0
citations

Online Convex Optimization Over Erdos-Renyi Random Networks

NeurIPS 2020
0
citations

CentripetalText: An Efficient Text Instance Representation for Scene Text Detection

NeurIPS 2021
0
citations

Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations

NeurIPS 2022
0
citations

Discover and Align Taxonomic Context Priors for Open-world Semi-Supervised Learning

NeurIPS 2023
0
citations

Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin

ICML 2016
0
citations

DAG-GNN: DAG Structure Learning with Graph Neural Networks

ICML 2019
0
citations