Qi Tian

156
Papers
2,469
Total Citations

Papers (156)

4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

CVPR 2024
1,061
citations

ControlVideo: Training-free Controllable Text-to-video Generation

ICLR 2024
331
citations

GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models

CVPR 2024
241
citations

Bottom-Up Temporal Action Localization with Mutual Regularization

ECCV 2020
209
citations

Rethinking the Distribution Gap of Person Re-identification with Camera-based Batch Normalization

ECCV 2020
181
citations

GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions

CVPR 2024
164
citations

Towards 3D Molecule-Text Interpretation in Language Models

ICLR 2024
73
citations

LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly Detection

AAAI 2024arXiv
47
citations

Improving Image Restoration through Removing Degradations in Textual Representations

CVPR 2024
45
citations

Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model

CVPR 2024
37
citations

LION: Implicit Vision Prompt Tuning

AAAI 2024arXiv
35
citations

CooGAN: A Memory-Efficient Framework for High-Resolution Facial Attribute Editing

ECCV 2020
15
citations

C-CLIP: Multimodal Continual Learning for Vision-Language Model

ICLR 2025
13
citations

Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners

ICLR 2024
9
citations

AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation

ECCV 2024
5
citations

Boosting Segment Anything Model Towards Open-Vocabulary Learning

AAAI 2025
1
citations

METEOR: Multi-Encoder Collaborative Token Pruning for Efficient Vision Language Models

ICCV 2025
1
citations

Optimize Incompatible Parameters Through Compatibility-aware Knowledge Integration

AAAI 2025
1
citations

Multi-Cue Correlation Filters for Robust Visual Tracking

CVPR 2018
0
citations

Deep Hashing via Discrepancy Minimization

CVPR 2018
0
citations

Learning Channel-Wise Interactions for Binary Convolutional Neural Networks

CVPR 2019
0
citations

Structural Relational Reasoning of Point Clouds

CVPR 2019
0
citations

Deep Fitting Degree Scoring Network for Monocular 3D Object Detection

CVPR 2019
0
citations

BridgeNet: A Continuity-Aware Probabilistic Network for Age Estimation

CVPR 2019
0
citations

Iterative Reorganization With Weak Spatial Constraints: Solving Arbitrary Jigsaw Puzzles for Unsupervised Representation Learning

CVPR 2019
0
citations

Variational Convolutional Neural Network Pruning

CVPR 2019
0
citations

Towards Visual Feature Translation

CVPR 2019arXiv
0
citations

Modeling Point Clouds With Self-Attention and Gumbel Subset Sampling

CVPR 2019
0
citations

Actional-Structural Graph Convolutional Networks for Skeleton-Based Action Recognition

CVPR 2019
0
citations

Deep Modular Co-Attention Networks for Visual Question Answering

CVPR 2019
0
citations

Learning to Learn Image Classifiers With Visual Analogy

CVPR 2019
0
citations

GhostNet: More Features From Cheap Operations

CVPR 2020arXiv
0
citations

Dynamic Multiscale Graph Neural Networks for 3D Skeleton Based Human Motion Prediction

CVPR 2020arXiv
0
citations

Unsupervised Person Re-Identification via Softened Similarity Learning

CVPR 2020arXiv
0
citations

Frequency Domain Compact 3D Convolutional Neural Networks

CVPR 2020
0
citations

Polishing Decision-Based Adversarial Noise With a Customized Sampling

CVPR 2020
0
citations

Joint Demosaicing and Denoising With Self Guidance

CVPR 2020
0
citations

A Semi-Supervised Assessor of Neural Architectures

CVPR 2020arXiv
0
citations

Towards Discriminability and Diversity: Batch Nuclear-Norm Maximization Under Label Insufficient Situations

CVPR 2020arXiv
0
citations

Learning to Select Base Classes for Few-Shot Classification

CVPR 2020arXiv
0
citations

Creating Something From Nothing: Unsupervised Knowledge Distillation for Cross-Modal Hashing

CVPR 2020arXiv
0
citations

CARS: Continuous Evolution for Efficient Neural Architecture Search

CVPR 2020arXiv
0
citations

AdderNet: Do We Really Need Multiplications in Deep Learning?

CVPR 2020arXiv
0
citations

Spatial-Temporal Graph Convolutional Network for Video-Based Person Re-Identification

CVPR 2020
0
citations

Projection & Probability-Driven Black-Box Attack

CVPR 2020arXiv
0
citations

Transformation GAN for Unsupervised Image Synthesis and Representation Learning

CVPR 2020
0
citations

Video Super-Resolution With Temporal Group Attention

CVPR 2020arXiv
0
citations

FM2u-Net: Face Morphological Multi-Branch Network for Makeup-Invariant Face Verification

CVPR 2020
0
citations

Rethinking Performance Estimation in Neural Architecture Search

CVPR 2020arXiv
0
citations

Gradually Vanishing Bridge for Adversarial Domain Adaptation

CVPR 2020arXiv
0
citations

Label Decoupling Framework for Salient Object Detection

CVPR 2020arXiv
0
citations

Cross-Domain Detection via Graph-Induced Prototype Alignment

CVPR 2020arXiv
0
citations

Learning Temporal Co-Attention Models for Unsupervised Video Action Localization

CVPR 2020
0
citations

Noise-Aware Fully Webly Supervised Object Detection

CVPR 2020
0
citations

Network Adjustment: Channel Search Guided by FLOPs Utilization Ratio

CVPR 2020arXiv
0
citations

CondenseNet V2: Sparse Feature Reactivation for Deep Networks

CVPR 2021arXiv
0
citations

UnrealPerson: An Adaptive Pipeline Towards Costless Person Re-Identification

CVPR 2021arXiv
0
citations

Towards Compact CNNs via Collaborative Compression

CVPR 2021arXiv
0
citations

ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised Image Segmentation

CVPR 2021
0
citations

A Fourier-Based Framework for Domain Generalization

CVPR 2021arXiv
0
citations

DATA: Domain-Aware and Task-Aware Self-Supervised Learning

CVPR 2022arXiv
0
citations

HyperDet3D: Learning a Scene-Conditioned 3D Object Detector

CVPR 2022arXiv
0
citations

Contextual Similarity Distillation for Asymmetric Image Retrieval

CVPR 2022
0
citations

MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens

CVPR 2022
0
citations

One-Bit Active Query With Contrastive Pairs

CVPR 2022
0
citations

Partial Class Activation Attention for Semantic Segmentation

CVPR 2022
0
citations

Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross-Modal Denoising Networks

CVPR 2022
0
citations

DeeCap: Dynamic Early Exiting for Efficient Image Captioning

CVPR 2022
0
citations

Learning To Learn by Jointly Optimizing Neural Architecture and Weights

CVPR 2022
0
citations

Domain-Agnostic Prior for Transfer Semantic Segmentation

CVPR 2022arXiv
0
citations

Distilling Vision-Language Pre-Training To Collaborate With Weakly-Supervised Temporal Action Localization

CVPR 2023arXiv
0
citations

Open-Set Fine-Grained Retrieval via Prompting Vision-Language Evaluator

CVPR 2023
0
citations

Adapting Shortcut With Normalizing Flow: An Efficient Tuning Framework for Visual Recognition

CVPR 2023
0
citations

Being Comes From Not-Being: Open-Vocabulary Text-to-Motion Generation With Wordless Training

CVPR 2023
0
citations

Integrally Pre-Trained Transformer Pyramid Networks

CVPR 2023arXiv
0
citations

Federated Domain Generalization With Generalization Adjustment

CVPR 2023
0
citations

Visual Recognition by Request

CVPR 2023arXiv
0
citations

RIDE: Reversal Invariant Descriptor Enhancement

ICCV 2015
0
citations

Scalable Person Re-Identification: A Benchmark

ICCV 2015
0
citations

Multi-Task Learning With Low Rank Attribute Embedding for Person Re-Identification

ICCV 2015
0
citations

Similarity Gaussian Process Latent Variable Model for Multi-Modal Data Analysis

ICCV 2015
0
citations

Ensemble Diffusion for Retrieval

ICCV 2017
0
citations

SORT: Second-Order Response Transform for Visual Recognition

ICCV 2017arXiv
0
citations

Pose-Driven Deep Convolutional Model for Person Re-Identification

ICCV 2017arXiv
0
citations

Multimodal Gaussian Process Latent Variable Models With Harmonization

ICCV 2017
0
citations

Progressive Differentiable Architecture Search: Bridging the Depth Gap Between Search and Evaluation

ICCV 2019
0
citations

Multinomial Distribution Learning for Effective Neural Architecture Search

ICCV 2019
0
citations

Co-Evolutionary Compression for Unpaired Image Translation

ICCV 2019
0
citations

Accelerate CNN via Recursive Bayesian Pruning

ICCV 2019
0
citations

Data-Free Learning of Student Networks

ICCV 2019
0
citations

Global-Local Temporal Representations for Video Person Re-Identification

ICCV 2019
0
citations

Universal Perturbation Attack Against Image Retrieval

ICCV 2019
0
citations

CenterNet: Keypoint Triplets for Object Detection

ICCV 2019
0
citations

Dynamic Points Agglomeration for Hierarchical Point Sets Learning

ICCV 2019
0
citations

AVT: Unsupervised Learning of Transformation Equivariant Representations by Autoencoding Variational Transformations

ICCV 2019
0
citations

Differentiable Convolution Search for Point Cloud Processing

ICCV 2021arXiv
0
citations

Foreground Activation Maps for Weakly Supervised Object Localization

ICCV 2021
0
citations

Omni-GAN: On the Secrets of cGANs and Beyond

ICCV 2021
0
citations

Greedy Gradient Ensemble for Robust Visual Question Answering

ICCV 2021arXiv
0
citations

Pixel Difference Networks for Efficient Edge Detection

ICCV 2021arXiv
0
citations

Visformer: The Vision-Friendly Transformer

ICCV 2021arXiv
0
citations

Divide and Conquer for Single-Frame Temporal Action Localization

ICCV 2021
0
citations

IM-Zero: Instance-level Motion Controllable Video Generation in a Zero-shot Manner

CVPR 2025
0
citations

TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization

ICCV 2021
0
citations

Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models

ICCV 2023arXiv
0
citations

Focus on Your Target: A Dual Teacher-Student Framework for Domain-Adaptive Semantic Segmentation

ICCV 2023arXiv
0
citations

Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation

ICCV 2023arXiv
0
citations

USAGE: A Unified Seed Area Generation Paradigm for Weakly Supervised Semantic Segmentation

ICCV 2023arXiv
0
citations

Corner Proposal Network for Anchor-free, Two-stage Object Detection

ECCV 2020
0
citations

Circumventing Outliers of AutoAugment with Knowledge Distillation

ECCV 2020
0
citations

Social Adaptive Module for Weakly-supervised Group Activity Recognition

ECCV 2020
0
citations

Interpretable Visual Reasoning via Probabilistic Formulation under Natural Supervision

ECCV 2020
0
citations

Large-Scale Few-Shot Learning via Multi-Modal Knowledge Discovery

ECCV 2020
0
citations

Video Super-Resolution with Recurrent Structure-Detail Network

ECCV 2020
0
citations

Wavelet-Based Dual-Branch Network for Image Demoiréing

ECCV 2020
0
citations

API-Net: Robust Generative Classifier via a Single Discriminator

ECCV 2020
0
citations

Reinforced Axial Refinement Network for Monocular 3D Object Detection

ECCV 2020
0
citations

FTL: A universal framework for training low-bit DNNs via Feature Transfer

ECCV 2020
0
citations

Extract and Merge: Superpixel Segmentation with Regional Attributes

ECCV 2020
0
citations

Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction

ECCV 2022
0
citations

Cornerformer: Purifying Instances for Corner-Based Detectors

ECCV 2022
0
citations

TAPE: Task-Agnostic Prior Embedding for Image Restoration

ECCV 2022
0
citations

Active Pointly-Supervised Instance Segmentation

ECCV 2022
0
citations

A Transformer-Based Decoder for Semantic Segmentation with Multi-level Context Mining

ECCV 2022
0
citations

SdAE: Self-Distillated Masked Autoencoder

ECCV 2022
0
citations

Vibration-Based Uncertainty Estimation for Learning from Limited Supervision

ECCV 2022
0
citations

MVP: Multimodality-Guided Visual Pre-training

ECCV 2022
0
citations

Shape Self-Correction for Unsupervised Point Cloud Understanding

ICCV 2021
0
citations

CLIP-Adapted Region-to-Text Learning for Generative Open-Vocabulary Semantic Segmentation

ICCV 2025
0
citations

Segment Any 3D Gaussians

AAAI 2025
0
citations

Infinite-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation

AAAI 2025
0
citations

HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data

CVPR 2024
0
citations

OVMR: Open-Vocabulary Recognition with Multi-Modal References

CVPR 2024
0
citations

Parameter Efficient Fine-tuning via Cross Block Orchestration for Segment Anything Model

CVPR 2024
0
citations

Query-Adaptive Late Fusion for Image Search and Person Re-Identification

CVPR 2015
0
citations

Interaction Part Mining: A Mid-Level Approach for Fine-Grained Action Recognition

CVPR 2015
0
citations

InterActive: Inter-Layer Activeness Propagation

CVPR 2016
0
citations

Picking Deep Filter Responses for Fine-Grained Image Recognition

CVPR 2016
0
citations

Cascaded Interactional Targeting Network for Egocentric Video Analysis

CVPR 2016
0
citations

DisturbLabel: Regularizing CNN on the Loss Layer

CVPR 2016
0
citations

Person Re-Identification in the Wild

CVPR 2017arXiv
0
citations

Scalable Person Re-Identification on Supervised Smoothed Manifold

CVPR 2017arXiv
0
citations

Task-Driven Dynamic Fusion: Reducing Ambiguity in Video Description

CVPR 2017
0
citations

Person Transfer GAN to Bridge Domain Gap for Person Re-Identification

CVPR 2018arXiv
0
citations

Zigzag Learning for Weakly Supervised Object Detection

CVPR 2018arXiv
0
citations

Information Competing Process for Learning Diversified Representations

NeurIPS 2019
0
citations

One-bit Supervision for Image Classification

NeurIPS 2020
0
citations

Self-Adaptively Learning to Demoiré from Focused and Defocused Image Pairs

NeurIPS 2020
0
citations

Rectifying the Shortcut Learning of Background for Few-Shot Learning

NeurIPS 2021
0
citations

Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence

NeurIPS 2021
0
citations

Fine-Grained Semantically Aligned Vision-Language Pre-Training

NeurIPS 2022
0
citations

ConfounderGAN: Protecting Image Data Privacy with Causal Confounder

NeurIPS 2022
0
citations

Parameter-efficient Tuning of Large-scale Multimodal Foundation Model

NeurIPS 2023
0
citations

Segment Anything in 3D with NeRFs

NeurIPS 2023
0
citations

AiluRus: A Scalable ViT Framework for Dense Prediction

NeurIPS 2023
0
citations

Learning to Parameterize Visual Attributes for Open-set Fine-grained Retrieval

NeurIPS 2023
0
citations