Qi Tian
156
Papers
2,469
Total Citations
Papers (156)
4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
CVPR 2024
1,061
citations
ControlVideo: Training-free Controllable Text-to-video Generation
ICLR 2024
331
citations
GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models
CVPR 2024
241
citations
Bottom-Up Temporal Action Localization with Mutual Regularization
ECCV 2020
209
citations
Rethinking the Distribution Gap of Person Re-identification with Camera-based Batch Normalization
ECCV 2020
181
citations
GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions
CVPR 2024
164
citations
Towards 3D Molecule-Text Interpretation in Language Models
ICLR 2024
73
citations
LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly Detection
AAAI 2024arXiv
47
citations
Improving Image Restoration through Removing Degradations in Textual Representations
CVPR 2024
45
citations
Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model
CVPR 2024
37
citations
LION: Implicit Vision Prompt Tuning
AAAI 2024arXiv
35
citations
CooGAN: A Memory-Efficient Framework for High-Resolution Facial Attribute Editing
ECCV 2020
15
citations
C-CLIP: Multimodal Continual Learning for Vision-Language Model
ICLR 2025
13
citations
Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners
ICLR 2024
9
citations
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation
ECCV 2024
5
citations
Boosting Segment Anything Model Towards Open-Vocabulary Learning
AAAI 2025
1
citations
METEOR: Multi-Encoder Collaborative Token Pruning for Efficient Vision Language Models
ICCV 2025
1
citations
Optimize Incompatible Parameters Through Compatibility-aware Knowledge Integration
AAAI 2025
1
citations
Multi-Cue Correlation Filters for Robust Visual Tracking
CVPR 2018
0
citations
Deep Hashing via Discrepancy Minimization
CVPR 2018
0
citations
Learning Channel-Wise Interactions for Binary Convolutional Neural Networks
CVPR 2019
0
citations
Structural Relational Reasoning of Point Clouds
CVPR 2019
0
citations
Deep Fitting Degree Scoring Network for Monocular 3D Object Detection
CVPR 2019
0
citations
BridgeNet: A Continuity-Aware Probabilistic Network for Age Estimation
CVPR 2019
0
citations
Iterative Reorganization With Weak Spatial Constraints: Solving Arbitrary Jigsaw Puzzles for Unsupervised Representation Learning
CVPR 2019
0
citations
Variational Convolutional Neural Network Pruning
CVPR 2019
0
citations
Towards Visual Feature Translation
CVPR 2019arXiv
0
citations
Modeling Point Clouds With Self-Attention and Gumbel Subset Sampling
CVPR 2019
0
citations
Actional-Structural Graph Convolutional Networks for Skeleton-Based Action Recognition
CVPR 2019
0
citations
Deep Modular Co-Attention Networks for Visual Question Answering
CVPR 2019
0
citations
Learning to Learn Image Classifiers With Visual Analogy
CVPR 2019
0
citations
GhostNet: More Features From Cheap Operations
CVPR 2020arXiv
0
citations
Dynamic Multiscale Graph Neural Networks for 3D Skeleton Based Human Motion Prediction
CVPR 2020arXiv
0
citations
Unsupervised Person Re-Identification via Softened Similarity Learning
CVPR 2020arXiv
0
citations
Frequency Domain Compact 3D Convolutional Neural Networks
CVPR 2020
0
citations
Polishing Decision-Based Adversarial Noise With a Customized Sampling
CVPR 2020
0
citations
Joint Demosaicing and Denoising With Self Guidance
CVPR 2020
0
citations
A Semi-Supervised Assessor of Neural Architectures
CVPR 2020arXiv
0
citations
Towards Discriminability and Diversity: Batch Nuclear-Norm Maximization Under Label Insufficient Situations
CVPR 2020arXiv
0
citations
Learning to Select Base Classes for Few-Shot Classification
CVPR 2020arXiv
0
citations
Creating Something From Nothing: Unsupervised Knowledge Distillation for Cross-Modal Hashing
CVPR 2020arXiv
0
citations
CARS: Continuous Evolution for Efficient Neural Architecture Search
CVPR 2020arXiv
0
citations
AdderNet: Do We Really Need Multiplications in Deep Learning?
CVPR 2020arXiv
0
citations
Spatial-Temporal Graph Convolutional Network for Video-Based Person Re-Identification
CVPR 2020
0
citations
Projection & Probability-Driven Black-Box Attack
CVPR 2020arXiv
0
citations
Transformation GAN for Unsupervised Image Synthesis and Representation Learning
CVPR 2020
0
citations
Video Super-Resolution With Temporal Group Attention
CVPR 2020arXiv
0
citations
FM2u-Net: Face Morphological Multi-Branch Network for Makeup-Invariant Face Verification
CVPR 2020
0
citations
Rethinking Performance Estimation in Neural Architecture Search
CVPR 2020arXiv
0
citations
Gradually Vanishing Bridge for Adversarial Domain Adaptation
CVPR 2020arXiv
0
citations
Label Decoupling Framework for Salient Object Detection
CVPR 2020arXiv
0
citations
Cross-Domain Detection via Graph-Induced Prototype Alignment
CVPR 2020arXiv
0
citations
Learning Temporal Co-Attention Models for Unsupervised Video Action Localization
CVPR 2020
0
citations
Noise-Aware Fully Webly Supervised Object Detection
CVPR 2020
0
citations
Network Adjustment: Channel Search Guided by FLOPs Utilization Ratio
CVPR 2020arXiv
0
citations
CondenseNet V2: Sparse Feature Reactivation for Deep Networks
CVPR 2021arXiv
0
citations
UnrealPerson: An Adaptive Pipeline Towards Costless Person Re-Identification
CVPR 2021arXiv
0
citations
Towards Compact CNNs via Collaborative Compression
CVPR 2021arXiv
0
citations
ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised Image Segmentation
CVPR 2021
0
citations
A Fourier-Based Framework for Domain Generalization
CVPR 2021arXiv
0
citations
DATA: Domain-Aware and Task-Aware Self-Supervised Learning
CVPR 2022arXiv
0
citations
HyperDet3D: Learning a Scene-Conditioned 3D Object Detector
CVPR 2022arXiv
0
citations
Contextual Similarity Distillation for Asymmetric Image Retrieval
CVPR 2022
0
citations
MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens
CVPR 2022
0
citations
One-Bit Active Query With Contrastive Pairs
CVPR 2022
0
citations
Partial Class Activation Attention for Semantic Segmentation
CVPR 2022
0
citations
Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross-Modal Denoising Networks
CVPR 2022
0
citations
DeeCap: Dynamic Early Exiting for Efficient Image Captioning
CVPR 2022
0
citations
Learning To Learn by Jointly Optimizing Neural Architecture and Weights
CVPR 2022
0
citations
Domain-Agnostic Prior for Transfer Semantic Segmentation
CVPR 2022arXiv
0
citations
Distilling Vision-Language Pre-Training To Collaborate With Weakly-Supervised Temporal Action Localization
CVPR 2023arXiv
0
citations
Open-Set Fine-Grained Retrieval via Prompting Vision-Language Evaluator
CVPR 2023
0
citations
Adapting Shortcut With Normalizing Flow: An Efficient Tuning Framework for Visual Recognition
CVPR 2023
0
citations
Being Comes From Not-Being: Open-Vocabulary Text-to-Motion Generation With Wordless Training
CVPR 2023
0
citations
Integrally Pre-Trained Transformer Pyramid Networks
CVPR 2023arXiv
0
citations
Federated Domain Generalization With Generalization Adjustment
CVPR 2023
0
citations
Visual Recognition by Request
CVPR 2023arXiv
0
citations
RIDE: Reversal Invariant Descriptor Enhancement
ICCV 2015
0
citations
Scalable Person Re-Identification: A Benchmark
ICCV 2015
0
citations
Multi-Task Learning With Low Rank Attribute Embedding for Person Re-Identification
ICCV 2015
0
citations
Similarity Gaussian Process Latent Variable Model for Multi-Modal Data Analysis
ICCV 2015
0
citations
Ensemble Diffusion for Retrieval
ICCV 2017
0
citations
SORT: Second-Order Response Transform for Visual Recognition
ICCV 2017arXiv
0
citations
Pose-Driven Deep Convolutional Model for Person Re-Identification
ICCV 2017arXiv
0
citations
Multimodal Gaussian Process Latent Variable Models With Harmonization
ICCV 2017
0
citations
Progressive Differentiable Architecture Search: Bridging the Depth Gap Between Search and Evaluation
ICCV 2019
0
citations
Multinomial Distribution Learning for Effective Neural Architecture Search
ICCV 2019
0
citations
Co-Evolutionary Compression for Unpaired Image Translation
ICCV 2019
0
citations
Accelerate CNN via Recursive Bayesian Pruning
ICCV 2019
0
citations
Data-Free Learning of Student Networks
ICCV 2019
0
citations
Global-Local Temporal Representations for Video Person Re-Identification
ICCV 2019
0
citations
Universal Perturbation Attack Against Image Retrieval
ICCV 2019
0
citations
CenterNet: Keypoint Triplets for Object Detection
ICCV 2019
0
citations
Dynamic Points Agglomeration for Hierarchical Point Sets Learning
ICCV 2019
0
citations
AVT: Unsupervised Learning of Transformation Equivariant Representations by Autoencoding Variational Transformations
ICCV 2019
0
citations
Differentiable Convolution Search for Point Cloud Processing
ICCV 2021arXiv
0
citations
Foreground Activation Maps for Weakly Supervised Object Localization
ICCV 2021
0
citations
Omni-GAN: On the Secrets of cGANs and Beyond
ICCV 2021
0
citations
Greedy Gradient Ensemble for Robust Visual Question Answering
ICCV 2021arXiv
0
citations
Pixel Difference Networks for Efficient Edge Detection
ICCV 2021arXiv
0
citations
Visformer: The Vision-Friendly Transformer
ICCV 2021arXiv
0
citations
Divide and Conquer for Single-Frame Temporal Action Localization
ICCV 2021
0
citations
IM-Zero: Instance-level Motion Controllable Video Generation in a Zero-shot Manner
CVPR 2025
0
citations
TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization
ICCV 2021
0
citations
Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models
ICCV 2023arXiv
0
citations
Focus on Your Target: A Dual Teacher-Student Framework for Domain-Adaptive Semantic Segmentation
ICCV 2023arXiv
0
citations
Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation
ICCV 2023arXiv
0
citations
USAGE: A Unified Seed Area Generation Paradigm for Weakly Supervised Semantic Segmentation
ICCV 2023arXiv
0
citations
Corner Proposal Network for Anchor-free, Two-stage Object Detection
ECCV 2020
0
citations
Circumventing Outliers of AutoAugment with Knowledge Distillation
ECCV 2020
0
citations
Social Adaptive Module for Weakly-supervised Group Activity Recognition
ECCV 2020
0
citations
Interpretable Visual Reasoning via Probabilistic Formulation under Natural Supervision
ECCV 2020
0
citations
Large-Scale Few-Shot Learning via Multi-Modal Knowledge Discovery
ECCV 2020
0
citations
Video Super-Resolution with Recurrent Structure-Detail Network
ECCV 2020
0
citations
Wavelet-Based Dual-Branch Network for Image Demoiréing
ECCV 2020
0
citations
API-Net: Robust Generative Classifier via a Single Discriminator
ECCV 2020
0
citations
Reinforced Axial Refinement Network for Monocular 3D Object Detection
ECCV 2020
0
citations
FTL: A universal framework for training low-bit DNNs via Feature Transfer
ECCV 2020
0
citations
Extract and Merge: Superpixel Segmentation with Regional Attributes
ECCV 2020
0
citations
Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction
ECCV 2022
0
citations
Cornerformer: Purifying Instances for Corner-Based Detectors
ECCV 2022
0
citations
TAPE: Task-Agnostic Prior Embedding for Image Restoration
ECCV 2022
0
citations
Active Pointly-Supervised Instance Segmentation
ECCV 2022
0
citations
A Transformer-Based Decoder for Semantic Segmentation with Multi-level Context Mining
ECCV 2022
0
citations
SdAE: Self-Distillated Masked Autoencoder
ECCV 2022
0
citations
Vibration-Based Uncertainty Estimation for Learning from Limited Supervision
ECCV 2022
0
citations
MVP: Multimodality-Guided Visual Pre-training
ECCV 2022
0
citations
Shape Self-Correction for Unsupervised Point Cloud Understanding
ICCV 2021
0
citations
CLIP-Adapted Region-to-Text Learning for Generative Open-Vocabulary Semantic Segmentation
ICCV 2025
0
citations
Segment Any 3D Gaussians
AAAI 2025
0
citations
Infinite-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation
AAAI 2025
0
citations
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data
CVPR 2024
0
citations
OVMR: Open-Vocabulary Recognition with Multi-Modal References
CVPR 2024
0
citations
Parameter Efficient Fine-tuning via Cross Block Orchestration for Segment Anything Model
CVPR 2024
0
citations
Query-Adaptive Late Fusion for Image Search and Person Re-Identification
CVPR 2015
0
citations
Interaction Part Mining: A Mid-Level Approach for Fine-Grained Action Recognition
CVPR 2015
0
citations
InterActive: Inter-Layer Activeness Propagation
CVPR 2016
0
citations
Picking Deep Filter Responses for Fine-Grained Image Recognition
CVPR 2016
0
citations
Cascaded Interactional Targeting Network for Egocentric Video Analysis
CVPR 2016
0
citations
DisturbLabel: Regularizing CNN on the Loss Layer
CVPR 2016
0
citations
Person Re-Identification in the Wild
CVPR 2017arXiv
0
citations
Scalable Person Re-Identification on Supervised Smoothed Manifold
CVPR 2017arXiv
0
citations
Task-Driven Dynamic Fusion: Reducing Ambiguity in Video Description
CVPR 2017
0
citations
Person Transfer GAN to Bridge Domain Gap for Person Re-Identification
CVPR 2018arXiv
0
citations
Zigzag Learning for Weakly Supervised Object Detection
CVPR 2018arXiv
0
citations
Information Competing Process for Learning Diversified Representations
NeurIPS 2019
0
citations
One-bit Supervision for Image Classification
NeurIPS 2020
0
citations
Self-Adaptively Learning to Demoiré from Focused and Defocused Image Pairs
NeurIPS 2020
0
citations
Rectifying the Shortcut Learning of Background for Few-Shot Learning
NeurIPS 2021
0
citations
Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence
NeurIPS 2021
0
citations
Fine-Grained Semantically Aligned Vision-Language Pre-Training
NeurIPS 2022
0
citations
ConfounderGAN: Protecting Image Data Privacy with Causal Confounder
NeurIPS 2022
0
citations
Parameter-efficient Tuning of Large-scale Multimodal Foundation Model
NeurIPS 2023
0
citations
Segment Anything in 3D with NeRFs
NeurIPS 2023
0
citations
AiluRus: A Scalable ViT Framework for Dense Prediction
NeurIPS 2023
0
citations
Learning to Parameterize Visual Attributes for Open-set Fine-grained Retrieval
NeurIPS 2023
0
citations