Qi Zhang

74
Papers
279
Total Citations

Papers (74)

FINER: Flexible Spectral-bias Tuning in Implicit NEural Representation by Variable-periodic Activation Functions

CVPR 2024
66
citations

Frequency Spectrum Is More Effective for Multimodal Representation and Fusion: A Multimodal Spectrum Rumor Detector

AAAI 2024arXiv
38
citations

TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution

CVPR 2025
37
citations

MindTuner: Cross-Subject Visual Decoding with Visual Fingerprint and Semantic Correction

AAAI 2025
23
citations

Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh

CVPR 2025arXiv
23
citations

OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?

ICLR 2025
21
citations

ConTex-Human: Free-View Rendering of Human from a Single Image with Texture-Consistent Synthesis

CVPR 2024
15
citations

MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning

AAAI 2025
13
citations

Amplifier: Bringing Attention to Neglected Low-Energy Components in Time Series Forecasting

AAAI 2025
10
citations

Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation

AAAI 2025
5
citations

CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models

ICCV 2025
4
citations

Beyond Interpretability: The Gains of Feature Monosemanticity on Model Robustness

ICLR 2025
4
citations

Large-Scale Non-convex Stochastic Constrained Distributionally Robust Optimization

AAAI 2024arXiv
4
citations

A Learning Error Analysis for Structured Prediction with Approximate Inference

NeurIPS 2017
3
citations

EvaLearn: Quantifying the Learning Capability and Efficiency of LLMs via Sequential Problem Solving

NeurIPS 2025
3
citations

Mitigating Ambiguities in 3D Classification with Gaussian Splatting

CVPR 2025
2
citations

Boosting Vision Semantic Density with Anatomy Normality Modeling for Medical Vision-language Pre-training

ICCV 2025
2
citations

Position-Aware Guided Point Cloud Completion with CLIP Model

AAAI 2025
2
citations

Text Diffusion with Reinforced Conditioning

AAAI 2024arXiv
2
citations

Wills Aligner: Multi-Subject Collaborative Brain Visual Decoding

AAAI 2025
1
citations

View Transformation Robustness for Multi-View 3D Object Reconstruction with Reconstruction Error-Guided View Selection

AAAI 2025
1
citations

Ray-Space Projection Model for Light Field Camera

CVPR 2019
0
citations

Context-Aware Attention Network for Image-Text Retrieval

CVPR 2020
0
citations

Cross-View Cross-Scene Multi-View Crowd Counting

CVPR 2021
0
citations

FENeRF: Face Editing in Neural Radiance Fields

CVPR 2022arXiv
0
citations

Hallucinated Neural Radiance Fields in the Wild

CVPR 2022arXiv
0
citations

Deblur-NeRF: Neural Radiance Fields From Blurry Images

CVPR 2022
0
citations

Fine-Grained Face Swapping via Regional GAN Inversion

CVPR 2023arXiv
0
citations

Local-to-Global Registration for Bundle-Adjusting Neural Radiance Fields

CVPR 2023arXiv
0
citations

DINER: Disorder-Invariant Implicit Neural Representation

CVPR 2023arXiv
0
citations

Wide-Angle Rectification via Content-Aware Conformal Mapping

CVPR 2023
0
citations

Local Implicit Ray Function for Generalizable Radiance Field Representation

CVPR 2023arXiv
0
citations

Inverting the Imaging Process by Learning an Implicit Camera Model

CVPR 2023arXiv
0
citations

UV Volumes for Real-Time Rendering of Editable Free-View Human Performance

CVPR 2023arXiv
0
citations

VL-Match: Enhancing Vision-Language Pretraining with Token-Level and Instance-Level Matching

ICCV 2023
0
citations

SLAN: Self-Locator Aided Network for Vision-Language Understanding

ICCV 2023
0
citations

Calibration-Free Multi-View Crowd Counting

ECCV 2022
0
citations

PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation

ECCV 2022arXiv
0
citations

Neural Color Operators for Sequential Image Retouching

ECCV 2022
0
citations

Unifying Event Detection and Captioning as Sequence Generation via Pre-training

ECCV 2022
0
citations

HDR-NeRF: High Dynamic Range Neural Radiance Fields

CVPR 2022
0
citations

Generative Hard Example Augmentation for Semantic Point Cloud Segmentation

CVPR 2025
0
citations

SU-RGS: Relightable 3D Gaussian Splatting from Sparse Views under Unconstrained Illuminations

ICCV 2025
0
citations

BokehDiff: Neural Lens Blur with One-Step Diffusion

ICCV 2025
0
citations

SEMPO: Lightweight Foundation Models for Time Series Forecasting

NeurIPS 2025
0
citations

Alleviating Shifted Distribution in Human Preference Alignment through Meta-Learning

AAAI 2025
0
citations

COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated Sample Weighting Mechanism

AAAI 2025
0
citations

A Pre-convolved Representation for Plug-and-Play Neural Illumination Fields

AAAI 2024
0
citations

Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection

AAAI 2024
0
citations

LLMEval: A Preliminary Study on How to Evaluate Large Language Models

AAAI 2024
0
citations

GS-IR: 3D Gaussian Splatting for Inverse Rendering

CVPR 2024
0
citations

HumanNorm: Learning Normal Diffusion Model for High-quality and Realistic 3D Human Generation

CVPR 2024
0
citations

HumanRef: Single Image to 3D Human Generation via Reference-Guided Diffusion

CVPR 2024
0
citations

Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining

ICML 2024
0
citations

MLIP: Efficient Multi-Perspective Language-Image Pretraining with Exhaustive Data Utilization

ICML 2024
0
citations

${\rm E}(3)$-Equivariant Actor-Critic Methods for Cooperative Multi-Agent Reinforcement Learning

ICML 2024
0
citations

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback

ICML 2024
0
citations

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

ICML 2024
0
citations

4D Light Field Superpixel and Segmentation

CVPR 2017
0
citations

Dynamic Feature Learning for Partial Face Recognition

CVPR 2018
0
citations

Wide-Area Crowd Counting via Ground-Plane Density Maps and Multi-View Fusion CNNs

CVPR 2019
0
citations

Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control

NeurIPS 2019
0
citations

Succinct and Robust Multi-Agent Communication With Temporal Message Control

NeurIPS 2020
0
citations

Spectral Temporal Graph Neural Network for Multivariate Time-series Forecasting

NeurIPS 2020
0
citations

Outlier Suppression: Pushing the Limit of Low-bit Transformer Language Models

NeurIPS 2022
0
citations

A Neural Corpus Indexer for Document Retrieval

NeurIPS 2022
0
citations

How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders

NeurIPS 2022
0
citations

A Comprehensive Study on Text-attributed Graphs: Benchmarking and Rethinking

NeurIPS 2023
0
citations

Model-enhanced Vector Index

NeurIPS 2023
0
citations

Identifiable Contrastive Learning with Automatic Feature Importance Discovery

NeurIPS 2023
0
citations

MG-ViT: A Multi-Granularity Method for Compact and Efficient Vision Transformers

NeurIPS 2023
0
citations

FourierGNN: Rethinking Multivariate Time Series Forecasting from a Pure Graph Perspective

NeurIPS 2023
0
citations

Frequency-domain MLPs are More Effective Learners in Time Series Forecasting

NeurIPS 2023
0
citations

\ell_1,p-Norm Regularization: Error Bounds and Convergence Rate Analysis of First-Order Methods

ICML 2015
0
citations