Jiaya Jia

128
Papers
1,266
Total Citations

Papers (128)

LISA: Reasoning Segmentation via Large Language Model

CVPR 2024
721
citations

Video-P2P: Video Editing with Cross-attention Control

CVPR 2024
309
citations

Visual Question Answering with Question Representation Update (QRU)

NeurIPS 2016
87
citations

Unified Language-driven Zero-shot Domain Adaptation

CVPR 2024
29
citations

GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

CVPR 2024
25
citations

MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers

ICCV 2025
21
citations

Generative Video Propagation

CVPR 2025
20
citations

Image Inpainting via Iteratively Decoupled Probabilistic Modeling

ICLR 2024
17
citations

DreamOmni: Unified Image Generation and Editing

CVPR 2025
16
citations

Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma?

ICCV 2025
11
citations

SaCo Loss: Sample-wise Affinity Consistency for Vision-Language Pre-training

CVPR 2024
10
citations

GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation

CVPR 2018
0
citations

Facelet-Bank for Fast Portrait Manipulation

CVPR 2018arXiv
0
citations

Referring Image Segmentation via Recurrent Refinement Networks

CVPR 2018
0
citations

Scale-Recurrent Network for Deep Image Deblurring

CVPR 2018arXiv
0
citations

Path Aggregation Network for Instance Segmentation

CVPR 2018arXiv
0
citations

Semi-Parametric Image Synthesis

CVPR 2018arXiv
0
citations

Wide-Context Semantic Image Extrapolation

CVPR 2019
0
citations

Homomorphic Latent Space Interpolation for Unpaired Image-To-Image Translation

CVPR 2019
0
citations

Amodal Instance Segmentation With KINS Dataset

CVPR 2019
0
citations

Dynamic Scene Deblurring With Parameter Selective Sharing and Nested Skip Connections

CVPR 2019
0
citations

Associatively Segmenting Instances and Semantics in Point Clouds

CVPR 2019
0
citations

Learning Shape-Aware Embedding for Scene Text Detection

CVPR 2019
0
citations

PointWeb: Enhancing Local Neighborhood Features for Point Cloud Processing

CVPR 2019
0
citations

Underexposed Photo Enhancement Using Deep Illumination Estimation

CVPR 2019
0
citations

3D Motion Decomposition for RGBD Future Dynamic Scene Synthesis

CVPR 2019
0
citations

Semantic Component Decomposition for Face Attribute Manipulation

CVPR 2019
0
citations

Domain Adaptive Image-to-Image Translation

CVPR 2020
0
citations

Attentive Normalization for Conditional Image Generation

CVPR 2020arXiv
0
citations

PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation

CVPR 2020arXiv
0
citations

Exploring Self-Attention for Image Recognition

CVPR 2020arXiv
0
citations

DSGN: Deep Stereo Geometry Network for 3D Object Detection

CVPR 2020arXiv
0
citations

3DSSD: Point-Based 3D Single Stage Object Detector

CVPR 2020arXiv
0
citations

Distilling Knowledge via Knowledge Review

CVPR 2021arXiv
0
citations

Semi-Supervised Semantic Segmentation With Directional Context-Aware Consistency

CVPR 2021
0
citations

Jigsaw Clustering for Unsupervised Visual Representation Learning

CVPR 2021arXiv
0
citations

Self-Supervised 3D Mesh Reconstruction From Single Images

CVPR 2021
0
citations

Multi-Scale Aligned Distillation for Low-Resolution Detection

CVPR 2021
0
citations

Scale-Aware Automatic Augmentation for Object Detection

CVPR 2021arXiv
0
citations

Fully Convolutional Networks for Panoptic Segmentation

CVPR 2021arXiv
0
citations

MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution

CVPR 2021
0
citations

Bidirectional Projection Network for Cross Dimension Scene Understanding

CVPR 2021arXiv
0
citations

Improving Calibration for Long-Tailed Recognition

CVPR 2021arXiv
0
citations

TWIST: Two-Way Inter-Label Self-Training for Semi-Supervised 3D Instance Segmentation

CVPR 2022
0
citations

EfficientNeRF Efficient Neural Radiance Fields

CVPR 2022arXiv
0
citations

Voxel Field Fusion for 3D Object Detection

CVPR 2022arXiv
0
citations

MAT: Mask-Aware Transformer for Large Hole Image Inpainting

CVPR 2022arXiv
0
citations

A Unified Query-Based Paradigm for Point Cloud Understanding

CVPR 2022arXiv
0
citations

Generalized Few-Shot Semantic Segmentation

CVPR 2022arXiv
0
citations

Video Frame Interpolation With Transformer

CVPR 2022arXiv
0
citations

Focal Sparse Convolutional Networks for 3D Object Detection

CVPR 2022arXiv
0
citations

Multi-View Transformer for 3D Visual Grounding

CVPR 2022arXiv
0
citations

High Quality Segmentation for Ultra High-Resolution Images

CVPR 2022arXiv
0
citations

SNR-Aware Low-Light Image Enhancement

CVPR 2022
0
citations

Stratified Transformer for 3D Point Cloud Segmentation

CVPR 2022arXiv
0
citations

Rethinking Out-of-Distribution (OOD) Detection: Masked Image Modeling Is All You Need

CVPR 2023arXiv
0
citations

Point2Pix: Photo-Realistic Point Cloud Rendering via Neural Radiance Fields

CVPR 2023arXiv
0
citations

Spherical Transformer for LiDAR-Based 3D Recognition

CVPR 2023arXiv
0
citations

VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking

CVPR 2023arXiv
0
citations

Understanding Imbalanced Semantic Segmentation Through Neural Collapse

CVPR 2023arXiv
0
citations

LargeKernel3D: Scaling Up Kernels in 3D Sparse CNNs

CVPR 2023arXiv
0
citations

TriVol: Point Cloud Rendering via Triple Volumes

CVPR 2023arXiv
0
citations

Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields for Controllable Scene Stylization

CVPR 2023
0
citations

Hierarchical Dense Correlation Distillation for Few-Shot Segmentation

CVPR 2023arXiv
0
citations

Command-Driven Articulated Object Understanding and Manipulation

CVPR 2023
0
citations

Video Super-Resolution via Deep Draft-Ensemble Learning

ICCV 2015
0
citations

Contour Box: Rejecting Object Proposals Without Explicit Closed Contours

ICCV 2015
0
citations

Box Aggregation for Proposal Decimation: Last Mile of Object Detection

ICCV 2015
0
citations

Semantic Segmentation With Object Clique Potential

ICCV 2015
0
citations

Understanding and Diagnosing Visual Tracking Systems

ICCV 2015
0
citations

Mutual-Structure for Joint Filtering

ICCV 2015
0
citations

Zero-Order Reverse Filtering

ICCV 2017arXiv
0
citations

Unsupervised Learning of Stereo Matching

ICCV 2017
0
citations

High-Quality Correspondence and Segmentation Estimation for Dual-Lens Smart-Phone Portraits

ICCV 2017arXiv
0
citations

SGN: Sequential Grouping Networks for Instance Segmentation

ICCV 2017
0
citations

Situation Recognition With Graph Neural Networks

ICCV 2017arXiv
0
citations

Detail-Revealing Deep Video Super-Resolution

ICCV 2017arXiv
0
citations

Makeup-Go: Blind Reversion of Portrait Edit

ICCV 2017
0
citations

3D Graph Neural Networks for RGBD Semantic Segmentation

ICCV 2017
0
citations

STD: Sparse-to-Dense 3D Object Detector for Point Cloud

ICCV 2019
0
citations

Aggregation via Separation: Boosting Facial Landmark Detector With Semi-Supervised Style Translation

ICCV 2019
0
citations

AGSS-VOS: Attention Guided Single-Shot Video Object Segmentation

ICCV 2019
0
citations

Attribute-Driven Spontaneous Motion in Unpaired Image Translation

ICCV 2019
0
citations

Fast and Practical Neural Architecture Search

ICCV 2019
0
citations

View Independent Generative Adversarial Network for Novel View Synthesis

ICCV 2019
0
citations

VisionZip: Longer is Better but Not Necessary in Vision Language Models

CVPR 2025
0
citations

Hierarchical Point-Edge Interaction Network for Point Cloud Semantic Segmentation

ICCV 2019
0
citations

Image Synthesis via Semantic Composition

ICCV 2021arXiv
0
citations

Guided Point Contrastive Learning for Semi-Supervised Point Cloud Semantic Segmentation

ICCV 2021arXiv
0
citations

Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation

ICCV 2021arXiv
0
citations

Deep Structured Instance Graph for Distilling Object Detectors

ICCV 2021arXiv
0
citations

Video Instance Segmentation With a Propose-Reduce Paradigm

ICCV 2021arXiv
0
citations

Learnable Boundary Guided Adversarial Training

ICCV 2021arXiv
0
citations

Point Transformer

ICCV 2021arXiv
0
citations

Seeing Dynamic Scene in the Dark: A High-Quality Video Dataset With Mechatronic Alignment

ICCV 2021
0
citations

Parametric Contrastive Learning

ICCV 2021arXiv
0
citations

Removing Anomalies as Noises for Industrial Defect Localization

ICCV 2023
0
citations

Mask-Attention-Free Transformer for 3D Instance Segmentation

ICCV 2023arXiv
0
citations

End-to-end 3D Tracking with Decoupled Queries

ICCV 2023
0
citations

FocalFormer3D: Focusing on Hard Instance for 3D Object Detection

ICCV 2023arXiv
0
citations

High Quality Entity Segmentation

ICCV 2023arXiv
0
citations

Particularity beyond Commonality: Unpaired Identity Transfer with Multiple References

ECCV 2020
0
citations

MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution

ECCV 2020
0
citations

CN: Channel Normalization For Point Cloud Recognition

ECCV 2020
0
citations

Memory Selection Network for Video Propagation

ECCV 2020
0
citations

VCNet: A Robust Approach to Blind Image Inpainting

ECCV 2020
0
citations

Tracking Objects As Pixel-Wise Distributions

ECCV 2022
0
citations

CA-SSL: Class-Agnostic Semi-Supervised Learning for Detection and Segmentation

ECCV 2022
0
citations

DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation

ECCV 2022
0
citations

Fast Point R-CNN

ICCV 2019
0
citations

Mixture-of-Scores: Robust Image-Text Data Valuation via Three Lines of Code

ICCV 2025
0
citations

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

ICCV 2025
0
citations

OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation

CVPR 2024
0
citations

Prompt Highlighter: Interactive Control for Multi-Modal LLMs

CVPR 2024
0
citations

Just Noticeable Defocus Blur Detection and Estimation

CVPR 2015
0
citations

Deep LAC: Deep Localization, Alignment and Classification for Fine-Grained Recognition

CVPR 2015
0
citations

Handling Motion Blur in Multi-Frame Super-Resolution

CVPR 2015
0
citations

Multi-Scale Patch Aggregation (MPA) for Simultaneous Detection and Segmentation

CVPR 2016
0
citations

ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation

CVPR 2016
0
citations

Pyramid Scene Parsing Network

CVPR 2017arXiv
0
citations

Image Inpainting via Generative Multi-column Convolutional Neural Networks

NeurIPS 2018
0
citations

Sequential Context Encoding for Duplicate Removal

NeurIPS 2018
0
citations

LAPAR: Linearly-Assembled Pixel-Adaptive Regression Network for Single Image Super-resolution and Beyond

NeurIPS 2020
0
citations

Blending Anti-Aliasing into Vision Transformer

NeurIPS 2021
0
citations

Unifying Voxel-based Representation with Transformer for 3D Object Detection

NeurIPS 2022
0
citations

Real-World Image Variation by Aligning Diffusion Inversion Chain

NeurIPS 2023
0
citations

DiffComplete: Diffusion-based Generative 3D Shape Completion

NeurIPS 2023
0
citations

Deep Edge-Aware Filters

ICML 2015
0
citations