Jiaya Jia
128
Papers
1,266
Total Citations
Papers (128)
LISA: Reasoning Segmentation via Large Language Model
CVPR 2024
721
citations
Video-P2P: Video Editing with Cross-attention Control
CVPR 2024
309
citations
Visual Question Answering with Question Representation Update (QRU)
NeurIPS 2016
87
citations
Unified Language-driven Zero-shot Domain Adaptation
CVPR 2024
29
citations
GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding
CVPR 2024
25
citations
MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers
ICCV 2025
21
citations
Generative Video Propagation
CVPR 2025
20
citations
Image Inpainting via Iteratively Decoupled Probabilistic Modeling
ICLR 2024
17
citations
DreamOmni: Unified Image Generation and Editing
CVPR 2025
16
citations
Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma?
ICCV 2025
11
citations
SaCo Loss: Sample-wise Affinity Consistency for Vision-Language Pre-training
CVPR 2024
10
citations
GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation
CVPR 2018
0
citations
Facelet-Bank for Fast Portrait Manipulation
CVPR 2018arXiv
0
citations
Referring Image Segmentation via Recurrent Refinement Networks
CVPR 2018
0
citations
Scale-Recurrent Network for Deep Image Deblurring
CVPR 2018arXiv
0
citations
Path Aggregation Network for Instance Segmentation
CVPR 2018arXiv
0
citations
Semi-Parametric Image Synthesis
CVPR 2018arXiv
0
citations
Wide-Context Semantic Image Extrapolation
CVPR 2019
0
citations
Homomorphic Latent Space Interpolation for Unpaired Image-To-Image Translation
CVPR 2019
0
citations
Amodal Instance Segmentation With KINS Dataset
CVPR 2019
0
citations
Dynamic Scene Deblurring With Parameter Selective Sharing and Nested Skip Connections
CVPR 2019
0
citations
Associatively Segmenting Instances and Semantics in Point Clouds
CVPR 2019
0
citations
Learning Shape-Aware Embedding for Scene Text Detection
CVPR 2019
0
citations
PointWeb: Enhancing Local Neighborhood Features for Point Cloud Processing
CVPR 2019
0
citations
Underexposed Photo Enhancement Using Deep Illumination Estimation
CVPR 2019
0
citations
3D Motion Decomposition for RGBD Future Dynamic Scene Synthesis
CVPR 2019
0
citations
Semantic Component Decomposition for Face Attribute Manipulation
CVPR 2019
0
citations
Domain Adaptive Image-to-Image Translation
CVPR 2020
0
citations
Attentive Normalization for Conditional Image Generation
CVPR 2020arXiv
0
citations
PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation
CVPR 2020arXiv
0
citations
Exploring Self-Attention for Image Recognition
CVPR 2020arXiv
0
citations
DSGN: Deep Stereo Geometry Network for 3D Object Detection
CVPR 2020arXiv
0
citations
3DSSD: Point-Based 3D Single Stage Object Detector
CVPR 2020arXiv
0
citations
Distilling Knowledge via Knowledge Review
CVPR 2021arXiv
0
citations
Semi-Supervised Semantic Segmentation With Directional Context-Aware Consistency
CVPR 2021
0
citations
Jigsaw Clustering for Unsupervised Visual Representation Learning
CVPR 2021arXiv
0
citations
Self-Supervised 3D Mesh Reconstruction From Single Images
CVPR 2021
0
citations
Multi-Scale Aligned Distillation for Low-Resolution Detection
CVPR 2021
0
citations
Scale-Aware Automatic Augmentation for Object Detection
CVPR 2021arXiv
0
citations
Fully Convolutional Networks for Panoptic Segmentation
CVPR 2021arXiv
0
citations
MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution
CVPR 2021
0
citations
Bidirectional Projection Network for Cross Dimension Scene Understanding
CVPR 2021arXiv
0
citations
Improving Calibration for Long-Tailed Recognition
CVPR 2021arXiv
0
citations
TWIST: Two-Way Inter-Label Self-Training for Semi-Supervised 3D Instance Segmentation
CVPR 2022
0
citations
EfficientNeRF Efficient Neural Radiance Fields
CVPR 2022arXiv
0
citations
Voxel Field Fusion for 3D Object Detection
CVPR 2022arXiv
0
citations
MAT: Mask-Aware Transformer for Large Hole Image Inpainting
CVPR 2022arXiv
0
citations
A Unified Query-Based Paradigm for Point Cloud Understanding
CVPR 2022arXiv
0
citations
Generalized Few-Shot Semantic Segmentation
CVPR 2022arXiv
0
citations
Video Frame Interpolation With Transformer
CVPR 2022arXiv
0
citations
Focal Sparse Convolutional Networks for 3D Object Detection
CVPR 2022arXiv
0
citations
Multi-View Transformer for 3D Visual Grounding
CVPR 2022arXiv
0
citations
High Quality Segmentation for Ultra High-Resolution Images
CVPR 2022arXiv
0
citations
SNR-Aware Low-Light Image Enhancement
CVPR 2022
0
citations
Stratified Transformer for 3D Point Cloud Segmentation
CVPR 2022arXiv
0
citations
Rethinking Out-of-Distribution (OOD) Detection: Masked Image Modeling Is All You Need
CVPR 2023arXiv
0
citations
Point2Pix: Photo-Realistic Point Cloud Rendering via Neural Radiance Fields
CVPR 2023arXiv
0
citations
Spherical Transformer for LiDAR-Based 3D Recognition
CVPR 2023arXiv
0
citations
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking
CVPR 2023arXiv
0
citations
Understanding Imbalanced Semantic Segmentation Through Neural Collapse
CVPR 2023arXiv
0
citations
LargeKernel3D: Scaling Up Kernels in 3D Sparse CNNs
CVPR 2023arXiv
0
citations
TriVol: Point Cloud Rendering via Triple Volumes
CVPR 2023arXiv
0
citations
Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields for Controllable Scene Stylization
CVPR 2023
0
citations
Hierarchical Dense Correlation Distillation for Few-Shot Segmentation
CVPR 2023arXiv
0
citations
Command-Driven Articulated Object Understanding and Manipulation
CVPR 2023
0
citations
Video Super-Resolution via Deep Draft-Ensemble Learning
ICCV 2015
0
citations
Contour Box: Rejecting Object Proposals Without Explicit Closed Contours
ICCV 2015
0
citations
Box Aggregation for Proposal Decimation: Last Mile of Object Detection
ICCV 2015
0
citations
Semantic Segmentation With Object Clique Potential
ICCV 2015
0
citations
Understanding and Diagnosing Visual Tracking Systems
ICCV 2015
0
citations
Mutual-Structure for Joint Filtering
ICCV 2015
0
citations
Zero-Order Reverse Filtering
ICCV 2017arXiv
0
citations
Unsupervised Learning of Stereo Matching
ICCV 2017
0
citations
High-Quality Correspondence and Segmentation Estimation for Dual-Lens Smart-Phone Portraits
ICCV 2017arXiv
0
citations
SGN: Sequential Grouping Networks for Instance Segmentation
ICCV 2017
0
citations
Situation Recognition With Graph Neural Networks
ICCV 2017arXiv
0
citations
Detail-Revealing Deep Video Super-Resolution
ICCV 2017arXiv
0
citations
Makeup-Go: Blind Reversion of Portrait Edit
ICCV 2017
0
citations
3D Graph Neural Networks for RGBD Semantic Segmentation
ICCV 2017
0
citations
STD: Sparse-to-Dense 3D Object Detector for Point Cloud
ICCV 2019
0
citations
Aggregation via Separation: Boosting Facial Landmark Detector With Semi-Supervised Style Translation
ICCV 2019
0
citations
AGSS-VOS: Attention Guided Single-Shot Video Object Segmentation
ICCV 2019
0
citations
Attribute-Driven Spontaneous Motion in Unpaired Image Translation
ICCV 2019
0
citations
Fast and Practical Neural Architecture Search
ICCV 2019
0
citations
View Independent Generative Adversarial Network for Novel View Synthesis
ICCV 2019
0
citations
VisionZip: Longer is Better but Not Necessary in Vision Language Models
CVPR 2025
0
citations
Hierarchical Point-Edge Interaction Network for Point Cloud Semantic Segmentation
ICCV 2019
0
citations
Image Synthesis via Semantic Composition
ICCV 2021arXiv
0
citations
Guided Point Contrastive Learning for Semi-Supervised Point Cloud Semantic Segmentation
ICCV 2021arXiv
0
citations
Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation
ICCV 2021arXiv
0
citations
Deep Structured Instance Graph for Distilling Object Detectors
ICCV 2021arXiv
0
citations
Video Instance Segmentation With a Propose-Reduce Paradigm
ICCV 2021arXiv
0
citations
Learnable Boundary Guided Adversarial Training
ICCV 2021arXiv
0
citations
Point Transformer
ICCV 2021arXiv
0
citations
Seeing Dynamic Scene in the Dark: A High-Quality Video Dataset With Mechatronic Alignment
ICCV 2021
0
citations
Parametric Contrastive Learning
ICCV 2021arXiv
0
citations
Removing Anomalies as Noises for Industrial Defect Localization
ICCV 2023
0
citations
Mask-Attention-Free Transformer for 3D Instance Segmentation
ICCV 2023arXiv
0
citations
End-to-end 3D Tracking with Decoupled Queries
ICCV 2023
0
citations
FocalFormer3D: Focusing on Hard Instance for 3D Object Detection
ICCV 2023arXiv
0
citations
High Quality Entity Segmentation
ICCV 2023arXiv
0
citations
Particularity beyond Commonality: Unpaired Identity Transfer with Multiple References
ECCV 2020
0
citations
MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution
ECCV 2020
0
citations
CN: Channel Normalization For Point Cloud Recognition
ECCV 2020
0
citations
Memory Selection Network for Video Propagation
ECCV 2020
0
citations
VCNet: A Robust Approach to Blind Image Inpainting
ECCV 2020
0
citations
Tracking Objects As Pixel-Wise Distributions
ECCV 2022
0
citations
CA-SSL: Class-Agnostic Semi-Supervised Learning for Detection and Segmentation
ECCV 2022
0
citations
DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation
ECCV 2022
0
citations
Fast Point R-CNN
ICCV 2019
0
citations
Mixture-of-Scores: Robust Image-Text Data Valuation via Three Lines of Code
ICCV 2025
0
citations
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition
ICCV 2025
0
citations
OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation
CVPR 2024
0
citations
Prompt Highlighter: Interactive Control for Multi-Modal LLMs
CVPR 2024
0
citations
Just Noticeable Defocus Blur Detection and Estimation
CVPR 2015
0
citations
Deep LAC: Deep Localization, Alignment and Classification for Fine-Grained Recognition
CVPR 2015
0
citations
Handling Motion Blur in Multi-Frame Super-Resolution
CVPR 2015
0
citations
Multi-Scale Patch Aggregation (MPA) for Simultaneous Detection and Segmentation
CVPR 2016
0
citations
ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation
CVPR 2016
0
citations
Pyramid Scene Parsing Network
CVPR 2017arXiv
0
citations
Image Inpainting via Generative Multi-column Convolutional Neural Networks
NeurIPS 2018
0
citations
Sequential Context Encoding for Duplicate Removal
NeurIPS 2018
0
citations
LAPAR: Linearly-Assembled Pixel-Adaptive Regression Network for Single Image Super-resolution and Beyond
NeurIPS 2020
0
citations
Blending Anti-Aliasing into Vision Transformer
NeurIPS 2021
0
citations
Unifying Voxel-based Representation with Transformer for 3D Object Detection
NeurIPS 2022
0
citations
Real-World Image Variation by Aligning Diffusion Inversion Chain
NeurIPS 2023
0
citations
DiffComplete: Diffusion-based Generative 3D Shape Completion
NeurIPS 2023
0
citations
Deep Edge-Aware Filters
ICML 2015
0
citations