Jian Yang
63
Papers
485
Total Citations
Papers (63)
TableBench: A Comprehensive and Complex Benchmark for Table Question Answering
AAAI 2025
99
citations
Frequency-Spatial Entanglement Learning for Camouflaged Object Detection
ECCV 2024
68
citations
OmniBench: Towards The Future of Universal Omni-Language Models
NeurIPS 2025
51
citations
LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly Detection
AAAI 2024arXiv
47
citations
Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
AAAI 2025
38
citations
McEval: Massively Multilingual Code Evaluation
ICLR 2025arXiv
30
citations
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
ICCV 2025
22
citations
EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing
CVPR 2025arXiv
16
citations
InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption
CVPR 2025arXiv
14
citations
MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling
CVPR 2025
13
citations
Completion as Enhancement: A Degradation-Aware Selective Image Guided Network for Depth Completion
CVPR 2025arXiv
12
citations
From Words to Worth: Newborn Article Impact Prediction with LLM
AAAI 2025
11
citations
RNG: Relightable Neural Gaussians
CVPR 2025
9
citations
Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video
AAAI 2025
8
citations
UltraHR-100K: Enhancing UHR Image Synthesis with A Large-Scale High-Quality Dataset
NeurIPS 2025
7
citations
LaTexBlend: Scaling Multi-concept Customized Generation with Latent Textual Blending
CVPR 2025
5
citations
Fundamental Matrix Estimation Using Relative Depths
ECCV 2024
5
citations
DuCos: Duality Constrained Depth Super-Resolution via Foundation Model
ICCV 2025arXiv
4
citations
SVG-IR: Spatially-Varying Gaussian Splatting for Inverse Rendering
CVPR 2025arXiv
4
citations
AltNeRF: Learning Robust Neural Radiance Field via Alternating Depth-Pose Optimization
AAAI 2024arXiv
4
citations
KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation
NeurIPS 2025
4
citations
Zero-shot RGB-D Point Cloud Registration with Pre-trained Large Vision Model
CVPR 2025
3
citations
Relaxed Rotational Equivariance via G-Biases in Vision
AAAI 2025
2
citations
Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability
CVPR 2025arXiv
2
citations
Describe, Don’t Dictate: Semantic Image Editing with Natural Language Intent
ICCV 2025
2
citations
StrandHead: Text to Hair-Disentangled 3D Head Avatars Using Human-Centric Priors
ICCV 2025arXiv
1
citations
Visual and Semantic Prompt Collaboration for Generalized Zero-Shot Learning
CVPR 2025arXiv
1
citations
Learning Class Prototypes for Unified Sparse-Supervised 3D Object Detection
CVPR 2025arXiv
1
citations
Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation
AAAI 2025
1
citations
Reverse Convolution and Its Applications to Image Restoration
ICCV 2025arXiv
1
citations
Towards Better Spherical Sliced-Wasserstein Distance Learning with Data-Adaptive Discriminative Projection Direction
AAAI 2025
0
citations
Mitigating Hallucinations in Large Vision-Language Models by Adaptively Constraining Information Flow
AAAI 2025
0
citations
XCOT: Cross-lingual Instruction Tuning for Cross-lingual Chain-of-Thought Reasoning
AAAI 2025
0
citations
From Zero to Detail: Deconstructing Ultra-High-Definition Image Restoration from Progressive Spectral Perspective
CVPR 2025
0
citations
Fine-Tuning Language Models with Collaborative and Semantic Experts
AAAI 2025
0
citations
MCL-NER: Cross-Lingual Named Entity Recognition via Multi-View Contrastive Learning
AAAI 2024arXiv
0
citations
SHaRPose: Sparse High-Resolution Representation for Human Pose Estimation
AAAI 2024
0
citations
Hyperbolic Graph Diffusion Model
AAAI 2024
0
citations
Divide and Conquer: Hybrid Pre-training for Person Search
AAAI 2024
0
citations
SGNet: Structure Guided Network via Gradient-Frequency Awareness for Depth Map Super-resolution
AAAI 2024
0
citations
PromptKD: Unsupervised Prompt Distillation for Vision-Language Models
CVPR 2024
0
citations
Multi-Attribute Interactions Matter for 3D Visual Grounding
CVPR 2024
0
citations
Tri-Perspective View Decomposition for Geometry-Aware Depth Completion
CVPR 2024
0
citations
Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance
CVPR 2024
0
citations
LTA-PCS: Learnable Task-Agnostic Point Cloud Sampling
CVPR 2024
0
citations
VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction
CVPR 2025
0
citations
Generative Point Cloud Registration
ICML 2025
0
citations
Sketchy Bounding-box Supervision for 3D Instance Segmentation
CVPR 2025
0
citations
HORP: Human-Object Relation Priors Guided HOI Detection
CVPR 2025
0
citations
Three-view Focal Length Recovery From Homographies
CVPR 2025
0
citations
DORNet: A Degradation Oriented and Regularized Network for Blind Depth Super-Resolution
CVPR 2025
0
citations
RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark
CVPR 2025
0
citations
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models
ICCV 2025
0
citations
Straighten Viscous Rectified Flow via Noise Optimization
ICCV 2025
0
citations
GSRecon: Efficient Generalizable Gaussian Splatting for Surface Reconstruction from Sparse Views
ICCV 2025
0
citations
RAGD: Regional-Aware Diffusion Model for Text-to-Image Generation
ICCV 2025
0
citations
OcRFDet: Object-Centric Radiance Fields for Multi-View 3D Object Detection in Autonomous Driving
ICCV 2025
0
citations
Cross-modal Ship Re-Identification via Optical and SAR Imagery: A Novel Dataset and Method
ICCV 2025
0
citations
Remote Photoplethysmography in Real-World and Extreme Lighting Scenarios
CVPR 2025
0
citations
WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba Diffusion
CVPR 2025
0
citations
Dual Manifold Regularization Steered Robust Representation Learning for Point Cloud Analysis
AAAI 2025
0
citations
Harmonious Music-driven Group Choreography with Trajectory-Controllable Diffusion
AAAI 2025
0
citations
One-Way Ticket: Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models
CVPR 2025
0
citations