Jian Yang

63
Papers
485
Total Citations

Papers (63)

TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

AAAI 2025
99
citations

Frequency-Spatial Entanglement Learning for Camouflaged Object Detection

ECCV 2024
68
citations

OmniBench: Towards The Future of Universal Omni-Language Models

NeurIPS 2025
51
citations

LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly Detection

AAAI 2024arXiv
47
citations

Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking

AAAI 2025
38
citations

McEval: Massively Multilingual Code Evaluation

ICLR 2025arXiv
30
citations

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

ICCV 2025
22
citations

EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing

CVPR 2025arXiv
16
citations

InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption

CVPR 2025arXiv
14
citations

MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling

CVPR 2025
13
citations

Completion as Enhancement: A Degradation-Aware Selective Image Guided Network for Depth Completion

CVPR 2025arXiv
12
citations

From Words to Worth: Newborn Article Impact Prediction with LLM

AAAI 2025
11
citations

RNG: Relightable Neural Gaussians

CVPR 2025
9
citations

Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video

AAAI 2025
8
citations

UltraHR-100K: Enhancing UHR Image Synthesis with A Large-Scale High-Quality Dataset

NeurIPS 2025
7
citations

LaTexBlend: Scaling Multi-concept Customized Generation with Latent Textual Blending

CVPR 2025
5
citations

Fundamental Matrix Estimation Using Relative Depths

ECCV 2024
5
citations

DuCos: Duality Constrained Depth Super-Resolution via Foundation Model

ICCV 2025arXiv
4
citations

SVG-IR: Spatially-Varying Gaussian Splatting for Inverse Rendering

CVPR 2025arXiv
4
citations

AltNeRF: Learning Robust Neural Radiance Field via Alternating Depth-Pose Optimization

AAAI 2024arXiv
4
citations

KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation

NeurIPS 2025
4
citations

Zero-shot RGB-D Point Cloud Registration with Pre-trained Large Vision Model

CVPR 2025
3
citations

Relaxed Rotational Equivariance via G-Biases in Vision

AAAI 2025
2
citations

Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability

CVPR 2025arXiv
2
citations

Describe, Don’t Dictate: Semantic Image Editing with Natural Language Intent

ICCV 2025
2
citations

StrandHead: Text to Hair-Disentangled 3D Head Avatars Using Human-Centric Priors

ICCV 2025arXiv
1
citations

Visual and Semantic Prompt Collaboration for Generalized Zero-Shot Learning

CVPR 2025arXiv
1
citations

Learning Class Prototypes for Unified Sparse-Supervised 3D Object Detection

CVPR 2025arXiv
1
citations

Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation

AAAI 2025
1
citations

Reverse Convolution and Its Applications to Image Restoration

ICCV 2025arXiv
1
citations

Towards Better Spherical Sliced-Wasserstein Distance Learning with Data-Adaptive Discriminative Projection Direction

AAAI 2025
0
citations

Mitigating Hallucinations in Large Vision-Language Models by Adaptively Constraining Information Flow

AAAI 2025
0
citations

XCOT: Cross-lingual Instruction Tuning for Cross-lingual Chain-of-Thought Reasoning

AAAI 2025
0
citations

From Zero to Detail: Deconstructing Ultra-High-Definition Image Restoration from Progressive Spectral Perspective

CVPR 2025
0
citations

Fine-Tuning Language Models with Collaborative and Semantic Experts

AAAI 2025
0
citations

MCL-NER: Cross-Lingual Named Entity Recognition via Multi-View Contrastive Learning

AAAI 2024arXiv
0
citations

SHaRPose: Sparse High-Resolution Representation for Human Pose Estimation

AAAI 2024
0
citations

Hyperbolic Graph Diffusion Model

AAAI 2024
0
citations

Divide and Conquer: Hybrid Pre-training for Person Search

AAAI 2024
0
citations

SGNet: Structure Guided Network via Gradient-Frequency Awareness for Depth Map Super-resolution

AAAI 2024
0
citations

PromptKD: Unsupervised Prompt Distillation for Vision-Language Models

CVPR 2024
0
citations

Multi-Attribute Interactions Matter for 3D Visual Grounding

CVPR 2024
0
citations

Tri-Perspective View Decomposition for Geometry-Aware Depth Completion

CVPR 2024
0
citations

Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance

CVPR 2024
0
citations

LTA-PCS: Learnable Task-Agnostic Point Cloud Sampling

CVPR 2024
0
citations

VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction

CVPR 2025
0
citations

Generative Point Cloud Registration

ICML 2025
0
citations

Sketchy Bounding-box Supervision for 3D Instance Segmentation

CVPR 2025
0
citations

HORP: Human-Object Relation Priors Guided HOI Detection

CVPR 2025
0
citations

Three-view Focal Length Recovery From Homographies

CVPR 2025
0
citations

DORNet: A Degradation Oriented and Regularized Network for Blind Depth Super-Resolution

CVPR 2025
0
citations

RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark

CVPR 2025
0
citations

SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models

ICCV 2025
0
citations

Straighten Viscous Rectified Flow via Noise Optimization

ICCV 2025
0
citations

GSRecon: Efficient Generalizable Gaussian Splatting for Surface Reconstruction from Sparse Views

ICCV 2025
0
citations

RAGD: Regional-Aware Diffusion Model for Text-to-Image Generation

ICCV 2025
0
citations

OcRFDet: Object-Centric Radiance Fields for Multi-View 3D Object Detection in Autonomous Driving

ICCV 2025
0
citations

Cross-modal Ship Re-Identification via Optical and SAR Imagery: A Novel Dataset and Method

ICCV 2025
0
citations

Remote Photoplethysmography in Real-World and Extreme Lighting Scenarios

CVPR 2025
0
citations

WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba Diffusion

CVPR 2025
0
citations

Dual Manifold Regularization Steered Robust Representation Learning for Point Cloud Analysis

AAAI 2025
0
citations

Harmonious Music-driven Group Choreography with Trajectory-Controllable Diffusion

AAAI 2025
0
citations

One-Way Ticket: Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models

CVPR 2025
0
citations