Liang Lin

29
Papers
129
Total Citations

Papers (29)

Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation

CVPR 2024
65
citations

EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE

AAAI 2024arXiv
20
citations

AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis

CVPR 2024
20
citations

Cross-modal Causal Relation Alignment for Video Question Grounding

CVPR 2025
7
citations

DreamFuse: Adaptive Image Fusion with Diffusion Transformer

ICCV 2025
5
citations

Diagnosing and Rectifying Fake OOD Invariance: A Restructured Causal Approach

AAAI 2024arXiv
4
citations

PS-Diffusion: Photorealistic Subject-Driven Image Editing with Disentangled Control and Attention

CVPR 2025
3
citations

Boosting the Dual-Stream Architecture in Ultra-High Resolution Segmentation with Resolution-Biased Uncertainty Estimation

CVPR 2025
2
citations

Free-MoRef: Instantly Multiplexing Context Perception Capabilities of Video-MLLMs within Single Inference

ICCV 2025
1
citations

Sim-DETR: Unlock DETR for Temporal Sentence Grounding

ICCV 2025
1
citations

Stripe Observation Guided Inference Cost-free Attention Mechanism

ECCV 2024
1
citations

SR-FoT: A Syllogistic-Reasoning Framework of Thought for Large Language Models Tackling Knowledge-based Reasoning Tasks

AAAI 2025
0
citations

Monitoring Primitive Interactions During the Training of DNNs

AAAI 2025
0
citations

FacetCRS: Multi-Faceted Preference Learning for Pricking Filter Bubbles in Conversational Recommender System

AAAI 2024
0
citations

Learning Adaptive Spatial Coherent Correlations for Speech-Preserving Facial Expression Manipulation

CVPR 2024
0
citations

Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection

CVPR 2024
0
citations

AttNS: Attention-Inspired Numerical Solving For Limited Data Scenarios

ICML 2024
0
citations

Reproducible Vision-Language Models Meet Concepts Out of Pre-Training

CVPR 2025
0
citations

Kepler codebook

ICML 2024
0
citations

VTON 360: High-Fidelity Virtual Try-On from Any Viewing Direction

CVPR 2025
0
citations

No Pains, More Gains: Recycling Sub-Salient Patches for Efficient High-Resolution Image Recognition

CVPR 2025
0
citations

DAGSM: Disentangled Avatar Generation with GS-enhanced Mesh

CVPR 2025
0
citations

DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering

CVPR 2025
0
citations

Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method

CVPR 2025
0
citations

Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering

ICCV 2025
0
citations

RoboPearls: Editable Video Simulation for Robot Manipulation

ICCV 2025
0
citations

Can We Achieve Efficient Diffusion Without Self-Attention? Distilling Self-Attention into Convolutions

ICCV 2025
0
citations

RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation

ICCV 2025
0
citations

Delving into Cascaded Instability: A Lipschitz Continuity View on Image Restoration and Object Detection Synergy

NeurIPS 2025
0
citations