Liang Lin
29
Papers
129
Total Citations
Papers (29)
Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation
CVPR 2024
65
citations
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
AAAI 2024arXiv
20
citations
AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis
CVPR 2024
20
citations
Cross-modal Causal Relation Alignment for Video Question Grounding
CVPR 2025
7
citations
DreamFuse: Adaptive Image Fusion with Diffusion Transformer
ICCV 2025
5
citations
Diagnosing and Rectifying Fake OOD Invariance: A Restructured Causal Approach
AAAI 2024arXiv
4
citations
PS-Diffusion: Photorealistic Subject-Driven Image Editing with Disentangled Control and Attention
CVPR 2025
3
citations
Boosting the Dual-Stream Architecture in Ultra-High Resolution Segmentation with Resolution-Biased Uncertainty Estimation
CVPR 2025
2
citations
Free-MoRef: Instantly Multiplexing Context Perception Capabilities of Video-MLLMs within Single Inference
ICCV 2025
1
citations
Sim-DETR: Unlock DETR for Temporal Sentence Grounding
ICCV 2025
1
citations
Stripe Observation Guided Inference Cost-free Attention Mechanism
ECCV 2024
1
citations
SR-FoT: A Syllogistic-Reasoning Framework of Thought for Large Language Models Tackling Knowledge-based Reasoning Tasks
AAAI 2025
0
citations
Monitoring Primitive Interactions During the Training of DNNs
AAAI 2025
0
citations
FacetCRS: Multi-Faceted Preference Learning for Pricking Filter Bubbles in Conversational Recommender System
AAAI 2024
0
citations
Learning Adaptive Spatial Coherent Correlations for Speech-Preserving Facial Expression Manipulation
CVPR 2024
0
citations
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection
CVPR 2024
0
citations
AttNS: Attention-Inspired Numerical Solving For Limited Data Scenarios
ICML 2024
0
citations
Reproducible Vision-Language Models Meet Concepts Out of Pre-Training
CVPR 2025
0
citations
Kepler codebook
ICML 2024
0
citations
VTON 360: High-Fidelity Virtual Try-On from Any Viewing Direction
CVPR 2025
0
citations
No Pains, More Gains: Recycling Sub-Salient Patches for Efficient High-Resolution Image Recognition
CVPR 2025
0
citations
DAGSM: Disentangled Avatar Generation with GS-enhanced Mesh
CVPR 2025
0
citations
DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering
CVPR 2025
0
citations
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
CVPR 2025
0
citations
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
ICCV 2025
0
citations
RoboPearls: Editable Video Simulation for Robot Manipulation
ICCV 2025
0
citations
Can We Achieve Efficient Diffusion Without Self-Attention? Distilling Self-Attention into Convolutions
ICCV 2025
0
citations
RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation
ICCV 2025
0
citations
Delving into Cascaded Instability: A Lipschitz Continuity View on Image Restoration and Object Detection Synergy
NeurIPS 2025
0
citations