Tsung-Yi Lin
29
Papers
270
Total Citations
Papers (29)
CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models
CVPR 2025
203
citations
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation
CVPR 2024
33
citations
Efficient Part-level 3D Object Generation via Dual Volume Packing
NeurIPS 2025arXiv
16
citations
Dynamic Camera Poses and Where to Find Them
CVPR 2025arXiv
15
citations
Articulated Kinematics Distillation from Video Diffusion Models
CVPR 2025
3
citations
MnasFPN: Learning Latency-Aware Pyramid Architecture for Object Detection on Mobile Devices
CVPR 2020arXiv
0
citations
SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization
CVPR 2020arXiv
0
citations
Bottleneck Transformers for Visual Recognition
CVPR 2021arXiv
0
citations
Simple Copy-Paste Is a Strong Data Augmentation Method for Instance Segmentation
CVPR 2021arXiv
0
citations
Magic3D: High-Resolution Text-to-3D Content Creation
CVPR 2023arXiv
0
citations
Focal Loss for Dense Object Detection
ICCV 2017arXiv
0
citations
ShapeMask: Learning to Segment Novel Objects by Refining Shape Priors
ICCV 2019
0
citations
Multi-Task Self-Training for Learning General Representations
ICCV 2021arXiv
0
citations
Patch2CAD: Patchwise Embedding Learning for In-the-Wild Shape Retrieval From a Single Image
ICCV 2021arXiv
0
citations
ATT3D: Amortized Text-to-3D Object Synthesis
ICCV 2023arXiv
0
citations
Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve
ECCV 2020
0
citations
Efficient Scale-Permuted Backbone with Learned Resource Distribution
ECCV 2020
0
citations
Learning Data Augmentation Strategies for Object Detection
ECCV 2020
0
citations
A Simple Single-Scale Vision Transformer for Object Detection and Instance Segmentation
ECCV 2022
0
citations
Scaling Open-Vocabulary Image Segmentation with Image-Level Labels
ECCV 2022
0
citations
HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation
CVPR 2025
0
citations
Learning Deep Representations for Ground-to-Aerial Geolocalization
CVPR 2015
0
citations
Feature Pyramid Networks for Object Detection
CVPR 2017arXiv
0
citations
NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection
CVPR 2019
0
citations
Class-Balanced Loss Based on Effective Number of Samples
CVPR 2019
0
citations
DropBlock: A regularization method for convolutional networks
NeurIPS 2018
0
citations
Rethinking Pre-training and Self-training
NeurIPS 2020
0
citations
Revisiting ResNets: Improved Training and Scaling Strategies
NeurIPS 2021
0
citations
A Unified Sequence Interface for Vision Tasks
NeurIPS 2022
0
citations