Tsung-Yi Lin

29
Papers
270
Total Citations

Papers (29)

CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models

CVPR 2025
203
citations

Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation

CVPR 2024
33
citations

Efficient Part-level 3D Object Generation via Dual Volume Packing

NeurIPS 2025arXiv
16
citations

Dynamic Camera Poses and Where to Find Them

CVPR 2025arXiv
15
citations

Articulated Kinematics Distillation from Video Diffusion Models

CVPR 2025
3
citations

MnasFPN: Learning Latency-Aware Pyramid Architecture for Object Detection on Mobile Devices

CVPR 2020arXiv
0
citations

SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization

CVPR 2020arXiv
0
citations

Bottleneck Transformers for Visual Recognition

CVPR 2021arXiv
0
citations

Simple Copy-Paste Is a Strong Data Augmentation Method for Instance Segmentation

CVPR 2021arXiv
0
citations

Magic3D: High-Resolution Text-to-3D Content Creation

CVPR 2023arXiv
0
citations

Focal Loss for Dense Object Detection

ICCV 2017arXiv
0
citations

ShapeMask: Learning to Segment Novel Objects by Refining Shape Priors

ICCV 2019
0
citations

Multi-Task Self-Training for Learning General Representations

ICCV 2021arXiv
0
citations

Patch2CAD: Patchwise Embedding Learning for In-the-Wild Shape Retrieval From a Single Image

ICCV 2021arXiv
0
citations

ATT3D: Amortized Text-to-3D Object Synthesis

ICCV 2023arXiv
0
citations

Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve

ECCV 2020
0
citations

Efficient Scale-Permuted Backbone with Learned Resource Distribution

ECCV 2020
0
citations

Learning Data Augmentation Strategies for Object Detection

ECCV 2020
0
citations

A Simple Single-Scale Vision Transformer for Object Detection and Instance Segmentation

ECCV 2022
0
citations

Scaling Open-Vocabulary Image Segmentation with Image-Level Labels

ECCV 2022
0
citations

HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation

CVPR 2025
0
citations

Learning Deep Representations for Ground-to-Aerial Geolocalization

CVPR 2015
0
citations

Feature Pyramid Networks for Object Detection

CVPR 2017arXiv
0
citations

NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection

CVPR 2019
0
citations

Class-Balanced Loss Based on Effective Number of Samples

CVPR 2019
0
citations

DropBlock: A regularization method for convolutional networks

NeurIPS 2018
0
citations

Rethinking Pre-training and Self-training

NeurIPS 2020
0
citations

Revisiting ResNets: Improved Training and Scaling Strategies

NeurIPS 2021
0
citations

A Unified Sequence Interface for Vision Tasks

NeurIPS 2022
0
citations