All Papers

34,598 papers found • Page 642 of 692

SCTNet: Single Branch CNN with Transformer Semantic Information for Real-Time Segmentation

Authors: Zhengze Xu, Dongyue Wu, Changqian Yu et al.

AAAI 2024paperarXiv:2312.17071
129
citations

SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models

Tongtian Yue, Jie Cheng, Longteng Guo et al.

CVPR 2024posterarXiv:2403.13263

Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior

Chen Cheng, Xiaofeng Yang, Fan Yang et al.

CVPR 2024posterarXiv:2403.09140

Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training

Yipeng Gao, Zeyu Wang, Wei-Shi Zheng et al.

CVPR 2024posterarXiv:2311.01734

SCULPT: Shape-Conditioned Unpaired Learning of Pose-dependent Clothed and Textured Human Meshes

Soubhik Sanyal, Partha Ghosh, Jinlong Yang et al.

CVPR 2024posterarXiv:2308.10638
5
citations

SD2Event:Self-supervised Learning of Dynamic Detectors and Contextual Descriptors for Event Cameras

Yuan Gao, Yuqing Zhu, Xinjun Li et al.

CVPR 2024poster

SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching

Xinghui Li, Jingyi Lu, Kai Han et al.

CVPR 2024posterarXiv:2310.17569

SDAC: A Multimodal Synthetic Dataset for Anomaly and Corner Case Detection in Autonomous Driving

Lei Gong, Yu Zhang, Yingqing Xia et al.

AAAI 2024paper

SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection

JUNSU KIM, Hoseong Cho, Jihyeon Kim et al.

CVPR 2024highlightarXiv:2402.17323
47
citations

SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer

Rui Zhu, Yingwei Pan, Yehao Li et al.

CVPR 2024posterarXiv:2403.17004

SDGAN: Disentangling Semantic Manipulation for Facial Attribute Editing

Wenmin Huang, Weiqi Luo, Jiwu Huang et al.

AAAI 2024paper

SDGMNet: Statistic-Based Dynamic Gradient Modulation for Local Descriptor Learning

Yuxin Deng, Jiayi Ma

AAAI 2024paperarXiv:2106.04434
11
citations

SD-MVS: Segmentation-Driven Deformation Multi-View Stereo with Spherical Refinement and EM Optimization

Zhenlong Yuan, Jiakai Cao, Zhaoxin Li et al.

AAAI 2024paperarXiv:2401.06385
35
citations

SDPose: Tokenized Pose Estimation via Circulation-Guide Self-Distillation

Chen Sichen, Yingyi Zhang, Siming Huang et al.

CVPR 2024posterarXiv:2404.03518

SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models

Yang Zhou, Yongjian Wu, Jiya Saiyin et al.

ECCV 2024posterarXiv:2407.11414
2
citations

SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking

Xiaojun Hou, Jiazheng Xing, Yijie Qian et al.

CVPR 2024posterarXiv:2403.16002

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Dustin Podell, Zion English, Kyle Lacey et al.

ICLR 2024spotlightarXiv:2307.01952

S-DyRF: Reference-Based Stylized Radiance Fields for Dynamic Scenes

Xingyi Li, Zhiguo Cao, Yizheng Wu et al.

CVPR 2024posterarXiv:2403.06205

SE(3)-Stochastic Flow Matching for Protein Backbone Generation

Joey Bose, Tara Akhound-Sadegh, Guillaume Huguet et al.

ICLR 2024spotlightarXiv:2310.02391

SeaBird: Segmentation in Bird’s View with Dice Loss Improves Monocular 3D Detection of Large Objects

Abhinav Kumar, Yuliang Guo, Xinyu Huang et al.

CVPR 2024posterarXiv:2403.20318

SEABO: A Simple Search-Based Method for Offline Imitation Learning

Jiafei Lyu, Xiaoteng Ma, Le Wan et al.

ICLR 2024posterarXiv:2402.03807

SEA-GWNN: Simple and Effective Adaptive Graph Wavelet Neural Network

Swakshar Deb, Sejuti Rahman, Shafin Rahman

AAAI 2024paper

SEAL: A Framework for Systematic Evaluation of Real-World Super-Resolution

Wenlong Zhang, Xiaohui Li, Xiangyu Chen et al.

ICLR 2024spotlightarXiv:2309.03020

Seamless Human Motion Composition with Blended Positional Encodings

German Barquero, Sergio Escalera, Cristina Palmero

CVPR 2024posterarXiv:2402.15509
58
citations

SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow

Yihan Wang, Lahav Lipson, Jia Deng

ECCV 2024posterarXiv:2405.14793
113
citations

Searching for High-Value Molecules Using Reinforcement Learning and Transformers

Raj Ghugare, Santiago Miret, Adriana Hugessen et al.

ICLR 2024posterarXiv:2310.02902

SeA: Semantic Adversarial Augmentation for Last Layer Features from Unsupervised Representation Learning

Qi Qian, Yuanhong Xu, JUHUA HU

ECCV 2024posterarXiv:2408.13351

SEA: Sparse Linear Attention with Estimated Attention Mask

Heejun Lee, Jina Kim, Jeff Willette et al.

ICLR 2024posterarXiv:2310.01777

SEAS: ShapE-Aligned Supervision for Person Re-Identification

Haidong Zhu, Pranav Budhwant, Zhaoheng Zheng et al.

CVPR 2024poster

SECap: Speech Emotion Captioning with Large Language Model

Yaoxun Xu, Hangting Chen, Jianwei Yu et al.

AAAI 2024paperarXiv:2312.10381
56
citations

SEC: More Accurate Clustering Algorithm via Structural Entropy

Junyu Huang, Qilong Feng, Jiahui Wang et al.

AAAI 2024paper
1
citations

Second-Order Uncertainty Quantification: A Distance-Based Approach

Yusuf Sale, Viktor Bengs, Michele Caprio et al.

ICML 2024spotlightarXiv:2312.00995

SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation

Yamei Chen, Yan Di, Guangyao Zhai et al.

CVPR 2024posterarXiv:2311.11125
54
citations

Secure Distributed Sparse Gaussian Process Models Using Multi-Key Homomorphic Encryption

Adil Nawaz, Guopeng Chen, Muhammad Umair Raza et al.

AAAI 2024paper

SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation

Bin Xie, Jiale Cao, Jin Xie et al.

CVPR 2024posterarXiv:2311.15537
90
citations

SEDiff: Structure Extraction for Domain Adaptive Depth Estimation via Denoising Diffusion Models

Dongseok Shim, Hyoun Jin Kim

ECCV 2024poster

SeD: Semantic-Aware Discriminator for Image Super-Resolution

Bingchen Li, Xin Li, Hanxin Zhu et al.

CVPR 2024posterarXiv:2402.19387

See and Think: Embodied Agent in Virtual Environment

Zhonghan Zhao, Xuan Wang, Wenhao Chai et al.

ECCV 2024posterarXiv:2311.15209

SEED: A Simple and Effective 3D DETR in Point Clouds

Zhe Liu, Jinghua Hou, Xiaoqing Ye et al.

ECCV 2024posterarXiv:2407.10749
21
citations

SEED-Bench: Benchmarking Multimodal Large Language Models

Bohao Li, Yuying Ge, Yixiao Ge et al.

CVPR 2024poster

Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains

Yu Zhang, Yunyi Zhang, Yanzhen Shen et al.

AAAI 2024paperarXiv:2401.13129
5
citations

Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

Yazhou Xing, Yingqing He, Zeyue Tian et al.

CVPR 2024posterarXiv:2402.17723
109
citations

Seeing Dark Videos via Self-Learned Bottleneck Neural Representation

Haofeng Huang, Wenhan Yang, Lingyu Duan et al.

AAAI 2024paper

Seeing Faces in Things: A Model and Dataset for Pareidolia

Mark T Hamilton, Simon Stent, Vasha G DuTell et al.

ECCV 2024posterarXiv:2409.16143
4
citations

Seeing Motion at Nighttime with an Event Camera

Haoyue Liu, Shihan Peng, Lin Zhu et al.

CVPR 2024posterarXiv:2404.11884
30
citations

Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration

shihao zhou, Jinshan Pan, Jinglei Shi et al.

ECCV 2024posterarXiv:2404.00288

Seeing the Unseen: Visual Common Sense for Semantic Placement

Ram Ramrakhya, Aniruddha Kembhavi, Dhruv Batra et al.

CVPR 2024posterarXiv:2401.07770

Seeing the World through Your Eyes

Hadi Alzayer, Kevin Zhang, Brandon Y. Feng et al.

CVPR 2024posterarXiv:2306.09348

Seeing Unseen: Discover Novel Biomedical Concepts via Geometry-Constrained Probabilistic Modeling

Jianan Fan, Dongnan Liu, Hang Chang et al.

CVPR 2024posterarXiv:2403.01053

Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective

Ming Zhong, Chenxin An, Weizhu Chen et al.

ICLR 2024posterarXiv:2310.11451
16
citations