2024 Poster Papers

8,865 papers found • Page 176 of 178

When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models

Haoran You, Yichao Fu, Zheng Wang et al.

ICML 2024posterarXiv:2406.07368

When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset

Yi Zhang, Wang Zeng, Sheng Jin et al.

ECCV 2024posterarXiv:2407.10125
19
citations

When Representations Align: Universality in Representation Learning Dynamics

Loek van Rossem, Andrew Saxe

ICML 2024posterarXiv:2402.09142

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Biao Zhang, Zhongtao Liu, Colin Cherry et al.

ICLR 2024posterarXiv:2402.17193

When Semantic Segmentation Meets Frequency Aliasing

Linwei Chen, Lin Gu, Ying Fu

ICLR 2024posterarXiv:2403.09065
21
citations

When should we prefer Decision Transformers for Offline Reinforcement Learning?

Prajjwal Bhargava, Rohan Chitnis, Alborz Geramifard et al.

ICLR 2024posterarXiv:2305.14550

When StyleGAN Meets Stable Diffusion: a W+ Adapter for Personalized Image Generation

Xiaoming Li, Xinyu Hou, Chen Change Loy

CVPR 2024poster

When Visual Grounding Meets Gigapixel-level Large-scale Scenes: Benchmark and Approach

TAO MA, Bing Bai, Haozhe Lin et al.

CVPR 2024poster
7
citations

When Will Gradient Regularization Be Harmful?

Yang Zhao, Hao Zhang, Xiuyuan Hu

ICML 2024posterarXiv:2406.09723

Where am I? Scene Retrieval with Language

Jiaqi Chen, Daniel Barath, Iro Armeni et al.

ECCV 2024posterarXiv:2404.14565
13
citations

Where We Have Arrived in Proving the Emergence of Sparse Interaction Primitives in DNNs

Qihan Ren, Jiayang Gao, Wen Shen et al.

ICLR 2024poster

Which Frequencies do CNNs Need? Emergent Bottleneck Structure in Feature Learning

Yuxiao Wen, Arthur Jacot

ICML 2024posterarXiv:2402.08010

Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution

Fengyuan Liu, Haochen Luo, Yiming Li et al.

ECCV 2024posterarXiv:2404.02697
12
citations

Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models

Xavi Suau, Pieter Delobelle, Katherine Metcalf et al.

ICML 2024posterarXiv:2407.12824

Whittle Index with Multiple Actions and State Constraint for Inventory Management

Chuheng Zhang, Xiangsen Wang, Wei Jiang et al.

ICLR 2024poster

Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning

Jin Hwa Lee, Stefano Mannelli, Andrew Saxe

ICML 2024posterarXiv:2402.18361

Why do Variational Autoencoders Really Promote Disentanglement?

Pratik Bhowal, Achint Soni, Sirisha Rambhatla

ICML 2024poster

Why Do You Grok? A Theoretical Analysis on Grokking Modular Addition

Mohamad Amin Mohamadi, Zhiyuan Li, Lei Wu et al.

ICML 2024posterarXiv:2407.12332

Why is SAM Robust to Label Noise?

Christina Baek, J Kolter, Aditi Raghunathan

ICLR 2024posterarXiv:2405.03676

Why Larger Language Models Do In-context Learning Differently?

Zhenmei Shi, Junyi Wei, Zhuoyan Xu et al.

ICML 2024posterarXiv:2405.19592

Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos

Kumaranage Ravindu Nagasinghe, Honglu Zhou, Malitha Gunawardhana et al.

CVPR 2024posterarXiv:2403.02782

WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space

Katja Schwarz, Seung Wook Kim, Jun Gao et al.

ICLR 2024posterarXiv:2311.13570

WildlifeMapper: Aerial Image Analysis for Multi-Species Detection and Identification

Satish Kumar, Bowen Zhang, Chandrakanth Gudavalli et al.

CVPR 2024poster

WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language

Zhenxiang Lin, Xidong Peng, peishan cong et al.

ECCV 2024posterarXiv:2304.05645
13
citations

WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models

Zijian He, Peixin Chen, Guangrun Wang et al.

ECCV 2024posterarXiv:2407.10625
20
citations

WiMANS: A Benchmark Dataset for WiFi-based Multi-user Activity Sensing

Shuokang Huang, Kaihan Li, Di You et al.

ECCV 2024posterarXiv:2402.09430
28
citations

Window Attention is Bugged: How not to Interpolate Position Embeddings

Daniel Bolya, Chaitanya Ryali, Judy Hoffman et al.

ICLR 2024posterarXiv:2311.05613

WindPoly: Polygonal Mesh Reconstruction via Winding Numbers

Xin He, Chenlei Lyu, Pengdi Huang et al.

ECCV 2024posterarXiv:2407.19208
3
citations

Winner-takes-all learners are geometry-aware conditional density estimators

Victor Letzelter, David Perera, Cédric Rommel et al.

ICML 2024posterarXiv:2406.04706

WinSyn: : A High Resolution Testbed for Synthetic Data

Tom Kelly, John Femiani, Peter Wonka

CVPR 2024posterarXiv:2310.08471

Win-Win: Training High-Resolution Vision Transformers from Two Windows

Vincent Leroy, Jerome Revaud, Thomas Lucas et al.

ICLR 2024posterarXiv:2310.00632

Wired Perspectives: Multi-View Wire Art Embraces Generative AI

Zhiyu Qu, LAN YANG, Honggang Zhang et al.

CVPR 2024posterarXiv:2311.15421

WISER: Weak Supervision and Supervised Representation Learning to Improve Drug Response Prediction in Cancer

Kumar Shubham, Aishwarya Jayagopal, Syed Danish et al.

ICML 2024posterarXiv:2405.04078

Within the Dynamic Context: Inertia-aware 3D Human Modeling with Pose Sequence

Yutong Chen, Yifan Zhan, Zhihang Zhong et al.

ECCV 2024posterarXiv:2403.19160
8
citations

WizardCoder: Empowering Code Large Language Models with Evol-Instruct

Ziyang Luo, Can Xu, Pu Zhao et al.

ICLR 2024posterarXiv:2306.08568

WizardLM: Empowering Large Pre-Trained Language Models to Follow Complex Instructions

Can Xu, Qingfeng Sun, Kai Zheng et al.

ICLR 2024posterarXiv:2304.12244

WonderJourney: Going from Anywhere to Everywhere

Hong-Xing Yu, Haoyi Duan, Junhwa Hur et al.

CVPR 2024posterarXiv:2312.03884

WOODS: Benchmarks for Out-of-Distribution Generalization in Time Series

Irina Rish, Kartik Ahuja, Mohammad Javad Darvishi Bayazi et al.

ICLR 2024poster
40
citations

WorDepth: Variational Language Prior for Monocular Depth Estimation

Ziyao Zeng, Hyoungseob Park, Fengyu Yang et al.

CVPR 2024posterarXiv:2404.03635

WordRobe: Text-Guided Generation of Textured 3D Garments

Astitva Srivastava, Pranav Manu, Amit Raj et al.

ECCV 2024posterarXiv:2403.17541
20
citations

WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?

Alexandre Drouin, Maxime Gasse, Massimo Caccia et al.

ICML 2024posterarXiv:2403.07718

Workflow Discovery from Dialogues in the Low Data Regime

David Vazquez, Stefania Raimondo, Christopher Pal et al.

ICLR 2024poster
11
citations

WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation

Tianjian Jiang, Johsan Billingham, Sebastian Müksch et al.

ECCV 2024posterarXiv:2501.02771
8
citations

WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models

Changhoon Kim, Kyle Min, Maitreya Patel et al.

CVPR 2024posterarXiv:2306.04744

Would Deep Generative Models Amplify Bias in Future Models?

Tianwei Chen, Yusuke Hirota, Mayu Otani et al.

CVPR 2024posterarXiv:2404.03242
23
citations

WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation

Jiachen Lu, Ze Huang, Zeyu Yang et al.

ECCV 2024posterarXiv:2312.02934

WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models

xinjian wu, Ruisong Zhang, Jie Qin et al.

ECCV 2024posterarXiv:2407.10131

WRIM-Net: Wide-Ranging Information Mining Network for Visible-Infrared Person Re-Identification

Yonggan Wu, Ling-Chao Meng, Yuan Zichao et al.

ECCV 2024posterarXiv:2408.10624

WSI-VQA: Interpreting Whole Slide Images by Generative Visual Question Answering

Pingyi Chen, Chenglu Zhu, Sunyi Zheng et al.

ECCV 2024posterarXiv:2407.05603
24
citations

WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding

Quan Kong, Yuki Kawana, Rajat Saini et al.

ECCV 2024posterarXiv:2407.15350
21
citations