Most Cited 2025 Poster Papers

22,274 papers found • Page 23 of 112

#4401

Generalization Bounds and Model Complexity for Kolmogorov–Arnold Networks

Xianyang Zhang, Huijuan Zhou

ICLR 2025posterarXiv:2410.08026
5
citations
#4402

CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation

Zhuoyan Luo, Yinghao Wu, Tianheng Cheng et al.

ICCV 2025posterarXiv:2405.15658
5
citations
#4403

Frequency-Dynamic Attention Modulation For Dense Prediction

Linwei Chen, Lin Gu, Ying Fu

ICCV 2025posterarXiv:2507.12006
5
citations
#4404

Seeing the Arrow of Time in Large Multimodal Models

Zihui (Sherry) Xue, Romy Luo, Kristen Grauman

NEURIPS 2025oralarXiv:2506.03340
5
citations
#4405

Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning

Xingjian Ran, Yixuan Li, Linning Xu et al.

NEURIPS 2025posterarXiv:2506.05341
5
citations
#4406

Open-World Objectness Modeling Unifies Novel Object Detection

Shan Zhang, Yao Ni, Jinhao Du et al.

CVPR 2025poster
5
citations
#4407

ECHOPulse: ECG Controlled Echocardio-gram Video Generation

Yiwei Li, Sekeun Kim, Zihao Wu et al.

ICLR 2025posterarXiv:2410.03143
5
citations
#4408

BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks

Juan A. Rodriguez, Xiangru Jian, Siba Smarak Panigrahi et al.

ICLR 2025posterarXiv:2412.04626
5
citations
#4409

Revisiting Source-Free Domain Adaptation: a New Perspective via Uncertainty Control

Gezheng Xu, Hui GUO, Li Yi et al.

ICLR 2025poster
5
citations
#4410

Make Your Training Flexible: Towards Deployment-Efficient Video Models

Chenting Wang, Kunchang Li, Tianxiang Jiang et al.

ICCV 2025posterarXiv:2503.14237
5
citations
#4411

Calibrating Expressions of Certainty

Peiqi Wang, Barbara Lam, Yingcheng Liu et al.

ICLR 2025posterarXiv:2410.04315
5
citations
#4412

Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning

Xiaochuan Li, Zichun Yu, Chenyan Xiong

ICLR 2025posterarXiv:2410.14208
5
citations
#4413

Infer Human’s Intentions Before Following Natural Language Instructions

Yanming Wan, Yue Wu, Yiping Wang et al.

AAAI 2025paperarXiv:2409.18073
5
citations
#4414

Few-Shot, No Problem: Descriptive Continual Relation Extraction

Nguyen Xuan Thanh, Anh Duc Le, Quyen Tran et al.

AAAI 2025paperarXiv:2502.20596
5
citations
#4415

Audio-Visual Semantic Graph Network for Audio-Visual Event Localization

Liang Liu, Shuaiyong Li, Yongqiang Zhu

CVPR 2025poster
5
citations
#4416

Can LLMs Obfuscate Code? A Systematic Analysis of Large Language Models into Assembly Code Obfuscation

Seyedreza Mohseni, Seyedali Mohammadi, Deepa Tilwani et al.

AAAI 2025paperarXiv:2412.16135
5
citations
#4417

Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction

Mingyu Derek Ma, Xiaoxuan Wang, Yijia Xiao et al.

AAAI 2025paperarXiv:2501.17326
5
citations
#4418

Causally Reliable Concept Bottleneck Models

Giovanni De Felice, Arianna Casanova Flores, Francesco De Santis et al.

NEURIPS 2025posterarXiv:2503.04363
5
citations
#4419

SMI-Editor: Edit-based SMILES Language Model with Fragment-level Supervision

Kangjie Zheng, Siyue Liang, Junwei Yang et al.

ICLR 2025posterarXiv:2412.05569
5
citations
#4420

MultiverSeg: Scalable Interactive Segmentation of Biomedical Imaging Datasets with In-Context Guidance

Hallee Wong, Jose Javier Gonzalez Ortiz, John Guttag et al.

ICCV 2025posterarXiv:2412.15058
5
citations
#4421

SwitchLingua: The First Large-Scale Multilingual and Multi-Ethnic Code-Switching Dataset

Peng Xie, Xingyuan Liu, Yequan Bie et al.

NEURIPS 2025posterarXiv:2506.00087
5
citations
#4422

STAR: Stability-Inducing Weight Perturbation for Continual Learning

Masih Eskandar, Tooba Imtiaz, Davin Hill et al.

ICLR 2025posterarXiv:2503.01595
5
citations
#4423

Strategic Classification With Externalities

Safwan Hossain, Evi Micha, Yiling Chen et al.

ICLR 2025posterarXiv:2410.08032
5
citations
#4424

CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models

Dongfang Li, Zetian Sun, Xinshuo Hu et al.

AAAI 2025paperarXiv:2412.07393
5
citations
#4425

DreamFuse: Adaptive Image Fusion with Diffusion Transformer

Junjia Huang, Pengxiang Yan, Jiyang Liu et al.

ICCV 2025posterarXiv:2504.08291
5
citations
#4426

Many-Objective Multi-Solution Transport

Ziyue Li, Tian Li, Virginia Smith et al.

ICLR 2025posterarXiv:2403.04099
5
citations
#4427

Benchmarking and Learning Multi-Dimensional Quality Evaluator for Text-to-3D Generation

Yujie Zhang, Bingyang Cui, Qi Yang et al.

ICCV 2025posterarXiv:2412.11170
5
citations
#4428

A Simple Approach to Unifying Diffusion-based Conditional Generation

Xirui Li, Charles Herrmann, Kelvin Chan et al.

ICLR 2025posterarXiv:2410.11439
5
citations
#4429

Ineq-Comp: Benchmarking Human-Intuitive Compositional Reasoning in Automated Theorem Proving of Inequalities

Haoyu Zhao, Yihan Geng, Shange Tang et al.

NEURIPS 2025posterarXiv:2505.12680
5
citations
#4430

Preference-Oriented Supervised Fine-Tuning: Favoring Target Model over Aligned Large Language Models

Yuchen Fan, Yuzhong Hong, Qiushi Wang et al.

AAAI 2025paperarXiv:2412.12865
5
citations
#4431

Learning-Guided Rolling Horizon Optimization for Long-Horizon Flexible Job-Shop Scheduling

Sirui Li, Wenbin Ouyang, Yining Ma et al.

ICLR 2025posterarXiv:2502.15791
5
citations
#4432

LaTexBlend: Scaling Multi-concept Customized Generation with Latent Textual Blending

Jian Jin, Zhenbo Yu, Yang Shen et al.

CVPR 2025highlightarXiv:2503.06956
5
citations
#4433

MoRE-Brain: Routed Mixture of Experts for Interpretable and Generalizable Cross-Subject fMRI Visual Decoding

YUXIANG WEI, Yanteng Zhang, Xi Xiao et al.

NEURIPS 2025posterarXiv:2505.15946
5
citations
#4434

DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization

Wenchuan Wang, Mengqi Huang, Yijing Tu et al.

ICCV 2025posterarXiv:2505.02192
5
citations
#4435

DeblurDiff: Real-Word Image Deblurring with Generative Diffusion Models

Lingshun Kong, Jiawei Zhang, Dongqing Zou et al.

NEURIPS 2025poster
5
citations
#4436

Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning

Weidong Liu, Jiyuan Tu, Xi Chen et al.

NEURIPS 2025posterarXiv:2310.02581
5
citations
#4437

On the Relation between Rectified Flows and Optimal Transport

Johannes Hertrich, Antonin Chambolle, Julie Delon

NEURIPS 2025posterarXiv:2505.19712
5
citations
#4438

Episodic Novelty Through Temporal Distance

Yuhua Jiang, Qihan Liu, Yiqin Yang et al.

ICLR 2025oralarXiv:2501.15418
5
citations
#4439

Can Students Beyond the Teacher? Distilling Knowledge from Teacher’s Bias

Jianhua Zhang, Yi Gao, Ruyu Liu et al.

AAAI 2025paperarXiv:2412.09874
5
citations
#4440

Specifying What You Know or Not for Multi-Label Class-Incremental Learning

Aoting Zhang, Dongbao Yang, Chang Liu et al.

AAAI 2025paperarXiv:2503.17017
5
citations
#4441

Atomic Thinking of LLMs: Decoupling and Exploring Mathematical Reasoning Abilities

Jiayi Kuang, Haojing Huang, Yinghui Li et al.

NEURIPS 2025posterarXiv:2509.25725
5
citations
#4442

On Extending Direct Preference Optimization to Accommodate Ties

Jinghong Chen, Guangyu Yang, Weizhe Lin et al.

NEURIPS 2025posterarXiv:2409.17431
5
citations
#4443

DMWM: Dual-Mind World Model with Long-Term Imagination

Lingyi Wang, Rashed Shelim, Walid Saad et al.

NEURIPS 2025spotlightarXiv:2502.07591
5
citations
#4444

BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments

Xinghao Wang, Pengyu Wang, Bo Wang et al.

ICLR 2025posterarXiv:2410.23918
5
citations
#4445

CityAnchor: City-scale 3D Visual Grounding with Multi-modality LLMs

Jinpeng Li, Haiping Wang, Jiabin chen et al.

ICLR 2025poster
5
citations
#4446

Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis

Kaiyang Ji, Ye Shi, Zichen Jin et al.

ICCV 2025highlightarXiv:2508.02106
5
citations
#4447

Harmonizing Visual and Textual Embeddings for Zero-Shot Text-to-Image Customization

Yeji Song, Jimyeong Kim, Wonhark Park et al.

AAAI 2025paperarXiv:2403.14155
5
citations
#4448

Difficulty-aware Balancing Margin Loss for Long-tailed Recognition

Minseok Son, Inyong Koo, Jinyoung Park et al.

AAAI 2025paperarXiv:2412.15477
5
citations
#4449

FactorGCL: A Hypergraph-Based Factor Model with Temporal Residual Contrastive Learning for Stock Returns Prediction

Yitong Duan, Weiran Wang, Jian Li

AAAI 2025paperarXiv:2502.05218
5
citations
#4450

ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping

Shun Iwase, Muhammad Zubair Irshad, Katherine Liu et al.

CVPR 2025posterarXiv:2504.10857
5
citations
#4451

SDFit: 3D Object Pose and Shape by Fitting a Morphable SDF to a Single Image

Dimitrije Antić, Georgios Paschalidis, Shashank Tripathi et al.

ICCV 2025posterarXiv:2409.16178
5
citations
#4452

Thousand Voices of Trauma: A Large-Scale Synthetic Dataset for Modeling Prolonged Exposure Therapy Conversations

Suhas BN, Andrew Sherrill, Rosa I. Arriaga et al.

NEURIPS 2025spotlightarXiv:2504.13955
5
citations
#4453

Co-Reinforcement Learning for Unified Multimodal Understanding and Generation

Jingjing Jiang, Chongjie Si, Jun Luo et al.

NEURIPS 2025spotlightarXiv:2505.17534
5
citations
#4454

FlexEvent: Towards Flexible Event-Frame Object Detection at Varying Operational Frequencies

Dongyue Lu, Lingdong Kong, Gim Hee Lee et al.

NEURIPS 2025oralarXiv:2412.06708
5
citations
#4455

VeriThoughts: Enabling Automated Verilog Code Generation using Reasoning and Formal Verification

Patrick Yubeaton, Andre Nakkab, Weihua Xiao et al.

NEURIPS 2025posterarXiv:2505.20302
5
citations
#4456

Thinker: Learning to Think Fast and Slow

Stephen Chung, Wenyu Du, Jie Fu

NEURIPS 2025posterarXiv:2505.21097
5
citations
#4457

LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal

Shr-Ruei Tsai, Wei-Cheng Chang, Jie-Ying Lee et al.

ICCV 2025posterarXiv:2510.15868
5
citations
#4458

PFDiff: Training-Free Acceleration of Diffusion Models Combining Past and Future Scores

Guangyi Wang, Yuren Cai, lijiang Li et al.

ICLR 2025posterarXiv:2408.08822
5
citations
#4459

On Union-Closedness of Language Generation

Steve Hanneke, Amin Karbasi, Anay Mehrotra et al.

NEURIPS 2025posterarXiv:2506.18642
5
citations
#4460

WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception

Zhiheng Liu, Xueqing Deng, Shoufa Chen et al.

NEURIPS 2025oralarXiv:2508.15720
5
citations
#4461

A solvable model of learning generative diffusion: theory and insights

Hugo Cui, Cengiz Pehlevan, Yue Lu

NEURIPS 2025posterarXiv:2501.03937
5
citations
#4462

Adversarial Robust Memory-Based Continual Learner

Xiaoyue Mi, Fan Tang, Zonghan Yang et al.

ICCV 2025posterarXiv:2311.17608
5
citations
#4463

Hierarchical Cross-modal Prompt Learning for Vision-Language Models

Hao Zheng, Shunzhi Yang, Zhuoxin He et al.

ICCV 2025posterarXiv:2507.14976
5
citations
#4464

Video Perception Models for 3D Scene Synthesis

Rui Huang, Guangyao Zhai, Zuria Bauer et al.

NEURIPS 2025posterarXiv:2506.20601
5
citations
#4465

Why Do Some Language Models Fake Alignment While Others Don't?

Abhay Sheshadri, John Hughes, Julian Michael et al.

NEURIPS 2025spotlightarXiv:2506.18032
5
citations
#4466

A Simple Graph Contrastive Learning Framework for Short Text Classification

Yonghao Liu, Fausto Giunchiglia, Lan Huang et al.

AAAI 2025paperarXiv:2501.09219
5
citations
#4467

Constrained Optimization From a Control Perspective via Feedback Linearization

Runyu Zhang, Arvind Raghunathan, Jeff Shamma et al.

NEURIPS 2025posterarXiv:2503.12665
5
citations
#4468

Learning Graph Invariance by Harnessing Spuriosity

Tianjun Yao, Yongqiang Chen, Kai Hu et al.

ICLR 2025poster
5
citations
#4469

Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On

Siqi Wan, Jingwen Chen, Yingwei Pan et al.

ICLR 2025posterarXiv:2505.16977
5
citations
#4470

Multi-Modal View Enhanced Large Vision Models for Long-Term Time Series Forecasting

ChengAo Shen, Wenchao Yu, Ziming Zhao et al.

NEURIPS 2025posterarXiv:2505.24003
5
citations
#4471

DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Model

Junjia Huang, Pengxiang Yan, Jinhang Cai et al.

ICCV 2025highlight
5
citations
#4472

Exploring the Design Space of Visual Context Representation in Video MLLMs

Yifan Du, Yuqi Huo, Kun Zhou et al.

ICLR 2025posterarXiv:2410.13694
5
citations
#4473

Real-Time Recurrent Reinforcement Learning

Julian Lemmel, Radu Grosu

AAAI 2025paperarXiv:2311.04830
5
citations
#4474

On the Consistency of Video Large Language Models in Temporal Comprehension

Minjoon Jung, Junbin Xiao, Byoung-Tak Zhang et al.

CVPR 2025posterarXiv:2411.12951
5
citations
#4475

Revisiting End-to-End Learning with Slide-level Supervision in Computational Pathology

Wenhao Tang, Rong Qin, Heng Fang et al.

NEURIPS 2025posterarXiv:2506.02408
5
citations
#4476

Chiron-o1: Igniting Multimodal Large Language Models towards Generalizable Medical Reasoning via Mentor-Intern Collaborative Search

Haoran Sun, Yankai Jiang, Wenjie Lou et al.

NEURIPS 2025posterarXiv:2506.16962
5
citations
#4477

PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly

Liang Ma, Jiajun Wen, Min Lin et al.

NEURIPS 2025posterarXiv:2506.08708
5
citations
#4478

Certification of Speaker Recognition Models to Additive Perturbations

Dmitrii Korzh, Elvir Karimov, Mikhail Pautov et al.

AAAI 2025paperarXiv:2404.18791
5
citations
#4479

Purifying Shampoo: Investigating Shampoo's Heuristics by Decomposing its Preconditioner

Runa Eschenhagen, Aaron Defazio, Tsung-Hsien Lee et al.

NEURIPS 2025spotlightarXiv:2506.03595
5
citations
#4480

Towards Homogeneous Lexical Tone Decoding from Heterogeneous Intracranial Recordings

Di Wu, Siyuan Li, Chen Feng et al.

ICLR 2025posterarXiv:2410.12866
5
citations
#4481

Compliant Residual DAgger: Improving Real-World Contact-Rich Manipulation with Human Corrections

Xiaomeng Xu, Yifan Hou, Zeyi Liu et al.

NEURIPS 2025posterarXiv:2506.16685
5
citations
#4482

Learning Spatial-Semantic Features for Robust Video Object Segmentation

Xin Li, Deshui Miao, Zhenyu He et al.

ICLR 2025posterarXiv:2407.07760
5
citations
#4483

MotionPRO: Exploring the Role of Pressure in Human MoCap and Beyond

Shenghao Ren, Yi Lu, Jiayi Huang et al.

CVPR 2025highlightarXiv:2504.05046
5
citations
#4484

Thinking in Character: Advancing Role-Playing Agents with Role-Aware Reasoning

Yihong Tang, Kehai Chen, Muyun Yang et al.

NEURIPS 2025posterarXiv:2506.01748
5
citations
#4485

Denoising with a Joint-Embedding Predictive Architecture

Chen Dengsheng, Jie Hu, Xiaoming Wei et al.

ICLR 2025posterarXiv:2410.03755
5
citations
#4486

Frame In-N-Out: Unbounded Controllable Image-to-Video Generation

Boyang Wang, Xuweiyi Chen, Matheus Gadelha et al.

NEURIPS 2025oralarXiv:2505.21491
5
citations
#4487

MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds

Bingquan Dai, Luo Li, Qihong Tang et al.

NEURIPS 2025posterarXiv:2508.14879
5
citations
#4488

Unlocking Point Processes through Point Set Diffusion

David Lüdke, Enric Rabasseda Raventós, Marcel Kollovieh et al.

ICLR 2025oralarXiv:2410.22493
5
citations
#4489

MetaBox-v2: A Unified Benchmark Platform for Meta-Black-Box Optimization

Zeyuan Ma, Yue-Jiao Gong, Hongshu Guo et al.

NEURIPS 2025posterarXiv:2505.17745
5
citations
#4490

HeMoRa: Unsupervised Heuristic Consensus Sampling for Robust Point Cloud Registration

Shaocheng Yan, Yiming Wang, Kaiyan Zhao et al.

CVPR 2025poster
5
citations
#4491

BUFFER-X: Towards Zero-Shot Point Cloud Registration in Diverse Scenes

Minkyun Seo, Hyungtae Lim, Kanghee Lee et al.

ICCV 2025highlightarXiv:2503.07940
5
citations
#4492

Learning Orthogonal Multi-Index Models: A Fine-Grained Information Exponent Analysis

Yunwei Ren, Jason Lee

NEURIPS 2025posterarXiv:2410.09678
5
citations
#4493

QMP: Q-switch Mixture of Policies for Multi-Task Behavior Sharing

Grace Zhang, Ayush Jain, Injune Hwang et al.

ICLR 2025oralarXiv:2302.00671
5
citations
#4494

SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions

Xianzhe Fan, Xuhui Zhou, Chuanyang Jin et al.

NEURIPS 2025posterarXiv:2506.23046
5
citations
#4495

Uncertainty-Aware Global-View Reconstruction for Multi-View Multi-Label Feature Selection

Pingting Hao, Kunpeng Liu, Wanfu Gao

AAAI 2025paperarXiv:2503.14024
5
citations
#4496

OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging

Yijie Tang, Jiazhao Zhang, Yuqing Lan et al.

CVPR 2025posterarXiv:2503.01309
5
citations
#4497

Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training

Will Merrill, Shane Arora, Dirk Groeneveld et al.

NEURIPS 2025spotlightarXiv:2505.23971
5
citations
#4498

On scalable and efficient training of diffusion samplers

Minkyu Kim, Kiyoung Seong, Dongyeop Woo et al.

NEURIPS 2025posterarXiv:2505.19552
5
citations
#4499

Learning Physics Informed Neural ODEs with Partial Measurements

Paul Ghanem, Ahmet Demirkaya, Tales Imbiriba et al.

AAAI 2025paperarXiv:2412.08681
5
citations
#4500

Stochastic Process Learning via Operator Flow Matching

Yaozhong Shi, Zachary Ross, Domniki Asimaki et al.

NEURIPS 2025spotlightarXiv:2501.04126
5
citations
#4501

Ego4o: Egocentric Human Motion Capture and Understanding from Multi-Modal Input

Jian Wang, Rishabh Dabral, Diogo Luvizon et al.

CVPR 2025posterarXiv:2504.08449
5
citations
#4502

Bootstrapping Heterogeneous Graph Representation Learning via Large Language Models: A Generalized Approach

Hang Gao, Chenhao Zhang, Fengge Wu et al.

AAAI 2025paperarXiv:2412.08038
5
citations
#4503

MEGADance: Mixture-of-Experts Architecture for Genre-Aware 3D Dance Generation

kaixing yang, Xulong Tang, Ziqiao Peng et al.

NEURIPS 2025posterarXiv:2505.17543
5
citations
#4504

Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection

Gensheng Pei, Tao Chen, Yujia Wang et al.

CVPR 2025posterarXiv:2503.17080
5
citations
#4505

EchoONE: Segmenting Multiple Echocardiography Planes in One Model

Jiongtong Hu, Wei Zhuo, Jun Cheng et al.

CVPR 2025posterarXiv:2412.02993
5
citations
#4506

When Are Concepts Erased From Diffusion Models?

Kevin Lu, Nicky Kriplani, Rohit Gandikota et al.

NEURIPS 2025posterarXiv:2505.17013
5
citations
#4507

Predicting Empirical AI Research Outcomes with Language Models

Jiaxin Wen, Chenglei Si, Yueh-Han Chen et al.

NEURIPS 2025posterarXiv:2506.00794
5
citations
#4508

Smoothness Really Matters: A Simple Yet Effective Approach for Unsupervised Graph Domain Adaptation

Wei Chen, Guo Ye, Yakun Wang et al.

AAAI 2025paperarXiv:2412.11654
5
citations
#4509

ZAPBench: A Benchmark for Whole-Brain Activity Prediction in Zebrafish

Jan-Matthis Lueckmann, Alexander Immer, Alex Chen et al.

ICLR 2025posterarXiv:2503.02618
5
citations
#4510

Efficient Multi-agent Offline Coordination via Diffusion-based Trajectory Stitching

Lei Yuan, Yuqi Bian, Lihe Li et al.

ICLR 2025oral
5
citations
#4511

Complementary Advantages: Exploiting Cross-Field Frequency Correlation for NIR-Assisted Image Denoising

Yuchen Wang, Hongyuan Wang, Lizhi Wang et al.

CVPR 2025posterarXiv:2412.16645
5
citations
#4512

Estimating Model Performance Under Covariate Shift Without Labels

Jakub Białek, Juhani Kivimäki, Wojciech Kuberski et al.

NEURIPS 2025posterarXiv:2401.08348
5
citations
#4513

HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding

Chenxin Tao, Shiqian Su, Xizhou Zhu et al.

CVPR 2025posterarXiv:2412.16158
5
citations
#4514

AesthetiQ: Enhancing Graphic Layout Design via Aesthetic-Aware Preference Alignment of Multi-modal Large Language Models

Sohan Patnaik, Rishabh Jain, Balaji Krishnamurthy et al.

CVPR 2025posterarXiv:2503.00591
5
citations
#4515

Omnidirectional Multi-Object Tracking

Kai Luo, Hao Shi, Sheng Wu et al.

CVPR 2025posterarXiv:2503.04565
5
citations
#4516

DLF: Extreme Image Compression with Dual-generative Latent Fusion

Naifu Xue, Zhaoyang Jia, Jiahao Li et al.

ICCV 2025highlightarXiv:2503.01428
5
citations
#4517

On the Value of Cross-Modal Misalignment in Multimodal Representation Learning

Yichao Cai, Yuhang Liu, Erdun Gao et al.

NEURIPS 2025spotlightarXiv:2504.10143
5
citations
#4518

BiggerGait: Unlocking Gait Recognition with Layer-wise Representations from Large Vision Models

Dingqiang Ye, Chao Fan, Zhanbo Huang et al.

NEURIPS 2025posterarXiv:2505.18132
5
citations
#4519

Loosely Synchronized Rule-Based Planning for Multi-Agent Path Finding with Asynchronous Actions

Shuai Zhou, Shizhe Zhao, Zhongqiang Ren

AAAI 2025paperarXiv:2412.11678
5
citations
#4520

Robust Message Embedding via Attention Flow-Based Steganography

Huayuan Ye, Shenzhuo Zhang, Shiqi Jiang et al.

CVPR 2025posterarXiv:2405.16414
5
citations
#4521

One2Any: One-Reference 6D Pose Estimation for Any Object

Mengya Liu, Siyuan Li, Ajad Chhatkuli et al.

CVPR 2025posterarXiv:2505.04109
5
citations
#4522

When Should We Prefer State-to-Visual DAgger over Visual Reinforcement Learning?

Tongzhou Mu, Zhaoyang Li, Stanisław Wiktor Strzelecki et al.

AAAI 2025paperarXiv:2412.13662
5
citations
#4523

Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation

Shahad Albastaki, Anabia Sohail, IYYAKUTTI IYAPPAN GANAPATHI et al.

CVPR 2025posterarXiv:2504.18856
5
citations
#4524

GM-MoE: Low-Light Enhancement with Gated-Mechanism Mixture-of-Experts

Minwen Liao, Hao Dong, Xinyi Wang et al.

ICCV 2025posterarXiv:2503.07417
5
citations
#4525

Efficient Quadratic Corrections for Frank-Wolfe Algorithms

Jannis Halbey, Seta Rakotomandimby, Mathieu Besançon et al.

NEURIPS 2025posterarXiv:2506.02635
5
citations
#4526

Distillation Robustifies Unlearning

Bruce W, Lee, Addie Foote, Alex Infanger et al.

NEURIPS 2025spotlightarXiv:2506.06278
5
citations
#4527

Improved Balanced Classification with Theoretically Grounded Loss Functions

Corinna Cortes, Mehryar Mohri, Yutao Zhong

NEURIPS 2025posterarXiv:2512.23947
5
citations
#4528

Learning Diffusion Models with Flexible Representation Guidance

Chenyu Wang, Cai Zhou, Sharut Gupta et al.

NEURIPS 2025posterarXiv:2507.08980
5
citations
#4529

Detect Any Mirrors: Boosting Learning Reliability on Large-Scale Unlabeled Data with an Iterative Data Engine

Zhaohu Xing, Lihao Liu, Yijun Yang et al.

CVPR 2025poster
5
citations
#4530

Zero-shot 3D Question Answering via Voxel-based Dynamic Token Compression

Hsiang-Wei Huang, Fu-Chen Chen, Wenhao Chai et al.

CVPR 2025poster
5
citations
#4531

Lay-Your-Scene: Natural Scene Layout Generation with Diffusion Transformers

Divyansh Srivastava, Xiang Zhang, He Wen et al.

ICCV 2025posterarXiv:2505.04718
5
citations
#4532

Proportional Representation in Practice: Quantifying Proportionality in Ordinal Elections

Tuva Bardal, Markus Brill, David McCune et al.

AAAI 2025paper
5
citations
#4533

Addressing Cold-Start Problem in Click-Through Rate Prediction via Supervised Diffusion Modeling

Wenqiao Zhu, Lulu Wang, Jun Wu

AAAI 2025paperarXiv:2504.06270
5
citations
#4534

High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity

Qian Yu, Peng-Tao Jiang, Hao Zhang et al.

ICLR 2025posterarXiv:2410.10105
5
citations
#4535

Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution Quality

Ge Ya Luo, Gian M Favero, Zhi Hao Luo et al.

ICLR 2025oral
5
citations
#4536

MINGLE: Mixture of Null-Space Gated Low-Rank Experts for Test-Time Continual Model Merging

Zihuan Qiu, Yi Xu, Chiyuan He et al.

NEURIPS 2025posterarXiv:2505.11883
5
citations
#4537

Faster Inference of Flow-Based Generative Models via Improved Data-Noise Coupling

Aram Davtyan, Leello Dadi, Volkan Cevher et al.

ICLR 2025poster
5
citations
#4538

SAVVY: Spatial Awareness via Audio-Visual LLMs through Seeing and Hearing

Mingfei Chen, Zijun Cui, Xiulong Liu et al.

NEURIPS 2025oralarXiv:2506.05414
5
citations
#4539

Towards RAW Object Detection in Diverse Conditions

Zhong-Yu Li, Xin Jin, Bo-Yuan Sun et al.

CVPR 2025highlightarXiv:2411.15678
5
citations
#4540

Lawma: The Power of Specialization for Legal Annotation

Ricardo Dominguez-Olmedo, Vedant Nanda, Rediet Abebe et al.

ICLR 2025posterarXiv:2407.16615
5
citations
#4541

Training-free Dense-Aligned Diffusion Guidance for Modular Conditional Image Synthesis

Zixuan Wang, DUO PENG, Feng Chen et al.

CVPR 2025posterarXiv:2504.01515
5
citations
#4542

Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias

Rui Lu, Runzhe Wang, Kaifeng Lyu et al.

ICLR 2025posterarXiv:2503.03595
5
citations
#4543

Geo-Sign: Hyperbolic Contrastive Regularisation for Geometrically Aware Sign Language Translation

Edward Fish, Richard Bowden

NEURIPS 2025oralarXiv:2506.00129
5
citations
#4544

Progressive Compression with Universally Quantized Diffusion Models

Yibo Yang, Justus Will, Stephan Mandt

ICLR 2025posterarXiv:2412.10935
5
citations
#4545

AdaFisher: Adaptive Second Order Optimization via Fisher Information

Damien GOMES, Yanlei Zhang, Eugene Belilovsky et al.

ICLR 2025posterarXiv:2405.16397
5
citations
#4546

When Selection Meets Intervention: Additional Complexities in Causal Discovery

Haoyue Dai, Ignavier Ng, Jianle Sun et al.

ICLR 2025posterarXiv:2503.07302
5
citations
#4547

From Bytes to Ideas: Language Modeling with Autoregressive U-Nets

Mathurin VIDEAU, Badr Youbi Idrissi, Alessandro Leite et al.

NEURIPS 2025posterarXiv:2506.14761
5
citations
#4548

In-Context Learning and Occam's Razor

Eric Elmoznino, Tom Marty, Tejas Kasetty et al.

ICML 2025posterarXiv:2410.14086
5
citations
#4549

Ranked Entropy Minimization for Continual Test-Time Adaptation

Jisu Han, Jaemin Na, Wonjun Hwang

ICML 2025posterarXiv:2505.16441
5
citations
#4550

A Polarization-Aided Transformer for Image Deblurring via Motion Vector Decomposition

Duosheng Chen, Shihao Zhou, Jinshan Pan et al.

CVPR 2025highlight
5
citations
#4551

Contextual Online Decision Making with Infinite-Dimensional Functional Regression

Haichen Hu, Rui Ai, Stephen Bates et al.

ICML 2025posterarXiv:2501.18359
5
citations
#4552

DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy

Kaixuan Xu, Jiajun Chai, Sicheng Li et al.

ICML 2025posterarXiv:2506.09655
5
citations
#4553

Cross-modal Multi-task Learning for Multimedia Event Extraction

Jianwei Cao, Yanli Hu, Zhen Tan et al.

AAAI 2025paper
5
citations
#4554

Auditing Meta-Cognitive Hallucinations in Reasoning Large Language Models

Haolang Lu, Yilian Liu, Jingxin Xu et al.

NEURIPS 2025posterarXiv:2505.13143
5
citations
#4555

Automatically Identify and Rectify: Robust Deep Contrastive Multi-view Clustering in Noisy Scenarios

xihong yang, Siwei Wang, Fangdi Wang et al.

ICML 2025spotlightarXiv:2505.21387
5
citations
#4556

Constructing Confidence Intervals for Average Treatment Effects from Multiple Datasets

Yuxin Wang, Maresa Schröder, Dennis Frauen et al.

ICLR 2025posterarXiv:2412.11511
5
citations
#4557

One is Plenty: A Polymorphic Feature Interpreter for Immutable Heterogeneous Collaborative Perception

Yuchen Xia, Quan Yuan, Guiyang Luo et al.

CVPR 2025posterarXiv:2411.16799
5
citations
#4558

Context Clues: Evaluating Long Context Models for Clinical Prediction Tasks on EHR Data

Michael Wornow, Suhana Bedi, Miguel Angel Fuentes Hernandez et al.

ICLR 2025poster
5
citations
#4559

Variational Search Distributions

Dan Steinberg, Rafael Oliveira, Cheng Soon Ong et al.

ICLR 2025posterarXiv:2409.06142
5
citations
#4560

Anti-Exposure Bias in Diffusion Models

Junyu Zhang, Daochang Liu, Eunbyung Park et al.

ICLR 2025poster
5
citations
#4561

What should a neuron aim for? Designing local objective functions based on information theory

Andreas C. Schneider, Valentin Neuhaus, David Ehrlich et al.

ICLR 2025posterarXiv:2412.02482
5
citations
#4562

ReNeg: Learning Negative Embedding with Reward Guidance

Xiaomin Li, yixuan liu, Takashi Isobe et al.

CVPR 2025highlightarXiv:2412.19637
5
citations
#4563

AIpparel: A Multimodal Foundation Model for Digital Garments

Kiyohiro Nakayama, Jan Ackermann, Timur Levent Kesdogan et al.

CVPR 2025highlightarXiv:2412.03937
5
citations
#4564

A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement

Hui Yuan, Yifan Zeng, Yue Wu et al.

ICLR 2025posterarXiv:2410.13828
5
citations
#4565

MagicNaming: Consistent Identity Generation by Finding a “Name Space” in T2I Diffusion Models

Jing Zhao, Heliang Zheng, Chaoyue Wang et al.

AAAI 2025paperarXiv:2412.14902
5
citations
#4566

MTGA: Multi-View Temporal Granularity Aligned Aggregation for Event-Based Lip-Reading

Wenhao Zhang, Jun Wang, Yong Luo et al.

AAAI 2025paperarXiv:2404.11979
5
citations
#4567

LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits

Duy Nguyen, Archiki Prasad, Elias Stengel-Eskin et al.

NEURIPS 2025posterarXiv:2410.01735
5
citations
#4568

EgoBridge: Domain Adaptation for Generalizable Imitation from Egocentric Human Data

Ryan Punamiya, Dhruv Patel, Patcharapong Aphiwetsa et al.

NEURIPS 2025posterarXiv:2509.19626
5
citations
#4569

ChangeDiff: A Multi-Temporal Change Detection Data Generator with Flexible Text Prompts via Diffusion Model

Qi Zang, Jiayi Yang, Shuang Wang et al.

AAAI 2025paperarXiv:2412.15541
5
citations
#4570

Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation

Zihan Wang, Seungjun Lee, Gim Hee Lee

NEURIPS 2025oralarXiv:2505.11383
5
citations
#4571

What Matters in Data for DPO?

Yu Pan, Zhongze Cai, Huaiyang Zhong et al.

NEURIPS 2025posterarXiv:2508.18312
5
citations
#4572

We Should Chart an Atlas of All the World's Models

Eliahu Horwitz, Nitzan Kurer, Jonathan Kahana et al.

NEURIPS 2025posterarXiv:2503.10633
5
citations
#4573

AtomSurf: Surface Representation for Learning on Protein Structures

Vincent Mallet, Yangyang Miao, Souhaib Attaiki et al.

ICLR 2025posterarXiv:2309.16519
5
citations
#4574

Risk-Controlling Model Selection via Guided Bayesian Optimization

Adam Fisch, Regina Barzilay, Bracha Laufer-Goldshtein et al.

ICLR 2025posterarXiv:2312.01692
5
citations
#4575

LuxDiT: Lighting Estimation with Video Diffusion Transformer

Ruofan Liang, Kai He, Zan Gojcic et al.

NEURIPS 2025posterarXiv:2509.03680
5
citations
#4576

A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking

Zixiang Zhao, Haowen Bai, Bingxin Ke et al.

NEURIPS 2025oralarXiv:2505.19858
5
citations
#4577

KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models

Fan Wang, Juyong Jiang, Chansung Park et al.

ICLR 2025posterarXiv:2412.06071
5
citations
#4578

Detection-Friendly Nonuniformity Correction: A Union Framework for Infrared UAV Target Detection

Houzhang Fang, Xiaolin Wang, Zengyang Li et al.

CVPR 2025highlight
5
citations
#4579

LUCAS: Layered Universal Codec Avatars

Di Liu, Teng Deng, Giljoo Nam et al.

CVPR 2025posterarXiv:2502.19739
5
citations
#4580

Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification

Jiayu Jiang, Changxing Ding, Wentao Tan et al.

CVPR 2025highlightarXiv:2503.09962
5
citations
#4581

KAC: Kolmogorov-Arnold Classifier for Continual Learning

Yusong Hu, Zichen Liang, Fei Yang et al.

CVPR 2025highlightarXiv:2503.21076
5
citations
#4582

Sparis: Neural Implicit Surface Reconstruction of Indoor Scenes from Sparse Views

Yulun Wu, Han Huang, Wenyuan Zhang et al.

AAAI 2025paperarXiv:2501.01196
5
citations
#4583

Topological Schrödinger Bridge Matching

Maosheng Yang

ICLR 2025posterarXiv:2504.04799
5
citations
#4584

Constrained Belief Updates Explain Geometric Structures in Transformer Representations

Mateusz Piotrowski, Paul Riechers, Daniel Filan et al.

ICML 2025posterarXiv:2502.01954
5
citations
#4585

Language Agents Meet Causality -- Bridging LLMs and Causal World Models

John Gkountouras, Matthias Lindemann, Phillip Lippe et al.

ICLR 2025oralarXiv:2410.19923
5
citations
#4586

True Zero-Shot Inference of Dynamical Systems Preserving Long-Term Statistics

Christoph Jürgen Hemmer, Daniel Durstewitz

NEURIPS 2025oralarXiv:2505.13192
5
citations
#4587

Bokehlicious: Photorealistic Bokeh Rendering with Controllable Apertures

Tim Seizinger, Florin-Alexandru Vasluianu, Marcos Conde et al.

ICCV 2025highlightarXiv:2503.16067
5
citations
#4588

On Generalization Across Environments In Multi-Objective Reinforcement Learning

Jayden Teoh, Pradeep Varakantham, Peter Vamplew

ICLR 2025posterarXiv:2503.00799
5
citations
#4589

Interpretable Image Classification via Non-parametric Part Prototype Learning

Zhijie Zhu, Lei Fan, Maurice Pagnucco et al.

CVPR 2025posterarXiv:2503.10247
5
citations
#4590

Mosaic of Modalities: A Comprehensive Benchmark for Multimodal Graph Learning

Jing Zhu, Yuhang Zhou, Shengyi Qian et al.

CVPR 2025posterarXiv:2406.16321
5
citations
#4591

Towards Improving Exploration through Sibling Augmented GFlowNets

Kanika Madan, Alex Lamb, Emmanuel Bengio et al.

ICLR 2025poster
5
citations
#4592

Rethinking Fair Representation Learning for Performance-Sensitive Tasks

Charles Jones, Fabio De Sousa Ribeiro, Mélanie Roschewitz et al.

ICLR 2025posterarXiv:2410.04120
5
citations
#4593

Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference

Weizhi Fei, Xueyan Niu, XIE GUOQING et al.

NEURIPS 2025spotlightarXiv:2501.12959
5
citations
#4594

Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts

Qizhou Chen, Chengyu Wang, Dakan Wang et al.

CVPR 2025posterarXiv:2411.15432
5
citations
#4595

Alligat0R: Pre-Training through Covisibility Segmentation for Relative Camera Pose Regression

Thibaut Loiseau, Guillaume Bourmaud, Vincent Lepetit

NEURIPS 2025spotlight
5
citations
#4596

Hierarchically-Structured Open-Vocabulary Indoor Scene Synthesis with Pre-trained Large Language Model

Weilin Sun, Xinran Li, Manyi Li et al.

AAAI 2025paperarXiv:2502.10675
5
citations
#4597

Unleashing the Potential of Vision-Language Pre-Training for 3D Zero-Shot Lesion Segmentation via Mask-Attribute Alignment

Yankai Jiang, Wenhui Lei, Xiaofan Zhang et al.

ICLR 2025posterarXiv:2410.15744
5
citations
#4598

Understanding the Generalization of In-Context Learning in Transformers: An Empirical Study

Xingxuan Zhang, Haoran Wang, Jiansheng Li et al.

ICLR 2025posterarXiv:2503.15579
5
citations
#4599

A Solvable Attention for Neural Scaling Laws

Bochen Lyu, Di Wang, Zhanxing Zhu

ICLR 2025poster
5
citations
#4600

TGBFormer: Transformer-GraphFormer Blender Network for Video Object Detection

Qiang Qi, Xiao Wang

AAAI 2025paperarXiv:2503.13903
5
citations