Most Cited 2025 "promptable foundation models" Papers

22,274 papers found • Page 78 of 112

Filters:Most Cited 2025 promptable foundation models Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#15401

Directional Label Diffusion Model for Learning from Noisy Labels

Senyu Hou, Gaoxia Jiang, Jia Zhang et al.

CVPR 2025

citations

#15402

UniFuse: A Unified All-in-One Framework for Multi-Modal Medical Image Fusion Under Diverse Degradations and Misalignments

Dayong Su, Yafei Zhang, Huafeng Li et al.

ICCV 2025arXiv:2506.22736

citations

#15403

MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures

Elena Zamaraeva, Christopher Collins, George Darling et al.

NEURIPS 2025arXiv:2506.04195

citations

#15404

Oracle-Efficient Combinatorial Semi-Bandits

Jung-hun Kim, Milan Vojnovic, Min-hwan Oh

NEURIPS 2025arXiv:2510.21431

citations

#15405

VideoCAD: A Dataset and Model for Learning Long‑Horizon 3D CAD UI Interactions from Video

King Yiu Brandon Man, Ghadi Nehme, Md Ferdous Alam et al.

NEURIPS 2025arXiv:2505.24838

citations

#15406

Trained Mamba Emulates Online Gradient Descent in In-Context Linear Regression

Jiarui Jiang, Wei Huang, Miao Zhang et al.

NEURIPS 2025arXiv:2509.23779

citations

#15407

Principles of Visual Tokens for Efficient Video Understanding

Xinyue Hao, Li, Shreyank Gowda et al.

ICCV 2025arXiv:2411.13626

citations

#15408

FreeCus: Free Lunch Subject-driven Customization in Diffusion Transformers

Yanbing Zhang, Zhe Wang, Qin Zhou et al.

ICCV 2025arXiv:2507.15249

citations

#15409

On Minimax Estimation of Parameters in Softmax-Contaminated Mixture of Experts

Fanqi Yan, Huy Nguyen, Le Dung et al.

NEURIPS 2025arXiv:2505.18455

citations

#15410

On the Robustness Tradeoff in Fine-Tuning

Kunyang Li, Jean-Charles Noirot Ferrand, Ryan Sheatsley et al.

ICCV 2025arXiv:2503.14836

citations

#15411

Impact of Layer Norm on Memorization and Generalization in Transformers

Rishi Singhal, Jung-Eun Kim

NEURIPS 2025arXiv:2511.10566

citations

#15412

CopyrightShield: Enhancing Diffusion Model Security Against Copyright Infringement Attacks

Zhixiang Guo, Siyuan Liang, Aishan Liu et al.

ICCV 2025arXiv:2412.01528

citations

#15413

HarmonySeg: Tubular Structure Segmentation with Deep-Shallow Feature Fusion and Growth-Suppression Balanced Loss

Ke Zhang, Yi Huang, Wei Liu et al.

ICCV 2025arXiv:2504.07827

citations

#15414

Linear Transformers Implicitly Discover Unified Numerical Algorithms

Patrick Lutz, Aditya Gangrade, Hadi Daneshmand et al.

NEURIPS 2025arXiv:2509.19702

citations

#15415

Automaton Constrained Q-Learning

Anastasios Manganaris, Vittorio Giammarino, Ahmed Qureshi

NEURIPS 2025oralarXiv:2510.05061

citations

#15416

AiDE-Q: Synthetic Labeled Datasets Can Enhance Learning Models for Quantum Property Estimation

Xinbiao Wang, Yuxuan Du, Zihan Lou et al.

NEURIPS 2025arXiv:2509.26109

citations

#15417

OMiSO: Adaptive optimization of state-dependent brain stimulation to shape neural population states

Yuki Minai, Joana Soldado-Magraner, Byron M Yu et al.

NEURIPS 2025arXiv:2507.07858

citations

#15418

Hephaestus: Mixture Generative Modeling with Energy Guidance for Large-scale QoS Degradation

Nguyen Do, Bach Ngo, Youval Kashuv et al.

NEURIPS 2025arXiv:2510.17036

citations

#15419

AugGen: Synthetic Augmentation using Diffusion Models Can Improve Recognition

Parsa Rahimi, Damien Teney, Sébastien Marcel

NEURIPS 2025arXiv:2503.11544

citations

#15420

GreenHyperSpectra: A multi-source hyperspectral dataset for global vegetation trait prediction

Eya Cherif, Arthur Ouaknine, Luke Brown et al.

NEURIPS 2025arXiv:2507.06806

citations

#15421

SinGS: Animatable Single-Image Human Gaussian Splats with Kinematic Priors

Yufan Wu, Xuanhong Chen, Wen Li et al.

CVPR 2025

citations

#15422

Neurons as Detectors of Coherent Sets in Sensory Dynamics

Joshua L Pughe-Sanford, Xuehao Ding, Jason Moore et al.

NEURIPS 2025oralarXiv:2510.26955

citations

#15423

Generative Caching for Structurally Similar Prompts and Responses

Sarthak Chakraborty, Suman Nath, Xuchao Zhang et al.

NEURIPS 2025arXiv:2511.17565

citations

#15424

CaMuViD: Calibration-Free Multi-View Detection

Amir Etefaghi Daryani, M. Usman Maqbool Bhutta, Byron Hernandez et al.

CVPR 2025

citations

#15425

Data Distributional Properties As Inductive Bias for Systematic Generalization

Felipe del Rio, Alain Raymond, Daniel Florea et al.

CVPR 2025arXiv:2502.20499

citations

#15426

Dataset Ownership Verification for Pre-trained Masked Models

Yuechen Xie, Jie Song, Yicheng Shan et al.

ICCV 2025arXiv:2507.12022

citations

#15427

ShapeShifter: 3D Variations Using Multiscale and Sparse Point-Voxel Diffusion

Nissim Maruani, Wang Yifan, Matthew Fisher et al.

CVPR 2025arXiv:2502.02187

citations

#15428

What One Cannot, Two Can: Two-Layer Transformers Provably Represent Induction Heads on Any-Order Markov Chains

Chanakya Ekbote, Ashok Vardhan Makkuva, Marco Bondaschi et al.

NEURIPS 2025spotlightarXiv:2508.07208

citations

#15429

Fast Rate Bounds for Multi-Task and Meta-Learning with Different Sample Sizes

Hossein Zakerinia, Christoph Lampert

NEURIPS 2025arXiv:2505.15496

citations

#15430

Strategic Classification with Non-Linear Classifiers

Benyamin Trachtenberg, Nir Rosenfeld

NEURIPS 2025arXiv:2505.23443

citations

#15431

From Holistic to Localized: Local Enhanced Adapters for Efficient Visual Instruction Fine-Tuning

Pengkun Jiao, Bin Zhu, Jingjing Chen et al.

ICCV 2025arXiv:2411.12787

citations

#15432

R2C: Mapping Room to Chessboard to Unlock LLM As Low-Level Action Planner

Ziyi Bai, Hanxuan Li, Bin Fu et al.

CVPR 2025

citations

#15433

Task-Optimized Convolutional Recurrent Networks Align with Tactile Processing in the Rodent Brain

Trinity Chung, Yuchen Shen, Nathan Kong et al.

NEURIPS 2025oralarXiv:2505.18361

citations

#15434

A Semantic Knowledge Complementarity based Decoupling Framework for Semi-supervised Class-imbalanced Medical Image Segmentation

Zheng Zhang, Guanchun Yin, Bo Zhang et al.

CVPR 2025

citations

#15435

Probing Neural Combinatorial Optimization Models

Zhiqin Zhang, Yining Ma, Zhiguang Cao et al.

NEURIPS 2025spotlightarXiv:2510.22131

citations

#15436

On the Empirical Power of Goodness-of-Fit Tests in Watermark Detection

Weiqing He, Xiang Li, Tianqi Shang et al.

NEURIPS 2025spotlightarXiv:2510.03944

citations

#15437

Unifying Symbolic Music Arrangement: Track-Aware Reconstruction and Structured Tokenization

Longshen Ou, Jingwei Zhao, Ziyu Wang et al.

NEURIPS 2025arXiv:2408.15176

citations

#15438

Seeing the Unseen: A Semantic Alignment and Context-Aware Prompt Framework for Open-Vocabulary Camouflaged Object Segmentation

Peng Ren, Tian Bai, Jing Sun et al.

ICCV 2025

citations

#15439

VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play

Zelai Xu, Ruize Zhang, Chao Yu et al.

NEURIPS 2025arXiv:2502.01932

citations

#15440

Optimal community detection in dense bipartite graphs

Julien Chhor, Parker Knight

NEURIPS 2025arXiv:2505.18372

citations

#15441

Fast MRI for All: Bridging Access Gaps by Training without Raw Data

Yasar Utku Alcalar, Merve Gulle, Mehmet Akcakaya

NEURIPS 2025spotlightarXiv:2411.13022

citations

#15442

Towards Prospective Medical Image Reconstruction via Knowledge-Informed Dynamic Optimal Transport

Taoran Zheng, Yan Yang, Xing Li et al.

NEURIPS 2025arXiv:2505.17644

citations

#15443

Mitigating the Privacy–Utility Trade-off in Decentralized Federated Learning via f-Differential Privacy

Xiang Li, Chendi Wang, Buxin Su et al.

NEURIPS 2025spotlight

citations

#15444

FSboard: Over 3 Million Characters of ASL Fingerspelling Collected via Smartphones

Manfred Georg, Garrett Tanzer, Esha Uboweja et al.

CVPR 2025arXiv:2407.15806

citations

#15445

Sketchy Bounding-box Supervision for 3D Instance Segmentation

qian deng, Le Hui, Jin Xie et al.

CVPR 2025arXiv:2505.16399

citations

#15446

Robust Distortion-Free Watermark for Autoregressive Audio Generation Models

Yihan Wu, Georgios Milis, Ruibo Chen et al.

NEURIPS 2025arXiv:2510.21115

citations

#15447

RASP: Revisiting 3D Anamorphic Art for Shadow-Guided Packing of Irregular Objects

Soumyaratna Debnath, Ashish Tiwari, Kaustubh Sadekar et al.

CVPR 2025arXiv:2504.02465

citations

#15448

Opinion Maximization in Social Networks by Modifying Internal Opinions

Gengyu Wang, Runze Zhang, Zhongzhi Zhang

NEURIPS 2025arXiv:2510.17226

citations

#15449

Cycle-Sync: Robust Global Camera Pose Estimation through Enhanced Cycle-Consistent Synchronization

Shaohan Li, Yunpeng Shi, Gilad Lerman

NEURIPS 2025spotlightarXiv:2511.02329

citations

#15450

Spike-timing-dependent Hebbian learning as noisy gradient descent

Niklas Dexheimer, Sascha Gaudlitz, Johannes Schmidt-Hieber

NEURIPS 2025arXiv:2505.10272

citations

#15451

Sequentially Auditing Differential Privacy

Tomás González Lara, Mateo Dulce Rubio, Aaditya Ramdas et al.

NEURIPS 2025arXiv:2509.07055

citations

#15452

Efficient Dynamic Scene Editing via 4D Gaussian-based Static-Dynamic Separation

Joohyun Kwon, Hanbyel Cho, Junmo Kim

CVPR 2025arXiv:2502.02091

citations

#15453

Robust Ego-Exo Correspondence with Long-Term Memory

Yijun Hu, Bing Fan, Xin Gu et al.

NEURIPS 2025arXiv:2510.11417

citations

#15454

The third pillar of causal analysis? A measurement perspective on causal representations

Dingling Yao, Shimeng Huang, Riccardo Cadei et al.

NEURIPS 2025arXiv:2505.17708

citations

#15455

DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding

Xiaoyi Bao, Chen-Wei Xie, Hao Tang et al.

ICCV 2025arXiv:2507.15569

citations

#15456

Revisiting Audio-Visual Segmentation with Vision-Centric Transformer

Shaofei Huang, Rui Ling, Tianrui Hui et al.

CVPR 2025arXiv:2506.23623

citations

#15457

Balancing Gradient and Hessian Queries in Non-Convex Optimization

Deeksha Adil, Brian Bullins, Aaron Sidford et al.

NEURIPS 2025arXiv:2510.20786

citations

#15458

Visual Structures Help Visual Reasoning: Addressing the Binding Problem in LVLMs

Amirmohammad Izadi, Mohammadali Banayeeanzade, Fatemeh Askari et al.

NEURIPS 2025

citations

#15459

How Many Domains Suffice for Domain Generalization? A Tight Characterization via the Domain Shattering Dimension

Cynthia Dwork, Lunjia Hu, Han Shao

NEURIPS 2025arXiv:2506.16704

citations

#15460

Causal Climate Emulation with Bayesian Filtering

Sebastian H. M. Hickman, Ilija Trajković, Julia Kaltenborn et al.

NEURIPS 2025arXiv:2506.09891

citations

#15461

CTRL-ALT-DECEIT Sabotage Evaluations for Automated AI R&D

Francis Ward, Teun van der Weij, Hanna Gábor et al.

NEURIPS 2025spotlightarXiv:2511.09904

citations

#15462

Tight Bounds on the Distortion of Randomized and Deterministic Distributed Voting

Mohammad Abam, Davoud Kareshki, Marzieh Nilipour et al.

NEURIPS 2025arXiv:2509.17134

citations

#15463

Long-tailed Recognition with Model Rebalancing

JIAAN LUO, Feng Hong, Qiang Hu et al.

NEURIPS 2025arXiv:2510.08177

citations

#15464

Tighter CMI-Based Generalization Bounds via Stochastic Projection and Quantization

Milad Sefidgaran, Kimia Nadjahi, Abdellatif Zaidi

NEURIPS 2025oralarXiv:2510.23485

citations

#15465

Empower Words: DualGround for Structured Phrase and Sentence-Level Temporal Grounding

Minseok Kang, Minhyeok Lee, Minjung Kim et al.

NEURIPS 2025oralarXiv:2510.20244

citations

#15466

Unlocking the Potential of Unlabeled Data in Semi-Supervised Domain Generalization

Dongkwan Lee, Kyomin Hwang, Nojun Kwak

CVPR 2025arXiv:2503.13915

citations

#15467

SolverLLM: Leveraging Test-Time Scaling for Optimization Problem via LLM-Guided Search

Dong Li, Xujiang Zhao, Linlin Yu et al.

NEURIPS 2025arXiv:2510.16916

citations

#15468

Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics

Indrashis Das, Mahmoud Safari, Steven Adriaensen et al.

NEURIPS 2025arXiv:2502.03654

citations

#15469

Guiding Cross-Modal Representations with MLLM Priors via Preference Alignment

Pengfei Zhao, Rongbo Luan, Wei Zhang et al.

NEURIPS 2025arXiv:2506.06970

citations

#15470

ShortListing Model: A Streamlined Simplex Diffusion for Discrete Variable Generation

Yuxuan Song, Zhe Zhang, Yu Pei et al.

NEURIPS 2025

citations

#15471

MoME: Mixture of Matryoshka Experts for Audio-Visual Speech Recognition

Umberto Cappellazzo, Minsu Kim, Pingchuan Ma et al.

NEURIPS 2025arXiv:2510.04136

citations

#15472

Meta-Learning Dynamic Center Distance: Hard Sample Mining for Learning with Noisy Labels

Chenyu Mu, Yijun Qu, Jiexi Yan et al.

ICCV 2025

citations

#15473

Martingale Posterior Neural Networks for Fast Sequential Decision Making

Gerardo Duran-Martin, Leandro Sánchez-Betancourt, Alvaro Cartea et al.

NEURIPS 2025arXiv:2506.11898

citations

#15474

Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models

Yoojin Jung, Byung Cheol Song

CVPR 2025arXiv:2504.04747

citations

#15475

Estimating cognitive biases with attention-aware inverse planning

Sounak Banerjee, Daphne Cornelisse, Deepak Gopinath et al.

NEURIPS 2025spotlightarXiv:2510.25951

citations

#15476

MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification

Jianwei Zhao, XIN LI, Fan Yang et al.

CVPR 2025arXiv:2503.12401

citations

#15477

Towards Understanding Transformers in Learning Random Walks

Wei Shi, Yuan Cao

NEURIPS 2025arXiv:2511.23239

citations

#15478

ClearSight: Human Vision-Inspired Solutions for Event-Based Motion Deblurring

Xiaopeng LIN, Yulong Huang, Hongwei Ren et al.

ICCV 2025arXiv:2501.15808

citations

#15479

Hybrid Autoencoders for Tabular Data: Leveraging Model-Based Augmentation in Low-Label Settings

Erel Naor, Ofir Lindenbaum

NEURIPS 2025arXiv:2511.06961

citations

#15480

Decentralized Dynamic Cooperation of Personalized Models for Federated Continual Learning

Danni Yang, Zhikang Chen, Sen Cui et al.

NEURIPS 2025oralarXiv:2509.23683

citations

#15481

Value-Guided KV Compression for LLMs via Approximated CUR Decomposition

Ayan Sengupta, Siddhant Chaudhary, Tanmoy Chakraborty

NEURIPS 2025arXiv:2509.15038

citations

#15482

SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting

Shuaiting Li, Juncan Deng, Chengxuan Wang et al.

ICCV 2025arXiv:2503.08668

citations

#15483

Adaptive Riemannian ADMM for Nonsmooth Optimization: Optimal Complexity without Smoothing

Kangkang Deng, Jiachen Jin, Jiang Hu et al.

NEURIPS 2025arXiv:2510.18617

citations

#15484

Where and How to Perturb: On the Design of Perturbation Guidance in Diffusion and Flow Models

Donghoon Ahn, Jiwon Kang, Sanghyun Lee et al.

NEURIPS 2025arXiv:2506.10978

citations

#15485

Taxonomy of reduction matrices for Graph Coarsening

Antonin Joly, Nicolas Keriven, Aline Roumy

NEURIPS 2025arXiv:2506.11743

citations

#15486

Beyond Token Probes: Hallucination Detection via Activation Tensors with ACT-ViT

Guy Bar-Shalom, Fabrizio Frasca, Yaniv Galron et al.

NEURIPS 2025arXiv:2510.00296

citations

#15487

Homogeneous Dynamics Space for Heterogeneous Humans

Xinpeng Liu, Junxuan Liang, Chenshuo Zhang et al.

CVPR 2025arXiv:2412.06146

citations

#15488

Towards Unsupervised Training of Matching-based Graph Edit Distance Solver via Preference-aware GAN

Wei Huang, Hanchen Wang, Dong Wen et al.

NEURIPS 2025arXiv:2506.01977

citations

#15489

CF3: Compact and Fast 3D Feature Fields

Hyunjoon Lee, Joonkyu Min, Jaesik Park

ICCV 2025arXiv:2508.05254

citations

#15490

VideoMiner: Iteratively Grounding Key Frames of Hour-Long Videos via Tree-based Group Relative Policy Optimization

Xinye Cao, Hongcan Guo, Jiawen Qian et al.

ICCV 2025arXiv:2510.06040

citations

#15491

Planning and Learning in Average Risk-aware MDPs

Weikai Wang, Erick Delage

NEURIPS 2025arXiv:2503.17629

citations

#15492

UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions

Siyuan Yao, Rui Zhu, Ziqi Wang et al.

ICCV 2025arXiv:2507.00648

citations

#15493

Attraction Diminishing and Distributing for Few-Shot Class-Incremental Learning

Li-Jun Zhao, Zhen-Duo Chen, Yongxin Wang et al.

CVPR 2025

citations

#15494

Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning

Jongchan Park, Mingyu Park, Donghwan Lee

NEURIPS 2025arXiv:2505.05701

citations

#15495

GeoAvatar: Adaptive Geometrical Gaussian Splatting for 3D Head Avatar

SeungJun Moon, Hah Min Lew, Seungeun Lee et al.

ICCV 2025arXiv:2507.18155

citations

#15496

Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video

Xiao Li, Qi Chen, Xiulian Peng et al.

ICCV 2025arXiv:2509.08376

citations

#15497

Fast Training of Large Kernel Models with Delayed Projections

Amirhesam Abedsoltan, Siyuan Ma, Parthe Pandit et al.

NEURIPS 2025spotlightarXiv:2411.16658

citations

#15498

Leveraging Importance Sampling to Detach Alignment Modules from Large Language Models

Yi Liu, Dianqing Liu, Mingye Zhu et al.

NEURIPS 2025arXiv:2505.19700

citations

#15499

Tiling artifacts and trade-offs of feature normalization in the segmentation of large biological images

Elena Buglakova, Anwai Archit, Edoardo D'Imprima et al.

ICCV 2025highlightarXiv:2503.19545

citations

#15500

Saliency-Aware Quantized Imitation Learning for Efficient Robotic Control

Seongmin Park, Hyungmin Kim, Sangwoo kim et al.

ICCV 2025arXiv:2505.15304

citations

#15501

Differentiable extensions with rounding guarantees for combinatorial optimization over permutations

Robert (Riley) Nerem, Zhishang Luo, Akbar Rafiey et al.

NEURIPS 2025arXiv:2411.10707

citations

#15502

UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines

Chen Tang, Xinzhu Ma, Encheng Su et al.

CVPR 2025arXiv:2503.20748

citations

#15503

DeepShield: Fortifying Deepfake Video Detection with Local and Global Forgery Analysis

Yinqi Cai, Jichang Li, Zhaolun Li et al.

ICCV 2025arXiv:2510.25237

citations

#15504

Are Greedy Task Orderings Better Than Random in Continual Linear Regression?

Matan Tsipory, Ran Levinstein, Itay Evron et al.

NEURIPS 2025arXiv:2510.19941

citations

#15505

Group-wise Scaling and Orthogonal Decomposition for Domain-Invariant Feature Extraction in Face Anti-Spoofing

Seungjin Jung, Kanghee Lee, Yonghyun Jeong et al.

ICCV 2025arXiv:2507.04006

citations

#15506

FakeRadar: Probing Forgery Outliers to Detect Unknown Deepfake Videos

Zhaolun Li, Jichang Li, Yinqi Cai et al.

ICCV 2025arXiv:2512.14601

citations

#15507

StrandHead: Text to Hair-Disentangled 3D Head Avatars Using Human-Centric Priors

Xiaokun Sun, Zeyu Cai, Ying Tai et al.

ICCV 2025arXiv:2412.11586

citations

#15508

Revisiting Agnostic Boosting

Arthur da Cunha, Mikael Møller Høgsgaard, Andrea Paudice et al.

NEURIPS 2025arXiv:2503.09384

citations

#15509

SSTAG: Structure-Aware Self-Supervised Learning Method for Text-Attributed Graphs

Ruyue Liu, Rong Yin, Xiangzhen Bo et al.

NEURIPS 2025arXiv:2510.01248

citations

#15510

T2Bs: Text-to-Character Blendshapes via Video Generation

Jiahao Luo, Chaoyang Wang, Michael Vasilkovsky et al.

ICCV 2025arXiv:2509.10678

citations

#15511

ProGait: A Multi-Purpose Video Dataset and Benchmark for Transfemoral Prosthesis Users

Xiangyu Yin, Boyuan Yang, Weichen Liu et al.

ICCV 2025highlightarXiv:2507.10223

citations

#15512

When Less Language is More: Language-Reasoning Disentanglement Makes LLMs Better Multilingual Reasoners

Weixiang Zhao, Jiahe Guo, Yang Deng et al.

NEURIPS 2025spotlightarXiv:2505.15257

citations

#15513

LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation

Xinyu Yan, Meijun Sun, Ge-Peng Ji et al.

ICCV 2025arXiv:2508.01152

citations

#15514

Neural Attention Search

Difan Deng, Marius Lindauer

NEURIPS 2025arXiv:2502.13251

citations

#15515

DuoCLR: Dual-Surrogate Contrastive Learning for Skeleton-based Human Action Segmentation

Haitao Tian

ICCV 2025arXiv:2509.05543

citations

#15516

Class-wise Balancing Data Replay for Federated Class-Incremental Learning

Zhuang Qi, Ying-Peng Tang, Lei Meng et al.

NEURIPS 2025oralarXiv:2507.07712

citations

#15517

Occlusion-robust Stylization for Drawing-based 3D Animation

Sunjae Yoon, Gwanhyeong Koo, Younghwan Lee et al.

ICCV 2025arXiv:2508.00398

citations

#15518

PARC: A Quantitative Framework Uncovering the Symmetries within Vision Language Models

Jenny Schmalfuss, Nadine Chang, Vibashan VS et al.

CVPR 2025arXiv:2506.14808

citations

#15519

How Would It Sound? Material-Controlled Multimodal Acoustic Profile Generation for Indoor Scenes

Mahnoor Saad, Ziad Al-Halah

ICCV 2025arXiv:2508.02905

citations

#15520

HazeFlow: Revisit Haze Physical Model as ODE and Non-Homogeneous Haze Generation for Real-World Dehazing

Junseong Shin, Seungwoo Chung, Yunjeong Yang et al.

ICCV 2025arXiv:2509.18190

citations

#15521

Concentration and excess risk bounds for imbalanced classification with synthetic oversampling

Touqeer Ahmad, Mohammadreza Mousavi Kalan, François Portier et al.

NEURIPS 2025arXiv:2510.20472

citations

#15522

Projection-based Lyapunov method for fully heterogeneous weakly-coupled MDPs

Xiangcheng Zhang, Yige Hong, Weina Wang

NEURIPS 2025spotlightarXiv:2502.06072

citations

#15523

IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution

Sejin Park, Sangmin Lee, Kyong Hwan Jin et al.

ICCV 2025arXiv:2507.09923

citations

#15524

Towards Efficient General Feature Prediction in Masked Skeleton Modeling

Shengkai Sun, Zefan Zhang, Jianfeng Dong et al.

ICCV 2025arXiv:2509.03609

citations

#15525

NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration

Haotian Dong, Xin WANG, Di Lin et al.

ICCV 2025arXiv:2504.18448

citations

#15526

FED-PsyAU: Privacy-Preserving Micro-Expression Recognition via Psychological AU Coordination and Dynamic Facial Motion Modeling

Jingting Li, Yu Qian, Lin Zhao et al.

ICCV 2025arXiv:2507.20557

citations

#15527

PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks

Clinton A Mo, Kun Hu, Chengjiang Long et al.

ICCV 2025arXiv:2507.20170

citations

#15528

VideoSPatS: Video SPatiotemporal Splines for Disentangled Occlusion, Appearance and Motion Modeling and Editing

Juan Luis Gonzalez Bello, Xu Yao, Alex Whelan et al.

CVPR 2025arXiv:2504.07146

citations

#15529

Towards Natural Language-Based Document Image Retrieval: New Dataset and Benchmark

Hao Guo, Xugong Qin, Jun Jie Ou Yang et al.

CVPR 2025arXiv:2512.20174

citations

#15530

Dual Reciprocal Learning of Language-based Human Motion Understanding and Generation

CHEN LIANG, Zhicheng Shi, Wenguan Wang et al.

ICCV 2025

citations

#15531

DISTA-Net: Dynamic Closely-Spaced Infrared Small Target Unmixing

Shengdong Han, Shangdong Yang, Yuxuan Li et al.

ICCV 2025arXiv:2505.19148

citations

#15532

The Quest for Universal Master Key Filters in DS-CNNs

Zahra Babaiee, Peyman M. Kiasari, Daniela Rus et al.

NEURIPS 2025arXiv:2509.11711

citations

#15533

ASCENT: Annotation-free Self-supervised Contrastive Embeddings for 3D Neuron Tracking in Fluorescence Microscopy

Haejun Han, Hang Lu

ICCV 2025

citations

#15534

VSRM: A Robust Mamba-Based Framework for Video Super-Resolution

Phu Tran Dinh, Hung Dao, Daeyoung Kim

ICCV 2025arXiv:2506.22762

citations

#15535

AnimalClue: Recognizing Animals by their Traces

Risa Shinoda, Nakamasa Inoue, Iro Laina et al.

ICCV 2025highlightarXiv:2507.20240

citations

#15536

Skeleton Motion Words for Unsupervised Skeleton-based Temporal Action Segmentation

Uzay Gökay, Federico Spurio, Dominik Bach et al.

ICCV 2025arXiv:2508.04513

citations

#15537

SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models

Pingchuan Ma, Xiaopei Yang, Ming Gui et al.

ICCV 2025arXiv:2508.03402

citations

#15538

Reinventing Multi-Agent Collaboration through Gaussian-Image Synergy in Diffusion Policies

Ziye Wang, Li Kang, Yiran Qin et al.

NEURIPS 2025arXiv:2511.00998

citations

#15539

Hipandas: Hyperspectral Image Joint Denoising and Super-Resolution by Image Fusion with the Panchromatic Image

Shuang Xu, Zixiang Zhao, Haowen Bai et al.

ICCV 2025arXiv:2412.04201

citations

#15540

EASEMVC:Efficient Dual Selection Mechanism for Deep Multi-View Clustering

Baili Xiao, Zhibin Dong, KE LIANG et al.

CVPR 2025

citations

#15541

ForCenNet: Foreground-Centric Network for Document Image Rectification

Peng Cai, liqiang liqiang, Kaicheng Yang et al.

ICCV 2025arXiv:2507.19804

citations

#15542

Uncertainty-Aware Multi-Objective Reinforcement Learning-Guided Diffusion Models for 3D De Novo Molecular Design

Lianghong Chen, Dongkyu Kim, Mike Domaratzki et al.

NEURIPS 2025arXiv:2510.21153

citations

#15543

CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation

Yi Liu, Shengqian Li, Zuzeng Lin et al.

ICCV 2025arXiv:2506.23347

citations

#15544

ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation

Xiwei Xuan, Ziquan Deng, Kwan-Liu Ma

ICCV 2025highlightarXiv:2506.21233

citations

#15545

ModalTune: Fine-Tuning Slide-Level Foundation Models with Multi-Modal Information for Multi-task Learning in Digital Pathology

Vishwesh Ramanathan, Tony Xu, Pushpak Pati et al.

ICCV 2025arXiv:2503.17564

citations

#15546

PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations

YU WEI, Jiahui Zhang, Xiaoqin Zhang et al.

ICCV 2025arXiv:2507.13891

citations

#15547

Learning Intractable Multimodal Policies with Reparameterization and Diversity Regularization

Ziqi Wang, Jiashun Liu, Ling Pan

NEURIPS 2025arXiv:2511.01374

citations

#15548

Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks

Bhishma Dedhia, David Bourgin, Krishna Kumar Singh et al.

ICCV 2025arXiv:2503.17539

citations

#15549

Identifying Macro Causal Effects in C-DMGs over DMGs

Simon Ferreira, Charles Assaad

NEURIPS 2025arXiv:2506.19650

citations

#15550

Text Embedding Knows How to Quantize Text-Guided Diffusion Models

Hongjae Lee, Myungjun Son, Dongjea Kang et al.

ICCV 2025arXiv:2507.10340

citations

#15551

Open-Vocabulary HOI Detection with Interaction-aware Prompt and Concept Calibration

Ting Lei, Shaofeng Yin, Qingchao Chen et al.

ICCV 2025arXiv:2508.03207

citations

#15552

Robust Low-light Scene Restoration via Illumination Transition

Ze Li, Feng Zhang, Xiatian Zhu et al.

ICCV 2025arXiv:2507.03976

citations

#15553

Dynamical modeling of nonlinear latent factors in multiscale neural activity with real-time inference

Eray Erturk, Maryam Shanechi

NEURIPS 2025oralarXiv:2512.12462

citations

#15554

Event-Driven Storytelling with Multiple Lifelike Humans in a 3D Scene

Donggeun Lim, Jinseok Bae, Inwoo Hwang et al.

ICCV 2025arXiv:2507.19232

citations

#15555

Leveraging robust optimization for llm alignment under distribution shifts

Mingye Zhu, Yi Liu, Zheren Fu et al.

NEURIPS 2025arXiv:2504.05831

citations

#15556

Relaxing partition admissibility in Cluster-DAGs: a causal calculus with arbitrary variable clustering

Clément Yvernes, Emilie Devijver, Adèle Ribeiro et al.

NEURIPS 2025arXiv:2511.01396

citations

#15557

TRACE: Contrastive learning for multi-trial time series data in neuroscience

Lisa Schmors, Dominic Gonschorek, Jan Niklas Böhm et al.

NEURIPS 2025arXiv:2506.04906

citations

#15558

SegmentDreamer: Towards High-fidelity Text-to-3D Synthesis with Segmented Consistency Trajectory Distillation

Jiahao Zhu, Zixuan Chen, Guangcong Wang et al.

ICCV 2025arXiv:2507.05256

citations

#15559

Membership Inference Attacks with False Discovery Rate Control

Chenxu Zhao, Wei Qian, Aobo Chen et al.

ICCV 2025arXiv:2508.07066

citations

#15560

Perturbation Bounds for Low-Rank Inverse Approximations under Noise

Phuc Tran, Nisheeth K. Vishnoi

NEURIPS 2025arXiv:2510.25571

citations

#15561

VLMs have Tunnel Vision: Evaluating Nonlocal Visual Reasoning in Leading VLMs

Shmuel Berman, Jia Deng

NEURIPS 2025spotlightarXiv:2507.13361

citations

#15562

IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance

Jiayi Guo, Chuanhao Yan, Xingqian Xu et al.

ICCV 2025arXiv:2509.26231

citations

#15563

Graph Diffusion that can Insert and Delete

Matteo Ninniri, Marco Podda, Davide Bacciu

NEURIPS 2025arXiv:2506.15725

citations

#15564

Outlier-Aware Post-Training Quantization for Image Super-Resolution

Hailing Wang, Jianglin Lu, Yitian Zhang et al.

ICCV 2025highlightarXiv:2511.00682

citations

#15565

Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction

Giuseppe Cartella, Vittorio Cuculo, Alessandro D'Amelio et al.

ICCV 2025arXiv:2507.23021

citations

#15566

MeshPad: Interactive Sketch-Conditioned Artist-Reminiscent Mesh Generation and Editing

Haoxuan Li, Ziya Erkoç, Lei Li et al.

ICCV 2025arXiv:2503.01425

citations

#15567

PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity

Kwanyoung Kim, Byeongsu Sim

ICCV 2025arXiv:2503.07677

citations

#15568

PrimHOI: Compositional Human-Object Interaction via Reusable Primitives

Kai Jia, Tengyu Liu, Mingtao Pei et al.

ICCV 2025

citations

#15569

D3QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection

Yanran Zhang, Bingyao Yu, Yu Zheng et al.

ICCV 2025

citations

#15570

Blind Video Super-Resolution based on Implicit Kernels

Qiang Zhu, Yuxuan Jiang, Shuyuan Zhu et al.

ICCV 2025arXiv:2503.07856

citations

#15571

OmniDiff: A Comprehensive Benchmark for Fine-grained Image Difference Captioning

Yuan Liu, Saihui Hou, Saijie Hou et al.

ICCV 2025arXiv:2503.11093

citations

#15572

Referring Expression Comprehension for Small Objects

Kanoko Goto, Takumi Hirose, Mahiro Ukai et al.

ICCV 2025arXiv:2510.03701

citations

#15573

Link-based Contrastive Learning for One-Shot Unsupervised Domain Adaptation

Yue Zhang, Mingyue Bin, Yuyang Zhang et al.

CVPR 2025

citations

#15574

PLMP - Point-Line Minimal Problems for Projective SfM

Kim Kiehn, Albin Ahlbäck, Kathlén Kohn

ICCV 2025highlightarXiv:2503.04351

citations

#15575

HOI-Dyn: Learning Interaction Dynamics for Human-Object Motion Diffusion

Lin Wu, Zhixiang Chen, Jianglin Lan

NEURIPS 2025arXiv:2507.01737

citations

#15576

PatchScaler: An Efficient Patch-Independent Diffusion Model for Image Super-Resolution

Yong Liu, Hang Dong, Jinshan Pan et al.

ICCV 2025arXiv:2405.17158

citations

#15577

Balancing Task-invariant Interaction and Task-specific Adaptation for Unified Image Fusion

Xingyu Hu, Junjun Jiang, Chenyang Wang et al.

ICCV 2025arXiv:2504.05164

citations

#15578

SpiLiFormer: Enhancing Spiking Transformers with Lateral Inhibition

Zeqi Zheng, Yanchen Huang, Yingchao Yu et al.

ICCV 2025arXiv:2503.15986

citations

#15579

TITAN-Guide: Taming Inference-Time Alignment for Guided Text-to-Video Diffusion Models

Christian Simon, Masato Ishii, Akio Hayakawa et al.

ICCV 2025arXiv:2508.00289

citations

#15580

CompSlider: Compositional Slider for Disentangled Multiple-Attribute Image Generation

Zixin Zhu, Kevin Duarte, Mamshad Nayeem Rizve et al.

ICCV 2025arXiv:2509.01028

citations

#15581

Deep Value Benchmark: Measuring Whether Models Generalize Deep values or Shallow Preferences

Joshua Ashkinaze, Hua Shen, Saipranav Avula et al.

NEURIPS 2025oralarXiv:2511.02109

citations

#15582

Augmented and Softened Matching for Unsupervised Visible-Infrared Person Re-Identification

Zhiqi Pang, Chunyu Wang, Lingling Zhao et al.

ICCV 2025

citations

#15583

Spurious-Aware Prototype Refinement for Reliable Out-of-Distribution Detection

Reihaneh Zohrabi, Hosein Hasani, Mahdieh Soleymani et al.

NEURIPS 2025arXiv:2506.23881

citations

#15584

Feature Distillation is the Better Choice for Model-Heterogeneous Federated Learning

Yichen Li, Xiuying Wang, Wenchao Xu et al.

NEURIPS 2025arXiv:2507.10348

citations

#15585

Revisiting Generative Replay for Class Incremental Object Detection

Shizhou Zhang, Xueqiang Lv, Yinghui Xing et al.

CVPR 2025

citations

#15586

DISCO: Disentangled Communication Steering for Large Language Models

Max Torop, Aria Masoomi, Masih Eskandar et al.

NEURIPS 2025arXiv:2509.16820

citations

#15587

Latent Swap Joint Diffusion for 2D Long-Form Latent Generation

Yusheng Dai, Chenxi Wang, Chang Li et al.

ICCV 2025arXiv:2502.05130

citations

#15588

Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost

Runzhe Zhan, Zhihong Huang, Xinyi Yang et al.

NEURIPS 2025arXiv:2510.20780

citations

#15589

FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models

Yuxuan Wang, Tianwei Cao, Huayu Zhang et al.

ICCV 2025arXiv:2507.02714

citations

#15590

Restoring Pruned Large Language Models via Lost Component Compensation

Zijian Feng, Hanzhang Zhou, Zixiao Zhu et al.

NEURIPS 2025spotlightarXiv:2510.21834

citations

#15591

How Many Tokens Do 3D Point Cloud Transformer Architectures Really Need?

Tuan Tran Anh, Duy M. H. Nguyen, Hoai-Chau Tran et al.

NEURIPS 2025arXiv:2511.05449

citations

#15592

ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration

Andrea Conti, Matteo Poggi, Valerio Cambareri et al.

ICCV 2025arXiv:2504.16545

citations

#15593

HypDAE: Hyperbolic Diffusion Autoencoders for Hierarchical Few-shot Image Generation

Lingxiao Li, Kaixuan Fan, Boqing Gong et al.

ICCV 2025arXiv:2411.17784

citations

#15594

Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning

Haoran Chen, Ping Wang, Zihan Zhou et al.

ICCV 2025arXiv:2503.07979

citations

#15595

Recursive Inference Scaling: A Winning Path to Scalable Inference in Language and Multimodal Systems

Ibrahim Alabdulmohsin, Xiaohua Zhai

NEURIPS 2025arXiv:2502.07503

citations

#15596

Geometry-Aware Edge Pooling for Graph Neural Networks

Katharina Limbeck, Lydia Mezrag, Guy Wolf et al.

NEURIPS 2025arXiv:2506.11700

citations

#15597

V.I.P. : Iterative Online Preference Distillation for Efficient Video Diffusion Models

Jisoo Kim, Wooseok Seo, Junwan Kim et al.

ICCV 2025arXiv:2508.03254

citations

#15598

From Euler to AI: Unifying Formulas for Mathematical Constants

Tomer Raz, Michael Shalyt, Elyasheev Leibtag et al.

NEURIPS 2025arXiv:2502.17533

citations

#15599

Vulnerability-Aware Spatio-Temporal Learning for Generalizable Deepfake Video Detection

Dat NGUYEN, Marcella Astrid, Anis Kacem et al.

ICCV 2025arXiv:2501.01184

citations

#15600

Targeted Forgetting of Image Subgroups in CLIP Models

Zeliang Zhang, Gaowen Liu, Charles Fleming et al.

CVPR 2025arXiv:2506.03117

citations

← Previous

1...76 77 78 79 80...112