Most Cited 2025 "promptable foundation models" Papers

22,274 papers found • Page 78 of 112

#15401

Directional Label Diffusion Model for Learning from Noisy Labels

Senyu Hou, Gaoxia Jiang, Jia Zhang et al.

CVPR 2025
1
citations
#15402

UniFuse: A Unified All-in-One Framework for Multi-Modal Medical Image Fusion Under Diverse Degradations and Misalignments

Dayong Su, Yafei Zhang, Huafeng Li et al.

ICCV 2025arXiv:2506.22736
1
citations
#15403

MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures

Elena Zamaraeva, Christopher Collins, George Darling et al.

NEURIPS 2025arXiv:2506.04195
1
citations
#15404

Oracle-Efficient Combinatorial Semi-Bandits

Jung-hun Kim, Milan Vojnovic, Min-hwan Oh

NEURIPS 2025arXiv:2510.21431
1
citations
#15405

VideoCAD: A Dataset and Model for Learning Long‑Horizon 3D CAD UI Interactions from Video

King Yiu Brandon Man, Ghadi Nehme, Md Ferdous Alam et al.

NEURIPS 2025arXiv:2505.24838
1
citations
#15406

Trained Mamba Emulates Online Gradient Descent in In-Context Linear Regression

Jiarui Jiang, Wei Huang, Miao Zhang et al.

NEURIPS 2025arXiv:2509.23779
1
citations
#15407

Principles of Visual Tokens for Efficient Video Understanding

Xinyue Hao, Li, Shreyank Gowda et al.

ICCV 2025arXiv:2411.13626
1
citations
#15408

FreeCus: Free Lunch Subject-driven Customization in Diffusion Transformers

Yanbing Zhang, Zhe Wang, Qin Zhou et al.

ICCV 2025arXiv:2507.15249
1
citations
#15409

On Minimax Estimation of Parameters in Softmax-Contaminated Mixture of Experts

Fanqi Yan, Huy Nguyen, Le Dung et al.

NEURIPS 2025arXiv:2505.18455
1
citations
#15410

On the Robustness Tradeoff in Fine-Tuning

Kunyang Li, Jean-Charles Noirot Ferrand, Ryan Sheatsley et al.

ICCV 2025arXiv:2503.14836
1
citations
#15411

Impact of Layer Norm on Memorization and Generalization in Transformers

Rishi Singhal, Jung-Eun Kim

NEURIPS 2025arXiv:2511.10566
1
citations
#15412

CopyrightShield: Enhancing Diffusion Model Security Against Copyright Infringement Attacks

Zhixiang Guo, Siyuan Liang, Aishan Liu et al.

ICCV 2025arXiv:2412.01528
1
citations
#15413

HarmonySeg: Tubular Structure Segmentation with Deep-Shallow Feature Fusion and Growth-Suppression Balanced Loss

Ke Zhang, Yi Huang, Wei Liu et al.

ICCV 2025arXiv:2504.07827
1
citations
#15414

Linear Transformers Implicitly Discover Unified Numerical Algorithms

Patrick Lutz, Aditya Gangrade, Hadi Daneshmand et al.

NEURIPS 2025arXiv:2509.19702
1
citations
#15415

Automaton Constrained Q-Learning

Anastasios Manganaris, Vittorio Giammarino, Ahmed Qureshi

NEURIPS 2025oralarXiv:2510.05061
1
citations
#15416

AiDE-Q: Synthetic Labeled Datasets Can Enhance Learning Models for Quantum Property Estimation

Xinbiao Wang, Yuxuan Du, Zihan Lou et al.

NEURIPS 2025arXiv:2509.26109
1
citations
#15417

OMiSO: Adaptive optimization of state-dependent brain stimulation to shape neural population states

Yuki Minai, Joana Soldado-Magraner, Byron M Yu et al.

NEURIPS 2025arXiv:2507.07858
1
citations
#15418

Hephaestus: Mixture Generative Modeling with Energy Guidance for Large-scale QoS Degradation

Nguyen Do, Bach Ngo, Youval Kashuv et al.

NEURIPS 2025arXiv:2510.17036
1
citations
#15419

AugGen: Synthetic Augmentation using Diffusion Models Can Improve Recognition

Parsa Rahimi, Damien Teney, Sébastien Marcel

NEURIPS 2025arXiv:2503.11544
1
citations
#15420

GreenHyperSpectra: A multi-source hyperspectral dataset for global vegetation trait prediction

Eya Cherif, Arthur Ouaknine, Luke Brown et al.

NEURIPS 2025arXiv:2507.06806
1
citations
#15421

SinGS: Animatable Single-Image Human Gaussian Splats with Kinematic Priors

Yufan Wu, Xuanhong Chen, Wen Li et al.

CVPR 2025
1
citations
#15422

Neurons as Detectors of Coherent Sets in Sensory Dynamics

Joshua L Pughe-Sanford, Xuehao Ding, Jason Moore et al.

NEURIPS 2025oralarXiv:2510.26955
1
citations
#15423

Generative Caching for Structurally Similar Prompts and Responses

Sarthak Chakraborty, Suman Nath, Xuchao Zhang et al.

NEURIPS 2025arXiv:2511.17565
1
citations
#15424

CaMuViD: Calibration-Free Multi-View Detection

Amir Etefaghi Daryani, M. Usman Maqbool Bhutta, Byron Hernandez et al.

CVPR 2025
1
citations
#15425

Data Distributional Properties As Inductive Bias for Systematic Generalization

Felipe del Rio, Alain Raymond, Daniel Florea et al.

CVPR 2025arXiv:2502.20499
1
citations
#15426

Dataset Ownership Verification for Pre-trained Masked Models

Yuechen Xie, Jie Song, Yicheng Shan et al.

ICCV 2025arXiv:2507.12022
1
citations
#15427

ShapeShifter: 3D Variations Using Multiscale and Sparse Point-Voxel Diffusion

Nissim Maruani, Wang Yifan, Matthew Fisher et al.

CVPR 2025arXiv:2502.02187
1
citations
#15428

What One Cannot, Two Can: Two-Layer Transformers Provably Represent Induction Heads on Any-Order Markov Chains

Chanakya Ekbote, Ashok Vardhan Makkuva, Marco Bondaschi et al.

NEURIPS 2025spotlightarXiv:2508.07208
1
citations
#15429

Fast Rate Bounds for Multi-Task and Meta-Learning with Different Sample Sizes

Hossein Zakerinia, Christoph Lampert

NEURIPS 2025arXiv:2505.15496
1
citations
#15430

Strategic Classification with Non-Linear Classifiers

Benyamin Trachtenberg, Nir Rosenfeld

NEURIPS 2025arXiv:2505.23443
1
citations
#15431

From Holistic to Localized: Local Enhanced Adapters for Efficient Visual Instruction Fine-Tuning

Pengkun Jiao, Bin Zhu, Jingjing Chen et al.

ICCV 2025arXiv:2411.12787
1
citations
#15432

R2C: Mapping Room to Chessboard to Unlock LLM As Low-Level Action Planner

Ziyi Bai, Hanxuan Li, Bin Fu et al.

CVPR 2025
1
citations
#15433

Task-Optimized Convolutional Recurrent Networks Align with Tactile Processing in the Rodent Brain

Trinity Chung, Yuchen Shen, Nathan Kong et al.

NEURIPS 2025oralarXiv:2505.18361
1
citations
#15434

A Semantic Knowledge Complementarity based Decoupling Framework for Semi-supervised Class-imbalanced Medical Image Segmentation

Zheng Zhang, Guanchun Yin, Bo Zhang et al.

CVPR 2025
1
citations
#15435

Probing Neural Combinatorial Optimization Models

Zhiqin Zhang, Yining Ma, Zhiguang Cao et al.

NEURIPS 2025spotlightarXiv:2510.22131
1
citations
#15436

On the Empirical Power of Goodness-of-Fit Tests in Watermark Detection

Weiqing He, Xiang Li, Tianqi Shang et al.

NEURIPS 2025spotlightarXiv:2510.03944
1
citations
#15437

Unifying Symbolic Music Arrangement: Track-Aware Reconstruction and Structured Tokenization

Longshen Ou, Jingwei Zhao, Ziyu Wang et al.

NEURIPS 2025arXiv:2408.15176
1
citations
#15438

Seeing the Unseen: A Semantic Alignment and Context-Aware Prompt Framework for Open-Vocabulary Camouflaged Object Segmentation

Peng Ren, Tian Bai, Jing Sun et al.

ICCV 2025
1
citations
#15439

VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play

Zelai Xu, Ruize Zhang, Chao Yu et al.

NEURIPS 2025arXiv:2502.01932
1
citations
#15440

Optimal community detection in dense bipartite graphs

Julien Chhor, Parker Knight

NEURIPS 2025arXiv:2505.18372
1
citations
#15441

Fast MRI for All: Bridging Access Gaps by Training without Raw Data

Yasar Utku Alcalar, Merve Gulle, Mehmet Akcakaya

NEURIPS 2025spotlightarXiv:2411.13022
1
citations
#15442

Towards Prospective Medical Image Reconstruction via Knowledge-Informed Dynamic Optimal Transport

Taoran Zheng, Yan Yang, Xing Li et al.

NEURIPS 2025arXiv:2505.17644
1
citations
#15443

Mitigating the Privacy–Utility Trade-off in Decentralized Federated Learning via f-Differential Privacy

Xiang Li, Chendi Wang, Buxin Su et al.

NEURIPS 2025spotlight
1
citations
#15444

FSboard: Over 3 Million Characters of ASL Fingerspelling Collected via Smartphones

Manfred Georg, Garrett Tanzer, Esha Uboweja et al.

CVPR 2025arXiv:2407.15806
1
citations
#15445

Sketchy Bounding-box Supervision for 3D Instance Segmentation

qian deng, Le Hui, Jin Xie et al.

CVPR 2025arXiv:2505.16399
1
citations
#15446

Robust Distortion-Free Watermark for Autoregressive Audio Generation Models

Yihan Wu, Georgios Milis, Ruibo Chen et al.

NEURIPS 2025arXiv:2510.21115
1
citations
#15447

RASP: Revisiting 3D Anamorphic Art for Shadow-Guided Packing of Irregular Objects

Soumyaratna Debnath, Ashish Tiwari, Kaustubh Sadekar et al.

CVPR 2025arXiv:2504.02465
1
citations
#15448

Opinion Maximization in Social Networks by Modifying Internal Opinions

Gengyu Wang, Runze Zhang, Zhongzhi Zhang

NEURIPS 2025arXiv:2510.17226
1
citations
#15449

Cycle-Sync: Robust Global Camera Pose Estimation through Enhanced Cycle-Consistent Synchronization

Shaohan Li, Yunpeng Shi, Gilad Lerman

NEURIPS 2025spotlightarXiv:2511.02329
1
citations
#15450

Spike-timing-dependent Hebbian learning as noisy gradient descent

Niklas Dexheimer, Sascha Gaudlitz, Johannes Schmidt-Hieber

NEURIPS 2025arXiv:2505.10272
1
citations
#15451

Sequentially Auditing Differential Privacy

Tomás González Lara, Mateo Dulce Rubio, Aaditya Ramdas et al.

NEURIPS 2025arXiv:2509.07055
1
citations
#15452

Efficient Dynamic Scene Editing via 4D Gaussian-based Static-Dynamic Separation

Joohyun Kwon, Hanbyel Cho, Junmo Kim

CVPR 2025arXiv:2502.02091
1
citations
#15453

Robust Ego-Exo Correspondence with Long-Term Memory

Yijun Hu, Bing Fan, Xin Gu et al.

NEURIPS 2025arXiv:2510.11417
1
citations
#15454

The third pillar of causal analysis? A measurement perspective on causal representations

Dingling Yao, Shimeng Huang, Riccardo Cadei et al.

NEURIPS 2025arXiv:2505.17708
1
citations
#15455

DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding

Xiaoyi Bao, Chen-Wei Xie, Hao Tang et al.

ICCV 2025arXiv:2507.15569
1
citations
#15456

Revisiting Audio-Visual Segmentation with Vision-Centric Transformer

Shaofei Huang, Rui Ling, Tianrui Hui et al.

CVPR 2025arXiv:2506.23623
1
citations
#15457

Balancing Gradient and Hessian Queries in Non-Convex Optimization

Deeksha Adil, Brian Bullins, Aaron Sidford et al.

NEURIPS 2025arXiv:2510.20786
1
citations
#15458

Visual Structures Help Visual Reasoning: Addressing the Binding Problem in LVLMs

Amirmohammad Izadi, Mohammadali Banayeeanzade, Fatemeh Askari et al.

NEURIPS 2025
1
citations
#15459

How Many Domains Suffice for Domain Generalization? A Tight Characterization via the Domain Shattering Dimension

Cynthia Dwork, Lunjia Hu, Han Shao

NEURIPS 2025arXiv:2506.16704
1
citations
#15460

Causal Climate Emulation with Bayesian Filtering

Sebastian H. M. Hickman, Ilija Trajković, Julia Kaltenborn et al.

NEURIPS 2025arXiv:2506.09891
1
citations
#15461

CTRL-ALT-DECEIT Sabotage Evaluations for Automated AI R&D

Francis Ward, Teun van der Weij, Hanna Gábor et al.

NEURIPS 2025spotlightarXiv:2511.09904
1
citations
#15462

Tight Bounds on the Distortion of Randomized and Deterministic Distributed Voting

Mohammad Abam, Davoud Kareshki, Marzieh Nilipour et al.

NEURIPS 2025arXiv:2509.17134
1
citations
#15463

Long-tailed Recognition with Model Rebalancing

JIAAN LUO, Feng Hong, Qiang Hu et al.

NEURIPS 2025arXiv:2510.08177
1
citations
#15464

Tighter CMI-Based Generalization Bounds via Stochastic Projection and Quantization

Milad Sefidgaran, Kimia Nadjahi, Abdellatif Zaidi

NEURIPS 2025oralarXiv:2510.23485
1
citations
#15465

Empower Words: DualGround for Structured Phrase and Sentence-Level Temporal Grounding

Minseok Kang, Minhyeok Lee, Minjung Kim et al.

NEURIPS 2025oralarXiv:2510.20244
1
citations
#15466

Unlocking the Potential of Unlabeled Data in Semi-Supervised Domain Generalization

Dongkwan Lee, Kyomin Hwang, Nojun Kwak

CVPR 2025arXiv:2503.13915
1
citations
#15467

SolverLLM: Leveraging Test-Time Scaling for Optimization Problem via LLM-Guided Search

Dong Li, Xujiang Zhao, Linlin Yu et al.

NEURIPS 2025arXiv:2510.16916
1
citations
#15468

Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics

Indrashis Das, Mahmoud Safari, Steven Adriaensen et al.

NEURIPS 2025arXiv:2502.03654
1
citations
#15469

Guiding Cross-Modal Representations with MLLM Priors via Preference Alignment

Pengfei Zhao, Rongbo Luan, Wei Zhang et al.

NEURIPS 2025arXiv:2506.06970
1
citations
#15470

ShortListing Model: A Streamlined Simplex Diffusion for Discrete Variable Generation

Yuxuan Song, Zhe Zhang, Yu Pei et al.

NEURIPS 2025
1
citations
#15471

MoME: Mixture of Matryoshka Experts for Audio-Visual Speech Recognition

Umberto Cappellazzo, Minsu Kim, Pingchuan Ma et al.

NEURIPS 2025arXiv:2510.04136
1
citations
#15472

Meta-Learning Dynamic Center Distance: Hard Sample Mining for Learning with Noisy Labels

Chenyu Mu, Yijun Qu, Jiexi Yan et al.

ICCV 2025
1
citations
#15473

Martingale Posterior Neural Networks for Fast Sequential Decision Making

Gerardo Duran-Martin, Leandro Sánchez-Betancourt, Alvaro Cartea et al.

NEURIPS 2025arXiv:2506.11898
1
citations
#15474

Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models

Yoojin Jung, Byung Cheol Song

CVPR 2025arXiv:2504.04747
1
citations
#15475

Estimating cognitive biases with attention-aware inverse planning

Sounak Banerjee, Daphne Cornelisse, Deepak Gopinath et al.

NEURIPS 2025spotlightarXiv:2510.25951
1
citations
#15476

MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification

Jianwei Zhao, XIN LI, Fan Yang et al.

CVPR 2025arXiv:2503.12401
1
citations
#15477

Towards Understanding Transformers in Learning Random Walks

Wei Shi, Yuan Cao

NEURIPS 2025arXiv:2511.23239
1
citations
#15478

ClearSight: Human Vision-Inspired Solutions for Event-Based Motion Deblurring

Xiaopeng LIN, Yulong Huang, Hongwei Ren et al.

ICCV 2025arXiv:2501.15808
1
citations
#15479

Hybrid Autoencoders for Tabular Data: Leveraging Model-Based Augmentation in Low-Label Settings

Erel Naor, Ofir Lindenbaum

NEURIPS 2025arXiv:2511.06961
1
citations
#15480

Decentralized Dynamic Cooperation of Personalized Models for Federated Continual Learning

Danni Yang, Zhikang Chen, Sen Cui et al.

NEURIPS 2025oralarXiv:2509.23683
1
citations
#15481

Value-Guided KV Compression for LLMs via Approximated CUR Decomposition

Ayan Sengupta, Siddhant Chaudhary, Tanmoy Chakraborty

NEURIPS 2025arXiv:2509.15038
1
citations
#15482

SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting

Shuaiting Li, Juncan Deng, Chengxuan Wang et al.

ICCV 2025arXiv:2503.08668
1
citations
#15483

Adaptive Riemannian ADMM for Nonsmooth Optimization: Optimal Complexity without Smoothing

Kangkang Deng, Jiachen Jin, Jiang Hu et al.

NEURIPS 2025arXiv:2510.18617
1
citations
#15484

Where and How to Perturb: On the Design of Perturbation Guidance in Diffusion and Flow Models

Donghoon Ahn, Jiwon Kang, Sanghyun Lee et al.

NEURIPS 2025arXiv:2506.10978
1
citations
#15485

Taxonomy of reduction matrices for Graph Coarsening

Antonin Joly, Nicolas Keriven, Aline Roumy

NEURIPS 2025arXiv:2506.11743
1
citations
#15486

Beyond Token Probes: Hallucination Detection via Activation Tensors with ACT-ViT

Guy Bar-Shalom, Fabrizio Frasca, Yaniv Galron et al.

NEURIPS 2025arXiv:2510.00296
1
citations
#15487

Homogeneous Dynamics Space for Heterogeneous Humans

Xinpeng Liu, Junxuan Liang, Chenshuo Zhang et al.

CVPR 2025arXiv:2412.06146
1
citations
#15488

Towards Unsupervised Training of Matching-based Graph Edit Distance Solver via Preference-aware GAN

Wei Huang, Hanchen Wang, Dong Wen et al.

NEURIPS 2025arXiv:2506.01977
1
citations
#15489

CF3: Compact and Fast 3D Feature Fields

Hyunjoon Lee, Joonkyu Min, Jaesik Park

ICCV 2025arXiv:2508.05254
1
citations
#15490

VideoMiner: Iteratively Grounding Key Frames of Hour-Long Videos via Tree-based Group Relative Policy Optimization

Xinye Cao, Hongcan Guo, Jiawen Qian et al.

ICCV 2025arXiv:2510.06040
1
citations
#15491

Planning and Learning in Average Risk-aware MDPs

Weikai Wang, Erick Delage

NEURIPS 2025arXiv:2503.17629
1
citations
#15492

UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions

Siyuan Yao, Rui Zhu, Ziqi Wang et al.

ICCV 2025arXiv:2507.00648
1
citations
#15493

Attraction Diminishing and Distributing for Few-Shot Class-Incremental Learning

Li-Jun Zhao, Zhen-Duo Chen, Yongxin Wang et al.

CVPR 2025
1
citations
#15494

Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning

Jongchan Park, Mingyu Park, Donghwan Lee

NEURIPS 2025arXiv:2505.05701
1
citations
#15495

GeoAvatar: Adaptive Geometrical Gaussian Splatting for 3D Head Avatar

SeungJun Moon, Hah Min Lew, Seungeun Lee et al.

ICCV 2025arXiv:2507.18155
1
citations
#15496

Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video

Xiao Li, Qi Chen, Xiulian Peng et al.

ICCV 2025arXiv:2509.08376
1
citations
#15497

Fast Training of Large Kernel Models with Delayed Projections

Amirhesam Abedsoltan, Siyuan Ma, Parthe Pandit et al.

NEURIPS 2025spotlightarXiv:2411.16658
1
citations
#15498

Leveraging Importance Sampling to Detach Alignment Modules from Large Language Models

Yi Liu, Dianqing Liu, Mingye Zhu et al.

NEURIPS 2025arXiv:2505.19700
1
citations
#15499

Tiling artifacts and trade-offs of feature normalization in the segmentation of large biological images

Elena Buglakova, Anwai Archit, Edoardo D'Imprima et al.

ICCV 2025highlightarXiv:2503.19545
1
citations
#15500

Saliency-Aware Quantized Imitation Learning for Efficient Robotic Control

Seongmin Park, Hyungmin Kim, Sangwoo kim et al.

ICCV 2025arXiv:2505.15304
1
citations
#15501

Differentiable extensions with rounding guarantees for combinatorial optimization over permutations

Robert (Riley) Nerem, Zhishang Luo, Akbar Rafiey et al.

NEURIPS 2025arXiv:2411.10707
1
citations
#15502

UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines

Chen Tang, Xinzhu Ma, Encheng Su et al.

CVPR 2025arXiv:2503.20748
1
citations
#15503

DeepShield: Fortifying Deepfake Video Detection with Local and Global Forgery Analysis

Yinqi Cai, Jichang Li, Zhaolun Li et al.

ICCV 2025arXiv:2510.25237
1
citations
#15504

Are Greedy Task Orderings Better Than Random in Continual Linear Regression?

Matan Tsipory, Ran Levinstein, Itay Evron et al.

NEURIPS 2025arXiv:2510.19941
1
citations
#15505

Group-wise Scaling and Orthogonal Decomposition for Domain-Invariant Feature Extraction in Face Anti-Spoofing

Seungjin Jung, Kanghee Lee, Yonghyun Jeong et al.

ICCV 2025arXiv:2507.04006
1
citations
#15506

FakeRadar: Probing Forgery Outliers to Detect Unknown Deepfake Videos

Zhaolun Li, Jichang Li, Yinqi Cai et al.

ICCV 2025arXiv:2512.14601
1
citations
#15507

StrandHead: Text to Hair-Disentangled 3D Head Avatars Using Human-Centric Priors

Xiaokun Sun, Zeyu Cai, Ying Tai et al.

ICCV 2025arXiv:2412.11586
1
citations
#15508

Revisiting Agnostic Boosting

Arthur da Cunha, Mikael Møller Høgsgaard, Andrea Paudice et al.

NEURIPS 2025arXiv:2503.09384
1
citations
#15509

SSTAG: Structure-Aware Self-Supervised Learning Method for Text-Attributed Graphs

Ruyue Liu, Rong Yin, Xiangzhen Bo et al.

NEURIPS 2025arXiv:2510.01248
1
citations
#15510

T2Bs: Text-to-Character Blendshapes via Video Generation

Jiahao Luo, Chaoyang Wang, Michael Vasilkovsky et al.

ICCV 2025arXiv:2509.10678
1
citations
#15511

ProGait: A Multi-Purpose Video Dataset and Benchmark for Transfemoral Prosthesis Users

Xiangyu Yin, Boyuan Yang, Weichen Liu et al.

ICCV 2025highlightarXiv:2507.10223
1
citations
#15512

When Less Language is More: Language-Reasoning Disentanglement Makes LLMs Better Multilingual Reasoners

Weixiang Zhao, Jiahe Guo, Yang Deng et al.

NEURIPS 2025spotlightarXiv:2505.15257
1
citations
#15513

LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation

Xinyu Yan, Meijun Sun, Ge-Peng Ji et al.

ICCV 2025arXiv:2508.01152
1
citations
#15514

Neural Attention Search

Difan Deng, Marius Lindauer

NEURIPS 2025arXiv:2502.13251
1
citations
#15515

DuoCLR: Dual-Surrogate Contrastive Learning for Skeleton-based Human Action Segmentation

Haitao Tian

ICCV 2025arXiv:2509.05543
1
citations
#15516

Class-wise Balancing Data Replay for Federated Class-Incremental Learning

Zhuang Qi, Ying-Peng Tang, Lei Meng et al.

NEURIPS 2025oralarXiv:2507.07712
1
citations
#15517

Occlusion-robust Stylization for Drawing-based 3D Animation

Sunjae Yoon, Gwanhyeong Koo, Younghwan Lee et al.

ICCV 2025arXiv:2508.00398
1
citations
#15518

PARC: A Quantitative Framework Uncovering the Symmetries within Vision Language Models

Jenny Schmalfuss, Nadine Chang, Vibashan VS et al.

CVPR 2025arXiv:2506.14808
1
citations
#15519

How Would It Sound? Material-Controlled Multimodal Acoustic Profile Generation for Indoor Scenes

Mahnoor Saad, Ziad Al-Halah

ICCV 2025arXiv:2508.02905
1
citations
#15520

HazeFlow: Revisit Haze Physical Model as ODE and Non-Homogeneous Haze Generation for Real-World Dehazing

Junseong Shin, Seungwoo Chung, Yunjeong Yang et al.

ICCV 2025arXiv:2509.18190
1
citations
#15521

Concentration and excess risk bounds for imbalanced classification with synthetic oversampling

Touqeer Ahmad, Mohammadreza Mousavi Kalan, François Portier et al.

NEURIPS 2025arXiv:2510.20472
1
citations
#15522

Projection-based Lyapunov method for fully heterogeneous weakly-coupled MDPs

Xiangcheng Zhang, Yige Hong, Weina Wang

NEURIPS 2025spotlightarXiv:2502.06072
1
citations
#15523

IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution

Sejin Park, Sangmin Lee, Kyong Hwan Jin et al.

ICCV 2025arXiv:2507.09923
1
citations
#15524

Towards Efficient General Feature Prediction in Masked Skeleton Modeling

Shengkai Sun, Zefan Zhang, Jianfeng Dong et al.

ICCV 2025arXiv:2509.03609
1
citations
#15525

NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration

Haotian Dong, Xin WANG, Di Lin et al.

ICCV 2025arXiv:2504.18448
1
citations
#15526

FED-PsyAU: Privacy-Preserving Micro-Expression Recognition via Psychological AU Coordination and Dynamic Facial Motion Modeling

Jingting Li, Yu Qian, Lin Zhao et al.

ICCV 2025arXiv:2507.20557
1
citations
#15527

PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks

Clinton A Mo, Kun Hu, Chengjiang Long et al.

ICCV 2025arXiv:2507.20170
1
citations
#15528

VideoSPatS: Video SPatiotemporal Splines for Disentangled Occlusion, Appearance and Motion Modeling and Editing

Juan Luis Gonzalez Bello, Xu Yao, Alex Whelan et al.

CVPR 2025arXiv:2504.07146
1
citations
#15529

Towards Natural Language-Based Document Image Retrieval: New Dataset and Benchmark

Hao Guo, Xugong Qin, Jun Jie Ou Yang et al.

CVPR 2025arXiv:2512.20174
1
citations
#15530

Dual Reciprocal Learning of Language-based Human Motion Understanding and Generation

CHEN LIANG, Zhicheng Shi, Wenguan Wang et al.

ICCV 2025
1
citations
#15531

DISTA-Net: Dynamic Closely-Spaced Infrared Small Target Unmixing

Shengdong Han, Shangdong Yang, Yuxuan Li et al.

ICCV 2025arXiv:2505.19148
1
citations
#15532

The Quest for Universal Master Key Filters in DS-CNNs

Zahra Babaiee, Peyman M. Kiasari, Daniela Rus et al.

NEURIPS 2025arXiv:2509.11711
1
citations
#15533

ASCENT: Annotation-free Self-supervised Contrastive Embeddings for 3D Neuron Tracking in Fluorescence Microscopy

Haejun Han, Hang Lu

ICCV 2025
1
citations
#15534

VSRM: A Robust Mamba-Based Framework for Video Super-Resolution

Phu Tran Dinh, Hung Dao, Daeyoung Kim

ICCV 2025arXiv:2506.22762
1
citations
#15535

AnimalClue: Recognizing Animals by their Traces

Risa Shinoda, Nakamasa Inoue, Iro Laina et al.

ICCV 2025highlightarXiv:2507.20240
1
citations
#15536

Skeleton Motion Words for Unsupervised Skeleton-based Temporal Action Segmentation

Uzay Gökay, Federico Spurio, Dominik Bach et al.

ICCV 2025arXiv:2508.04513
1
citations
#15537

SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models

Pingchuan Ma, Xiaopei Yang, Ming Gui et al.

ICCV 2025arXiv:2508.03402
1
citations
#15538

Reinventing Multi-Agent Collaboration through Gaussian-Image Synergy in Diffusion Policies

Ziye Wang, Li Kang, Yiran Qin et al.

NEURIPS 2025arXiv:2511.00998
1
citations
#15539

Hipandas: Hyperspectral Image Joint Denoising and Super-Resolution by Image Fusion with the Panchromatic Image

Shuang Xu, Zixiang Zhao, Haowen Bai et al.

ICCV 2025arXiv:2412.04201
1
citations
#15540

EASEMVC:Efficient Dual Selection Mechanism for Deep Multi-View Clustering

Baili Xiao, Zhibin Dong, KE LIANG et al.

CVPR 2025
1
citations
#15541

ForCenNet: Foreground-Centric Network for Document Image Rectification

Peng Cai, liqiang liqiang, Kaicheng Yang et al.

ICCV 2025arXiv:2507.19804
1
citations
#15542

Uncertainty-Aware Multi-Objective Reinforcement Learning-Guided Diffusion Models for 3D De Novo Molecular Design

Lianghong Chen, Dongkyu Kim, Mike Domaratzki et al.

NEURIPS 2025arXiv:2510.21153
1
citations
#15543

CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation

Yi Liu, Shengqian Li, Zuzeng Lin et al.

ICCV 2025arXiv:2506.23347
1
citations
#15544

ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation

Xiwei Xuan, Ziquan Deng, Kwan-Liu Ma

ICCV 2025highlightarXiv:2506.21233
1
citations
#15545

ModalTune: Fine-Tuning Slide-Level Foundation Models with Multi-Modal Information for Multi-task Learning in Digital Pathology

Vishwesh Ramanathan, Tony Xu, Pushpak Pati et al.

ICCV 2025arXiv:2503.17564
1
citations
#15546

PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations

YU WEI, Jiahui Zhang, Xiaoqin Zhang et al.

ICCV 2025arXiv:2507.13891
1
citations
#15547

Learning Intractable Multimodal Policies with Reparameterization and Diversity Regularization

Ziqi Wang, Jiashun Liu, Ling Pan

NEURIPS 2025arXiv:2511.01374
1
citations
#15548

Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks

Bhishma Dedhia, David Bourgin, Krishna Kumar Singh et al.

ICCV 2025arXiv:2503.17539
1
citations
#15549

Identifying Macro Causal Effects in C-DMGs over DMGs

Simon Ferreira, Charles Assaad

NEURIPS 2025arXiv:2506.19650
1
citations
#15550

Text Embedding Knows How to Quantize Text-Guided Diffusion Models

Hongjae Lee, Myungjun Son, Dongjea Kang et al.

ICCV 2025arXiv:2507.10340
1
citations
#15551

Open-Vocabulary HOI Detection with Interaction-aware Prompt and Concept Calibration

Ting Lei, Shaofeng Yin, Qingchao Chen et al.

ICCV 2025arXiv:2508.03207
1
citations
#15552

Robust Low-light Scene Restoration via Illumination Transition

Ze Li, Feng Zhang, Xiatian Zhu et al.

ICCV 2025arXiv:2507.03976
1
citations
#15553

Dynamical modeling of nonlinear latent factors in multiscale neural activity with real-time inference

Eray Erturk, Maryam Shanechi

NEURIPS 2025oralarXiv:2512.12462
1
citations
#15554

Event-Driven Storytelling with Multiple Lifelike Humans in a 3D Scene

Donggeun Lim, Jinseok Bae, Inwoo Hwang et al.

ICCV 2025arXiv:2507.19232
1
citations
#15555

Leveraging robust optimization for llm alignment under distribution shifts

Mingye Zhu, Yi Liu, Zheren Fu et al.

NEURIPS 2025arXiv:2504.05831
1
citations
#15556

Relaxing partition admissibility in Cluster-DAGs: a causal calculus with arbitrary variable clustering

Clément Yvernes, Emilie Devijver, Adèle Ribeiro et al.

NEURIPS 2025arXiv:2511.01396
1
citations
#15557

TRACE: Contrastive learning for multi-trial time series data in neuroscience

Lisa Schmors, Dominic Gonschorek, Jan Niklas Böhm et al.

NEURIPS 2025arXiv:2506.04906
1
citations
#15558

SegmentDreamer: Towards High-fidelity Text-to-3D Synthesis with Segmented Consistency Trajectory Distillation

Jiahao Zhu, Zixuan Chen, Guangcong Wang et al.

ICCV 2025arXiv:2507.05256
1
citations
#15559

Membership Inference Attacks with False Discovery Rate Control

Chenxu Zhao, Wei Qian, Aobo Chen et al.

ICCV 2025arXiv:2508.07066
1
citations
#15560

Perturbation Bounds for Low-Rank Inverse Approximations under Noise

Phuc Tran, Nisheeth K. Vishnoi

NEURIPS 2025arXiv:2510.25571
1
citations
#15561

VLMs have Tunnel Vision: Evaluating Nonlocal Visual Reasoning in Leading VLMs

Shmuel Berman, Jia Deng

NEURIPS 2025spotlightarXiv:2507.13361
1
citations
#15562

IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance

Jiayi Guo, Chuanhao Yan, Xingqian Xu et al.

ICCV 2025arXiv:2509.26231
1
citations
#15563

Graph Diffusion that can Insert and Delete

Matteo Ninniri, Marco Podda, Davide Bacciu

NEURIPS 2025arXiv:2506.15725
1
citations
#15564

Outlier-Aware Post-Training Quantization for Image Super-Resolution

Hailing Wang, Jianglin Lu, Yitian Zhang et al.

ICCV 2025highlightarXiv:2511.00682
1
citations
#15565

Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction

Giuseppe Cartella, Vittorio Cuculo, Alessandro D'Amelio et al.

ICCV 2025arXiv:2507.23021
1
citations
#15566

MeshPad: Interactive Sketch-Conditioned Artist-Reminiscent Mesh Generation and Editing

Haoxuan Li, Ziya Erkoç, Lei Li et al.

ICCV 2025arXiv:2503.01425
1
citations
#15567

PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity

Kwanyoung Kim, Byeongsu Sim

ICCV 2025arXiv:2503.07677
1
citations
#15568

PrimHOI: Compositional Human-Object Interaction via Reusable Primitives

Kai Jia, Tengyu Liu, Mingtao Pei et al.

ICCV 2025
1
citations
#15569

D3QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection

Yanran Zhang, Bingyao Yu, Yu Zheng et al.

ICCV 2025
1
citations
#15570

Blind Video Super-Resolution based on Implicit Kernels

Qiang Zhu, Yuxuan Jiang, Shuyuan Zhu et al.

ICCV 2025arXiv:2503.07856
1
citations
#15571

OmniDiff: A Comprehensive Benchmark for Fine-grained Image Difference Captioning

Yuan Liu, Saihui Hou, Saijie Hou et al.

ICCV 2025arXiv:2503.11093
1
citations
#15572

Referring Expression Comprehension for Small Objects

Kanoko Goto, Takumi Hirose, Mahiro Ukai et al.

ICCV 2025arXiv:2510.03701
1
citations
#15573

Link-based Contrastive Learning for One-Shot Unsupervised Domain Adaptation

Yue Zhang, Mingyue Bin, Yuyang Zhang et al.

CVPR 2025
1
citations
#15574

PLMP - Point-Line Minimal Problems for Projective SfM

Kim Kiehn, Albin Ahlbäck, Kathlén Kohn

ICCV 2025highlightarXiv:2503.04351
1
citations
#15575

HOI-Dyn: Learning Interaction Dynamics for Human-Object Motion Diffusion

Lin Wu, Zhixiang Chen, Jianglin Lan

NEURIPS 2025arXiv:2507.01737
1
citations
#15576

PatchScaler: An Efficient Patch-Independent Diffusion Model for Image Super-Resolution

Yong Liu, Hang Dong, Jinshan Pan et al.

ICCV 2025arXiv:2405.17158
1
citations
#15577

Balancing Task-invariant Interaction and Task-specific Adaptation for Unified Image Fusion

Xingyu Hu, Junjun Jiang, Chenyang Wang et al.

ICCV 2025arXiv:2504.05164
1
citations
#15578

SpiLiFormer: Enhancing Spiking Transformers with Lateral Inhibition

Zeqi Zheng, Yanchen Huang, Yingchao Yu et al.

ICCV 2025arXiv:2503.15986
1
citations
#15579

TITAN-Guide: Taming Inference-Time Alignment for Guided Text-to-Video Diffusion Models

Christian Simon, Masato Ishii, Akio Hayakawa et al.

ICCV 2025arXiv:2508.00289
1
citations
#15580

CompSlider: Compositional Slider for Disentangled Multiple-Attribute Image Generation

Zixin Zhu, Kevin Duarte, Mamshad Nayeem Rizve et al.

ICCV 2025arXiv:2509.01028
1
citations
#15581

Deep Value Benchmark: Measuring Whether Models Generalize Deep values or Shallow Preferences

Joshua Ashkinaze, Hua Shen, Saipranav Avula et al.

NEURIPS 2025oralarXiv:2511.02109
1
citations
#15582

Augmented and Softened Matching for Unsupervised Visible-Infrared Person Re-Identification

Zhiqi Pang, Chunyu Wang, Lingling Zhao et al.

ICCV 2025
1
citations
#15583

Spurious-Aware Prototype Refinement for Reliable Out-of-Distribution Detection

Reihaneh Zohrabi, Hosein Hasani, Mahdieh Soleymani et al.

NEURIPS 2025arXiv:2506.23881
1
citations
#15584

Feature Distillation is the Better Choice for Model-Heterogeneous Federated Learning

Yichen Li, Xiuying Wang, Wenchao Xu et al.

NEURIPS 2025arXiv:2507.10348
1
citations
#15585

Revisiting Generative Replay for Class Incremental Object Detection

Shizhou Zhang, Xueqiang Lv, Yinghui Xing et al.

CVPR 2025
1
citations
#15586

DISCO: Disentangled Communication Steering for Large Language Models

Max Torop, Aria Masoomi, Masih Eskandar et al.

NEURIPS 2025arXiv:2509.16820
1
citations
#15587

Latent Swap Joint Diffusion for 2D Long-Form Latent Generation

Yusheng Dai, Chenxi Wang, Chang Li et al.

ICCV 2025arXiv:2502.05130
1
citations
#15588

Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost

Runzhe Zhan, Zhihong Huang, Xinyi Yang et al.

NEURIPS 2025arXiv:2510.20780
1
citations
#15589

FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models

Yuxuan Wang, Tianwei Cao, Huayu Zhang et al.

ICCV 2025arXiv:2507.02714
1
citations
#15590

Restoring Pruned Large Language Models via Lost Component Compensation

Zijian Feng, Hanzhang Zhou, Zixiao Zhu et al.

NEURIPS 2025spotlightarXiv:2510.21834
1
citations
#15591

How Many Tokens Do 3D Point Cloud Transformer Architectures Really Need?

Tuan Tran Anh, Duy M. H. Nguyen, Hoai-Chau Tran et al.

NEURIPS 2025arXiv:2511.05449
1
citations
#15592

ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration

Andrea Conti, Matteo Poggi, Valerio Cambareri et al.

ICCV 2025arXiv:2504.16545
1
citations
#15593

HypDAE: Hyperbolic Diffusion Autoencoders for Hierarchical Few-shot Image Generation

Lingxiao Li, Kaixuan Fan, Boqing Gong et al.

ICCV 2025arXiv:2411.17784
1
citations
#15594

Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning

Haoran Chen, Ping Wang, Zihan Zhou et al.

ICCV 2025arXiv:2503.07979
1
citations
#15595

Recursive Inference Scaling: A Winning Path to Scalable Inference in Language and Multimodal Systems

Ibrahim Alabdulmohsin, Xiaohua Zhai

NEURIPS 2025arXiv:2502.07503
1
citations
#15596

Geometry-Aware Edge Pooling for Graph Neural Networks

Katharina Limbeck, Lydia Mezrag, Guy Wolf et al.

NEURIPS 2025arXiv:2506.11700
1
citations
#15597

V.I.P. : Iterative Online Preference Distillation for Efficient Video Diffusion Models

Jisoo Kim, Wooseok Seo, Junwan Kim et al.

ICCV 2025arXiv:2508.03254
1
citations
#15598

From Euler to AI: Unifying Formulas for Mathematical Constants

Tomer Raz, Michael Shalyt, Elyasheev Leibtag et al.

NEURIPS 2025arXiv:2502.17533
1
citations
#15599

Vulnerability-Aware Spatio-Temporal Learning for Generalizable Deepfake Video Detection

Dat NGUYEN, Marcella Astrid, Anis Kacem et al.

ICCV 2025arXiv:2501.01184
1
citations
#15600

Targeted Forgetting of Image Subgroups in CLIP Models

Zeliang Zhang, Gaowen Liu, Charles Fleming et al.

CVPR 2025arXiv:2506.03117
1
citations