Most Cited 2025 "underdamped langevin diffusion" Papers

22,274 papers found • Page 49 of 112

Filters:Most Cited 2025 underdamped langevin diffusion Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#9601

AGO: Adaptive Grounding for Open World 3D Occupancy Prediction

Peizheng Li, Shuxiao Ding, You Zhou et al.

ICCV 2025arXiv:2504.10117

citations

#9602

ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval

Eric Xing, Pranavi Kolouju, Robert Pless et al.

CVPR 2025arXiv:2505.20764

citations

#9603

WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and Segmentation

Silin Cheng, Yang Liu, Xinwei He et al.

CVPR 2025arXiv:2505.18686

citations

#9604

Local Dense Logit Relations for Enhanced Knowledge Distillation

Liuchi Xu, Kang Liu, Jinshuai Liu et al.

ICCV 2025arXiv:2507.15911

citations

#9605

Uni-LoRA: One Vector is All You Need

Kaiyang Li, Shaobo Han, Qing Su et al.

NEURIPS 2025spotlightarXiv:2506.00799

citations

#9606

Continual Release Moment Estimation with Differential Privacy

Nikita Kalinin, Jalaj Upadhyay, Christoph Lampert

NEURIPS 2025arXiv:2502.06597

citations

#9607

Feel-Good Thompson Sampling for Contextual Bandits: a Markov Chain Monte Carlo Showdown

Emile Anand, Sarah Liaw

NEURIPS 2025arXiv:2507.15290

citations

#9608

SFDM: Robust Decomposition of Geometry and Reflectance for Realistic Face Rendering from Sparse-view Images

Daisheng Jin, Jiangbei Hu, Baixin Xu et al.

CVPR 2025arXiv:2312.06085

citations

#9609

Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis

Boming Miao, Chunxiao Li, Xiaoxiao Wang et al.

CVPR 2025arXiv:2411.16503

citations

#9610

FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization

Hao Chen, Shell Xu Hu, Wayne Luk et al.

ICCV 2025arXiv:2503.12649

citations

#9611

FFR: Frequency Feature Rectification for Weakly Supervised Semantic Segmentation

Ziqian Yang, Xinqiao Zhao, Xiaolei Wang et al.

CVPR 2025

citations

#9612

TAMT: Temporal-Aware Model Tuning for Cross-Domain Few-Shot Action Recognition

yilong wang, Zilin Gao, Qilong Wang et al.

CVPR 2025arXiv:2411.19041

citations

#9613

GG-SSMs: Graph-Generating State Space Models

Nikola Zubic, Davide Scaramuzza

CVPR 2025

citations

#9614

Cheb-GR: Rethinking K-nearest Neighbor Search in Re-ranking for Person Re-identification

Jinxi Yang, He Li, Bo Du et al.

CVPR 2025

citations

#9615

Hierarchical Retrieval: The Geometry and a Pretrain-Finetune Recipe

Chong You, Rajesh Jayaram, Ananda Theertha Suresh et al.

NEURIPS 2025arXiv:2509.16411

citations

#9616

GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data

Wentao Wang, Hang Ye, Fangzhou Hong et al.

NEURIPS 2025arXiv:2411.18624

citations

#9617

Learning to Instruct for Visual Instruction Tuning

Zhihan Zhou, Feng Hong, JIAAN LUO et al.

NEURIPS 2025arXiv:2503.22215

citations

#9618

DOF-GS: Adjustable Depth-of-Field 3D Gaussian Splatting for Post-Capture Refocusing, Defocus Rendering and Blur Removal

Yujie Wang, Praneeth Chakravarthula, Baoquan Chen

CVPR 2025

citations

#9619

Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous Circle

Miroslav Purkrabek, Jiri Matas

ICCV 2025arXiv:2412.01562

citations

#9620

UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References

Ming-Feng Li, Xin Yang, Fu-En Wang et al.

CVPR 2025arXiv:2506.07996

citations

#9621

3D-AVS: LiDAR-based 3D Auto-Vocabulary Segmentation

Weijie Wei, Osman Ülger, Fatemeh Karimi Nejadasl et al.

CVPR 2025arXiv:2406.09126

citations

#9622

Does Object Binding Naturally Emerge in Large Pretrained Vision Transformers?

Yihao Li, Saeed Salehi, Lyle Ungar et al.

NEURIPS 2025spotlightarXiv:2510.24709

citations

#9623

Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers

Quentin Guimard, Moreno D'Incà, Massimiliano Mancini et al.

CVPR 2025arXiv:2504.20902

citations

#9624

See Further When Clear: Curriculum Consistency Model

Yunpeng Liu, Boxiao Liu, Yi Zhang et al.

CVPR 2025arXiv:2412.06295

citations

#9625

SynBrain: Enhancing Visual-to-fMRI Synthesis via Probabilistic Representation Learning

Weijian Mai, Jiamin Wu, Yu Zhu et al.

NEURIPS 2025arXiv:2508.10298

citations

#9626

RGB-to-Polarization Estimation: A New Task and Benchmark Study

Beibei Lin, Zifeng Yuan, Tingting Chen

NEURIPS 2025arXiv:2505.13050

citations

#9627

How Well Can Differential Privacy Be Audited in One Run?

Amit Keinan, Moshe Shenfeld, Katrina Ligett

NEURIPS 2025spotlightarXiv:2503.07199

citations

#9628

PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction

Eduard Poesina, Adriana Valentina Costache, Adrian-Gabriel Chifu et al.

CVPR 2025arXiv:2406.04746

citations

#9629

SplitFlow: Flow Decomposition for Inversion-Free Text-to-Image Editing

Sung-Hoon Yoon, Minghan Li, Gaspard Beaudouin et al.

NEURIPS 2025arXiv:2510.25970

citations

#9630

AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models

Kwan Yun, Seokhyeon Hong, Chaelin Kim et al.

CVPR 2025arXiv:2503.08417

citations

#9631

RefLoRA: Refactored Low-Rank Adaptation for Efficient Fine-Tuning of Large Models

Yilang Zhang, Bingcong Li, Georgios Giannakis

NEURIPS 2025arXiv:2505.18877

citations

#9632

Unlearned but Not Forgotten: Data Extraction after Exact Unlearning in LLM

Xiaoyu Wu, Yifei Pang, Terrance Liu et al.

NEURIPS 2025arXiv:2505.24379

citations

#9633

GEOPARD: Geometric Pretraining for Articulation Prediction in 3D Shapes

Pradyumn Goyal, Dmitrii Petrov, Sheldon Andrews et al.

ICCV 2025arXiv:2504.02747

citations

#9634

SViMo: Synchronized Diffusion for Video and Motion Generation in Hand-object Interaction Scenarios

Lingwei Dang, Ruizhi Shao, Hongwen Zhang et al.

NEURIPS 2025spotlightarXiv:2506.02444

citations

#9635

T2SMark: Balancing Robustness and Diversity in Noise-as-Watermark for Diffusion Models

Jindong Yang, Han Fang, Weiming Zhang et al.

NEURIPS 2025arXiv:2510.22366

citations

#9636

EDM: Equirectangular Projection-Oriented Dense Kernelized Feature Matching

Dongki Jung, Jaehoon Choi, Yonghan Lee et al.

CVPR 2025arXiv:2502.20685

citations

#9637

DAMamba: Vision State Space Model with Dynamic Adaptive Scan

Tanzhe Li, Caoshuo Li, Jiayi Lyu et al.

NEURIPS 2025arXiv:2502.12627

citations

#9638

DiSciPLE: Learning Interpretable Programs for Scientific Visual Discovery

Utkarsh Mall, Cheng Perng Phoo, Mia Chiquier et al.

CVPR 2025arXiv:2502.10060

citations

#9639

STEP: A Unified Spiking Transformer Evaluation Platform for Fair and Reproducible Benchmarking

Sicheng Shen, Dongcheng Zhao, Linghao Feng et al.

NEURIPS 2025oralarXiv:2505.11151

citations

#9640

HybridMQA: Exploring Geometry-Texture Interactions for Colored Mesh Quality Assessment

Armin Shafiee Sarvestani, Sheyang Tang, Zhou Wang

CVPR 2025arXiv:2412.01986

citations

#9641

MV-CoLight: Efficient Object Compositing with Consistent Lighting and Shadow Generation

Kerui Ren, Jiayang Bai, Linning Xu et al.

NEURIPS 2025arXiv:2505.21483

citations

#9642

Spectral Analysis of Representational Similarity with Limited Neurons

Hyunmo Kang, Abdulkadir Canatar, SueYeon Chung

NEURIPS 2025arXiv:2502.19648

citations

#9643

VITRIX-UniViTAR: Unified Vision Transformer with Native Resolution

Limeng Qiao, Yiyang Gan, Bairui Wang et al.

NEURIPS 2025oral

citations

#9644

ElasticMM: Efficient Multimodal LLMs Serving with Elastic Multimodal Parallelism

Zedong Liu, Shenggan Cheng, Guangming Tan et al.

NEURIPS 2025oralarXiv:2507.10069

citations

#9645

FIction: 4D Future Interaction Prediction from Video

Kumar Ashutosh, Georgios Pavlakos, Kristen Grauman

CVPR 2025highlightarXiv:2412.00932

citations

#9646

TopNet: Transformer-Efficient Occupancy Prediction Network for Octree-Structured Point Cloud Geometry Compression

Xinjie Wang, Yifan Zhang, Ting Liu et al.

CVPR 2025

citations

#9647

How To Make Your Cell Tracker Say "I dunno!"

Richard D Paul, Johannes Seiffarth, David Rügamer et al.

ICCV 2025

citations

#9648

MetaSlot: Break Through the Fixed Number of Slots in Object-Centric Learning

Hongjia Liu, Rongzhen Zhao, Haohan Chen et al.

NEURIPS 2025arXiv:2505.20772

citations

#9649

Option-aware Temporally Abstracted Value for Offline Goal-Conditioned Reinforcement Learning

Hongjoon Ahn, Heewoong Choi, Jisu Han et al.

NEURIPS 2025oralarXiv:2505.12737

citations

#9650

FACE: Faithful Automatic Concept Extraction

Dipkamal Bhusal, Michael Clifford, Sara Rampazzi et al.

NEURIPS 2025arXiv:2510.11675

citations

#9651

RC-AutoCalib: An End-to-End Radar-Camera Automatic Calibration Network

Van-Tin Luu, Yong-Lin Cai, Vu-Hoang Tran et al.

CVPR 2025arXiv:2505.22427

citations

#9652

Avoiding exp(R) scaling in RLHF through Preference-based Exploration

Mingyu Chen, Yiding Chen, Wen Sun et al.

NEURIPS 2025

citations

#9653

Instant GaussianImage: A Generalizable and Self-Adaptive Image Representation via 2D Gaussian Splatting

Zhaojie Zeng, Yuesong Wang, Chao Yang et al.

ICCV 2025arXiv:2506.23479

citations

#9654

BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent

Shaojie Zhang, Ruoceng Zhang, Pei Fu et al.

NEURIPS 2025arXiv:2509.15566

citations

#9655

ATA: Adaptive Transformation Agent for Text-Guided Subject-Position Variable Background Inpainting

Yizhe Tang, Zhimin Sun, Yuzhen Du et al.

CVPR 2025

citations

#9656

DAViD: Data-efficient and Accurate Vision Models from Synthetic Data

Fatemeh Saleh, Sadegh Aliakbarian, Charlie Hewitt et al.

ICCV 2025arXiv:2507.15365

citations

#9657

TITAN: A Trajectory-Informed Technique for Adaptive Parameter Freezing in Large-Scale VQE

Yifeng Peng, Xinyi Li, Samuel Yen-Chi Chen et al.

NEURIPS 2025arXiv:2509.15193

citations

#9658

ReDi: Rectified Discrete Flow

Jaehoon Yoo, Wonjung Kim, Seunghoon Hong

NEURIPS 2025arXiv:2507.15897

citations

#9659

Hallucinatory Image Tokens: A Training-free EAZY Approach to Detecting and Mitigating Object Hallucinations in LVLMs

Liwei Che, Qingze T Liu, Jing Jia et al.

ICCV 2025arXiv:2503.07772

citations

#9660

SPAZER: Spatial-Semantic Progressive Reasoning Agent for Zero-shot 3D Visual Grounding

Zhao Jin, Rong-Cheng Tu, Jingyi Liao et al.

NEURIPS 2025arXiv:2506.21924

citations

#9661

M2SFormer: Multi-Spectral and Multi-Scale Attention with Edge-Aware Difficulty Guidance for Image Forgery Localization

Ju-Hyeon Nam, Dong-Hyun Moon, Sang-Chul Lee

ICCV 2025highlightarXiv:2506.20922

citations

#9662

Constrained Entropic Unlearning: A Primal-Dual Framework for Large Language Models

Taha Entesari, Arman Hatami, Rinat Khaziev et al.

NEURIPS 2025arXiv:2506.05314

citations

#9663

DFM: Differentiable Feature Matching for Anomaly Detection

Wu Sheng, Yimi Wang, Xudong Liu et al.

CVPR 2025

citations

#9664

Regret Analysis of Average-Reward Unichain MDPs via an Actor-Critic Approach

Swetha Ganesh, Vaneet Aggarwal

NEURIPS 2025arXiv:2505.19986

citations

#9665

VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption

Tianxiong Zhong, Xingye Tian, Boyuan Jiang et al.

NEURIPS 2025oralarXiv:2505.12053

citations

#9666

CrossAD: Time Series Anomaly Detection with Cross-scale Associations and Cross-window Modeling

Beibu Li, Qichao Shentu, Yang Shu et al.

NEURIPS 2025arXiv:2510.12489

citations

#9667

High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight

Cédric Vincent, Taehyoung Kim, Henri Meeß

CVPR 2025arXiv:2503.15676

citations

#9668

FedSPA: Generalizable Federated Graph Learning under Homophily Heterogeneity

Zihan Tan, Guancheng Wan, Wenke Huang et al.

CVPR 2025

citations

#9669

Faster and Better 3D Splatting via Group Training

Chengbo Wang, Guozheng Ma, Yizhen Lao et al.

ICCV 2025arXiv:2412.07608

citations

#9670

Towards Source-Free Machine Unlearning

Sk Miraj Ahmed, Umit Basaran, Dripta S. Raychaudhuri et al.

CVPR 2025arXiv:2508.15127

citations

#9671

Finsler Multi-Dimensional Scaling: Manifold Learning for Asymmetric Dimensionality Reduction and Embedding

Thomas Dagès, Simon Weber, Ya-Wei Eileen Lin et al.

CVPR 2025arXiv:2503.18010

citations

#9672

Flow4Agent: Long-form Video Understanding via Motion Prior from Optical Flow

Ruyang Liu, Shangkun Sun, Haoran Tang et al.

ICCV 2025arXiv:2510.05836

citations

#9673

GGTalker: Talking Head Systhesis with Generalizable Gaussian Priors and Identity-Specific Adaptation

Wentao Hu, Shunkai Li, Ziqiao Peng et al.

ICCV 2025highlightarXiv:2506.21513

citations

#9674

ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction

Danhui Chen, Ziquan Liu, Chuxi Yang et al.

ICCV 2025arXiv:2507.15803

citations

#9675

End-to-End Multi-Modal Diffusion Mamba

Chunhao Lu, Qiang Lu, Meichen Dong et al.

ICCV 2025arXiv:2510.13253

citations

#9676

DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning

Xiao-Hui Li, Fei Yin, Cheng-Lin Liu

CVPR 2025arXiv:2504.04085

citations

#9677

ImageSentinel: Protecting Visual Datasets from Unauthorized Retrieval-Augmented Image Generation

Ziyuan Luo, Yangyi Zhao, Ka Chun Cheung et al.

NEURIPS 2025arXiv:2510.12119

citations

#9678

THUNDER: Tile-level Histopathology image UNDERstanding benchmark

Pierre Marza, Leo Fillioux, Sofiène Boutaj et al.

NEURIPS 2025spotlightarXiv:2507.07860

citations

#9679

Underwater Visual SLAM with Depth Uncertainty and Medium Modeling

Rui Liu, Sheng Fan, Wenguan Wang et al.

ICCV 2025highlight

citations

#9680

Electromyography-Informed Facial Expression Reconstruction for Physiological-Based Synthesis and Analysis

Tim Büchner, Christoph Anders, Orlando Guntinas-Lichius et al.

CVPR 2025highlightarXiv:2503.09556

citations

#9681

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Zhiyuan Liang, Dongwen Tang, Yuhao Zhou et al.

NEURIPS 2025arXiv:2506.16406

citations

#9682

Robust Audio-Visual Segmentation via Audio-Guided Visual Convergent Alignment

Chen Liu, Peike Li, Liying Yang et al.

CVPR 2025arXiv:2503.12847

citations

#9683

EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow

Yixiang Chen, Peiyan Li, Yan Huang et al.

ICCV 2025arXiv:2507.06224

citations

#9684

Flatness is Necessary, Neural Collapse is Not: Rethinking Generalization via Grokking

Ting Han, Linara Adilova, Henning Petzka et al.

NEURIPS 2025oralarXiv:2509.17738

citations

#9685

Enhanced Visual-Semantic Interaction with Tailored Prompts for Pedestrian Attribute Recognition

Junyi Wu, Yan Huang, Min Gao et al.

CVPR 2025highlight

citations

#9686

Learning from Synchronization: Self-Supervised Uncalibrated Multi-View Person Association in Challenging Scenes

Keqi Chen, vinkle srivastav, Didier MUTTER et al.

CVPR 2025arXiv:2503.13739

citations

#9687

GenM3: Generative Pretrained Multi-path Motion Model for Text Conditional Human Motion Generation

Junyu Shi, Lijiang LIU, Yong Sun et al.

ICCV 2025

citations

#9688

TRAP: Targeted Redirecting of Agentic Preferences

Hangoo Kang, Jehyeok Yeon, Gagandeep Singh

NEURIPS 2025arXiv:2505.23518

citations

#9689

Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering

Wenlong Fang, Qiaofeng Wu, Jing Chen et al.

CVPR 2025

citations

#9690

RTMap: Real-Time Recursive Mapping with Change Detection and Localization

Yuheng Du, Sheng Yang, Lingxuan Wang et al.

ICCV 2025arXiv:2507.00980

citations

#9691

Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning

Xinyao Liu, Diping Song

ICCV 2025arXiv:2507.17539

citations

#9692

Fast and Accurate Gigapixel Pathological Image Classification with Hierarchical Distillation Multi-Instance Learning

Jiuyang Dong, Junjun Jiang, Kui Jiang et al.

CVPR 2025arXiv:2502.21130

citations

#9693

Towards Straggler-Resilient Split Federated Learning: An Unbalanced Update Approach

Dandan Liang, Jianing Zhang, Evan Chen et al.

NEURIPS 2025arXiv:2510.21155

citations

#9694

UniRes: Universal Image Restoration for Complex Degradations

Mo Zhou, Keren Ye, Mauricio Delbracio et al.

ICCV 2025arXiv:2506.05599

citations

#9695

FiRe: Fixed-points of Restoration Priors for Solving Inverse Problems

Matthieu Terris, Ulugbek Kamilov, Thomas Moreau

CVPR 2025arXiv:2411.18970

citations

#9696

Plug-and-Play Versatile Compressed Video Enhancement

Huimin Zeng, Jiacheng Li, Zhiwei Xiong

CVPR 2025arXiv:2504.15380

citations

#9697

SMMILE: An expert-driven benchmark for multimodal medical in-context learning

Melanie Rieff, Maya Varma, Ossian Rabow et al.

NEURIPS 2025arXiv:2506.21355

citations

#9698

Learning to price with resource constraints: from full information to machine-learned prices

Ruicheng Ao, Jiashuo Jiang, David Simchi-Levi

NEURIPS 2025arXiv:2501.14155

citations

#9699

StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion

Ziyu Guo, Young-Yoon Lee, Joseph Liu et al.

ICCV 2025arXiv:2503.21775

citations

#9700

HalLoc: Token-level Localization of Hallucinations for Vision Language Models

Eunkyu Park, Minyeong Kim, Gunhee Kim

CVPR 2025arXiv:2506.10286

citations

#9701

VAGUE: Visual Contexts Clarify Ambiguous Expressions

Heejeong Nam, Jinwoo Ahn, Keummin Ka et al.

ICCV 2025arXiv:2411.14137

citations

#9702

Action Detail Matters: Refining Video Recognition with Local Action Queries

Mengmeng Wang, Zeyi Huang, Xiangjie Kong et al.

CVPR 2025

citations

#9703

MARVEL-40M+: Multi-Level Visual Elaboration for High-Fidelity Text-to-3D Content Creation

Sankalp Sinha, Mohammad Sadil Khan, Muhammad Usama et al.

CVPR 2025arXiv:2411.17945

citations

#9704

PDEfuncta: Spectrally-Aware Neural Representation for PDE Solution Modeling

Minju Jo, Woojin Cho, Uvini Balasuriya Mudiyanselage et al.

NEURIPS 2025arXiv:2506.12790

citations

#9705

LightFair: Towards an Efficient Alternative for Fair T2I Diffusion via Debiasing Pre-trained Text Encoders

Boyu Han, Qianqian Xu, Shilong Bao et al.

NEURIPS 2025arXiv:2509.23639

citations

#9706

COALA: Numerically Stable and Efficient Framework for Context-Aware Low-Rank Approximation

Uliana Parkina, Maxim Rakhuba

NEURIPS 2025arXiv:2507.07580

citations

#9707

Attention! Your Vision Language Model Could Be Maliciously Manipulated

Xiaosen Wang, Shaokang Wang, Zhijin Ge et al.

NEURIPS 2025arXiv:2505.19911

citations

#9708

AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion

Yangyi Huang, Ye Yuan, Xueting Li et al.

ICCV 2025arXiv:2505.24877

citations

#9709

GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector

Zechuan Li, Hongshan Yu, Yihao Ding et al.

CVPR 2025arXiv:2503.15211

citations

#9710

DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution

Yuzhong Zhao, Feng Liu, Yue Liu et al.

CVPR 2025arXiv:2405.16071

citations

#9711

LibraGrad: Balancing Gradient Flow for Universally Better Vision Transformer Attributions

Faridoun Mehri, Mahdieh Baghshah, Mohammad Taher Pilehvar

CVPR 2025arXiv:2411.16760

citations

#9712

AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise

Dhruv Agarwal, Bodhisattwa Prasad Majumder, Reece Adamson et al.

NEURIPS 2025oralarXiv:2507.00310

citations

#9713

The Complexity of Symmetric Equilibria in Min-Max Optimization and Team Zero-Sum Games

Ioannis Anagnostides, Ioannis Panageas, Tuomas Sandholm et al.

NEURIPS 2025spotlightarXiv:2502.08519

citations

#9714

Generative Photomontage

Sean J. Liu, Nupur Kumari, Ariel Shamir et al.

CVPR 2025arXiv:2408.07116

citations

#9715

Deep RL Needs Deep Behavior Analysis: Exploring Implicit Planning by Model-Free Agents in Open-Ended Environments

Riley Simmons-Edler, Ryan Badman, Felix Berg et al.

NEURIPS 2025oralarXiv:2506.06981

citations

#9716

GPS as a Control Signal for Image Generation

Chao Feng, Ziyang Chen, Aleksander Holynski et al.

CVPR 2025arXiv:2501.12390

citations

#9717

DMesh++: An Efficient Differentiable Mesh for Complex Shapes

Sanghyun Son, Matheus Gadelha, Yang Zhou et al.

ICCV 2025arXiv:2412.16776

citations

#9718

Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM

Zinuo Li, Xian Zhang, Yongxin Guo et al.

NEURIPS 2025oralarXiv:2505.18110

citations

#9719

Who You Are Matters: Bridging Interests and Social Roles via LLM-Enhanced Logic Recommendation

Qing Yu, Xiaobei Wang, Shuchang Liu et al.

NEURIPS 2025oral

citations

#9720

Diving into the Fusion of Monocular Priors for Generalized Stereo Matching

Chengtang Yao, Lidong Yu, Zhidan Liu et al.

ICCV 2025arXiv:2505.14414

citations

#9721

I2VGuard: Safeguarding Images against Misuse in Diffusion-based Image-to-Video Models

Dongnan Gui, Xun Guo, Wengang Zhou et al.

CVPR 2025

citations

#9722

SGFormer: Satellite-Ground Fusion for 3D Semantic Scene Completion

Xiyue Guo, Jiarui Hu, Junjie Hu et al.

CVPR 2025arXiv:2503.16825

citations

#9723

One Sample is Enough to Make Conformal Prediction Robust

Soroush H. Zargarbashi, Mohammad Sadegh Akhondzadeh, Aleksandar Bojchevski

NEURIPS 2025arXiv:2506.16553

citations

#9724

DAGSM: Disentangled Avatar Generation with GS-enhanced Mesh

Jingyu Zhuang, Di Kang, Linchao Bao et al.

CVPR 2025arXiv:2411.15205

citations

#9725

Video Individual Counting for Moving Drones

Yaowu Fan, Jia Wan, Tao Han et al.

ICCV 2025highlightarXiv:2503.10701

citations

#9726

Moderating the Generalization of Score-based Generative Model

Wan Jiang, He Wang, Xin Zhang et al.

ICCV 2025arXiv:2412.07229

citations

#9727

SemGes: Semantics-aware Co-Speech Gesture Generation using Semantic Coherence and Relevance Learning

Lanmiao Liu, Esam Ghaleb, asli ozyurek et al.

ICCV 2025arXiv:2507.19359

citations

#9728

Pinco: Position-induced Consistent Adapter for Diffusion Transformer in Foreground-conditioned Inpainting

Guangben Lu, Yuzhen N/A, Zhimin Sun et al.

ICCV 2025arXiv:2412.03812

citations

#9729

Rethinking Layered Graphic Design Generation with a Top-Down Approach

Jingye Chen, Zhaowen Wang, Nanxuan Zhao et al.

ICCV 2025arXiv:2507.05601

citations

#9730

Exact and Linear Convergence for Federated Learning under Arbitrary Client Participation is Attainable

Bicheng Ying, Zhe Li, Haibo Yang

NEURIPS 2025arXiv:2503.20117

citations

#9731

SceneCrafter: Controllable Multi-View Driving Scene Editing

Zehao Zhu, Yuliang Zou, Chiyu “Max” Jiang et al.

CVPR 2025arXiv:2506.19488

citations

#9732

Monitoring Risks in Test-Time Adaptation

Mona Schirmer, Metod Jazbec, Christian Andersson Naesseth et al.

NEURIPS 2025arXiv:2507.08721

citations

#9733

Bias in Gender Bias Benchmarks: How Spurious Features Distort Evaluation

Yusuke Hirota, Ryo Hachiuma, Boyi Li et al.

ICCV 2025arXiv:2509.07596

citations

#9734

Traversal Verification for Speculative Tree Decoding

Yepeng Weng, Qiao Hu, Xujie Chen et al.

NEURIPS 2025arXiv:2505.12398

citations

#9735

Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion

Songsong Yu, Yuxin Chen, Zhongang Qi et al.

CVPR 2025arXiv:2503.22262

citations

#9736

SHeaP: Self-supervised Head Geometry Predictor Learned via 2D Gaussians

Liam Schoneveld, Zhe Chen, Davide Davoli et al.

ICCV 2025arXiv:2504.12292

citations

#9737

TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis

Tri Ton, Ji Woo Hong, Chang Yoo

ICCV 2025arXiv:2504.05684

citations

#9738

You Think, You ACT: The New Task of Arbitrary Text to Motion Generation

Runqi Wang, Caoyuan Ma, Guopeng Li et al.

ICCV 2025arXiv:2404.14745

citations

#9739

VolumetricSMPL: A Neural Volumetric Body Model for Efficient Interactions, Contacts, and Collisions

Marko Mihajlovic, Siwei Zhang, Gen Li et al.

ICCV 2025highlightarXiv:2506.23236

citations

#9740

Project-Probe-Aggregate: Efficient Fine-Tuning for Group Robustness

Beier Zhu, Jiequan Cui, Hanwang Zhang et al.

CVPR 2025highlightarXiv:2503.09487

citations

#9741

Local-Global Associative Frames for Symmetry-Preserving Crystal Structure Modeling

haowei hua, Wanyu Lin

NEURIPS 2025arXiv:2505.15315

citations

#9742

Constraint-Aware Feature Learning for Parametric Point Cloud

Xi Cheng, Ruiqi Lei, Di Huang et al.

ICCV 2025arXiv:2411.07747

citations

#9743

On the Robustness of Transformers against Context Hijacking for Linear Classification

Tianle Li, Chenyang Zhang, Xingwu Chen et al.

NEURIPS 2025arXiv:2502.15609

citations

#9744

RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case

Baihui Xiao, Chengjian Feng, Zhijian Huang et al.

ICCV 2025arXiv:2508.04642

citations

#9745

PHATNet: A Physics-guided Haze Transfer Network for Domain-adaptive Real-world Image Dehazing

Fu-Jen Tsai, Yan-Tsung Peng, Yen-Yu Lin et al.

ICCV 2025arXiv:2507.14826

citations

#9746

Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation

Hao Zhang, Chun-Han Yao, Simon Donné et al.

NEURIPS 2025oralarXiv:2509.10687

citations

#9747

Pioneering 4-Bit FP Quantization for Diffusion Models: Mixup-Sign Quantization and Timestep-Aware Fine-Tuning

Maosen Zhao, Pengtao Chen, Chong Yu et al.

CVPR 2025arXiv:2505.21591

citations

#9748

NeuraLeaf: Neural Parametric Leaf Models with Shape and Deformation Disentanglement

Yang Yang, Dongni Mao, Hiroaki Santo et al.

ICCV 2025highlightarXiv:2507.12714

citations

#9749

Diffusion-based Realistic Listening Head Generation via Hybrid Motion Modeling

Yinuo Wang, Yanbo Fan, Xuan Wang et al.

CVPR 2025highlight

citations

#9750

Measuring Scientific Capabilities of Language Models with a Systems Biology Dry Lab

Haonan Duan, Stephen Lu, Caitlin F Harrigan et al.

NEURIPS 2025arXiv:2507.02083

citations

#9751

PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba Adapter

Yaohua Zha, Yanzi Wang, Hang Guo et al.

CVPR 2025arXiv:2505.20941

citations

#9752

Who is a Better Talker: Subjective and Objective Quality Assessment for AI-Generated Talking Heads

Yingjie Zhou, Jiezhang Cao, Zicheng Zhang et al.

ICCV 2025arXiv:2507.23343

citations

#9753

Revisiting Image Fusion for Multi-Illuminant White-Balance Correction

David Serrano, Aditya Arora, Luis Herranz et al.

ICCV 2025arXiv:2503.14774

citations

#9754

Breaking the Encoder Barrier for Seamless Video-Language Understanding

Handong Li, Yiyuan Zhang, Longteng Guo et al.

ICCV 2025arXiv:2503.18422

citations

#9755

Open-ended Hierarchical Streaming Video Understanding with Vision Language Models

Hyolim Kang, Yunsu Park, Youngbeom Yoo et al.

ICCV 2025arXiv:2509.12145

citations

#9756

AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing

Niu Lian, Jun Li, Jinpeng Wang et al.

CVPR 2025arXiv:2504.03587

citations

#9757

SCAN: Bootstrapping Contrastive Pre-training for Data Efficiency

Yangyang Guo, Mohan Kankanhalli

ICCV 2025arXiv:2411.09126

citations

#9758

Robust and Efficient 3D Gaussian Splatting for Urban Scene Reconstruction

Zhensheng Yuan, Haozhi Huang, Zhen Xiong et al.

ICCV 2025arXiv:2507.23006

citations

#9759

CoralVQA: A Large-Scale Visual Question Answering Dataset for Coral Reef Image Understanding

hongyong han, Wei Wang, Gaowei Zhang et al.

NEURIPS 2025oralarXiv:2507.10449

citations

#9760

Empowering Large Language Models with 3D Situation Awareness

Zhihao Yuan, Yibo Peng, Jinke Ren et al.

CVPR 2025arXiv:2503.23024

citations

#9761

CVFusion: Cross-View Fusion of 4D Radar and Camera for 3D Object Detection

Hanzhi Zhong, Zhiyu Xiang, Ruoyu Xu et al.

ICCV 2025arXiv:2507.04587

citations

#9762

X2-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction

Weihao Yu, Yuanhao Cai, Ruyi Zha et al.

ICCV 2025

citations

#9763

Emergence of Linear Truth Encodings in Language Models

Shauli Ravfogel, Gilad Yehudai, Tal Linzen et al.

NEURIPS 2025arXiv:2510.15804

citations

#9764

Parameterized Blur Kernel Prior Learning for Local Motion Deblurring

Zhenxuan Fang, Fangfang Wu, Tao Huang et al.

CVPR 2025

citations

#9765

Compositional Caching for Training-free Open-vocabulary Attribute Detection

Marco Garosi, Alessandro Conti, Gaowen Liu et al.

CVPR 2025highlightarXiv:2503.19145

citations

#9766

Signs as Tokens: A Retrieval-Enhanced Multilingual Sign Language Generator

Ronglai Zuo, Rolandos Alexandros Potamias, Evangelos Ververas et al.

ICCV 2025arXiv:2411.17799

citations

#9767

Face Forgery Video Detection via Temporal Forgery Cue Unraveling

Zonghui Guo, YingJie Liu, Jie Zhang et al.

CVPR 2025

citations

#9768

Social Debiasing for Fair Multi-modal LLMs

Harry Cheng, Yangyang Guo, Qingpei Guo et al.

ICCV 2025arXiv:2408.06569

citations

#9769

CAP: Evaluation of Persuasive and Creative Image Generation

Aysan Aghazadeh, Adriana Kovashka

ICCV 2025arXiv:2412.10426

citations

#9770

GT-Loc: Unifying When and Where in Images through a Joint Embedding Space

David G. Shatwell, Ishan Rajendrakumar Dave, Swetha Sirnam et al.

ICCV 2025arXiv:2507.10473

citations

#9771

GraSS: Scalable Data Attribution with Gradient Sparsification and Sparse Projection

Pingbang Hu, Joseph Melkonian, Weijing Tang et al.

NEURIPS 2025arXiv:2505.18976

citations

#9772

I2-NeRF: Learning Neural Radiance Fields Under Physically-Grounded Media Interactions

Shuhong Liu, Lin Gu, Ziteng Cui et al.

NEURIPS 2025arXiv:2510.22161

citations

#9773

DiTaiListener: Controllable High Fidelity Listener Video Generation with Diffusion

Maksim Siniukov, Di Chang, Minh Tran et al.

ICCV 2025arXiv:2504.04010

citations

#9774

Contrastive Representations for Temporal Reasoning

Alicja Ziarko, Michał Bortkiewicz, Michał Zawalski et al.

NEURIPS 2025oralarXiv:2508.13113

citations

#9775

SAM-REF: Introducing Image-Prompt Synergy during Interaction for Detail Enhancement in the Segment Anything Model

Chongkai Yu, Ting Liu, Li Anqi et al.

CVPR 2025arXiv:2408.11535

citations

#9776

Delta Attention: Fast and Accurate Sparse Attention Inference by Delta Correction

Jeffrey Willette, Heejun Lee, Sung Ju Hwang

NEURIPS 2025arXiv:2505.11254

citations

#9777

PS-Diffusion: Photorealistic Subject-Driven Image Editing with Disentangled Control and Attention

Weicheng Wang, Guoli Jia, Zhongqi Zhang et al.

CVPR 2025

citations

#9778

Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis

Hengyuan Cao, Yutong Feng, Biao Gong et al.

NEURIPS 2025oralarXiv:2505.23325

citations

#9779

Scale Your Instructions: Enhance the Instruction-Following Fidelity of Unified Image Generation Model by Self-Adaptive Attention Scaling

Chao Zhou, Tianyi Wei, Nenghai Yu

ICCV 2025arXiv:2507.16240

citations

#9780

Color Matching Using Hypernetwork-Based Kolmogorov-Arnold Networks

Artem Nikonorov, Georgy Perevozchikov, Andrei Korepanov et al.

ICCV 2025arXiv:2503.11781

citations

#9781

A Visual Leap in CLIP Compositionality Reasoning through Generation of Counterfactual Sets

Zexi Jia, Chuanwei Huang, Yeshuang Zhu et al.

ICCV 2025arXiv:2507.04699

citations

#9782

Resonance: Learning to Predict Social-Aware Pedestrian Trajectories as Co-Vibrations

Conghao Wong, Ziqian Zou, Beihao Xia

ICCV 2025arXiv:2412.02447

citations

#9783

CodeCrash: Exposing LLM Fragility to Misleading Natural Language in Code Reasoning

Man Ho Lam, Chaozheng Wang, Jen-Tse Huang et al.

NEURIPS 2025arXiv:2504.14119

citations

#9784

On the creation of narrow AI: hierarchy and nonlocality of neural network skills

Eric Michaud, Asher Parker-Sartori, Max Tegmark

NEURIPS 2025arXiv:2505.15811

citations

#9785

Kernel Density Steering: Inference-Time Scaling via Mode Seeking for Image Restoration

Yuyang Hu, Kangfu Mei, Mojtaba Ardakani et al.

NEURIPS 2025arXiv:2507.05604

citations

#9786

DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting

Liao Shen, Tianqi Liu, Huiqiang Sun et al.

CVPR 2025arXiv:2503.00746

citations

#9787

Generalizable Object Re-Identification via Visual In-Context Prompting

Zhizhong Huang, Xiaoming Liu

ICCV 2025arXiv:2508.21222

citations

#9788

Is `Right' Right? Enhancing Object Orientation Understanding in Multimodal Large Language Models through Egocentric Instruction Tuning

JiHyeok Jung, EunTae Kim, SeoYeon Kim et al.

CVPR 2025arXiv:2411.16761

citations

#9789

Grouped Speculative Decoding for Autoregressive Image Generation

Junhyuk So, Juncheol Shin, Hyunho Kook et al.

ICCV 2025arXiv:2508.07747

citations

#9790

Online Language Splatting

Saimouli Katragadda, Cho-Ying Wu, Yuliang Guo et al.

ICCV 2025arXiv:2503.09447

citations

#9791

PiKE: Adaptive Data Mixing for Large-Scale Multi-Task Learning Under Low Gradient Conflicts

Zeman Li, Yuan Deng, Peilin Zhong et al.

NEURIPS 2025spotlightarXiv:2502.06244

citations

#9792

Invisible Backdoor Attack against Self-supervised Learning

Hanrong Zhang, Zhenting Wang, Boheng Li et al.

CVPR 2025arXiv:2405.14672

citations

#9793

VA-MoE: Variables-Adaptive Mixture of Experts for Incremental Weather Forecasting

Hao Chen, Tao Han, Song Guo et al.

ICCV 2025arXiv:2412.02503

citations

#9794

HoliGS: Holistic Gaussian Splatting for Embodied View Synthesis

Xiaoyuan Wang, Yizhou Zhao, Botao Ye et al.

NEURIPS 2025arXiv:2506.19291

citations

#9795

Topology-Aware Conformal Prediction for Stream Networks

Jifan Zhang, Fangxin Wang, Zihe Song et al.

NEURIPS 2025oralarXiv:2503.04981

citations

#9796

Spherical Manifold Guided Diffusion Model for Panoramic Image Generation

Xiancheng Sun, Mai Xu, Shengxi Li et al.

CVPR 2025

citations

#9797

Performative Validity of Recourse Explanations

Gunnar König, Hidde Fokkema, Timo Freiesleben et al.

NEURIPS 2025arXiv:2506.15366

citations

#9798

PARCO: Parallel AutoRegressive Models for Multi-Agent Combinatorial Optimization

Federico Berto, Chuanbo Hua, Laurin Luttmann et al.

NEURIPS 2025arXiv:2409.03811

citations

#9799

BlockScan: Detecting Anomalies in Blockchain Transactions

Jiahao Yu, Xian Wu, Hao Liu et al.

NEURIPS 2025arXiv:2410.04039

citations

#9800

Causal Head Gating: A Framework for Interpreting Roles of Attention Heads in Transformers

Andrew Nam, Henry Conklin, Yukang Yang et al.

NEURIPS 2025arXiv:2505.13737

citations

← Previous

1...47 48 49 50 51...112