Most Cited 2025 "numerical reconstruction" Papers

22,274 papers found • Page 104 of 112

#20601

Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging

Ibrahim Ethem Hamamci, Sezgin Er, Suprosanna Shit et al.

NEURIPS 2025posterarXiv:2510.20639
#20602

MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders

jiajun cao, Yuan Zhang, Tao Huang et al.

CVPR 2025posterarXiv:2501.01709
#20603

EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance

Yang Yue, Yulin Wang, Haojun Jiang et al.

CVPR 2025posterarXiv:2504.13065
#20604

PALQO: Physics-informed model for Accelerating Large-scale Quantum Optimization

Yiming Huang, Yajie Hao, Yuxuan Du et al.

NEURIPS 2025posterarXiv:2509.20733
#20605

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Jianing "Jed" Yang, Alexander Sax, Kevin Liang et al.

CVPR 2025posterarXiv:2501.13928
#20606

Pursuing Temporal-Consistent Video Virtual Try-On via Dynamic Pose Interaction

Dong Li, Wenqi Zhong, Wei Yu et al.

CVPR 2025posterarXiv:2505.16980
#20607

CARES: Comprehensive Evaluation of Safety and Adversarial Robustness in Medical LLMs

Sijia Chen, Xiaomin Li, mengxue zhang et al.

NEURIPS 2025posterarXiv:2505.11413
#20608

A Unified Image-Dense Annotation Generation Model for Underwater Scenes

Hongkai Lin, Dingkang Liang, Zhenghao Qi et al.

CVPR 2025posterarXiv:2503.21771
#20609

3D-SLNR: A Super Lightweight Neural Representation for Large-scale 3D Mapping

Chenhui Shi, Fulin Tang, Ning An et al.

CVPR 2025poster
#20610

STINR: Deciphering Spatial Transcriptomics via Implicit Neural Representation

Yisi Luo, Xile Zhao, Kai Ye et al.

CVPR 2025poster
#20611

Multi-Modal Contrastive Masked Autoencoders: A Two-Stage Progressive Pre-training Approach for RGBD Datasets

Muhammad Abdullah Jamal, Omid Mohareri

CVPR 2025poster
#20612

Font-Agent: Enhancing Font Understanding with Large Language Models

Yingxin Lai, Cuijie Xu, Haitian Shi et al.

CVPR 2025poster
#20613

RADIOv2.5: Improved Baselines for Agglomerative Vision Foundation Models

Greg Heinrich, Mike Ranzinger, Danny Yin et al.

CVPR 2025posterarXiv:2412.07679
#20614

Stabilizing and Accelerating Autofocus with Expert Trajectory Regularized Deep Reinforcement Learning

Shouhang Zhu, Chenglin Li, Yuankun Jiang et al.

CVPR 2025poster
#20615

Flow Field Reconstruction with Sensor Placement Policy Learning

Ruoyan Li, Guancheng Wan, Zijie Huang et al.

NEURIPS 2025poster
#20616

GeoMM: On Geodesic Perspective for Multi-modal Learning

Shibin Mei, Hang Wang, Bingbing Ni

CVPR 2025posterarXiv:2505.11216
#20617

Breaking the Memory Barrier of Contrastive Loss via Tile-Based Strategy

Zesen Cheng, Hang Zhang, Kehan Li et al.

CVPR 2025highlight
#20618

Optimal Minimum Width for the Universal Approximation of Continuously Differentiable Functions by Deep Narrow MLPs

Geonho Hwang

NEURIPS 2025poster
#20619

GeoAvatar: Geometrically-Consistent Multi-Person Avatar Reconstruction from Sparse Multi-View Videos

Soohyun Lee, SeoYeon Kim, HeeKyung Lee et al.

CVPR 2025poster
#20620

Reliable Lifelong Multimodal Editing: Conflict-Aware Retrieval Meets Multi-Level Guidance

Qiang Zhang, Fanrui Zhang, Jiawei Liu et al.

NEURIPS 2025poster
#20621

DGSolver: Diffusion Generalist Solver with Universal Posterior Sampling for Image Restoration

Hebaixu Wang, Jing Zhang, Haonan Guo et al.

NEURIPS 2025posterarXiv:2504.21487
#20622

Cross-Modal Distillation for 2D/3D Multi-Object Discovery from 2D Motion

Saad Lahlali, Sandra Kara, Hejer AMMAR et al.

CVPR 2025posterarXiv:2503.15022
#20623

MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations

Kyungho Bae, Jinhyung Kim, Sihaeng Lee et al.

CVPR 2025highlightarXiv:2503.15871
#20624

QuadricFormer: Scene as Superquadrics for 3D Semantic Occupancy Prediction

Sicheng Zuo, Wenzhao Zheng, Xiaoyong Han et al.

NEURIPS 2025posterarXiv:2506.10977
#20625

DEXTER: Diffusion-Guided EXplanations with TExtual Reasoning for Vision Models

Simone Carnemolla, Matteo Pennisi, Sarinda Samarasinghe et al.

NEURIPS 2025spotlightarXiv:2510.14741
#20626

Wasserstein Convergence of Critically Damped Langevin Diffusions

Stanislas Strasman, Sobihan Surendran, Claire Boyer et al.

NEURIPS 2025posterarXiv:2511.02419
#20627

Robust learning of halfspaces under log-concave marginals

Jane Lange, Arsen Vasilyan

NEURIPS 2025spotlightarXiv:2505.13708
#20628

GPVK-VL: Geometry-Preserving Virtual Keyframes for Visual Localization under Large Viewpoint Changes

Yunxuan Li, Lei Fan, Xiaoying Xing et al.

CVPR 2025poster
#20629

Be More Specific: Evaluating Object-centric Realism in Synthetic Images

Anqi Liang, Ciprian Adrian Corneanu, Qianli Feng et al.

CVPR 2025poster
#20630

Adaptive Gradient Masking for Balancing ID and MLLM-based Representations in Recommendation

Yidong Wu, Siyuan Chen, Binrui Wu et al.

NEURIPS 2025poster
#20631

Bootstrap Your Own Views: Masked Ego-Exo Modeling for Fine-grained View-invariant Video Representations

Jungin Park, Jiyoung Lee, Kwanghoon Sohn

CVPR 2025posterarXiv:2503.19706
#20632

Generalizable, real-time neural decoding with hybrid state-space models

Avery Hee-Woon Ryoo, Nanda H Krishna, Ximeng Mao et al.

NEURIPS 2025posterarXiv:2506.05320
#20633

CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late Hybridization

Junhao Xu, Yanan Zhang, Zhi Cai et al.

CVPR 2025posterarXiv:2503.03430
#20634

Hierarchical Knowledge Prompt Tuning for Multi-task Test-Time Adaptation

Qiang Zhang, Mengsheng Zhao, Jiawei Liu et al.

CVPR 2025poster
#20635

DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation

Ziyu Zhao, Xiaoguang Li, Lingjia Shi et al.

CVPR 2025posterarXiv:2505.11676
#20636

Rethinking the Role of Verbatim Memorization in LLM Privacy

Tom Sander, Bargav Jayaraman, Mark Ibrahim et al.

NEURIPS 2025poster
#20637

Hazy Low-Quality Satellite Video Restoration Via Learning Optimal Joint Degradation Patterns and Continuous-Scale Super-Resolution Reconstruction

Ning Ni, Libao Zhang

CVPR 2025poster
#20638

Visual Prompting for One-shot Controllable Video Editing without Inversion

Zhengbo Zhang, Yuxi Zhou, DUO PENG et al.

CVPR 2025posterarXiv:2504.14335
#20639

Segment Any Motion in Videos

Nan Huang, Wenzhao Zheng, Chenfeng Xu et al.

CVPR 2025posterarXiv:2503.22268
#20640

ObCLIP: Oblivious CLoud-Device Hybrid Image Generation with Privacy Preservation

Haoqi Wu, Wei Dai, Ming Xu et al.

NEURIPS 2025oralarXiv:2510.04153
#20641

Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages

Matteo Farina, Massimiliano Mancini, Giovanni Iacca et al.

CVPR 2025posterarXiv:2503.11609
#20642

TAGA: Self-supervised Learning for Template-free Animatable Gaussian Articulated Model

Zhichao Zhai, Guikun Chen, Wenguan Wang et al.

CVPR 2025poster
#20643

MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing

Shuo Wang, Wanting Li, Yongcai Wang et al.

CVPR 2025posterarXiv:2412.20082
#20644

MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation

Zhenyu Wu, Yuheng Zhou, Xiuwei Xu et al.

CVPR 2025posterarXiv:2503.13446
#20645

Mitigating Forgetting in LLM Fine-Tuning via Low-Perplexity Token Learning

Chao-Chung Wu, Zhi Rui Tam, Chieh-Yen Lin et al.

NEURIPS 2025posterarXiv:2501.14315
#20646

Vocabulary In-Context Learning in Transformers: Benefits of Positional Encoding

Qian Ma, Ruoxiang Xu, Yongqiang Cai

NEURIPS 2025posterarXiv:2511.06376
#20647

Beyond Single-Modal Boundary: Cross-Modal Anomaly Detection through Visual Prototype and Harmonization

Kai Mao, Ping Wei, Yiyang Lian et al.

CVPR 2025poster
#20648

Blameless Users in a Clean Room: Defining Copyright Protection for Generative Models

Aloni Cohen

NEURIPS 2025spotlightarXiv:2506.19881
#20649

Augmenting Perceptual Super-Resolution via Image Quality Predictors

Fengjia Zhang, Samrudhdhi Rangrej, Tristan T Aumentado-Armstrong et al.

CVPR 2025posterarXiv:2504.18524
#20650

Perception Tokens Enhance Visual Reasoning in Multimodal Language Models

Mahtab Bigverdi, Zelun Luo, Cheng-Yu Hsieh et al.

CVPR 2025posterarXiv:2412.03548
#20651

FADRM: Fast and Accurate Data Residual Matching for Dataset Distillation

Jiacheng Cui, Xinyue Bi, Yaxin Luo et al.

NEURIPS 2025posterarXiv:2506.24125
#20652

HYPERION: Fine-Grained Hypersphere Alignment for Robust Federated Graph Learning

Guancheng Wan, Xiaoran Shang, Yuxin Wu et al.

NEURIPS 2025spotlight
#20653

EyeBench: Predictive Modeling from Eye Movements in Reading

Omer Shubi, David Robert Reich, Keren Gruteke Klein et al.

NEURIPS 2025poster
#20654

ViKIENet: Towards Efficient 3D Object Detection with Virtual Key Instance Enhanced Network

Zhuochen Yu, Bijie Qiu, Andy W. H. Khong

CVPR 2025poster
#20655

A Unified Framework for Fair Graph Generation: Theoretical Guarantees and Empirical Advances

Zichong Wang, Zhipeng Yin, Wenbin Zhang

NEURIPS 2025poster
#20656

From Head to Tail: Efficient Black-box Model Inversion Attack via Long-tailed Learning

Ziang Li, Hongguang Zhang, Juan Wang et al.

CVPR 2025posterarXiv:2503.16266
#20657

Proximal Algorithm Unrolling: Flexible and Efficient Reconstruction Networks for Single-Pixel Imaging

Ping Wang, Lishun Wang, Gang Qu et al.

CVPR 2025posterarXiv:2505.23180
#20658

Corporate Needs You to Find the Difference: Revisiting Submodular and Supermodular Ratio Optimization Problems

Elfarouk Harb, Yousef Yassin, Chandra Chekuri

NEURIPS 2025spotlightarXiv:2505.17443
#20659

Learning Multi-Source and Robust Representations for Continual Learning

Fei Ye, Yongcheng Zhong, Qihe Liu et al.

NEURIPS 2025poster
#20660

Compositional Targeted Multi-Label Universal Perturbations

Hassan Mahmood, Ehsan Elhamifar

CVPR 2025poster
#20661

Valid Selection among Conformal Sets

Mahmoud Hegazy, Liviu Aolaritei, Michael Jordan et al.

NEURIPS 2025posterarXiv:2506.20173
#20662

CGMatch: A Different Perspective of Semi-supervised Learning

Bo Cheng, Jueqing Lu, Yuan Tian et al.

CVPR 2025posterarXiv:2503.02231
#20663

Leveraging Temporal Cues for Semi-Supervised Multi-View 3D Object Detection

Jinhyung Park, Navyata Sanghvi, Hiroki Adachi et al.

CVPR 2025poster
#20664

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Jianzong Wu, Chao Tang, Jingbo Wang et al.

CVPR 2025posterarXiv:2412.07589
#20665

GUIDED: Granular Understanding via Identification, Detection, and Discrimination for Fine-Grained Open-Vocabulary Object Detection

Jiaming Li, Zhijia Liang, Weikai Chen et al.

NEURIPS 2025poster
#20666

MoEMeta: Mixture-of-Experts Meta Learning for Few-Shot Relational Learning

Han Wu, Jie Yin

NEURIPS 2025posterarXiv:2510.23013
#20667

CocoER: Aligning Multi-Level Feature by Competition and Coordination for Emotion Recognition

Xuli Shen, Hua Cai, Weilin Shen et al.

CVPR 2025poster
#20668

Scent of Knowledge: Optimizing Search-Enhanced Reasoning with Information Foraging

Hongjin Qian, Zheng Liu

NEURIPS 2025spotlightarXiv:2505.09316
#20669

Dynamic Gaussian Splatting from Defocused and Motion-blurred Monocular Videos

Xuankai Zhang, Junjin Xiao, Qing Zhang

NEURIPS 2025posterarXiv:2510.10691
#20670

Noise-conditioned Energy-based Annealed Rewards (NEAR): A Generative Framework for Imitation Learning from Observation

Anish Abhijit Diwan, Julen Urain, Jens Kober et al.

ICLR 2025posterarXiv:2501.14856
#20671

Dynamic Motion Blending for Versatile Motion Editing

Nan Jiang, Hongjie Li, Ziye Yuan et al.

CVPR 2025posterarXiv:2503.20724
#20672

Semi-Supervised Regression with Heteroscedastic Pseudo-Labels

Xueqing Sun, Renzhen Wang, Quanziang Wang et al.

NEURIPS 2025posterarXiv:2510.15266
#20673

A Unified Approach to Interpreting Self-supervised Pre-training Methods for 3D Point Clouds via Interactions

Qiang Li, Jian Ruan, Fanghao Wu et al.

CVPR 2025highlight
#20674

SVFR: A Unified Framework for Generalized Video Face Restoration

Zhiyao Wang, Xu Chen, Chengming Xu et al.

CVPR 2025posterarXiv:2501.01235
#20675

Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-Resolution

Huan Zheng, Wencheng Han, Jianbing Shen

CVPR 2025posterarXiv:2411.03239
#20676

Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds

Mohamed Abdelsamad, Michael Ulrich, Claudius Glaeser et al.

CVPR 2025posterarXiv:2502.20316
#20677

CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image

Jingshun Huang, Haitao Lin, Tianyu Wang et al.

CVPR 2025highlightarXiv:2504.11230
#20678

Plug-and-Play PPO: An Adaptive Point Prompt Optimizer Making SAM Greater

Xueyu Liu, Rui Wang, Yexin Lai et al.

CVPR 2025poster
#20679

Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception

ruotian peng, Haiying He, Yake Wei et al.

CVPR 2025posterarXiv:2504.06666
#20680

Partition to Evolve: Niching-enhanced Evolution with LLMs for Automated Algorithm Discovery

Qinglong Hu, Qingfu Zhang

NEURIPS 2025poster
#20681

Regret Bounds for Adversarial Contextual Bandits with General Function Approximation and Delayed Feedback

Orin Levy, Liad Erez, Alon Peled-Cohen et al.

NEURIPS 2025spotlightarXiv:2510.09127
#20682

SCAP: Transductive Test-Time Adaptation via Supportive Clique-based Attribute Prompting

Chenyu Zhang, Kunlun Xu, Zichen Liu et al.

CVPR 2025posterarXiv:2503.12866
#20683

LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization

Zhenpeng Huang, Jiaqi Li, zihan jia et al.

NEURIPS 2025posterarXiv:2602.02341
#20684

Riemannian Proximal Sampler for High-accuracy Sampling on Manifolds

Yunrui Guan, Krishnakumar Balasubramanian, Shiqian Ma

NEURIPS 2025posterarXiv:2502.07265
#20685

Neuro-3D: Towards 3D Visual Decoding from EEG Signals

Zhanqiang Guo, Jiamin Wu, Yonghao Song et al.

CVPR 2025posterarXiv:2411.12248
#20686

Fourier Clouds: Fast Bias Correction for Imbalanced Semi-Supervised Learning

Jiawei Gu, Yidi Wang, Qingqiang Sun et al.

NEURIPS 2025poster
#20687

LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering

Jonas Kulhanek, Marie-Julie Rakotosaona, Fabian Manhardt et al.

NEURIPS 2025spotlightarXiv:2505.23158
#20688

SE-GUI: Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning

Xinbin Yuan, Jian Zhang, Kaixin Li et al.

NEURIPS 2025poster
#20689

Shapley-Coop: Credit Assignment for Emergent Cooperation in Self-Interested LLM Agents

Yun Hua, Haosheng Chen, Shiqin Wang et al.

NEURIPS 2025posterarXiv:2506.07388
#20690

Adventurer: Optimizing Vision Mamba Architecture Designs for Efficiency

Feng Wang, Timing Yang, Yaodong Yu et al.

CVPR 2025posterarXiv:2410.07599
#20691

Revolutionizing Training-Free NAS: Towards Efficient Automatic Proxy Discovery via Large Language Models

Haidong Kang, Lihong Lin, Hanling Wang

NEURIPS 2025poster
#20692

SAINT: Sequence-Aware Integration for Spatial Transcriptomics Multi-View Clustering

Zeyu Zhu, KE LIANG, Lingyuan Meng et al.

NEURIPS 2025poster
#20693

Vector Database Watermarking

Zhiwen Ren, Wei Fan, Qiyi Yao et al.

NEURIPS 2025poster
#20694

WeGen: A Unified Model for Interactive Multimodal Generation as We Chat

Zhipeng Huang, Shaobin Zhuang, Canmiao Fu et al.

CVPR 2025posterarXiv:2503.01115
#20695

Reducing Class-wise Confusion for Incremental Learning with Disentangled Manifolds

Huitong Chen, Yu Wang, Yan Fan et al.

CVPR 2025posterarXiv:2503.17677
#20696

GAMMA: Gated Multi-hop Message Passing for Homophily-Agnostic Node Representation in GNNs

Amir Ghazizadeh, Rickard Ewetz, Hao Zheng

NEURIPS 2025poster
#20697

Solving the Asymmetric Traveling Salesman Problem via Trace-Guided Cost Augmentation

Zhen Zhang, Prof Javen Qinfeng Shi, Wee Sun Lee

NEURIPS 2025poster
#20698

Beyond Words: Augmenting Discriminative Richness via Diffusions in Unsupervised Prompt Learning

Hairui Ren, Fan Tang, He Zhao et al.

CVPR 2025posterarXiv:2504.11930
#20699

TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation

Ruineng Li, Daitao Xing, Huiming Sun et al.

CVPR 2025posterarXiv:2504.08181
#20700

UniteFormer: Unifying Node and Edge Modalities in Transformers for Vehicle Routing Problems

Dian Meng, Zhiguang Cao, Jie Gao et al.

NEURIPS 2025spotlight
#20701

Task-Aware Clustering for Prompting Vision-Language Models

Fusheng Hao, Fengxiang He, Fuxiang Wu et al.

CVPR 2025poster
#20702

GenIR: Generative Visual Feedback for Mental Image Retrieval

Diji Yang, Minghao Liu, Chung-Hsiang Lo et al.

NEURIPS 2025posterarXiv:2506.06220
#20703

Audits Under Resource, Data, and Access Constraints: Scaling Laws For Less Discriminatory Alternatives

Sarah Cen, Salil Goyal, Zaynah Javed et al.

NEURIPS 2025posterarXiv:2509.05627
#20704

Hunyuan-Portrait: Implicit Condition Control for Enhanced Portrait Animation

Zunnan Xu, Zhentao Yu, Zixiang Zhou et al.

CVPR 2025poster
#20705

MeshArt: Generating Articulated Meshes with Structure-Guided Transformers

Daoyi Gao, Mohd Yawar Nihal Siddiqui, Lei Li et al.

CVPR 2025posterarXiv:2412.11596
#20706

Non-Natural Image Understanding with Advancing Frequency-based Vision Encoders

Wang Lin, Qingsong Wang, Yueying Feng et al.

CVPR 2025poster
#20707

Navigating the Unseen: Zero-shot Scene Graph Generation via Capsule-Based Equivariant Features

Wenhuan Huang, Yi JI, guiqian zhu et al.

CVPR 2025poster
#20708

AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea

Qifan Yu, Wei Chow, Zhongqi Yue et al.

CVPR 2025posterarXiv:2411.15738
#20709

Compress Large Language Models via Collaboration Between Learning and Matrix Approximation

Yuesen Liao, Zhiwei Li, Binrui Wu et al.

NEURIPS 2025poster
#20710

RefPose: Leveraging Reference Geometric Correspondences for Accurate 6D Pose Estimation of Unseen Objects

Jaeguk Kim, Jaewoo Park, Keuntek Lee et al.

CVPR 2025posterarXiv:2505.10841
#20711

GRAE-3DMOT: Geometry Relation-Aware Encoder for Online 3D Multi-Object Tracking

Hyunseop Kim, Hyo-Jun Lee, Yonguk Lee et al.

CVPR 2025poster
#20712

Generative Modeling of Class Probability for Multi-Modal Representation Learning

JungKyoo Shin, Bumsoo Kim, Eunwoo Kim

CVPR 2025highlightarXiv:2503.17417
#20713

Mitigating Overthinking in Large Reasoning Models via Manifold Steering

Yao Huang, Huanran Chen, Shouwei Ruan et al.

NEURIPS 2025posterarXiv:2505.22411
#20714

Unified Medical Lesion Segmentation via Self-referring Indicator

Shijie Chang, Xiaoqi Zhao, Lihe Zhang et al.

CVPR 2025poster
#20715

FlowNet: Modeling Dynamic Spatio-Temporal Systems via Flow Propagation

Yutong Feng, Xu Liu, Yutong Xia et al.

NEURIPS 2025oralarXiv:2511.05595
#20716

SGSST: Scaling Gaussian Splatting Style Transfer

Bruno Galerne, Jianling WANG, Lara Raad et al.

CVPR 2025poster
#20717

A Cramér–von Mises Approach to Incentivizing Truthful Data Sharing

Alex Clinton, Thomas Zeng, Yiding Chen et al.

NEURIPS 2025posterarXiv:2506.07272
#20718

DIO: Decomposable Implicit 4D Occupancy-Flow World Model

Christopher Diehl, Quinlan Sykora, Ben Agro et al.

CVPR 2025poster
#20719

HERA: Hybrid Explicit Representation for Ultra-Realistic Head Avatars

Hongrui Cai, Yuting Xiao, Xuan Wang et al.

CVPR 2025poster
#20720

Hierarchical Adaptive Filtering Network for Text Image Specular Highlight Removal

Zhi Jiang, Jingbo Hu, Ling Zhang et al.

CVPR 2025poster
#20721

Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation

Sayak Nag, Udita Ghosh, Calvin-Khang Ta et al.

CVPR 2025posterarXiv:2503.13947
#20722

AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning

Zewei Zhou, Tianhui Cai, Seth Zhao et al.

NEURIPS 2025posterarXiv:2506.13757
#20723

Protein Inverse Folding From Structure Feedback

Junde Xu, Zijun Gao, Xinyi Zhou et al.

NEURIPS 2025posterarXiv:2506.03028
#20724

Move-in-2D: 2D-Conditioned Human Motion Generation

Hsin-Ping Huang, Yang Zhou, Jui-Hsien Wang et al.

CVPR 2025posterarXiv:2412.13185
#20725

ForceVLA: Enhancing VLA Models with a Force-aware MoE for Contact-rich Manipulation

Jiawen Yu, Hairuo Liu, Qiaojun Yu et al.

NEURIPS 2025posterarXiv:2505.22159
#20726

FerretNet: Efficient Synthetic Image Detection via Local Pixel Dependencies

Shuqiao Liang, Jian Liu, Chen Renzhang et al.

NEURIPS 2025posterarXiv:2509.20890
#20727

High-Order Flow Matching: Unified Framework and Sharp Statistical Rates

Maojiang Su, Jerry Yao-Chieh Hu, Yi-Chen Lee et al.

NEURIPS 2025poster
#20728

SAFEx: Analyzing Vulnerabilities of MoE-Based LLMs via Stable Safety-critical Expert Identification

Zhenglin Lai, Mengyao Liao, Bingzhe Wu et al.

NEURIPS 2025posterarXiv:2506.17368
#20729

Taught Well Learned Ill: Towards Distillation-conditional Backdoor Attack

Yukun Chen, Boheng Li, Yu Yuan et al.

NEURIPS 2025posterarXiv:2509.23871
#20730

ALMGuard: Safety Shortcuts and Where to Find Them as Guardrails for Audio–Language Models

Weifei Jin, Yuxin Cao, Junjie Su et al.

NEURIPS 2025posterarXiv:2510.26096
#20731

Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need

Qiang Wang, Xiang Song, Yuhang He et al.

CVPR 2025posterarXiv:2505.23744
#20732

InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models

Yanggan Gu, Yuanyi Wang, Zhaoyi Yan et al.

NEURIPS 2025spotlightarXiv:2505.13878
#20733

Phoenix: A Motion-based Self-Reflection Framework for Fine-grained Robotic Action Correction

Wenke Xia, Ruoxuan Feng, Dong Wang et al.

CVPR 2025posterarXiv:2504.14588
#20734

4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models

Wanhua Li, Renping Zhou, Jiawei Zhou et al.

CVPR 2025posterarXiv:2503.10437
#20735

Non-Asymptotic Guarantees for Average-Reward Q-Learning with Adaptive Stepsizes

Zaiwei Chen

NEURIPS 2025posterarXiv:2504.18743
#20736

How to Learn a Star: Binary Classification with Starshaped Polyhedral Sets

Marie-Charlotte Brandenburg, Katharina Jochemko

NEURIPS 2025posterarXiv:2505.01346
#20737

Quaffure: Real-Time Quasi-Static Neural Hair Simulation

Tuur Stuyck, Gene Wei-Chin Lin, Egor Larionov et al.

CVPR 2025posterarXiv:2412.10061
#20738

Implicit Bias Injection Attacks against Text-to-Image Diffusion Models

Huayang Huang, Xiangye Jin, Jiaxu Miao et al.

CVPR 2025posterarXiv:2504.01819
#20739

Reversing Flow for Image Restoration

Haina Qin, Wenyang Luo, Bing Li et al.

CVPR 2025posterarXiv:2506.16961
#20740

MTADiffusion: Mask Text Alignment Diffusion Model for Object Inpainting

jun huang, Ting Liu, Yihang Wu et al.

CVPR 2025posterarXiv:2506.23482
#20741

DAGSM: Disentangled Avatar Generation with GS-enhanced Mesh

Jingyu Zhuang, Di Kang, Linchao Bao et al.

CVPR 2025posterarXiv:2411.15205
#20742

Open-Canopy: Towards Very High Resolution Forest Monitoring

Fajwel Fogel, Yohann PERRON, Nikola Besic et al.

CVPR 2025highlightarXiv:2407.09392
#20743

S2D-LFE: Sparse-to-Dense Light Field Event Generation

Yutong Liu, Wenming Weng, Yueyi Zhang et al.

CVPR 2025poster
#20744

Beyond Generation: A Diffusion-based Low-level Feature Extractor for Detecting AI-generated Images

Nan Zhong, Haoyu Chen, Yiran Xu et al.

CVPR 2025poster
#20745

URDF-Anything: Constructing Articulated Objects with 3D Multimodal Language Model

Zhe Li, Xiang Bai, Jieyu Zhang et al.

NEURIPS 2025spotlightarXiv:2511.00940
#20746

Image is All You Need to Empower Large-scale Diffusion Models for In-Domain Generation

Pu Cao, Feng Zhou, Lu Yang et al.

CVPR 2025posterarXiv:2312.08195
#20747

MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations

Ziyang Zhang, Yang Yu, Yucheng Chen et al.

CVPR 2025posterarXiv:2503.01019
#20748

Fit the Distribution: Cross-Image/Prompt Adversarial Attacks on Multimodal Large Language Models

Hai Yan, Haijian Ma, Xiaowen Cai et al.

NEURIPS 2025poster
#20749

Inference-Scale Complexity in ANN-SNN Conversion for High-Performance and Low-Power Applications

Tong Bu, Maohua Li, Zhaofei Yu

CVPR 2025posterarXiv:2409.03368
#20750

You Can Trust Your Clustering Model: A Parameter-free Self-Boosting Plug-in for Deep Clustering

Hanyang Li, Yuheng Jia, Hui LIU et al.

NEURIPS 2025posterarXiv:2511.21193
#20751

GD$^2$: Robust Graph Learning under Label Noise via Dual-View Prediction Discrepancy

Kailai Li, Jiong Lou, Jiawei Sun et al.

NEURIPS 2025poster
#20752

Spectral Estimation with Free Decompression

Siavash Ameli, Chris van der Heide, Liam Hodgkinson et al.

NEURIPS 2025spotlightarXiv:2506.11994
#20753

GazeGene: Large-scale Synthetic Gaze Dataset with 3D Eyeball Annotations

Yiwei Bao, Zhiming Wang, Feng Lu

CVPR 2025poster
#20754

Multirate Neural Image Compression with Adaptive Lattice Vector Quantization

Hao Xu, Xiaolin Wu, Xi Zhang

CVPR 2025highlight
#20755

VideoGEM: Training-free Action Grounding in Videos

Felix Vogel, Walid Bousselham, Anna Kukleva et al.

CVPR 2025posterarXiv:2503.20348
#20756

SceneFactor: Factored Latent 3D Diffusion for Controllable 3D Scene Generation

Aleksei Bokhovkin, Quan Meng, Shubham Tulsiani et al.

CVPR 2025posterarXiv:2412.01801
#20757

DefMamba: Deformable Visual State Space Model

Leiye Liu, Miao Zhang, Jihao Yin et al.

CVPR 2025posterarXiv:2504.05794
#20758

PEER Pressure: Model-to-Model Regularization for Single Source Domain Generalization

Dongkyu Cho, Inwoo Hwang, Sanghack Lee

CVPR 2025posterarXiv:2505.12745
#20759

Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbreaks

Han Wang, Gang Wang, Huan Zhang

CVPR 2025posterarXiv:2411.16721
#20760

Less is More: Efficient Image Vectorization with Adaptive Parameterization

Kaibo Zhao, Liang Bao, Yufei Li et al.

CVPR 2025poster
#20761

Diffusion-Driven Progressive Target Manipulation for Source-Free Domain Adaptation

Yuyang Huang, Yabo Chen, Junyu Zhou et al.

NEURIPS 2025posterarXiv:2510.25279
#20762

Online Portfolio Selection with ML Predictions

Ziliang Zhang, Tianming Zhao, Albert Zomaya

NEURIPS 2025poster
#20763

Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention

Wenbin An, Feng Tian, Sicong Leng et al.

CVPR 2025posterarXiv:2406.12718
#20764

Asymptotically Stable Quaternion-valued Hopfield-structured Neural Network with Periodic Projection-based Supervised Learning Rules

Tianwei Wang, Xinhui Ma, Wei Pang

NEURIPS 2025posterarXiv:2510.16607
#20765

PERSE: Personalized 3D Generative Avatars from A Single Portrait

Hyunsoo Cha, Inhee Lee, Hanbyul Joo

CVPR 2025posterarXiv:2412.21206
#20766

Towards Explainable and Unprecedented Accuracy in Matching Challenging Finger Crease Patterns

Zhenyu Zhou, Chengdong Dong, Ajay Kumar

CVPR 2025highlight
#20767

Animate and Sound an Image

Xihua Wang, Ruihua Song, Chongxuan Li et al.

CVPR 2025poster
#20768

Enhancing Few-Shot Class-Incremental Learning via Training-Free Bi-Level Modality Calibration

Yiyang Chen, Tianyu Ding, Lei Wang et al.

CVPR 2025poster
#20769

Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning

Xueyi Ke, Satoshi Tsutsui, Yayun Zhang et al.

CVPR 2025posterarXiv:2501.05205
#20770

BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance

Xin Ye, Burhan Yaman, Sheng Cheng et al.

CVPR 2025highlightarXiv:2502.19694
#20771

Unbalanced Optimal Total Variation Transport: A Theoretical Approach to Spatial Resource Allocation Problems

Nhan-Phu Chung, Jinhui Han, Bohan Li et al.

NEURIPS 2025poster
#20772

Path-specific effects for pulse-oximetry guided decisions in critical care

Kevin Zhang, Yonghan Jung, Divyat Mahajan et al.

NEURIPS 2025posterarXiv:2506.12371
#20773

Stop Learning it all to Mitigate Visual Hallucination, Focus on the Hallucination Target.

Dokyoon Yoon, Youngsook Song, Woomyoung Park

CVPR 2025posterarXiv:2506.11417
#20774

LeanGaussian: Breaking Pixel or Point Cloud Correspondence in Modeling 3D Gaussians

Jiamin WU, Kenkun Liu, Han Gao et al.

CVPR 2025posterarXiv:2404.16323
#20775

Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction

Dubing Chen, Huan Zheng, Jin Fang et al.

CVPR 2025posterarXiv:2504.12959
#20776

Star with Bilinear Mapping

Zelin Peng, Yu Huang, Zhengqin Xu et al.

CVPR 2025poster
#20777

OSMamba: Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure Correction

Gehui Li, Bin Chen, Chen Zhao et al.

CVPR 2025posterarXiv:2411.15255
#20778

BMW: Bidirectionally Memory bank reWriting for Unsupervised Person Re-Identification

Xiaobin Liu, Jianing Li, Baiwei Guo et al.

NEURIPS 2025poster
#20779

ArticulatedGS: Self-supervised Digital Twin Modeling of Articulated Objects using 3D Gaussian Splatting

Guo Junfu, Yu Xin, Gaoyi Liu et al.

CVPR 2025posterarXiv:2503.08135
#20780

Complete Structure Guided Point Cloud Completion via Cluster- and Instance-Level Contrastive Learning

Yang Chen, Yirun Zhou, Weizhong Zhang et al.

NEURIPS 2025spotlight
#20781

Motions as Queries: One-Stage Multi-Person Holistic Human Motion Capture

Kenkun Liu, Yurong Fu, Weihao Yuan et al.

CVPR 2025poster
#20782

Wonder Wins Ways: Curiosity-Driven Exploration through Multi-Agent Contextual Calibration

Yiyuan Pan, Zhe Liu, Hesheng Wang

NEURIPS 2025posterarXiv:2509.20648
#20783

NeuralSurv: Deep Survival Analysis with Bayesian Uncertainty Quantification

Mélodie Monod, Alessandro Micheli, Samir Bhatt

NEURIPS 2025posterarXiv:2505.11054
#20784

Large-scale Multi-view Tensor Clustering with Implicit Linear Kernels

Jiyuan Liu, Xinwang Liu, chuankun Li et al.

CVPR 2025poster
#20785

Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation

Andrea Maracani, Savas Ozkan, Sijun Cho et al.

CVPR 2025posterarXiv:2503.16184
#20786

RANK++LETR: Learn to Rank and Optimize Candidates for Line Segment Detection

Xin Tong, Baojie Tian, Yufei Guo et al.

NEURIPS 2025poster
#20787

Analyzing the Synthetic-to-Real Domain Gap in 3D Hand Pose Estimation

Zhuoran ZHAO, Linlin Yang, Pengzhan Sun et al.

CVPR 2025posterarXiv:2503.19307
#20788

Zero-Shot 4D Lidar Panoptic Segmentation

Yushan Zhang, Aljoša Ošep, Laura Leal-Taixe et al.

CVPR 2025posterarXiv:2504.00848
#20789

Speculative Jacobi-Denoising Decoding for Accelerating Autoregressive Text-to-image Generation

Yao Teng, Fu-Yun Wang, Xian Liu et al.

NEURIPS 2025posterarXiv:2510.08994
#20790

POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation

Lanyun Zhu, Tianrun Chen, Qianxiong Xu et al.

CVPR 2025posterarXiv:2504.00640
#20791

HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories

Eric Hedlin, Munawar Hayat, Fatih Porikli et al.

CVPR 2025posterarXiv:2412.17040
#20792

RICCARDO: Radar Hit Prediction and Convolution for Camera-Radar 3D Object Detection

Yunfei Long, Abhinav Kumar, Xiaoming Liu et al.

CVPR 2025posterarXiv:2504.09086
#20793

Efficient Training of Minimal and Maximal Low-Rank Recurrent Neural Networks

Anushri Arora, Jonathan Pillow

NEURIPS 2025poster
#20794

Towards Visualization-of-Thought Jailbreak Attack against Large Visual Language Models

HongQiong Zhong, Qingyang Teng, Baolin Zheng et al.

NEURIPS 2025poster
#20795

IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification

Yuhao Wang, Yongfeng Lv, Pingping Zhang et al.

CVPR 2025posterarXiv:2503.10324
#20796

MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices

Jianwen Jiang, Gaojie Lin, Zhengkun Rong et al.

CVPR 2025posterarXiv:2407.05712
#20797

Jailbreaking the Non-Transferable Barrier via Test-Time Data Disguising

Yongli Xiang, Ziming Hong, Lina Yao et al.

CVPR 2025posterarXiv:2503.17198
#20798

Anatomically inspired digital twins capture hierarchical object representations in visual cortex

Emanuele Luconi, Dario Liscai, Carlo Baldassi et al.

NEURIPS 2025poster
#20799

Joint Vision-Language Social Bias Removal for CLIP

Haoyu Zhang, Yangyang Guo, Mohan Kankanhalli

CVPR 2025posterarXiv:2411.12785
#20800

InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention

Qiang Xiang, Shuang Sun, Binglei Li et al.

NEURIPS 2025posterarXiv:2509.16691