Most Cited 2025 "exploration rate" Papers
22,274 papers found • Page 107 of 112
Conference
GTR-Loc: Geospatial Text Regularization Assisted Outdoor LiDAR Localization
Shangshu Yu, Wen Li, Xiaotian Sun et al.
Continual Release Moment Estimation with Differential Privacy
Nikita Kalinin, Jalaj Upadhyay, Christoph Lampert
SparseMVC: Probing Cross-view Sparsity Variations for Multi-view Clustering
Ruimeng Liu, Xin Zou, Chang Tang et al.
EVPGS: Enhanced View Prior Guidance for Splatting-based Extrapolated View Synthesis
Jiahe Li, Feiyu Wang, Xiaochao Qu et al.
Optical-Flow Guided Prompt Optimization for Coherent Video Generation
Hyelin Nam, Jaemin Kim, Dohun Lee et al.
Movie Weaver: Tuning-Free Multi-Concept Video Personalization with Anchored Prompts
Feng Liang, Haoyu Ma, Zecheng He et al.
HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
Zhijian Zhuo, Yutao Zeng, Ya Wang et al.
FORLA: Federated Object-centric Representation Learning with Slot Attention
Guiqiu Liao, Matjaz Jogan, Eric Eaton et al.
Uncertainty-Informed Meta Pseudo Labeling for Surrogate Modeling with Limited Labeled Data
Xingyu Ren, Pengwei Liu, Pengkai Wang et al.
Detecting Open World Objects via Partial Attribute Assignment
Muli Yang, Gabriel James Goenawan, Huaiyuan Qin et al.
OpenMIBOOD: Open Medical Imaging Benchmarks for Out-Of-Distribution Detection
Max Gutbrod, David Rauber, Danilo Weber Nunes et al.
CTRL-O: Language-Controllable Object-Centric Visual Representation Learning
Aniket Rajiv Didolkar, Andrii Zadaianchuk, Rabiul Awal et al.
MMCSBench: A Fine-Grained Benchmark for Large Vision-Language Models in Camouflage Scenes
Jin Zhang, Ruiheng Zhang, Zhe Cao et al.
Subnet-Aware Dynamic Supernet Training for Neural Architecture Search
Jeimin Jeon, Youngmin Oh, Junghyup Lee et al.
DeltaFormer: Unlock the state space of Transformer
Mingyu Xu, Tenglong Ao, Jiaao He et al.
MARVEL-40M+: Multi-Level Visual Elaboration for High-Fidelity Text-to-3D Content Creation
Sankalp Sinha, Mohammad Sadil Khan, Muhammad Usama et al.
PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation
Qiyao Xue, Xiangyu Yin, Boyuan Yang et al.
Spiking Meets Attention: Efficient Remote Sensing Image Super-Resolution with Attention Spiking Neural Networks
Yi Xiao, Qiangqiang Yuan, Kui Jiang et al.
Towards Better Dental AI: A Multimodal Benchmark and Instruction Dataset for Panoramic X-ray Analysis
Jing Hao, Yuxuan Fan, Yanpeng Sun et al.
AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing
Niu Lian, Jun Li, Jinpeng Wang et al.
Entropy-Calibrated Label Distribution Learning
Yunan Lu, Bowen Xue, Xiuyi Jia et al.
Multimodal Disease Progression Modeling via Spatiotemporal Disentanglement and Multiscale Alignment
Chen Liu, Wenfang Yao, Kejing Yin et al.
The Rashomon Set Has It All: Analyzing Trustworthiness of Trees under Multiplicity
Ethan Hsu, Tony Cao, Lesia Semenova et al.
EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Haotian Wang, Yuzhe Weng, Yueyan Li et al.
Unbiased Prototype Consistency Learning for Multi-Modal and Multi-Task Object Re-Identification
Zhongao Zhou, Bin Yang, Wenke Huang et al.
LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation
Vladan Stojnić, Yannis Kalantidis, Jiri Matas et al.
MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM
Vladimir Yugay, Theo Gevers, Martin R. Oswald
AHa-Bench: Benchmarking Audio Hallucinations in Large Audio-Language Models
Xize Cheng, Dongjie Fu, Chenyuhao Wen et al.
Task-Specific Data Selection for Instruction Tuning via Monosemantic Neuronal Activations
Da Ma, Gonghu Shang, Zhi Chen et al.
Arbitrary-steps Image Super-resolution via Diffusion Inversion
Zongsheng Yue, Kang Liao, Chen Change Loy
MotionRAG: Motion Retrieval-Augmented Image-to-Video Generation
Chenhui Zhu, Yilu Wu, Shuai Wang et al.
Inductive Domain Transfer In Misspecified Simulation-Based Inference
Ortal Senouf, Antoine Wehenkel, Cédric Vincent-Cuaz et al.
AutoLUT: LUT-Based Image Super-Resolution with Automatic Sampling and Adaptive Residual Learning
Yuheng Xu, Shijie Yang, Xin Liu et al.
Leader360V: A Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment
WEIMING ZHANG, Dingwen Xiao, Aobotao DAI et al.
Improving Monte Carlo Tree Search for Symbolic Regression
Zhengyao Huang, Daniel Huang, Tiannan Xiao et al.
Fuse2Match: Training-Free Fusion of Flow, Diffusion, and Contrastive Models for Zero-Shot Semantic Matching
Jing Zuo, Jiaqi Wang, Yonggang Qi et al.
Prompt-Guided Alignment with Information Bottleneck Makes Image Compression Also a Restorer
Xuelin Shen, Quan Liu, Jiayin Xu et al.
Reinforcement Learning with Action Chunking
Qiyang Li, Zhiyuan (Paul) Zhou, Sergey Levine
Structure-Aware Cooperative Ensemble Evolutionary Optimization on Combinatorial Problems with Multimodal Large Language Models
Jie Zhao, Kang Cheong
Crab: A Unified Audio-Visual Scene Understanding Model with Explicit Cooperation
Henghui Du, Guangyao Li, Chang Zhou et al.
Exploring Landscapes for Better Minima along Valleys
Tong Zhao, Jiacheng Li, Yuanchang Zhou et al.
EgoLife: Towards Egocentric Life Assistant
Jingkang Yang, Shuai Liu, Hongming Guo et al.
A Hierarchy of Graphical Models for Counterfactual Inferences
Hongshuo Yang, Elias Bareinboim
Counterfactual Image Editing with Disentangled Causal Latent Space
Yushu Pan, Elias Bareinboim
Sound Logical Explanations for Mean Aggregation Graph Neural Networks
Matthew Morris, Ian Horrocks
Confounding Robust Deep Reinforcement Learning: A Causal Approach
Mingxuan Li, Junzhe Zhang, Elias Bareinboim
Smooth Quadratic Prediction Markets
Enrique Nueve, Bo Waggoner
Oryx: a Scalable Sequence Model for Many-Agent Coordination in Offline MARL
Juan Formanek, Omayma Mahjoub, Louay Nessir et al.
Scalable Best-of-N Selection for Large Language Models via Self-Certainty
Zhewei Kang, Xuandong Zhao, Dawn Song
Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World
Bangyan Liao, Zhenjun Zhao, Haoang Li et al.
StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements
Mingkun Lei, Xue Song, Beier Zhu et al.
Invisible Backdoor Attack against Self-supervised Learning
Hanrong Zhang, Zhenting Wang, Boheng Li et al.
SketchFusion: Learning Universal Sketch Features through Fusing Foundation Models
Subhadeep Koley, Tapas Kumar Dutta, Aneeshan Sain et al.
CamSAM2: Segment Anything Accurately in Camouflaged Videos
Yuli Zhou, Yawei Li, Yuqian Fu et al.
Spatially-aware Weights Tokenization for NeRF-Language Models
Andrea Amaduzzi, Pierluigi Zama Ramirez, Giuseppe Lisanti et al.
Siegel Neural Networks
Xuan Son Nguyen, Aymeric Histace, Nistor Grozavu
PMLF: A Physics-Guided Multiscale Loss Framework for Structurally Heterogeneous Time Series
Xinghong Chen, Weilin Wu, Kunping Yang et al.
Mind the Quote: Enabling Quotation-Aware Dialogue in LLMs via Plug-and-Play Modules
Yueqi Zhang, Peiwen Yuan, Yiwei Li et al.
CoMapGS: Covisibility Map-based Gaussian Splatting for Sparse Novel View Synthesis
Youngkyoon Jang, Eduardo Pérez-Pellitero
Contextual Tokenization for Graph Inverted Indices
Pritish Chakraborty, Indradyumna Roy, Soumen Chakrabarti et al.
Multi-Objective One-Shot Pruning for Large Language Models
Weiyu Chen, Hansi Yang, Yunhao Gou et al.
Interactive Medical Image Analysis with Concept-based Similarity Reasoning
Ta Duc Huy, Sen Kim Tran, Phan Nguyen et al.
SGAR: Structural Generative Augmentation for 3D Human Motion Retrieval
Jiahang Zhang, Lilang Lin, Shuai Yang et al.
ViSPLA: Visual Iterative Self-Prompting for Language-Guided 3D Affordance Learning
Hritam Basak, Zhaozheng Yin
Steepest Descent Density Control for Compact 3D Gaussian Splatting
Peihao Wang, Yuehao Wang, Dilin Wang et al.
InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions
Sirui Xu, Hung Yu Ling, Yu-Xiong Wang et al.
Tabula: A Tabular Self-Supervised Foundation Model for Single-Cell Transcriptomics
Jiayuan Ding, Jianhui Lin, Shiyu Jiang et al.
Semi-off-Policy Reinforcement Learning for Vision-Language Slow-Thinking Reasoning
Junhao Shen, Haiteng Zhao, Yuzhe Gu et al.
EdgeMovingNet: Edge-preserving Point Cloud Reconstruction via Joint Geometry Features
Xinran Yang, Donghao Ji, Yuanqi Li et al.
MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors
Fanqi Pu, Yifan Wang, Jiru Deng et al.
Breaking the Batch Barrier (B3) of Contrastive Learning via Smart Batch Mining
Raghuveer Thirukovalluru, Rui Meng, Ye Liu et al.
Staggered Environment Resets Improve Massively Parallel On-Policy Reinforcement Learning
Sid Bharthulwar, Stone Tao, Hao Su
Depth-Supervised Fusion Network for Seamless-Free Image Stitching
Zhiying Jiang, Ruhao Yan, Zengxi Zhang et al.
ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness
Yijun Liang, Ming Li, Chenrui Fan et al.
Online Strategic Classification With Noise and Partial Feedback
Tianrun Zhao, Xiaojie Mao, Yong Liang
Imitation Learning with Temporal Logic Constraints
Zining Fan, He Zhu
Enhancing Facial Privacy Protection via Weakening Diffusion Purification
Ali Salar, Qing Liu, Yingli Tian et al.
PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability
Weijie Zhou, Manli Tao, Chaoyang Zhao et al.
Generative Distribution Embeddings
Nic Fishman, Gokul Gowri, Peng Yin et al.
Large Language Models for Lossless Image Compression: Next-Pixel Prediction in Language Space is All You Need
Kecheng Chen, Pingping Zhang, Hui Liu et al.
Improving Progressive Generation with Decomposable Flow Matching
Moayed Haji-Ali, Willi Menapace, Ivan Skorokhodov et al.
Preserving Clusters in Prompt Learning for Unsupervised Domain Adaptation
Long Tung Vuong, Hoang Phan, Vy Vo et al.
Controllable Human Image Generation with Personalized Multi-Garments
Yisol Choi, Sangkyung Kwak, Sihyun Yu et al.
Beyond Last-Click: An Optimal Mechanism for Ad Attribution
Nan An, Weian Li, Qi Qi et al.
Sampled Estimators For Softmax Must Be Biased
Li-Chung Lin, Yaxu Liu, Chih-Jen Lin
Reconstructing Animals and the Wild
Peter Kulits, Michael J. Black, Silvia Zuffi
Venus-MAXWELL: Efficient Learning of Protein-Mutation Stability Landscapes using Protein Language Models
Yuanxi Yu, Fan Jiang, Xinzhu Ma et al.
InteractVLM: 3D Interaction Reasoning from 2D Foundational Models
Sai Kumar Dwivedi, Dimitrije Antić, Shashank Tripathi et al.
Subgraph Federated Learning via Spectral Methods
Javad Aliakbari, Johan Oestman, Ashkan Panahi et al.
Sequential Monte Carlo for Policy Optimization in Continuous POMDPs
Hany Abdulsamad, Sahel Mohammad Iqbal, Simo Sarkka
Vector Quantization in the Brain: Grid-like Codes in World Models
Xiangyuan Peng, Xingsi Dong, Si Wu
Optimal Graph Clustering without Edge Density Signals
Maximilien Dreveton, Elaine Liu, Matthias Grossglauser et al.
PREAMBLE: Private and Efficient Aggregation via Block Sparse Vectors
Hilal Asi, Vitaly Feldman, Hannah Keller et al.
Eulerian Neural Network Informed by Chemical Transport for Air Quality Forecasting
Eluder dimension: localise it!
Alireza Bakhtiari, Alex Ayoub, Samuel Robertson et al.
Rethinking Epistemic and Aleatoric Uncertainty for Active Open-Set Annotation: An Energy-Based Approach
Chen-Chen Zong, Sheng-Jun Huang
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Wenbo Hu, Xiangjun Gao, Xiaoyu Li et al.
DC4GS: Directional Consistency-Driven Adaptive Density Control for 3D Gaussian Splatting
Moonsoo Jeong, Dongbeen Kim, Minseong Kim et al.
Rethinking Entropy in Test-Time Adaptation: The Missing Piece from Energy Duality
Mincheol Park, Heeji Won, Won Woo Ro et al.
SeaLion: Semantic Part-Aware Latent Point Diffusion Models for 3D Generation
Dekai Zhu, Yan Di, Stefan Gavranovic et al.
LIMOPro: Reasoning Refinement for Efficient and Effective Test-time Scaling
Yang Xiao, Jiashuo WANG, Ruifeng Yuan et al.
ActiveGAMER: Active GAussian Mapping through Efficient Rendering
Liyan Chen, Huangying Zhan, Kevin Chen et al.
PromptHash: Affinity-Prompted Collaborative Cross-Modal Learning for Adaptive Hashing Retrieval
Qiang Zou, Shuli Cheng, Jiayi Chen
CoLT: The conditional localization test for assessing the accuracy of neural posterior estimates
Tianyu Chen, Vansh Bansal, James Scott
Noisy Multi-Label Learning through Co-Occurrence-Aware Diffusion
Senyu Hou, Yuru Ren, Gaoxia Jiang et al.
Role-aware Multi-agent Reinforcement Learning for Coordinated Emergency Traffic Control
Ming Cheng, Hao Chen, Zhiqing Li et al.
Correcting misinterpretations of additive models
Benedict Clark, Rick Wilming, Hjalmar Schulz et al.
GET: Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery
Enguang Wang, Zhimao Peng, Zhengyuan Xie et al.
Exploration-Driven Generative Interactive Environments
Nedko Savov, Naser Kazemi, Mohammad Mahdi et al.
DBLoss: Decomposition-based Loss Function for Time Series Forecasting
Xiangfei Qiu, Xingjian Wu, Hanyin Cheng et al.
BioCG: Constrained Generative Modeling for Biochemical Interaction Prediction
Amitay Sicherman, Kira Radinsky
FedEL: Federated Elastic Learning for Heterogeneous Devices
Letian Zhang, Bo Chen, Jieming Bian et al.
E2E-VGuard: Adversarial Prevention for Production LLM-based End-To-End Speech Synthesis
Zhisheng Zhang, Derui Wang, Yifan Mi et al.
Dynamic Shadow Unveils Invisible Semantics for Video Outpainting
Ruilin Li, Hang Yu, Jiayan Qiu
Extreme Rotation Estimation in the Wild
Hana Bezalel, Dotan Ankri, Ruojin Cai et al.
Simple and Optimal Sublinear Algorithms for Mean Estimation
Beatrice Bertolotti, Matteo Russo, Chris Schwiegelshohn et al.
An Efficient Orlicz-Sobolev Approach for Transporting Unbalanced Measures on a Graph
Tam Le, Truyen Nguyen, Hideitsu Hino et al.
Flash3D: Super-scaling Point Transformers through Joint Hardware-Geometry Locality
Liyan Chen, Gregory P. Meyer, Zaiwei Zhang et al.
ProHOC: Probabilistic Hierarchical Out-of-Distribution Classification via Multi-Depth Networks
Erik Wallin, Fredrik Kahl, Lars Hammarstrand
Motion Prompting: Controlling Video Generation with Motion Trajectories
Daniel Geng, Charles Herrmann, Junhwa Hur et al.
EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting
Dong In Lee, Hyeongcheol Park, Jiyoung Seo et al.
Improved Algorithms for Fair Matroid Submodular Maximization
Sepideh Mahabadi, Sherry Sarkar, Jakub Tarnawski
Defining and Discovering Hyper-meta-paths for Heterogeneous Hypergraphs
Yaming Yang, Ziyu Zheng, Weigang Lu et al.
Many Minds, One Goal: Time Series Forecasting via Sub-task Specialization and Inter-agent Cooperation
Qihe Huang, Zhengyang Zhou, Yangze Li et al.
AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction
Yuanbin Man, Ying Huang, Chengming Zhang et al.
LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
Hongjie Wang, Chih-Yao Ma, Yen-Cheng Liu et al.
Volumetrically Consistent 3D Gaussian Rasterization
Chinmay Talegaonkar, Yash Belhe, Ravi Ramamoorthi et al.
DPAIL: Training Diffusion Policy for Adversarial Imitation Learning without Policy Optimization
Yunseon Choi, Minchan Jeong, Soobin Um et al.
From Zero to Detail: Deconstructing Ultra-High-Definition Image Restoration from Progressive Spectral Perspective
Chen Zhao, Zhizhou Chen, Yunzhe Xu et al.
Finite-Time Bounds for Average-Reward Fitted Q-Iteration
Jongmin Lee, Ernest Ryu
Clustering via Hedonic Games: New Concepts and Algorithms
Gergely Csáji, Alexander Gundert, Jörg Rothe et al.
DevFD : Developmental Face Forgery Detection by Learning Shared and Orthogonal LoRA Subspaces
Tianshuo Zhang, Li Gao, Siran Peng et al.
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
Sili Chen, Hengkai Guo, Shengnan Zhu et al.
vHector and HeisenVec: Scalable Vector Graphics Generation Through Large Language Models
Leonardo Zini, Elia Frigieri, Sebastiano Aloscari et al.
FreeCloth: Free-form Generation Enhances Challenging Clothed Human Modeling
Hang Ye, Xiaoxuan Ma, Hai Ci et al.
Good, Cheap, and Fast: Overfitted Image Compression with Wasserstein Distortion
Jona Ballé, Luca Versari, Emilien Dupont et al.
Understanding challenges to the interpretation of disaggregated evaluations of algorithmic fairness
Stephen Pfohl, Natalie Harris, Chirag Nagpal et al.
Progressive Focused Transformer for Single Image Super-Resolution
Wei Long, Xingyu Zhou, Leheng Zhang et al.
DictPFL: Efficient and Private Federated Learning on Encrypted Gradients
Jiaqi Xue, Mayank Kumar, Yuzhang Shang et al.
The Boundaries of Fair AI in Medical Image Prognosis: A Causal Perspective
Thai-Hoang Pham, Jiayuan Chen, Seungyeon Lee et al.
KVQ: Boosting Video Quality Assessment via Saliency-guided Local Perception
Yunpeng Qu, Kun Yuan, Qizhi Xie et al.
Tight analyses of first-order methods with error feedback
Daniel Berg Thomsen, Adrien Taylor, Aymeric Dieuleveut
GOATex: Geometry & Occlusion-Aware Texturing
Hyunjin Kim, Kunho Kim, Adam Lee et al.
Multimodal LiDAR-Camera Novel View Synthesis with Unified Pose-free Neural Fields
Weiyi Xue, Fan Lu, Yunwei Zhu et al.
Multi-dataset Joint Pre-training of Emotional EEG Enables Generalizable Affective Computing
Qingzhu Zhang, Jiani Zhong, Zongsheng Li et al.
CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive Programming
Han Deng, Yuan Meng, SHIXIANG TANG et al.
PINN Balls: Scaling Second-Order Methods for PINNs with Domain Decomposition and Adaptive Sampling
Andrea Bonfanti, Ismael Medina, Roman List et al.
FuncGenFoil: Airfoil Generation and Editing Model in Function Space
Jinouwen Zhang, Junjie Ren, Ma Qianhong et al.
Efficient Personalization of Quantized Diffusion Model without Backpropagation
Hoigi Seo, Wongi Jeong, Kyungryeol Lee et al.
From Sparse Signal to Smooth Motion: Real-Time Motion Generation with Rolling Prediction Models
German Barquero, Nadine Bertsch, Manojkumar Marramreddy et al.
Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents
Yunseok Jang, Yeda Song, Sungryull Sohn et al.
Optimal Rates for Generalization of Gradient Descent for Deep ReLU Classification
Yuanfan Li, Yunwen Lei, Zheng-Chu Guo et al.
Each Complexity Deserves a Pruning Policy
Hanshi Wang, Yuhao Xu, Zekun Xu et al.
Aha! - Predicting What Matters Next: Online Highlight Detection Without Looking Ahead
Aiden Chang, Celso de Melo, Stephanie Lukin
RGB-Only Supervised Camera Parameter Optimization in Dynamic Scenes
Fang Li, Hao Zhang, Narendra Ahuja
Table2LaTeX-RL: High-Fidelity LaTeX Code Generation from Table Images via Reinforced Multimodal Language Models
Jun Ling, Yao Qi, Tao Huang et al.
Which Algorithms Have Tight Generalization Bounds?
Michael Gastpar, Ido Nachum, Jonathan Shafer et al.
Ultra-high Resolution Watermarking Framework Resistant to Extreme Cropping and Scaling
Nan Sun, LuYu Yuan, Han Fang et al.
Enhancing Consistency of Flow-Based Image Editing through Kalman Control
Haozhe Chi, Zhicheng Sun, Yang Jin et al.
Variance-Reduced Long-Term Rehearsal Learning with Quadratic Programming Reformulation
Wen-Bo Du, Tian Qin, Tian-Zuo Wang et al.
Bridging the Vision-Brain Gap with an Uncertainty-Aware Blur Prior
Haitao Wu, Qing Li, Changqing Zhang et al.
Multi-View Oriented GPLVM: Expressiveness and Efficiency
Zi Yang, Ying Li, Zhidi Lin et al.
Free Lunch Enhancements for Multi-modal Crowd Counting
Haoliang Meng, Xiaopeng Hong, Zhengqin Lai et al.
DATE-LM: Benchmarking Data Attribution Evaluation for Large Language Models
Cathy Jiao, Yijun Pan, Emily Xiao et al.
Learning Interestingness in Automated Mathematical Theory Formation
George Tsoukalas, Rahul Saha, Amitayush Thakur et al.
Unfolding the Black Box of Recurrent Neural Networks for Path Integration
Tianhao Chu, Yuling Wu, Neil Burgess et al.
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding
Zining Wang, Tongkun Guan, Pei Fu et al.
PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting
Alex Hanson, Allen Tu, Vasu Singla et al.
Directed-Tokens: A Robust Multi-Modality Alignment Approach to Large Language-Vision Models
Thanh-Dat Truong, Huu-Thien Tran, Tran Son et al.
Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Soumya Suvra Ghosal, Souradip Chakraborty, Vaibhav Singh et al.
Prior Forgetting and In-Context Overfitting
Sungyoon Lee
OOD-Barrier: Build a Middle-Barrier for Open-Set Single-Image Test Time Adaptation via Vision Language Models
Boyang Peng, Sanqing Qu, Tianpei Zou et al.
SEGA: Shaping Semantic Geometry for Robust Hashing under Noisy Supervision
Yiyang Gu, Bohan Wu, Qinghua Ran et al.
PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models
Chenyu Yang, Xuan Dong, Xizhou Zhu et al.
From Replication to Redesign: Exploring Pairwise Comparisons for LLM-Based Peer Review
Yaohui Zhang, Haijing ZHANG, Wenlong Ji et al.
Homogeneous Keys, Heterogeneous Values: Exploiting Local KV Cache Asymmetry for Long-Context LLMs
Wanyun Cui, Mingwei Xu
MultiMorph: On-demand Atlas Construction
Mazdak Abulnaga, Andrew Hoopes, Neel Dey et al.
Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking
Phuc Nguyen, Minh Luu, Anh Tran et al.
Asymmetric Dual Self-Distillation for 3D Self-Supervised Representation Learning
Remco Leijenaar, Hamidreza Kasaei
DarkIR: Robust Low-Light Image Restoration
Daniel Feijoo, Juan C. Benito, Alvaro Garcia et al.
ImagineFSL: Self-Supervised Pretraining Matters on Imagined Base Set for VLM-based Few-shot Learning
Haoyuan Yang, Xiaoou Li, Jiaming Lv et al.
Localist Topographic Expert Routing: A Barrel Cortex-Inspired Modular Network for Sensorimotor Processing
Tianfang Zhu, Dongli Hu, Jiandong Zhou et al.
Stackelberg Learning with Outcome-based Payment
Tom Yan, Chicheng Zhang
Controlled Visual Hallucination via Thalamus-Driven Decoupling Network for Domain Adaptation of Black-Box Predictors
Yuwu Lu, Chunzhi Liu
Counteractive RL: Rethinking Core Principles for Efficient and Scalable Deep Reinforcement Learning
Ezgi Korkmaz
TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization
Liang Pan, Zeshi Yang, Zhiyang Dou et al.
DepthSplat: Connecting Gaussian Splatting and Depth
Haofei Xu, Songyou Peng, Fangjinhua Wang et al.
Effects of Dropout on Performance in Long-range Graph Learning Tasks
Jasraj Singh, Keyue Jiang, Brooks Paige et al.
Multitwine: Multi-Object Compositing with Text and Layout Control
Gemma Canet Tarrés, Zhe Lin, Zhifei Zhang et al.
RelationField: Relate Anything in Radiance Fields
Sebastian Koch, Johanna Wald, Mirco Colosi et al.
MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching
Liang Yue, Yihong Tang, Kehai Chen et al.
SafePTR: Token-Level Jailbreak Defense in Multimodal LLMs via Prune-then-Restore Mechanism
Beitao Chen, Xinyu Lyu, shengming yuan et al.
Consistent Normal Orientation for 3D Point Clouds via Least Squares on Delaunay Graph
Rao Fu, Jianmin Zheng, Liang Yu
Recover and Match: Open-Vocabulary Multi-Label Recognition through Knowledge-Constrained Optimal Transport
Hao Tan, Zichang Tan, Jun Li et al.
DeCaFlow: A deconfounding causal generative model
Alejandro Almodóvar, Adrián Javaloy, Juan Parras et al.
BioX-CPath: Biologically-driven Explainable Diagnostics for Multistain IHC Computational Pathology
Amaya Gallagher-Syed, Henry Senior, Omnia Alwazzan et al.
Disentangled Representation Learning via Modular Compositional Bias
whie jung, Dong Hoon Lee, Seunghoon Hong
The Price of Opportunity Fairness in Matroid Allocation Problems
Rémi Castera, Felipe Garrido-Lucero, Patrick Loiseau et al.
Geometric Algebra-Enhanced Bayesian Flow Network for RNA Inverse Design
Rubo Wang, Xingyu Gao, Peilin Zhao
DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation
Bo-Wen Yin, Jiao-Long Cao, Ming-Ming Cheng et al.