Most Cited 2025 "microtransactions" Papers

22,274 papers found • Page 61 of 112

#12001

On-Policy Optimization with Group Equivalent Preference for Multi-Programming Language Understanding

Haoyuan Wu, Rui Ming, Jilong Gao et al.

NEURIPS 2025arXiv:2505.12723
2
citations
#12002

Enhancing Adversarial Transferability with Checkpoints of a Single Model’s Training

Shixin Li, Chaoxiang He, Xiaojing Ma et al.

CVPR 2025
2
citations
#12003

Fast constrained sampling in pre-trained diffusion models

Alexandros Graikos, Nebojsa Jojic, Dimitris Samaras

NEURIPS 2025arXiv:2410.18804
2
citations
#12004

Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation

Andrea Maracani, Savas Ozkan, Sijun Cho et al.

CVPR 2025arXiv:2503.16184
2
citations
#12005

Self-supervised Learning of Echocardiographic Video Representations via Online Cluster Distillation

Divyanshu Mishra, Mohammadreza Salehi, Pramit Saha et al.

NEURIPS 2025oralarXiv:2506.11777
2
citations
#12006

GeoCAD: Local Geometry-Controllable CAD Generation with Large Language Models

Zhanwei Zhang, kaiyuan liu, Junjie Liu et al.

NEURIPS 2025arXiv:2506.10337
2
citations
#12007

Approximately Aligned Decoding

Daniel Melcer, Sujan Kumar Gonugondla, Pramuditha Perera et al.

NEURIPS 2025arXiv:2410.01103
2
citations
#12008

Learning on Model Weights using Tree Experts

Eliahu Horwitz, Bar Cavia, Jonathan Kahana et al.

CVPR 2025arXiv:2410.13569
2
citations
#12009

Table as a Modality for Large Language Models

Liyao Li, Chao Ye, Wentao Ye et al.

NEURIPS 2025arXiv:2512.00947
2
citations
#12010

Gaussian Herding across Pens: An Optimal Transport Perspective on Global Gaussian Reduction for 3DGS

Tao Wang, Mengyu Li, Geduo Zeng et al.

NEURIPS 2025spotlightarXiv:2506.09534
2
citations
#12011

Towards Reliable Code-as-Policies: A Neuro-Symbolic Framework for Embodied Task Planning

Sanghyun Ahn, Wonje Choi, Junyong Lee et al.

NEURIPS 2025spotlightarXiv:2510.21302
2
citations
#12012

Recovering Dynamic 3D Sketches from Videos

Jaeah Lee, Changwoon Choi, Young Min Kim et al.

CVPR 2025arXiv:2503.20321
2
citations
#12013

AuroRA: Breaking Low-Rank Bottleneck of LoRA with Nonlinear Mapping

Haonan Dong, Wenhao Zhu, Guojie Song et al.

NEURIPS 2025spotlightarXiv:2505.18738
2
citations
#12014

TranSUN: A Preemptive Paradigm to Eradicate Retransformation Bias Intrinsically from Regression Models in Recommender Systems

Jiahao Yu, Haozhuang Liu, Yeqiu Yang et al.

NEURIPS 2025arXiv:2505.13881
2
citations
#12015

Dendritic Resonate-and-Fire Neuron for Effective and Efficient Long Sequence Modeling

Dehao Zhang, Malu Zhang, Shuai Wang et al.

NEURIPS 2025oralarXiv:2509.17186
2
citations
#12016

Preference Optimization by Estimating the Ratio of the Data Distribution

Yeongmin Kim, HeeSun Bae, Byeonghu Na et al.

NEURIPS 2025arXiv:2505.19601
2
citations
#12017

LeanGaussian: Breaking Pixel or Point Cloud Correspondence in Modeling 3D Gaussians

Jiamin WU, Kenkun Liu, Han Gao et al.

CVPR 2025arXiv:2404.16323
2
citations
#12018

InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models

Yanggan Gu, Yuanyi Wang, Zhaoyi Yan et al.

NEURIPS 2025spotlightarXiv:2505.13878
2
citations
#12019

PocketSR: The Super-Resolution Expert in Your Pocket Mobiles

Haoze Sun, Linfeng Jiang, Fan Li et al.

NEURIPS 2025arXiv:2510.03012
2
citations
#12020

Integral Fast Fourier Color Constancy

Wenjun Wei, Yanlin Qian, Huaian Chen et al.

CVPR 2025arXiv:2502.03494
2
citations
#12021

Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning

Xueyi Ke, Satoshi Tsutsui, Yayun Zhang et al.

CVPR 2025arXiv:2501.05205
2
citations
#12022

FerretNet: Efficient Synthetic Image Detection via Local Pixel Dependencies

Shuqiao Liang, Jian Liu, Chen Renzhang et al.

NEURIPS 2025arXiv:2509.20890
2
citations
#12023

VITED: Video Temporal Evidence Distillation

Yujie Lu, Yale Song, Lorenzo Torresani et al.

CVPR 2025arXiv:2503.12855
2
citations
#12024

VinaBench: Benchmark for Faithful and Consistent Visual Narratives

Silin Gao, Sheryl Mathew, Li Mi et al.

CVPR 2025arXiv:2503.20871
2
citations
#12025

Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning

Cheng Chen, Yunpeng Zhai, Yifan Zhao et al.

CVPR 2025arXiv:2506.09473
2
citations
#12026

Deep Tree Tensor Networks

Chang Nie

NEURIPS 2025
2
citations
#12027

Fast Data Attribution for Text-to-Image Models

Sheng-Yu Wang, Aaron Hertzmann, Alexei Efros et al.

NEURIPS 2025arXiv:2511.10721
2
citations
#12028

Preconditioners for the Stochastic Training of Neural Fields

Shin-Fang Chng, Hemanth Saratchandran, Simon Lucey

CVPR 2025arXiv:2402.08784
2
citations
#12029

Multiplication-Free Parallelizable Spiking Neurons with Efficient Spatio-Temporal Dynamics

Peng Xue, Wei Fang, Zhengyu Ma et al.

NEURIPS 2025oralarXiv:2501.14490
2
citations
#12030

Provable Sample-Efficient Transfer Learning Conditional Diffusion Models via Representation Learning

Ziheng Cheng, Tianyu Xie, Shiyue Zhang et al.

NEURIPS 2025arXiv:2502.04491
2
citations
#12031

Align-DA: Align Score-based Atmospheric Data Assimilation with Multiple Preferences

Jing-An Sun, Hang Fan, Junchao Gong et al.

NEURIPS 2025arXiv:2505.22008
2
citations
#12032

Stealthy Yet Effective: Distribution-Preserving Backdoor Attacks on Graph Classification

Xiaobao Wang, Ruoxiao Sun, Yujun Zhang et al.

NEURIPS 2025arXiv:2509.26032
2
citations
#12033

Unleashing the Potential of Multimodal LLMs for Zero-Shot Spatio-Temporal Video Grounding

Zaiquan Yang, Yuhao LIU, Gerhard Hancke et al.

NEURIPS 2025oralarXiv:2509.15178
2
citations
#12034

AmorLIP: Efficient Language-Image Pretraining via Amortization

Haotian Sun, Yitong Li, Yuchen Zhuang et al.

NEURIPS 2025arXiv:2505.18983
2
citations
#12035

Environment Inference for Learning Generalizable Dynamical System

Shixuan Liu, Yue He, Haotian Wang et al.

NEURIPS 2025spotlightarXiv:2510.19784
2
citations
#12036

Riemannian Proximal Sampler for High-accuracy Sampling on Manifolds

Yunrui Guan, Krishnakumar Balasubramanian, Shiqian Ma

NEURIPS 2025arXiv:2502.07265
2
citations
#12037

Toward Relative Positional Encoding in Spiking Transformers

Changze Lv, Yansen Wang, Dongqi Han et al.

NEURIPS 2025oralarXiv:2501.16745
2
citations
#12038

Regret Bounds for Adversarial Contextual Bandits with General Function Approximation and Delayed Feedback

Orin Levy, Liad Erez, Alon Peled-Cohen et al.

NEURIPS 2025spotlightarXiv:2510.09127
2
citations
#12039

MS-BART: Unified Modeling of Mass Spectra and Molecules for Structure Elucidation

Yang Han, Pengyu Wang, Kai Yu et al.

NEURIPS 2025arXiv:2510.20615
2
citations
#12040

Mixture-of-Experts Operator Transformer for Large-Scale PDE Pre-Training

Hong Wang, Haiyang Xin, Jie Wang et al.

NEURIPS 2025arXiv:2510.25803
2
citations
#12041

FuXi-Ocean: A Global Ocean Forecasting System with Sub-Daily Resolution

Qiusheng Huang, Yuan Niu, Xiaohui Zhong et al.

NEURIPS 2025oralarXiv:2506.03210
2
citations
#12042

PhyS-EdiT: Physics-aware Semantic Image Editing with Text Description

Ziqi Cai, Shuchen Weng, Yifei Xia et al.

CVPR 2025
2
citations
#12043

Hand-held Object Reconstruction from RGB Video with Dynamic Interaction

Shijian Jiang, Qi Ye, Rengan Xie et al.

CVPR 2025
2
citations
#12044

ProReflow: Progressive Reflow with Decomposed Velocity

Lei Ke, Haohang Xu, Xuefei Ning et al.

CVPR 2025arXiv:2503.04824
2
citations
#12045

Evaluating Model Perception of Color Illusions in Photorealistic Scenes

Lingjun Mao, Zineng Tang, Alane Suhr

CVPR 2025arXiv:2412.06184
2
citations
#12046

DiEP: Adaptive Mixture-of-Experts Compression through Differentiable Expert Pruning

Sikai Bai, Haoxi Li, Jie ZHANG et al.

NEURIPS 2025arXiv:2509.16105
2
citations
#12047

Intrinsic Benefits of Categorical Distributional Loss: Uncertainty-aware Regularized Exploration in Reinforcement Learning

Ke Sun, Yingnan Zhao, Enze Shi et al.

NEURIPS 2025arXiv:2110.03155
2
citations
#12048

Data Mixing Can Induce Phase Transitions in Knowledge Acquisition

Xinran Gu, Kaifeng Lyu, Jiazheng Li et al.

NEURIPS 2025spotlightarXiv:2505.18091
2
citations
#12049

SeCon-RAG: A Two-Stage Semantic Filtering and Conflict-Free Framework for Trustworthy RAG

Xiaonan Si, Meilin Zhu, Simeng Qin et al.

NEURIPS 2025arXiv:2510.09710
2
citations
#12050

KMD: Koopman Multi-modality Decomposition for Generalized Brain Tumor Segmentation under Incomplete Modalities

Tianyi Liu, Haochuan Jiang, Kaizhu Huang

CVPR 2025
2
citations
#12051

Causal Graphical Models for Vision-Language Compositional Understanding

Fiorenzo Parascandolo, Nicholas Moratelli, Enver Sangineto et al.

ICLR 2025arXiv:2412.09353
2
citations
#12052

STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation

Hossein Goli, Michael Gimelfarb, Nathan de Lara et al.

NEURIPS 2025spotlightarXiv:2505.20781
2
citations
#12053

Learnable Sampler Distillation for Discrete Diffusion Models

Feiyang Fu, Tongxian Guo, Zhaoqiang Liu

NEURIPS 2025spotlightarXiv:2509.19962
2
citations
#12054

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

Jinkun Hao, Naifu Liang, Zhen Luo et al.

NEURIPS 2025spotlightarXiv:2509.22281
2
citations
#12055

ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation

Lingfeng Wang, Hualing Lin, Senda Chen et al.

NEURIPS 2025arXiv:2505.16495
2
citations
#12056

Listwise Preference Diffusion Optimization for User Behavior Trajectories Prediction

Hongtao Huang, Chengkai Huang, Junda Wu et al.

NEURIPS 2025arXiv:2511.00530
2
citations
#12057

Neptune-X: Active X-to-Maritime Generation for Universal Maritime Object Detection

Yu Guo, Shengfeng He, Yuxu Lu et al.

NEURIPS 2025spotlightarXiv:2509.20745
2
citations
#12058

Online Statistical Inference in Decision Making with Matrix Context

Qiyu Han, Will Wei Sun, Yichen Zhang

NEURIPS 2025arXiv:2212.11385
2
citations
#12059

Foundation Models for Scientific Discovery: From Paradigm Enhancement to Paradigm Transition

Fan LIU, Jindong Han, Tengfei Lyu et al.

NEURIPS 2025arXiv:2510.15280
2
citations
#12060

Semantic-guided Diverse Decoding for Large Language Model

Weijie Shi, Yue Cui, Yaguang Wu et al.

NEURIPS 2025arXiv:2506.23601
2
citations
#12061

Reversing Flow for Image Restoration

Haina Qin, Wenyang Luo, Bing Li et al.

CVPR 2025arXiv:2506.16961
2
citations
#12062

DiffCAM: Data-Driven Saliency Maps by Capturing Feature Differences

Xingjian Li, Qiming Zhao, Neelesh Bisht et al.

CVPR 2025highlight
2
citations
#12063

The Catechol Benchmark: Time-series Solvent Selection Data for Few-shot Machine Learning

Toby Boyne, Juan Campos, Rebecca Langdon et al.

NEURIPS 2025arXiv:2506.07619
2
citations
#12064

Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need

Qiang Wang, Xiang Song, Yuhang He et al.

CVPR 2025arXiv:2505.23744
2
citations
#12065

Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs

Guiyao Tie, Zenghui Yuan, Zeli Zhao et al.

NEURIPS 2025arXiv:2510.16062
2
citations
#12066

UniMRSeg: Unified Modality-Relax Segmentation via Hierarchical Self-Supervised Compensation

Xiaoqi Zhao, Youwei Pang, Chenyang Yu et al.

NEURIPS 2025arXiv:2509.16170
2
citations
#12067

Pragmatic Heterogeneous Collaborative Perception via Generative Communication Mechanism

Junfei Zhou, Penglin Dai, Quanmin Wei et al.

NEURIPS 2025arXiv:2510.19618
2
citations
#12068

GPSToken: Gaussian Parameterized Spatially-adaptive Tokenization for Image Representation and Generation

Zhengqiang ZHANG, Rongyuan Wu, Lingchen Sun et al.

NEURIPS 2025arXiv:2509.01109
2
citations
#12069

Linear Attention for Efficient Bidirectional Sequence Modeling

Arshia Afzal, Elias Abad Rocamora, Leyla Candogan et al.

NEURIPS 2025arXiv:2502.16249
2
citations
#12070

Analog Foundation Models

Julian Büchel, Iason Chalas, Giovanni Acampa et al.

NEURIPS 2025arXiv:2505.09663
2
citations
#12071

Dynamic Bundling with Large Language Models for Zero-Shot Inference on Text-Attributed Graphs

Yusheng Zhao, Qixin Zhang, Xiao Luo et al.

NEURIPS 2025arXiv:2505.17599
2
citations
#12072

The Illusion of Progress? A Critical Look at Test-Time Adaptation for Vision-Language Models

Lijun Sheng, Jian Liang, Ran He et al.

NEURIPS 2025arXiv:2506.24000
2
citations
#12073

CorrBEV: Multi-View 3D Object Detection by Correlation Learning with Multi-modal Prototypes

ziteng xue, Mingzhe Guo, Heng Fan et al.

CVPR 2025
2
citations
#12074

VideoLucy: Deep Memory Backtracking for Long Video Understanding

Jialong Zuo, Yongtai Deng, Lingdong Kong et al.

NEURIPS 2025oralarXiv:2510.12422
2
citations
#12075

A Regularized Newton Method for Nonconvex Optimization with Global and Local Complexity Guarantees

Yuhao Zhou, Jintao Xu, Bingrui Li et al.

NEURIPS 2025arXiv:2502.04799
2
citations
#12076

PoseBH: Prototypical Multi-Dataset Training Beyond Human Pose Estimation

Uyoung Jeong, Jonathan Freer, Seungryul Baek et al.

CVPR 2025arXiv:2505.17475
2
citations
#12077

Hierachical Balance Packing: Towards Efficient Supervised Fine-tuning for Long-Context LLM

Yongqiang Yao, Jingru Tan, Kaihuan Liang et al.

NEURIPS 2025
2
citations
#12078

Instance-Level Composed Image Retrieval

Bill Psomas, George Retsinas, Nikos Efthymiadis et al.

NEURIPS 2025arXiv:2510.25387
2
citations
#12079

AdaptGrad: Adaptive Sampling to Reduce Noise

Linjiang Zhou, Chao Ma, Zepeng Wang et al.

NEURIPS 2025arXiv:2410.07711
2
citations
#12080

Reasoning is Periodicity? Improving Large Language Models Through Effective Periodicity Modeling

Yihong Dong, Ge Li, Xue Jiang et al.

NEURIPS 2025arXiv:2502.21309
2
citations
#12081

GenSpace: Benchmarking Spatially-Aware Image Generation

Zehan Wang, Jiayang Xu, Ziang Zhang et al.

NEURIPS 2025arXiv:2505.24870
2
citations
#12082

Generative Modeling of Class Probability for Multi-Modal Representation Learning

JungKyoo Shin, Bumsoo Kim, Eunwoo Kim

CVPR 2025highlightarXiv:2503.17417
2
citations
#12083

Towards Evaluating Proactive Risk Awareness of Multimodal Language Models

Youliang Yuan, Wenxiang Jiao, Yuejin Xie et al.

NEURIPS 2025arXiv:2505.17455
2
citations
#12084

MedChain: Bridging the Gap Between LLM Agents and Clinical Practice with Interactive Sequence

Jie Liu, Wenxuan Wang, Zizhan Ma et al.

NEURIPS 2025spotlightarXiv:2412.01605
2
citations
#12085

MedicalNarratives: Connecting Medical Vision and Language with Localized Narratives

Wisdom Ikezogwo, Kevin M. Zhang, Saygin Seyfioglu

NEURIPS 2025oralarXiv:2501.04184
2
citations
#12086

Parameter Efficient Mamba Tuning via Projector-targeted Diagonal-centric Linear Transformation

Seokil Ham, Hee-Seon Kim, Sangmin Woo et al.

CVPR 2025arXiv:2411.15224
2
citations
#12087

RefPose: Leveraging Reference Geometric Correspondences for Accurate 6D Pose Estimation of Unseen Objects

Jaeguk Kim, Jaewoo Park, Keuntek Lee et al.

CVPR 2025arXiv:2505.10841
2
citations
#12088

Variance-Reducing Couplings for Random Features

Isaac Reid, Stratis Markou, Krzysztof Choromanski et al.

ICLR 2025arXiv:2405.16541
2
citations
#12089

Compass Control: Multi Object Orientation Control for Text-to-Image Generation

Rishubh Parihar, Vaibhav Agrawal, Sachidanand VS et al.

CVPR 2025arXiv:2504.06752
2
citations
#12090

DH-Set: Improving Vision-Language Alignment with Diverse and Hybrid Set-Embeddings Learning

Kun Zhang, Jingyu Li, Zhe Li et al.

CVPR 2025
2
citations
#12091

Towards a Unified and Verified Understanding of Group-Operation Networks

Wilson Wu, Louis Jaburi, jacob drori et al.

ICLR 2025arXiv:2410.07476
2
citations
#12092

Toward Exploratory Inverse Constraint Inference with Generative Diffusion Verifiers

Runyi Zhao, Sheng Xu, Bo Yue et al.

ICLR 2025
2
citations
#12093

Exploring Semantic Feature Discrimination for Perceptual Image Super-Resolution and Opinion-Unaware No-Reference Image Quality Assessment

Guanglu Dong, Xiangyu Liao, Mingyang Li et al.

CVPR 2025arXiv:2503.19295
2
citations
#12094

Learning mirror maps in policy mirror descent

Carlo Alfano, Sebastian Towers, Silvia Sapora et al.

ICLR 2025arXiv:2402.05187
2
citations
#12095

LOCORE: Image Re-ranking with Long-Context Sequence Modeling

Zilin Xiao, Pavel Suma, Ayush Sachdeva et al.

CVPR 2025arXiv:2503.21772
2
citations
#12096

Sample- and Parameter-Efficient Auto-Regressive Image Models

Elad Amrani, Leonid Karlinsky, Alex M. Bronstein

CVPR 2025arXiv:2411.15648
2
citations
#12097

Rethinking Token Reduction with Parameter-Efficient Fine-Tuning in ViT for Pixel-Level Tasks

Cheng Lei, Ao Li, Hu Yao et al.

CVPR 2025
2
citations
#12098

Reasoning Mamba: Hypergraph-Guided Region Relation Calculating for Weakly Supervised Affordance Grounding

Yuxuan Wang, Aming Wu, Muli Yang et al.

CVPR 2025
2
citations
#12099

Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAI

Won Jun Kim, Hyungjin Chung, Jaemin Kim et al.

CVPR 2025arXiv:2411.15265
2
citations
#12100

Elucidating the Preconditioning in Consistency Distillation

Kaiwen Zheng, Guande He, Jianfei Chen et al.

ICLR 2025arXiv:2502.02922
2
citations
#12101

Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data

Zhiyuan Ma, Xinyue Liang, Rongyuan Wu et al.

CVPR 2025arXiv:2503.21694
2
citations
#12102

Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene

Tai-Yu Daniel Pan, Sooyoung Jeon, Mengdi Fan et al.

CVPR 2025arXiv:2502.06682
2
citations
#12103

Boosting the Dual-Stream Architecture in Ultra-High Resolution Segmentation with Resolution-Biased Uncertainty Estimation

Rong Qin, Xingyu Liu, Jinglei Shi et al.

CVPR 2025
2
citations
#12104

GBlobs: Explicit Local Structure via Gaussian Blobs for Improved Cross-Domain LiDAR-based 3D Object Detection

Dušan Malić, Christian Fruhwirth-Reisinger, Samuel Schulter et al.

CVPR 2025arXiv:2503.08639
2
citations
#12105

InstantPortrait: One-Step Portrait Editing via Diffusion Multi-Objective Distillation

Zhixin Lai, Keqiang Sun, Fu-Yun Wang et al.

ICLR 2025
2
citations
#12106

STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection

Divya Velayudhan, Abdelfatah Ahmed, Mohamad Alansari et al.

CVPR 2025highlightarXiv:2504.02823
2
citations
#12107

Optimality and Adaptivity of Deep Neural Features for Instrumental Variable Regression

Juno Kim, Dimitri Meunier, Arthur Gretton et al.

ICLR 2025arXiv:2501.04898
2
citations
#12108

Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision

Tomoya Yoshida, Shuhei Kurita, Taichi Nishimura et al.

CVPR 2025highlightarXiv:2506.03605
2
citations
#12109

Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception

Luke Chen, Junyao Wang, Trier Mortlock et al.

CVPR 2025arXiv:2503.20011
2
citations
#12110

UMotion: Uncertainty-driven Human Motion Estimation from Inertial and Ultra-wideband Units

Huakun Liu, Hiroki Ota, Xin Wei et al.

CVPR 2025highlightarXiv:2505.09393
2
citations
#12111

JPEG Inspired Deep Learning

Ahmed Hussien Salamah, Kaixiang Zheng, Yiwen Liu et al.

ICLR 2025arXiv:2410.07081
2
citations
#12112

GBC-Splat: Generalizable Gaussian-Based Clothed Human Digitalization under Sparse RGB Cameras

Hanzhang Tu, Zhanfeng Liao, Boyao Zhou et al.

CVPR 2025
2
citations
#12113

SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs

Guibiao Liao, Qing Li, Zhenyu Bao et al.

CVPR 2025arXiv:2503.12535
2
citations
#12114

Galaxy Walker: Geometry-aware VLMs For Galaxy-scale Understanding

Tianyu Chen, Xingcheng Fu, Yisen Gao et al.

CVPR 2025highlightarXiv:2503.18578
2
citations
#12115

Flash Inference: Near Linear Time Inference for Long Convolution Sequence Models and Beyond

Costin-Andrei Oncescu, Sanket Jayant Purandare, Stratos Idreos et al.

ICLR 2025arXiv:2410.12982
2
citations
#12116

Test-Time Fine-Tuning of Image Compression Models for Multi-Task Adaptability

Unki Park, Seongmoon Jeong, Jang Youngchan et al.

CVPR 2025
2
citations
#12117

An Auditing Test to Detect Behavioral Shift in Language Models

Leo Richter, Xuanli He, Pasquale Minervini et al.

ICLR 2025oralarXiv:2410.19406
2
citations
#12118

Dynamic Contrastive Skill Learning with State-Transition Based Skill Clustering and Dynamic Length Adjustment

Jinwoo Choi, Seung-Woo Seo

ICLR 2025oralarXiv:2504.14805
2
citations
#12119

CustAny: Customizing Anything from A Single Example

Lingjie Kong, Kai WU, Chengming Xu et al.

CVPR 2025arXiv:2406.11643
2
citations
#12120

SerialGen: Personalized Image Generation by First Standardization Then Personalization

Cong Xie, Han Zou, Ruiqi Yu et al.

CVPR 2025arXiv:2412.01485
2
citations
#12121

Adapting to the Unknown: Training-Free Audio-Visual Event Perception with Dynamic Thresholds

Eitan Shaar, Ariel Shaulov, Gal Chechik et al.

CVPR 2025arXiv:2503.13693
2
citations
#12122

CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images

Jungho Lee, Suhwan Cho, Taeoh Kim et al.

CVPR 2025arXiv:2412.16028
2
citations
#12123

Understanding Multi-layered Transmission Matrices

Marina Alterman, Anat Levin

CVPR 2025highlightarXiv:2410.23864
2
citations
#12124

Augmenting Perceptual Super-Resolution via Image Quality Predictors

Fengjia Zhang, Samrudhdhi Rangrej, Tristan T Aumentado-Armstrong et al.

CVPR 2025arXiv:2504.18524
2
citations
#12125

Gaussian Head & Shoulders: High Fidelity Neural Upper Body Avatars with Anchor Gaussian Guided Texture Warping

Tianhao Wu, Jing Yang, Zhilin Guo et al.

ICLR 2025arXiv:2405.12069
2
citations
#12126

Exploring the Camera Bias of Person Re-identification

Myungseo Song, Jin-Woo Park, Jong-Seok Lee

ICLR 2025arXiv:2502.10195
2
citations
#12127

Wavelet-based Positional Representation for Long Context

Yui Oka, Taku Hasegawa, Kyosuke Nishida et al.

ICLR 2025arXiv:2502.02004
2
citations
#12128

SIR-DIFF: Sparse Image Sets Restoration with Multi-View Diffusion Model

Yucheng Mao, Boyang Wang, Nilesh Kulkarni et al.

CVPR 2025arXiv:2503.14463
2
citations
#12129

ADAM Optimization with Adaptive Batch Selection

Gyu Yeol Kim, Min-hwan Oh

ICLR 2025arXiv:2512.06795
2
citations
#12130

PINP: Physics-Informed Neural Predictor with latent estimation of fluid flows

Huaguan Chen, Yang Liu, Hao Sun

ICLR 2025oralarXiv:2504.06070
2
citations
#12131

Bootstrap Your Own Views: Masked Ego-Exo Modeling for Fine-grained View-invariant Video Representations

Jungin Park, Jiyoung Lee, Kwanghoon Sohn

CVPR 2025arXiv:2503.19706
2
citations
#12132

Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty Measurement

Bryan Bo Cao, Lawrence OGorman, Michael Coss et al.

ICLR 2025arXiv:2411.01099
2
citations
#12133

Graph Neural Networks Are More Than Filters: Revisiting and Benchmarking from A Spectral Perspective

Yushun Dong, Patrick Soga, Yinhan He et al.

ICLR 2025oralarXiv:2412.07188
2
citations
#12134

GlyphMastero: A Glyph Encoder for High-Fidelity Scene Text Editing

Tong Wang, Ting Liu, Xiaochao Qu et al.

CVPR 2025arXiv:2505.04915
2
citations
#12135

Towards Human-Understandable Multi-Dimensional Concept Discovery

Arne Grobrügge, Niklas Kühl, Gerhard Satzger et al.

CVPR 2025arXiv:2503.18629
2
citations
#12136

FLOPS: Forward Learning with OPtimal Sampling

Tao Ren, Zishi Zhang, Jinyang Jiang et al.

ICLR 2025arXiv:2410.05966
2
citations
#12137

Neural Functions for Learning Periodic Signal

Woojin Cho, Minju Jo, Kookjin Lee et al.

ICLR 2025oralarXiv:2506.09526
2
citations
#12138

Teaching Human Behavior Improves Content Understanding Abilities Of VLMs

SOMESH SINGH, Harini S I, Yaman Singla et al.

ICLR 2025
2
citations
#12139

AutoCGP: Closed-Loop Concept-Guided Policies from Unlabeled Demonstrations

Pei Zhou, Ruizhe Liu, Qian Luo et al.

ICLR 2025
2
citations
#12140

Tight Lower Bounds under Asymmetric High-Order Hölder Smoothness and Uniform Convexity

Cedar Site Bai, Brian Bullins

ICLR 2025arXiv:2409.10773
2
citations
#12141

KooNPro: A Variance-Aware Koopman Probabilistic Model Enhanced by Neural Process for Time Series Forecasting

Ronghua Zheng, Hanru Bai, Weiyang Ding

ICLR 2025oral
2
citations
#12142

Remote Photoplethysmography in Real-World and Extreme Lighting Scenarios

Hang Shao, lei luo, Jianjun Qian et al.

CVPR 2025arXiv:2503.11465
2
citations
#12143

A Unified Image-Dense Annotation Generation Model for Underwater Scenes

Hongkai Lin, Dingkang Liang, Zhenghao Qi et al.

CVPR 2025arXiv:2503.21771
2
citations
#12144

Weakly Supervised Video Scene Graph Generation via Natural Language Supervision

Kibum Kim, Kanghoon Yoon, Yeonjun In et al.

ICLR 2025oralarXiv:2502.15370
2
citations
#12145

Improving Visual and Downstream Performance of Low-Light Enhancer with Vision Foundation Models Collaboration

yuxuan Gu, Huaian Chen, Yi Jin et al.

CVPR 2025
2
citations
#12146

H2ST: Hierarchical Two-Sample Tests for Continual Out-of-Distribution Detection

Yuhang Liu, Wenjie Zhao, Yunhui Guo

CVPR 2025arXiv:2503.14832
2
citations
#12147

Warm Diffusion: Recipe for Blur-Noise Mixture Diffusion Models

Hao-Chien Hsueh, Wen-Hsiao Peng, Ching-Chun Huang

ICLR 2025arXiv:2511.16904
2
citations
#12148

Deep Change Monitoring: A Hyperbolic Representative Learning Framework and a Dataset for Long-term Fine-grained Tree Change Detection

Yante Li, Hanwen Qi, Haoyu Chen et al.

CVPR 2025highlightarXiv:2503.00643
2
citations
#12149

GLOMA: Global Video Text Spotting with Morphological Association

Han Wang, Yanjie Wang, Yang Li et al.

ICLR 2025oral
2
citations
#12150

ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way

Jiazi Bu, Pengyang Ling, Pan Zhang et al.

CVPR 2025arXiv:2410.06241
2
citations
#12151

Seek Common Ground While Reserving Differences: Semi-Supervised Image-Text Sentiment Recognition

Wuyou Xia, Guoli Jia, Sicheng Zhao et al.

CVPR 2025
2
citations
#12152

SparsyFed: Sparse Adaptive Federated Learning

Adriano Guastella, Lorenzo Sani, Alex Iacob et al.

ICLR 2025
2
citations
#12153

Discrete Distribution Networks

Lei Yang

ICLR 2025arXiv:2401.00036
2
citations
#12154

beta-FFT: Nonlinear Interpolation and Differentiated Training Strategies for Semi-Supervised Medical Image Segmentation

Ming Hu, Jianfu Yin, Zhuangzhuang Ma et al.

CVPR 2025
2
citations
#12155

FLAVC: Learned Video Compression with Feature Level Attention

Chun Zhang, Heming Sun, Jiro Katto

CVPR 2025
2
citations
#12156

Endowing Visual Reprogramming with Adversarial Robustness

Shengjie Zhou, Xin Cheng, Haiyang Xu et al.

ICLR 2025
2
citations
#12157

Robust Multi-Object 4D Generation for In-the-wild Videos

Wen-Hsuan Chu, Lei Ke, Jianmeng Liu et al.

CVPR 2025
2
citations
#12158

Dual-Granularity Semantic Guided Sparse Routing Diffusion Model for General Pansharpening

Yinghui Xing, Qu Li Tao, Shizhou Zhang et al.

CVPR 2025
2
citations
#12159

BLADE: Single-view Body Mesh Estimation through Accurate Depth Estimation

Shengze Wang, Jiefeng Li, Tianye Li et al.

CVPR 2025
2
citations
#12160

Coherent 3D Portrait Video Reconstruction via Triplane Fusion

Shengze Wang, Xueting Li, Chao Liu et al.

CVPR 2025arXiv:2405.00794
2
citations
#12161

Generative Map Priors for Collaborative BEV Semantic Segmentation

Jiahui Fu, Yue Gong, Luting Wang et al.

CVPR 2025
2
citations
#12162

FlexDrive: Toward Trajectory Flexibility in Driving Scene Gaussian Splatting Reconstruction and Rendering

Jingqiu Zhou, Lue Fan, Linjiang Huang et al.

CVPR 2025
2
citations
#12163

Data-centric Prediction Explanation via Kernelized Stein Discrepancy

Mahtab Sarvmaili, Hassan Sajjad, Ga Wu

ICLR 2025arXiv:2403.15576
2
citations
#12164

Global Convergence of Policy Gradient in Average Reward MDPs

Navdeep Kumar, Yashaswini Murthy, Itai Shufaro et al.

ICLR 2025
2
citations
#12165

EAP-GS: Efficient Augmentation of Pointcloud for 3D Gaussian Splatting in Few-shot Scene Reconstruction

Dongrui Dai, Yuxiang Xing

CVPR 2025
2
citations
#12166

Towards Auto-Regressive Next-Token Prediction: In-context Learning Emerges from Generalization

Zixuan Gong, Xiaolin Hu, Huayi Tang et al.

ICLR 2025arXiv:2502.17024
2
citations
#12167

RDD: Robust Feature Detector and Descriptor using Deformable Transformer

Gonglin Chen, Tianwen Fu, Haiwei Chen et al.

CVPR 2025arXiv:2505.08013
2
citations
#12168

DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer

Ho-Joong Kim, Yearang Lee, Jung-Ho Hong et al.

CVPR 2025arXiv:2505.05711
2
citations
#12169

Person De-reidentification: A Variation-guided Identity Shift Modeling

Yi-Xing Peng, Yu-Ming Tang, Kun-Yu Lin et al.

CVPR 2025
2
citations
#12170

Rethinking Audio-Visual Adversarial Vulnerability from Temporal and Modality Perspectives

Zeliang Zhang, Susan Liang, Daiki Shimada et al.

ICLR 2025oralarXiv:2502.11858
2
citations
#12171

Sequential Stochastic Combinatorial Optimization Using Hierarchal Reinforcement Learning

Xinsong Feng, Zihan Yu, Yanhai Xiong et al.

ICLR 2025arXiv:2502.05537
2
citations
#12172

LP-Diff: Towards Improved Restoration of Real-World Degraded License Plate

Haoyan Gong, Zhenrong Zhang, Yuzheng Feng et al.

CVPR 2025highlight
2
citations
#12173

Can Neural Networks Achieve Optimal Computational-statistical Tradeoff? An Analysis on Single-Index Model

Siyu Chen, Beining Wu, Miao Lu et al.

ICLR 2025
2
citations
#12174

Neural Inverse Rendering from Propagating Light

Anagh Malik, Benjamin Attal, Andrew Xie et al.

CVPR 2025arXiv:2506.05347
2
citations
#12175

Deriving Causal Order from Single-Variable Interventions: Guarantees & Algorithm

Mathieu Chevalley, Patrick Schwab, Arash Mehrjou

ICLR 2025arXiv:2405.18314
2
citations
#12176

Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability

Lei Wang, Senmao Li, Fei Yang et al.

CVPR 2025arXiv:2505.03097
2
citations
#12177

COFlowNet: Conservative Constraints on Flows Enable High-Quality Candidate Generation

Yudong Zhang, Xuan Yu, Xu Wang et al.

ICLR 2025
2
citations
#12178

MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention

Yuhan Wang, Fangzhou Hong, Shuai Yang et al.

CVPR 2025arXiv:2503.08664
2
citations
#12179

Continuous Exposure Learning for Low-light Image Enhancement using Neural ODEs

Donggoo Jung, Daehyun Kim, Tae Hyun Kim

ICLR 2025
2
citations
#12180

Nested Diffusion Models Using Hierarchical Latent Priors

Xiao Zhang, Ruoxi Jiang, Rebecca Willett et al.

CVPR 2025arXiv:2412.05984
2
citations
#12181

REVISITING MULTI-PERMUTATION EQUIVARIANCE THROUGH THE LENS OF IRREDUCIBLE REPRESENTATIONS

Yonatan Sverdlov, Ido Springer, Nadav Dym

ICLR 2025arXiv:2410.06665
2
citations
#12182

Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models

Hoang Khoi Nguyen Do, Truc Nguyen, Malik Hassanaly et al.

ICLR 2025arXiv:2503.06413
2
citations
#12183

An Asynchronous Bundle Method for Distributed Learning Problems

Daniel Cederberg, Xuyang Wu, Stephen Boyd et al.

ICLR 2025
2
citations
#12184

MIND over Body: Adaptive Thinking using Dynamic Computation

Mrinal Mathur, Barak Pearlmutter, Sergey Plis

ICLR 2025
2
citations
#12185

UniHOPE: A Unified Approach for Hand-Only and Hand-Object Pose Estimation

Yinqiao Wang, Hao Xu, Pheng-Ann Heng et al.

CVPR 2025arXiv:2503.13303
2
citations
#12186

No Thing, Nothing: Highlighting Safety-Critical Classes for Robust LiDAR Semantic Segmentation in Adverse Weather

Junsung Park, HwiJeong Lee, Inha Kang et al.

CVPR 2025arXiv:2503.15910
2
citations
#12187

Leave-One-Out Stable Conformal Prediction

Kiljae Lee, Yuan Zhang

ICLR 2025arXiv:2504.12189
2
citations
#12188

RaSS: Improving Denoising Diffusion Samplers with Reinforced Active Sampling Scheduler

Xin Ding, Lei Yu, Xin Li et al.

CVPR 2025
2
citations
#12189

SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators

Rasoul Shafipour, David Harrison, Maxwell Horton et al.

ICLR 2025arXiv:2410.10714
2
citations
#12190

EntitySAM: Segment Everything in Video

Mingqiao Ye, Seoung Wug Oh, Lei Ke et al.

CVPR 2025
2
citations
#12191

Adapting Text-to-Image Generation with Feature Difference Instruction for Generic Image Restoration

Chao Wang, Hehe Fan, Huichen Yang et al.

CVPR 2025
2
citations
#12192

DynaMoDe-NeRF: Motion-aware Deblurring Neural Radiance Field for Dynamic Scenes

Ashish Kumar, A. N. Rajagopalan

CVPR 2025
2
citations
#12193

VISTREAM: Improving Computation Efficiency of Visual Streaming Perception via Law-of-Charge-Conservation Inspired Spiking Neural Network

Kang You, Ziling Wei, Jing Yan et al.

CVPR 2025
2
citations
#12194

AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data

Zengqun Zhao, Ziquan Liu, Yu Cao et al.

CVPR 2025arXiv:2503.05665
2
citations
#12195

v-CLR: View-Consistent Learning for Open-World Instance Segmentation

Chang-Bin Zhang, Jinhong Ni, Yujie Zhong et al.

CVPR 2025highlightarXiv:2504.01383
2
citations
#12196

Progressive Correspondence Regenerator for Robust 3D Registration

Guiyu Zhao, Sheng Ao, Ye Zhang et al.

CVPR 2025arXiv:2502.02163
2
citations
#12197

Towards Fine-Grained Interpretability: Counterfactual Explanations for Misclassification with Saliency Partition

ZHANG LINTONG, Kang Yin, Seong-Whan Lee

CVPR 2025arXiv:2511.07974
2
citations
#12198

Shapley-Guided Utility Learning for Effective Graph Inference Data Valuation

Hongliang Chi, Qiong Wu, Zhengyi Zhou et al.

ICLR 2025arXiv:2503.18195
2
citations
#12199

NTR-Gaussian: Nighttime Dynamic Thermal Reconstruction with 4D Gaussian Splatting Based on Thermodynamics

Kun Yang, Yuxiang Liu, Zeyu Cui et al.

CVPR 2025arXiv:2503.03115
2
citations
#12200

Order-aware Interactive Segmentation

Bin Wang, Anwesa Choudhuri, Meng Zheng et al.

ICLR 2025arXiv:2410.12214
2
citations