Most Cited 2025 "feature dimension selection" Papers
22,274 papers found • Page 61 of 112
Conference
On-Policy Optimization with Group Equivalent Preference for Multi-Programming Language Understanding
Haoyuan Wu, Rui Ming, Jilong Gao et al.
Enhancing Adversarial Transferability with Checkpoints of a Single Model’s Training
Shixin Li, Chaoxiang He, Xiaojing Ma et al.
Fast constrained sampling in pre-trained diffusion models
Alexandros Graikos, Nebojsa Jojic, Dimitris Samaras
Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation
Andrea Maracani, Savas Ozkan, Sijun Cho et al.
Self-supervised Learning of Echocardiographic Video Representations via Online Cluster Distillation
Divyanshu Mishra, Mohammadreza Salehi, Pramit Saha et al.
GeoCAD: Local Geometry-Controllable CAD Generation with Large Language Models
Zhanwei Zhang, kaiyuan liu, Junjie Liu et al.
Approximately Aligned Decoding
Daniel Melcer, Sujan Kumar Gonugondla, Pramuditha Perera et al.
Learning on Model Weights using Tree Experts
Eliahu Horwitz, Bar Cavia, Jonathan Kahana et al.
Table as a Modality for Large Language Models
Liyao Li, Chao Ye, Wentao Ye et al.
Gaussian Herding across Pens: An Optimal Transport Perspective on Global Gaussian Reduction for 3DGS
Tao Wang, Mengyu Li, Geduo Zeng et al.
Towards Reliable Code-as-Policies: A Neuro-Symbolic Framework for Embodied Task Planning
Sanghyun Ahn, Wonje Choi, Junyong Lee et al.
Recovering Dynamic 3D Sketches from Videos
Jaeah Lee, Changwoon Choi, Young Min Kim et al.
AuroRA: Breaking Low-Rank Bottleneck of LoRA with Nonlinear Mapping
Haonan Dong, Wenhao Zhu, Guojie Song et al.
TranSUN: A Preemptive Paradigm to Eradicate Retransformation Bias Intrinsically from Regression Models in Recommender Systems
Jiahao Yu, Haozhuang Liu, Yeqiu Yang et al.
Dendritic Resonate-and-Fire Neuron for Effective and Efficient Long Sequence Modeling
Dehao Zhang, Malu Zhang, Shuai Wang et al.
Preference Optimization by Estimating the Ratio of the Data Distribution
Yeongmin Kim, HeeSun Bae, Byeonghu Na et al.
LeanGaussian: Breaking Pixel or Point Cloud Correspondence in Modeling 3D Gaussians
Jiamin WU, Kenkun Liu, Han Gao et al.
InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models
Yanggan Gu, Yuanyi Wang, Zhaoyi Yan et al.
PocketSR: The Super-Resolution Expert in Your Pocket Mobiles
Haoze Sun, Linfeng Jiang, Fan Li et al.
Integral Fast Fourier Color Constancy
Wenjun Wei, Yanlin Qian, Huaian Chen et al.
Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning
Xueyi Ke, Satoshi Tsutsui, Yayun Zhang et al.
FerretNet: Efficient Synthetic Image Detection via Local Pixel Dependencies
Shuqiao Liang, Jian Liu, Chen Renzhang et al.
VITED: Video Temporal Evidence Distillation
Yujie Lu, Yale Song, Lorenzo Torresani et al.
VinaBench: Benchmark for Faithful and Consistent Visual Narratives
Silin Gao, Sheryl Mathew, Li Mi et al.
Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning
Cheng Chen, Yunpeng Zhai, Yifan Zhao et al.
Deep Tree Tensor Networks
Chang Nie
Fast Data Attribution for Text-to-Image Models
Sheng-Yu Wang, Aaron Hertzmann, Alexei Efros et al.
Preconditioners for the Stochastic Training of Neural Fields
Shin-Fang Chng, Hemanth Saratchandran, Simon Lucey
Multiplication-Free Parallelizable Spiking Neurons with Efficient Spatio-Temporal Dynamics
Peng Xue, Wei Fang, Zhengyu Ma et al.
Provable Sample-Efficient Transfer Learning Conditional Diffusion Models via Representation Learning
Ziheng Cheng, Tianyu Xie, Shiyue Zhang et al.
Align-DA: Align Score-based Atmospheric Data Assimilation with Multiple Preferences
Jing-An Sun, Hang Fan, Junchao Gong et al.
Stealthy Yet Effective: Distribution-Preserving Backdoor Attacks on Graph Classification
Xiaobao Wang, Ruoxiao Sun, Yujun Zhang et al.
Unleashing the Potential of Multimodal LLMs for Zero-Shot Spatio-Temporal Video Grounding
Zaiquan Yang, Yuhao LIU, Gerhard Hancke et al.
AmorLIP: Efficient Language-Image Pretraining via Amortization
Haotian Sun, Yitong Li, Yuchen Zhuang et al.
Environment Inference for Learning Generalizable Dynamical System
Shixuan Liu, Yue He, Haotian Wang et al.
Riemannian Proximal Sampler for High-accuracy Sampling on Manifolds
Yunrui Guan, Krishnakumar Balasubramanian, Shiqian Ma
Toward Relative Positional Encoding in Spiking Transformers
Changze Lv, Yansen Wang, Dongqi Han et al.
Regret Bounds for Adversarial Contextual Bandits with General Function Approximation and Delayed Feedback
Orin Levy, Liad Erez, Alon Peled-Cohen et al.
MS-BART: Unified Modeling of Mass Spectra and Molecules for Structure Elucidation
Yang Han, Pengyu Wang, Kai Yu et al.
Mixture-of-Experts Operator Transformer for Large-Scale PDE Pre-Training
Hong Wang, Haiyang Xin, Jie Wang et al.
FuXi-Ocean: A Global Ocean Forecasting System with Sub-Daily Resolution
Qiusheng Huang, Yuan Niu, Xiaohui Zhong et al.
PhyS-EdiT: Physics-aware Semantic Image Editing with Text Description
Ziqi Cai, Shuchen Weng, Yifei Xia et al.
Hand-held Object Reconstruction from RGB Video with Dynamic Interaction
Shijian Jiang, Qi Ye, Rengan Xie et al.
ProReflow: Progressive Reflow with Decomposed Velocity
Lei Ke, Haohang Xu, Xuefei Ning et al.
Evaluating Model Perception of Color Illusions in Photorealistic Scenes
Lingjun Mao, Zineng Tang, Alane Suhr
DiEP: Adaptive Mixture-of-Experts Compression through Differentiable Expert Pruning
Sikai Bai, Haoxi Li, Jie ZHANG et al.
Intrinsic Benefits of Categorical Distributional Loss: Uncertainty-aware Regularized Exploration in Reinforcement Learning
Ke Sun, Yingnan Zhao, Enze Shi et al.
Data Mixing Can Induce Phase Transitions in Knowledge Acquisition
Xinran Gu, Kaifeng Lyu, Jiazheng Li et al.
SeCon-RAG: A Two-Stage Semantic Filtering and Conflict-Free Framework for Trustworthy RAG
Xiaonan Si, Meilin Zhu, Simeng Qin et al.
KMD: Koopman Multi-modality Decomposition for Generalized Brain Tumor Segmentation under Incomplete Modalities
Tianyi Liu, Haochuan Jiang, Kaizhu Huang
Causal Graphical Models for Vision-Language Compositional Understanding
Fiorenzo Parascandolo, Nicholas Moratelli, Enver Sangineto et al.
STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation
Hossein Goli, Michael Gimelfarb, Nathan de Lara et al.
Learnable Sampler Distillation for Discrete Diffusion Models
Feiyang Fu, Tongxian Guo, Zhaoqiang Liu
MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
Jinkun Hao, Naifu Liang, Zhen Luo et al.
ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation
Lingfeng Wang, Hualing Lin, Senda Chen et al.
Listwise Preference Diffusion Optimization for User Behavior Trajectories Prediction
Hongtao Huang, Chengkai Huang, Junda Wu et al.
Neptune-X: Active X-to-Maritime Generation for Universal Maritime Object Detection
Yu Guo, Shengfeng He, Yuxu Lu et al.
Online Statistical Inference in Decision Making with Matrix Context
Qiyu Han, Will Wei Sun, Yichen Zhang
Foundation Models for Scientific Discovery: From Paradigm Enhancement to Paradigm Transition
Fan LIU, Jindong Han, Tengfei Lyu et al.
Semantic-guided Diverse Decoding for Large Language Model
Weijie Shi, Yue Cui, Yaguang Wu et al.
Reversing Flow for Image Restoration
Haina Qin, Wenyang Luo, Bing Li et al.
DiffCAM: Data-Driven Saliency Maps by Capturing Feature Differences
Xingjian Li, Qiming Zhao, Neelesh Bisht et al.
The Catechol Benchmark: Time-series Solvent Selection Data for Few-shot Machine Learning
Toby Boyne, Juan Campos, Rebecca Langdon et al.
Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need
Qiang Wang, Xiang Song, Yuhang He et al.
Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs
Guiyao Tie, Zenghui Yuan, Zeli Zhao et al.
UniMRSeg: Unified Modality-Relax Segmentation via Hierarchical Self-Supervised Compensation
Xiaoqi Zhao, Youwei Pang, Chenyang Yu et al.
Pragmatic Heterogeneous Collaborative Perception via Generative Communication Mechanism
Junfei Zhou, Penglin Dai, Quanmin Wei et al.
GPSToken: Gaussian Parameterized Spatially-adaptive Tokenization for Image Representation and Generation
Zhengqiang ZHANG, Rongyuan Wu, Lingchen Sun et al.
Linear Attention for Efficient Bidirectional Sequence Modeling
Arshia Afzal, Elias Abad Rocamora, Leyla Candogan et al.
Analog Foundation Models
Julian Büchel, Iason Chalas, Giovanni Acampa et al.
Dynamic Bundling with Large Language Models for Zero-Shot Inference on Text-Attributed Graphs
Yusheng Zhao, Qixin Zhang, Xiao Luo et al.
The Illusion of Progress? A Critical Look at Test-Time Adaptation for Vision-Language Models
Lijun Sheng, Jian Liang, Ran He et al.
CorrBEV: Multi-View 3D Object Detection by Correlation Learning with Multi-modal Prototypes
ziteng xue, Mingzhe Guo, Heng Fan et al.
VideoLucy: Deep Memory Backtracking for Long Video Understanding
Jialong Zuo, Yongtai Deng, Lingdong Kong et al.
A Regularized Newton Method for Nonconvex Optimization with Global and Local Complexity Guarantees
Yuhao Zhou, Jintao Xu, Bingrui Li et al.
PoseBH: Prototypical Multi-Dataset Training Beyond Human Pose Estimation
Uyoung Jeong, Jonathan Freer, Seungryul Baek et al.
Hierachical Balance Packing: Towards Efficient Supervised Fine-tuning for Long-Context LLM
Yongqiang Yao, Jingru Tan, Kaihuan Liang et al.
Instance-Level Composed Image Retrieval
Bill Psomas, George Retsinas, Nikos Efthymiadis et al.
AdaptGrad: Adaptive Sampling to Reduce Noise
Linjiang Zhou, Chao Ma, Zepeng Wang et al.
Reasoning is Periodicity? Improving Large Language Models Through Effective Periodicity Modeling
Yihong Dong, Ge Li, Xue Jiang et al.
GenSpace: Benchmarking Spatially-Aware Image Generation
Zehan Wang, Jiayang Xu, Ziang Zhang et al.
Generative Modeling of Class Probability for Multi-Modal Representation Learning
JungKyoo Shin, Bumsoo Kim, Eunwoo Kim
Towards Evaluating Proactive Risk Awareness of Multimodal Language Models
Youliang Yuan, Wenxiang Jiao, Yuejin Xie et al.
MedChain: Bridging the Gap Between LLM Agents and Clinical Practice with Interactive Sequence
Jie Liu, Wenxuan Wang, Zizhan Ma et al.
MedicalNarratives: Connecting Medical Vision and Language with Localized Narratives
Wisdom Ikezogwo, Kevin M. Zhang, Saygin Seyfioglu
Parameter Efficient Mamba Tuning via Projector-targeted Diagonal-centric Linear Transformation
Seokil Ham, Hee-Seon Kim, Sangmin Woo et al.
RefPose: Leveraging Reference Geometric Correspondences for Accurate 6D Pose Estimation of Unseen Objects
Jaeguk Kim, Jaewoo Park, Keuntek Lee et al.
Variance-Reducing Couplings for Random Features
Isaac Reid, Stratis Markou, Krzysztof Choromanski et al.
Compass Control: Multi Object Orientation Control for Text-to-Image Generation
Rishubh Parihar, Vaibhav Agrawal, Sachidanand VS et al.
DH-Set: Improving Vision-Language Alignment with Diverse and Hybrid Set-Embeddings Learning
Kun Zhang, Jingyu Li, Zhe Li et al.
Towards a Unified and Verified Understanding of Group-Operation Networks
Wilson Wu, Louis Jaburi, jacob drori et al.
Toward Exploratory Inverse Constraint Inference with Generative Diffusion Verifiers
Runyi Zhao, Sheng Xu, Bo Yue et al.
Exploring Semantic Feature Discrimination for Perceptual Image Super-Resolution and Opinion-Unaware No-Reference Image Quality Assessment
Guanglu Dong, Xiangyu Liao, Mingyang Li et al.
Learning mirror maps in policy mirror descent
Carlo Alfano, Sebastian Towers, Silvia Sapora et al.
LOCORE: Image Re-ranking with Long-Context Sequence Modeling
Zilin Xiao, Pavel Suma, Ayush Sachdeva et al.
Sample- and Parameter-Efficient Auto-Regressive Image Models
Elad Amrani, Leonid Karlinsky, Alex M. Bronstein
Rethinking Token Reduction with Parameter-Efficient Fine-Tuning in ViT for Pixel-Level Tasks
Cheng Lei, Ao Li, Hu Yao et al.
Reasoning Mamba: Hypergraph-Guided Region Relation Calculating for Weakly Supervised Affordance Grounding
Yuxuan Wang, Aming Wu, Muli Yang et al.
Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAI
Won Jun Kim, Hyungjin Chung, Jaemin Kim et al.
Elucidating the Preconditioning in Consistency Distillation
Kaiwen Zheng, Guande He, Jianfei Chen et al.
Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data
Zhiyuan Ma, Xinyue Liang, Rongyuan Wu et al.
Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene
Tai-Yu Daniel Pan, Sooyoung Jeon, Mengdi Fan et al.
Boosting the Dual-Stream Architecture in Ultra-High Resolution Segmentation with Resolution-Biased Uncertainty Estimation
Rong Qin, Xingyu Liu, Jinglei Shi et al.
GBlobs: Explicit Local Structure via Gaussian Blobs for Improved Cross-Domain LiDAR-based 3D Object Detection
Dušan Malić, Christian Fruhwirth-Reisinger, Samuel Schulter et al.
InstantPortrait: One-Step Portrait Editing via Diffusion Multi-Objective Distillation
Zhixin Lai, Keqiang Sun, Fu-Yun Wang et al.
STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection
Divya Velayudhan, Abdelfatah Ahmed, Mohamad Alansari et al.
Optimality and Adaptivity of Deep Neural Features for Instrumental Variable Regression
Juno Kim, Dimitri Meunier, Arthur Gretton et al.
Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision
Tomoya Yoshida, Shuhei Kurita, Taichi Nishimura et al.
Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception
Luke Chen, Junyao Wang, Trier Mortlock et al.
UMotion: Uncertainty-driven Human Motion Estimation from Inertial and Ultra-wideband Units
Huakun Liu, Hiroki Ota, Xin Wei et al.
JPEG Inspired Deep Learning
Ahmed Hussien Salamah, Kaixiang Zheng, Yiwen Liu et al.
GBC-Splat: Generalizable Gaussian-Based Clothed Human Digitalization under Sparse RGB Cameras
Hanzhang Tu, Zhanfeng Liao, Boyao Zhou et al.
SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs
Guibiao Liao, Qing Li, Zhenyu Bao et al.
Galaxy Walker: Geometry-aware VLMs For Galaxy-scale Understanding
Tianyu Chen, Xingcheng Fu, Yisen Gao et al.
Flash Inference: Near Linear Time Inference for Long Convolution Sequence Models and Beyond
Costin-Andrei Oncescu, Sanket Jayant Purandare, Stratos Idreos et al.
Test-Time Fine-Tuning of Image Compression Models for Multi-Task Adaptability
Unki Park, Seongmoon Jeong, Jang Youngchan et al.
An Auditing Test to Detect Behavioral Shift in Language Models
Leo Richter, Xuanli He, Pasquale Minervini et al.
Dynamic Contrastive Skill Learning with State-Transition Based Skill Clustering and Dynamic Length Adjustment
Jinwoo Choi, Seung-Woo Seo
CustAny: Customizing Anything from A Single Example
Lingjie Kong, Kai WU, Chengming Xu et al.
SerialGen: Personalized Image Generation by First Standardization Then Personalization
Cong Xie, Han Zou, Ruiqi Yu et al.
Adapting to the Unknown: Training-Free Audio-Visual Event Perception with Dynamic Thresholds
Eitan Shaar, Ariel Shaulov, Gal Chechik et al.
CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images
Jungho Lee, Suhwan Cho, Taeoh Kim et al.
Understanding Multi-layered Transmission Matrices
Marina Alterman, Anat Levin
Augmenting Perceptual Super-Resolution via Image Quality Predictors
Fengjia Zhang, Samrudhdhi Rangrej, Tristan T Aumentado-Armstrong et al.
Gaussian Head & Shoulders: High Fidelity Neural Upper Body Avatars with Anchor Gaussian Guided Texture Warping
Tianhao Wu, Jing Yang, Zhilin Guo et al.
Exploring the Camera Bias of Person Re-identification
Myungseo Song, Jin-Woo Park, Jong-Seok Lee
Wavelet-based Positional Representation for Long Context
Yui Oka, Taku Hasegawa, Kyosuke Nishida et al.
SIR-DIFF: Sparse Image Sets Restoration with Multi-View Diffusion Model
Yucheng Mao, Boyang Wang, Nilesh Kulkarni et al.
ADAM Optimization with Adaptive Batch Selection
Gyu Yeol Kim, Min-hwan Oh
PINP: Physics-Informed Neural Predictor with latent estimation of fluid flows
Huaguan Chen, Yang Liu, Hao Sun
Bootstrap Your Own Views: Masked Ego-Exo Modeling for Fine-grained View-invariant Video Representations
Jungin Park, Jiyoung Lee, Kwanghoon Sohn
Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty Measurement
Bryan Bo Cao, Lawrence OGorman, Michael Coss et al.
Graph Neural Networks Are More Than Filters: Revisiting and Benchmarking from A Spectral Perspective
Yushun Dong, Patrick Soga, Yinhan He et al.
GlyphMastero: A Glyph Encoder for High-Fidelity Scene Text Editing
Tong Wang, Ting Liu, Xiaochao Qu et al.
Towards Human-Understandable Multi-Dimensional Concept Discovery
Arne Grobrügge, Niklas Kühl, Gerhard Satzger et al.
FLOPS: Forward Learning with OPtimal Sampling
Tao Ren, Zishi Zhang, Jinyang Jiang et al.
Neural Functions for Learning Periodic Signal
Woojin Cho, Minju Jo, Kookjin Lee et al.
Teaching Human Behavior Improves Content Understanding Abilities Of VLMs
SOMESH SINGH, Harini S I, Yaman Singla et al.
AutoCGP: Closed-Loop Concept-Guided Policies from Unlabeled Demonstrations
Pei Zhou, Ruizhe Liu, Qian Luo et al.
Tight Lower Bounds under Asymmetric High-Order Hölder Smoothness and Uniform Convexity
Cedar Site Bai, Brian Bullins
KooNPro: A Variance-Aware Koopman Probabilistic Model Enhanced by Neural Process for Time Series Forecasting
Ronghua Zheng, Hanru Bai, Weiyang Ding
Remote Photoplethysmography in Real-World and Extreme Lighting Scenarios
Hang Shao, lei luo, Jianjun Qian et al.
A Unified Image-Dense Annotation Generation Model for Underwater Scenes
Hongkai Lin, Dingkang Liang, Zhenghao Qi et al.
Weakly Supervised Video Scene Graph Generation via Natural Language Supervision
Kibum Kim, Kanghoon Yoon, Yeonjun In et al.
Improving Visual and Downstream Performance of Low-Light Enhancer with Vision Foundation Models Collaboration
yuxuan Gu, Huaian Chen, Yi Jin et al.
H2ST: Hierarchical Two-Sample Tests for Continual Out-of-Distribution Detection
Yuhang Liu, Wenjie Zhao, Yunhui Guo
Warm Diffusion: Recipe for Blur-Noise Mixture Diffusion Models
Hao-Chien Hsueh, Wen-Hsiao Peng, Ching-Chun Huang
Deep Change Monitoring: A Hyperbolic Representative Learning Framework and a Dataset for Long-term Fine-grained Tree Change Detection
Yante Li, Hanwen Qi, Haoyu Chen et al.
GLOMA: Global Video Text Spotting with Morphological Association
Han Wang, Yanjie Wang, Yang Li et al.
ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way
Jiazi Bu, Pengyang Ling, Pan Zhang et al.
Seek Common Ground While Reserving Differences: Semi-Supervised Image-Text Sentiment Recognition
Wuyou Xia, Guoli Jia, Sicheng Zhao et al.
SparsyFed: Sparse Adaptive Federated Learning
Adriano Guastella, Lorenzo Sani, Alex Iacob et al.
Discrete Distribution Networks
Lei Yang
beta-FFT: Nonlinear Interpolation and Differentiated Training Strategies for Semi-Supervised Medical Image Segmentation
Ming Hu, Jianfu Yin, Zhuangzhuang Ma et al.
FLAVC: Learned Video Compression with Feature Level Attention
Chun Zhang, Heming Sun, Jiro Katto
Endowing Visual Reprogramming with Adversarial Robustness
Shengjie Zhou, Xin Cheng, Haiyang Xu et al.
Robust Multi-Object 4D Generation for In-the-wild Videos
Wen-Hsuan Chu, Lei Ke, Jianmeng Liu et al.
Dual-Granularity Semantic Guided Sparse Routing Diffusion Model for General Pansharpening
Yinghui Xing, Qu Li Tao, Shizhou Zhang et al.
BLADE: Single-view Body Mesh Estimation through Accurate Depth Estimation
Shengze Wang, Jiefeng Li, Tianye Li et al.
Coherent 3D Portrait Video Reconstruction via Triplane Fusion
Shengze Wang, Xueting Li, Chao Liu et al.
Generative Map Priors for Collaborative BEV Semantic Segmentation
Jiahui Fu, Yue Gong, Luting Wang et al.
FlexDrive: Toward Trajectory Flexibility in Driving Scene Gaussian Splatting Reconstruction and Rendering
Jingqiu Zhou, Lue Fan, Linjiang Huang et al.
Data-centric Prediction Explanation via Kernelized Stein Discrepancy
Mahtab Sarvmaili, Hassan Sajjad, Ga Wu
Global Convergence of Policy Gradient in Average Reward MDPs
Navdeep Kumar, Yashaswini Murthy, Itai Shufaro et al.
EAP-GS: Efficient Augmentation of Pointcloud for 3D Gaussian Splatting in Few-shot Scene Reconstruction
Dongrui Dai, Yuxiang Xing
Towards Auto-Regressive Next-Token Prediction: In-context Learning Emerges from Generalization
Zixuan Gong, Xiaolin Hu, Huayi Tang et al.
RDD: Robust Feature Detector and Descriptor using Deformable Transformer
Gonglin Chen, Tianwen Fu, Haiwei Chen et al.
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer
Ho-Joong Kim, Yearang Lee, Jung-Ho Hong et al.
Person De-reidentification: A Variation-guided Identity Shift Modeling
Yi-Xing Peng, Yu-Ming Tang, Kun-Yu Lin et al.
Rethinking Audio-Visual Adversarial Vulnerability from Temporal and Modality Perspectives
Zeliang Zhang, Susan Liang, Daiki Shimada et al.
Sequential Stochastic Combinatorial Optimization Using Hierarchal Reinforcement Learning
Xinsong Feng, Zihan Yu, Yanhai Xiong et al.
LP-Diff: Towards Improved Restoration of Real-World Degraded License Plate
Haoyan Gong, Zhenrong Zhang, Yuzheng Feng et al.
Can Neural Networks Achieve Optimal Computational-statistical Tradeoff? An Analysis on Single-Index Model
Siyu Chen, Beining Wu, Miao Lu et al.
Neural Inverse Rendering from Propagating Light
Anagh Malik, Benjamin Attal, Andrew Xie et al.
Deriving Causal Order from Single-Variable Interventions: Guarantees & Algorithm
Mathieu Chevalley, Patrick Schwab, Arash Mehrjou
Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability
Lei Wang, Senmao Li, Fei Yang et al.
COFlowNet: Conservative Constraints on Flows Enable High-Quality Candidate Generation
Yudong Zhang, Xuan Yu, Xu Wang et al.
MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention
Yuhan Wang, Fangzhou Hong, Shuai Yang et al.
Continuous Exposure Learning for Low-light Image Enhancement using Neural ODEs
Donggoo Jung, Daehyun Kim, Tae Hyun Kim
Nested Diffusion Models Using Hierarchical Latent Priors
Xiao Zhang, Ruoxi Jiang, Rebecca Willett et al.
REVISITING MULTI-PERMUTATION EQUIVARIANCE THROUGH THE LENS OF IRREDUCIBLE REPRESENTATIONS
Yonatan Sverdlov, Ido Springer, Nadav Dym
Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models
Hoang Khoi Nguyen Do, Truc Nguyen, Malik Hassanaly et al.
An Asynchronous Bundle Method for Distributed Learning Problems
Daniel Cederberg, Xuyang Wu, Stephen Boyd et al.
MIND over Body: Adaptive Thinking using Dynamic Computation
Mrinal Mathur, Barak Pearlmutter, Sergey Plis
UniHOPE: A Unified Approach for Hand-Only and Hand-Object Pose Estimation
Yinqiao Wang, Hao Xu, Pheng-Ann Heng et al.
No Thing, Nothing: Highlighting Safety-Critical Classes for Robust LiDAR Semantic Segmentation in Adverse Weather
Junsung Park, HwiJeong Lee, Inha Kang et al.
Leave-One-Out Stable Conformal Prediction
Kiljae Lee, Yuan Zhang
RaSS: Improving Denoising Diffusion Samplers with Reinforced Active Sampling Scheduler
Xin Ding, Lei Yu, Xin Li et al.
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
Rasoul Shafipour, David Harrison, Maxwell Horton et al.
EntitySAM: Segment Everything in Video
Mingqiao Ye, Seoung Wug Oh, Lei Ke et al.
Adapting Text-to-Image Generation with Feature Difference Instruction for Generic Image Restoration
Chao Wang, Hehe Fan, Huichen Yang et al.
DynaMoDe-NeRF: Motion-aware Deblurring Neural Radiance Field for Dynamic Scenes
Ashish Kumar, A. N. Rajagopalan
VISTREAM: Improving Computation Efficiency of Visual Streaming Perception via Law-of-Charge-Conservation Inspired Spiking Neural Network
Kang You, Ziling Wei, Jing Yan et al.
AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data
Zengqun Zhao, Ziquan Liu, Yu Cao et al.
v-CLR: View-Consistent Learning for Open-World Instance Segmentation
Chang-Bin Zhang, Jinhong Ni, Yujie Zhong et al.
Progressive Correspondence Regenerator for Robust 3D Registration
Guiyu Zhao, Sheng Ao, Ye Zhang et al.
Towards Fine-Grained Interpretability: Counterfactual Explanations for Misclassification with Saliency Partition
ZHANG LINTONG, Kang Yin, Seong-Whan Lee
Shapley-Guided Utility Learning for Effective Graph Inference Data Valuation
Hongliang Chi, Qiong Wu, Zhengyi Zhou et al.
NTR-Gaussian: Nighttime Dynamic Thermal Reconstruction with 4D Gaussian Splatting Based on Thermodynamics
Kun Yang, Yuxiang Liu, Zeyu Cui et al.
Order-aware Interactive Segmentation
Bin Wang, Anwesa Choudhuri, Meng Zheng et al.