Most Cited 2025 "universal inverted bottleneck" Papers
22,274 papers found • Page 28 of 112
Conference
Matcha: Mitigating Graph Structure Shifts with Test-Time Adaptation
Wenxuan Bao, Zhichen Zeng, Zhining Liu et al.
Discriminating image representations with principal distortions
Jenelle Feather, David Lipshutz, Sarah Harvey et al.
Collapse-Proof Non-Contrastive Self-Supervised Learning
EMANUELE SANSONE, Tim Lebailly, Tinne Tuytelaars
On the Performance Analysis of Momentum Method: A Frequency Domain Perspective
Xianliang Li, Jun Luo, Zhiwei Zheng et al.
Understanding and Mitigating Memorization in Generative Models via Sharpness of Probability Landscapes
Dongjae Jeon, Dueun Kim, Albert No
TRACE Back from the Future: A Probabilistic Reasoning Approach to Controllable Language Generation
Gwen Yidou-Weng, Benjie Wang, Guy Van den Broeck
MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces
Loris Gaven, Thomas Carta, Clément Romac et al.
Representations Shape Weak-to-Strong Generalization: Theoretical Insights and Empirical Predictions
Yihao Xue, Jiping Li, Baharan Mirzasoleiman
ChA-MAEViT: Unifying Channel-Aware Masked Autoencoders and Multi-Channel Vision Transformers for Improved Cross-Channel Learning
Chau Pham, Juan C. Caicedo, Bryan Plummer
Position: Lifetime tuning is incompatible with continual reinforcement learning
Golnaz Mesbahi, Parham Mohammad Panahi, Olya Mastikhina et al.
Epistemic Monte Carlo Tree Search
Yaniv Oren, Viliam Vadocz, Matthijs T. J. Spaan et al.
Point-Level Topological Representation Learning on Point Clouds
Vincent P. Grande, Michael Schaub
ParaSolver: A Hierarchical Parallel Integral Solver for Diffusion Models
Jianrong Lu, Zhiyu Zhu, Junhui Hou
GridMix: Exploring Spatial Modulation for Neural Fields in PDE Modeling
Honghui Wang, Shiji Song, Gao Huang
SPEX: Scaling Feature Interaction Explanations for LLMs
Justin S. Kang, Landon Butler, Abhineet Agarwal et al.
Log-Sum-Exponential Estimator for Off-Policy Evaluation and Learning
Armin Behnamnia, Gholamali Aminian, Alireza Aghaei et al.
Relating Misfit to Gain in Weak-to-Strong Generalization Beyond the Squared Loss
Abhijeet Mulgund, Chirag Pabbaraju
Global Well-posedness and Convergence Analysis of Score-based Generative Models via Sharp Lipschitz Estimates
Connor Mooney, Zhongjian Wang, Jack Xin et al.
Domain Guidance: A Simple Transfer Approach for a Pre-trained Diffusion Model
Jincheng Zhong, XiangCheng Zhang, Jianmin Wang et al.
SymmetricDiffusers: Learning Discrete Diffusion on Finite Symmetric Groups
Yongxing Zhang, Donglin Yang, Renjie Liao
On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations
GUOJUN XIONG, Shufan Wang, Daniel Jiang et al.
A Riemannian Framework for Learning Reduced-order Lagrangian Dynamics
Katharina Friedl, Noémie Jaquier, Jens Lundell et al.
How Far Are We from True Unlearnability?
Kai Ye, Liangcai Su, Chenxiong Qian
Revisiting Convolution Architecture in the Realm of DNA Foundation Models
Yu Bo, Weian Mao, Daniel Shao et al.
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities
Shaozhe Hao, Xuantong LIU, Xianbiao Qi et al.
Projection Optimization: A General Framework for Multi-Objective and Multi-Group RLHF
Nuoya Xiong, Aarti Singh
Bridging the Semantic Gap Between Text and Table: A Case Study on NL2SQL
Lin Long, Xijun Gu, Xinjie Sun et al.
UniDrive: Towards Universal Driving Perception Across Camera Configurations
Ye Li, Wenzhao Zheng, Xiaonan Huang et al.
Learning Dynamics in Continual Pre-Training for Large Language Models
Xingjin Wang, Howe Tissue, Lu Wang et al.
High-Dimensional Bayesian Optimisation with Gaussian Process Prior Variational Autoencoders
Siddharth Ramchandran, Manuel Haussmann, Harri Lähdesmäki
Exposure Bracketing Is All You Need For A High-Quality Image
Zhilu Zhang, Shuohao Zhang, Renlong Wu et al.
Mean-Field Sampling for Cooperative Multi-Agent Reinforcement Learning
Emile Anand, Ishani Karmarkar, Guannan Qu
Range, not Independence, Drives Modularity in Biologically Inspired Representations
Will Dorrell, Kyle Hsu, Luke Hollingsworth et al.
On the Hölder Stability of Multiset and Graph Neural Networks
Yair Davidson, Nadav Dym
3D StreetUnveiler with Semantic-aware 2DGS - a simple baseline
Jingwei Xu, Yikai Wang, Yiqun Zhao et al.
Lightspeed Geometric Dataset Distance via Sliced Optimal Transport
Khai Nguyen, Hai Nguyen, Tuan Pham et al.
Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer
Yilun Kong, Guozheng Ma, Qi Zhao et al.
Human-Aligned Chess With a Bit of Search
Yiming Zhang, Athul Jacob, Vivian Lai et al.
SPDIM: Source-Free Unsupervised Conditional and Label Shift Adaptation in EEG
Shanglin Li, Motoaki Kawanabe, Reinmar Kobler
Infinite-Resolution Integral Noise Warping for Diffusion Models
Yitong Deng, Winnie Lin, Lingxiao Li et al.
Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model
Yaxuan Huang, Xili Dai, Jianan Wang et al.
Visually Consistent Hierarchical Image Classification
Seulki Park, Youren Zhang, Stella Yu et al.
CryoFM: A Flow-based Foundation Model for Cryo-EM Densities
Yi Zhou, Yilai Li, Jing Yuan et al.
Learning Mask Invariant Mutual Information for Masked Image Modeling
Tao Huang, Yanxiang Ma, Shan You et al.
CARTS: Advancing Neural Theorem Proving with Diversified Tactic Calibration and Bias-Resistant Tree Search
Xiao-Wen Yang, Zhi Zhou, Haiming Wang et al.
Motion Control of High-Dimensional Musculoskeletal Systems with Hierarchical Model-Based Planning
Yunyue Wei, Shanning Zhuang, Vincent Zhuang et al.
Learning to Plan Before Answering: Self-Teaching LLMs to Learn Abstract Plans for Problem Solving
Jin Zhang, Flood Sung, Zhilin Yang et al.
Long-horizon Visual Instruction Generation with Logic and Attribute Self-reflection
Yucheng Suo, Fan Ma, Kaixin Shen et al.
Zero-shot Model-based Reinforcement Learning using Large Language Models
Abdelhakim Benechehab, Youssef Attia El Hili, Ambroise Odonnat et al.
Noisy Test-Time Adaptation in Vision-Language Models
Chentao Cao, Zhun Zhong, (Andrew) Zhanke Zhou et al.
Geometric Hyena Networks for Large-scale Equivariant Learning
Artem Moskalev, Mangal Prakash, Junjie Xu et al.
Learning Neural Exposure Fields for View Synthesis
Michael Niemeyer, Fabian Manhardt, Marie-Julie Rakotosaona et al.
Random Forest Autoencoders for Guided Representation Learning
Adrien Aumon, Shuang Ni, Myriam Lizotte et al.
Training-Free Message Passing for Learning on Hypergraphs
Bohan Tang, Zexi Liu, Keyue Jiang et al.
Improving Multimodal Learning Balance and Sufficiency through Data Remixing
Xiaoyu Ma, Hao Chen, Yongjian Deng
Understanding Matrix Function Normalizations in Covariance Pooling through the Lens of Riemannian Geometry
Ziheng Chen, Yue Song, Xiaojun Wu et al.
Breaking the Curse of Multiagency in Robust Multi-Agent Reinforcement Learning
Laixi Shi, Jingchu Gai, Eric Mazumdar et al.
Unlocking the Potential of Model Calibration in Federated Learning
Yun-Wei Chu, Dong-Jun Han, Seyyedali Hosseinalipour et al.
PaRa: Personalizing Text-to-Image Diffusion via Parameter Rank Reduction
Shangyu Chen, Zizheng Pan, Jianfei Cai et al.
MeToken: Uniform Micro-environment Token Boosts Post-Translational Modification Prediction
Cheng Tan, Zhenxiao Cao, Zhangyang Gao et al.
Small Models are LLM Knowledge Triggers for Medical Tabular Prediction
Jiahuan Yan, Jintai Chen, Chaowen Hu et al.
Learning Equivariant Non-Local Electron Density Functionals
Nicholas Gao, Eike Eberhard, Stephan Günnemann
No Equations Needed: Learning System Dynamics Without Relying on Closed-Form ODEs
Krzysztof Kacprzyk, Mihaela van der Schaar
Controlled Generation with Equivariant Variational Flow Matching
Floor Eijkelboom, Heiko Zimmermann, Sharvaree Vadgama et al.
Simple, Good, Fast: Self-Supervised World Models Free of Baggage
Jan Robine, Marc Höftmann, Stefan Harmeling
Connecting Federated ADMM to Bayes
Siddharth Swaroop, Mohammad Emtiyaz Khan, Finale Doshi-Velez
MARS: A Malignity-Aware Backdoor Defense in Federated Learning
Wei Wan, Ning Yuxuan, Zhicong Huang et al.
Fact-R1: Towards Explainable Video Misinformation Detection with Deep Reasoning
Fanrui Zhang, Dian Li, Qiang Zhang et al.
Correlating instruction-tuning (in multimodal models) with vision-language processing (in the brain)
SUBBA REDDY OOTA, Akshett Rai Jindal, Ishani Mondal et al.
Spectral Convolutional Conditional Neural Process
Peiman Mohseni, Nick Duffield
Graph Data Selection for Domain Adaptation: A Model-Free Approach
Ting-Wei Li, Ruizhong Qiu, Hanghang Tong
Extreme Risk Mitigation in Reinforcement Learning using Extreme Value Theory
Jan Drgona, Mahantesh Halappanavar, Frank Liu et al.
Copyright-Protected Language Generation via Adaptive Model Fusion
Javier Abad, Konstantin Donhauser, Francesco Pinto et al.
The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation
Fredrik Carlsson, Fangyu Liu, Daniel Ward et al.
ExPO: Unlocking Hard Reasoning with Self-Explanation-Guided Reinforcement Learning
Ruiyang Zhou, Shuozhe Li, Amy Zhang et al.
UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation
Alexander Liu, Sang-gil Lee, Chao-Han Huck Yang et al.
Blink of an eye: a simple theory for feature localization in generative models
Marvin Li, Aayush Karan, Sitan Chen
SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs
Shibo Jie, Yehui Tang, Kai Han et al.
CALM: Consensus-Aware Localized Merging for Multi-Task Learning
Kunda Yan, Min Zhang, Sen Cui et al.
Training-Free Dataset Pruning for Instance Segmentation
Yalun Dai, Lingao Xiao, Ivor Tsang et al.
Nested Expectations with Kernel Quadrature
Zonghao Chen, Masha Naslidnyk, Francois-Xavier Briol
Extending Mercer's expansion to indefinite and asymmetric kernels
Sungwoo Jeong, Alex Townsend
Breaking the Reclustering Barrier in Centroid-based Deep Clustering
Lukas Miklautz, Timo Klein, Kevin Sidak et al.
Second-Order Min-Max Optimization with Lazy Hessians
Lesi Chen, Chengchang Liu, Jingzhao Zhang
Certifying Counterfactual Bias in LLMs
Isha Chaudhary, Qian Hu, Manoj Kumar et al.
A Black Swan Hypothesis: The Role of Human Irrationality in AI Safety
Hyunin Lee, Chanwoo Park, David Abel et al.
BAMDP Shaping: a Unified Framework for Intrinsic Motivation and Reward Shaping
Aly Lidayan, Michael Dennis, Stuart Russell
Controllable Blur Data Augmentation Using 3D-Aware Motion Estimation
Insoo Kim, Hana Lee, Hyong-Euk Lee et al.
Self-supervised contrastive learning performs non-linear system identification
Rodrigo Gonzalez Laiz, Tobias Schmidt, Steffen Schneider
WiFi CSI Based Temporal Activity Detection via Dual Pyramid Network
Zhendong Liu, Le Zhang, Bing Li et al.
Counterexample Guided Program Repair Using Zero-Shot Learning and MaxSAT-based Fault Localization
Pedro Orvalho, Mikoláš Janota, Vasco M. Manquinho
Tensor Product Neural Networks for Functional ANOVA Model
Seokhun Park, Insung Kong, yongchan Choi et al.
END^2: Robust Dual-Decoder Watermarking Framework Against Non-Differentiable Distortions
Nan Sun, Han Fang, Yuxing Lu et al.
ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks
Renshuai Tao, Manyi Le, Chuangchuang Tan et al.
DRoP: Distributionally Robust Data Pruning
Artem Vysogorets, Kartik Ahuja, Julia Kempe
Collaborative Evolution: Multi-Round Learning Between Large and Small Language Models for Emergent Fake News Detection
Ziyi Zhou, Xiaoming Zhang, Shenghan Tan et al.
Contractive Dynamical Imitation Policies for Efficient Out-of-Sample Recovery
Amin Soleimani Abyaneh, Mahrokh Boroujeni, Hsiu-Chin Lin et al.
Sable: a Performant, Efficient and Scalable Sequence Model for MARL
Omayma Mahjoub, Sasha Abramowitz, Ruan de Kock et al.
PhyloVAE: Unsupervised Learning of Phylogenetic Trees via Variational Autoencoders
Tianyu Xie, David Harry Tyensoung Richman, Jiansi Gao et al.
Solving Inverse Problems with Model Mismatch using Untrained Neural Networks within Model-based Architectures
Peimeng Guan, Naveed Iqbal, Mark Davenport et al.
Understanding protein function with a multimodal retrieval-augmented foundation model
Timothy Truong Jr, Tristan Bepler
WGFormer: An SE(3)-Transformer Driven by Wasserstein Gradient Flows for Molecular Ground-State Conformation Prediction
Fanmeng Wang, Minjie Cheng, Hongteng Xu
Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport
Mingyang Sun, Pengxiang Ding, Weinan Zhang et al.
IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION
Chuanyang Zheng
Synergy Between Sufficient Changes and Sparse Mixing Procedure for Disentangled Representation Learning
Zijian Li, Shunxing Fan, Yujia Zheng et al.
Affine Steerable Equivariant Layer for Canonicalization of Neural Networks
Yikang Li, Yeqing Qiu, Yuxuan Chen et al.
Discovering Influential Neuron Path in Vision Transformers
Yifan Wang, Yifei Liu, Yingdong Shi et al.
Ambiguity-Restrained Text-Video Representation Learning for Partially Relevant Video Retrieval
Cheol-Ho Cho, WonJun Moon, WooJin Jun et al.
Enhancing Robustness in Incremental Learning with Adversarial Training
Seungju Cho, Hongsin Lee, Changick Kim
Linear $Q$-Learning Does Not Diverge in $L^2$: Convergence Rates to a Bounded Set
Xinyu Liu, Zixuan Xie, Shangtong Zhang
What to Preserve and What to Transfer: Faithful, Identity-Preserving Diffusion-based Hairstyle Transfer
Chaeyeon Chung, Sunghyun Park, Jeongho Kim et al.
DCSF-KD: Dynamic Channel-wise Spatial Feature Knowledge Distillation for Object Detection
Tao Dai, Yang Lin, Hang Guo et al.
InstructOCR: Instruction Boosting Scene Text Spotting
Chen Duan, Qianyi Jiang, Pei Fu et al.
MPO: An Efficient Post-Processing Framework for Mixing Diverse Preference Alignment
Tianze Wang, Dongnan Gui, Yifan Hu et al.
Quantifying Prediction Consistency Under Fine-tuning Multiplicity in Tabular LLMs
Faisal Hamman, Sachindra P Dissanayake, Saumitra Mishra et al.
How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?
Seongyun Lee, Geewook Kim, Jiyeon Kim et al.
Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
Jihan Yao, Wenxuan Ding, Shangbin Feng et al.
Disentangle Nighttime Lens Flares: Self-supervised Generation-based Lens Flare Removal
Yuwen He, Wei Wang, Wanyu Wu et al.
HUANG: A Robust Diffusion Model-based Targeted Adversarial Attack Against Deep Hashing Retrieval
Chihan Huang, Xiaobo Shen
Foundation Molecular Grammar: Multi-Modal Foundation Models Induce Interpretable Molecular Graph Languages
Michael Sun, Weize Yuan, Gang Liu et al.
GenDataAgent: On-the-fly Dataset Augmentation with Synthetic Data
Zhiteng Li, Lele Chen, Jerone Andrews et al.
Enhancing Decision-Making of Large Language Models via Actor-Critic
Heng Dong, Kefei Duan, Chongjie Zhang
RRT-MVS: Recurrent Regularization Transformer for Multi-View Stereo
Jianfei Jiang, Liyong Wang, Haochen Yu et al.
LATTE: Improving Latex Recognition for Tables and Formulae with Iterative Refinement
Nan Jiang, Shanchao Liang, Chengxiao Wang et al.
SpatioTemporal Learning for Human Pose Estimation in Sparsely-Labeled Videos
Yingying Jiao, Zhigang Wang, Sifan Wu et al.
Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction
Junyi Chen, Di Huang, Weicai Ye et al.
C2PD: Continuity-Constrained Pixelwise Deformation for Guided Depth Super-Resolution
Jiahui Kang, Qing Cai, Runqing Tan et al.
HiCM²: Hierarchical Compact Memory Modeling for Dense Video Captioning
Minkuk Kim, Hyeon Bae Kim, Jinyoung Moon et al.
MHBench: Demystifying Motion Hallucination in VideoLLMs
Ming Kong, Xianzhou Zeng, Luyuan Chen et al.
Unified Breakdown Analysis for Byzantine Robust Gossip
Renaud Gaucher, Aymeric Dieuleveut, Hadrien Hendrikx
Subgraph Federated Learning for Local Generalization
Sungwon Kim, Yoonho Lee, Yunhak Oh et al.
VEGAS: Towards Visually Explainable and Grounded Artificial Social Intelligence
Hao Li, Hao Fei, Zechao Hu et al.
AIF-SFDA: Autonomous Information Filter Driven Source-Free Domain Adaptation for Medical Image Segmentation
Haojin Li, Heng Li, Jianyu Chen et al.
Revisiting CAD Model Generation by Learning Raster Sketch
Pu Li, Wenhao Zhang, Jianwei Guo et al.
A Theoretical Framework For Overfitting In Energy-based Modeling
Giovanni Catania, Aurélien Decelle, Cyril Furtlehner et al.
Hyperbolic-Constraint Point Cloud Reconstruction from Single RGB-D Images
Wenrui Li, Zhe Yang, Wei Han et al.
Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation
Sungnyun Kim, Sungwoo Cho, Sangmin Bae et al.
Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot Classification with CLIP
Yayuan Li, Jintao Guo, Lei Qi et al.
Gradient Boosting Reinforcement Learning
Benjamin Fuhrer, Chen Tessler, Gal Dalal
Forte : Finding Outliers with Representation Typicality Estimation
Debargha Ganguly, Warren Morningstar, Andrew Yu et al.
BoA: Attention-aware Post-training Quantization without Backpropagation
Junhan Kim, Ho-young Kim, Eulrang Cho et al.
FlowMamba: Learning Point Cloud Scene Flow with Global Motion Propagation
Min Lin, Gangwei Xu, Yun Wang et al.
Text to Point Cloud Localization with Multi-Level Negative Contrastive Learning
Dunqiang Liu, Shujun Huang, Wen Li et al.
SplineGS: Learning Smooth Trajectories in Gaussian Splatting for Dynamic Scene Reconstruction
Jihwan Yoon, Sangbeom Han, Jaeseok Oh et al.
ProtoCar: Learning 3D Vehicle Prototypes from Single-View and Unconstrained Driving Scene Images
Hongyuan Liu, Haochen Yu, Bochao Zou et al.
Bridge Diffusion Model: Bridge Chinese Text-to-Image Diffusion Model with English Communities
Shanyuan Liu, Bo Cheng, Yuhang Ma et al.
SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models
Cansu Sancaktar, Christian Gumbsch, Andrii Zadaianchuk et al.
Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation
Tao Liu, Rongjie Li, Chongyu Wang et al.
Training Matting Models Without Alpha Labels
Wenze Liu, Zixuan Ye, Hao Lu et al.
Learning Video-Conditioned Policy on Unlabelled Data with Joint Embedding Predictive Transformer
Hao Luo, Zongqing Lu
Efficient Logit-based Knowledge Distillation of Deep Spiking Neural Networks for Full-Range Timestep Deployment
Chengting Yu, Xiaochen Zhao, Lei Liu et al.
Towards Generalizable Multi-Camera 3D Object Detection via Perspective Rendering
Hao Lu, Yunpeng Zhang, Guoqing Wang et al.
AssistanceZero: Scalably Solving Assistance Games
Cassidy Laidlaw, Eli Bronstein, Timothy Guo et al.
Rethinking U-Net: Task-Adaptive Mixture of Skip Connections for Enhanced Medical Image Segmentation
Zichen Luo, Xinshan Zhu, Lan Zhang et al.
CFP-Gen: Combinatorial Functional Protein Generation via Diffusion Language Models
Junbo Yin, Chao Zha, Wenjia He et al.
CAKE: Category Aware Knowledge Extraction for Open-Vocabulary Object Detection
Shiyuan Ma, Donglin Qian, Kai Ye et al.
MAST: model-agnostic sparsified training
Yury Demidovich, Grigory Malinovsky, Egor Shulgin et al.
Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation
Suho Park, SuBeen Lee, Hyun Seok Seong et al.
ANaGRAM: A Natural Gradient Relative to Adapted Model for efficient PINNs learning
Nilo Schwencke, Cyril Furtlehner
SlimLLM: Accurate Structured Pruning for Large Language Models
Jialong Guo, Xinghao Chen, Yehui Tang et al.
VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving
Ruifei Zhang, Wei Zhang, Xiao Tan et al.
Triplets Better Than Pairs: Towards Stable and Effective Self-Play Fine-Tuning for LLMs
Yibo Wang, Hai-Long Sun, Guangda Huzhang et al.
UKBOB: One Billion MRI Labeled Masks for Generalizable 3D Medical Image Segmentation
Emmanuelle Bourigault, Amir Jamaludin, Abdullah Hamdi
State-Covering Trajectory Stitching for Diffusion Planners
Kyowoon Lee, Jaesik Choi
One-Way Ticket: Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models
Senmao Li, Lei Wang, Kai Wang et al.
ESCAPE: Equivariant Shape Completion via Anchor Point Encoding
Burak Bekci, Nassir Navab, Federico Tombari et al.
Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning
Sherry X. Chen, Misha Sra, Pradeep Sen
ViiNeuS: Volumetric Initialization for Implicit Neural Surface Reconstruction of Urban Scenes with Limited Image Overlap
Hala Djeghim, Nathan Piasco, Moussab Bennehar et al.
GCC: Generative Color Constancy via Diffusing a Color Checker
Chen-Wei Chang, Cheng-De Fan, Chia-Che Chang et al.
Audio-Sync Video Generation with Multi-Stream Temporal Control
Shuchen Weng, Haojie Zheng, zheng chang et al.
AMR-Transformer: Enabling Efficient Long-range Interaction for Complex Neural Fluid Simulation
Zeyi Xu, Jinfan Liu, Kuangxu Chen et al.
FlowDAS: A Stochastic Interpolant-based Framework for Data Assimilation
Siyi Chen, Yixuan Jia, Qing Qu et al.
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
Nan Chen, Mengqi Huang, Yihao Meng et al.
Backward Conformal Prediction
Etienne Gauthier, Francis Bach, Michael Jordan
Tokenize Image Patches: Global Context Fusion for Effective Haze Removal in Large Images
Jiuchen Chen, Xinyu Yan, Qizhi Xu et al.
Color Conditional Generation with Sliced Wasserstein Guidance
Alexander Lobashev, Maria Larchenko, Dmitry Guskov
3D-RAD: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks
Xiaotang Gai, Jiaxiang Liu, Yichen Li et al.
iManip: Skill-Incremental Learning for Robotic Manipulation
Zexin Zheng, Jia-Feng Cai, Xiao-Ming Wu et al.
Chain-of-Action: Trajectory Autoregressive Modeling for Robotic Manipulation
Wenbo Zhang, Tianrun Hu, Hanbo Zhang et al.
Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms
Baran Hashemi, Kurt Pasque, Chris Teska et al.
Noise-Resistant Video Anomaly Detection via RGB Error-Guided Multiscale Predictive Coding and Dynamic Memory
Han Hu, Wenli Du, Peng Liao et al.
DNF: Unconditional 4D Generation with Dictionary-based Neural Fields
Xinyi Zhang, Naiqi Li, Angela Dai
Sculpting Memory: Multi-Concept Forgetting in Diffusion Models via Dynamic Mask and Concept-Aware Optimization
Li, Yang Xiao, Jie Ji et al.
ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation
Pengcheng Huang, Zhenghao Liu, Yukun Yan et al.
High-Dimensional Calibration from Swap Regret
Maxwell Fishelson, Noah Golowich, Mehryar Mohri et al.
UniEdit: A Unified Knowledge Editing Benchmark for Large Language Models
Qizhou Chen, Dakan Wang, Taolin Zhang et al.
RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping
Dongming Wu, Yanping Fu, Saike Huang et al.
SIMS: Simulating Stylized Human-Scene Interactions with Retrieval-Augmented Script Generation
Wenjia Wang, Liang Pan, Zhiyang Dou et al.
Context-Aware Multimodal Pretraining
Karsten Roth, Zeynep Akata, Dima Damen et al.
Semantic Watermarking Reinvented: Enhancing Robustness and Generation Quality with Fourier Integrity
Sung Ju Lee, Nam Ik Cho
Uni-Instruct: One-step Diffusion Model through Unified Diffusion Divergence Instruction
Yifei Wang, Weimin Bai, colin zhang et al.
SEEA-R1: Tree-Structured Reinforcement Fine-Tuning for Self-Evolving Embodied Agents
Wanxin Tian, Shijie Zhang, Kevin Zhang et al.
GoRA: Gradient-driven Adaptive Low Rank Adaptation
haonan he, Peng Ye, Yuchen Ren et al.
RaySt3R: Predicting Novel Depth Maps for Zero-Shot Object Completion
Bardienus Duisterhof, Jan Oberst, Bowen Wen et al.
I2V3D: Controllable Image-to-video Generation with 3D Guidance
Zhiyuan Zhang, Dongdong Chen, Jing Liao
Neural Shell Texture Splatting: More Details and Fewer Primitives
Xin Zhang, Anpei Chen, Jincheng Xiong et al.
MikuDance: Animating Character Art with Mixed Motion Dynamics
Jiaxu Zhang, Xianfang Zeng, Xin Chen et al.
Geometry in Style: 3D Stylization via Surface Normal Deformation
Nam Anh Dinh, Itai Lang, Hyunwoo Kim et al.
Towards Higher Effective Rank in Parameter-Efficient Fine-tuning using Khatri-Rao Product
Paul Albert, Frederic Zhang, Hemanth Saratchandran et al.
RAGRouter: Learning to Route Queries to Multiple Retrieval-Augmented Language Models
Jiarui Zhang, Xiangyu Liu, Yong Hu et al.