Most Cited 2025 "treewidth hardness" Papers
22,274 papers found • Page 97 of 112
Conference
Causal-R: A Causal-Reasoning Geometry Problem Solver for Optimized Solution Exploration
Wenjun Wu, Lingling Zhang, Bo Zhao et al.
HiPART: Hierarchical Pose AutoRegressive Transformer for Occluded 3D Human Pose Estimation
Hongwei Zheng, Han Li, Wenrui Dai et al.
Towards Continual Universal Segmentation
Zihan Lin, Zilei Wang, Xu Wang
Don’t Trade Off Safety: Diffusion Regularization for Constrained Offline RL
Junyu guo, Zhi Zheng, Donghao Ying et al.
DeformCL: Learning Deformable Centerline Representation for Vessel Extraction in 3D Medical Image
Ziwei Zhao, Zhixing Zhang, Yuhang Liu et al.
SPOT-Trip: Dual-Preference Driven Out-of-Town Trip Recommendation
Yinghui Liu, Hao Miao, Guojiang Shen et al.
Test-Time Spectrum-Aware Latent Steering for Zero-Shot Generalization in Vision-Language Models
Konstantinos Dafnis, Dimitris Metaxas
Cross-Modal 3D Representation with Multi-View Images and Point Clouds
Ziyang Zhou, Pinghui Wang, Zi Liang et al.
Continual SFT Matches Multimodal RLHF with Negative Supervision
Ke Zhu, Yu Wang, Yanpeng Sun et al.
Visual Lexicon: Rich Image Features in Language Space
XuDong Wang, Xingyi Zhou, Alireza Fathi et al.
Less is More: Efficient Model Merging with Binary Task Switch
Biqing Qi, Fangyuan Li, Zhen Wang et al.
Joint‑Embedding vs Reconstruction: Provable Benefits of Latent Space Prediction for Self‑Supervised Learning
Hugues Van Assel, Mark Ibrahim, Tommaso Biancalani et al.
On the $O(\frac{\sqrt{d}}{K^{1/4}})$ Convergence Rate of AdamW Measured by $\ell_1$ Norm
Huan Li, Yiming Dong, Zhouchen Lin
Unboxed: Geometrically and Temporally Consistent Video Outpainting
Zhongrui Yu, Martina Megaro-Boldini, Robert Sumner et al.
Measure-Theoretic Anti-Causal Representation Learning
Arman Behnam, Binghui Wang
When Can Model-Free Reinforcement Learning be Enough for Thinking?
Josiah Hanna, Nicholas Corrado
Adaptive LoRA Experts Allocation and Selection for Federated Fine-Tuning
Lei Wang, Jieming Bian, Letian Zhang et al.
ARIA: Training Language Agents with Intention-driven Reward Aggregation
Ruihan Yang, yikai zhang, Aili Chen et al.
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
Zhichao Sun, Huazhang Hu, Yidong Ma et al.
Split conformal classification with unsupervised calibration
Santiago Mazuelas
3D Gaussian Splatting based Scene-independent Relocalization with Unidirectional and Bidirectional Feature Fusion
Junyi Wang, Yuze Wang, Wantong Duan et al.
UCM-VeID V2: A Richer Dataset and A Pre-training Method for UAV Cross-Modality Vehicle Re-Identification
Xingyue Liu, Jiahao Qi, Chen Chen et al.
CarPlanner: Consistent Auto-regressive Trajectory Planning for Large-Scale Reinforcement Learning in Autonomous Driving
Dongkun Zhang, Jiaming Liang, Ke Guo et al.
K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs
Ziheng Ouyang, Zhen Li, Qibin Hou
FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving
Shuang Zeng, Xinyuan Chang, Mengwei Xie et al.
STaRFormer: Semi-Supervised Task-Informed Representation Learning via Dynamic Attention-Based Regional Masking for Sequential Data
Maximilian Forstenhäusler, Daniel Külzer, Christos Anagnostopoulos et al.
Stochastic Optimization in Semi-Discrete Optimal Transport: Convergence Analysis and Minimax Rate
Ferdinand Genans, Antoine Godichon-Baggioni, François-Xavier Vialard et al.
Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals
Stefan Stojanov, David Wendt, Seungwoo Kim et al.
Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation
Dingcheng Zhen, Shunshun Yin, Shiyang Qin et al.
Graph-Embedded Structure-Aware Perceptual Hashing for Neural Network Protection and Piracy Detection
Ruiheng Liu, Haozhe Chen, Boyao Zhao et al.
Normalizing Flows are Capable Models for Continuous Control
Raj Ghugare, Benjamin Eysenbach
RrED: Black-box Unsupervised Domain Adaptation via Rectifying-reasoning Errors of Diffusion
Yuwu Lu, Chunzhi Liu
Weakly Supervised Contrastive Adversarial Training for Learning Robust Features from Semi-supervised Data
Lilin Zhang, Chengpei Wu, Ning Yang
Alias-Free Latent Diffusion Models: Improving Fractional Shift Equivariance of Diffusion Latent Space
Yifan Zhou, Zeqi Xiao, Shuai Yang et al.
Domain Generalization in CLIP via Learning with Diverse Text Prompts
Changsong Wen, Zelin Peng, Yu Huang et al.
A Simple Data Augmentation for Feature Distribution Skewed Federated Learning
Yunlu Yan, Huazhu Fu, Yuexiang Li et al.
Improving LLM General Preference Alignment via Optimistic Online Mirror Descent
Yuheng Zhang, Dian Yu, Tao Ge et al.
Think before Recommendation: Autonomous Reasoning-enhanced Recommender
Xiaoyu Kong, Junguang Jiang, Bin Liu et al.
RoomPainter: View-Integrated Diffusion for Consistent Indoor Scene Texturing
Zhipeng Huang, Wangbo Yu, Xinhua Cheng et al.
Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text
Yize Cheng, Vinu Sankar Sadasivan, Mehrdad Saberi et al.
Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback
Wei Shen, Guanlin Liu, Yu Yue et al.
Computational Efficiency under Covariate Shift in Kernel Ridge Regression
Andrea Della Vecchia, Arnaud Mavakala Watusadisi, Ernesto De Vito et al.
Hearing Hands: Generating Sounds from Physical Interactions in 3D Scenes
Yiming Dou, Wonseok Oh, Yuqing Luo et al.
Hybrid-Collaborative Augmentation and Contrastive Sample Adaptive-Differential Awareness for Robust Attributed Graph Clustering
Tianxiang Zhao, Youqing Wang, Jinlu Wang et al.
Accelerating Diffusion Transformer via Increment-Calibrated Caching with Channel-Aware Singular Value Decomposition
Zhiyuan Chen, Keyi Li, Yifan Jia et al.
Automatic Auxiliary Task Selection and Adaptive Weighting Boost Molecular Property Prediction
Zhiqiang Zhong, Davide Mottin
BlenderGym: Benchmarking Foundational Model Systems for Graphics Editing
Yunqi Gu, Ian Huang, Jihyeon Je et al.
World-aware Planning Narratives Enhance Large Vision-Language Model Planner
Junhao Shi, Zhaoye Fei, Siyin Wang et al.
ZEUS: Zero-shot Embeddings for Unsupervised Separation of Tabular Data
Patryk Marszałek, Tomasz Kuśmierczyk, Witold Wydmański et al.
Towards General Modality Translation with Contrastive and Predictive Latent Diffusion Bridge
Nimrod Berman, Omkar Joglekar, Eitan Kosman et al.
BrainFlow: A Holistic Pathway of Dynamic Neural System on Manifold
Zhixuan Zhou, Tingting Dan, Guorong Wu
STAR: Spatial-Temporal Tracklet Matching for Multi-Object Tracking
Xuewei Bai, Yongcai Wang, Deying Li et al.
Explainable Saliency: Articulating Reasoning with Contextual Prioritization
Nuo Chen, Ming Jiang, Qi Zhao
Structured Initialization for Vision Transformers
Jianqiao Zheng, Xueqian Li, Hemanth Saratchandran et al.
Efficient Decoupled Feature 3D Gaussian Splatting via Hierarchical Compression
Zhenqi Dai, Ting Liu, Yanning Zhang
Self-Boost via Optimal Retraining: An Analysis via Approximate Message Passing
Adel Javanmard, Rudrajit Das, Alessandro Epasto et al.
Revitalizing SVD for Global Covariance Pooling: Halley’s Method to Overcome Over-Flattening
Jiawei Gu, Ziyue Qiao, Xinming Li et al.
Distributed mediation analysis with communication efficiency
Shaomin Li
DL2G: Degradation-guided Local-to-Global Restoration for Eyeglass Reflection Removal
Yizhilv, Xiao Lu, Hong Ding et al.
Foley-Flow: Coordinated Video-to-Audio Generation with Masked Audio-Visual Alignment and Dynamic Conditional Flows
Shentong Mo, Yibing Song
Robustly Learning Monotone Single-Index Models
Puqian Wang, Nikos Zarifis, Ilias Diakonikolas et al.
Refining Norms: A Post-hoc Framework for OOD Detection in Graph Neural Networks
Jiawei Gu, Ziyue Qiao, Zechao Li
Meta CLIP 2: A Worldwide Scaling Recipe
Yung-Sung Chuang, Yang Li, Dong Wang et al.
Divide and Conquer: Heterogeneous Noise Integration for Diffusion-based Adversarial Purification
Gaozheng Pei, Shaojie Lyu, Gong Chen et al.
Adaptive Cannistraci-Hebb Network Automata Modelling of Complex Networks for Path-based Link Prediction
Jialin Zhao, Alessandro Muscoloni, Umberto Michieli et al.
Fair Minimum Labeling: Efficient Temporal Network Activations for Reachability and Equity
Lutz Oettershagen, Othon Michail
Tracking and Understanding Object Transformations
Yihong Sun, Xinyu Yang, Jennifer Sun et al.
Cloud4D: Estimating Cloud Properties at a High Spatial and Temporal Resolution
Jacob Lin, Edward Gryspeerdt, Ronald Clark
BIPNN: Learning to Solve Binary Integer Programming via Hypergraph Neural Networks
Sen Bai, Chunqi Yang, Xin Bai et al.
StyleMaster: Stylize Your Video with Artistic Generation and Translation
Zixuan Ye, Huijuan Huang, Xintao Wang et al.
REOrdering Patches Improves Vision Models
Declan Kutscher, David Chan, Yutong Bai et al.
Towards Improved Text-Aligned Codebook Learning: Multi-Hierarchical Codebook-Text Alignment with Long Text
Guotao liang, Baoquan Zhang, Zhiyuan Wen et al.
Exposure-slot: Exposure-centric Representations Learning with Slot-in-Slot Attention for Region-aware Exposure Correction
Donggoo Jung, DAEHYUN KIM, Guanghui Wang et al.
Boosting Resilience of Large Language Models through Causality-Driven Robust Optimization
Xiaoling Zhou, Mingjie Zhang, Zhemg Lee et al.
One Token Embedding Is Enough to Deadlock Your Large Reasoning Model
Mohan Zhang, Yihua Zhang, Jinghan Jia et al.
YOLOv12: Attention-Centric Real-Time Object Detectors
Yunjie Tian, Qixiang Ye, DAVID DOERMANN
CAGE: Continuity-Aware edGE Network Unlocks Robust Floorplan Reconstruction
Yiyi Liu, Chunyang Liu, Bohan Wang et al.
Unlocking the Potential of Unlabeled Data in Semi-Supervised Domain Generalization
Dongkwan Lee, Kyomin Hwang, Nojun Kwak
Tapered Off-Policy REINFORCE - Stable and efficient reinforcement learning for large language models
Nicolas Le Roux, Marc Bellemare, Jonathan Lebensold et al.
Coupling Generative Modeling and an Autoencoder with the Causal Bridge
Ruolin Meng, Ming-Yu Chung, Dhanajit Brahma et al.
MeCeFO: Enhancing LLM Training Robustness via Fault-Tolerant Optimization
Rizhen Hu, Yutong He, Ran Yan et al.
ProjAttacker: A Configurable Physical Adversarial Attack for Face Recognition via Projector
Yuanwei Liu, Hui Wei, Chengyu Jia et al.
The World Is Bigger: A Computationally-Embedded Perspective on the Big World Hypothesis
Alex Lewandowski, Aditya Ramesh, Edan Meyer et al.
Flash Invariant Point Attention
Andrew Liu, Axel Elaldi, Nicholas Franklin et al.
Does Stochastic Gradient really succeed for bandits?
Dorian Baudry, Emmeran Johnson, Simon Vary et al.
LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting
Xiaoyan Xing, Konrad Groh, Sezer Karaoglu et al.
TokenSwap: A Lightweight Method to Disrupt Memorized Sequences in LLMs
Parjanya Prashant, Kaustubh Ponkshe, Babak Salimi
Minding Fuzzy Regions: A Data-driven Alternating Learning Paradigm for Stable Lesion Segmentation
Lexin Fang, Yunyang Xu, Xiang Ma et al.
Virtual Fitting Room: Generating Arbitrarily Long Videos of Virtual Try-On from a Single Image
Junkun Chen, Aayush Bansal, Minh Vo et al.
Science-T2I: Addressing Scientific Illusions in Image Synthesis
Jialuo Li, Wenhao Chai, XINGYU FU et al.
Geometric Logit Decoupling for Energy-Based Graph Out-of-distribution Detection
Min Wang, Hao Yang, Qing Cheng et al.
TractoTransformer: Diffusion MRI Streamline Tractography using CNN and Transformer Networks
Itzik Waizman, Yakov Gusakov, Itay Benou et al.
AVQACL: A Novel Benchmark for Audio-Visual Question Answering Continual Learning
Kaixuan Wu, Xinde Li, Xinglin Li et al.
AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark
Li Lin, Santosh Santosh, Mingyang Wu et al.
Real-DRL: Teach and Learn in Reality
Yanbing Mao, Yihao Cai, Lui Sha
Statistical Analysis of the Sinkhorn Iterations for Two-Sample Schr\"{o}dinger Bridge Estimation
Ibuki Maeda, Yao, Atsushi Nitanda
Sketch-Augmented Features Improve Learning Long-Range Dependencies in Graph Neural Networks
Ryien Hosseini, Filippo Simini, Venkatram Vishwanath et al.
Non-Clairvoyant Scheduling with Progress Bars
Ziyad Benomar, Romain Cosson, Alexander Lindermayr et al.
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
Jingcheng Ni, Yuxin Guo, Yichen Liu et al.
Z-Magic: Zero-shot Multiple Attributes Guided Image Creator
Yingying Deng, Xiangyu He, Fan Tang et al.
Towards Fully FP8 GEMM LLM Training at Scale
Alejandro Hernández Cano, Dhia Garbaya, Imanol Schlag et al.
Private Online Learning against an Adaptive Adversary: Realizable and Agnostic Settings
Bo Li, Wei Wang, Peng Ye
RainyGS: Efficient Rain Synthesis with Physically-Based Gaussian Splatting
Qiyu Dai, Xingyu Ni, Qianfan Shen et al.
UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping
Aashish Rai, Dilin Wang, Mihir Jain et al.
Investigating the Role of Weight Decay in Enhancing Nonconvex SGD
Tao Sun, Yuhao Huang, Li Shen et al.
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
Jifeng Hu, Sili Huang, Zhejian Yang et al.
A Closer Look to Positive-Unlabeled Learning from Fine-grained Perspectives: An Empirical Study
Yuanchao Dai, Zhengzhang Hou, Changchun Li et al.
Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters
Zhiyang Guo, Jinxu Xiang, Kai Ma et al.
Cross-modal Associations in Vision and Language Models: Revisiting the Bouba-Kiki Effect
Tom Kouwenhoven, Kiana Shahrasbi, Tessa Verhoef
Timely Clinical Diagnosis through Active Test Selection
Silas Ruhrberg Estévez, Nicolás Astorga, Mihaela van der Schaar
Training-free Online Video Step Grounding
Luca Zanella, Massimiliano Mancini, Yiming Wang et al.
DiSciPLE: Learning Interpretable Programs for Scientific Visual Discovery
Utkarsh Mall, Cheng Perng Phoo, Mia Chiquier et al.
WKV-sharing embraced random shuffle RWKV high-order modeling for pan-sharpening
man zhou, Xuanhua He, Danfeng Hong et al.
Towards Accurate Time Series Forecasting via Implicit Decoding
Xinyu Li, Yuchen Luo, Hao Wang et al.
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
Leander Diaz-Bone, Marco Bagatella, Jonas Hübotter et al.
Beyond Greedy Exits: Improved Early Exit Decisions for Risk Control and Reliability
Divya Jyoti Bajpai, Manjesh Kumar Hanawal
Improved Regret Bounds for Gaussian Process Upper Confidence Bound in Bayesian Optimization
Shogo Iwazaki
DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning
Runpeng Xie, Quanwei Wang, Hao Hu et al.
A Sustainable AI Economy Needs Data Deals That Work for Generators
Ruoxi Jia, Luis Oala, Wenjie Xiong et al.
Knowledge-based Visual Question Answer with Multimodal Processing, Retrieval and Filtering
yuyang Hong, Jiaqi Gu, Yang Qi et al.
DePass: Unified Feature Attributing by Simple Decomposed Forward Pass
Xiangyu Hong, Che Jiang, Kai Tian et al.
Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning
Wenlin Zhang, Xiangyang Li, Kuicai Dong et al.
Poly-Autoregressive Prediction for Modeling Interactions
Neerja Thakkar, Tara Sadjadpour, Jathushan Rajasegaran et al.
SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning
Zhongwei Wan, Zhihao Dou, Che Liu et al.
TinyFusion: Diffusion Transformers Learned Shallow
Gongfan Fang, Kunjun Li, Xinyin Ma et al.
CrossSDF: 3D Reconstruction of Thin Structures From Cross-Sections
Thomas Walker, Salvatore Esposito, Daniel Rebain et al.
Decentralized Diffusion Models
David McAllister, Matthew Tancik, Jiaming Song et al.
ALIEN: Implicit Neural Representations for Human Motion Prediction under Arbitrary Latency
Dong Wei, Xiaoning Sun, Xizhan Gao et al.
S$^2$NN: Sub-bit Spiking Neural Networks
Wenjie Wei, Malu Zhang, Jieyuan (Eric) Zhang et al.
Two by Two: Learning Multi-Task Pairwise Objects Assembly for Generalizable Robot Manipulation
Yu Qi, Yuanchen Ju, Tianming Wei et al.
Differentiable Generalized Sliced Wasserstein Plans
Laetitia Chapel, Romain Tavenard, Samuel Vaiter
You Only Communicate Once: One-shot Federated Low-Rank Adaptation of MLLM
Binqian Xu, Haiyang Mei, Zechen Bai et al.
GIF: Generative Inspiration for Face Recognition at Scale
Mohammad Saadabadi Saadabadi, Sahar Rahimi Malakshan, Ali Dabouei et al.
Nonlinear Laplacians: Tunable principal component analysis under directional prior information
Yuxin Ma, Dmitriy Kunisky
On the Sample Complexity Bounds of Bilevel Reinforcement Learning
Mudit Gaur, Utsav Singh, Amrit Singh Bedi et al.
Statistical Analysis of an Adversarial Bayesian Weak Supervision Method
Steven An
Vicinal Label Supervision for Reliable Aleatoric and Epistemic Uncertainty Estimation
Linye Li, Yufei Chen, Xiaodong Yue
Redundancy-Aware Test-Time Graph Out-of-Distribution Detection
Yue Hou, He Zhu, Ruomei Liu et al.
Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs
Yifan Wei, Xiaoyan Yu, Tengfei Pan et al.
Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action Recognition
Yang Chen, Jingcai Guo, Song Guo et al.
On the Surprising Effectiveness of Large Learning Rates under Standard Width Scaling
Moritz Haas, Sebastian Bordt, Ulrike Luxburg et al.
Functional Virtual Adversarial Training for Semi-Supervised Time Series Classification
Qingyi Pan, Yicheng Li
Navigating Image Restoration with VAR’s Distribution Alignment Prior
Siyang Wang, Naishan Zheng, Jie Huang et al.
DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion
Qitao Zhao, Amy Lin, Jeff Tan et al.
Viewpoint Rosetta Stone: Unlocking Unpaired Ego-Exo Videos for View-invariant Representation Learning
Mi Luo, Zihui Xue, Alex Dimakis et al.
AnimateQR: Bridging Aesthetics and Functionality in Dynamic QR Code Generation
Guangyang Wu, Huayu Zheng, Siqi Luo et al.
PLANA3R: Zero-shot Metric Planar 3D Reconstruction via Feed-forward Planar Splatting
Changkun Liu, Bin Tan, Zeran Ke et al.
Efficiently Escaping Saddle Points under Generalized Smoothness via Self-Bounding Regularity
Daniel Cao, August Chen, Karthik Sridharan et al.
Toward Human Deictic Gesture Target Estimation
Xu Cao, Pranav Virupaksha, Sangmin Lee et al.
Less Greedy Equivalence Search
Adiba Ejaz, Elias Bareinboim
TokMan:Tokenize Manhattan Mask Optimization for Inverse Lithography
Yiwen Wu, Yuyang Chen, Ye Xia et al.
Optimal Nuisance Function Tuning for Estimating a Doubly Robust Functional under Proportional Asymptotics
Sean McGrath, Debarghya Mukherjee, Rajarshi Mukherjee et al.
Optimistic Online-to-Batch Conversions for Accelerated Convergence and Universality
Yu-Hu Yan, Peng Zhao, Zhi-Hua Zhou
Gaussian-Augmented Physics Simulation and System Identification with Complex Colliders
Federico Vasile, Ri-Zhao Qiu, Lorenzo Natale et al.
D2SA: Dual-Stage Distribution and Slice Adaptation for Efficient Test-Time Adaptation in MRI Reconstruction
Lipei Zhang, Rui Sun, Zhongying Deng et al.
VoxDet: Rethinking 3D Semantic Scene Completion as Dense Object Detection
Wuyang Li, Zhu Yu, Alexandre Alahi
Effective Neural Approximations for Geometric Optimization Problems
Samantha Chen, Oren Ciolli, Anastasios Sidiropoulos et al.
ZoomLDM: Latent Diffusion Model for Multi-scale Image Generation
Srikar Yellapragada, Alexandros Graikos, Kostas Triaridis et al.
AdMiT: Adaptive Multi-Source Tuning in Dynamic Environments
Xiangyu Chang, Fahim Faisal Niloy, Sk Miraj Ahmed et al.
Perturb a Model, Not an Image: Towards Robust Privacy Protection via Anti-Personalized Diffusion Models
Tae-Young Lee, Juwon Seo, Jong Hwan Ko et al.
Dynamical Properties of Tokens in Self-Attention and Effects of Positional Encoding
Duy-Tung Pham, An Nguyen The, Viet-Hoang Tran et al.
Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound
Tal Fiskus, Uri Shaham
Fairness under Competition
Ronen Gradwohl, Eilam Shapira, Moshe Tennenholtz
Fortifying Federated Learning Towards Trustworthiness via Auditable Data Valuation and Verifiable Client Contribution
Naveen Kumar Kummari, Ranjeet Ranjan Jha, Krishna Mohan Chalavadi et al.
DIV-FF: Dynamic Image-Video Feature Fields For Environment Understanding in Egocentric Videos
Lorenzo Mur-Labadia, Jose J. Guerrero, Ruben Martinez-Cantin
VCM: Vision Concept Modeling with Adaptive Vision Token Compression via Instruction Fine-Tuning
Run Luo, Renke Shan, Longze Chen et al.
Towards Unsupervised Open-Set Graph Domain Adaptation via Dual Reprogramming
Zhen Zhang, Bingsheng He
Regret Lower Bounds for Decentralized Multi-Agent Stochastic Shortest Path Problems
Utkarsh Chavan, Prashant Trivedi, Nandyala Hemachandra
Hybrid Autoencoders for Tabular Data: Leveraging Model-Based Augmentation in Low-Label Settings
Erel Naor, Ofir Lindenbaum
KaRF: Weakly-Supervised Kolmogorov-Arnold Networks-based Radiance Fields for Local Color Editing
Wudi Chen, Zhiyuan Zha, Shigang Wang et al.
Beyond Local Sharpness: Communication-Efficient Global Sharpness-aware Minimization for Federated Learning
Debora Caldarola, Pietro Cagnasso, Barbara Caputo et al.
Learning Physics From Video: Unsupervised Physical Parameter Estimation for Continuous Dynamical Systems
Alejandro Castañeda Garcia, Jan Warchocki, Jan van Gemert et al.
Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers
Johanna Vielhaben, Dilyara Bareeva, Jim Berend et al.
Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization
Feifei Li, Mi Zhang, Yiming Sun et al.
OmniVCus: Feedforward Subject-driven Video Customization with Multimodal Control Conditions
Yuanhao Cai, HE Zhang, Xi Chen et al.
AnyMap: Learning a General Camera Model for Structure-from-Motion with Unknown Distortion in Dynamic Scenes
Andrea Porfiri Dal Cin, Georgi Dikov, Jihong Ju et al.
Seeking Consistent Flat Minima for Better Domain Generalization via Refining Loss Landscapes
Aodi Li, Liansheng Zhuang, Xiao Long et al.
ReservoirTTA: Prolonged Test-time Adaptation for Evolving and Recurring Domains
Guillaume Vray, Devavrat Tomar, Xufeng Gao et al.
Unraveling Metameric Dilemma for Spectral Reconstruction: A High-Fidelity Approach via Semi-Supervised Learning
Xingxing Yang, Jie Chen, Zaifeng Yang
Dynamic Configuration for Cutting Plane Separators via Reinforcement Learning on Incremental Graph
Mingxuan Ye, Jie Wang, Fangzhou et al.
FASTer: Focal token Acquiring-and-Scaling Transformer for Long-term 3D Objection Detection
Chenxu Dang, Pei An, Xinmin Zhang et al.
ZeroPatcher: Training-free Sampler for Video Inpainting and Editing
Shaoshu Yang, Yingya Zhang, Ran He
STAR-Edge: Structure-aware Local Spherical Curve Representation for Thin-walled Edge Extraction from Unstructured Point Clouds
Zikuan Li, Honghua Chen, Yuecheng Wang et al.
Product Distribution Learning with Imperfect Advice
Arnab Bhattacharyya, XianJun, Davin Choo, Philips George John et al.
TimeXL: Explainable Multi-modal Time Series Prediction with LLM-in-the-Loop
Yushan Jiang, Wenchao Yu, Geon Lee et al.
Seeing Speech and Sound: Distinguishing and Locating Audio Sources in Visual Scenes
Hyeonggon Ryu, Seongyu Kim, Joon Chung et al.
When Lower-Order Terms Dominate: Adaptive Expert Algorithms for Heavy-Tailed Losses
Antoine Moulin, Emmanuel Esposito, Dirk van der Hoeven
Recurrent Feature Mining and Keypoint Mixup Padding for Category-Agnostic Pose Estimation
Junjie Chen, Weilong Chen, Yifan Zuo et al.
Learning Shared Representations from Unpaired Data
Amitai Yacobi, Nir Ben-Ari, Ronen Talmon et al.
Heterogeneous Skeleton-Based Action Representation Learning
Xiaoyan Ma, jidong kuang, Hongsong Wang et al.
Mind-the-Glitch: Visual Correspondence for Detecting Inconsistencies in Subject-Driven Generation
Abdelrahman Eldesokey, Aleksandar Cvejić, Bernard Ghanem et al.
AccuQuant: Simulating Multiple Denoising Steps for Quantizing Diffusion Models
Seunghoon Lee, Jeongwoo Choi, Byunggwan Son et al.
Non-Line-of-Sight 3D Reconstruction with Radar
Haowen Lai, Zitong Lan, Mingmin Zhao
Once-Tuning-Multiple-Variants: Tuning Once and Expanded as Multiple Vision-Language Model Variants
Chong Yu, Tao Chen, Zhongxue Gan
U-REPA: Aligning Diffusion U-Nets to ViTs
Yuchuan Tian, Hanting Chen, Mengyu Zheng et al.
Leveraging Perturbation Robustness to Enhance Out-of-Distribution Detection
Wenxi Chen, Raymond A. Yeh, Shaoshuai Mou et al.
DecompNet: Enhancing Time Series Forecasting Models with Implicit Decomposition
Donghao Luo, Xue Wang
Incentivizing Desirable Effort Profiles in Strategic Classification: The Role of Causality and Uncertainty
Valia Efthymiou, Chara Podimata, Diptangshu Sen et al.
Robust and Diverse Multi-Agent Learning via Rational Policy Gradient
Niklas Lauffer, Ameesh Shah, Micah Carroll et al.
UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation
Rui Tian, Mingfei Gao, Mingze Xu et al.