Most Cited 2025 "object-proposal association" Papers
22,274 papers found • Page 93 of 112
Conference
Active Target Discovery under Uninformative Priors: The Power of Permanent and Transient Memory
Anindya Sarkar, Binglin Ji, Yevgeniy Vorobeychik
Multi-agent Markov Entanglement
Shuze Chen, Tianyi Peng
Diffusion Guided Adversarial State Perturbations in Reinforcement Learning
Xiaolin Sun, Feidi Liu, Zhengming Ding et al.
How Does Label Noise Gradient Descent Improve Generalization in the Low SNR Regime?
Wei Huang, Andi Han, Yujin Song et al.
PolarQuant: Leveraging Polar Transformation for Key Cache Quantization and Decoding Acceleration
Songhao Wu, Ang Lv, xiao feng et al.
MissRAG: Addressing the Missing Modality Challenge in Multimodal Large Language Models
Vittorio Pipoli, Alessia Saporita, Federico Bolelli et al.
Off-policy Reinforcement Learning with Model-based Exploration Augmentation
Likun Wang, Xiangteng Zhang, Yinuo Wang et al.
Transstratal Adversarial Attack: Compromising Multi-Layered Defenses in Text-to-Image Models
Chunlong Xie, Kangjie Chen, Shangwei Guo et al.
StegoZip: Enhancing Linguistic Steganography Payload in Practice with Large Language Models
Jun Jiang, Zijin Yang, Weiming Zhang et al.
Unified all-atom molecule generation with neural fields
Matthieu Kirchmeyer, Pedro O. Pinheiro, Emma Willett et al.
MoleBridge: Synthetic Space Projecting with Discrete Markov Bridges
Rongchao Zhang, Yu Huang, Yongzhi Cao et al.
FedPall: Prototype-based Adversarial and Collaborative Learning for Federated Learning with Feature Drift
yong zhang, Feng Liang, Guanghu Yuan et al.
Factorized Learning for Temporally Grounded Video-Language Models
Wenzheng Zeng, Difei Gao, Mike Zheng Shou et al.
Constrained Best Arm Identification
Tyron Lardy, Christina Katsimerou, Wouter Koolen
Harnessing Feature Resonance under Arbitrary Target Alignment for Out-of-Distribution Node Detection
Shenzhi Yang, Junbo Zhao, Sharon Li et al.
DISTIL: Data-Free Inversion of Suspicious Trojan Inputs via Latent Diffusion
Hossein Mirzaei, Zeinab Taghavi, Sepehr Rezaee et al.
Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion
Aleksandar Jevtić, Christoph Reich, Felix Wimbauer et al.
ART: Adaptive Relation Tuning for Generalized Relation Prediction
Gopika Sudhakaran, Hikaru Shindo, Patrick Schramowski et al.
Mars-Bench: A Benchmark for Evaluating Foundation Models for Mars Science Tasks
Mirali Purohit, Bimal Gajera, Vatsal Malaviya et al.
BridgePure: Limited Protection Leakage Can Break Black-Box Data Protection
Yihan Wang, Yiwei Lu, Xiao-Shan Gao et al.
Sparsity Outperforms Low-Rank Projections in Few-Shot Adaptation
Nairouz Mrabah, Nicolas Richet, Ismail Ayed et al.
SMARTraj$^2$: A Stable Multi-City Adaptive Method for Multi-View Spatio-Temporal Trajectory Representation Learning
Tangwen Qian, Junhe Li, Yile Chen et al.
Unraveling Normal Anatomy via Fluid-Driven Anomaly Randomization
Peirong Liu, Ana Lawry Aguila, Juan Iglesias
The Cost of Robustness: Tighter Bounds on Parameter Complexity for Robust Memorization in ReLU Nets
Yujun Kim, Chaewon Moon, Chulhee Yun
Agents Robust to Distribution Shifts Learn Causal World Models Even Under Mediation
Matteo Ceriscioli, Karthika Mohan
WINS: Winograd Structured Pruning for Fast Winograd Convolution
Cheonjun Park, Hyunjae Oh, Mincheol Park et al.
Multi-Class Support Vector Machine with Differential Privacy
Jinseong Park, Yujin Choi, Jaewook Lee
Flow Density Control: Generative Optimization Beyond Entropy-Regularized Fine-Tuning
Riccardo De Santi, Marin Vlastelica, Ya-Ping Hsieh et al.
Efficient Last-Iterate Convergence in Solving Extensive-Form Games
Linjian Meng, Tianpei Yang, Youzhi Zhang et al.
Contimask: Explaining Irregular Time Series via Perturbations in Continuous Time
Max Moebus, Björn Braun, Christian Holz
Instance-Optimality for Private KL Distribution Estimation
Jiayuan Ye, Vitaly Feldman, Kunal Talwar
DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic
Munish Monga, Vishal Chudasama, Pankaj Wasnik et al.
Probing Neural Combinatorial Optimization Models
Zhiqin Zhang, Yining Ma, Zhiguang Cao et al.
From Likelihood to Fitness: Improving Variant Effect Prediction in Protein and Genome Language Models
Charles W J Pugh, Paulina Nuñez-Valencia, Mafalda Dias et al.
Understanding and Rectifying Safety Perception Distortion in VLMs
Xiaohan Zou, Jian Kang, George Kesidis et al.
Semi-supervised Concept Bottleneck Models
Lijie Hu, Tianhao Huang, Huanyi Xie et al.
SAM Encoder Breach by Adversarial Simplicial Complex Triggers Downstream Model Failures
Yi Qin, Rui Wang, Tao Huang et al.
Gaze-Language Alignment for Zero-Shot Prediction of Visual Search Targets from Human Gaze Scanpaths
Sounak Mondal, Naveen Sendhilnathan, Ting Zhang et al.
TRoVe: Discovering Error-Inducing Static Feature Biases in Temporal Vision-Language Models
Maya Varma, Jean-Benoit Delbrouck, Sophie Ostmeier et al.
Virus Infection Attack on LLMs: Your Poisoning Can Spread "VIA" Synthetic Data
Zi Liang, Qingqing Ye, Xuan Liu et al.
3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion
Zhaoxi Chen, Jiaxiang Tang, Yuhao Dong et al.
LGA-Net: Learning Local and Global Affinities for Sparse Scribble based Image Colorization
Hongjin Lyu, Bo Li, Paul Rosin et al.
One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory
Chenhao Zheng, Jieyu Zhang, Mohammadreza Salehi et al.
Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks
Giyeong Oh, Woohyun Cho, Siyeol Kim et al.
Convex Potential Mirror Langevin Algorithm for Efficient Sampling of Energy-Based Models
Zitao Yang, Amin Ullah, Shuai Li et al.
RESPIN-S1.0: A read speech corpus of 10000+ hours in dialects of nine Indian Languages
Saurabh Kumar, Abhayjeet Singh, DEEKSHITHA G et al.
Heterogeneous Adversarial Play in Interactive Environments
Manjie Xu, Xinyi Yang, Jiayu Zhan et al.
Flat Channels to Infinity in Neural Loss Landscapes
Flavio Martinelli, Alexander van Meegen, Berfin Simsek et al.
ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization
Yuanhe Guo, Linxi Xie, Zhuoran Chen et al.
Learning Visual Proxy for Compositional Zero-Shot Learning
Shiyu Zhang, Cheng Yan, Yang Liu et al.
UnZipLoRA: Separating Content and Style from a Single Image
Chang Liu, Viraj Shah, Aiyu Cui et al.
Clink! Chop! Thud! - Learning Object Sounds from Real-World Interactions
Mengyu Yang, Yiming Chen, Haozheng Pei et al.
RoboCerebra: A Large-scale Benchmark for Long-horizon Robotic Manipulation Evaluation
Songhao Han, Boxiang Qiu, Yue Liao et al.
How Does Topology Bias Distort Message Passing in Graph Recommender? A Dirichlet Energy Perspective
Yanbiao Ji, Yue Ding, Dan Luo et al.
CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving
Rui Song, Chenwei Liang, Yan Xia et al.
OmniSegmentor: A Flexible Multi-Modal Learning Framework for Semantic Segmentation
Bo-Wen Yin, Jiao-Long Cao, Xuying Zhang et al.
Diffusion Feature Field for Text-based 3D Editing with Gaussian Splatting
Eunseo Koh, Sangeek Hyun, MinKyu Lee et al.
VRM: Knowledge Distillation via Virtual Relation Matching
Weijia Zhang, Fei Xie, Weidong Cai et al.
Defending Multimodal Backdoored Models by Repulsive Visual Prompt Tuning
Zhifang Zhang, Shuo He, Haobo Wang et al.
Tree of Preferences for Diversified Recommendation
Hanyang Yuan, Ning Tang, Tongya Zheng et al.
Asymmetric Dual-Lens Video Deblurring
Zeyu Xiao, Xinchao Wang
Diffusion-Driven Two-Stage Active Learning for Low-Budget Semantic Segmentation
Jeongin Kim, Wonho Bae, YouLee Han et al.
Discovering Divergent Representations between Text-to-Image Models
Lisa Dunlap, Trevor Darrell, Joseph Gonzalez et al.
Cross-Modal Representational Knowledge Distillation for Enhanced Spike-informed LFP Modeling
Eray Erturk, Saba Hashemi, Maryam Shanechi
Zero-Shot Blind-Spot Image Denoising via Cross-Scale Non-Local Pixel Refilling
Qilong Guo, Tianjing Zhang, Zhiyuan Ma et al.
BaRISTA: Brain Scale Informed Spatiotemporal Representation of Human Intracranial Neural Activity
Lucine L Oganesian, Saba Hashemi, Maryam Shanechi
Dual-Stage Value-Guided Inference with Margin-Based Reward Adjustment for Fast and Faithful VLM Captioning
Ankan Deria, Adinath Dukre, feilong tang et al.
Understanding Personal Concept in Open-Vocabulary Semantic Segmentation
Sunghyun Park, Jungsoo Lee, Shubhankar Borse et al.
Generalized Linear Mode Connectivity for Transformers
Alexander Theus, Alessandro Cabodi, Sotiris Anagnostidis et al.
Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Dongwon Kim, Ju He, Qihang Yu et al.
Self-supervised ControlNet with Spatio-Temporal Mamba for Real-world Video Super-resolution
Shijun Shi, Jing Xu, Lijing Lu et al.
CoralVQA: A Large-Scale Visual Question Answering Dataset for Coral Reef Image Understanding
hongyong han, Wei Wang, Gaowei Zhang et al.
QuanDA: Quantile-Based Discriminant Analysis for High-Dimensional Imbalanced Classification
Qian Tang, Yuwen Gu, Boxiang Wang
Continuous Domain Generalization
Zekun CAI, Yiheng YAO, Guangji Bai et al.
Advancing Interpretability of CLIP Representations with Concept Surrogate Model
Nhat Hoang-Xuan, Xiyuan Wei, Wanli Xing et al.
Do Language Models Use Their Depth Efficiently?
Róbert Csordás, Christopher D Manning, Chris Potts
D-Attn: Decomposed Attention for Large Vision-and-Language Model
Chia-Wen Kuo, Sijie Zhu, Fan Chen et al.
Partial Correlation Network Estimation by Semismooth Newton Methods
DongWon Kim, Sungdong Lee, Joong-Ho (Johann) Won
Are Pixel-Wise Metrics Reliable for Computerized Tomography Reconstruction?
Tianyu Lin, Xinran Li, Chuntung Zhuang et al.
FuXi-RTM: A Physics-Guided Prediction Framework with Radiative Transfer Modeling
qiusheng huang, Xiaohui Zhong, Xu Fan et al.
Visual Interestingness Decoded: How GPT-4o Mirrors Human Interests
Fitim Abdullahu, Helmut Grabner
An Ellipsoid Algorithm for Online Convex Optimization
Zakaria Mhammedi
NormFit: A Lightweight Solution for Few-Shot Federated Learning with Non-IID Data
Azadeh Motamedi, Jae-Mo Kang, Il-Min Kim
Towards Performance Consistency in Multi-Level Model Collaboration
Qi Li, Runpeng Yu, Xinchao Wang
MagicID: Hybrid Preference Optimization for ID-Consistent and Dynamic-Preserved Video Customization
Hengjia Li, Lifan Jiang, Xi Xiao et al.
REMI: Reconstructing Episodic Memory During Internally Driven Path Planning
Zhaoze Wang, Genela Morris, Dori Derdikman et al.
Physics-informed Value Learner for Offline Goal-Conditioned Reinforcement Learning
Vittorio Giammarino, Ruiqi Ni, Ahmed Qureshi
Towards Robust Uncertainty Calibration for Composed Image Retrieval
Yifan Wang, Wuliang Huang, Yufan Wen et al.
Learning Personalized Ad Impact via Contextual Reinforcement Learning under Delayed Rewards
Yuwei Cheng, Zifeng Zhao, Haifeng Xu
SummDiff: Generative Modeling of Video Summarization with Diffusion
Kwanseok Kim, Jaehoon Hahm, Sumin Kim et al.
OSKAR: Omnimodal Self-supervised Knowledge Abstraction and Representation
Mohamed Abdelfattah, Kaouther Messaoud, Alexandre Alahi
HQA-VLAttack: Towards High Quality Adversarial Attack on Vision-Language Pre-Trained Models
Han Liu, Jiaqi Li, Zhi Xu et al.
Derm1M: A Million-scale Vision-Language Dataset Aligned with Clinical Ontology Knowledge for Dermatology
Siyuan Yan, Ming Hu, Yiwen Jiang et al.
RALoc: Enhancing Outdoor LiDAR Localization via Rotation Awareness
Yuyang Yang, Wen Li, Sheng Ao et al.
Progressive Distribution Bridging: Unsupervised Adaptation for Large-scale Pre-trained Models via Adaptive Auxiliary Data
Weinan He, Yixin Zhang, Zilei Wang
Imbalance in Balance: Online Concept Balancing in Generation Models
Yukai Shi, Jiarong Ou, Rui Chen et al.
DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization
YUANTIAN SHAO, Yuanteng Chen, Peisong Wang et al.
GARF: Learning Generalizable 3D Reassembly for Real-World Fractures
Sihang Li, Zeyu Jiang, Grace Chen et al.
PanSt3R: Multi-view Consistent Panoptic Segmentation
Lojze Zust, Yohann Cabon, Juliette Marrie et al.
PASG: A Closed-Loop Framework for Automated Geometric Primitive Extraction and Semantic Anchoring in Robotic Manipulation
Zhihao ZHU, Yifan Zheng, Siyu Pan et al.
Learning Latent Variable Models via Jarzynski-adjusted Langevin Algorithm
James Cuin, Davide Carbone, O. Deniz Akyildiz
RobIA: Robust Instance-aware Continual Test-time Adaptation for Deep Stereo
Jueun Ko, Hyewon Park, Hyesong Choi et al.
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions
Stefan Andreas Baumann, Felix Krause, Michael Neumayr et al.
Flow based approach for Dynamic Temporal Causal models with non-Gaussian or Heteroscedastic Noises
Abdellah Rahmani, Pascal Frossard
VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play
Zelai Xu, Ruize Zhang, Chao Yu et al.
Parameter-free Algorithms for the Stochastically Extended Adversarial Model
Shuche Wang, Adarsh Barik, Peng Zhao et al.
Learning Without Augmenting: Unsupervised Time Series Representation Learning via Frame Projections
Berken Utku Demirel, Christian Holz
Removing Cost Volumes from Optical Flow Estimators
Simon Kiefhaber, Stefan Roth, Simone Schaub-Meyer
MDD: A Dataset for Text-and-Music Conditioned Duet Dance Generation
Prerit Gupta, Jason Alexander Fotso-Puepi, Zhengyuan Li et al.
Scalable Exploration via Ensemble++
Yingru Li, Jiawei Xu, Baoxiang Wang et al.
BREAD: Branched Rollouts from Expert Anchors Bridge SFT & RL for Reasoning
Xuechen Zhang, Zijian Huang, Yingcong Li et al.
Consistent Supervised-Unsupervised Alignment for Generalized Category Discovery
Jizhou Han, Shaokun Wang, Yuhang He et al.
ReFlex: Text-Guided Editing of Real Images in Rectified Flow via Mid-Step Feature Extraction and Attention Adaptation
Jimyeong Kim, Jungwon Park, Yeji Song et al.
GraphTOP: Graph Topology-Oriented Prompting for Graph Neural Networks
Xingbo Fu, Zhenyu Lei, Zihan Chen et al.
Algorithm- and Data-Dependent Generalization Bounds for Diffusion Models
Benjamin Dupuis, Dario Shariatian, Maxime Haddouche et al.
On Vanishing Gradients, Over-Smoothing, and Over-Squashing in GNNs: Bridging Recurrent and Graph Learning
Alvaro Arroyo, Alessio Gravina, Benjamin Gutteridge et al.
Erasing Conceptual Knowledge from Language Models
Rohit Gandikota, Sheridan Feucht, Samuel Marks et al.
Can Agent Fix Agent Issues?
Alfin Wijaya Rahardja, Junwei Liu, Weitong Chen et al.
SpecTRe-GS: Modeling Highly Specular Surfaces with Reflected Nearby Objects by Tracing Rays in 3D Gaussian Splatting
Jiajun Tang, Fan Fei, Zhihao Li et al.
MixAT: Combining Continuous and Discrete Adversarial Training for LLMs
Csaba Dékány, Stefan Balauca, Dimitar I. Dimitrov et al.
One Subgoal at a Time: Zero-Shot Generalization to Arbitrary Linear Temporal Logic Requirements in Multi-Task Reinforcement Learning
Zijian Guo, İlker Işık, H M Sabbir Ahmad et al.
V-CECE: Visual Counterfactual Explanations via Conceptual Edits
Nikolaos Spanos, Maria Lymperaiou, Giorgos Filandrianos et al.
Learning from Demonstrations via Capability-Aware Goal Sampling
Yuanlin Duan, Yuning Wang, Wenjie Qiu et al.
Hawk: Leveraging Spatial Context for Faster Autoregressive Text-to-Image Generation
Zhi-Kai Chen, Jun-Peng Jiang, Han-Jia Ye et al.
Erasing More Than Intended? How Concept Erasure Degrades the Generation of Non-Target Concepts
Ibtihel Amara, Ahmed Imtiaz Humayun, Ivana Kajic et al.
KOEnsAttack: Towards Efficient Data-Free Black-Box Adversarial Attacks via Knowledge-Orthogonalized Substitute Ensembles
Chaoyong Yang, Jia-Li Yin, Bin Chen et al.
MI-TRQR: Mutual Information-Based Temporal Redundancy Quantification and Reduction for Energy-Efficient Spiking Neural Networks
Dengfeng Xue, Wenjuan Li, Yifan Lu et al.
Not Only Vision: Evolve Visual Speech Recognition via Peripheral Information
Zhaoxin Yuan, Shuang Yang, Shiguang Shan et al.
Approximate Domain Unlearning for Vision-Language Models
Kodai Kawamura, Yuta Goto, Rintaro Yanagi et al.
The Good, the Bad and the Ugly: Meta-Analysis of Watermarks, Transferable Attacks and Adversarial Defenses
Greg Gluch, Berkant Turan, Sai Ganesh Nagarajan et al.
Any-SSR: How Recursive Least Squares Works in Continual Learning of Large Language Model
Kai Tong, Kang Pan, Xiao Zhang et al.
VR-Drive: Viewpoint-Robust End-to-End Driving with Feed-Forward 3D Gaussian Splatting
Hoonhee Cho, Jae-Young Kang, Giwon Lee et al.
Embodied Crowd Counting
Runling Long, Yunlong Wang, Jia Wan et al.
URLs Help, Topics Guide: Understanding Metadata Utility in LLM Training
Dongyang Fan, Vinko Sabolčec, Martin Jaggi
GauUpdate: New Object Insertion in 3D Gaussian Fields with Consistent Global Illumination
Chengwei REN, Fan Zhang, Liangchao Xu et al.
The Cost of Compression: Tight Quadratic Black-Box Attacks on Sketches for $\ell_2$ Norm Estimation
Sara Ahmadian, Edith Cohen, Uri Stemmer
Latency NMS Attacks: Is It Real Life or Is It Just Fantasy?
Jean-Philippe Monteuuis, Cong Chen, Jonathan Petit
ChatGarment: Garment Estimation, Generation and Editing via Large Language Models
Siyuan Bian, Chenghao Xu, Yuliang Xiu et al.
Enhanced Event-based Dense Stereo via Cross-Sensor Knowledge Distillation
Haihao Zhang, Yunjian Zhang, Jianing Li et al.
Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation
Yuseung Lee, Jihyeon Je, Chanho Park et al.
Uncoupled and Convergent Learning in Monotone Games under Bandit Feedback
Jing Dong, Baoxiang Wang, Yaoliang Yu
Chimera: Improving Generalist Model with Domain-Specific Experts
Tianshuo Peng, Mingsheng Li, Jiakang Yuan et al.
Your Text Encoder Can Be An Object-Level Watermarking Controller
Naresh Kumar Devulapally, Mingzhen Huang, Vishal Asnani et al.
Fewer Denoising Steps or Cheaper Per-Step Inference: Towards Compute-Optimal Diffusion Model Deployment
Zhenbang Du, Yonggan Fu, Lifu Wang et al.
Music Grounding by Short Video
Zijie Xin, Minquan Wang, Jingyu Liu et al.
Continual Gaussian Mixture Distribution Modeling for Class Incremental Semantic Segmentation
Guilin Zhu, Runmin Wang, Yuanjie Shao et al.
Overcoming Challenges of Long-Horizon Prediction in Driving World Models
Arian Mousakhan, Sudhanshu Mittal, Silvio Galesso et al.
IOSTOM: Offline Imitation Learning from Observations via State Transition Occupancy Matching
Quang Anh Pham, Janaka Brahmanage, Tien Mai et al.
Training-Free Test-Time Adaptation via Shape and Style Guidance for Vision-Language Models
Shenglong Zhou, Manjiang Yin, Leiyu Sun et al.
Towards Single-Source Domain Generalized Object Detection via Causal Visual Prompts
Chen Li, Huiying Xu, Changxin Gao et al.
Fair Continuous Resource Allocation with Equality of Impact
Blossom Metevier, Dennis Wei, Karthikeyan Natesan Ramamurthy et al.
DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization
Zihan Ding, Chi Jin, Difan Liu et al.
Scaling and Taming Adversarial Training with Synthetic Data
Juntao Wu, Xianting Huang, Yu Chen et al.
DisCoPatch: Taming Adversarially-driven Batch Statistics for Improved Out-of-Distribution Detection
Francisco Caetano, Christiaan Viviers, Luis Zavala-Mondragón et al.
HubGT: Fast Graph Transformer with Decoupled Hierarchy Labeling
Ningyi Liao, Zihao Yu, Siqiang Luo et al.
Enhancing Diffusion-based Unrestricted Adversarial Attacks via Adversary Preferences Alignment
Kaixun Jiang, Zhaoyu Chen, HaiJing Guo et al.
Resolving Token-Space Gradient Conflicts: Token Space Manipulation for Transformer-Based Multi-Task Learning
Wooseong Jeong, Kuk-Jin Yoon
Intervening in Black Box: Concept Bottleneck Model for Enhancing Human Neural Network Mutual Understanding
Nuoye Xiong, Anqi Dong, Ning Wang et al.
Single-Step Operator Learning for Conditioned Time-Series Diffusion Models
Hui Chen, Vikas Singh
MVTrajecter: Multi-View Pedestrian Tracking with Trajectory Motion Cost and Trajectory Appearance Cost
Taiga Yamane, Ryo Masumura, Satoshi Suzuki et al.
SPOT: Scalable Policy Optimization with Trees for Markov Decision Processes
Xuyuan Xiong, Pedro Chumpitaz-Flores, Kaixun Hua et al.
DualFocus: Depth from Focus with Spatio-Focal Dual Variational Constraints
Sungmin Woo, Sangyoun Lee
HALO: Hadamard-Assisted Lower-Precision Optimization for LLMs
Saleh Ashkboos, Mahdi Nikdan, Rush Tabesh et al.
Multimodal Negative Learning
Baoquan Gong, Xiyuan Gao, Pengfei Zhu et al.
Time Reversal Symmetry for Efficient Robotic Manipulations in Deep Reinforcement Learning
Yunpeng Jiang, Jianshu Hu, Paul Weng et al.
Actor-Free Continuous Control via Structurally Maximizable Q-Functions
Yigit Korkmaz, Urvi Bhuwania, Ayush Jain et al.
Optimal Transport for Brain-Image Alignment: Unveiling Redundancy and Synergy in Neural Information Processing
Yang Xiao, Wang Lu, Jie Ji et al.
Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks
Jiawei Wang, Yushen Zuo, Yuanjun Chai et al.
Optimal community detection in dense bipartite graphs
Julien Chhor, Parker Knight
Adaptive Data-Borrowing for Improving Treatment Effect Estimation using External Controls
Qinwei Yang, Jingyi Li, Peng Wu
CoT Information: Improved Sample Complexity under Chain-of-Thought Supervision
Awni Altabaa, Omar Montasser, John Lafferty
EventMG: Efficient Multilevel Mamba-Graph Learning for Spatiotemporal Event Representation
Sheng Wu, Lin Jin, Hui Feng et al.
Controllable Human-centric Keyframe Interpolation with Generative Prior
Zujin Guo, Size Wu, Zhongang Cai et al.
The Curse of Depth in Large Language Models
Wenfang Sun, Xinyuan Song, Pengxiang Li et al.
Active Learning Meets Foundation Models: Fast Remote Sensing Data Annotation for Object Detection
Marvin Burges, Philipe Dias, Dalton Lunga et al.
ObjectMate: A Recurrence Prior for Object Insertion and Subject-Driven Generation
Daniel Winter, Asaf Shul, Matan Cohen et al.
MPS-Prover: Advancing Stepwise Theorem Proving by Multi-Perspective Search and Data Curation
Zhenwen Liang, Linfeng Song, Yang Li et al.
Safely Learning Controlled Stochastic Dynamics
Luc Brogat-Motte, Alessandro Rudi, Riccardo Bonalli
Geminio: Language-Guided Gradient Inversion Attacks in Federated Learning
Junjie Shan, Ziqi Zhao, Jialin Lu et al.
ARECHO: Autoregressive Evaluation via Chain-Based Hypothesis Optimization for Speech Multi-Metric Estimation
Jiatong Shi, Yifan Cheng, Bo-Hao Su et al.
Jacobian-Based Interpretation of Nonlinear Neural Encoding Model
Xiaohui Gao, Haoran Yang, cheng yue et al.
Hierarchical Information Aggregation for Incomplete Multimodal Alzheimer's Disease Diagnosis
Chengliang Liu, Que Yuanxi, Qihao Xu et al.
Learning Dynamics of RNNs in Closed-Loop Environments
Yoav Ger, Omri Barak
StruDiCO: Structured Denoising Diffusion with Gradient-free Inference-stage Boosting for Memory and Time Efficient Combinatorial Optimization
Yu Wang, Yang Li, Junchi Yan et al.
SpinMeRound: Consistent Multi-View Identity Generation Using Diffusion Models
Stathis Galanakis, Alexandros Lattas, Stylianos Moschoglou et al.
Is Visual in-Context Learning for Compositional Medical Tasks within Reach?
Simon Reiß, Zdravko Marinov, Alexander Jaus et al.
InvRGB+L: Inverse Rendering of Complex Scenes with Unified Color and LiDAR Reflectance Modeling
Xiaoxue Chen, Bhargav Chandaka, Chih-Hao Lin et al.
Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models
Wei Chen, Xin Yan, Bin Wen et al.
Improving Task-Specific Multimodal Sentiment Analysis with General MLLMs via Prompting
Haoyu Zhang, Yinan Zhang, Chaolong Ying et al.
MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model Framework
Qirui Mi, Mengyue Yang, Xiangning Yu et al.
MIND: Material Interface Generation from UDFs for Non-Manifold Surface Reconstruction
Xuhui Chen, Fei Hou, Wencheng Wang et al.
UDC-VIT: A Real-World Video Dataset for Under-Display Cameras
Kyusu Ahn, JiSoo Kim, Sangik Lee et al.
SAMPO: Scale-wise Autoregression with Motion Prompt for Generative World Models
Sen Wang, Jingyi Tian, Le Wang et al.
Task-Aware Prompt Gradient Projection for Parameter-Efficient Tuning Federated Class-Incremental Learning
Hualong Ke, Yachao Zhang, Jiangming Shi et al.
ConsNoTrainLoRA: Data-driven Weight Initialization of Low-rank Adapters using Constraints
Debasmit Das, Hyoungwoo Park, Munawar Hayat et al.
Multi-Modal Multi-Task Unified Embedding Model (M3T-UEM): A Task-Adaptive Representation Learning Framework
Rohan Sharma, Changyou Chen, Feng-Ju Chang et al.
Implicit Generative Property Enhancer
Pedro O. Pinheiro, Pan Kessel, Aya Ismail et al.
Act to See, See to Act: Diffusion-Driven Perception-Action Interplay for Adaptive Policies
Jing Wang, Weiting Peng, Jing Tang et al.
3D-GSRD: 3D Molecular Graph Auto-Encoder with Selective Re-mask Decoding
Chang Wu, ZHIYUAN LIU, Wen Shu et al.
Handling Missing Responses under Cluster Dependence with Applications to Language Model Evaluation
Zhenghao Zeng, David Arbour, Avi Feller et al.