Most Cited 2025 "data efficiency" Papers
22,274 papers found • Page 84 of 112
Conference
Bohdi: Heterogeneous LLM Fusion with Automatic Data Exploration
俊琪 高, Zhichang Guo, Dazhi Zhang et al.
Motions as Queries: One-Stage Multi-Person Holistic Human Motion Capture
Kenkun Liu, Yurong Fu, Weihao Yuan et al.
Speculative Jacobi-Denoising Decoding for Accelerating Autoregressive Text-to-image Generation
Yao Teng, Fu-Yun Wang, Xian Liu et al.
RANK++LETR: Learn to Rank and Optimize Candidates for Line Segment Detection
Xin Tong, Baojie Tian, Yufei Guo et al.
NeuralSurv: Deep Survival Analysis with Bayesian Uncertainty Quantification
Mélodie Monod, Alessandro Micheli, Samir Bhatt
Accelerating 3D Molecule Generative Models with Trajectory Diagnosis
Zhilong Zhang, Yuxuan Song, Yichun Wang et al.
Learning to Control Free-Form Soft Swimmers
Changyu Hu, Yanke Qu, Qiuan Yang et al.
Reproducing Kernel Banach Space Models for Neural Networks with Application to Rademacher Complexity Analysis
Alistair Shilton, Sunil Gupta, Santu Rana et al.
Exploring Weather-aware Aggregation and Adaptation for Semantic Segmentation under Adverse Conditions
Yuwen Pan, Rui Sun, Wangkai Li et al.
Wonder Wins Ways: Curiosity-Driven Exploration through Multi-Agent Contextual Calibration
Yiyuan Pan, Zhe Liu, Hesheng Wang
Rethinking Noisy Video-Text Retrieval via Relation-aware Alignment
Huakai Lai, Guoxin Xiong, Huayu Mai et al.
CoStoDet-DDPM: Collaborative Training of Stochastic and Deterministic Models Improves Surgical Workflow Anticipation and Recognition
Kaixiang Yang, Xin Li, Qiang Li et al.
CroPe: Cross-Modal Semantic Compensation Adaptation for All Adverse Scene Understanding
Qin Xu, Qihang Wu, Lu Hongtao et al.
CLIPSym: Delving into Symmetry Detection with CLIP
Tinghan Yang, Md Ashiqur Rahman, Raymond A. Yeh
Complete Structure Guided Point Cloud Completion via Cluster- and Instance-Level Contrastive Learning
Yang Chen, Yirun Zhou, Weizhong Zhang et al.
Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App Control
Georgios Papoudakis, Thomas Coste, Jianye Hao et al.
BMW: Bidirectionally Memory bank reWriting for Unsupervised Person Re-Identification
Xiaobin Liu, Jianing Li, Baiwei Guo et al.
Implicit-ARAP: Efficient Handle-Guided Neural Field Deformation via Local Patch Meshing
Daniele Baieri, Filippo Maggioli, Emanuele Rodolà et al.
Online Portfolio Selection with ML Predictions
Ziliang Zhang, Tianming Zhao, Albert Zomaya
Normalize Filters! Classical Wisdom for Deep Vision
Gustavo Perez, Stella X. Yu
Conflict-Aware Knowledge Editing in the Wild: Semantic-Augmented Graph Representation for Unstructured Text
Zhange Zhang, Zhicheng Geng, Yuqing Ma et al.
Rig3R: Rig-Aware Conditioning and Discovery for 3D Reconstruction
Samuel Li, Pujith Kachana, Prajwal Chidananda et al.
Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning
Buzhen Huang, Chen Li, Chongyang Xu et al.
Diffusion-Driven Progressive Target Manipulation for Source-Free Domain Adaptation
Yuyang Huang, Yabo Chen, Junyu Zhou et al.
Co-Painter: Fine-Grained Controllable Image Stylization via Implicit Decoupling and Adaptive Injection
Bowen Fu, Wei Wei, Jiaqi Tang et al.
GD$^2$: Robust Graph Learning under Label Noise via Dual-View Prediction Discrepancy
Kailai Li, Jiong Lou, Jiawei Sun et al.
You Can Trust Your Clustering Model: A Parameter-free Self-Boosting Plug-in for Deep Clustering
Hanyang Li, Yuheng Jia, Hui LIU et al.
Toward Better Out-painting: Improving the Image Composition with Initialization Policy Model
Xuan Han, Yihao Zhao, Yanhao Ge et al.
M3GYM: A Large-Scale Multimodal Multi-view Multi-person Pose Dataset for Fitness Activity Understanding in Real-world Settings
Qingzheng Xu, Ru Cao, Xin Shen et al.
In-Context Fully Decentralized Cooperative Multi-Agent Reinforcement Learning
Chao Li, Bingkun BAO, Yang Gao
Star with Bilinear Mapping
Zelin Peng, Yu Huang, Zhengqin Xu et al.
A Unified Analysis of Stochastic Gradient Descent with Arbitrary Data Permutations and Beyond
Yipeng Li, Xinchen Lyu, Zhenyu Liu
LLM Thought Divergence and Convergence for Dialogue-Based Image Generation Control
Hui Li
Image Stitching in Adverse Condition: A Bidirectional-Consistency Learning Framework and Benchmark
Zengxi Zhang, Junchen Ge, Zhiying Jiang et al.
Fit the Distribution: Cross-Image/Prompt Adversarial Attacks on Multimodal Large Language Models
Hai Yan, Haijian Ma, Xiaowen Cai et al.
Self-Perturbed Anomaly-Aware Graph Dynamics for Multivariate Time-Series Anomaly Detection
Jinyu Cai, Yuan Xie, Glynnis Lim et al.
Curriculum Model Merging: Harmonizing Chemical LLMs for Enhanced Cross-Task Generalization
Baoyi He, Luotian Yuan, Ying Wei et al.
SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation
Runtao Liu, I Chen, Jindong Gu et al.
ToF-IP: Time-of-Flight Enhanced Sparse Inertial Poser for Real-time Human Motion Capture
Yuan Yao, Shifan Jiang, Yangqing Hou et al.
Do LVLMs Truly Understand Video Anomalies? Revealing Hallucination via Co-Occurrence Patterns
Menghao Zhang, Huazheng Wang, Pengfei Ren et al.
Scalable Extraction of Training Data from Aligned, Production Language Models
Milad Nasr, Javier Rando, Nicholas Carlini et al.
Whose Instructions Count? Resolving Preference Bias in Instruction Fine-Tuning
Jiayu Zhang, Changbang Li, Yinan Peng et al.
Enpowering Your Pansharpening Models with Generalizability: Unified Distribution is All You Need
Yongchuan Cui, Peng Liu, HUI ZHANG
How to Learn a Star: Binary Classification with Starshaped Polyhedral Sets
Marie-Charlotte Brandenburg, Katharina Jochemko
SUV: Suppressing Undesired Video Content via Semantic Modulation Based on Text Embeddings
Xiang Lv, Mingwen Shao, Lingzhuang Meng et al.
Dual Domain Control via Active Learning for Remote Sensing Domain Incremental Object Detection
Jiachen Sun, De Cheng, Xi Yang et al.
HOT: Hadamard-based Optimized Training
Seonggon Kim, Juncheol Shin, Seung-taek Woo et al.
OVG-HQ: Online Video Grounding with Hybrid-modal Queries
Runhao Zeng, Jiaqi Mao, Minghao Lai et al.
Thinking vs. Doing: Improving Agent Reasoning by Scaling Test-Time Interaction
Junhong Shen, Hao Bai, Lunjun Zhang et al.
Federated Continuous Category Discovery and Learning
Lixu Wang, Chenxi Liu, Junfeng Guo et al.
SGN: Shifted Window-Based Hierarchical Variable Grouping for Multivariate Time Series Classification
Zenan Ying, Zhi Zheng, huijun hou et al.
PROL : Rehearsal Free Continual Learning in Streaming Data via Prompt Online Learning
Muhammad Anwar Ma'sum, Mahardhika Pratama, Savitha Ramasamy et al.
SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment
QiXu, Dongxu Wei, Lingzhe Zhao et al.
Continual Slow-and-Fast Adaptation of Latent Neural Dynamics (CoSFan): Meta-Learning What-How & When to Adapt
Ryan Missel, Linwei Wang
ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting
Sandro Papais, Letian Wang, Brian Cheong et al.
Towards the Resistance of Neural Network Fingerprinting to Fine-tuning
Ling Tang, YueFeng Chen, Hui Xue' et al.
MeasureXpert: Automatic Anthropometric Measurement Extraction from Two Unregistered, Partial, Posed, and Dressed Body Scans
Ran Zhao, Xinxin Dai, Pengpeng Hu et al.
An Effective Levelling Paradigm for Unlabeled Scenarios
Fangming Cui, Zhou Yu, Di Yang et al.
FreeControl: Efficient, Training-Free Structural Control via One-Step Attention Extraction
Jiang Lin, Xinyu Chen, Song Wu et al.
Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Large Model Enhancement
Qianhan Feng, Wenshuo Li, Tong Lin et al.
Proxy-SPEX: Sample-Efficient Interpretability via Sparse Feature Interactions in LLMs
Landon Butler, Abhineet Agarwal, Justin Kang et al.
Policy Optimized Text-to-Image Pipeline Design
Uri Gadot, Rinon Gal, Yftah Ziser et al.
A Geometry-Aware Metric for Mode Collapse in Time Series Generative Models
Yassine ABBAHADDOU, Amine Aboussalah
GLID$^2$E: A Gradient-Free Lightweight Fine-tune Approach for Discrete Biological Sequence Design
Hanqun Cao, Haosen Shi, Chenyu Wang et al.
Robust Egocentric Referring Video Object Segmentation via Dual-Modal Causal Intervention
Haijing Liu, Zhiyuan Song, Hefeng Wu et al.
Bridging Class Imbalance and Partial Labeling via Spectral-Balanced Energy Propagation for Skeleton-based Action Recognition
Yandan Wang, Chenqi Guo, Yinglong Ma et al.
The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation
Aoxiong Yin, Kai Shen, Yichong Leng et al.
GenAssets: Generating in-the-wild 3D Assets in Latent Space
Ze Yang, Jingkang Wang, Haowei Zhang et al.
High-Order Flow Matching: Unified Framework and Sharp Statistical Rates
Maojiang Su, Jerry Yao-Chieh Hu, Yi-Chen Lee et al.
Enhancing Spatial Reasoning in Multimodal Large Language Models through Reasoning-based Segmentation
Zhenhua Ning, Zhuotao Tian, Shaoshuai Shi et al.
Shortcutting Pre-trained Flow Matching Diffusion Models is Almost Free Lunch
Xu Cai, Yang Wu, Qianli Chen et al.
Agreement aware and dissimilarity oriented GLOM
Ru Zeng, Yan Song, Yang ZHANG et al.
Learning Textual Prompts for Open-World Semi-Supervised Learning
Yuxin Fan, Junbiao Cui, Jiye Liang
Model-Guided Dual-Role Alignment for High-Fidelity Open-Domain Video-to-Audio Generation
Kang Zhang, Trung X. Pham, Suyeon Lee et al.
Ultra High-Resolution Image Inpainting with Patch-Based Content Consistency Adapter
JianHui Zhang, Shen Cheng, Qirui Sun et al.
Diffusion Transformers for Imputation: Statistical Efficiency and Uncertainty Quantification
Zeqi Ye, Minshuo Chen
CMoB: Modality Valuation via Causal Effect for Balanced Multimodal Learning
Jun Wang, Fuyuan CAO, ZhixinXue et al.
Automatic Visual Instrumental Variable Learning for Confounding-Resistant Domain Generalization
Fuyuan CAO, Shichang Qiao, Kui Yu et al.
Automated Red Teaming for Text-to-Image Models through Feedback-Guided Prompt Iteration with Vision-Language Models
Wei Xu, Kangjie Chen, Jiawei Qiu et al.
KSP: Kolmogorov-Smirnov metric-based Post-Hoc Calibration for Survival Analysis
Jeongho Park, Daheen Kim, Cheoljun Kim et al.
ErrorTrace: A Black-Box Traceability Mechanism Based on Model Family Error Space
Chuanchao Zang, Xiangtao Meng, Wenyu Chen et al.
FlowNet: Modeling Dynamic Spatio-Temporal Systems via Flow Propagation
Yutong Feng, Xu Liu, Yutong Xia et al.
Enrich and Detect: Video Temporal Grounding with Multimodal LLMs
Shraman Pramanick, Effrosyni Mavroudi, Yale Song et al.
Breaking Grid Constraints: Dynamic Graph Reconstruction Network for Multi-organ Segmentation
Junhao Xiao, Yang Wei, Jingyu Wang et al.
Compress Large Language Models via Collaboration Between Learning and Matrix Approximation
Yuesen Liao, Zhiwei Li, Binrui Wu et al.
Towards Annotation-Free Evaluation: KPAScore for Human Keypoint Detection
Xiaoxiao Wang, Chunxiao Li, Peng Sun et al.
MaskSAM: Auto-prompt SAM with Mask Classification for Volumetric Medical Image Segmentation
Bin Xie, Hao Tang, Bin Duan et al.
Self-Assembling Graph Perceptrons
Jialong Chen, Tong Wang, Bowen Deng et al.
Spatial-Temporal Forgery Trace based Forgery Image Identification
Yilin Wang, Zunlei Feng, Jiachi Wang et al.
Democratizing Clinical Risk Prediction with Cross-Cohort Cross-Modal Knowledge Transfer
Qiannan Zhang, Manqi Zhou, Zilong Bai et al.
SMP-Attack: Boosting the Transferability of Feature Importance-based Adversarial Attack with Semantics-aware Multi-granularity Patchout
Wen Yang, Guodong Liu, Di Ming
ContextFace: Generating Facial Expressions from Emotional Contexts
minjung kim, Minsang Kim, Seung Jun Baek
Gaze Beyond the Frame: Forecasting Egocentric 3D Visual Span
Heeseung Yun, Joonil Na, Jaeyeon Kim et al.
Enhancing Few-Shot Class-Incremental Learning via Training-Free Bi-Level Modality Calibration
Yiyang Chen, Tianyu Ding, Lei Wang et al.
Images as Noisy Labels: Unleashing the Potential of the Diffusion Model for Open-Vocabulary Semantic Segmentation
Fan Li, Xuanbin Wang, Xuan Wang et al.
DiffIP: Representation Fingerprints for Robust IP Protection of Diffusion Models
Zhuoling Li, Haoxuan Qu, Jason Kuen et al.
UniSite: The First Cross-Structure Dataset and Learning Framework for End-to-End Ligand Binding Site Detection
Jigang Fan, Quanlin Wu, Shengjie Luo et al.
GenIR: Generative Visual Feedback for Mental Image Retrieval
Diji Yang, Minghao Liu, Chung-Hsiang Lo et al.
UniteFormer: Unifying Node and Edge Modalities in Transformers for Vehicle Routing Problems
Dian Meng, Zhiguang Cao, Jie Gao et al.
Pruning-Robust Mamba with Asymmetric Multi-Scale Scanning Paths
Jindi Lv, Yuhao Zhou, Mingjia Shi et al.
Text to Sketch Generation with Multi-Styles
Tengjie Li, Shikui Tu, Lei Xu
QSCA: Quantization with Self-Compensating Auxiliary for Monocular Depth Estimation
Jincheol Yang, Jaemin Choi, Matti Zinke et al.
Solving the Asymmetric Traveling Salesman Problem via Trace-Guided Cost Augmentation
Zhen Zhang, Prof Javen Qinfeng Shi, Wee Sun Lee
Animate and Sound an Image
Xihua Wang, Ruihua Song, Chongxuan Li et al.
GAMMA: Gated Multi-hop Message Passing for Homophily-Agnostic Node Representation in GNNs
Amir Ghazizadeh, Rickard Ewetz, Hao Zheng
Vector Database Watermarking
Zhiwen Ren, Wei Fan, Qiyi Yao et al.
Processing and acquisition traces in visual encoders: What does CLIP know about your camera?
Ryan Ramos, Vladan Stojnić, Giorgos Kordopatis-Zilos et al.
SAINT: Sequence-Aware Integration for Spatial Transcriptomics Multi-View Clustering
Zeyu Zhu, KE LIANG, Lingyuan Meng et al.
The Devil is in the Spurious Correlations: Boosting Moment Retrieval with Dynamic Learning
Xinyang Zhou, Fanyue Wei, Lixin Duan et al.
Polyline Path Masked Attention for Vision Transformer
Zhongchen Zhao, Chaodong Xiao, Hui LIN et al.
P-Law: Predicting Quantitative Scaling Law with Entropy Guidance in Large Recommendation Models
Tingjia Shen, Hao Wang, Chuhan Wu et al.
MEH: A Multi-Style Dataset and Toolkit for Advancing Egyptian Hieroglyph Recognition
Maksim Golyadkin, Rubanova Alexandrovna, Aleksandr Utkov et al.
Revolutionizing Training-Free NAS: Towards Efficient Automatic Proxy Discovery via Large Language Models
Haidong Kang, Lihong Lin, Hanling Wang
Towards Explainable and Unprecedented Accuracy in Matching Challenging Finger Crease Patterns
Zhenyu Zhou, Chengdong Dong, Ajay Kumar
Transforming Gaps into Gains: Bridging Model and Data Heterogeneity in Federated Learning via Knowledge Weak-Aware Zones
Ke Li, Yan Ding, Zhiqin Zhu et al.
Hybrid-Tower: Fine-grained Pseudo-query Interaction and Generation for Text-to-Video Retrieval
Bangxiang Lan, Ruobing Xie, Ruixiang Zhao et al.
Unbiased Missing-modality Multimodal Learning
Ruiting Dai, Chenxi Li, Yandong Yan et al.
LDPose: Towards Inclusive Human Pose Estimation for Limb-Deficient Individuals in the Wild
Jiaying Ying, Heming Du, Kaihao Zhang et al.
Focus-Then-Reuse: Fast Adaptation in Visual Perturbation Environments
Jiahui Wang, Chao Chen, Jiacheng Xu et al.
SE-GUI: Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
Xinbin Yuan, Jian Zhang, Kaixin Li et al.
Enhancing Transferability of Targeted Adversarial Examples via Inverse Target Gradient Competition and Spatial Distance Stretching
Zhankai Li, Weiping Wang, jie li et al.
Generalizable Hand-Object Modeling from Monocular RGB Images via 3D Gaussians
Xingyu Liu, Pengfei Ren, Qi Qi et al.
ViM-VQ: Efficient Post-Training Vector Quantization for Visual Mamba
Juncan Deng, Shuaiting Li, Zeyu Wang et al.
Unified Multi-Agent Trajectory Modeling with Masked Trajectory Diffusion
songru Yang, Zhenwei Shi, Zhengxia Zou
Beyond Average Value Function in Precision Medicine: Maximum Probability-Driven Reinforcement Learning for Survival Analysis
Jianqi Feng, Chengchun Shi, Zhenke Wu et al.
Fourier Clouds: Fast Bias Correction for Imbalanced Semi-Supervised Learning
Jiawei Gu, Yidi Wang, Qingqiang Sun et al.
LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization
Zhenpeng Huang, Jiaqi Li, zihan jia et al.
DM-EFS: Dynamically Multiplexed Expanded Features Set Form for Robust and Efficient Small Object Detection
Aashish Sharma
LVLM-Driven Attribute-Aware Modeling for Visible-Infrared Person Re-Identification
Zhiqi Pang, Lingling Zhao, Junjie Wang et al.
No Experts, No Problem: Avoidance Learning from Bad Demonstrations
Huy Hoang, Tien Mai, Pradeep Varakantham
DiffTell: A High-Quality Dataset for Describing Image Manipulation Changes
Zonglin Di, Jing Shi, Yifei Fan et al.
Bi-Level Decision-Focused Causal Learning for Large-Scale Marketing Optimization: Bridging Observational and Experimental Data
Shuli Zhang, Hao Zhou, Jiaqi Zheng et al.
Toward Fair and Accurate Cross-Domain Medical Image Segmentation: A VLM-Driven Active Domain Adaptation Paradigm
Hongqiu Wang, Wu Chen, Xiangde Luo et al.
AM-Adapter: Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis in-the-Wild
Siyoon Jin, Jisu Nam, Jiyoung Kim et al.
Uncover Treasures in DCT: Advancing JPEG Quality Enhancement by Exploiting Latent Correlations
jing Yang, Qunliang Xing, Mai Xu et al.
Partition to Evolve: Niching-enhanced Evolution with LLMs for Automated Algorithm Discovery
Qinglong Hu, Qingfu Zhang
Gap Preserving Distillation by Building Bidirectional Mappings with A Dynamic Teacher
Yong Guo, Shulian Zhang, Haolin Pan et al.
Mixture-of-Scores: Robust Image-Text Data Valuation via Three Lines of Code
WU Sitong, Haoru Tan, Yukang Chen et al.
Semi-Supervised Regression with Heteroscedastic Pseudo-Labels
Xueqing Sun, Renzhen Wang, Quanziang Wang et al.
Personalized Federated Conformal Prediction with Localization
Yinjie Min, Chuchen Zhang, Liuhua Peng et al.
Dual Prototype-Enhanced Contrastive Framework for Class-Imbalanced Graph Domain Adaptation
Xin Ma, Yifan Wang, Siyu Yi et al.
Dynamic Gaussian Splatting from Defocused and Motion-blurred Monocular Videos
Xuankai Zhang, Junjin Xiao, Qing Zhang
Towards Reliable and Holistic Visual In-Context Learning Prompt Selection
Wenxiao Wu, Jing-Hao Xue, Chengming Xu et al.
GUIDED: Granular Understanding via Identification, Detection, and Discrimination for Fine-Grained Open-Vocabulary Object Detection
Jiaming Li, Zhijia Liang, Weikai Chen et al.
Hybrid Re-matching for Continual Learning with Parameter-Efficient Tuning
Weicheng Wang, Guoli Jia, Xialei Liu et al.
Unified Adversarial Augmentation for Improving Palmprint Recognition
Jianlong Jin, Chenglong Zhao, Ruixin Zhang et al.
Exploring Scene Affinity for Semi-Supervised LiDAR Semantic Segmentation
Chuandong Liu, Xingxing Weng, Shuguo Jiang et al.
Corporate Needs You to Find the Difference: Revisiting Submodular and Supermodular Ratio Optimization Problems
Elfarouk Harb, Yousef Yassin, Chandra Chekuri
NopeRoomGS: Indoor 3D Gaussian Splatting Optimization without Camera Pose Input
Wenbo Li, Yan Xu, Mingde Yao et al.
Enhancing Bioactivity Prediction via Spatial Emptiness Representation of Protein-ligand Complex and Union of Multiple Pockets
Zhiyuan Zhou, Yueming Yin, Yiming Yang et al.
Adversarial Reconstruction Feedback for Robust Fine-grained Generalization
Shijie Wang, Jian Shi, Haojie Li
A Unified Framework for Fair Graph Generation: Theoretical Guarantees and Empirical Advances
Zichong Wang, Zhipeng Yin, Wenbin Zhang
MGUP: A Momentum-Gradient Alignment Update Policy for Stochastic Optimization
Da Chang, Ganzhao Yuan
EyeBench: Predictive Modeling from Eye Movements in Reading
Omer Shubi, David Robert Reich, Keren Gruteke Klein et al.
HYPERION: Fine-Grained Hypersphere Alignment for Robust Federated Graph Learning
Guancheng Wan, Xiaoran Shang, Yuxin Wu et al.
MoFRR: Mixture of Diffusion Models for Face Retouching Restoration
Jiaxin Liu, Qichao Ying, Zhenxing Qian et al.
Vocabulary In-Context Learning in Transformers: Benefits of Positional Encoding
Qian Ma, Ruoxiang Xu, Yongqiang Cai
SHIFT: Smoothing Hallucinations by Information Flow Tuning for Multimodal Large Language Models
Sudong Wang, Yunjian Zhang, Yao Zhu et al.
Multi-agent KTO: Enhancing Strategic Interactions of Large Language Model in Language Game
Rong Ye, Yongxin Zhang, yikai zhang et al.
TokenSqueeze: Performance-Preserving Compression for Reasoning LLMs
Yuxiang Zhang, Zhengxu Yu, Weihang Pan et al.
Diffusion Epistemic Uncertainty with Asymmetric Learning for Diffusion-Generated Image Detection
Yingsong Huang, Hui Guo, Jing Huang et al.
ObCLIP: Oblivious CLoud-Device Hybrid Image Generation with Privacy Preservation
Haoqi Wu, Wei Dai, Ming Xu et al.
Is the acquisition worth the cost? Surrogate losses for Consistent Two-stage Classifiers
florence regol, Joseph Cotnareanu, Theodore Glavas et al.
Less is More: Efficient Image Vectorization with Adaptive Parameterization
Kaibo Zhao, Liang Bao, Yufei Li et al.
Inverse Image-Based Rendering for Light Field Generation from Single Images
Hyunjun Jung, Hae-Gon Jeon
Rethinking the Role of Verbatim Memorization in LLM Privacy
Tom Sander, Bargav Jayaraman, Mark Ibrahim et al.
Adaptive Gradient Masking for Balancing ID and MLLM-based Representations in Recommendation
Yidong Wu, Siyuan Chen, Binrui Wu et al.
Joint Scheduling of Causal Prompts and Tasks for Multi-Task Learning
Chaoyang Li, Jianyang Qin, Jinhao Cui et al.
DEXTER: Diffusion-Guided EXplanations with TExtual Reasoning for Vision Models
Simone Carnemolla, Matteo Pennisi, Sarinda Samarasinghe et al.
LOPT: Learning Optimal Pigovian Tax in Sequential Social Dilemmas
Yun Hua, Shang Gao, Wenhao Li et al.
CAS-Spec: Cascade Adaptive Self-Speculative Decoding for On-the-Fly Lossless Inference Acceleration of LLMs
Zhiyuan Ning, Jiawei Shao, Ruge Xu et al.
Conformal Prediction Beyond the Horizon: Distribution-Free Inference for Policy Evaluation
Feichen Gan, Lu Youcun, Yingying Zhang et al.
RiboFlow: Conditional De Novo RNA Co-Design via Synergistic Flow Matching
Runze Ma, Zhongyue Zhang, Zichen Wang et al.
Data-Free Model Extraction for Black-box Recommender Systems via Graph Convolutions
Zeyu Wang, Yidan Song, Shihao Qin et al.
MRGen: Segmentation Data Engine For Underrepresented MRI Modalities
Haoning Wu, Ziheng Zhao, Ya Zhang et al.
IneqSearch: Hybrid Reasoning for Olympiad Inequality Proofs
Zhaoqun Li, Beishui Liao, Qiwei Ye
Enhanced Expert Merging for Mixture-of-Experts in Graph Foundation Models
Lei Liu, Xingyu Xia, Qianqian Xie et al.
Jury-and-Judge Chain-of-Thought for Uncovering Toxic Data in 3D Visual Grounding
Kaixiang Huang, Qifeng Zhang, Jin Wang et al.
Reliable Lifelong Multimodal Editing: Conflict-Aware Retrieval Meets Multi-Level Guidance
Qiang Zhang, Fanrui Zhang, Jiawei Liu et al.
DynScene: Scalable Generation of Dynamic Robotic Manipulation Scenes for Embodied AI
Sangmin Lee, Sungyong Park, Heewon Kim
Optimal Minimum Width for the Universal Approximation of Continuously Differentiable Functions by Deep Narrow MLPs
Geonho Hwang
CLIP-driven Coarse-to-fine Semantic Guidance for Fine-grained Open-set Semi-supervised Learning
Xiaokun Li, Yaping Huang, Qingji Guan
AdaTS: Learning Adaptive Time Series Representations via Dynamic Soft Contrasts
Denizhan Kara, Tomoyoshi Kimura, Jinyang Li et al.
Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles
Eric Slyman, Mehrab Tanjim, Kushal Kafle et al.
VideoGEM: Training-free Action Grounding in Videos
Felix Vogel, Walid Bousselham, Anna Kukleva et al.
AegisGuard: RL-Guided Adapter Tuning for TEE-Based Efficient & Secure On-Device Inference
CHE WANG, Ziqi Zhang, Yinggui Wang et al.
Local-Global Coupling Spiking Graph Transformer for Brain Disorders Diagnosis from Two Perspectives
Geng Zhang, Jiangrong Shen, Kaizhong Zheng et al.
Region-aware Anchoring Mechanism for Efficient Referring Visual Grounding
Shuyi Ouyang, Ziwei Niu, Hongyi Wang et al.
Token-Efficient VLM: High-Resolution Image Understanding via Dynamic Region Proposal
Yitong Jiang, Jinwei Gu, Tianfan Xue et al.
SmartCache: Context-aware Semantic Cache for Efficient Multi-turn LLM Inference
Chengye Yu, Tianyu Wang, Zili Shao et al.
Targeted Maximum Likelihood Learning: An Optimization Perspective
Diyang Li, Kyra Gan
Diff-ICMH: Harmonizing Machine and Human Vision in Image Compression with Generative Prior
Ruoyu Feng, Yunpeng Qi, Jinming Liu et al.
Don’t Forget the Enjoin: FocalLoRA for Instruction Hierarchical Alignment in Large Language Models
Zitong Shi, Guancheng Wan, Haixin Wang et al.
Event-Equalized Dense Video Captioning
Kangyi Wu, Pengna Li, Jingwen Fu et al.
Leaving No OOD Instance Behind: Instance-Level OOD Fine-Tuning for Anomaly Segmentation
Yuxuan Zhang, Zhenbo Shi, han ye et al.
Axis-level Symmetry Detection with Group-Equivariant Representation
Wongyun Yu, Ahyun Seo, Minsu Cho
STaR: Seamless Spatial-Temporal Aware Motion Retargeting with Penetration and Consistency Constraints
Xiaohang Yang, Qing Wang, Jiahao Yang et al.
GazeGene: Large-scale Synthetic Gaze Dataset with 3D Eyeball Annotations
Yiwei Bao, Zhiming Wang, Feng Lu
Boosting Knowledge Utilization in Multimodal Large Language Models via Adaptive Logits Fusion and Attention Reallocation
Wenbin An, Jiahao Nie, Feng Tian et al.
FAST: Foreground‑aware Diffusion with Accelerated Sampling Trajectory for Segmentation‑oriented Anomaly Synthesis
xichen xu, Yanshu Wang, Jinbao Wang et al.