Most Cited 2025 "space-to-object regression" Papers
22,274 papers found • Page 108 of 112
Conference
A Computational Framework for Modeling Emergence of Color Vision in the Human Brain
Atsunobu Kotani, Yi-Ren Ng
Unsupervised Multiple Kernel Learning for Graphs via Ordinality Preservation
Yan Sun, Stanley Kok
Leveraging Flatness to Improve Information-Theoretic Generalization Bounds for SGD
Ze Peng, Jian Zhang, Yisen Wang et al.
Flaws of ImageNet, Computer Vision's Favourite Dataset
Nikita Kisel, Illia Volkov, Kateřina Hanzelková et al.
Lossy Compression with Pretrained Diffusion Models
jeremy vonderfecht, Feng Liu
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma, Zhengding Luo, Thanh Vinh Vo et al.
SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
Jintao Zhang, Jia wei, Pengle Zhang et al.
PIN: Prolate Spheroidal Wave Function-based Implicit Neural Representations
Viraj Dhananjaya Bandara Jayasundara Jayasundara Mudiyanselage, Heng Zhao, Demetrio Labate et al.
DeMo: Deep Motion Field Consensus with Learnable Kernels for Two-view Correspondence Learning
Yifan Lu, Jiajun Le, Zizhuo Li et al.
Extendable and Iterative Structure Learning Strategy for Bayesian Networks
Hamid Kalantari, Russell Greiner, Pouria Ramazi
Transformers Provably Solve Parity Efficiently with Chain of Thought
Juno Kim, Taiji Suzuki
Generalized Behavior Learning from Diverse Demonstrations
Varshith Sreeramdass, Rohan Paleja, Letian Chen et al.
ILLUSION: Unveiling Truth with a Comprehensive Multi-Modal, Multi-Lingual Deepfake Dataset
Kartik Thakral, Rishabh Ranjan, Akanksha Singh et al.
RuAG: Learned-rule-augmented Generation for Large Language Models
Yudi Zhang, Pei Xiao, Lu Wang et al.
Improving Deep Regression with Tightness
Shihao Zhang, Yuguang Yan, Angela Yao
GSE: Group-wise Sparse and Explainable Adversarial Attacks
Shpresim Sadiku, Moritz Wagner, Sebastian Pokutta
The impact of allocation strategies in subset learning on the expressive power of neural networks
Ofir Schlisselberg, Ran Darshan
Wavelet Diffusion Neural Operator
Peiyan Hu, Rui Wang, Xiang Zheng et al.
VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text
Tianyu Zhang, Suyuchen Wang, Lu Li et al.
OccProphet: Pushing the Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with an Observer-Forecaster-Refiner Framework
Junliang Chen, Huaiyuan Xu, Yi Wang et al.
Agree to Disagree: Demystifying Homogeneous Deep Ensembles through Distributional Equivalence
Yipei Wang, Xiaoqian Wang
Discovering Clone Negatives via Adaptive Contrastive Learning for Image-Text Matching
Renjie Pan, Jihao Dong, Hua Yang
Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization
Juntao Dai, Taiye Chen, Yaodong Yang et al.
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
Xinchen Zhang, Ling Yang, Guohao Li et al.
Resolution Attack: Exploiting Image Compression to Deceive Deep Neural Networks
Wangjia Yu, Xiaomeng Fu, Qiao Li et al.
Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations
Katie Matton, Robert Ness, John Guttag et al.
ProtPainter: Draw or Drag Protein via Topology-guided Diffusion
Zhengxi Lu, Shizhuo Cheng, Yuru Jiang et al.
Redefining the task of Bioactivity Prediction
Yanwen Huang, Bowen Gao, Yinjun JIA et al.
CtD: Composition through Decomposition in Emergent Communication
Boaz Carmeli, Ron Meir, Yonatan Belinkov
Revisit Micro-batch Clipping: Adaptive Data Pruning via Gradient Manipulation
Lun Wang
Model Risk-sensitive Offline Reinforcement Learning
Gwangpyo Yoo, Honguk Woo
Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks
Rui Hu, Yifan Zhang, Zhuoran Li et al.
Simple yet Effective Incomplete Multi-view Clustering: Similarity-level Imputation and Intra-view Hybrid-group Prototype Construction
Shengju Yu, Zhibin Dong, Siwei Wang et al.
UniCBE: An Uniformity-driven Comparing Based Evaluation Framework with Unified Multi-Objective Optimization
Peiwen Yuan, Shaoxiong Feng, Yiwei Li et al.
Personality Alignment of Large Language Models
Minjun Zhu, Yixuan Weng, Linyi Yang et al.
UniRestore3D: A Scalable Framework For General Shape Restoration
Yuang Wang, Yujian Zhang, Sida Peng et al.
Adversarially Robust Anomaly Detection through Spurious Negative Pair Mitigation
Hossein Mirzaei Sadeghlou, Mojtaba Nafez, Jafar Habibi et al.
From Your Block to Our Block: How to Find Shared Structure Between Stochastic Block Models over Multiple Graphs
Iiro Kumpulainen, Sebastian Dalleiger, Jilles Vreeken et al.
Disentangling, Amplifying, and Debiasing: Learning Disentangled Representations for Fair Graph Neural Networks
Yeon-Chang Lee, Hojung Shin, Sang-Wook Kim
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models
Peng Xia, Kangyu Zhu, Haoran Li et al.
Hotspot-Driven Peptide Design via Multi-Fragment Autoregressive Extension
Jiahan Li, Tong Chen, Shitong Luo et al.
Semantic Temporal Abstraction via Vision-Language Model Guidance for Efficient Reinforcement Learning
Tian-Shuo Liu, Xu-Hui Liu, Ruifeng Chen et al.
Enhancing Pre-trained Representation Classifiability can Boost its Interpretability
Reassessing How to Compare and Improve the Calibration of Machine Learning Models
Muthu Chidambaram, Rong Ge
Bridging the Gap Between f-divergences and Bayes Hilbert Spaces
Linus Lach, Alexander Fottner, Yarema Okhrin
DeepTAGE: Deep Temporal-Aligned Gradient Enhancement for Optimizing Spiking Neural Networks
Wei Liu, Li Yang, Mingxuan Zhao et al.
SysCaps: Language Interfaces for Simulation Surrogates of Complex Systems
Patrick Emami, Zhaonan Li, Saumya Sinha et al.
Revisit the Open Nature of Open Vocabulary Semantic Segmentation
Qiming Huang, Han Hu, Jianbo Jiao
Multi-Scale Fusion for Object Representation
Rongzhen Zhao, Vivienne Huiling Wang, Juho Kannala et al.
Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Jiajie Li, Brian Quaranto, Chenhui Xu et al.
SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations
Zhaorun Chen, Francesco Pinto, Minzhou Pan et al.
Improving Long-Text Alignment for Text-to-Image Diffusion Models
Luping Liu, Chao Du, Tianyu Pang et al.
On the Computation of the Fisher Information in Continual Learning
Gido van de Ven
3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery
Xiuyuan Hu, Guoqing Liu, Can Chen et al.
A Geometric Framework for Understanding Memorization in Generative Models
Brendan Ross, Hamidreza Kamkari, Tongzi Wu et al.
Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection
Guangsheng Bao, Yanbin Zhao, Juncai He et al.
Investigating Pattern Neurons in Urban Time Series Forecasting
Chengxin Wang, Yiran Zhao, shaofeng cai et al.
Can Watermarks be Used to Detect LLM IP Infringement For Free?
Zhengyue Zhao, Xiaogeng Liu, Somesh Jha et al.
Neural Approximate Mirror Maps for Constrained Diffusion Models
Berthy Feng, Ricardo Baptista, Katherine Bouman
GANDALF: Generative AttentioN based Data Augmentation and predictive modeLing Framework for personalized cancer treatment
Aishwarya Jayagopal, Yanrong Zhang, Robert Walsh et al.
On the Fourier analysis in the SO(3) space : the EquiLoPO Network
Dmitrii Zhemchuzhnikov, Sergei Grudinin
Grammar Reinforcement Learning: path and cycle counting in graphs with a Context-Free Grammar and Transformer approach
Jason Piquenot, Maxime Berar, Romain Raveaux et al.
HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere
Hatef Otroshi Shahreza, Sébastien Marcel
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance
Dongmin Park, Sebin Kim, Taehong Moon et al.
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
Boyu Gou, Demi Ruohan Wang, Boyuan Zheng et al.
Decentralized Optimization with Coupled Constraints
Demyan Yarmoshik, Alexander Rogozin, Nikita Kiselev et al.
A Visual Dive into Conditional Flow Matching
Anne Gagneux, Ségolène Martin, Rémi Emonet et al.
A3D: Does Diffusion Dream about 3D Alignment?
Savva Ignatyev, Nina Konovalova, Daniil Selikhanovych et al.
Long Context Compression with Activation Beacon
Peitian Zhang, Zheng Liu, Shitao Xiao et al.
K-HALU: Multiple Answer Korean Hallucination Benchmark for Large Language Models
Jaehyung Seo, Heuiseok Lim
CipherPrune: Efficient and Scalable Private Transformer Inference
Yancheng Zhang, Jiaqi Xue, Mengxin Zheng et al.
Data Selection via Optimal Control for Language Models
Yuxian Gu, Li Dong, Hongning Wang et al.
VVC-Gym: A Fixed-Wing UAV Reinforcement Learning Environment for Multi-Goal Long-Horizon Problems
Xudong Gong, Feng Dawei, Kele Xu et al.
Scaling Laws for Downstream Task Performance in Machine Translation
Berivan Isik, NATALIA PONOMAREVA, Hussein Hazimeh et al.
CURIE: Evaluating LLMs on Multitask Scientific Long-Context Understanding and Reasoning
Hao Cui, Zahra Shamsi, Gowoon Cheon et al.
SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression
Xin Wang, Yu Zheng, Zhongwei Wan et al.
Federated Continual Learning Goes Online: Uncertainty-Aware Memory Management for Vision Tasks and Beyond
Giuseppe Serra, Florian Buettner
Diversity-Rewarded CFG Distillation
Geoffrey Cideron, Andrea Agostinelli, Johan Ferret et al.
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Chankyu Lee, Rajarshi Roy, Mengyao Xu et al.
GenXD: Generating Any 3D and 4D Scenes
Yuyang Zhao, Chung-Ching Lin, Kevin Lin et al.
Meta-Continual Learning of Neural Fields
Seungyoon Woo, Junhyeog Yun, Gunhee Kim
Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving
Xiang Li, Pengfei Li, Yupeng Zheng et al.
DPLM-2: A Multimodal Diffusion Protein Language Model
Xinyou Wang, Zaixiang Zheng, Fei YE et al.
DisCo: Graph-Based Disentangled Contrastive Learning for Cold-Start Cross-Domain Recommendation
Hourun Li, Yifan Wang, Zhiping Xiao et al.
Disentangled Contrastive Bundle Recommendation with Conditional Diffusion
Jiuqiang Li
Continuous Autoregressive Modeling with Stochastic Monotonic Alignment for Speech Synthesis
Weiwei Lin, Chenhang HE
Trained Transformer Classifiers Generalize and Exhibit Benign Overfitting In-Context
Spencer Frei, Gal Vardi
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
Haotian Zhang, Mingfei Gao, Zhe Gan et al.
NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative
Asmar Nadeem, Faegheh Sardari, Robert Dawes et al.
Exploring Local Memorization in Diffusion Models via Bright Ending Attention
Chen Chen, Daochang Liu, Mubarak Shah et al.
Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank
Wenhao Zhan, Scott Fujimoto, Zheqing Zhu et al.
Towards Generalization Bounds of GCNs for Adversarially Robust Node Classification
Wen Wen, Han Li, Tieliang Gong et al.
TGB-Seq Benchmark: Challenging Temporal GNNs with Complex Sequential Dynamics
Lu Yi, Jie Peng, Yanping Zheng et al.
Process Reward Model with Q-value Rankings
Wendi Li, Yixuan Li
UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models
Fanghua Yu, Jinjin Gu, Jinfan Hu et al.
Efficient Cross-Episode Meta-RL
Gresa Shala, André Biedenkapp, Pierre Krack et al.
Rethinking Neural Multi-Objective Combinatorial Optimization via Neat Weight Embedding
Jinbiao Chen, Zhiguang Cao, Jiahai Wang et al.
RB-Modulation: Training-Free Stylization using Reference-Based Modulation
Litu Rout, Yujia Chen, Nataniel Ruiz et al.
Single Teacher, Multiple Perspectives: Teacher Knowledge Augmentation for Enhanced Knowledge Distillation
Md Imtiaz Hossain, Sharmen Akhter, Choong Seon Hong et al.
Your Weak LLM is Secretly a Strong Teacher for Alignment
Leitian Tao, Yixuan Li
CONDA: Adaptive Concept Bottleneck for Foundation Models Under Distribution Shifts
Jihye Choi, Jayaram Raghuram, Yixuan Li et al.
Lean-STaR: Learning to Interleave Thinking and Proving
Haohan Lin, Zhiqing Sun, Sean Welleck et al.
Enhancing Zeroth-order Fine-tuning for Language Models with Low-rank Structures
Yiming Chen, Yuan Zhang, Liyuan Cao et al.
4K4DGen: Panoramic 4D Generation at 4K Resolution
Renjie Li, Panwang Pan, Bangbang Yang et al.
Unleashing the Power of Task-Specific Directions in Parameter Efficient Fine-tuning
Chongjie Si, Zhiyi Shi, Shifan Zhang et al.
Diffusing to the Top: Boost Graph Neural Networks with Minimal Hyperparameter Tuning
Lequan Lin, Dai Shi, Andi Han et al.
When Graph Neural Networks Meet Dynamic Mode Decomposition
Dai Shi, Lequan Lin, Andi Han et al.
DINOv2: Learning Robust Visual Features without Supervision
Pierre Fernandez, Piotr Bojanowski, Gabriel Synnaeve et al.
Uncertainty Herding: One Active Learning Method for All Label Budgets
Wonho Bae, Danica Sutherland, Gabriel Oliveira
$q$-exponential family for policy optimization
Lingwei Zhu, Haseeb Shah, Han Wang et al.
SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data Selection
Han Shen, Pin-Yu Chen, Payel Das et al.
Addressing Label Shift in Distributed Learning via Entropy Regularization
Zhiyuan Wu, Changkyu Choi, Xiangcheng Cao et al.
TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights
Aiwei Liu, Haoping Bai, Zhiyun Lu et al.
Cross-Entropy Is All You Need To Invert the Data Generating Process
Patrik Reizinger, Alice Bizeul, Attila Juhos et al.
In Search of Forgotten Domain Generalization
Prasanna Mayilvahanan, Roland Zimmermann, Thaddäus Wiedemer et al.
Towards Hierarchical Rectified Flow
Yichi Zhang, Yici Yan, Alex Schwing et al.
Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning
Prajwal Koirala, Zhanhong Jiang, Soumik Sarkar et al.
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models
Alireza Ganjdanesh, Reza Shirkavand, Shangqian Gao et al.
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
Hayk Manukyan, Andranik Sargsyan, Barsegh Atanyan et al.
Relax and Merge: A Simple Yet Effective Framework for Solving Fair $k$-Means and $k$-sparse Wasserstein Barycenter Problems
Shihong Song, Guanlin Mo, Hu Ding
To Tackle Adversarial Transferability: A Novel Ensemble Training Method with Fourier Transformation
Wanlin Zhang, Weichen Lin, Ruomin Huang et al.
Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset
Yiqin Yang, Quanwei Wang, Chenghao Li et al.
VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning
Yichao Liang, Nishanth Kumar, Hao Tang et al.
Reward Dimension Reduction for Scalable Multi-Objective Reinforcement Learning
Giseung Park, Youngchul Sung
SPD Attack - Prevention of AI Powered Image Editing by Image Immunization
Parth Badgujar, Shorya Singhal, Devansh Bhardwaj
Value-aligned Behavior Cloning for Offline Reinforcement Learning via Bi-level Optimization
Xingyu Jiang, Ning Gao, Xiuhui Zhang et al.
Hybrid Regularization Improves Diffusion-based Inverse Problem Solving
Hongkun Dou, Zeyu Li, Jinyang Du et al.
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data
Shubham Toshniwal, Wei Du, Ivan Moshkov et al.
HGM³: Hierarchical Generative Masked Motion Modeling with Hard Token Mining
Minjae Jeong, Yechan Hwang, Jaejin Lee et al.
Logic-Logit: A Logic-Based Approach to Choice Modeling
Shuhan Zhang, Wendi Ren, Shuang Li
Learning Evolving Tools for Large Language Models
Guoxin Chen, Zhong Zhang, Xin Cong et al.
Enhancing Compositional Text-to-Image Generation with Reliable Random Seeds
Shuangqi Li, Hieu Le, Jingyi Xu et al.
Robust Representation Consistency Model via Contrastive Denoising
jiachen lei, Julius Berner, Jiongxiao Wang et al.
Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning
Amin Karimi Monsefi, Mengxi Zhou, Nastaran Monsefi et al.
OpenPRM: Building Open-domain Process-based Reward Models with Preference Trees
Kaiyan Zhang, Jiayuan Zhang, Haoxin Li et al.
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models
Guanting Dong, Keming Lu, Chengpeng Li et al.
CertainlyUncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness
Khyathi Chandu, Linjie Li, Anas Awadalla et al.
Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation
Jiahao Cui, Hui Li, Yao Yao et al.
GaussianBlock: Building Part-Aware Compositional and Editable 3D Scene by Primitives and Gaussians
Shuyi Jiang, Qihao Zhao, Hossein Rahmani et al.
Flat Reward in Policy Parameter Space Implies Robust Reinforcement Learning
HyunKyu Lee, Sung Whan Yoon
Integral Performance Approximation for Continuous-Time Reinforcement Learning Control
Brent Wallace, Jennie Si
A Theoretically-Principled Sparse, Connected, and Rigid Graph Representation of Molecules
Shih-Hsin Wang, Yuhao Huang, Justin Baker et al.
Efficient Perplexity Bound and Ratio Matching in Discrete Diffusion Language Models
Etrit Haxholli, Yeti Z. Gurbuz, Oğul Can et al.
Utility-Directed Conformal Prediction: A Decision-Aware Framework for Actionable Uncertainty Quantification
Santiago Cortes-Gomez, Carlos Patiño, Yewon Byun et al.
Uncovering Overfitting in Large Language Model Editing
Mengqi Zhang, Xiaotian Ye, Qiang Liu et al.
ParetoFlow: Guided Flows in Multi-Objective Optimization
Ye Yuan, Can Chen, Christopher Pal et al.
HyperDAS: Towards Automating Mechanistic Interpretability with Hypernetworks
Jiuding Sun, Jing Huang, Sidharth Baskaran et al.
Beyond Model Collapse: Scaling Up with Synthesized Data Requires Verification
Yunzhen Feng, Elvis Dohmatob, Pu Yang et al.
GReaTer: Gradients Over Reasoning Makes Smaller Language Models Strong Prompt Optimizers
Sarkar Snigdha Sarathi Das, Ryo Kamoi, Bo Pang et al.
Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos
Gengshan Yang, Andrea Bajcsy, Shunsuke Saito et al.
MAPS: Advancing Multi-Modal Reasoning in Expert-Level Physical Science
Erle Zhu, Yadi Liu, Zhe Zhang et al.
Scaling Long Context Training Data by Long-Distance Referrals
Yonghao Zhuang, Lanxiang Hu, Longfei Yun et al.
AutoBencher: Towards Declarative Benchmark Construction
XIANG LI, Farzaan Kaiyom, Evan Liu et al.
Scaling Stick-Breaking Attention: An Efficient Implementation and In-depth Study
Shawn Tan, Songlin Yang, Aaron Courville et al.
Innovative Thinking, Infinite Humor: Humor Research of Large Language Models through Structured Thought Leaps
Han Wang, Yilin Zhao, Dian Li et al.
How Discrete and Continuous Diffusion Meet: Comprehensive Analysis of Discrete Diffusion Models via a Stochastic Integral Framework
Yinuo Ren, Haoxuan Chen, Grant Rotskoff et al.
ODE-based Smoothing Neural Network for Reinforcement Learning Tasks
Yinuo Wang, Wenxuan Wang, Xujie Song et al.
What is Wrong with Perplexity for Long-context Language Modeling?
Lizhe Fang, Yifei Wang, Zhaoyang Liu et al.
Is Factuality Enhancement a Free Lunch For LLMs? Better Factuality Can Lead to Worse Context-Faithfulness
Baolong Bi, Shenghua Liu, Yiwei Wang et al.
Residual Kernel Policy Network: Enhancing Stability and Robustness in RKHS-Based Reinforcement Learning
Yixian Zhang, Huaze Tang, Huijing Lin et al.
Doubly robust identification of treatment effects from multiple environments
Piersilvio De Bartolomeis, Julia Kostin, Javier Abad et al.
Can We Trust Embodied Agents? Exploring Backdoor Attacks against Embodied LLM-Based Decision-Making Systems
Ruochen Jiao, Shaoyuan Xie, Justin Yue et al.
Factor Graph-based Interpretable Neural Networks
Yicong Li, Kuanjiu Zhou, Shuo Yu et al.
ProteinBench: A Holistic Evaluation of Protein Foundation Models
Fei YE, Zaixiang Zheng, Dongyu Xue et al.
Enhancing Prediction Performance through Influence Measure
Shuguang Yu, Wenqian Xu, Xinyi Zhou et al.
Fine-Grained Verifiers: Preference Modeling as Next-token Prediction in Vision-Language Alignment
Chenhang Cui, An Zhang, Yiyang Zhou et al.
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Peng Xia, Siwei Han, Shi Qiu et al.
Learning LLM-as-a-Judge for Preference Alignment
Ziyi Ye, Xiangsheng Li, Qiuchi Li et al.
Reliable and Diverse Evaluation of LLM Medical Knowledge Mastery
Yuxuan Zhou, Xien Liu, Chen Ning et al.
Step-Calibrated Diffusion for Biomedical Optical Image Restoration
Yiwei Lyu, Sung Jik Cha, Cheng Jiang et al.
Learning Graph Quantized Tokenizers
Limei Wang, Kaveh Hassani, Si Zhang et al.
PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration
Yuxuan Sun, Yunlong Zhang, Yixuan Si et al.
From Decoupling to Adaptive Transformation: a Wider Optimization Space for PTQ
Zhaojing Wen, Qiulin Zhang, Yuan Zhang et al.
Scaling Large Language Model-based Multi-Agent Collaboration
Chen Qian, Zihao Xie, YiFei Wang et al.
Less is More: Masking Elements in Image Condition Features Avoids Content Leakages in Style Transfer Diffusion Models
Lin Zhu, Xinbing Wang, Chenghu Zhou et al.
When Prompt Engineering Meets Software Engineering: CNL-P as Natural and Robust "APIs'' for Human-AI Interaction
Zhenchang Xing, Yang Liu, Zhuo Cheng et al.
Quantifying Generalization Complexity for Large Language Models
Zhenting Qi, Hongyin Luo, Xuliang Huang et al.
Multi-Reward as Condition for Instruction-based Image Editing
Xin Gu, Ming Li, Libo Zhang et al.
A Tight Convergence Analysis of Inexact Stochastic Proximal Point Algorithm for Stochastic Composite Optimization Problems
Shulan Zhu, Chenglong Bao, Defeng Sun et al.
A Benchmark for Semantic Sensitive Information in LLMs Outputs
Qingjie Zhang, Han Qiu, Di Wang et al.
Lipschitz Bandits in Optimal Space
Xiaoyi Zhu, Zengfeng Huang
TEASER: Token Enhanced Spatial Modeling for Expressions Reconstruction
Yunfei Liu, Lei Zhu, Lijian Lin et al.
Provable Robust Overfitting Mitigation in Wasserstein Distributionally Robust Optimization
Shuang Liu, Yihan Wang, Yifan Zhu et al.
GROOT-2: Weakly Supervised Multimodal Instruction Following Agents
Shaofei Cai, Bowei Zhang, Zihao Wang et al.
Computing Circuits Optimization via Model-Based Circuit Genetic Evolution
Zhihai Wang, Jie Wang, Xilin Xia et al.
Vevo: Controllable Zero-Shot Voice Imitation with Self-Supervised Disentanglement
Xueyao Zhang, Xiaohui Zhang, Kainan Peng et al.
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces
zehan wang, Ziang Zhang, Minjie Hong et al.
RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction
Tanqiu Jiang, Zian Wang, Jiacheng Liang et al.
Sort-free Gaussian Splatting via Weighted Sum Rendering
Qiqi Hou, Randall Rauwendaal, Zifeng Li et al.
Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist
Zihao Zhou, Shudong Liu, Maizhen Ning et al.
Gyrogroup Batch Normalization
Ziheng Chen, Yue Song, Xiaojun Wu et al.
ZeroDiff: Solidified Visual-semantic Correlation in Zero-Shot Learning
Zihan Ye, Shreyank Gowda, Shiming Chen et al.
GrabS: Generative Embodied Agent for 3D Object Segmentation without Scene Supervision
Zihui Zhang, Yafei YANG, Hongtao Wen et al.
QP-SNN: Quantized and Pruned Spiking Neural Networks
Wenjie Wei, Malu Zhang, Zijian Zhou et al.
TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse Attention
Lijie Yang, Zhihao Zhang, Zhuofu Chen et al.
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence
Weize Chen, Ziming You, Ran Li et al.
Adam-mini: Use Fewer Learning Rates To Gain More
Yushun Zhang, Congliang Chen, Ziniu Li et al.
Automatic Curriculum Expert Iteration for Reliable LLM Reasoning
Zirui Zhao, Hanze Dong, Amrita Saha et al.
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Ziru Chen, Shijie Chen, Yuting Ning et al.
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Sangmin Bae, Adam Fisch, Hrayr Harutyunyan et al.