Most Cited 2024 "semantic causal graphs" Papers
12,324 papers found • Page 58 of 62
Conference
InstructDET: Diversifying Referring Object Detection with Generalized Instructions
Ronghao Dang, Jiangyan Feng, Haodong Zhang et al.
Patched Denoising Diffusion Models For High-Resolution Image Synthesis
Zheng Ding, Mengqi Zhang, Jiajun Wu et al.
Teach LLMs to Phish: Stealing Private Information from Language Models
Ashwinee Panda, Christopher Choquette-Choo, Zhengming Zhang et al.
How Do Transformers Learn In-Context Beyond Simple Functions? A Case Study on Learning with Representations
Tianyu Guo, Wei Hu, Song Mei et al.
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei GUO, Ceyuan Yang, Anyi Rao et al.
Efficient Integrators for Diffusion Generative Models
Kushagra Pandey, Maja Rudolph, Stephan Mandt
AttEXplore: Attribution for Explanation with model parameters eXploration
Zhiyu Zhu, Huaming Chen, Jiayu Zhang et al.
Symmetric Basis Convolutions for Learning Lagrangian Fluid Mechanics
Rene Winchenbach, Nils Thuerey
You Only Query Once: An Efficient Label-Only Membership Inference Attack
Yutong Wu, Han Qiu, Shangwei Guo et al.
The Marginal Value of Momentum for Small Learning Rate SGD
Runzhe Wang, Sadhika Malladi, Tianhao Wang et al.
Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!
Xiangyu Qi, Yi Zeng, Tinghao Xie et al.
CLIP the Bias: How Useful is Balancing Data in Multimodal Learning?
Ibrahim Alabdulmohsin, Xiao Wang, Andreas Steiner et al.
On the Power of the Weisfeiler-Leman Test for Graph Motif Parameters
Matthias Lanzinger, Pablo Barcelo
Multisize Dataset Condensation
Yang He, Lingao Xiao, Joey Tianyi Zhou et al.
Graph-based Virtual Sensing from Sparse and Partial Multivariate Observations
Giovanni De Felice, Andrea Cini, Daniele Zambon et al.
Point2SSM: Learning Morphological Variations of Anatomies from Point Clouds
Jadie Adams, Shireen Elhabian
Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models
Ashutosh Baheti, Ximing Lu, Faeze Brahman et al.
Gradual Optimization Learning for Conformational Energy Minimization
Artem Tsypin, Leonid A. Ugadiarov, Kuzma Khrabrov et al.
Communication-Efficient Gradient Descent-Accent Methods for Distributed Variational Inequalities: Unified Analysis and Local Updates
Siqi Zhang, Sayantan Choudhury, Sebastian Stich et al.
Reasoning with Latent Diffusion in Offline Reinforcement Learning
Siddarth Venkatraman, Shivesh Khaitan, Ravi Tej Akella et al.
COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
Sihan Chen, Xingjian He, Handong Li et al.
FedWon: Triumphing Multi-domain Federated Learning Without Normalization
Weiming Zhuang, Lingjuan Lyu
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Haque Ishfaq, Qingfeng Lan, Pan Xu et al.
Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing
Dujian Ding, Ankur Mallick, Chi Wang et al.
QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models
Jing Liu, Ruihao Gong, Xiuying Wei et al.
InfoCon: Concept Discovery with Generative and Discriminative Informativeness
Ruizhe Liu, Qian Luo, Yanchao Yang
Sparse Autoencoders Find Highly Interpretable Features in Language Models
Robert Huben, Hoagy Cunningham, Logan Smith et al.
Fixed Non-negative Orthogonal Classifier: Inducing Zero-mean Neural Collapse with Feature Dimension Separation
Hoyong Kim, Kangil Kim
Self-supervised Representation Learning from Random Data Projectors
Yi Sui, Tongzi Wu, Jesse Cresswell et al.
Dual-Encoders for Extreme Multi-label Classification
Nilesh Gupta, Fnu Devvrit, Ankit Singh Rawat et al.
Privileged Sensing Scaffolds Reinforcement Learning
Edward Hu, James Springer, Oleh Rybkin et al.
Fully Hyperbolic Convolutional Neural Networks for Computer Vision
Ahmad Bdeir, Kristian Schwethelm, Niels Landwehr
Cameras as Rays: Pose Estimation via Ray Diffusion
Jason Zhang, Amy Lin, Moneish Kumar et al.
Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy
Simon Ging, Maria A. Bravo, Thomas Brox
ResFields: Residual Neural Fields for Spatiotemporal Signals
Marko Mihajlovic, Sergey Prokudin, Marc Pollefeys et al.
Prompt Gradient Projection for Continual Learning
Jingyang Qiao, Zhizhong Zhang, Xin Tan et al.
Vision-by-Language for Training-Free Compositional Image Retrieval
Shyamgopal Karthik, Karsten Roth, Massimiliano Mancini et al.
Single Motion Diffusion
Sigal Raab, Inbal Leibovitch, Guy Tevet et al.
DeepSPF: Spherical SO(3)-Equivariant Patches for Scan-to-CAD Estimation
Driton Salihu, Adam Misik, Yuankai Wu et al.
Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform
Shengyi Huang, Jiayi Weng, Rujikorn Charakorn et al.
SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs
Jaehyung Kim, Jaehyun Nam, Sangwoo Mo et al.
Feature Collapse
Thomas Laurent, James von Brecht, Xavier Bresson
HypeBoy: Generative Self-Supervised Representation Learning on Hypergraphs
Sunwoo Kim, Shinhwan Kang, Fanchen Bu et al.
Multi-Resolution Diffusion Models for Time Series Forecasting
Lifeng Shen, Weiyu Chen, James Kwok
In-context Exploration-Exploitation for Reinforcement Learning
Zhenwen Dai, Federico Tomasi, Sina Ghiassian
Non-negative Contrastive Learning
Yifei Wang, Qi Zhang, Yaoyu Guo et al.
Model Merging by Uncertainty-Based Gradient Matching
Nico Daheim, Thomas Möllenhoff, Edoardo M. Ponti et al.
Idempotence and Perceptual Image Compression
Tongda Xu, Ziran Zhu, Dailan He et al.
Does Progress On Object Recognition Benchmarks Improve Generalization on Crowdsourced, Global Data?
Megan Richards, Polina Kirichenko, Diane Bouchacourt et al.
Zero-Shot Robustification of Zero-Shot Models
Dyah Adila, Changho Shin, Linrong Cai et al.
Towards image compression with perfect realism at ultra-low bitrates
Marlene Careil, Matthew J Muckley, Jakob Verbeek et al.
Learning Optimal Contracts: How to Exploit Small Action Spaces
Francesco Bacchiocchi, Matteo Castiglioni, Alberto Marchesi et al.
Transferring Labels to Solve Annotation Mismatches Across Object Detection Datasets
Yuan-Hong Liao, David Acuna, Rafid Mahmood et al.
Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
Shashank Gupta, Vaishnavi Shrivastava, Ameet Deshpande et al.
Self-Supervised High Dynamic Range Imaging with Multi-Exposure Images in Dynamic Scenes
Zhilu Zhang, Haoyu Wang, Shuai Liu et al.
An Efficient Membership Inference Attack for the Diffusion Model by Proximal Initialization
Fei Kong, Jinhao Duan, ruipeng ma et al.
FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets
Seonghyeon Ye, Doyoung Kim, Sungdong Kim et al.
BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference
Siqi Kou, Lei Gan, Dequan Wang et al.
Test-Time Training on Nearest Neighbors for Large Language Models
Moritz Hardt, Yu Sun
Unified Projection-Free Algorithms for Adversarial DR-Submodular Optimization
Mohammad Pedramfar, Yididiya Nadew, Chris Quinn et al.
Critical Learning Periods Emerge Even in Deep Linear Networks
Michael Kleinman, Alessandro Achille, Stefano Soatto
Leveraging Unpaired Data for Vision-Language Generative Models via Cycle Consistency
Tianhong Li, Sangnie Bhardwaj, Yonglong Tian et al.
InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning
Ziheng Qin, Kai Wang, Zangwei Zheng et al.
PRES: Toward Scalable Memory-Based Dynamic Graph Neural Networks
Junwei Su, Difan Zou, Chuan Wu
Elastic Feature Consolidation For Cold Start Exemplar-Free Incremental Learning
Simone Magistri, Tomaso Trinci, Albin Soutif--Cormerais et al.
Investigating the Benefits of Projection Head for Representation Learning
Yihao Xue, Eric Gan, Jiayi Ni et al.
LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation
Suhyeon Lee, Won Jun Kim, Jinho Chang et al.
Stochastic Modified Equations and Dynamics of Dropout Algorithm
Zhongwang Zhang, Yuqing Li, Tao Luo et al.
Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature
Guangsheng Bao, Yanbin Zhao, Zhiyang Teng et al.
Conformal Inductive Graph Neural Networks
Soroush H. Zargarbashi, Aleksandar Bojchevski
Continual Momentum Filtering on Parameter Space for Online Test-time Adaptation
Jae-Hong Lee, Joon-Hyuk Chang
Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning
Zihan Ding, Chi Jin
FedP3: Federated Personalized and Privacy-friendly Network Pruning under Model Heterogeneity
Kai Yi, Nidham Gazagnadou, Peter Richtarik et al.
LRM: Large Reconstruction Model for Single Image to 3D
Yicong Hong, Kai Zhang, Jiuxiang Gu et al.
Generative Sliced MMD Flows with Riesz Kernels
Johannes Hertrich, Christian Wald, Fabian Altekrüger et al.
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Xichen Pan, Li Dong, Shaohan Huang et al.
BaDExpert: Extracting Backdoor Functionality for Accurate Backdoor Input Detection
Tinghao Xie, Xiangyu Qi, Ping He et al.
Self-Supervised Dataset Distillation for Transfer Learning
Dong Bok Lee, Seanie Lee, Joonho Ko et al.
Large-scale Training of Foundation Models for Wearable Biosignals
Salar Abbaspourazad, Oussama Elachqar, Andrew Miller et al.
Provably Robust Conformal Prediction with Improved Efficiency
Ge Yan, Yaniv Romano, Tsui-Wei Weng
Accelerating Distributed Stochastic Optimization via Self-Repellent Random Walks
Jie Hu, Vishwaraj Doshi, Do Young Eun
Adaptive Federated Learning with Auto-Tuned Clients
Junhyung Lyle Kim, Mohammad Taha Toghani, Cesar Uribe et al.
LogicMP: A Neuro-symbolic Approach for Encoding First-order Logic Constraints
Weidi Xu, Jingwei Wang, Lele Xie et al.
OmniControl: Control Any Joint at Any Time for Human Motion Generation
Yiming Xie, Varun Jampani, Lei Zhong et al.
Leave-one-out Distinguishability in Machine Learning
Jiayuan Ye, Anastasia Borovykh, Soufiane Hayou et al.
In defense of parameter sharing for model-compression
Aditya Desai, Anshumali Shrivastava
Enhancing One-Shot Federated Learning Through Data and Ensemble Co-Boosting
Rong Dai, Yonggang Zhang, Ang Li et al.
How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?
Wenxuan Li, Alan Yuille, Zongwei Zhou
Revisit and Outstrip Entity Alignment: A Perspective of Generative Models
Lingbing Guo, Zhuo Chen, Jiaoyan Chen et al.
OMNI: Open-endedness via Models of human Notions of Interestingness
Jenny Zhang, Joel Lehman, Kenneth Stanley et al.
Risk Bounds of Accelerated SGD for Overparameterized Linear Regression
Xuheng Li, Yihe Deng, Jingfeng Wu et al.
Task Adaptation from Skills: Information Geometry, Disentanglement, and New Objectives for Unsupervised Reinforcement Learning
Yucheng Yang, Tianyi Zhou, Qiang HE et al.
Consistent Multi-Class Classification from Multiple Unlabeled Datasets
Zixi Wei, Senlin Shu, Yuzhou Cao et al.
Enhancing Instance-Level Image Classification with Set-Level Labels
Renyu Zhang, Aly Khan, Yuxin Chen et al.
Coeditor: Leveraging Repo-level Diffs for Code Auto-editing
Jiayi Wei, Greg Durrett, Isil Dillig
Unifying Feature and Cost Aggregation with Transformers for Semantic and Visual Correspondence
Sunghwan Hong, Seokju Cho, Seungryong Kim et al.
Causality-Inspired Spatial-Temporal Explanations for Dynamic Graph Neural Networks
Kesen Zhao, Liang Zhang
Hyper Evidential Deep Learning to Quantify Composite Classification Uncertainty
Changbin Li, Kangshuo Li, Yuzhe Ou et al.
Leveraging Generative Models for Unsupervised Alignment of Neural Time Series Data
Ayesha Vermani, Il Memming Park, Josue Nassar
LLM Augmented LLMs: Expanding Capabilities through Composition
Rachit Bansal, Bidisha Samanta, Siddharth Dalmia et al.
Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback
Yu Chen, Yihan Du, Pihe Hu et al.
Domain-Agnostic Molecular Generation with Chemical Feedback
Yin Fang, Ningyu Zhang, Zhuo Chen et al.
Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection
Qinyu Zhao, Ming Xu, Kartik Gupta et al.
Bayesian Optimization through Gaussian Cox Process Models for Spatio-temporal Data
Yongsheng Mei, Mahdi Imani, Tian Lan
Safe and Robust Watermark Injection with a Single OoD Image
Shuyang Yu, Junyuan Hong, Haobo Zhang et al.
TOSS: High-quality Text-guided Novel View Synthesis from a Single Image
Yukai Shi, Jianan Wang, He CAO et al.
Elucidating the design space of classifier-guided diffusion generation
Jiajun Ma, Tianyang Hu, Wenjia Wang et al.
Learning Flexible Body Collision Dynamics with Hierarchical Contact Mesh Transformer
Youn-Yeol Yu, Jeongwhan Choi, Woojin Cho et al.
Periodicity Decoupling Framework for Long-term Series Forecasting
Tao Dai, Beiliang Wu, Peiyuan Liu et al.
General Stability Analysis for Zeroth-Order Optimization Algorithms
Xinyue Liu, Hualin Zhang, Bin Gu et al.
The Cost of Scaling Down Large Language Models: Reducing Model Size Affects Memory before In-context Learning
Tian Jin, Nolan Clement, Xin Dong et al.
DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer
Junyuan Hong, Jiachen (Tianhao) Wang, Chenhui Zhang et al.
Context is Environment
Sharut Gupta, Stefanie Jegelka, David Lopez-Paz et al.
Denoising Diffusion Step-aware Models
Shuai Yang, Yukang Chen, Luozhou WANG et al.
Initializing Models with Larger Ones
Zhiqiu Xu, Yanjie Chen, Kirill Vishniakov et al.
HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion
Xian Liu, Jian Ren, Aliaksandr Siarohin et al.
Language-Interfaced Tabular Oversampling via Progressive Imputation and Self-Authentication
June Yong Yang, Geondo Park, Joowon Kim et al.
Counterfactual Density Estimation using Kernel Stein Discrepancies
Diego Martinez-Taboada, Edward Kennedy
EQA-MX: Embodied Question Answering using Multimodal Expression
Md Mofijul Islam, Alexi Gladstone, Riashat Islam et al.
Proper Laplacian Representation Learning
Diego Gomez, Michael Bowling, Marlos C. Machado
Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling
Huangjie Zheng, Zhendong Wang, Jianbo Yuan et al.
DNABERT-2: Efficient Foundation Model and Benchmark For Multi-Species Genomes
Zhihan Zhou, Yanrong Ji, Weijian Li et al.
Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets
Dominique Beaini, Shenyang(Andy) Huang, Joao Cunha et al.
Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation
Qiang HE, Tianyi Zhou, Meng Fang et al.
PanoDiffusion: 360-degree Panorama Outpainting via Diffusion
Tianhao Wu, Chuanxia Zheng, Tat-Jen Cham
PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts
Bang An, Sicheng Zhu, Michael-Andrei Panaitescu-Liess et al.
Making LLaMA SEE and Draw with SEED Tokenizer
Yuying Ge, Sijie Zhao, Ziyun Zeng et al.
Removing Biases from Molecular Representations via Information Maximization
Chenyu Wang, Sharut Gupta, Caroline Uhler et al.
Online Continual Learning for Interactive Instruction Following Agents
Byeonghwi Kim, Minhyuk Seo, Jonghyun Choi
Closing the Curious Case of Neural Text Degeneration
Matthew Finlayson, John Hewitt, Alexander Koller et al.
Stable Anisotropic Regularization
William Rudman, Carsten Eickhoff
A Framework for Inference Inspired by Human Memory Mechanisms
Xiangyu Zeng, Jie Lin, Piao Hu et al.
SF(DA)$^2$: Source-free Domain Adaptation Through the Lens of Data Augmentation
Uiwon Hwang, Jonghyun Lee, Juhyeon Shin et al.
PBADet: A One-Stage Anchor-Free Approach for Part-Body Association
Zhongpai Gao, Huayi Zhou, Abhishek Sharma et al.
Entropy is not Enough for Test-Time Adaptation: From the Perspective of Disentangled Factors
Jonghyun Lee, Dahuin Jung, Saehyung Lee et al.
A Characterization Theorem for Equivariant Networks with Point-wise Activations
Marco Pacini, Xiaowen Dong, Bruno Lepri et al.
Exploring the Common Appearance-Boundary Adaptation for Nighttime Optical Flow
Hanyu Zhou, Yi Chang, Haoyue Liu et al.
Faster Approximation of Probabilistic and Distributional Values via Least Squares
Weida Li, Yaoliang Yu
Accelerating Sinkhorn algorithm with sparse Newton iterations
Xun Tang, Michael Shavlovsky, Holakou Rahmanian et al.
Circuit Component Reuse Across Tasks in Transformer Language Models
Jack Merullo, Carsten Eickhoff, Ellie Pavlick
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
Dan Fu, Hermann Kumbong, Eric Nguyen et al.
Continuous Invariance Learning
LIN Yong, Fan Zhou, Lu Tan et al.
Learning to solve Class-Constrained Bin Packing Problems via Encoder-Decoder Model
Hanni Cheng, Ya Cong, Weihao Jiang et al.
EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models
Koichi Namekata, Amirmojtaba Sabour, Sanja Fidler et al.
Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking
Kaifeng Lyu, Jikai Jin, Zhiyuan Li et al.
IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs
Yuzhen Mao, Martin Ester, Ke Li
Dynamic Discounted Counterfactual Regret Minimization
Hang Xu, Kai Li, Haobo Fu et al.
Graphical Multioutput Gaussian Process with Attention
Yijue Dai, Wenzhong Yan, Feng Yin
RetroBridge: Modeling Retrosynthesis with Markov Bridges
Ilia Igashov, Arne Schneuing, Marwin Segler et al.
Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game
Sam Toyer, Olivia Watkins, Ethan Mendes et al.
Improved Analysis of Sparse Linear Regression in Local Differential Privacy Model
Liyang Zhu, Meng Ding, Vaneet Aggarwal et al.
Visual Data-Type Understanding does not emerge from scaling Vision-Language Models
Vishaal Udandarao, Max F. Burg, Samuel Albanie et al.
Enhancing Tail Performance in Extreme Classifiers by Label Variance Reduction
Anirudh Buvanesh, Rahul Chand, Jatin Prakash et al.
Reward Design for Justifiable Sequential Decision-Making
Aleksa Sukovic, Goran Radanovic
Rethinking Complex Queries on Knowledge Graphs with Neural Link Predictors
Hang Yin, Zihao Wang, Yangqiu Song
Generative Pre-training for Speech with Flow Matching
Alexander Liu, Matthew Le, Apoorv Vyas et al.
Enhancing Neural Training via a Correlated Dynamics Model
Jonathan Brokman, Roy Betser, Rotem Turjeman et al.
Modeling state-dependent communication between brain regions with switching nonlinear dynamical systems
Orren Karniol-Tambour, David Zoltowski, E. Mika Diamanti et al.
Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding
Yuanhao Xiong, Long Zhao, Boqing Gong et al.
Certified Adversarial Robustness for Rate Encoded Spiking Neural Networks
Bhaskar Mukhoty, Hilal AlQuabeh, Giulia De Masi et al.
Graph Metanetworks for Processing Diverse Neural Architectures
Derek Lim, Haggai Maron, Marc T Law et al.
Structuring Representation Geometry with Rotationally Equivariant Contrastive Learning
Sharut Gupta, Joshua Robinson, Derek Lim et al.
TEDDY: Trimming Edges with Degree-based Discrimination Strategy
Hyunjin Seo, Jihun Yun, Eunho Yang
A differentiable brain simulator bridging brain simulation and brain-inspired computing
Chaoming Wang, Tianqiu Zhang, Sichao He et al.
RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems
Tianyang Liu, Canwen Xu, Julian McAuley
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Zibin Dong, Yifu Yuan, Jianye HAO et al.
Physics-Regulated Deep Reinforcement Learning: Invariant Embeddings
Hongpeng Cao, Yanbing Mao, Lui Sha et al.
Efficient Planning with Latent Diffusion
Wenhao Li
Structural Fairness-aware Active Learning for Graph Neural Networks
Haoyu Han, Xiaorui Liu, Li Ma et al.
Scalable Neural Network Kernels
Arijit Sehanobish, Krzysztof Choromanski, YUNFAN ZHAO et al.
Language Model Detectors Are Easily Optimized Against
Charlotte Nicks, Eric Mitchell, Rafael Rafailov et al.
How I Warped Your Noise: a Temporally-Correlated Noise Prior for Diffusion Models
Pascal Chang, Jingwei Tang, Markus Gross et al.
Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
Fuxiao Liu, Kevin Lin, Linjie Li et al.
Zero-Mean Regularized Spectral Contrastive Learning: Implicitly Mitigating Wrong Connections in Positive-Pair Graphs
Xiong Zhou, Xianming Liu, feilong zhang et al.
Neural Field Classifiers via Target Encoding and Classification Loss
Xindi Yang, Zeke Xie, Xiong Zhou et al.
Variance-enlarged Poisson Learning for Graph-based Semi-Supervised Learning with Extremely Sparse Labeled Data
Xiong Zhou, Xianming Liu, Hao Yu et al.
Solving Homogeneous and Heterogeneous Cooperative Tasks with Greedy Sequential Execution
Shanqi Liu, Dong Xing, Pengjie Gu et al.
Coordinate-Aware Modulation for Neural Fields
Joo Chan Lee, Daniel Rho, Seungtae Nam et al.
Rigid Protein-Protein Docking via Equivariant Elliptic-Paraboloid Interface Prediction
Ziyang Yu, Wenbing Huang, Yang Liu
An improved analysis of per-sample and per-update clipping in federated learning
Bo Li, Xiaowen Jiang, Mikkel N. Schmidt et al.
Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing
Xinyu Hu, Pengfei Tang, Simiao Zuo et al.
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding
Junyan Li, Delin Chen, Yining Hong et al.
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
Ziyang Luo, Can Xu, Pu Zhao et al.
Numerical Accounting in the Shuffle Model of Differential Privacy
Antti Koskela, Antti Honkela, Mikko Heikkilä
Language Model Decoding as Direct Metrics Optimization
Haozhe Ji, Pei Ke, Hongning Wang et al.
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes
Rishabh Agarwal, Nino Vieillard, Yongchao Zhou et al.
Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes
Ruiquan Huang, Yuan Cheng, Jing Yang et al.
Like Oil and Water: Group Robustness Methods and Poisoning Defenses May Be at Odds
Michael-Andrei Panaitescu-Liess, Yigitcan Kaya, Sicheng Zhu et al.
Geometrically Aligned Transfer Encoder for Inductive Transfer in Regression Tasks
Sung Moon Ko, Sumin Lee, Dae-Woong Jeong et al.
Generative Learning for Solving Non-Convex Problem with Multi-Valued Input-Solution Mapping
Enming Liang, Minghua Chen
Knowledge Fusion of Large Language Models
Fanqi Wan, Xinting Huang, Deng Cai et al.
Learning Multi-Agent Communication from Graph Modeling Perspective
Shengchao Hu, Li Shen, Ya Zhang et al.
Evaluating Language Model Agency Through Negotiations
Tim R. Davidson, Veniamin Veselovsky, Michal Kosinski et al.
SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos
Yulei Niu, Wenliang Guo, Long Chen et al.
Overthinking the Truth: Understanding how Language Models Process False Demonstrations
Danny Halawi, Jean-Stanislas Denain, Jacob Steinhardt
BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models
Qingqing Cao, Sewon Min, Yizhong Wang et al.
The mechanistic basis of data dependence and abrupt learning in an in-context classification task
Gautam Reddy Nallamala
A Probabilistic Framework for Modular Continual Learning
Lazar Valkov, Akash Srivastava, Swarat Chaudhuri et al.
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with Multi-Step On-Policy Optimization
Kun LEI, Zhengmao He, Chenhao Lu et al.
Universal Backdoor Attacks
Benjamin Schneider, Nils Lukas, Florian Kerschbaum