Most Cited 2025 "automated modeling" Papers
22,274 papers found • Page 103 of 112
Conference
Learning Conditional Space-Time Prompt Distributions for Video Class-Incremental Learning
Xiaohan Zou, Wenchao Ma, Shu Zhao
PixPerfect: Seamless Latent Diffusion Local Editing with Discriminative Pixel-Space Refinement
Haitian Zheng, Yuan Yao, yongsheng yu et al.
IndEgo: A Dataset of Industrial Scenarios and Collaborative Work for Egocentric Assistants
Vivek Chavan, Yasmina Imgrund, Tung Dao et al.
Neural Networks Generalize on Low Complexity Data
Sourav Chatterjee, Timothy Sudijono
SmokeViz: A Large-Scale Satellite Dataset for Wildfire Smoke Detection and Segmentation
Rey Koki, Michael McCabe, Dhruv Kedar et al.
Learning from Interval Targets
Rattana Pukdee, Ziqi Ke, Chirag Gupta
PhySense: Sensor Placement Optimization for Accurate Physics Sensing
Yuezhou Ma, Haixu Wu, Hang Zhou et al.
Rethinking Optimal Verification Granularity for Compute-Efficient Test-Time Scaling
Hao Chen, Guanxi Lu, Yasuyuki Okoshi et al.
Rotation-Equivariant Self-Supervised Method in Image Denoising
Hanze Liu, Jiahong Fu, Qi Xie et al.
Aligning Text to Image in Diffusion Models is Easier Than You Think
Jaa-Yeon Lee, ByungHee Cha, Jeongsol Kim et al.
ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way
Jiazi Bu, Pengyang Ling, Pan Zhang et al.
Disentangling misreporting from genuine adaptation in strategic settings: a causal approach
Dylan Zapzalka, Trenton Chang, Lindsay Warrenburg et al.
MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving
Zhi-Yuan Zhang, Xiaofan Li, Zhihao Xu et al.
Dimensionality Mismatch Between Brains and Artificial Neural Networks
Santiago Galella, Maren Wehrheim, Matthias Kaschube
Conformal Mixed-Integer Constraint Learning with Feasibility Guarantees
Daniel Ovalle, Lorenz Biegler, Ignacio Grossmann et al.
Generalizable Reasoning through Compositional Energy Minimization
Alexandru Oarga, Yilun Du
Meta-learning how to Share Credit among Macro-Actions
Ionel-Alexandru Hosu, Traian Rebedea, Razvan Pascanu
Disentangled Pose and Appearance Guidance for Multi-Pose Generation
Tengfei Xiao, Yue Wu, Yuelong Li et al.
A Controllable Examination for Long-Context Language Models
Yijun Yang, Zeyu Huang, Wenhao Zhu et al.
ProfiX: Improving Profile-Guided Optimization in Compilers with Graph Neural Networks
Huiri Tan, Juyong Jiang, Jiasi Shen
Why Diffusion Models Don’t Memorize: The Role of Implicit Dynamical Regularization in Training
Tony Bonnaire, Raphaël Urfin, Giulio Biroli et al.
Reasoning Models Sometimes Output Illegible Chains of Thought
Arun Jose
VI^3NR: Variance Informed Initialization for Implicit Neural Representations
Chamin Hewa Koneputugodage, Yizhak Ben-Shabat, Sameera Ramasinghe et al.
REVE: A Foundation Model for EEG - Adapting to Any Setup with Large-Scale Pretraining on 25,000 Subjects
Yassine El Ouahidi, Jonathan Lys, Philipp Thölke et al.
Efficient Diffusion as Low Light Enhancer
Guanzhou Lan, Qianli Ma, YUQI YANG et al.
GliaNet: Adaptive Neural Network Structure Learning with Glia-Driven
Mengqiao Han, Liyuan Pan, Xiabi Liu
SeerAttention: Self-distilled Attention Gating for Efficient Long-context Prefilling
Yizhao Gao, Zhichen Zeng, DaYou Du et al.
Weakly Supervised Semantic Segmentation via Progressive Confidence Region Expansion
Xiangfeng Xu, Pinyi Zhang, Wenxuan Huang et al.
Hybrid Boundary Physics-Informed Neural Networks for Solving Navier-Stokes Equations with Complex Boundary
ChuYu Zhou, Tianyu Li, Chenxi Lan et al.
VidSeg: Training-free Video Semantic Segmentation based on Diffusion Models
Qian Wang, Abdelrahman Eldesokey, Mohit Mendiratta et al.
Fast attention mechanisms: a tale of parallelism
Jingwen Liu, Hantao Yu, Clayton Sanford et al.
MOS-Attack: A Scalable Multi-objective Adversarial Attack Framework
Ping Guo, Cheng Gong, Fei Liu et al.
Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity
Huaxin Zhang, Xiaohao Xu, Xiang Wang et al.
Pairwise Calibrated Rewards for Pluralistic Alignment
Daniel Halpern, Evi Micha, Ariel Procaccia et al.
SAO-Instruct: Free-form Audio Editing using Natural Language Instructions
Michael Ungersböck, Florian Grötschla, Luca Lanzendörfer et al.
SuperLightNet: Lightweight Parameter Aggregation Network for Multimodal Brain Tumor Segmentation
Feng Yu, Jiacheng Cao, Li Liu et al.
Robust and Scalable Autonomous Reinforcement Learning in Irreversible Environments
Sang-Hyun Lee
Learning from Streaming Video with Orthogonal Gradients
Tengda Han, Dilara Gokay, Joseph Heyward et al.
The Omni-Expert: A Computationally Efficient Approach to Achieve a Mixture of Experts in a Single Expert Model
Sohini Saha, Mezisashe Ojuba, Leslie Collins et al.
Sim-to-Real Causal Transfer: A Metric Learning Approach to Causally-Aware Interaction Representations
Ahmad Rahimi, Po-Chien Luan, Yuejiang Liu et al.
MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense Action Anticipation
Olga Zatsarynna, Emad Bahrami, Yazan Abu Farha et al.
MuSLR: Multimodal Symbolic Logical Reasoning
Jundong Xu, Hao Fei, Yuhui Zhang et al.
Visual Thoughts: A Unified Perspective of Understanding Multimodal Chain-of-Thought
ZIhui Cheng, Qiguang Chen, Xiao Xu et al.
Cycle-Sync: Robust Global Camera Pose Estimation through Enhanced Cycle-Consistent Synchronization
Shaohan Li, Yunpeng Shi, Gilad Lerman
GaussianIP: Identity-Preserving Realistic 3D Human Generation via Human-Centric Diffusion Prior
Zichen Tang, Yuan Yao, Miaomiao Cui et al.
CDI: Copyrighted Data Identification in Diffusion Models
Jan Dubiński, Antoni Kowalczuk, Franziska Boenisch et al.
Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions
Simon Matrenok, Skander Moalla, Caglar Gulcehre
Bridging Gait Recognition and Large Language Models Sequence Modeling
Shaopeng Yang, Jilong Wang, Saihui Hou et al.
Towards Practical Real-Time Neural Video Compression
Zhaoyang Jia, Bin Li, Jiahao Li et al.
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
Feng Liu, Shiwei Zhang, Xiaofeng Wang et al.
SynTSBench: Rethinking Temporal Pattern Learning in Deep Learning Models for Time Series
Qitai Tan, Yiyun Chen, Mo Li et al.
Cross-Rejective Open-Set SAR Image Registration
Shasha Mao, Shiming Lu, Zhaolong Du et al.
TAPT: Test-Time Adversarial Prompt Tuning for Robust Inference in Vision-Language Models
Xin Wang, Kai Chen, Jiaming Zhang et al.
HARDMath2: A Benchmark for Applied Mathematics Built by Students as Part of a Graduate Class
James Roggeveen, Erik Wang, David Ettel et al.
Taxonomy-Aware Evaluation of Vision-Language Models
Vésteinn Snæbjarnarson, Kevin Du, Niklas Stoehr et al.
Conformal Prediction under Lévy-Prokhorov Distribution Shifts: Robustness to Local and Global Perturbations
Liviu Aolaritei, Julie Zhu, Oliver Wang et al.
SOAP: Vision-Centric 3D Semantic Scene Completion with Scene-Adaptive Decoder and Occluded Region-Aware View Projection
Hyo-Jun Lee, Yeong Jun Koh, Hanul Kim et al.
DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models
Haoyang Li, Liang Wang, Chao Wang et al.
Bidirectional Motion Transformer for Safety-Critical Traffic Scenario Generation
Yuxin Liu, Zhenghao (Mark) Peng, Xuanhao Cui et al.
Learning to Factorize Spatio-Temporal Foundation Models
Siru Zhong, Junjie Qiu, Yangyu Wu et al.
Feasibility-Aware Decision-Focused Learning for Predicting Parameters in the Constraints
Jayanta Mandi, Marianne Defresne, Senne Berden et al.
Protocols for Verifying Smooth Strategies in Bandits and Games
Miranda Christ, Daniel Reichman, Jonathan Shafer
A Unified Approach to Submodular Maximization Under Noise
Kshipra Bhawalkar, Yang Cai, Zhe Feng et al.
Is Noise Conditioning Necessary? A Unified Theory of Unconditional Graph Diffusion Models
JIPENG LI, Yanning Shen
FedCS: Coreset Selection for Federated Learning
Chenhe Hao, Weiying Xie, Daixun Li et al.
GraphI2P: Image-to-Point Cloud Registration with Exploring Pattern of Correspondence via Graph Learning
Lin Bie, Shouan Pan, Siqi Li et al.
FlexUOD: The Answer to Real-world Unsupervised Image Outlier Detection
Zhonghang Liu, Kun Zhou, Changshuo Wang et al.
Samba: A Unified Mamba-based Framework for General Salient Object Detection
Jiahao He, Keren Fu, Xiaohong Liu et al.
Learning to Solve Complex Problems via Dataset Decomposition
Wanru Zhao, Lucas Page-Caccia, Zhengyan Shi et al.
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature
Alejandro Lozano, Min Woo Sun, James Burgess et al.
$\text{G}^2\text{M}$: A Generalized Gaussian Mirror Method to Boost Feature Selection Power
Hongyu Shen, Zhizhen Jane Zhao
Radial Attention: $\mathcal O(n \log n)$ Sparse Attention for Long Video Generation
XINGYANG LI, Muyang Li, Tianle Cai et al.
Energy Loss Functions for Physical Systems
Oumar Kaba, Kusha Sareen, Daniel Levy et al.
Collaborative Tree Search for Enhancing Embodied Multi-Agent Collaboration
Lizheng Zu, Lin Lin, Song Fu et al.
Constructing an Optimal Behavior Basis for the Option Keyboard
Lucas N. Alegre, Ana Bazzan, Andre Barreto et al.
Accelerated Evolving Set Processes for Local PageRank Computation
Binbin Huang, Luo Luo, Yanghua Xiao et al.
SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation
Claudia Cuttano, Gabriele Trivigno, Giuseppe Averta et al.
Titans: Learning to Memorize at Test Time
Ali Behrouz, Peilin Zhong, Vahab Mirrokni
Compositional Neural Network Verification via Assume-Guarantee Reasoning
Hai Duong, David Shriver, ThanhVu Nguyen et al.
Generating and Checking DNN Verification Proofs
Hai Duong, ThanhVu Nguyen, Matthew Dwyer
Fast Projection-Free Approach (without Optimization Oracle) for Optimization over Compact Convex Set
Chenghao Liu, Enming Liang, Minghua Chen
FIRE: Robust Detection of Diffusion-Generated Images via Frequency-Guided Reconstruction Error
Beilin Chu, Xuan Xu, Xin Wang et al.
CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians
Chongjian GE, Chenfeng Xu, Yuanfeng Ji et al.
Dual Exposure Stereo for Extended Dynamic Range 3D Imaging
Juhyung Choi, Jinneyong Kim, Seokjun Choi et al.
Binary Quadratic Quantization: Beyond First-Order Quantization for Real-Valued Matrix Compression
Kyo Kuroki, Yasuyuki Okoshi, Thiem Van Chu et al.
Improved Monocular Depth Prediction Using Distance Transform Over Pre-semantic Contours with Self-supervised Neural Networks
Marwane Hariat, Antoine Manzanera, David Filliat
Towards Interpretability Without Sacrifice: Faithful Dense Layer Decomposition with Mixture of Decoders
James Oldfield, Shawn Im, Sharon Li et al.
ReCon-GS: Continuum-Preserved Guassian Streaming for Fast and Compact Reconstruction of Dynamic Scenes
Jiaye Fu, Qiankun Gao, Chengxiang Wen et al.
Orochi: Versatile Biomedical Image Processor
Gaole Dai, Chenghao Zhou, Yu Zhou et al.
Conditional Diffusion Anomaly Modeling on Graphs
Chunyu Wei, Haozhe Lin, Yueguo Chen et al.
SATA: Spatial Autocorrelation Token Analysis for Enhancing the Robustness of Vision Transformers
Nikaan Nikzad, YI LIAO, Yongsheng Gao et al.
A$^3$E: Towards Compositional Model Editing
Hongming Piao, Hao Wang, Dapeng Wu et al.
ERUPT: Efficient Rendering with Unposed Patch Transformer
Maxim Shugaev, Vincent Chen, Maxim Karrenbach et al.
VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary
Kevin Qinghong Lin, Mike Zheng Shou
Redefining <Creative> in Dictionary: Towards an Enhanced Semantic Understanding of Creative Generation
Fu Feng, Yucheng Xie, Xu Yang et al.
ActiveVOO: Value of Observation Guided Active Knowledge Acquisition for Open-World Embodied Lifted Regression Planning
Xiaotian Liu, Ali Pesaranghader, Jaehong Kim et al.
A faster training algorithm for regression trees with linear leaves, and an analysis of its complexity
Kuat Gazizov, Miguel A. Carreira-Perpinan
Knowledge Starts with Practice: Knowledge-Aware Exercise Generative Recommendation with Adaptive Multi-Agent Cooperation
Yangtao Zhou, Hua Chu, chen et al.
Optimality and NP-Hardness of Transformers in Learning Markovian Dynamical Functions
Yanna Ding, Songtao Lu, Yingdong Lu et al.
PIPE: Physics-Informed Position Encoding for Alignment of Satellite Images and Time Series in Typhoon Forecasting
Haobo Li, Eunseo Jung, Zixin CHEN et al.
$\text{S}^2$Q-VDiT: Accurate Quantized Video Diffusion Transformer with Salient Data and Sparse Token Distillation
Weilun Feng, Haotong Qin, Chuanguang Yang et al.
Variance-Based Membership Inference Attacks Against Large-Scale Image Captioning Models
Daniel Samira, Edan Habler, Yuval Elovici et al.
Generalizing Experience for Language Agents with Hierarchical MetaFlows
Shengda Fan, Xin Cong, Zhong Zhang et al.
Stratify or Die: Rethinking Data Splits in Image Segmentation
Naga Venkata Sai Jitin Jami, Thomas Altstidl, Jonas Mueller et al.
NEED: Cross-Subject and Cross-Task Generalization for Video and Image Reconstruction from EEG Signals
Shuai Huang, Huan Luo, Haodong Jing et al.
Generalized Zero-Shot Classification via Semantics-Free Inter-Class Feature Generation
Libiao Chen, Dong Nie, Junjun Pan et al.
NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints
Changyao Tian, Hao Li, Gen Luo et al.
Decreasing Entropic Regularization Averaged Gradient for Semi-Discrete Optimal Transport
Ferdinand Genans, Antoine Godichon-Baggioni, François-Xavier Vialard et al.
Are Language Models Efficient Reasoners? A Perspective from Logic Programming
Andreas Opedal, Yanick Zengaffinen, Haruki Shirakami et al.
Understanding Differential Transformer Unchains Pretrained Self-Attentions
Chaerin Kong, Jiho Jang, Nojun Kwak
Enhanced Self-Distillation Framework for Efficient Spiking Neural Network Training
Xiaochen Zhao, Chengting Yu, Kairong Yu et al.
Cross-fluctuation phase transitions reveal sampling dynamics in diffusion models
Sai Niranjan Ramachandran, Manish Krishan Lal, Suvrit Sra
Monoculture or Multiplicity: Which Is It?
Mila Gorecki, Moritz Hardt
A Principled Path to Fitted Distributional Evaluation
Sungee Hong, Jiayi Wang, Zhengling Qi et al.
HuMoCon: Concept Discovery for Human Motion Understanding
Qihang Fang, Chengcheng Tang, Bugra Tekin et al.
TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning
Sheng Wang, Pengan CHEN, Jingqi Zhou et al.
TTS-VAR: A Test-Time Scaling Framework for Visual Auto-Regressive Generation
Zhekai Chen, Ruihang Chu, Yukang Chen et al.
Camera Resection from Known Line Pencils and a Radially Distorted Scanline
Juan Carlos Dibene Simental, Enrique Dunn
Inpainting the Neural Picture: Inferring Unrecorded Brain Area Dynamics from Multi-Animal Datasets
Ji Xia, Yizi Zhang, Shuqi Wang et al.
SKDream: Controllable Multi-view and 3D Generation with Arbitrary Skeletons
Yuanyou Xu, Zongxin Yang, Yi Yang
Moment- and Power-Spectrum-Based Gaussianity Regularization for Text-to-Image Models
Jisung Hwang, Jaihoon Kim, Minhyuk Sung
Achieving $\tilde{\mathcal{O}}(1/N)$ Optimality Gap in Restless Bandits through Gaussian Approximation
Chen YAN, Weina Wang, Lei Ying
Reverse Engineering Human Preferences with Reinforcement Learning
Lisa Alazraki, Yi-Chern Tan, Jon Ander Campos et al.
Closest Neighbors are Harmful for Lightweight Masked Auto-encoders
Jian Meng, Ahmed Hasssan, Li Yang et al.
Large Language Diffusion Models
Shen Nie, Fengqi Zhu, Zebin You et al.
RigAnyFace: Scaling Neural Facial Mesh Auto-Rigging with Unlabeled Data
Wenchao Ma, Dario Kneubuehler, Maurice Chu et al.
ELDET: Early-Learning Distillation with Noisy Labels for Object Detection
Dongmin Choi, Sangbin Lee, EungGu Yun et al.
Safe and Stable Control via Lyapunov-Guided Diffusion Models
Xiaoyuan Cheng, Xiaohang Tang, Yiming Yang
OrdShap: Feature Position Importance for Sequential Black-Box Models
Davin Hill, Brian Hill, Aria Masoomi et al.
Stacking Brick by Brick: Aligned Feature Isolation for Incremental Face Forgery Detection
Jikang Cheng, Zhiyuan Yan, Ying Zhang et al.
One Head to Rule Them All: Amplifying LVLM Safety through a Single Critical Attention Head
Junhao Xia, Haotian Zhu, Shuchao Pang et al.
ProteinConformers: Benchmark Dataset for Simulating Protein Conformational Landscape Diversity and Plausibility
Yihang Zhou, Chen Wei, Minghao Sun et al.
RDD: Robust Feature Detector and Descriptor using Deformable Transformer
Gonglin Chen, Tianwen Fu, Haiwei Chen et al.
Quadratic Coreset Selection: Certifying and Reconciling Sequence and Token Mining for Efficient Instruction Tuning
Ziliang Chen, Yongsen Zheng, Zhao-Rong Lai et al.
FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution Rendering
Guofeng Feng, Siyan Chen, Rong Fu et al.
Distribution-Aware Tensor Decomposition for Compression of Convolutional Neural Networks
Alper KALLE, Théo Rudkiewicz, Mohamed Ouerfelli et al.
Gradient Inversion Attacks on Parameter-Efficient Fine-Tuning
Hasin Us Sami, Swapneel Sen, Amit K. Roy-Chowdhury et al.
From Indicators to Insights: Diversity-Optimized for Medical Series-Text Decoding via LLMs
Xiyuan Jin, Jing Wang, Ziwei Lin et al.
Pre-trained Large Language Models Learn to Predict Hidden Markov Models In-context
Yijia Dai, Zhaolin Gao, Yahya Sattar et al.
SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos
Yuzheng Liu, Siyan Dong, Shuzhe Wang et al.
Conformal Online Learning of Deep Koopman Linear Embeddings
Ben Gao, Jordan Patracone, Stephane Chretien et al.
ShapeX: Shapelet-Driven Post Hoc Explanations for Time Series Classification Models
Bosong Huang, Ming Jin, Yuxuan Liang et al.
CleverBirds: A Multiple-Choice Benchmark for Fine-grained Human Knowledge Tracing
Leonie Bossemeyer, Samuel Heinrich, Grant Van Horn et al.
Isotropic Noise in Stochastic and Quantum Convex Optimization
Annie Marsden, Liam O'Carroll, Aaron Sidford et al.
TransPixeler: Advancing Text-to-Video Generation with Transparency
Luozhou Wang, Yijun Li, ZhiFei Chen et al.
Hybrid Reciprocal Transformer with Triplet Feature Alignment for Scene Graph Generation
Jiawei Fu, ZHANG Tiantian, Kai Chen et al.
Factored-NeuS: Reconstructing Surfaces, Illumination, and Materials of Possibly Glossy Objects
Yue Fan, Ningjing Fan, Ivan Skorokhodov et al.
Learning Person-Specific Animatable Face Models from In-the-Wild Images via a Shared Base Model
Yuxiang Mao, Zhenfeng Fan, Zhijie Zhang et al.
Improving Formal Reasoning of Transformer with State Stack
Kechi Zhang, Ge Li, Jia Li et al.
Distance-informed Neural Processes
Aishwarya Venkataramanan, Joachim Denzler
SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction
Yutao Tang, Yuxiang Guo, Deming Li et al.
Let's Chorus: Partner-aware Hybrid Song-Driven 3D Head Animation
Xiumei Xie, Zikai Huang, Wenhao Xu et al.
Dense-SfM: Structure from Motion with Dense Consistent Matching
JongMin Lee, Sungjoo Yoo
RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents
Jingyi Yang, Shuai Shao, Dongrui Liu et al.
Model Merging in Pre-training of Large Language Models
Yunshui Li, Yiyuan Ma, Shen Yan et al.
Towards Unified and Lossless Latent Space for 3D Molecular Latent Diffusion Modeling
Yanchen Luo, ZHIYUAN LIU, Yi Zhao et al.
Angles Don’t Lie: Unlocking Training‑Efficient RL Through the Model’s Own Signals
Qinsi Wang, Jinghan Ke, Hancheng Ye et al.
Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models
Mateusz Pach, Shyamgopal Karthik, Quentin Bouniot et al.
Teaching Transformers to Solve Combinatorial Problems through Efficient Trial & Error
Panagiotis Giannoulis, Yorgos Pantis, Christos Tzamos
LATTE-MV: Learning to Anticipate Table Tennis Hits from Monocular Videos
Daniel Etaat, Dvij Rajesh Kalaria, Nima Rahmanian et al.
Effortless Active Labeling for Long-Term Test-Time Adaptation
Guowei Wang, Changxing Ding
Revisiting Generative Infrared and Visible Image Fusion Based on Human Cognitive Laws
Lin Guo, Xiaoqing Luo, Wei Xie et al.
SHF: Symmetrical Hierarchical Forest with Pretrained Vision Transformer Encoder for High-Resolution Medical Segmentation
Enzhi Zhang, Peng Chen, Rui Zhong et al.
T-norm Selection for Object Detection in Autonomous Driving with Logical Constraints
Thomas Eiter, Katsumi Inoue, Nelson Higuera et al.
Emerging Risks from Embodied AI Require Urgent Policy Action
Jared Perlo, Alexander Robey, Fazl Barez et al.
GVPO: Group Variance Policy Optimization for Large Language Model Post-Training
Kaichen Zhang, Yuzhong Hong, Junwei Bao et al.
Feature-Based Instance Neighbor Discovery: Advanced Stable Test-Time Adaptation in Dynamic World
Qinting Jiang, Chuyang Ye, Dongyan Wei et al.
Multi-order Orchestrated Curriculum Distillation for Model-Heterogeneous Federated Graph Learning
Guancheng Wan, Xu Cheng, Run Liu et al.
Can Machines Understand Composition? Dataset and Benchmark for Photographic Image Composition Embedding and Understanding
Zhaoran Zhao, Peng Lu, Anran Zhang et al.
FracFace: Breaking The Visual Clues—Fractal-Based Privacy-Preserving Face Recognition
Wanying Dai, Beibei Li, Naipeng Dong et al.
Knot So Simple: A Minimalistic Environment for Spatial Reasoning
Zizhao Chen, Yoav Artzi
Wide-Horizon Thinking and Simulation-Based Evaluation for Real-World LLM Planning with Multifaceted Constraints
Dongjie Yang, Chengqiang Lu, Qimeng Wang et al.
LIM: Large Interpolator Model for Dynamic Reconstruction
Remy Sabathier, Niloy J. Mitra, David Novotny
SciArena: An Open Evaluation Platform for Non-Verifiable Scientific Literature-Grounded Tasks
Yilun Zhao, Kaiyan Zhang, Tiansheng Hu et al.
OSTAR: Optimized Statistical Text-classifier with Adversarial Resistance
Yuhan Yao, Feifei Kou, Lei Shi et al.
FRBNet: Revisiting Low-Light Vision through Frequency-Domain Radial Basis Network
Fangtong Sun, Congyu Li, Ke Yang et al.
AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos
Felix Wimbauer, Weirong Chen, Dominik Muhle et al.
Beyond Value Functions: Single-Loop Bilevel Optimization under Flatness Conditions
Liuyuan Jiang, Quan Xiao, Lisha Chen et al.
SDBF: Steep-Decision-Boundary Fingerprinting for Hard-Label Tampering Detection of DNN Models
Xiaofan Bai, Shixin Li, Xiaojing Ma et al.
Optimal Best Arm Identification under Differential Privacy
Marc Jourdan, Achraf Azize
TIDMAD: Time Series Dataset for Discovering Dark Matter with AI Denoising
Jessica Fry, Xinyi Fu, Zhenghao Fu et al.
Zero-shot World Models via Search in Memory
Federico Malato, Ville Hautamäki
A Near-optimal, Scalable and Parallelizable Framework for Stochastic Bandits Robust to Adversarial Corruptions and Beyond
Zicheng Hu, Cheng Chen
Valid Inference with Imperfect Synthetic Data
Yewon Byun, Shantanu Gupta, Zachary Lipton et al.
RoGSplat: Learning Robust Generalizable Human Gaussian Splatting from Sparse Multi-View Images
Junjin Xiao, Qing Zhang, Yongwei Nie et al.
Spectral Compressive Imaging via Chromaticity-Intensity Decomposition
Xiaodong Wang, Zijun He, Ping Wang et al.
VITRIX-CLIPIN: Enhancing Fine-Grained Visual Understanding in CLIP via Instruction-Editing Data and Long Captions
Ziteng Wang, Siqi Yang, Limeng Qiao et al.
A Standardized Benchmark for Multilabel Antimicrobial Peptide Classification
Sebastian Ojeda, Rafael Velasquez, Nicolás Aparicio et al.
IRIS: Inverse Rendering of Indoor Scenes from Low Dynamic Range Images
Chih-Hao Lin, Jia-Bin Huang, Zhengqin Li et al.
Gromov–Wasserstein Problem with Cyclic Symmetry
Shoichiro Takeda, Yasunori Akagi
UMoE: Unifying Attention and FFN with Shared Experts
Yuanhang Yang, Chaozheng Wang, Jing Li
CASP: Consistency-aware Audio-induced Saliency Prediction Model for Omnidirectional Video
Zhaolin Wan, Han Qin, Zhiyang Li et al.
Learning Temporal 3D Semantic Scene Completion via Optical Flow Guidance
meng wang, Fan Wu, Ruihui Li et al.
Consistent Sampling and Simulation: Molecular Dynamics with Energy-Based Diffusion Models
Michael Plainer, Hao Wu, Leon Klein et al.
A Universal Scale-Adaptive Deformable Transformer for Image Restoration across Diverse Artifacts
Xuyi He, Yuhui Quan, Ruotao Xu et al.
Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance
Dimitrios Gerogiannis, Foivos Paraperas Papantoniou, Rolandos Alexandros Potamias et al.
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Andrew Zhao, Yiran Wu, Yang Yue et al.
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
Shenzhi Wang, Le Yu, Chang Gao et al.
Context-Aware Hierarchical Learning: A Two-Step Paradigm towards Safer LLMs
Tengyun Ma, Jiaqi Yao, Daojing He et al.
Neural Inverse Rendering from Propagating Light
Anagh Malik, Benjamin Attal, Andrew Xie et al.