Oral Papers
1,594 papers found • Page 6 of 32
Deep Value Benchmark: Measuring Whether Models Generalize Deep values or Shallow Preferences
Joshua Ashkinaze, Hua Shen, Saipranav Avula et al.
Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding
Xiaoyi Zhang, Zhaoyang Jia, Zongyu Guo et al.
DEFAME: Dynamic Evidence-based FAct-checking with Multimodal Experts
Tobias Braun, Mark Rothermel, Marcus Rohrbach et al.
DeFoG: Discrete Flow Matching for Graph Generation
Yiming Qin, Manuel Madeira, Dorina Thanou et al.
Delay-DSGN: A Dynamic Spiking Graph Neural Network with Delay Mechanisms for Evolving Graph
Zhiqiang Wang, Jianghao Wen, Jianqing Liang
DeltaFlow: An Efficient Multi-frame Scene Flow Estimation Method
Qingwen Zhang, Xiaomeng Zhu, Yushan Zhang et al.
Delving into Large Language Models for Effective Time-Series Anomaly Detection
JUN WOO PARK, Kyudan Jung, Dohyun Lee et al.
Dendritic Resonate-and-Fire Neuron for Effective and Efficient Long Sequence Modeling
Dehao Zhang, Malu Zhang, Shuai Wang et al.
DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models
Ziyi Wu, Anil Kag, Ivan Skorokhodov et al.
Dense Video Object Captioning from Disjoint Supervision
Xingyi Zhou, Anurag Arnab, Chen Sun et al.
Depth Any Video with Scalable Synthetic Data
Honghui Yang, Di Huang, Wei Yin et al.
Depth-Bounds for Neural Networks via the Braid Arrangement
Moritz Grillo, Christoph Hertrich, Georg Loho
DGH: Dynamic Gaussian Hair
Junying Wang, Yuanlu Xu, Edith Tretschk et al.
DiffLiG: Diffusion-enhanced Liquid Graph with Attention Propagation for Grid-to-Station Precipitation Correction
Yuxiang Li, Yang Zhang, Li et al.
Diff-MoE: Diffusion Transformer with Time-Aware and Space-Adaptive Experts
Kun Cheng, Xiao He, Lei Yu et al.
Diffusion$^2$: Dynamic 3D Content Generation via Score Composition of Video and Multi-view Diffusion Models
Zeyu Yang, Zijie Pan, Chun Gu et al.
Diffusion-Based Hierarchical Graph Neural Networks for Simulating Nonlinear Solid Mechanics
Tobias Würth, Niklas Freymuth, Gerhard Neumann et al.
Diffusion Transformer Captures Spatial-Temporal Dependencies: A Theory for Gaussian Process Data
Hengyu Fu, Zehao Dou, Jiawei Guo et al.
Diffusion Transformers as Open-World Spatiotemporal Foundation Models
Yuan Yuan, Chonghua Han, Jingtao Ding et al.
Diffusion Transformers for Imputation: Statistical Efficiency and Uncertainty Quantification
Zeqi Ye, Minshuo Chen
Digi-Q: Learning VLM Q-Value Functions for Training Device-Control Agents
Hao Bai, Yifei Zhou, Li Li et al.
Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis
Hengyuan Cao, Yutong Feng, Biao Gong et al.
DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning
Gaoyue Zhou, Hengkai Pan, Yann LeCun et al.
Direct Motion Models for Assessing Generated Videos
Kelsey Allen, Carl Doersch, Guangyao Zhou et al.
DisasterM3: A Remote Sensing Vision-Language Dataset for Disaster Damage Assessment and Response
Junjue Wang, Weihao Xuan, Heli Qi et al.
DISCO: learning to DISCover an evolution Operator for multi-physics-agnostic prediction
Rudy Morel, Jiequn Han, Edouard Oyallon
Discovering Latent Causal Graphs from Spatiotemporal Data
Kun Wang, Sumanth Varambally, Duncan Watson-Parris et al.
Discovering Opinion Intervals from Conflicts in Signed Graphs
Peter Blohm, Florian Chen, Aristides Gionis et al.
Discovering Temporally Compositional Neural Manifolds with Switching Infinite GPFA
Changmin Yu, Maneesh Sahani, Máté Lengyel
Disentangling 3D Animal Pose Dynamics with Scrubbed Conditional Latent Variables
Joshua Wu, Hari Koneru, James Ravenel et al.
Distil-E2D: Distilling Image-to-Depth Priors for Event-Based Monocular Depth Estimation
Jie Long Lee, Gim Hee Lee
DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs
Jongwoo Ko, Tianyi Chen, Sungnyun Kim et al.
Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning
Hanlin Yang, Jian Yao, Weiming Liu et al.
Diversifying Robot Locomotion Behaviors with Extrinsic Behavioral Curiosity
Zhenglin Wan, Xingrui Yu, David Bossens et al.
Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning
Zican Hu, Wei Liu, Xiaoye Qu et al.
DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis
Yinghao Li, Rithesh Kumar, Zeyu Jin
Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?
BOSHEN XU, Ziheng Wang, Yang Du et al.
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
Yang Yue, Zhiqi Chen, Rui Lu et al.
Does Stochastic Gradient really succeed for bandits?
Dorian Baudry, Emmeran Johnson, Simon Vary et al.
Don’t Trade Off Safety: Diffusion Regularization for Constrained Offline RL
Junyu guo, Zhi Zheng, Donghao Ying et al.
Doodle to Detect: A Goofy but Powerful Approach to Skeleton-based Hand Gesture Recognition
Sang Han, Seonho Lee, Hyeok Nam et al.
DOVTrack: Data-Efficient Open-Vocabulary Tracking
Zekun Qian, Ruize Han, Zhixiang Wang et al.
DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving
Xiaosong Jia, Junqi You, Zhiyuan Zhang et al.
Dual-Path Temporal Decoder for End-to-End Multi-Object Tracking
Hyunseop Kim, Juheon Jeong, Hanul Kim et al.
Dual-Stage Value-Guided Inference with Margin-Based Reward Adjustment for Fast and Faithful VLM Captioning
Ankan Deria, Adinath Dukre, feilong tang et al.
DyCAST: Learning Dynamic Causal Structure from Time Series
Yue Cheng, Bochen Lyu, Weiwei Xing et al.
DyMoDreamer: World Modeling with Dynamic Modulation
Boxuan Zhang, Runqing Wang, Wei Xiao et al.
Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation
Zihan Wang, Seungjun Lee, Gim Hee Lee
Dynamical Decoupling of Generalization and Overfitting in Large Two-Layer Networks
Andrea Montanari, Pierfrancesco Urbani
Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models
Xingzhuo Guo, Yu Zhang, Baixu Chen et al.