Most Cited 2025 "microtransactions" Papers
22,274 papers found • Page 95 of 112
Conference
Hierarchical Optimization via LLM-Guided Objective Evolution for Mobility-on-Demand Systems
Yi Zhang, Yushen Long, Yun Ni et al.
Language‑Bias‑Resilient Visual Question Answering via Adaptive Multi‑Margin Collaborative Debiasing
Huanjia Zhu, Shuyuan Zheng, Yishu Liu et al.
Improved Representation Steering for Language Models
Zhengxuan Wu, Qinan Yu, Aryaman Arora et al.
When Does Curriculum Learning Help? A Theoretical Perspective
Raman Arora, Yunjuan Wang, Kaibo Zhang
Reframing Gaussian Splatting Densification with Complexity-Density Consistency of Primitives
Zhemeng Dong, Junjun Jiang, Youyu Chen et al.
Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations
Xunzhi Zheng, Dan Xu
HyperMixup: Hypergraph-Augmented with Higher-order Information Mixup
Kaixuan Yao, Zhuo Li, Jianqing Liang et al.
Adversary Aware Optimization for Robust Defense
Daniel Wesego, Pedram Rooshenas
Improved Confidence Regions and Optimal Algorithms for Online and Offline Linear MNL Bandits
Yuxuan Han, Jose Blanchet, Zhengyuan Zhou
Towards Generalizable Multi-Policy Optimization with Self-Evolution for Job Scheduling
Inguk Choi, Woo-Jin Shin, Sang-Hyun Cho et al.
The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion
Changan Chen, Juze Zhang, Shrinidhi Kowshika Lakshmikanth et al.
Towards Visual Discrimination and Reasoning of Real-World Physical Dynamics: Physics-Grounded Anomaly Detection
wenqiao Li, Yao Gu, Xintao Chen et al.
GaRA-SAM: Robustifying Segment Anything Model with Gated-Rank Adaptation
Sohyun Lee, Yeho Gwon, Lukas Hoyer et al.
R$^2$ec: Towards Large Recommender Models with Reasoning
Runyang You, Yongqi Li, Xinyu Lin et al.
Scalable Neural Network Geometric Robustness Validation via Hölder Optimisation
Yanghao Zhang, Panagiotis Kouvaros, Alessio Lomuscio
Joint Hierarchical Representation Learning of Samples and Features via Informed Tree-Wasserstein Distance
Ya-Wei Eileen Lin, Ronald Coifman, Gal Mishne et al.
SoftShadow: Leveraging Soft Masks for Penumbra-Aware Shadow Removal
Xinrui Wang, Lanqing Guo, Xiyu Wang et al.
A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning
Anjie Liu, Jianhong Wang, Samuel Kaski et al.
HSI-GPT: A General-Purpose Large Scene-Motion-Language Model for Human Scene Interaction
Yuan Wang, Yali Li, Lixiang Li et al.
Learned Prefix Caching for Efficient LLM Inference
Dongsheng Yang, Austin Li, Kai Li et al.
Just Dance with pi! A Poly-modal Inductor for Weakly-supervised Video Anomaly Detection
Snehashis Majhi, Giacomo D'Amicantonio, Antitza Dantcheva et al.
Toward Robust Neural Reconstruction from Sparse Point Sets
Amine Ouasfi, Shubhendu Jena, Eric Marchand et al.
Antidistillation Sampling
Yash Savani, Asher Trockman, Zhili Feng et al.
Coarse-to-fine Q-Network with Action Sequence for Data-Efficient Reinforcement Learning
Younggyo Seo, Pieter Abbeel
Generalized Gaussian Entropy Model for Point Cloud Attribute Compression with Dynamic Likelihood Intervals
Changhao Peng
Plug-and-Play Interpretable Responsible Text-to-Image Generation via Dual-Space Multi-facet Concept Control
Basim Azam, Naveed Akhtar
Multi-Agent Debate for LLM Judges with Adaptive Stability Detection
Tianyu Hu, Zhen Tan, Song Wang et al.
Greedy Algorithms for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure
Aleksandrs Slivkins, Yunzong Xu, Shiliang Zuo
Domain-Specific Pruning of Large Mixture-of-Experts Models with Few-shot Demonstrations
Zican Dong, Han Peng, Peiyu Liu et al.
Small Resamples, Sharp Guarantees: Convergence Rates for Resampled Studentized Quantile Estimators
Imon Banerjee, Sayak Chakrabarty
OffsetOPT: Explicit Surface Reconstruction without Normals
Huan Lei
Event-based HDR Structured Light
Jiacheng Fu, Yue Li, Xin Dong et al.
Distilling LLM Prior to Flow Model for Generalizable Agent’s Imagination in Object Goal Navigation
Badi Li, Ren-Jie Lu, Yu Zhou et al.
HyPINO: Multi-Physics Neural Operators via HyperPINNs and the Method of Manufactured Solutions
Rafael Bischof, Michal Piovarci, Michael Kraus et al.
Online Task-Free Continual Learning via Dynamic Expansionable Memory Distribution
Fei Ye, Adrian Bors
Cancer Survival Analysis via Zero-shot Tumor Microenvironment Segmentation on Low-resolution Whole Slide Pathology Images
Jiao Tang, WEI SHAO, Daoqiang Zhang
MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
Riku Murai, Eric Dexheimer, Andrew J. Davison
MLEP: Multi-granularity Local Entropy Patterns for Generalized AI-generated Image Detection
Lin Yuan, Xiaowan Li, Yan Zhang et al.
Strategic Costs of Perceived Bias in Fair Selection
L. Elisa Celis, Lingxiao Huang, Milind Sohoni et al.
Skrull: Towards Efficient Long Context Fine-tuning through Dynamic Data Scheduling
Hongtao Xu, Wenting Shen, Yuanxin Wei et al.
Reconstruction and Secrecy under Approximate Distance Queries
Shay Moran, Elizaveta Nesterova
BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence
Xuewu Lin, Tianwei Lin, Alan Huang et al.
InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception
Haijie Li, Yanmin Wu, Jiarui Meng et al.
Cyclic Counterfactuals under Shift–Scale Interventions
Saptarshi Saha, Dhruv Rathore, Utpal Garain
TOMCAT: Test-time Comprehensive Knowledge Accumulation for Compositional Zero-Shot Learning
Xudong Yan, Songhe Feng
DiffE2E: Rethinking End-to-End Driving with a Hybrid Diffusion-Regression-Classification Policy
Rui Zhao, Yuze Fan, Ziguo Chen et al.
Image Token Matters: Mitigating Hallucination in Discrete Tokenizer-based Large Vision-Language Models via Latent Editing
Weixing Wang, Zifeng Ding, Jindong Gu et al.
Bridging the Gap between Gaussian Diffusion Models and Universal Quantization for Image Compression
Lucas Relic, Roberto Azevedo, Yang Zhang et al.
Active Hyperspectral Imaging Using an Event Camera
Bohan Yu, Jinxiu Liang, Zhuofeng Wang et al.
Automated Proof of Polynomial Inequalities via Reinforcement Learning
Banglong Liu, Niuniu Qi, Xia Zeng et al.
Thoughts Are All Over the Place: On the Underthinking of Long Reasoning Models
Yue Wang, Qiuzhi Liu, Jiahao Xu et al.
DynFocus: Dynamic Cooperative Network Empowers LLMs with Video Understanding
Yudong Han, Qingpei Guo, Liyuan Pan et al.
Analog In-memory Training on General Non-ideal Resistive Elements: The Impact of Response Functions
Zhaoxian Wu, Quan Xiao, Tayfun Gokmen et al.
Easy-editable Image Vectorization with Multi-layer Multi-scale Distributed Visual Feature Embedding
Ye Chen, Zhangli Hu, Zhongyin Zhao et al.
How to Merge Your Multimodal Models Over Time?
Sebastian Dziadzio, Vishaal Udandarao, Karsten Roth et al.
Mitigating Semantic Collapse in Partially Relevant Video Retrieval
WonJun Moon, MinSeok Jung, Gilhan Park et al.
Q-PART: Quasi-Periodic Adaptive Regression with Test-time Training for Pediatric Left Ventricular Ejection Fraction Regression
Jie Liu, Tiexin Qin, Hui Liu et al.
Can Generative Video Models Help Pose Estimation?
Ruojin Cai, Jason Y. Zhang, Philipp Henzler et al.
Large Language Models Miss the Multi-agent Mark
Emanuele La Malfa, Gabriele La Malfa, Samuele Marro et al.
Pairwise Optimal Transports for Training All-to-All Flow-Based Condition Transfer Model
Kotaro Ikeda, Masanori Koyama, Jinzhe Zhang et al.
Incentive-Aware Dynamic Resource Allocation under Long-Term Cost Constraints
Yan Dai, Negin Golrezaei, Patrick Jaillet
Learning-Augmented Algorithms for $k$-median via Online Learning
Anish Hebbar, Rong Ge, Amit Kumar et al.
BOOTPLACE: Bootstrapped Object Placement with Detection Transformers
Hang Zhou, Xinxin Zuo, Rui Ma et al.
AniGrad: Anisotropic Gradient-Adaptive Sampling for 3D Reconstruction From Monocular Video
Noah Stier, Alex Rich, Pradeep Sen et al.
Shadow Generation Using Diffusion Model with Geometry Prior
Haonan Zhao, Qingyang Liu, Xinhao Tao et al.
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models
Guo Chen, Zhiqi Li, Shihao Wang et al.
Increasing the Utility of Synthetic Images through Chamfer Guidance
Nicola Dall'Asen, Xiaofeng Zhang, Reyhane Askari Hemmat et al.
Learning Relative Gene Expression Trends from Pathology Images in Spatial Transcriptomics
Kazuya Nishimura, Haruka Hirose, Ryoma Bise et al.
A Bayesian Approach to Contextual Dynamic Pricing using the Proportional Hazards Model with Discrete Price Data
Dongguen Kim, Young-Geun Choi, Minwoo Chae
Bridging the Gap Between Cross-Domain Theory and Practical Application: A Case Study on Molecular Dissolution
Sihan Wang, Wenjie Du, Qing Zhu et al.
Domain Adaptive Hashing Retrieval via VLM Assisted Pseudo-Labeling and Dual Space Adaptation
Jingyao Li, Zhanshan Li, Shuai Lü
VLMs-Guided Representation Distillation for Efficient Vision-Based Reinforcement Learning
Haoran Xu, Peixi Peng, Guang Tan et al.
Scaling Epidemic Inference on Contact Networks: Theory and Algorithms
Guanghui Min, Yinhan He, Chen Chen
ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models
Fernando Julio Cendra, Kai Han
MIX: A Multi-view Time-Frequency Interactive Explanation Framework for Time Series Classification
Viet-Hung Tran, Ngoc Phu Doan, Zichi Zhang et al.
CDFlow: Building Invertible Layers with Circulant and Diagonal Matrices
XUCHEN FENG, Siyu Liao
Hamiltonian Neural PDE Solvers through Functional Approximation
Anthony Zhou, Amir Barati Farimani
Pay Attention to Small Weights
chao zhou, Tom Jacobs, Advait Gadhikar et al.
MiNT: Multi-Network Transfer Benchmark for Temporal Graph Learning
Kiarash Shamsi, Tran Gia Bao Ngo, Razieh Shirzadkhani et al.
The Rich and the Simple: On the Implicit Bias of Adam and SGD
Bhavya Vasudeva, Jung Lee, Vatsal Sharan et al.
Quartet: Native FP4 Training Can Be Optimal for Large Language Models
Roberto Castro, Andrei Panferov, Rush Tabesh et al.
Training Data Provenance Verification: Did Your Model Use Synthetic Data from My Generative Model for Training?
Yuechen Xie, Jie Song, Huiqiong Wang et al.
On Evaluating LLM Alignment by Evaluating LLMs as Judges
Yixin Liu, Pengfei Liu, Arman Cohan
Learned Image Compression with Dictionary-based Entropy Model
Jingbo Lu, Leheng Zhang, Xingyu Zhou et al.
DreamTrack: Dreaming the Future for Multimodal Visual Object Tracking
Mingzhe Guo, Weiping Tan, Wenyu Ran et al.
Agnostic Active Learning Is Always Better Than Passive Learning
Steve Hanneke
CLIP is Almost All You Need: Towards Parameter-Efficient Scene Text Retrieval without OCR
Xugong Qin, peng zhang, Jun Jie Ou Yang et al.
TransferTraj: A Vehicle Trajectory Learning Model for Region and Task Transferability
Tonglong Wei, Yan Lin, Zeyu Zhou et al.
REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders
Savya Khosla, Sethuraman T V, Barnett Lee et al.
The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation
Bingjie Gao, Xinyu Gao, Xiaoxue Wu et al.
Volume Tells: Dual Cycle-Consistent Diffusion for 3D Fluorescence Microscopy De-noising and Super-Resolution
ZELIN LI, Chenwei Wang, Zhaoke Huang et al.
MODfinity: Unsupervised Domain Adaptation with Multimodal Information Flow Intertwining
Shanglin Liu, Jianming Lv, Jingdan Kang et al.
Diffusion Bridge: Leveraging Diffusion Model to Reduce the Modality Gap Between Text and Vision for Zero-Shot Image Captioning
Jeongryong Lee, Yejee Shin, Geonhui Son et al.
AF-UMC: An Alignment-Free Fusion Framework for Unaligned Multi-View Clustering
Bohang Sun, Yuena Lin, Tao Yang et al.
GUI Exploration Lab: Enhancing Screen Navigation in Agents via Multi-Turn Reinforcement Learning
Haolong Yan, Yeqing Shen, Xin Huang et al.
Semi-infinite Nonconvex Constrained Min-Max Optimization
Cody Melcher, Zeinab Alizadeh, Lindsey Hiett et al.
Structure Matters: Dynamic Policy Gradient
Sara Klein, Xiangyuan Zhang, Tamer Basar et al.
Two Heads are Better than One: Simulating Large Transformers with Small Ones
Hantao Yu, Josh Alman
3DPE-Gaze:Unlocking the Potential of 3D Facial Priors for Generalized Gaze Estimation
Yangshi Ge, Yiwei Bao, Feng Lu
FlowRefiner: A Robust Traffic Classification Framework against Label Noise
Mingwei Zhan, Ruijie Zhao, Xianwen Deng et al.
Collapsing Taylor Mode Automatic Differentiation
Felix Dangel, Tim Siebert, Marius Zeinhofer et al.
ConceptGuard: Continual Personalized Text-to-Image Generation with Forgetting and Confusion Mitigation
Zirun Guo, Tao Jin
Neural MJD: Neural Non-Stationary Merton Jump Diffusion for Time Series Prediction
Yuanpei Gao, Qi Yan, Yan Leng et al.
Discrete Neural Flow Samplers with Locally Equivariant Transformer
Zijing Ou, Ruixiang Zhang, Yingzhen Li
Training Robust Graph Neural Networks by Modeling Noise Dependencies
Yeonjun In, Kanghoon Yoon, Sukwon Yun et al.
Track3R: Joint Point Map and Trajectory Prior for Spatiotemporal 3D Understanding
Seong Hyeon Park, Jinwoo Shin
Ambient Proteins - Training Diffusion Models on Noisy Structures
Giannis Daras, Jeffrey Ouyang-Zhang, Krithika Ravishankar et al.
POCO: Scalable Neural Forecasting through Population Conditioning
Yu Duan, Hamza Chaudhry, Misha B Ahrens et al.
BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset
Zhiheng Xi, Guanyu Li, Yutao Fan et al.
Information-Computation Tradeoffs for Noiseless Linear Regression with Oblivious Contamination
Ilias Diakonikolas, Chao Gao, Daniel Kane et al.
The Structure of Relation Decoding Linear Operators in Large Language Models
Miranda Anna Christ, Adrián Csiszárik, Gergely Becsó et al.
Overcoming Long Context Limitations of State Space Models via Context Dependent Sparse Attention
Zhihao Zhan, Jianan Zhao, Zhaocheng Zhu et al.
Once Upon an Input: Reasoning via Per-Instance Program Synthesis
Adam Stein, Neelay Velingker, Mayur Naik et al.
Non-rectangular Robust MDPs with Normed Uncertainty Sets
Navdeep Kumar, Adarsh Gupta, Maxence Mohamed ELFATIHI et al.
Condensing Action Segmentation Datasets via Generative Network Inversion
Guodong Ding, Rongyu Chen, Angela Yao
Automaton Constrained Q-Learning
Anastasios Manganaris, Vittorio Giammarino, Ahmed Qureshi
T2SG: Traffic Topology Scene Graph for Topology Reasoning in Autonomous Driving
Changsheng Lv, Mengshi Qi, Liang Liu et al.
Probing the Mid-level Vision Capabilities of Self-Supervised Learning
Xuweiyi Chen, Markus Marks, Zezhou Cheng
MESC-3D:Mining Effective Semantic Cues for 3D Reconstruction from a Single Image
Shaoming Li, Qing Cai, Songqi KONG et al.
IPFormer: Visual 3D Panoptic Scene Completion with Context-Adaptive Instance Proposals
Markus Gross, Aya Fahmy, Danit Niwattananan et al.
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Xiaozhong Ji, Xiaobin Hu, Zhihong Xu et al.
Bandit Guided Submodular Curriculum for Adaptive Subset Selection
Prateek Chanda, Prayas Agrawal, Saral Sureka et al.
RobSense: A Robust Multi-modal Foundation Model for Remote Sensing with Static, Temporal, and Incomplete Data Adaptability
Minh Kha Do, Kang Han, Phu Lai et al.
Towards Precise Scaling Laws for Video Diffusion Transformers
Yuanyang Yin, Yaqi Zhao, Mingwu Zheng et al.
Double Descent Meets Out-of-Distribution Detection: Theoretical Insights and Empirical Analysis on the Role of Model Complexity
Mouïn Ben Ammar, David Brellmann, Arturo Mendoza et al.
OnlineSplatter: Pose-Free Online 3D Reconstruction for Free-Moving Objects
Mark H. Huang, Lin Geng Foo, Christian Theobalt et al.
Strategic Classification with Non-Linear Classifiers
Benyamin Trachtenberg, Nir Rosenfeld
PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models
Junhyuk So, Jiwoong Shin, Chaeyeon Jang et al.
MIDAS: Misalignment-based Data Augmentation Strategy for Imbalanced Multimodal Learning
Seong-Hyeon Hwang, Soyoung Choi, Steven Whang
Erase Diffusion: Empowering Object Removal Through Calibrating Diffusion Pathways
Yi Liu, Hao Zhou, Benlei Cui et al.
Caption This, Reason That: VLMs Caught in the Middle
Zihan Weng, Lucas Gomez, Taylor Webb et al.
Task-Optimized Convolutional Recurrent Networks Align with Tactile Processing in the Rodent Brain
Trinity Chung, Yuchen Shen, Nathan Kong et al.
On the Empirical Power of Goodness-of-Fit Tests in Watermark Detection
Weiqing He, Xiang Li, Tianqi Shang et al.
Exploiting Deblurring Networks for Radiance Fields
Haeyun Choi, Heemin Yang, Janghyeok Han et al.
MIHC: Multi-View Interpretable Hypergraph Neural Networks with Information Bottleneck for Chip Congestion Prediction
Zeyue Zhang, Heng Ping, Peiyu Zhang et al.
Multi-Expert Distributionally Robust Optimization for Out-of-Distribution Generalization
Jinyong Jeong, Hyungu Kahng, Seoung Bum Kim
Meta-D2AG: Causal Graph Learning with Interventional Dynamic Data
Tian Gao, Songtao Lu, Junkyu Lee et al.
Glance2Gaze: Efficient Vision-Language Models from Glance Fusion to Gaze Compression
Juan Chen, Honglin liu, Yingying Ao et al.
Program Synthesis via Test-Time Transduction
Kang-il Lee, Jahyun Koo, Seunghyun Yoon et al.
macOSWorld: A Multilingual Interactive Benchmark for GUI Agents
Pei Yang, Hai Ci, Mike Zheng Shou
NoiseCtrl: A Sampling-Algorithm-Agnostic Conditional Generation Method for Diffusion Models
Longquan Dai, He Wang, Jinhui Tang
FIGRDock: Fast Interaction-Guided Regression for Flexible Docking
Shikun Feng, Bicheng Lin, Yuanhuan Mo et al.
Understanding and Improving Fast Adversarial Training against $l_0$ Bounded Perturbations
Xuyang Zhong, Yixiao Huang, Chen Liu
Spiking Transformer: Introducing Accurate Addition-Only Spiking Self-Attention for Transformer
Yufei Guo, Xiaode Liu, Yuanpei Chen et al.
Opinion Maximization in Social Networks by Modifying Internal Opinions
Gengyu Wang, Runze Zhang, Zhongzhi Zhang
Enhancing Deep Batch Active Learning for Regression with Imperfect Data Guided Selection
Yinjie Min, Furong Xu, Xinyao Li et al.
Who Reasons in the Large Language Models?
Jie Shao, Jianxin Wu
Enhancing Interpretability in Deep Reinforcement Learning through Semantic Clustering
Liang Zhang, Justin Lieffers, Adarsh Pyarelal
SINR: Sparsity Driven Compressed Implicit Neural Representations
Dhananjaya Jayasundara, Sudarshan Rajagopalan, Yasiru Ranasinghe et al.
Strategic Hypothesis Testing
Yatong Chen, Safwan Hossain, Yiling Chen
From Prototypes to General Distributions: An Efficient Curriculum for Masked Image Modeling
Jinhong Lin, Cheng-En Wu, Huanran Li et al.
CH3Depth: Efficient and Flexible Depth Foundation Model with Flow Matching
Jiaqi Li, Yiran Wang, Jinghong Zheng et al.
HetSyn: Versatile Timescale Integration in Spiking Neural Networks via Heterogeneous Synapses
Zhichao Deng, Zhikun Liu, Junxue Wang et al.
Towards General Continuous Memory for Vision-Language Models
Wenyi WU, Zixuan Song, Kun Zhou et al.
Advancing Adversarial Robustness in GNeRFs: The IL2-NeRF Attack
Nicole Meng, Caleb Manicke, Ronak Sahu et al.
Metric Automata Theory: A Unifying Theory of RNNs
Adam Dankowiakowski, Alessandro Ronca
PINNs with Learnable Quadrature
Sourav Pal, Kamyar Azizzadenesheli, Vikas Singh
Learning-enabled Polynomial Lyapunov Function Synthesis via High-Accuracy Counterexample-Guided Framework
Hanrui Zhao, Niuniu Qi, Mengxin Ren et al.
Coreset for Robust Geometric Median: Eliminating Size Dependency on Outliers
Ziyi Fang, Lingxiao Huang, Runkai Yang
Selftok-Zero: Reinforcement Learning for Visual Generation via Discrete and Autoregressive Visual Tokens
Bohan Wang, Mingze Zhou, Zhongqi Yue et al.
Learning-Augmented Facility Location Mechanisms for the Envy Ratio Objective
Haris Aziz, Yuhang Guo, Alexander Lam et al.
Dataset Distillation of 3D Point Clouds via Distribution Matching
Jae-Young Yim, Dongwook Kim, Jae-Young Sim
CheXwhatsApp: A Dataset for Exploring Challenges in the Diagnosis of Chest X-rays through Mobile Devices
Mariamma Antony, Rajiv Porana, Sahil M. Lathiya et al.
Aligning Text-to-Image Diffusion Models to Human Preference by Classification
Longquan Dai, Xiaolu Wei, wang he et al.
PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction
Eduard Poesina, Adriana Valentina Costache, Adrian-Gabriel Chifu et al.
The Price of Sparsity: Sufficient Conditions for Sparse Recovery using Sparse and Sparsified Measurements
Youssef Chaabouni, David Gamarnik
CTRL-ALT-DECEIT Sabotage Evaluations for Automated AI R&D
Francis Ward, Teun van der Weij, Hanna Gábor et al.
PROFIT: A Specialized Optimizer for Deep Fine Tuning
Anirudh Chakravarthy, Shuai Zheng, Xin Huang et al.
Towards Autonomous Micromobility through Scalable Urban Simulation
Wayne Wu, Honglin He, Chaoyuan Zhang et al.
REINFORCE Converges to Optimal Policies with Any Learning Rate
Samuel Robertson, Thang Chu, Bo Dai et al.
DiskVPS: Vanishing Point Detector via Hough Transform in a Disk Region
Jianping Wu
Sea-ing in Low-light
Nisha Varghese, A. N. Rajagopalan
Energy Landscape-Aware Vision Transformers: Layerwise Dynamics and Adaptive Task-Specific Training via Hopfield States
Runze Xia, Richard Jiang
Fin3R: Fine-tuning Feed-forward 3D Reconstruction Models via Monocular Knowledge Distillation
Weining Ren, Hongjun Wang, Xiao Tan et al.
LAL: Enhancing 3D Human Motion Prediction with Latency-aware Auxiliary Learning
Xiaoning Sun, Dong Wei, Huaijiang Sun et al.
Speculate Deep and Accurate: Lossless and Training-Free Acceleration for Offloaded LLMs via Substitute Speculative Decoding
Pei-Shuo Wang, Jian-Jia Chen, Chun-Che Yang et al.
EventPSR: Surface Normal and Reflectance Estimation from Photometric Stereo Using an Event Camera
Bohan Yu, Jin Han, Boxin Shi et al.
DroneAudioset: An Audio Dataset for Drone-based Search and Rescue
Chitralekha Gupta, Soundarya Ramesh, Praveen Sasikumar et al.
The Emergence of Abstract Thought in Large Language Models Beyond Any Language
Yuxin Chen, Yiran Zhao, Yang Zhang et al.
Structure-from-Motion with a Non-Parametric Camera Model
Yihan Wang, Linfei Pan, Marc Pollefeys et al.
Uncertainty Quantification for Deep Regression using Contextualised Normalizing Flows
Adriel Sosa Marco, John D. Kirwan, Alexia Toumpa et al.
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Yicheng Chen, Xiangtai Li, Yining Li et al.
PointMapPolicy: Structured Point Cloud Processing for Multi-Modal Imitation Learning
Xiaogang Jia, Qian Wang, Anrui Wang et al.
SeqMvRL: A Sequential Fusion Framework for Multi-view Representation Learning
Ren Wang, Haoliang Sun, Yuxiu Lin et al.
Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Pengcheng Xu, Boyuan Jiang, Xiaobin Hu et al.
Knowledge Memorization and Rumination for Pre-trained Model-based Class-Incremental Learning
Zijian Gao, Wangwang Jia, Xingxing Zhang et al.
Distilling Long-tailed Datasets
Zhenghao Zhao, Haoxuan Wang, Yuzhang Shang et al.
Towards Understanding Transformers in Learning Random Walks
Wei Shi, Yuan Cao
Causal Discovery over Clusters of Variables in Markovian Systems
Tara Anand, Adèle Ribeiro, Jin Tian et al.
SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting
Dongliang Luo, Hanshen Zhu, Ziyang Zhang et al.
ElasticMM: Efficient Multimodal LLMs Serving with Elastic Multimodal Parallelism
Zedong Liu, Shenggan Cheng, Guangming Tan et al.
Make Information Diffusion Explainable: LLM-based Causal Framework for Diffusion Prediction
Wenbo Shang, Zihan Feng, Yang Yajun et al.
Causal Differentiating Concepts: Interpreting LM Behavior via Causal Representation Learning
Navita Goyal, Hal Daumé III, Alexandre Drouin et al.
LBMKGC: Large Model-Driven Balanced Multimodal Knowledge Graph Completion
Yuan Guo, Qian Ma, Hui Li et al.
Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
Bingda Tang, Sayak Paul, Boyang Zheng et al.
Retrosynthesis Planning via Worst-path Policy Optimisation in Tree-structured MDPs
Mianchu Wang, Giovanni Montana
BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent
Shaojie Zhang, Ruoceng Zhang, Pei Fu et al.
Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers
Yixiao Huang, Hanlin Zhu, Tianyu Guo et al.
Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios
Kai Wang, Zekai Li, Zhi-Qi Cheng et al.
Subspace Constraint and Contribution Estimation for Heterogeneous Federated Learning
Xiangtao Zhang, Sheng Li, Ao Li et al.