Most Cited 2025 "6-dof manipulation" Papers
22,274 papers found • Page 110 of 112
Conference
CVPT: Cross Visual Prompt Tuning
Lingyun Huang, Jianxu Mao, Junfei YI et al.
DDB: Diffusion Driven Balancing to Address Spurious Correlations
Aryan Yazdan Parast, Basim Azam, Naveed Akhtar
Geometric Alignment and Prior Modulation for View-Guided Point Cloud Completion on Unseen Categories
Jingqiao Xiu, Yicong Li, Na Zhao et al.
AnyBimanual: Transferring Unimanual Policy for General Bimanual Manipulation
Guanxing Lu, Tengbo Yu, Haoyuan Deng et al.
FVGen: Accelerating Novel-View Synthesis with Adversarial Video Diffusion Distillation
Wenbin Teng, Gonglin Chen, Haiwei Chen et al.
CoralSRT: Revisiting Coral Reef Semantic Segmentation by Feature Rectifying via Self-supervised Guidance
Zheng Ziqiang, Wong Kwan, Binh-Son Hua et al.
Learning Dense Feature Matching via Lifting Single 2D Image to 3D Space
Yingping Liang, Yutao Hu, Wenqi Shao et al.
Diagnosing Pretrained Models for Out-of-distribution Detection
Haipeng Xiong, Kai Xu, Angela Yao
What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?
Jinhong Ni, Chang-Bin Zhang, Qiang Zhang et al.
Learning Normals of Noisy Points by Local Gradient-Aware Surface Filtering
Qing Li, Huifang Feng, Xun Gong et al.
TSENOR: Highly-Efficient Algorithm for Finding Transposable N:M Sparse Masks
Xiang Meng, Mehdi Makni, Rahul Mazumder
Bayesian-Inspired Space-Time Superpixels
Kent Gauen, Stanley Chan
INSTINCT: Instance-Level Interaction Architecture for Query-Based Collaborative Perception
yunjiang xu, Yupeng Ouyang, Lingzhi Li et al.
Debiased Curriculum Adaptation for Safe Transfer Learning in Chest X-ray Classification
Mingyang Liu, Xinyang Chen, Yang Shu et al.
PHATNet: A Physics-guided Haze Transfer Network for Domain-adaptive Real-world Image Dehazing
Fu-Jen Tsai, Yan-Tsung Peng, Yen-Yu Lin et al.
Forensic-MoE: Exploring Comprehensive Synthetic Image Detection Traces with Mixture of Experts
Mingqi Fang, Ziguang Li, Lingyun Yu et al.
Information-Bottleneck Driven Binary Neural Network for Change Detection
Kaijie Yin, Zhiyuan Zhang, Shu Kong et al.
Entropy-Adaptive Diffusion Policy Optimization with Dynamic Step Alignment
Renye Yan, Jikang Cheng, Yaozhong Gan et al.
Time-Aware Auto White Balance in Mobile Photography
Mahmoud Afifi, Luxi Zhao, Abhijith Punnappurath et al.
Leveraging Panoptic Scene Graph for Evaluating Fine-Grained Text-to-Image Generation
Xueqing Deng, Linjie Yang, Qihang Yu et al.
ViewSRD: 3D Visual Grounding via Structured Multi-View Decomposition
Ronggang Huang, Haoxin Yang, Yan Cai et al.
Physical Degradation Model-Guided Interferometric Hyperspectral Reconstruction with Unfolding Transformer
Yuansheng Li, Yunhao Zou, Linwei Chen et al.
VPR-Cloak: A First Look at Privacy Cloak Against Visual Place Recognition
Shuting Dong, Mingzhi Chen, Feng Lu et al.
Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors
Fan Nie, Lan Feng, Haotian Ye et al.
GUAVA: Generalizable Upper Body 3D Gaussian Avatar
Dongbin Zhang, Yunfei Liu, Lijian Lin et al.
HOMO-Feature: Cross-Arbitrary-Modal Image Matching with Homomorphism of Organized Major Orientation
Chenzhong Gao, Wei Li, Desheng Weng
GSOT3D: Towards Generic 3D Single Object Tracking in the Wild
Yifan Jiao, Yunhao Li, Junhua Ding et al.
Locally Optimal Private Sampling: Beyond the Global Minimax
Hrad Ghoukasian, Bonwoo Lee, Shahab Asoodeh
Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection
Yehao Lu, Minghe Weng, Zekang Xiao et al.
WAVE: Warp-Based View Guidance for Consistent Novel View Synthesis Using a Single Image
Jiwoo Park, Tae Choi, Youngjun Jun et al.
DEGauss: Defending Against Malicious 3D Editing for Gaussian Splatting
Lingzhuang Meng, Mingwen Shao, Yuanjian Qiao et al.
Scalable Signature Kernel Computations via Local Neumann Series Expansions
Matthew Tamayo-Rios, Alexander Schell, Rima Alaifari
Lightweight and Fast Real-time Image Enhancement via Decomposition of the Spatial-aware Lookup Tables
Wontae Kim, Keuntek Lee, Nam Ik Cho
Quantifying and Alleviating Co-Adaptation in Sparse-View 3D Gaussian Splatting
Kangjie Chen, Yingji Zhong, Zhihao Li et al.
Impact of LLM Alignment on Impression Formation in Social Interactions
Ala N. Tak, Anahita Bolourani, Daniel B. Shank et al.
Neural Multi-View Self-Calibrated Photometric Stereo without Photometric Stereo Cues
Xu Cao, Takafumi Taketomi
Enhancing the Maximum Effective Window for Long-Term Time Series Forecasting
Jiahui Zhang, Zhengyang Zhou, Wenjie Du et al.
EMatch: A Unified Framework for Event-based Optical Flow and Stereo Matching
Pengjie Zhang, Lin Zhu, Xiao Wang et al.
Where Does It Exist from the Low-Altitude: Spatial Aerial Video Grounding
Yang Zhan, Yuan Yuan
NoWag: A Unified Framework for Shape Preserving Com- pression of Large Language Models
Lawrence Ray Liu, Inesh Chakrabarti, Yixiao Li et al.
DenoiseRotator: Enhance Pruning Robustness for LLMs via Importance Concentration
Tianteng Gu, Bei Liu, Bo Xiao et al.
CounterPC: Counterfactual Feature Realignment for Unsupervised Domain Adaptation on Point Clouds
Feng Yang, Yichao Cao, Xiu Su et al.
KScope: A Framework for Characterizing the Knowledge Status of Language Models
Yuxin Xiao, Shan Chen, Jack Gallifant et al.
Liberated-GS: 3D Gaussian Splatting Independent from SfM Point Clouds
Weihong Pan, Xiaoyu Zhang, Hongjia Zhai et al.
Unlocking the Potential of Diffusion Priors in Blind Face Restoration
Yunqi Miao, Zhiyu Qu, Mingqi Gao et al.
Beyond Node-Centric Modeling: Sketching Signed Networks with Simplicial Complexes
Wei Wu, Xuan Tan, Yan Peng et al.
Implicit Counterfactual Learning for Audio-Visual Segmentation
Mingfeng Zha, Tianyu Li, Guoqing Wang et al.
Train on Pins and Test on Obstacles for Rectilinear Steiner Minimum Tree
Xingbo Du, Ruizhe Zhong, Junchi Yan
STaR: Seamless Spatial-Temporal Aware Motion Retargeting with Penetration and Consistency Constraints
Xiaohang Yang, Qing Wang, Jiahao Yang et al.
MRGen: Segmentation Data Engine For Underrepresented MRI Modalities
Haoning Wu, Ziheng Zhao, Ya Zhang et al.
Towards Reliable LLM-based Robots Planning via Combined Uncertainty Estimation
Shiyuan Yin, Chenjia Bai, Zihao Zhang et al.
Lifelong Test-Time Adaptation via Online Learning in Tracked Low-Dimensional Subspace
Dexin Duan, Rui Xu, Peilin Liu et al.
Rethink Sparse Signals for Pose-guided Text-to-image Generation
Wenjie Xuan, Jing Zhang, Juhua Liu et al.
Does Object Binding Naturally Emerge in Large Pretrained Vision Transformers?
Yihao Li, Saeed Salehi, Lyle Ungar et al.
Single-Scanline Relative Pose Estimation for Rolling Shutter Cameras
Petr Hruby, Marc Pollefeys
Enhancing Transferability of Targeted Adversarial Examples via Inverse Target Gradient Competition and Spatial Distance Stretching
Zhankai Li, Weiping Wang, jie li et al.
LDPose: Towards Inclusive Human Pose Estimation for Limb-Deficient Individuals in the Wild
Jiaying Ying, Heming Du, Kaihao Zhang et al.
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Ahmed Nassar, Matteo Omenetti, Maksym Lysak et al.
Images as Noisy Labels: Unleashing the Potential of the Diffusion Model for Open-Vocabulary Semantic Segmentation
Fan Li, Xuanbin Wang, Xuan Wang et al.
ContextFace: Generating Facial Expressions from Emotional Contexts
minjung kim, Minsang Kim, Seung Jun Baek
SMP-Attack: Boosting the Transferability of Feature Importance-based Adversarial Attack with Semantics-aware Multi-granularity Patchout
Wen Yang, Guodong Liu, Di Ming
Spatial-Temporal Forgery Trace based Forgery Image Identification
Yilin Wang, Zunlei Feng, Jiachi Wang et al.
Towards Annotation-Free Evaluation: KPAScore for Human Keypoint Detection
Xiaoxiao Wang, Chunxiao Li, Peng Sun et al.
Ultra High-Resolution Image Inpainting with Patch-Based Content Consistency Adapter
JianHui Zhang, Shen Cheng, Qirui Sun et al.
Agreement aware and dissimilarity oriented GLOM
Ru Zeng, Yan Song, Yang ZHANG et al.
Yggdrasil: Bridging Dynamic Speculation and Static Runtime for Latency-Optimal Tree-Based LLM Decoding
Yue Guan, Changming Yu, Shihan Fang et al.
MeasureXpert: Automatic Anthropometric Measurement Extraction from Two Unregistered, Partial, Posed, and Dressed Body Scans
Ran Zhao, Xinxin Dai, Pengpeng Hu et al.
DiMPLe - Disentangled Multi-Modal Prompt Learning: Enhancing Out-Of-Distribution Alignment with Invariant and Spurious Feature Separation
Umaima Rahman, Mohammad Yaqub, Dwarikanath Mahapatra
ResidualViT for Efficient Temporally Dense Video Encoding
Mattia Soldan, Fabian Caba Heilbron, Bernard Ghanem et al.
Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics
Ruining Li, Chuanxia Zheng, Christian Rupprecht et al.
Randomized Autoregressive Visual Generation
Qihang Yu, Ju He, Xueqing Deng et al.
Unsupervised RGB-D Point Cloud Registration for Scenes with Low Overlap and Photometric Inconsistency
yejun Shou, Haocheng Wang, Lingfeng Shen et al.
TOGA: Temporally Grounded Open-Ended Video QA with Weak Supervision
Ayush Gupta, Anirban Roy, Rama Chellappa et al.
Restricted Global-Aware Graph Filters Bridging GNNs and Transformer for Node Classification
Jingyuan Zhang, Xin Wang, Lei Yu et al.
A Beyond-Worst-Case Analysis of Greedy k-means++
Qingyun Chen, Sungjin Im, Ben Moseley et al.
Beyond Blanket Masking: Examining Granularity for Privacy Protection in Images Captured by Blind and Low Vision Users
Jeffri Murrugarra-Llerena, Haoran Niu, K. Suzanne Barber et al.
Training-free Geometric Image Editing on Diffusion Models
Hanshen Zhu, Zhen Zhu, Kaile Zhang et al.
BlurDM: A Blur Diffusion Model for Image Deblurring
Jin-Ting He, Fu-Jen Tsai, Yan-Tsung Peng et al.
Monocular Facial Appearance Capture in the Wild
Yingyan Xu, Kate Gadola, Prashanth Chandran et al.
Growing a Twig to Accelerate Large Vision-Language Models
Zhenwei Shao, Mingyang Wang, Zhou Yu et al.
SignRep: Enhancing Self-Supervised Sign Representations
Ryan Wong, Necati Cihan Camgoz, Richard Bowden
MixA: A Mixed Attention approach with Stable Lightweight Linear Attention to enhance Efficiency of Vision Transformers at the Edge
Sabbir Ahmed, Jingtao Li, Weiming Zhuang et al.
Nearly-Linear Time and Massively Parallel Algorithms for $k$-anonymity
Kevin Aydin, Honghao Lin, David Woodruff et al.
What do you know? Bayesian knowledge inference for navigating agents
Matthias Schultheis, Jana-Sophie Schönfeld, Constantin Rothkopf et al.
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
Junyuan Zhang, Qintong Zhang, Bin Wang et al.
Efficient Event Camera Data Pretraining with Adaptive Prompt Fusion
Quanmin Liang, Qiang Li, Shuai Liu et al.
Head2Body: Body Pose Generation from Multi-sensory Head-mounted Inputs
Minh Tran, Hongda Mao, Qingshuang Chen et al.
Looking in the Mirror: A Faithful Counterfactual Explanation Method for Interpreting Deep Image Classification Models
Townim Chowdhury, Vu Phan, Kewen Liao et al.
FLSeg: Enhancing Privacy and Robustness in Federated Learning under Heterogeneous Data via Model Segmentation
Zichun Su, Zhi Lu, Yutong Wu et al.
Self-Calibrating Gaussian Splatting for Large Field-of-View Reconstruction
Youming Deng, Wenqi Xian, Guandao Yang et al.
Gradient Decomposition and Alignment for Incremental Object Detection
Wenlong Luo, Shizhou Zhang, De Cheng et al.
MSQ: Memory-Efficient Bit Sparsification Quantization
Seokho Han, Seoyeon Yoon, Jinhee Kim et al.
ShapeCraft: LLM Agents for Structured, Textured and Interactive 3D Modeling
Shuyuan Zhang, ChenHan Jiang, Zuoou Li et al.
Gate to the Vessel: Residual Experts Restore What SAM Overlooks
Weili Jiang, Jinrong Lv, Xun Gong et al.
When and Where do Data Poisons Attack Textual Inversion?
Jeremy Styborski, Mingzhi Lyu, Jiayou Lu et al.
TrajMamba: An Efficient and Semantic-rich Vehicle Trajectory Pre-training Model
Yichen Liu, Yan Lin, Shengnan Guo et al.
SRefiner: Soft-Braid Attention for Multi-Agent Trajectory Refinement
Liwen Xiao, Zhiyu Pan, Zhicheng Wang et al.
Spike-RetinexFormer: Rethinking Low-light Image Enhancement with Spiking Neural Networks
Hongzhi Wang, Xiubo Liang, Jinxing Han et al.
Rethinking Few Shot CLIP Benchmarks: A Critical Analysis in the Inductive Setting
Alexey Kravets, Da Chen, Vinay Namboodiri
HiMoLE: Towards OOD-Robust LoRA via Hierarchical Mixture of Experts
Yinuo Jiang, Yan Xiaodong, Keyan Ding et al.
AU-Blendshape for Fine-grained Stylized 3D Facial Expression Manipulation
Hao Li, Ju Dai, Feng Zhou et al.
BokehDiff: Neural Lens Blur with One-Step Diffusion
Chengxuan Zhu, Qingnan Fan, Qi Zhang et al.
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
Jiaming Han, Hao Chen, Yang Zhao et al.
Trial-Oriented Visual Rearrangement
Yuyi Liu, Xinhang Song, Tianliang Qi et al.
Debiased Teacher for Day-to-Night Domain Adaptive Object Detection
Yiming Cui, Liang Li, Haibing YIN et al.
SpikePack: Enhanced Information Flow in Spiking Neural Networks with High Hardware Compatibility
Guobin Shen, Jindong Li, Tenglong Li et al.
Social Debiasing for Fair Multi-modal LLMs
Harry Cheng, Yangyang Guo, Qingpei Guo et al.
Hierarchy-Aware Pseudo Word Learning with Text Adaptation for Zero-Shot Composed Image Retrieval
Zhe Li, Lei Zhang, Zheren Fu et al.
Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?
Xi Chen, Kaituo Feng, Changsheng Li et al.
UPP: Unified Point-Level Prompting for Robust Point Cloud Analysis
Zixiang Ai, Zhenyu Cui, Yuxin Peng et al.
AV-Flow: Transforming Text to Audio-Visual Human-like Interactions
Aggelina Chatziagapi, Louis-Philippe Morency, Hongyu Gong et al.
Probabilistic Inertial Poser (ProbIP): Uncertainty-aware Human Motion Modeling from Sparse Inertial Sensors
Min Kim, Younho Jeon, Sungho Jo
Principled Model Routing for Unknown Mixtures of Source Domains
Christoph Dann, Yishay Mansour, Teodor Vanislavov Marinov et al.
SFUOD: Source-Free Unknown Object Detection
Keon-Hee Park, Seun-An Choe, Gyeong-Moon Park
Compression-Aware One-Step Diffusion Model for JPEG Artifact Removal
Jinpei Guo, Zheng Chen, Wenbo Li et al.
ConstStyle: Robust Domain Generalization with Unified Style Transformation
Nam Duong Tran, Nam Nguyen Phuong, Hieu Pham et al.
Golden Noise for Diffusion Models: A Learning Framework
zikai zhou, Shitong Shao, Lichen Bai et al.
Vision-Language Interactive Relation Mining for Open-Vocabulary Scene Graph Generation
Yukuan Min, Muli Yang, Jinhao Zhang et al.
OrderChain: Towards General Instruct-Tuning for Stimulating the Ordinal Understanding Ability of MLLM
Jinhong Wang, Shuo Tong, Jintai CHEN et al.
Unified Open-World Segmentation with Multi-Modal Prompts
Yang Liu, Yufei Yin, Chenchen Jing et al.
LayerAnimate: Layer-level Control for Animation
Yuxue Yang, Lue Fan, Zuzeng Lin et al.
Finite Sample Analysis of Linear Temporal Difference Learning with Arbitrary Features
Zixuan Xie, Xinyu Liu, Rohan Chandra et al.
Aligning by Misaligning: Boundary-aware Curriculum Learning for Multimodal Alignment
Hua Ye, Hang Ding, Siyuan Chen et al.
Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion
shengyuan zhang, An Zhao, Ling Yang et al.
SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection for SLAM
Yannick Burkhardt, Simon Schaefer, Stefan Leutenegger
FedAGC: Federated Continual Learning with Asymmetric Gradient Correction
Chengchao Zhang, Fanhua Shang, Hongying Liu et al.
Joint Learning of Pose Regression and Denoising Diffusion with Score Scaling Sampling for Category-level 6D Pose Estimation
Seunghyun Lee, Tae-Kyun Kim
Intra-modal and Cross-modal Synchronization for Audio-visual Deepfake Detection and Temporal Localization
Ashutosh Anshul, Shreyas Gopal, Deepu Rajan et al.
Purest Quantum State Identification
Yingqi Yu, Honglin Chen, Jun Wu et al.
Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models
Ruikang Liu, Yuxuan Sun, Manyi Zhang et al.
MobileODE: An Extra Lightweight Network
Le Yu, Jun Wu, Bo Gou et al.
CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval
Zelong Sun, Dong Jing, Zhiwu Lu
The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based Generation
Ho Kei Cheng, Alex Schwing
OmniTry: Virtual Try-On Anything without Masks
Yutong Feng, Linlin Zhang, Hengyuan Cao et al.
DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation
Yue-Jiang Dong, Wang Zhao, Jiale Xu et al.
InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation
Wenjie Zhuo, Fan Ma, Hehe Fan
Client2Vec: Improving Federated Learning by Distribution Shifts Aware Client Indexing
Yongxin Guo, Lin Wang, Xiaoying Tang et al.
Overfill: Two-Stage Models for Efficient Language Model Decoding
Woojeong Kim, Junxiong Wang, Jing Nathan Yan et al.
Instance-Level Video Depth in Groups Beyond Occlusions
Yuan Liang, Yang Zhou, Ziming Sun et al.
The Price of Opportunity Fairness in Matroid Allocation Problems
Rémi Castera, Felipe Garrido-Lucero, Patrick Loiseau et al.
Future-Aware Interaction Network For Motion Forecasting
Shijie Li, Chunyu Liu, Xun Xu et al.
DreamCube: RGB-D Panorama Generation via Multi-plane Synchronization
Yukun Huang, Yanning Zhou, Jianan Wang et al.
Effects of Dropout on Performance in Long-range Graph Learning Tasks
Jasraj Singh, Keyue Jiang, Brooks Paige et al.
Optical Model-Driven Sharpness Mapping for Autofocus in Small Depth-of-Field and Severe Defocus Scenarios
Chen-Liang Fan, Mingpei Cao, Chih-Chien Hung et al.
HyPiDecoder: Hybrid Pixel Decoder for Efficient Segmentation and Detection
Fengzhe Zhou, Humphrey Shi
Controlled Visual Hallucination via Thalamus-Driven Decoupling Network for Domain Adaptation of Black-Box Predictors
Yuwu Lu, Chunzhi Liu
MMAD: Multi-label Micro-Action Detection in Videos
Kun Li, pengyu Liu, Dan Guo et al.
Localist Topographic Expert Routing: A Barrel Cortex-Inspired Modular Network for Sensorimotor Processing
Tianfang Zhu, Dongli Hu, Jiandong Zhou et al.
Homogeneous Keys, Heterogeneous Values: Exploiting Local KV Cache Asymmetry for Long-Context LLMs
Wanyun Cui, Mingwei Xu
Omni-scene Perception-oriented Point Cloud Geometry Enhancement for Coordinate Quantization
Wang Liu, Wei Gao
From Replication to Redesign: Exploring Pairwise Comparisons for LLM-Based Peer Review
Yaohui Zhang, Haijing ZHANG, Wenlong Ji et al.
Auto-Regressive Transformation for Image Alignment
Kanggeon Lee, Soochahn Lee, Kyoung Mu Lee
Training-Free Industrial Defect Generation with Diffusion Models
Ruyi Xu, Yen-Tzu Chiu, Tai-I Chen et al.
TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models
Ruidong Chen, honglin guo, Lanjun Wang et al.
Explainably Safe Reinforcement Learning
Sabine Rieder, Stefan Pranger, Debraj Chakraborty et al.
CaMiT: A Time-Aware Car Model Dataset for Classification and Generation
Frédéric Lin, Biruk Abere Ambaw, Adrian Popescu et al.
SYMPHONY: Synergistic Multi-agent Planning with Heterogeneous Language Model Assembly
Wei Zhu, Zhiwen Tang, Kun Yue
Reconstructing Heterogeneous Biomolecules via Hierarchical Gaussian Mixtures and Part Discovery
Shayan Shekarforoush, David Lindell, Marcus Brubaker et al.
Connectome-Based Modelling Reveals Orientation Maps in the Drosophila Optic Lobe
Jia Nuo Liew, Shenghan Lin, Bowen Chen et al.
Online Multi-Class Selection with Group Fairness Guarantee
Faraz Zargari, Hossein Jazi, Lyndon Hallett et al.
Majority of the Bests: Improving Best-of-N via Bootstrapping
Amin Rakhsha, Kanika Madan, Tianyu Zhang et al.
Orthogonal Contrastive Learning for Multi-Representation fMRI Analysis
Tony Yousefnezhad
No Object Is an Island: Enhancing 3D Semantic Segmentation Generalization with Diffusion Models
Fan Li, Xuan Wang, Xuanbin Wang et al.
PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts
Yiming Wang, Pei Zhang, Jialong Tang et al.
Learning Interestingness in Automated Mathematical Theory Formation
George Tsoukalas, Rahul Saha, Amitayush Thakur et al.
ChemX: A Collection of Chemistry Datasets for Benchmarking Automated Information Extraction
Anastasia Vepreva, Julia Razlivina, Mariia Eremeyeva et al.
OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Ling Fu, Zhebin Kuang, Jiajun Song et al.
A Learning-Augmented Approach to Online Allocation Problems
Ilan Cohen, Debmalya Panigrahi
Memory-Augmented Potential Field Theory: A Framework for Adaptive Control in Non-Convex Domains
Dongzhe Zheng, Wenjie Mei
Dr. RAW: Towards General High-Level Vision from RAW with Efficient Task Conditioning
Wenjun Huang, Ziteng Cui, Yinqiang Zheng et al.
Cognitive Predictive Processing: A Human-inspired Framework for Adaptive Exploration in Open-World Reinforcement Learning
boheng liu, Ziyu Li, Chenghua Duan et al.
Trust Region Reward Optimization and Proximal Inverse Reward Optimization Algorithm
Yang Chen, Menglin Zou, Jiaqi Zhang et al.
RankSEG-RMA: An Efficient Segmentation Algorithm via Reciprocal Moment Approximation
Zixun Wang, Ben Dai
Adaptive Sigmoid Clipping for Balancing the Direction–Magnitude Mismatch Trade-off in Differentially Private Learning
Faeze Moradi Kalarde, Ali Bereyhi, Ben Liang et al.
MoniTor: Exploiting Large Language Models with Instruction for Online Video Anomaly Detection
shengtian yang, Yue Feng, Yingshi Liu et al.
THD-BAR: Topology Hierarchical Derived Brain Autoregressive Modeling for EEG Generic Representations
Wenchao Yang, Weidong Yan, Wenkang Liu et al.
F-Adapter: Frequency-Adaptive Parameter-Efficient Fine-Tuning in Scientific Machine Learning
Hangwei Zhang, Chun Kang, Yan Wang et al.
Unsupervised Federated Graph Learning
Lele Fu, Tianchi Liao, Sheng Huang et al.
A Closer Look at Graph Transformers: Cross-Aggregation and Beyond
Jiaming Zhuo, Ziyi Ma, Yintong Lu et al.
HypoBootstrap: A Bootstrapping Framework for Inductive Reasoning
Si Chen, Yifei Li, Richong Zhang
Enhancing Consistency of Flow-Based Image Editing through Kalman Control
Haozhe Chi, Zhicheng Sun, Yang Jin et al.
Storyboard-guided Alignment for Fine-grained Video Action Recognition
Enqi Liu, Liyuan Pan, Yan Yang et al.
Local Curvature Descent: Squeezing More Curvature out of Standard and Polyak Gradient Descent
Peter Richtarik, Simone Maria Giancola, Dymitr Lubczyk et al.
DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Hao LU, Tianshuo Xu, Wenzhao Zheng et al.
Shapley-Based Data Valuation for Weighted $k$-Nearest Neighbors
Guangyi Zhang, Qiyu Liu, Aristides Gionis
One-Step is Enough: Sparse Autoencoders for Text-to-Image Diffusion Models
Viacheslav Surkov, Chris Wendler, Antonio Mari et al.
ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback
Taewon Yun, Jihwan Oh, Hyangsuk Min et al.
Diffusion-Guided Graph Data Augmentation
Maria Marrium, Arif Mahmood, Muhammad Haris Khan et al.
Constant Bit-size Transformers Are Turing Complete
Qian Li, Yuyi Wang
Navigating the MIL Trade-Off: Flexible Pooling for Whole Slide Image Classification
Hossein Jafarinia, Danial Hamdi, Amirhossein Alamdar et al.
Which Algorithms Have Tight Generalization Bounds?
Michael Gastpar, Ido Nachum, Jonathan Shafer et al.
RGB-Only Supervised Camera Parameter Optimization in Dynamic Scenes
Fang Li, Hao Zhang, Narendra Ahuja
Enhancing Privacy in Multimodal Federated Learning with Information Theory
Tianzhe Xiao, Yichen Li, Yining Qi et al.
Task Vectors in In-Context Learning: Emergence, Formation, and Benefits
Liu Yang, Ziqian Lin, Kangwook Lee et al.
Tighter CMI-Based Generalization Bounds via Stochastic Projection and Quantization
Milad Sefidgaran, Kimia Nadjahi, Abdellatif Zaidi
S-Crescendo: A Nested Transformer Weaving Framework for Scalable Nonlinear System in S-Domain Representation
Junlang Huang, Chen Hao, Li Luo et al.
On the VC dimension of deep group convolutional neural networks
Anna Sepliarskaia, Sophie Langer, Johannes Schmidt-Hieber
SIGMA: Selective Gated Mamba for Sequential Recommendation
Ziwei Liu, Qidong Liu, Yejing Wang et al.
Light-Weight Diffusion Multiplier and Uncertainty Quantification for Fourier Neural Operators
Albert Matveev, Sanmitra Ghosh, Aamal Hussain et al.
SEMPO: Lightweight Foundation Models for Time Series Forecasting
Hui He, Kun Yi, Yuanchi Ma et al.