Most Cited 2025 "forward matrix deduction" Papers
22,274 papers found • Page 59 of 112
Conference
Adversaries With Incentives: A Strategic Alternative to Adversarial Robustness
Maayan Ehrenberg, Roy Ganz, Nir Rosenfeld
SAGEPhos: Sage Bio-Coupled and Augmented Fusion for Phosphorylation Site Detection
Jingjie Zhang, Hanqun Cao, Zijun Gao et al.
Analog Foundation Models
Julian Büchel, Iason Chalas, Giovanni Acampa et al.
Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data?
Yutong Yin, Zhaoran Wang
Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Jiajie Li, Brian Quaranto, Chenhui Xu et al.
EcoFace: Audio-Visual Emotional Co-Disentanglement Speech-Driven 3D Talking Face Generation
Jiajian Xie, Shengyu Zhang, Mengze Li et al.
Uncertainty modeling for fine-tuned implicit functions
Anna Susmelj, Mael Macuglia, Natasa Tagasovska et al.
cryoSPHERE: Single-Particle HEterogeneous REconstruction from cryo EM
Gabriel Claude Jean Ducrocq, Lukas Grunewald, Sebastian Westenhoff et al.
Dynamic Bundling with Large Language Models for Zero-Shot Inference on Text-Attributed Graphs
Yusheng Zhao, Qixin Zhang, Xiao Luo et al.
Confidence Elicitation: A New Attack Vector for Large Language Models
Brian Formento, Chuan Sheng Foo, See-Kiong Ng
Decentralized Optimization with Coupled Constraints
Demyan Yarmoshik, Alexander Rogozin, Nikita Kiselev et al.
The Illusion of Progress? A Critical Look at Test-Time Adaptation for Vision-Language Models
Lijun Sheng, Jian Liang, Ran He et al.
Can a Large Language Model be a Gaslighter?
Wei Li, Luyao Zhu, Yang Song et al.
EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment
Yifei Xing, Xiangyuan Lan, Ruiping Wang et al.
How Much is Unseen Depends Chiefly on Information About the Seen
Seongmin Lee, Marcel Boehme
Loss Landscape of Shallow ReLU-like Neural Networks: Stationary Points, Saddle Escape, and Network Embedding
Frank Zhengqing Wu, Berfin Simsek, François Ged
FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles
Tian-Hao Zhang, Jiawei Zhang, Jun Wang et al.
Preference Optimization by Estimating the Ratio of the Data Distribution
Yeongmin Kim, HeeSun Bae, Byeonghu Na et al.
Geometry of Long-Tailed Representation Learning: Rebalancing Features for Skewed Distributions
Lingjie Yi, Michael Yao, Weimin Lyu et al.
The Breakdown of Gaussian Universality in Classification of High-dimensional Linear Factor Mixtures
Xiaoyi MAI, Zhenyu Liao
Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank
Wenhao Zhan, Scott Fujimoto, Zheqing Zhu et al.
Physics-informed Temporal Difference Metric Learning for Robot Motion Planning
Ruiqi Ni, zherong pan, Ahmed Hussain Qureshi
Adaptive Batch Size for Privately Finding Second-Order Stationary Points
Daogao Liu, Kunal Talwar
VideoLucy: Deep Memory Backtracking for Long Video Understanding
Jialong Zuo, Yongtai Deng, Lingdong Kong et al.
Measuring And Improving Engagement of Text-to-Image Generation Models
Varun Khurana, Yaman Singla, Jayakumar Subramanian et al.
FACTS: A Factored State-Space Framework for World Modelling
Li Nanbo, Firas Laakom, Yucheng XU et al.
A Regularized Newton Method for Nonconvex Optimization with Global and Local Complexity Guarantees
Yuhao Zhou, Jintao Xu, Bingrui Li et al.
Learning under Temporal Label Noise
Sujay Nagaraj, Walter Gerych, Sana Tonekaboni et al.
$q$-exponential family for policy optimization
Lingwei Zhu, Haseeb Shah, Han Wang et al.
Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels
Zhizheng Liu, Joe Lin, Wayne Wu et al.
Hierachical Balance Packing: Towards Efficient Supervised Fine-tuning for Long-Context LLM
Yongqiang Yao, Jingru Tan, Kaihuan Liang et al.
VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-horizon Manipulation
Kuo-Han Hung, Pang-Chi Lo, Jia-Fong Yeh et al.
ProtoSnap: Prototype Alignment For Cuneiform Signs
Rachel Mikulinsky, Morris Alper, Shai Gordin et al.
Equivariant Masked Position Prediction for Efficient Molecular Representation
Junyi An, Chao Qu, Yun-Fei Shi et al.
How to Find the Exact Pareto Front for Multi-Objective MDPs?
Yining Li, Peizhong Ju, Ness Shroff
Understanding Constraint Inference in Safety-Critical Inverse Reinforcement Learning
Bo Yue, Shufan Wang, Ashish Gaurav et al.
Towards a learning theory of representation alignment
Francesco Maria Gabriele Insulla, Shuo Huang, Lorenzo Rosasco
Instance-Level Composed Image Retrieval
Bill Psomas, George Retsinas, Nikos Efthymiadis et al.
Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks
Rushang Karia, Daniel Bramblett, Daksh Dobhal et al.
DEPT: Decoupled Embeddings for Pre-training Language Models
Alex Iacob, Lorenzo Sani, Meghdad Kurmanji et al.
ProAdvPrompter: A Two-Stage Journey to Effective Adversarial Prompting for LLMs
Hao Di, Tong He, Haishan Ye et al.
AdaptGrad: Adaptive Sampling to Reduce Noise
Linjiang Zhou, Chao Ma, Zepeng Wang et al.
InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models
Yanggan Gu, Yuanyi Wang, Zhaoyi Yan et al.
TFG-Flow: Training-free Guidance in Multimodal Generative Flow
Haowei Lin, Shanda Li, Haotian Ye et al.
The adaptive complexity of parallelized log-concave sampling
Huanjian Zhou, Baoxiang Wang, Masashi Sugiyama
Progressive Parameter Efficient Transfer Learning for Semantic Segmentation
Nan Zhou, Huiqun Wang, Yaoyan Zheng et al.
Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows
Xiangxin Zhou, Yi Xiao, Haowei Lin et al.
Group Ligands Docking to Protein Pockets
Jiaqi Guan, Jiahan Li, Xiangxin Zhou et al.
Reasoning is Periodicity? Improving Large Language Models Through Effective Periodicity Modeling
Yihong Dong, Ge Li, Xue Jiang et al.
PocketSR: The Super-Resolution Expert in Your Pocket Mobiles
Haoze Sun, Linfeng Jiang, Fan Li et al.
On Quantizing Neural Representation for Variable-Rate Video Coding
Junqi Shi, Zhujia Chen, Hanfei Li et al.
Toward Efficient Multi-Agent Exploration With Trajectory Entropy Maximization
Tianxu Li, Kun Zhu
DynFrs: An Efficient Framework for Machine Unlearning in Random Forest
Shurong Wang, Zhuoyang Shen, Xinbao Qiao et al.
Efficient Masked AutoEncoder for Video Object Counting and A Large-Scale Benchmark
Bing Cao, Quanhao Lu, Jiekang Feng et al.
Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective
Xiangru Zhu, Penglei Sun, Yaoxian Song et al.
INFER: A Neural-symbolic Model For Extrapolation Reasoning on Temporal Knowledge Graph
Ningyuan Li, Haihong E, Tianyu Yao et al.
InstaRevive: One-Step Image Enhancement via Dynamic Score Matching
Yixuan Zhu, Haolin Wang, Ao Li et al.
On Designing General and Expressive Quantum Graph Neural Networks with Applications to MILP Instance Representation
Xinyu Ye, Hao Xiong, Jianhao Huang et al.
OMG: Opacity Matters in Material Modeling with Gaussian Splatting
Silong Yong, Venkata Nagarjun Pudureddiyur Manivannan, Bernhard Kerbl et al.
ZeroDiff: Solidified Visual-semantic Correlation in Zero-Shot Learning
Zihan Ye, Shreyank Gowda, Shiming Chen et al.
GenSpace: Benchmarking Spatially-Aware Image Generation
Zehan Wang, Jiayang Xu, Ziang Zhang et al.
Combatting Dimensional Collapse in LLM Pre-Training Data via Submodular File Selection
Ziqing Fan, Siyuan Du, Shengchao Hu et al.
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Ziqi Pang, Xin Xu, Yu-Xiong Wang
Query-based Knowledge Transfer for Heterogeneous Learning Environments
Norah Alballa, Wenxuan Zhang, Ziquan Liu et al.
SEBRA : Debiasing through Self-Guided Bias Ranking
Adarsh Kappiyath, Abhra Chaudhuri, AJAY JAISWAL et al.
A Statistical Approach for Controlled Training Data Detection
Zirui Hu, Yingjie Wang, Zheng Zhang et al.
Towards Evaluating Proactive Risk Awareness of Multimodal Language Models
Youliang Yuan, Wenxiang Jiao, Yuejin Xie et al.
MedChain: Bridging the Gap Between LLM Agents and Clinical Practice with Interactive Sequence
Jie Liu, Wenxuan Wang, Zizhan Ma et al.
MedicalNarratives: Connecting Medical Vision and Language with Localized Narratives
Wisdom Ikezogwo, Kevin M. Zhang, Saygin Seyfioglu
Scaling Instruction-tuned LLMs to Million-token Contexts via Hierarchical Synthetic Data Generation
Linda He, Jue Wang, Maurice Weber et al.
Graph Transformers Dream of Electric Flow
Xiang Cheng, Lawrence Carin, Suvrit Sra
Provably Reliable Conformal Prediction Sets in the Presence of Data Poisoning
Yan Scholten, Stephan Günnemann
CBMA: Improving Conformal Prediction through Bayesian Model Averaging
Pankaj Bhagwat, Linglong Kong, Bei Jiang
Efficient Online Pruning and Abstraction for Imperfect Information Extensive-Form Games
Boning Li, Longbo Huang
Bounds on $L_p$ Errors in Density Ratio Estimation via $f$-Divergence Loss Functions
Yoshiaki Kitazawa
Generalizable Motion Planning via Operator Learning
Sharath Matada, Luke Bhan, Yuanyuan Shi et al.
Exact Community Recovery under Side Information: Optimality of Spectral Algorithms
Julia Gaudio, Nirmit Joshi
Content-Style Learning from Unaligned Domains: Identifiability under Unknown Latent Dimensions
Sagar Shrestha, Xiao Fu
Towards a Unified and Verified Understanding of Group-Operation Networks
Wilson Wu, Louis Jaburi, jacob drori et al.
GS-ProCams: Gaussian Splatting-Based Projector-Camera Systems
Qingyue Deng, Jijiang Li, Haibin Ling et al.
Toward Exploratory Inverse Constraint Inference with Generative Diffusion Verifiers
Runyi Zhao, Sheng Xu, Bo Yue et al.
Radiance Fields in XR: A Survey on How Radiance Fields are Envisioned and Addressed for XR Research
Ke Li, Mana Masuda, Susanne Schmidt et al.
Learning mirror maps in policy mirror descent
Carlo Alfano, Sebastian Towers, Silvia Sapora et al.
Elucidating the Preconditioning in Consistency Distillation
Kaiwen Zheng, Guande He, Jianfei Chen et al.
InstantPortrait: One-Step Portrait Editing via Diffusion Multi-Objective Distillation
Zhixin Lai, Keqiang Sun, Fu-Yun Wang et al.
Optimality and Adaptivity of Deep Neural Features for Instrumental Variable Regression
Juno Kim, Dimitri Meunier, Arthur Gretton et al.
Probabilistic Verification of Cybersickness in Virtual Reality Through Bayesian Networks
Peng Wu, Nasim Ahmed, Abhiram Sarma et al.
JPEG Inspired Deep Learning
Ahmed Hussien Salamah, Kaixiang Zheng, Yiwen Liu et al.
Exploring Organizational Strategies in Immersive Computational Notebooks
Sungwon In, Ayush Roy, Eric Krokos et al.
Revisiting Performance Models of Distal Pointing Tasks in Virtual Reality
Logan Lane, Feiyu Lu, Shakiba Davari et al.
Can People's Brains Synchronize during Remote AR Collaboration?
Jaehwan You, Myeongul Jung, Kwanguk Kim
An Auditing Test to Detect Behavioral Shift in Language Models
Leo Richter, Xuanli He, Pasquale Minervini et al.
Dynamic Contrastive Skill Learning with State-Transition Based Skill Clustering and Dynamic Length Adjustment
Jinwoo Choi, Seung-Woo Seo
Exploring and Modeling the Effects of Eye-Tracking Accuracy and Precision on Gaze-Based Steering in Virtual Environments
Xuning Hu, Yichuan Zhang, Yushi Wei et al.
Uncertainty Quantification for Physics-Informed Neural Networks with Extended Fiducial Inference
Frank Shih, Zhenghao Jiang, Faming Liang
Gaussian Head & Shoulders: High Fidelity Neural Upper Body Avatars with Anchor Gaussian Guided Texture Warping
Tianhao Wu, Jing Yang, Zhilin Guo et al.
Exploring the Camera Bias of Person Re-identification
Myungseo Song, Jin-Woo Park, Jong-Seok Lee
Birds of a Feather Augment Together: Exploring Sonic Links Between Real and Virtual Worlds in Audio Augmented Reality
Jacob Bhattacharyya, Alessandro Vinciarelli, Stephen Anthony Brewster
EgoTrigger: Toward Audio-Driven Image Capture for Human Memory Enhancement in All-Day Energy-Efficient Smart Glasses
Akshay Paruchuri, Sinan Hersek, Lavisha Aggarwal et al.
Wavelet-based Positional Representation for Long Context
Yui Oka, Taku Hasegawa, Kyosuke Nishida et al.
FerretNet: Efficient Synthetic Image Detection via Local Pixel Dependencies
Shuqiao Liang, Jian Liu, Chen Renzhang et al.
ADAM Optimization with Adaptive Batch Selection
Gyu Yeol Kim, Min-hwan Oh
PINP: Physics-Informed Neural Predictor with latent estimation of fluid flows
Huaguan Chen, Yang Liu, Hao Sun
Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty Measurement
Bryan Bo Cao, Lawrence OGorman, Michael Coss et al.
Graph Neural Networks Are More Than Filters: Revisiting and Benchmarking from A Spectral Perspective
Yushun Dong, Patrick Soga, Yinhan He et al.
ViFactCheck: A New Benchmark Dataset and Methods for Multi-Domain News Fact-Checking In Vietnamese
Tran Thai Hoa, Tran Quang Duy, Khanh Quoc Tran et al.
FLOPS: Forward Learning with OPtimal Sampling
Tao Ren, Zishi Zhang, Jinyang Jiang et al.
IDseq: Decoupled and Sequentially Detecting and Grounding Multi-Modal Media Manipulation
Runxin Liu, Tian Xie, Jiaming Li et al.
Neural Functions for Learning Periodic Signal
Woojin Cho, Minju Jo, Kookjin Lee et al.
Teaching Human Behavior Improves Content Understanding Abilities Of VLMs
SOMESH SINGH, Harini S I, Yaman Singla et al.
AutoCGP: Closed-Loop Concept-Guided Policies from Unlabeled Demonstrations
Pei Zhou, Ruizhe Liu, Qian Luo et al.
Tight Lower Bounds under Asymmetric High-Order Hölder Smoothness and Uniform Convexity
Cedar Site Bai, Brian Bullins
RaCMC: Residual-Aware Compensation Network with Multi-Granularity Constraints for Fake News Detection
Xinquan Yu, Ziqi Sheng, Wei Lu et al.
KooNPro: A Variance-Aware Koopman Probabilistic Model Enhanced by Neural Process for Time Series Forecasting
Ronghua Zheng, Hanru Bai, Weiyang Ding
E4: Energy-Efficient DNN Inference for Edge Video Analytics via Early Exiting and DVFS
Ziyang Zhang, Yang Zhao, Ming-Ching Chang et al.
Enhancing Identity-Deformation Disentanglement in StyleGAN for One-Shot Face Video Re-Enactment
Qing Chang, Yao-Xiang Ding, Kun Zhou
Weakly Supervised Video Scene Graph Generation via Natural Language Supervision
Kibum Kim, Kanghoon Yoon, Yeonjun In et al.
EyEar: Learning Audio Synchronized Human Gaze Trajectory Based on Physics-Informed Dynamics
Xiaochuan Liu, Xin Cheng, Yuchong Sun et al.
Multimodal Fine-Grained Apparent Personality Trait Recognition: Joint Modeling of Big Five and Questionnaire Item-level Scores
Ryo Masumura, Shota Orihashi, Mana Ihori et al.
Warm Diffusion: Recipe for Blur-Noise Mixture Diffusion Models
Hao-Chien Hsueh, Wen-Hsiao Peng, Ching-Chun Huang
The Master Key Filters Hypothesis: Deep Filters Are General
Zahra Babaiee, Peyman M. Kiasari, Daniela Rus et al.
BEE: Metric-Adapted Explanations via Baseline Exploration-Exploitation
Oren Barkan, Yehonatan Elisha, Jonathan Weill et al.
FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing
Lingling Cai, Kang Zhao, Hangjie Yuan et al.
ObjVariantEnsemble: Advancing Point Cloud LLM Evaluation in Challenging Scenes with Subtly Distinguished Objects
Qihang Cao, Huangxun Chen
Sharpening Neural Implicit Functions with Frequency Consolidation Priors
Chao Chen, Yu-Shen Liu, Zhizhong Han
Rethinking High-speed Image Reconstruction Framework with Spike Camera
Kang Chen, Yajing Zheng, Tiejun Huang et al.
GLOMA: Global Video Text Spotting with Morphological Association
Han Wang, Yanjie Wang, Yang Li et al.
Comprehensive Multi-Modal Prototypes Are Simple and Effective Classifiers for Vast-Vocabulary Object Detection
Yitong Chen, Wenhao Yao, Lingchen Meng et al.
Spatiotemporal Blind-Spot Network with Calibrated Flow Alignment for Self-Supervised Video Denoising
Zikang Chen, Tao Jiang, Xiaowan Hu et al.
Elevating Flow-Guided Video Inpainting with Reference Generation
Suhwan Cho, Seoung Wug Oh, Sangyoun Lee et al.
DynASyn: Multi-Subject Personalization Enabling Dynamic Action Synthesis
Yongjin Choi, Chanhun Park, Seung Jun Baek
PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery
Shristi Das Biswas, Matthew Shreve, Xuelu Li et al.
Semantic Ambiguity Modeling and Propagation for Fine-Grained Visual Cross View Geo-Localization
Mingtao Feng, Fenghao Tian, Jianqiao Luo et al.
ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis
Xinyu Geng, Jiaming Wang, Xiaolin Huang et al.
You Should Learn to Stop Denoising on Point Clouds in Advance
Chuchen Guo, Weijie Zhou, Zheng Liu et al.
AsyncDSB: Schedule-Asynchronous Diffusion Schrödinger Bridge for Image Inpainting
Zihao Han, Baoquan Zhang, Lisai Zhang et al.
Multi-Frame Deformable Look-Up Table for Compressed Video Quality Enhancement
Gang He, Guancheng Quan, Chang Wu et al.
Achieving Speed-Accuracy Balance in Vision-based 3D Occupancy Prediction via Geometric-Semantic Disentanglement
Yulin He, Wei Chen, Siqi Wang et al.
Robust and Consistent Online Video Instance Segmentation via Instance Mask Propagation
Miran Heo, Seoung Wug Oh, Seon Joo Kim et al.
Generalized Class Discovery in Instance Segmentation
Cuong Manh Hoang, Yeejin Lee, Byeongkeun Kang
ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning
Wenjin Hou, Dingjie Fu, Kun Li et al.
BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation
Xiaolu Hou, Mingcheng Li, Dingkang Yang et al.
Motion Decoupled 3D Gaussian Splatting for Dynamic Object Representation
Xiao Hu, Libo Long, Jochen Lang
LPCG: A Self-conditional Architecture for Labeled Point Cloud Generation
Dongshuo Huang, Xiaoshui Huang, Chengdong Zhang et al.
PSReg: Prior-guided Sparse Mixture of Experts for Point Cloud Registration
Xiaoshui Huang, Zhou Huang, Yifan Zuo et al.
QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects
Elkhan Ismayilzada, MD Khalequzzaman Chowdhury Sayem, Yihalem Yimolal Tiruneh et al.
ContextHOI: Spatial Context Learning for Human-Object Interaction Detection
Mingda Jia, Liming Zhao, Ge Li et al.
SparsyFed: Sparse Adaptive Federated Learning
Adriano Guastella, Lorenzo Sani, Alex Iacob et al.
Deep Tree Tensor Networks
Chang Nie
Sensing Surface Patches in Volume Rendering for Inferring Signed Distance Functions
Sijia Jiang, Tong Wu, Jing Hua et al.
Constructing Fair Latent Space for Intersection of Fairness and Explainability
Hyungjun Joo, Hyeonggeun Han, Sehwan Kim et al.
Discrete Distribution Networks
Lei Yang
Multispectral Pedestrian Detection with Sparsely Annotated Label
Chan Lee, Seungho Shin, Gyeong-Moon Park et al.
Diverse Rare Sample Generation with Pretrained GANs
Subeen Lee, Jiyeon Han, Soyeon Kim et al.
Bridging Knowledge Gap Between Image Inpainting and Large-Area Visible Watermark Removal
Yicheng Leng, Chaowei Fang, Junye Chen et al.
M²RL-Net: Multi-View and Multi-Level Relation Learning Network for Weakly-Supervised Image Forgery Detection
Jiafeng Li, Ying Wen, Lianghua He
Lightweight Contrastive Distilled Hashing for Online Cross-modal Retrieval
Jiaxing Li, Lin Jiang, Zeqi Ma et al.
Multi-View 3D Human Pose Estimation with Weakly Synchronized Images
Ling Li, Ruiwen Gu, Chongyang Wang et al.
SyncNoise: Geometrically Consistent Noise Prediction for Instruction-based 3D Editing
Ruihuang Li, Liyi Chen, Zhengqiang Zhang et al.
Endowing Visual Reprogramming with Adversarial Robustness
Shengjie Zhou, Xin Cheng, Haiyang Xu et al.
VidEvent: A Large Dataset for Understanding Dynamic Evolution of Events in Videos
Baoyu Liang, Qile Su, Shoutai Zhu et al.
UN-DETR: Promoting Objectness Learning via Joint Supervision for Unknown Object Detection
HaoMiao Liu, Hao Xu, Chuhuai Yue et al.
D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching
Jingyu Liu, Minquan Wang, Ye Ma et al.
Efficient Deformable Convolutional Prompt for Continual Test-Time Adaptation in Medical Image Segmentation
Shiyu Liu, Daoqiang Zhang, Xiaoke Hao
MUN: Image Forgery Localization Based on M³ Encoder and UN Decoder
Yaqi Liu, Shuhuan Chen, Haichao Shi et al.
Enhancing Low-Light Images: A Synthetic Data Perspective on Practical and Generalizable Solutions
Yu Long, Qinghua Lin, Zhihua Wang et al.
Generative Video Diffusion for Unseen Novel Semantic Video Moment Retrieval
Dezhao Luo, Shaogang Gong, Jiabo Huang et al.
Does VLM Classification Benefit from LLM Description Semantics?
Pingchuan Ma, Lennart Rietdorf, Dmytro Kotovenko et al.
Novel View Synthesis Under Large-Deviation Viewpoint for Autonomous Driving
Xin Ma, Jiguang Zhang, Peng Lu et al.
Relaxed Class-consensus Consistency for Semi-supervised Semantic Segmentation
Huayu Mai, Rui Sun, Feng Wu
Data-centric Prediction Explanation via Kernelized Stein Discrepancy
Mahtab Sarvmaili, Hassan Sajjad, Ga Wu
Fast Data Attribution for Text-to-Image Models
Sheng-Yu Wang, Aaron Hertzmann, Alexei Efros et al.
Extract Free Dense Misalignment from CLIP
JeongYeon Nam, Jinbae Im, Wonjae Kim et al.
Global Convergence of Policy Gradient in Average Reward MDPs
Navdeep Kumar, Yashaswini Murthy, Itai Shufaro et al.
S2S2: Semantic Stacking for Robust Semantic Segmentation in Medical Imaging
Yimu Pan, Sitao Zhang, Alison D. Gernand et al.
Towards Auto-Regressive Next-Token Prediction: In-context Learning Emerges from Generalization
Zixuan Gong, Xiaolin Hu, Huayi Tang et al.
PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation
Shoumeng Qiu, Xinrun Li, Xiangyang Xue et al.
GHOST: Gaussian Hypothesis Open-Set Technique
Ryan Rabinowitz, Steve Cruz, Manuel Günther et al.
Click2Mask: Local Editing with Dynamic Mask Generation
Omer Regev, Omri Avrahami, Dani Lischinski
ISPDiffuser: Learning RAW-to-sRGB Mappings with Texture-Aware Diffusion Models and Histogram-Guided Color Consistency
Yang Ren, Hai Jiang, Menglong Yang et al.
Video Summarization Using Denoising Diffusion Probabilistic Model
Zirui Shang, Yubo Zhu, Hongxi Li et al.
Fast Omni-Directional Image Super-Resolution: Adapting the Implicit Image Function with Pixel and Semantic-Wise Spherical Geometric Priors
Xuelin Shen, Yitong Wang, Silin Zheng et al.
Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera
Haixin Shi, Yinlin Hu, Daniel Koguciuk et al.
Pixel Is Not a Barrier: An Effective Evasion Attack for Pixel-Domain Diffusion Models
Chun-Yen Shih, Li-Xuan Peng, Jia-Wei Liao et al.
SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living
Arkaprava Sinha, Dominick Reilly, Francois Bremond et al.
JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts
Taein Son, Soo Won Seo, Jisong Kim et al.
Enhancing Noise-Robust Losses for Large-Scale Noisy Data Learning
Max Staats, Matthias Thamm, Bernd Rosenow
EigenSR: Eigenimage-Bridged Pre-Trained RGB Learners for Single Hyperspectral Image Super-Resolution
Xi Su, Xiangfei Shen, Mingyang Wan et al.
Learning Fine-Grained Alignment for Aerial Vision-Dialog Navigation
Yifei Su, Dong An, Kehan Chen et al.
Explicit Relational Reasoning Network for Scene Text Detection
Yuchen Su, Zhineng Chen, Yongkun Du et al.
Rethinking Audio-Visual Adversarial Vulnerability from Temporal and Modality Perspectives
Zeliang Zhang, Susan Liang, Daiki Shimada et al.
Promptable Representation Distribution Learning and Data Augmentation for Gigapixel Histopathology WSI Analysis
Kunming Tang, Zhiguo Jiang, Jun Shi et al.
Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking
Xin Tong, Shi Peng, Baojie Tian et al.
Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics
Tze Ho Elden Tse, Runyang Feng, Linfang Zheng et al.
Overcoming Heterogeneous Data in Federated Medical Vision-Language Pre-training: A Triple-Embedding Model Selector Approach
Aowen Wang, Zhiwang Zhang, Dongang Wang et al.
SSC-VAE: Structured Sparse Coding Based Variational Autoencoder for Detail Preserved Image Reconstruction
Hao Wang, Lu Wang, Zhongyu Wang et al.
Bright-NeRF: Brightening Neural Radiance Field with Color Restoration from Low-Light RAW Images
Min Wang, Xin Huang, Guoqing Zhou et al.
HomoMatcher: Achieving Dense Feature Matching with Semi-Dense Efficiency by Homography Estimation
Xiaolong Wang, Lei Yu, Yingying Zhang et al.
Sequential Stochastic Combinatorial Optimization Using Hierarchal Reinforcement Learning
Xinsong Feng, Zihan Yu, Yanhai Xiong et al.
Aligning Composed Query with Image via Discriminative Perception from Negative Correspondences
Yifan Wang, Wuliang Huang, Chun Yuan