Most Cited 2025 "top-p sampling" Papers
22,274 papers found • Page 63 of 112
Conference
On the Almost Sure Convergence of the Stochastic Three Points Algorithm
Taha EL BAKKALI EL KADI, Omar Saadi
PvNeXt: Rethinking Network Design and Temporal Motion for Point Cloud Video Recognition
Jie Wang, Tingfa Xu, Lihe Ding et al.
RFMamba: Frequency-Aware State Space Model for RF-Based Human-Centric Perception
Rui Zhang, Ruixu Geng, Yadong Li et al.
Demystifying Topological Message-Passing with Relational Structures: A Case Study on Oversquashing in Simplicial Message-Passing
Diaaeldin Taha, James Chapman, Marzieh Eidi et al.
Improved Algorithms for Kernel Matrix-Vector Multiplication Under Sparsity Assumptions
Piotr Indyk, Michael Kapralov, Kshiteej Jitesh Sheth et al.
Optimistic Games for Combinatorial Bayesian Optimization with Application to Protein Design
Melis Ilayda Bal, Pier Giuseppe Sessa, Mojmir Mutny et al.
Physiome-ODE: A Benchmark for Irregularly Sampled Multivariate Time-Series Forecasting Based on Biological ODEs
Christian Klötergens, Vijaya Krishna Yalavarthi, Randolf Scholz et al.
IV-mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis
Shitong Shao, zikai zhou, Lichen Bai et al.
LLM-based Typed Hyperresolution for Commonsense Reasoning with Knowledge Bases
Armin Toroghi, Ali Pesaranghader, Tanmana Sadhu et al.
Generalization in VAE and Diffusion Models: A Unified Information-Theoretic Analysis
Qi Chen, Jierui Zhu, Florian Shkurti
REBIND: Enhancing Ground-state Molecular Conformation Prediction via Force-Based Graph Rewiring
Taewon Kim, Hyunjin Seo, Sungsoo Ahn et al.
TopoGaussian: Inferring Internal Topology Structures from Visual Clues
Xiaoyu Xiong, Changyu Hu, Chunru Lin et al.
Differentiable Rule Induction from Raw Sequence Inputs
Kun Gao, Katsumi Inoue, Yongzhi Cao et al.
Verifying Properties of Binary Neural Networks Using Sparse Polynomial Optimization
Jianting Yang, Srecko Durasinovic, Jean Bernard Lasserre et al.
Adaptive Shrinkage Estimation for Personalized Deep Kernel Regression in Modeling Brain Trajectories
Vasiliki Tassopoulou, Haochang Shou, Christos Davatzikos
Computational Explorations of Total Variation Distance
Arnab Bhattacharyya, Sutanu Gayen, Kuldeep S. Meel et al.
Convex Formulations for Training Two-Layer ReLU Neural Networks
Karthik Prakhya, Tolga Birdal, Alp Yurtsever
Restyling Unsupervised Concept Based Interpretable Networks with Generative Models
Jayneel Parekh, Quentin Bouniot, Pavlo Mozharovskyi et al.
Preference Elicitation for Offline Reinforcement Learning
Alizée Pace, Bernhard Schölkopf, Gunnar Ratsch et al.
Zero-Shot Natural Language Explanations
Fawaz Sammani, Nikos Deligiannis
Learning to Help in Multi-Class Settings
Yu Wu, Yansong Li, Zeyu Dong et al.
The Directionality of Optimization Trajectories in Neural Networks
Sidak Pal Singh, Bobby He, Thomas Hofmann et al.
NEAR: A Training-Free Pre-Estimator of Machine Learning Model Performance
Raphael Husistein, Markus Reiher, Marco Eckhoff
Towards Certification of Uncertainty Calibration under Adversarial Attacks
Cornelius Emde, Francesco Pinto, Thomas Lukasiewicz et al.
Newton Meets Marchenko-Pastur: Massively Parallel Second-Order Optimization with Hessian Sketching and Debiasing
Elad Romanov, Fangzhao Zhang, Mert Pilanci
PharmacoMatch: Efficient 3D Pharmacophore Screening via Neural Subgraph Matching
Daniel Rose, Oliver Wieder, Thomas Seidel et al.
Towards Marginal Fairness Sliced Wasserstein Barycenter
Khai Nguyen, Hai Nguyen, Nhat Ho
CoMotion: Concurrent Multi-person 3D Motion
Alejandro Newell, Peiyun Hu, Lahav Lipson et al.
Salvage: Shapley-distribution Approximation Learning Via Attribution Guided Exploration for Explainable Image Classification
Mehdi Naouar, Hanne Raum, Jens Rahnfeld et al.
Adversarial Training for Defense Against Label Poisoning Attacks
Melis Ilayda Bal, Volkan Cevher, Michael Muehlebach
GeSubNet: Gene Interaction Inference for Disease Subtype Network Generation
Ziwei Yang, Zheng Chen, XIN LIU et al.
The Computational Complexity of Positive Non-Clashing Teaching in Graphs
Robert Ganian, Liana Khazaliya, Fionn Mc Inerney et al.
PaLD: Detection of Text Partially Written by Large Language Models
Eric Lei, Hsiang Hsu, Chun-Fu Chen
RuAG: Learned-rule-augmented Generation for Large Language Models
Yudi Zhang, Pei Xiao, Lu Wang et al.
Easing Training Process of Rectified Flow Models Via Lengthening Inter-Path Distance
Shifeng Xu, Yanzhu Liu, Adams Kong
UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation
Tao Zhang, Jinyong Wen, Zhen Chen et al.
CtD: Composition through Decomposition in Emergent Communication
Boaz Carmeli, Ron Meir, Yonatan Belinkov
The Rise and Down of Babel Tower: Investigating the Evolution Process of Multilingual Code Large Language Model
Jiawei Chen, Wentao Chen, Jing Su et al.
Predicting the Energy Landscape of Stochastic Dynamical System via Physics-informed Self-supervised Learning
Ruikun Li, Huandong Wang, Qingmin Liao et al.
Linear Spherical Sliced Optimal Transport: A Fast Metric for Comparing Spherical Data
Xinran Liu, Yikun Bai, Rocio Diaz Martin et al.
Supervised and Semi-Supervised Diffusion Maps with Label-Driven Diffusion
Harel Mendelman, Ronen Talmon
For Better or For Worse? Learning Minimum Variance Features With Label Augmentation
Muthu Chidambaram, Rong Ge
Self-Attention-Based Contextual Modulation Improves Neural System Identification
Isaac Lin, Tianye Wang, Shang Gao et al.
Adversaries With Incentives: A Strategic Alternative to Adversarial Robustness
Maayan Ehrenberg, Roy Ganz, Nir Rosenfeld
SAGEPhos: Sage Bio-Coupled and Augmented Fusion for Phosphorylation Site Detection
Jingjie Zhang, Hanqun Cao, Zijun Gao et al.
Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data?
Yutong Yin, Zhaoran Wang
Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Jiajie Li, Brian Quaranto, Chenhui Xu et al.
EcoFace: Audio-Visual Emotional Co-Disentanglement Speech-Driven 3D Talking Face Generation
Jiajian Xie, Shengyu Zhang, Mengze Li et al.
Uncertainty modeling for fine-tuned implicit functions
Anna Susmelj, Mael Macuglia, Natasa Tagasovska et al.
cryoSPHERE: Single-Particle HEterogeneous REconstruction from cryo EM
Gabriel Claude Jean Ducrocq, Lukas Grunewald, Sebastian Westenhoff et al.
Confidence Elicitation: A New Attack Vector for Large Language Models
Brian Formento, Chuan Sheng Foo, See-Kiong Ng
Decentralized Optimization with Coupled Constraints
Demyan Yarmoshik, Alexander Rogozin, Nikita Kiselev et al.
Can a Large Language Model be a Gaslighter?
Wei Li, Luyao Zhu, Yang Song et al.
EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment
Yifei Xing, Xiangyuan Lan, Ruiping Wang et al.
How Much is Unseen Depends Chiefly on Information About the Seen
Seongmin Lee, Marcel Boehme
Loss Landscape of Shallow ReLU-like Neural Networks: Stationary Points, Saddle Escape, and Network Embedding
Frank Zhengqing Wu, Berfin Simsek, François Ged
TeaserGen: Generating Teasers for Long Documentaries
Weihan Xu, Paul Pu Liang, Haven Kim et al.
Geometry of Long-Tailed Representation Learning: Rebalancing Features for Skewed Distributions
Lingjie Yi, Michael Yao, Weimin Lyu et al.
The Breakdown of Gaussian Universality in Classification of High-dimensional Linear Factor Mixtures
Xiaoyi MAI, Zhenyu Liao
Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank
Wenhao Zhan, Scott Fujimoto, Zheqing Zhu et al.
Physics-informed Temporal Difference Metric Learning for Robot Motion Planning
Ruiqi Ni, zherong pan, Ahmed Hussain Qureshi
Adaptive Batch Size for Privately Finding Second-Order Stationary Points
Daogao Liu, Kunal Talwar
Measuring And Improving Engagement of Text-to-Image Generation Models
Varun Khurana, Yaman Singla, Jayakumar Subramanian et al.
FACTS: A Factored State-Space Framework for World Modelling
Li Nanbo, Firas Laakom, Yucheng XU et al.
Learning under Temporal Label Noise
Sujay Nagaraj, Walter Gerych, Sana Tonekaboni et al.
$q$-exponential family for policy optimization
Lingwei Zhu, Haseeb Shah, Han Wang et al.
Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels
Zhizheng Liu, Joe Lin, Wayne Wu et al.
VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-horizon Manipulation
Kuo-Han Hung, Pang-Chi Lo, Jia-Fong Yeh et al.
ProtoSnap: Prototype Alignment For Cuneiform Signs
Rachel Mikulinsky, Morris Alper, Shai Gordin et al.
Equivariant Masked Position Prediction for Efficient Molecular Representation
Junyi An, Chao Qu, Yun-Fei Shi et al.
How to Find the Exact Pareto Front for Multi-Objective MDPs?
Yining Li, Peizhong Ju, Ness Shroff
Understanding Constraint Inference in Safety-Critical Inverse Reinforcement Learning
Bo Yue, Shufan Wang, Ashish Gaurav et al.
Towards a learning theory of representation alignment
Francesco Maria Gabriele Insulla, Shuo Huang, Lorenzo Rosasco
Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks
Rushang Karia, Daniel Bramblett, Daksh Dobhal et al.
DEPT: Decoupled Embeddings for Pre-training Language Models
Alex Iacob, Lorenzo Sani, Meghdad Kurmanji et al.
ProAdvPrompter: A Two-Stage Journey to Effective Adversarial Prompting for LLMs
Hao Di, Tong He, Haishan Ye et al.
TFG-Flow: Training-free Guidance in Multimodal Generative Flow
Haowei Lin, Shanda Li, Haotian Ye et al.
The adaptive complexity of parallelized log-concave sampling
Huanjian Zhou, Baoxiang Wang, Masashi Sugiyama
Progressive Parameter Efficient Transfer Learning for Semantic Segmentation
Nan Zhou, Huiqun Wang, Yaoyan Zheng et al.
Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows
Xiangxin Zhou, Yi Xiao, Haowei Lin et al.
Group Ligands Docking to Protein Pockets
Jiaqi Guan, Jiahan Li, Xiangxin Zhou et al.
On Quantizing Neural Representation for Variable-Rate Video Coding
Junqi Shi, Zhujia Chen, Hanfei Li et al.
Toward Efficient Multi-Agent Exploration With Trajectory Entropy Maximization
Tianxu Li, Kun Zhu
DynFrs: An Efficient Framework for Machine Unlearning in Random Forest
Shurong Wang, Zhuoyang Shen, Xinbao Qiao et al.
Efficient Masked AutoEncoder for Video Object Counting and A Large-Scale Benchmark
Bing Cao, Quanhao Lu, Jiekang Feng et al.
Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective
Xiangru Zhu, Penglei Sun, Yaoxian Song et al.
INFER: A Neural-symbolic Model For Extrapolation Reasoning on Temporal Knowledge Graph
Ningyuan Li, Haihong E, Tianyu Yao et al.
InstaRevive: One-Step Image Enhancement via Dynamic Score Matching
Yixuan Zhu, Haolin Wang, Ao Li et al.
On Designing General and Expressive Quantum Graph Neural Networks with Applications to MILP Instance Representation
Xinyu Ye, Hao Xiong, Jianhao Huang et al.
OMG: Opacity Matters in Material Modeling with Gaussian Splatting
Silong Yong, Venkata Nagarjun Pudureddiyur Manivannan, Bernhard Kerbl et al.
ZeroDiff: Solidified Visual-semantic Correlation in Zero-Shot Learning
Zihan Ye, Shreyank Gowda, Shiming Chen et al.
Combatting Dimensional Collapse in LLM Pre-Training Data via Submodular File Selection
Ziqing Fan, Siyuan Du, Shengchao Hu et al.
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Ziqi Pang, Xin Xu, Yu-Xiong Wang
Query-based Knowledge Transfer for Heterogeneous Learning Environments
Norah Alballa, Wenxuan Zhang, Ziquan Liu et al.
SEBRA : Debiasing through Self-Guided Bias Ranking
Adarsh Kappiyath, Abhra Chaudhuri, AJAY JAISWAL et al.
A Statistical Approach for Controlled Training Data Detection
Zirui Hu, Yingjie Wang, Zheng Zhang et al.
Scaling Instruction-tuned LLMs to Million-token Contexts via Hierarchical Synthetic Data Generation
Linda He, Jue Wang, Maurice Weber et al.
Graph Transformers Dream of Electric Flow
Xiang Cheng, Lawrence Carin, Suvrit Sra
Provably Reliable Conformal Prediction Sets in the Presence of Data Poisoning
Yan Scholten, Stephan Günnemann
CBMA: Improving Conformal Prediction through Bayesian Model Averaging
Pankaj Bhagwat, Linglong Kong, Bei Jiang
Efficient Online Pruning and Abstraction for Imperfect Information Extensive-Form Games
Boning Li, Longbo Huang
Bounds on $L_p$ Errors in Density Ratio Estimation via $f$-Divergence Loss Functions
Yoshiaki Kitazawa
Generalizable Motion Planning via Operator Learning
Sharath Matada, Luke Bhan, Yuanyuan Shi et al.
Exact Community Recovery under Side Information: Optimality of Spectral Algorithms
Julia Gaudio, Nirmit Joshi
Content-Style Learning from Unaligned Domains: Identifiability under Unknown Latent Dimensions
Sagar Shrestha, Xiao Fu
GS-ProCams: Gaussian Splatting-Based Projector-Camera Systems
Qingyue Deng, Jijiang Li, Haibin Ling et al.
Radiance Fields in XR: A Survey on How Radiance Fields are Envisioned and Addressed for XR Research
Ke Li, Mana Masuda, Susanne Schmidt et al.
Probabilistic Verification of Cybersickness in Virtual Reality Through Bayesian Networks
Peng Wu, Nasim Ahmed, Abhiram Sarma et al.
Exploring Organizational Strategies in Immersive Computational Notebooks
Sungwon In, Ayush Roy, Eric Krokos et al.
Revisiting Performance Models of Distal Pointing Tasks in Virtual Reality
Logan Lane, Feiyu Lu, Shakiba Davari et al.
Can People's Brains Synchronize during Remote AR Collaboration?
Jaehwan You, Myeongul Jung, Kwanguk Kim
Exploring and Modeling the Effects of Eye-Tracking Accuracy and Precision on Gaze-Based Steering in Virtual Environments
Xuning Hu, Yichuan Zhang, Yushi Wei et al.
Birds of a Feather Augment Together: Exploring Sonic Links Between Real and Virtual Worlds in Audio Augmented Reality
Jacob Bhattacharyya, Alessandro Vinciarelli, Stephen Anthony Brewster
EgoTrigger: Toward Audio-Driven Image Capture for Human Memory Enhancement in All-Day Energy-Efficient Smart Glasses
Akshay Paruchuri, Sinan Hersek, Lavisha Aggarwal et al.
ViFactCheck: A New Benchmark Dataset and Methods for Multi-Domain News Fact-Checking In Vietnamese
Tran Thai Hoa, Tran Quang Duy, Khanh Quoc Tran et al.
IDseq: Decoupled and Sequentially Detecting and Grounding Multi-Modal Media Manipulation
Runxin Liu, Tian Xie, Jiaming Li et al.
RaCMC: Residual-Aware Compensation Network with Multi-Granularity Constraints for Fake News Detection
Xinquan Yu, Ziqi Sheng, Wei Lu et al.
E4: Energy-Efficient DNN Inference for Edge Video Analytics via Early Exiting and DVFS
Ziyang Zhang, Yang Zhao, Ming-Ching Chang et al.
Enhancing Identity-Deformation Disentanglement in StyleGAN for One-Shot Face Video Re-Enactment
Qing Chang, Yao-Xiang Ding, Kun Zhou
EyEar: Learning Audio Synchronized Human Gaze Trajectory Based on Physics-Informed Dynamics
Xiaochuan Liu, Xin Cheng, Yuchong Sun et al.
Multimodal Fine-Grained Apparent Personality Trait Recognition: Joint Modeling of Big Five and Questionnaire Item-level Scores
Ryo Masumura, Shota Orihashi, Mana Ihori et al.
The Master Key Filters Hypothesis: Deep Filters Are General
Zahra Babaiee, Peyman M. Kiasari, Daniela Rus et al.
BEE: Metric-Adapted Explanations via Baseline Exploration-Exploitation
Oren Barkan, Yehonatan Elisha, Jonathan Weill et al.
FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing
Lingling Cai, Kang Zhao, Hangjie Yuan et al.
ObjVariantEnsemble: Advancing Point Cloud LLM Evaluation in Challenging Scenes with Subtly Distinguished Objects
Qihang Cao, Huangxun Chen
Sharpening Neural Implicit Functions with Frequency Consolidation Priors
Chao Chen, Yu-Shen Liu, Zhizhong Han
Rethinking High-speed Image Reconstruction Framework with Spike Camera
Kang Chen, Yajing Zheng, Tiejun Huang et al.
Comprehensive Multi-Modal Prototypes Are Simple and Effective Classifiers for Vast-Vocabulary Object Detection
Yitong Chen, Wenhao Yao, Lingchen Meng et al.
Spatiotemporal Blind-Spot Network with Calibrated Flow Alignment for Self-Supervised Video Denoising
Zikang Chen, Tao Jiang, Xiaowan Hu et al.
Elevating Flow-Guided Video Inpainting with Reference Generation
Suhwan Cho, Seoung Wug Oh, Sangyoun Lee et al.
DynASyn: Multi-Subject Personalization Enabling Dynamic Action Synthesis
Yongjin Choi, Chanhun Park, Seung Jun Baek
PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery
Shristi Das Biswas, Matthew Shreve, Xuelu Li et al.
Semantic Ambiguity Modeling and Propagation for Fine-Grained Visual Cross View Geo-Localization
Mingtao Feng, Fenghao Tian, Jianqiao Luo et al.
ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis
Xinyu Geng, Jiaming Wang, Xiaolin Huang et al.
You Should Learn to Stop Denoising on Point Clouds in Advance
Chuchen Guo, Weijie Zhou, Zheng Liu et al.
AsyncDSB: Schedule-Asynchronous Diffusion Schrödinger Bridge for Image Inpainting
Zihao Han, Baoquan Zhang, Lisai Zhang et al.
Multi-Frame Deformable Look-Up Table for Compressed Video Quality Enhancement
Gang He, Guancheng Quan, Chang Wu et al.
Achieving Speed-Accuracy Balance in Vision-based 3D Occupancy Prediction via Geometric-Semantic Disentanglement
Yulin He, Wei Chen, Siqi Wang et al.
Robust and Consistent Online Video Instance Segmentation via Instance Mask Propagation
Miran Heo, Seoung Wug Oh, Seon Joo Kim et al.
Generalized Class Discovery in Instance Segmentation
Cuong Manh Hoang, Yeejin Lee, Byeongkeun Kang
ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning
Wenjin Hou, Dingjie Fu, Kun Li et al.
BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation
Xiaolu Hou, Mingcheng Li, Dingkang Yang et al.
Motion Decoupled 3D Gaussian Splatting for Dynamic Object Representation
Xiao Hu, Libo Long, Jochen Lang
LPCG: A Self-conditional Architecture for Labeled Point Cloud Generation
Dongshuo Huang, Xiaoshui Huang, Chengdong Zhang et al.
PSReg: Prior-guided Sparse Mixture of Experts for Point Cloud Registration
Xiaoshui Huang, Zhou Huang, Yifan Zuo et al.
QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects
Elkhan Ismayilzada, MD Khalequzzaman Chowdhury Sayem, Yihalem Yimolal Tiruneh et al.
ContextHOI: Spatial Context Learning for Human-Object Interaction Detection
Mingda Jia, Liming Zhao, Ge Li et al.
Sensing Surface Patches in Volume Rendering for Inferring Signed Distance Functions
Sijia Jiang, Tong Wu, Jing Hua et al.
Constructing Fair Latent Space for Intersection of Fairness and Explainability
Hyungjun Joo, Hyeonggeun Han, Sehwan Kim et al.
Multispectral Pedestrian Detection with Sparsely Annotated Label
Chan Lee, Seungho Shin, Gyeong-Moon Park et al.
Diverse Rare Sample Generation with Pretrained GANs
Subeen Lee, Jiyeon Han, Soyeon Kim et al.
Bridging Knowledge Gap Between Image Inpainting and Large-Area Visible Watermark Removal
Yicheng Leng, Chaowei Fang, Junye Chen et al.
M²RL-Net: Multi-View and Multi-Level Relation Learning Network for Weakly-Supervised Image Forgery Detection
Jiafeng Li, Ying Wen, Lianghua He
Lightweight Contrastive Distilled Hashing for Online Cross-modal Retrieval
Jiaxing Li, Lin Jiang, Zeqi Ma et al.
Multi-View 3D Human Pose Estimation with Weakly Synchronized Images
Ling Li, Ruiwen Gu, Chongyang Wang et al.
SyncNoise: Geometrically Consistent Noise Prediction for Instruction-based 3D Editing
Ruihuang Li, Liyi Chen, Zhengqiang Zhang et al.
VidEvent: A Large Dataset for Understanding Dynamic Evolution of Events in Videos
Baoyu Liang, Qile Su, Shoutai Zhu et al.
UN-DETR: Promoting Objectness Learning via Joint Supervision for Unknown Object Detection
HaoMiao Liu, Hao Xu, Chuhuai Yue et al.
D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching
Jingyu Liu, Minquan Wang, Ye Ma et al.
Efficient Deformable Convolutional Prompt for Continual Test-Time Adaptation in Medical Image Segmentation
Shiyu Liu, Daoqiang Zhang, Xiaoke Hao
MUN: Image Forgery Localization Based on M³ Encoder and UN Decoder
Yaqi Liu, Shuhuan Chen, Haichao Shi et al.
Enhancing Low-Light Images: A Synthetic Data Perspective on Practical and Generalizable Solutions
Yu Long, Qinghua Lin, Zhihua Wang et al.
Generative Video Diffusion for Unseen Novel Semantic Video Moment Retrieval
Dezhao Luo, Shaogang Gong, Jiabo Huang et al.
Does VLM Classification Benefit from LLM Description Semantics?
Pingchuan Ma, Lennart Rietdorf, Dmytro Kotovenko et al.
Novel View Synthesis Under Large-Deviation Viewpoint for Autonomous Driving
Xin Ma, Jiguang Zhang, Peng Lu et al.
Relaxed Class-consensus Consistency for Semi-supervised Semantic Segmentation
Huayu Mai, Rui Sun, Feng Wu
Extract Free Dense Misalignment from CLIP
JeongYeon Nam, Jinbae Im, Wonjae Kim et al.
S2S2: Semantic Stacking for Robust Semantic Segmentation in Medical Imaging
Yimu Pan, Sitao Zhang, Alison D. Gernand et al.
PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation
Shoumeng Qiu, Xinrun Li, Xiangyang Xue et al.
GHOST: Gaussian Hypothesis Open-Set Technique
Ryan Rabinowitz, Steve Cruz, Manuel Günther et al.
Click2Mask: Local Editing with Dynamic Mask Generation
Omer Regev, Omri Avrahami, Dani Lischinski
ISPDiffuser: Learning RAW-to-sRGB Mappings with Texture-Aware Diffusion Models and Histogram-Guided Color Consistency
Yang Ren, Hai Jiang, Menglong Yang et al.
Video Summarization Using Denoising Diffusion Probabilistic Model
Zirui Shang, Yubo Zhu, Hongxi Li et al.
Fast Omni-Directional Image Super-Resolution: Adapting the Implicit Image Function with Pixel and Semantic-Wise Spherical Geometric Priors
Xuelin Shen, Yitong Wang, Silin Zheng et al.
Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera
Haixin Shi, Yinlin Hu, Daniel Koguciuk et al.
Pixel Is Not a Barrier: An Effective Evasion Attack for Pixel-Domain Diffusion Models
Chun-Yen Shih, Li-Xuan Peng, Jia-Wei Liao et al.
SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living
Arkaprava Sinha, Dominick Reilly, Francois Bremond et al.
JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts
Taein Son, Soo Won Seo, Jisong Kim et al.
Enhancing Noise-Robust Losses for Large-Scale Noisy Data Learning
Max Staats, Matthias Thamm, Bernd Rosenow
EigenSR: Eigenimage-Bridged Pre-Trained RGB Learners for Single Hyperspectral Image Super-Resolution
Xi Su, Xiangfei Shen, Mingyang Wan et al.
Learning Fine-Grained Alignment for Aerial Vision-Dialog Navigation
Yifei Su, Dong An, Kehan Chen et al.
Explicit Relational Reasoning Network for Scene Text Detection
Yuchen Su, Zhineng Chen, Yongkun Du et al.
Promptable Representation Distribution Learning and Data Augmentation for Gigapixel Histopathology WSI Analysis
Kunming Tang, Zhiguo Jiang, Jun Shi et al.
Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking
Xin Tong, Shi Peng, Baojie Tian et al.
Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics
Tze Ho Elden Tse, Runyang Feng, Linfang Zheng et al.
Overcoming Heterogeneous Data in Federated Medical Vision-Language Pre-training: A Triple-Embedding Model Selector Approach
Aowen Wang, Zhiwang Zhang, Dongang Wang et al.
SSC-VAE: Structured Sparse Coding Based Variational Autoencoder for Detail Preserved Image Reconstruction
Hao Wang, Lu Wang, Zhongyu Wang et al.
Bright-NeRF: Brightening Neural Radiance Field with Color Restoration from Low-Light RAW Images
Min Wang, Xin Huang, Guoqing Zhou et al.
HomoMatcher: Achieving Dense Feature Matching with Semi-Dense Efficiency by Homography Estimation
Xiaolong Wang, Lei Yu, Yingying Zhang et al.
Aligning Composed Query with Image via Discriminative Perception from Negative Correspondences
Yifan Wang, Wuliang Huang, Chun Yuan
AnyTalk: Multi-modal Driven Multi-domain Talking Head Generation
Yu Wang, Yunfei Liu, Fa-Ting Hong et al.
TdAttenMix: Top-Down Attention Guided Mixup
Zhiming Wang, Lin Gu, Feng Lu
Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation
Dongyue Wu, Zilin Guo, Li Yu et al.
LVPTrack: High Performance Domain Adaptive UAV Tracking with Label Aligned Visual Prompt Tuning
Hongjing Wu, Siyuan Yao, Feng Huang et al.
FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos
Zhengqian Wu, Ruizhe Li, Zijun Xu et al.
Hierarchically Controlled Deformable 3D Gaussians for Talking Head Synthesis
Zhenhua Wu, Linxuan Jiang, Xiang Li et al.
Relaxed Rotational Equivariance via G-Biases in Vision
Zhiqiang Wu, Yingjie Liu, Licheng Sun et al.
Exploiting Continuous Motion Clues for Vision-Based Occupancy Prediction
Haoran Xu, Peixi Peng, Xinyi Zhang et al.
Physical-aware Neural Radiance Fields for Efficient Exposure Correction
Kai Xu, Mingwen Shao, Yuanjian Qiao et al.
TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances
Wenting Xu, Viorela Ila, Luping Zhou et al.