Most Cited 2024 "sub-exponential noise" Papers
12,324 papers found • Page 16 of 62
Conference
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
Zhiyu Mei, Wei Fu, Jiaxuan Gao et al.
DeCoTR: Enhancing Depth Completion with 2D and 3D Attentions
Yunxiao Shi, Manish Singh, Hong Cai et al.
A Dynamic Learning Method towards Realistic Compositional Zero-Shot Learning
Xiaoming Hu, Zilei Wang
Masked Spatial Propagation Network for Sparsity-Adaptive Depth Refinement
Jinyoung Jun, Jae-Han Lee, Chang-Su Kim
Prompting Future Driven Diffusion Model for Hand Motion Prediction
Bowen Tang, Kaihao Zhang, Wenhan Luo et al.
Auto-DAS: Automated Proxy Discovery for Training-free Distillation-aware Architecture Search
Haosen SUN, Lujun Li, Peijie Dong et al.
QPEN: Quantum Projection and Quantum Entanglement Enhanced Network for Cross-Lingual Aspect-Based Sentiment Analysis
Xingqiang Zhao, Hai Wan, Kunxun Qi
SLIM: Spuriousness Mitigation with Minimal Human Annotations
Xiwei Xuan, Ziquan Deng, Hsuan-Tien Lin et al.
RAVE: Residual Vector Embedding for CLIP-Guided Backlit Image Enhancement
Tatiana Gaintseva, Martin Benning, Greg Slabaugh
D4-VTON: Dynamic Semantics Disentangling for Differential Diffusion based Virtual Try-On
Zhaotong Yang, Zicheng Jiang, Xinzhe Li et al.
Learning Discriminative Dynamics with Label Corruption for Noisy Label Detection
Suyeon Kim, Dongha Lee, SeongKu Kang et al.
Learning Diverse Risk Preferences in Population-Based Self-Play
Yuhua Jiang, Qihan Liu, Xiaoteng Ma et al.
Parameterized Quasi-Physical Simulators for Dexterous Manipulations Transfer
Xueyi Liu, Kangbo Lyu, jieqiong zhang et al.
Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation
Dong Zhao, Shuang Wang, Qi Zang et al.
Relational Matching for Weakly Semi-Supervised Oriented Object Detection
Wenhao Wu, Hau San Wong, Si Wu et al.
BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling
Sameera Ramasinghe, Violetta Shevchenko, Gil Avraham et al.
IPRemover: A Generative Model Inversion Attack against Deep Neural Network Fingerprinting and Watermarking
Wei Zong, Yang-Wai Chow, Willy Susilo et al.
In-N-Out: Faithful 3D GAN Inversion with Volumetric Decomposition for Face Editing
Yiran Xu, Zhixin Shu, Cameron Smith et al.
Improving PTM Site Prediction by Coupling of Multi-Granularity Structure and Multi-Scale Sequence Representation
Zhengyi Li, Menglu Li, Lida Zhu et al.
GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement
Linfang Zheng, Tze Ho Elden Tse, Chen Wang et al.
PoseCrafter: One-Shot Personalized Video Synthesis Following Flexible Pose Control
Yong Zhong, Min Zhao, Zebin You et al.
JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments
Duy Tho Le, Chenhui Gou, Stavya Datta et al.
Task-Aware Encoder Control for Deep Video Compression
Xingtong Ge, Jixiang Luo, XINJIE ZHANG et al.
An Efficient Knowledge Transfer Strategy for Spiking Neural Networks from Static to Event Domain
Xiang He, Dongcheng Zhao, Yang Li et al.
Cross-Dimension Affinity Distillation for 3D EM Neuron Segmentation
Xiaoyu Liu, Miaomiao Cai, Yinda Chen et al.
Clustering for Protein Representation Learning
Ruijie Quan, Wenguan Wang, Fan Ma et al.
DreamSalon: A Staged Diffusion Framework for Preserving Identity-Context in Editable Face Generation
Haonan Lin
Interventional Fairness on Partially Known Causal Graphs: A Constrained Optimization Approach
Aoqi Zuo, yiqing li, Susan Wei et al.
Made to Order: Discovering monotonic temporal changes via self-supervised video ordering
Charig Yang, Weidi Xie, Andrew ZISSERMAN
Better than Random: Reliable NLG Human Evaluation with Constrained Active Sampling
Jie Ruan, Xiao Pu, Mingqi Gao et al.
Learning Personalized Causally Invariant Representations for Heterogeneous Federated Clients
Xueyang Tang, Song Guo, Jie ZHANG et al.
Neural Reasoning about Agents’ Goals, Preferences, and Actions
Matteo Bortoletto, Lei Shi, Andreas Bulling
Gaze from Origin: Learning for Generalized Gaze Estimation by Embedding the Gaze Frontalization Process
Mingjie Xu, Feng Lu
Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter Profile
Seokjun Lee, Seung-Won Jung, Hyunseok Seo
Automatic Controllable Colorization via Imagination
Xiaoyan Cong, Yue Wu, Qifeng Chen et al.
Towards Continual Learning Desiderata via HSIC-Bottleneck Orthogonalization and Equiangular Embedding
Depeng Li, Tianqi Wang, Junwei Chen et al.
Regret Analysis of Repeated Delegated Choice
Suho Shin, Keivan Rezaei, Mohammad Hajiaghayi et al.
Robust Communicative Multi-Agent Reinforcement Learning with Active Defense
Lebin Yu, Yunbo Qiu, Quanming Yao et al.
RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection
Ming Chang, Xishan Zhang, Rui Zhang et al.
Improved Anonymous Multi Agent Path Finding Algorithm
Zain Alabedeen Ali, Konstantin Yakovlev
EPSD: Early Pruning with Self-Distillation for Efficient Model Compression
Dong Chen, Ning Liu, Yichen Zhu et al.
Keep the Faith: Faithful Explanations in Convolutional Neural Networks for Case-Based Reasoning
Tom Nuno Wolf, Fabian Bongratz, Anne-Marie Rickmann et al.
Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Agent Preference-Based Reinforcement Learning
Tianchen Zhu, Yue Qiu, Haoyi Zhou et al.
FFF: Fixing Flawed Foundations in Contrastive Pre-Training Results in Very Strong Vision-Language Models
Adrian Bulat, Yassine Ouali, Georgios Tzimiropoulos
Learning with a Mole: Transferable latent spatial representations for navigation without reconstruction
Guillaume Bono, Leonid Antsfeld, Assem Sadek et al.
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition
Yusheng Dai, HangChen, Jun Du et al.
Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling
Jakob Hollenstein, Georg Martius, Justus Piater
Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing
Haobin Jiang, Ziluo Ding, Zongqing Lu
Hierarchical Aligned Multimodal Learning for NER on Tweet Posts
Peipei Liu, Hong Li, Yimo Ren et al.
DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation
Wenliang Zhao, Haolin Wang, Jie Zhou et al.
Residual Hyperbolic Graph Convolution Networks
Yangkai Xue, Jindou Dai, Zhipeng Lu et al.
Improved Metric Distortion via Threshold Approvals
Elliot Anshelevich, Aris Filos-Ratsikas, Christopher Jerrett et al.
DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation
Soojin Jang, JungMin Yun, JuneHyoung Kwon et al.
Robust Policy Learning via Offline Skill Diffusion
Woo Kyung Kim, Minjong Yoo, Honguk Woo
Distill Gold from Massive Ores: Bi-level Data Pruning towards Efficient Dataset Distillation
YUE XU, Yong-Lu Li, Kaitong Cui et al.
IVP-VAE: Modeling EHR Time Series with Initial Value Problem Solvers
Jingge Xiao, Leonie Basso, Wolfgang Nejdl et al.
RePOSE: 3D Human Pose Estimation via Spatio-Temporal Depth Relational Consistency
Ziming Sun, Yuan Liang, Zejun Ma et al.
Graph Context Transformation Learning for Progressive Correspondence Pruning
Junwen Guo, Guobao Xiao, Shiping Wang et al.
Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation
Jaehyeong Jeon, Kibum Kim, Kanghoon Yoon et al.
Spherical Pseudo-Cylindrical Representation for Omnidirectional Image Super-resolution
Qing Cai, Mu Li, Dongwei Ren et al.
Bottom-Up Domain Prompt Tuning for Generalized Face Anti-Spoofing
SI-QI LIU, Qirui Wang, Pong Chi Yuen
PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation
Jian Ma, Chen Chen, Qingsong Xie et al.
An Explainable Vision Question Answer Model via Diffusion Chain-of-Thought
Chunhao LU, Qiang Lu, Jake Luo
1/2-Approximate MMS Allocation for Separable Piecewise Linear Concave Valuations
Chandra Chekuri, Pooja Kulkarni, Rucha Kulkarni et al.
DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing
Vint Lee, Pieter Abbeel, Youngwoon Lee
MedBN: Robust Test-Time Adaptation against Malicious Test Samples
Hyejin Park, Jeongyeon Hwang, Sunung Mun et al.
RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation
Samuel Pegg, Kai Li, Xiaolin Hu
Backdoor Adjustment via Group Adaptation for Debiased Coupon Recommendations
Junpeng Fang, Gongduo Zhang, Qing Cui et al.
Interactive Hyperparameter Optimization in Multi-Objective Problems via Preference Learning
Joseph Giovanelli, Alexander Tornede, Tanja Tornede et al.
Diversity-Authenticity Co-constrained Stylization for Federated Domain Generalization in Person Re-identification
Fengxiang Yang, Zhun Zhong, Zhiming Luo et al.
Shape from Heat Conduction
Sriram Narayanan, Mani Ramanagopal, Mark Sheinin et al.
WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation
Tianjian Jiang, Johsan Billingham, Sebastian Müksch et al.
PolyOculus: Simultaneous Multi-view Image-based Novel View Synthesis
Jason Yu, Tristan Aumentado-Armstrong, Fereshteh Forghani et al.
CatmullRom Splines-Based Regression for Image Forgery Localization
Li Zhang, Mingliang Xu, Dong Li et al.
Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees
Yifei Zhou, Ayush Sekhari, Yuda Song et al.
TTT-MIM: Test-Time Training with Masked Image Modeling for Denoising Distribution Shifts
Youssef Mansour, Xuyang Zhong, Serdar Caglar et al.
Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos
Ekta Prashnani, Koki Nagano, Shalini De Mello et al.
Colorizing Monochromatic Radiance Fields
Yean Cheng, Renjie Wan, Shuchen Weng et al.
Insect Identification in the Wild: The AMI Dataset
Aditya Jain, Fagner Cunha, Michael J Bunsen et al.
Robust 3D Tracking with Quality-Aware Shape Completion
Jingwen Zhang, Zikun Zhou, Guangming Lu et al.
FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
Jiedong Zhuang, Jiaqi Hu, Lianrui Mu et al.
UniCal: Unified Neural Sensor Calibration
Ze Yang, George G Chen, Haowei Zhang et al.
Knowledge Enhanced Representation Learning for Drug Discovery
Thanh Lam Hoang, Marco Luca Sbodio, Marcos Martinez et al.
Layered Rendering Diffusion Model for Controllable Zero-Shot Image Synthesis
Zipeng Qi, Guoxi Huang, Chenyang Liu et al.
Deep Imbalanced Regression via Hierarchical Classification Adjustment
Haipeng Xiong, Angela Yao
ProCreate, Don't Reproduce! Propulsive Energy Diffusion for Creative Generation
Jack Lu, Ryan Teehan, Mengye Ren
Leveraging Normalization Layer in Adapters with Progressive Learning and Adaptive Distillation for Cross-Domain Few-Shot Learning
YongJin Yang, Taehyeon Kim, Se-Young Yun
Generalized Bradley-Terry Models for Score Estimation from Paired Comparisons
Julien Fageot, Lê-Nguyên Hoang, Oscar Villemaud et al.
Querying Easily Flip-flopped Samples for Deep Active Learning
Seong Jin Cho, Gwangsu Kim, Junghyun Lee et al.
Object-Centric Learning with Slot Mixture Module
Daniil Kirilenko, Vitaliy Vorobyov, Aleksey Kovalev et al.
Reachability of Fair Allocations via Sequential Exchanges
Ayumi Igarashi, Naoyuki Kamiyama, Warut Suksompong et al.
Un-EVIMO: Unsupervised Event-based Independent Motion Segmentation
Ziyun Wang, Jinyuan Guo, Kostas Daniilidis
High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior
Shen Jianbing, Wencheng Han
ARoFace: Alignment Robustness to Improve Low-quality Face Recognition
Mohammad Saeed Ebrahimi Saadabadi, Sahar Rahimi Malakshan, Ali Dabouei et al.
SEER: Backdoor Detection for Vision-Language Models through Searching Target Text and Image Trigger Jointly
Liuwan Zhu, Rui Ning, Jiang Li et al.
EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models
Lee Eungbean, Somi Jeong, Kwanghoon Sohn
Bridging the Semantic Latent Space between Brain and Machine: Similarity Is All You Need
Jiaxuan Chen, Yu Qi, Yueming Wang et al.
Text2City: One-Stage Text-Driven Urban Layout Regeneration
Yiming Qin, Nanxuan Zhao, Bin Sheng et al.
fairret: a Framework for Differentiable Fairness Regularization Terms
Maarten Buyl, MaryBeth Defrance, Tijl De Bie
Trainable Highly-expressive Activation Functions
Irit Chelly, Shahaf Finder, Shira Ifergane et al.
Expressive Forecasting of 3D Whole-Body Human Motions
Pengxiang Ding, Qiongjie Cui, Haofan Wang et al.
Anytime Continual Learning for Open Vocabulary Classification
Zhen Zhu, Yiming Gong, Derek Hoiem
FreeMotion: MoCap-Free Human Motion Synthesis with Multimodal Large Language Models
Zhikai Zhang, Yitang Li, Haofeng Huang et al.
On Harmonizing Implicit Subpopulations
Feng Hong, Jiangchao Yao, YUEMING LYU et al.
Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model
Zhening Liu, XINJIE ZHANG, Jiawei Shao et al.
Relightable Neural Actor with Intrinsic Decomposition and Pose Control
Diogo Carbonera Luvizon, Vladislav Golyanik, Adam Kortylewski et al.
Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable Repainting
Junwu Zhang, Zhenyu Tang, Yatian Pang et al.
Improving Knowledge Distillation via Regularizing Feature Direction and Norm
Yuzhu Wang, Lechao Cheng, Manni Duan et al.
Self-supervised co-salient object detection via feature correspondences at multiple scales
Souradeep Chakraborty, Dimitris Samaras
DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays
Baochang Zhang, Zhi Qiao, Runkun Liu et al.
AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval
Pavel Suma, Giorgos Kordopatis-Zilos, Ahmet Iscen et al.
Learning Neural Volumetric Pose Features for Camera Localization
Jingyu Lin, Jiaqi Gu, Bojian Wu et al.
You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception
Sheng Jin, Shuhuai Li, Tong Li et al.
Adversarially Robust Few-shot Learning via Parameter Co-distillation of Similarity and Class Concept Learners
Junhao Dong, Piotr Koniusz, Junxi Chen et al.
Large-Scale Multi-Robot Coverage Path Planning via Local Search
Jingtao Tang, Hang Ma
SIG: Speaker Identification in Literature via Prompt-Based Generation
Zhenlin Su, Liyan Xu, Jin Xu et al.
Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community
Arman Isajanyan, Artur Shatveryan, David Kocharian et al.
On the Limitations of Temperature Scaling for Distributions with Overlaps
Muthu Chidambaram, Rong Ge
CoRe-GD: A Hierarchical Framework for Scalable Graph Visualization with GNNs
Florian Grötschla, Joël Mathys, Róbert Veres et al.
AWOL: Analysis WithOut synthesis using Language
Silvia Zuffi, Michael J. Black
GazeXplain: Learning to Predict Natural Language Explanations of Visual Scanpaths
Xianyu Chen, Ming Jiang, Qi Zhao
Bidirectional Progressive Transformer for Interaction Intention Anticipation
Zichen Zhang, Hongchen Luo, Wei Zhai et al.
SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking
Siyuan Li, Lei Ke, Yung-Hsu Yang et al.
Diversity-aware Channel Pruning for StyleGAN Compression
Jiwoo Chung, Sangeek Hyun, Sang-Heon Shim et al.
An N-Point Linear Solver for Line and Motion Estimation with Event Cameras
Ling Gao, Daniel Gehrig, Hang Su et al.
DeepPointMap: Advancing LiDAR SLAM with Unified Neural Descriptors
Xiaze Zhang, Ziheng Ding, Qi Jing et al.
Audio-visual Generalized Zero-shot Learning the Easy Way
Shentong Mo, Pedro Morgado
Learning to Complement and to Defer to Multiple Users
Zheng Zhang, Wenjie Ai, Kevin Wells et al.
Identifying Policy Gradient Subspaces
Jan Schneider, Pierre Schumacher, Simon Guist et al.
Image-adaptive 3D Lookup Tables for Real-time Image Enhancement with Bilateral Grids
Wontae Kim, Nam Ik Cho
Gradient-Aware for Class-Imbalanced Semi-supervised Medical Image Segmentation
Wenbo Qi, Jiafei Wu, S. C. Chan
Shedding More Light on Robust Classifiers under the lens of Energy-based Models
Mujtaba Hussain Mirza, Maria Rosaria Briglia, Senad Beadini et al.
Multiscale Vision Transformers Meet Bipartite Matching for Efficient Single-stage Action Localization
Ioanna Ntinou, Enrique Sanchez, Georgios Tzimiropoulos
Personalized Video Relighting With an At-Home Light Stage
Jun Myeong Choi, Max Christman, Roni Sengupta
Zero-Shot Multi-Object Scene Completion
Shun Iwase, Katherine Liu, Vitor Guizilini et al.
Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images
Chuanrui Zhang, Yonggen Ling, Minglei Lu et al.
KITRO: Refining Human Mesh by 2D Clues and Kinematic-tree Rotation
Fengyuan Yang, Kerui Gu, Angela Yao
Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics
Shishira R Maiya, Anubhav Anubhav, Matthew Gwilliam et al.
Querying as Prompt: Parameter-Efficient Learning for Multimodal Language Model
Tian Liang, Jing Huang, Ming Kong et al.
Removing Distributional Discrepancies in Captions Improves Image-Text Alignment
Mu Cai, Haotian Liu, Yuheng Li et al.
Fun with Flags: Robust Principal Directions via Flag Manifolds
Tolga Birdal, Nathan Mankovich
The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medical Image Segmentation
Muyang Qiu, Jian Zhang, Lei Qi et al.
High-Fidelity and Transferable NeRF Editing by Frequency Decomposition
YISHENG HE, Weihao Yuan, Siyu Zhu et al.
PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation
Ginger Delmas, Philippe Weinzaepfel, Francesc Moreno et al.
VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing
Shang Liu, Chaohui Yu, Chenjie Cao et al.
Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems
Sojin Lee, Dogyun Park, Inho Kong et al.
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
Lirui Zhao, Yue Yang, Kaipeng Zhang et al.
AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos
Feichi Lu, Zijian Dong, Jie Song et al.
OneTrack: Demystifying the Conflict Between Detection and Tracking in End-to-End 3D Trackers
Qitai Wang, Jiawei He, Yuntao Chen et al.
Language Model Guided Interpretable Video Action Reasoning
Ning Wang, Guangming Zhu, Hongsheng Li et al.
BiTT: Bi-directional Texture Reconstruction of Interacting Two Hands from a Single Image
Minje Kim, Tae-Kyun Kim
Intriguing Properties of Diffusion Models: An Empirical Study of the Natural Attack Capability in Text-to-Image Generative Models
Takami Sato, Justin Yue, Nanze Chen et al.
VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement
Hanjung Kim, Jaehyun Kang, Miran Heo et al.
Extreme Point Supervised Instance Segmentation
Hyeonjun Lee, Sehyun Hwang, Suha Kwak
Data Augmentation via Latent Diffusion for Saliency Prediction
Bahar Aydemir, Deblina Bhattacharjee, Tong Zhang et al.
Customizable Combination of Parameter-Efficient Modules for Multi-Task Learning
Haowen Wang, Tao Sun, Congyun Jin et al.
Concise Plane Arrangements for Low-Poly Surface and Volume Modelling
Raphael Sulzer, Florent Lafarge
Text-Guided Video Masked Autoencoder
David Fan, Jue Wang, Shuai Liao et al.
CONDA: Condensed Deep Association Learning for Co-Salient Object Detection.
Long Li, Nian Liu, Dingwen Zhang et al.
Frontier-enhanced Topological Memory with Improved Exploration Awareness for Embodied Visual Navigation
Xinru Cui, Qiming Liu, Zhe Liu et al.
CLIPtone: Unsupervised Learning for Text-based Image Tone Adjustment
Hyeongmin Lee, Kyoungkook Kang, Jungseul Ok et al.
NePhi: Neural Deformation Fields for Approximately Diffeomorphic Medical Image Registration
Lin Tian, Thomas H Greer, Raul San Jose Estepar et al.
Robust Training of Federated Models with Extremely Label Deficiency
Yonggang Zhang, Zhiqin Yang, Xinmei Tian et al.
FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous Driving
Xingtai Gui, Tengteng Huang, Haonan Shao et al.
DySeT: a Dynamic Masked Self-distillation Approach for Robust Trajectory Prediction
MOZHGAN POURKESHAVARZ, Arielle Zhang, Amir Rasouli
A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Mathilde Caron, Ahmet Iscen, Alireza Fathi et al.
Weight Conditioning for Smooth Optimization of Neural Networks
Hemanth Saratchandran, Thomas X Wang, Simon Lucey
Vision-Language Dual-Pattern Matching for Out-of-Distribution Detection
Zihan Zhang, Zhuo Xu, Xiang Xiang
Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation
Shentong Mo, Enze Xie, Yue Wu et al.
A high-quality robust diffusion framework for corrupted dataset
Quan Dao, Binh Ta, Tung Pham et al.
Understanding Physical Dynamics with Counterfactual World Modeling
Rahul Mysore Venkatesh, Honglin Chen, Kevin Feigelis et al.
Semantic Shield: Defending Vision-Language Models Against Backdooring and Poisoning via Fine-grained Knowledge Alignment
Alvi Md Ishmam, Chris Thomas
Reshaping the Online Data Buffering and Organizing Mechanism for Continual Test-Time Adaptation
Zhilin Zhu, Xiaopeng Hong, Zhiheng Ma et al.
Improving Domain Generalization in Self-Supervised Monocular Depth Estimation via Stabilized Adversarial Training
Yuanqi Yao, Gang Wu, Kui Jiang et al.
LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking
Jialin Li, Qiang Nie, Weifu Fu et al.
Weak Distribution Detectors Lead to Stronger Generalizability of Vision-Language Prompt Tuning
Kun Ding, Haojian Zhang, Qiang Yu et al.
Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling
Yuze Hao, Jianrong Zhang, Tao Zhuo et al.
BodyMAP - Jointly Predicting Body Mesh and 3D Applied Pressure Map for People in Bed
Abhishek Tandon, Anujraaj Goyal, Henry M. Clever et al.
DeepSaDe: Learning Neural Networks That Guarantee Domain Constraint Satisfaction
Kshitij Goyal, Sebastijan Dumancic, Hendrik Blockeel
Anchoring Path for Inductive Relation Prediction in Knowledge Graphs
Zhixiang Su, Di Wang, Chunyan Miao et al.
Two-timescale Extragradient for Finding Local Minimax Points
Jiseok Chae, Kyuwon Kim, Donghwan Kim
Direct Distillation between Different Domains
Jialiang Tang, Shuo Chen, Gang Niu et al.
Improving Robustness to Model Inversion Attacks via Sparse Coding Architectures
Sayanton Vhaduri Dibbo, Adam Breuer, Juston Moore et al.
Self-Supervised Audio-Visual Soundscape Stylization
Tingle Li, Renhao Wang, Po-Yao Huang et al.
Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning
Zeyang Liu, Lipeng Wan, Xinrui Yang et al.
Discretization-Induced Dirichlet Posterior for Robust Uncertainty Quantification on Regression
Xuanlong Yu, Gianni Franchi, Jindong Gu et al.
Camera-LiDAR Cross-modality Gait Recognition
Wenxuan Guo, Yingping Liang, Zhiyu Pan et al.
On Computing Makespan-Optimal Solutions for Generalized Sliding-Tile Puzzles
Marcus Gozon, Jingjin Yu
Accelerated Convergence of Stochastic Heavy Ball Method under Anisotropic Gradient Noise
Rui Pan, Yuxing Liu, Xiaoyu Wang et al.
Artist-Friendly Relightable and Animatable Neural Heads
Yingyan Xu, Prashanth Chandran, Sebastian Weiss et al.
Single View Refractive Index Tomography with Neural Fields
Brandon Zhao, Aviad Levis, Liam Connor et al.
Neural Lineage
Runpeng Yu, Xinchao Wang
Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM
Baicheng Li, Zike Yan, Dong Wu et al.
Spear and Shield: Adversarial Attacks and Defense Methods for Model-Based Link Prediction on Continuous-Time Dynamic Graphs
Dongjin Lee, Juho Lee, Kijung Shin
Generating 3D House Wireframes with Semantics
Xueqi Ma, Yilin Liu, Wenjun Zhou et al.
Detours for Navigating Instructional Videos
Kumar Ashutosh, Zihui Xue, Tushar Nagarajan et al.
Any Target Can be Offense: Adversarial Example Generation via Generalized Latent Infection
Youheng Sun, Shengming Yuan, Xuanhan Wang et al.
Spatial-Semantic Collaborative Cropping for User Generated Content
Yukun Su, Yiwen Cao, Jingliang Deng et al.
Semantic Human Mesh Reconstruction with Textures
xiaoyu zhan, Jianxin Yang, Yuanqi Li et al.
Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations
KILICHBEK HAYDAROV, Xiaoqian Shen, Avinash Madasu et al.