Most Cited 2024 "time-dependent attention" Papers
12,324 papers found • Page 17 of 62
Conference
PPIDSG: A Privacy-Preserving Image Distribution Sharing Scheme with GAN in Federated Learning
Yuting Ma, Yuanzhi Yao, Xiaohua Xu
Inverse Weight-Balancing for Deep Long-Tailed Learning
Wenqi Dang, Zhou Yang, Weisheng Dong et al.
DiffCD: A Symmetric Differentiable Chamfer Distance for Neural Implicit Surface Fitting
Linus Härenstam-Nielsen, Lu Sang, Abhishek Saroha et al.
Detours for Navigating Instructional Videos
Kumar Ashutosh, Zihui Xue, Tushar Nagarajan et al.
PARIS3D: Reasoning-based 3D Part Segmentation Using Large Multimodal Model
Amrin Kareem, Jean Lahoud, Hisham Cholakkal
LRS: Enhancing Adversarial Transferability through Lipschitz Regularized Surrogate
Tao Wu, Tie Luo, D. C. Wunsch
WaveMo: Learning Wavefront Modulations to See Through Scattering
Mingyang Xie, Haiyun Guo, Brandon Y. Feng et al.
DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors
Zizheng Yan, Jiapeng Zhou, Fanpeng Meng et al.
Efficient Stitchable Task Adaptation
Haoyu He, Zizheng Pan, Jing Liu et al.
Physics-Aware Hand-Object Interaction Denoising
Haowen Luo, Yunze Liu, Li Yi
Complete Neural Networks for Complete Euclidean Graphs
Snir Hordan, Tal Amir, Nadav Dym et al.
Improving Zero-Shot Generalization for CLIP with Variational Adapter
Ziqian Lu, Fengli Shen, Mushui Liu et al.
CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoor Object Detection from Multi-view Images
Guanlin Shen, Jingwei Huang, Zhihua Hu et al.
Semantic Shield: Defending Vision-Language Models Against Backdooring and Poisoning via Fine-grained Knowledge Alignment
Alvi Md Ishmam, Chris Thomas
Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation
Junsung Lee, Minsoo Kang, Bohyung Han
MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection
Youngmin Oh, Hyung-Il Kim, Seong Tae Kim et al.
SemReg: Semantics Constrained Point Cloud Registration
Sheldon Fung, Xuequan Lu, Dasith de Silva Edirimuni et al.
Faithful Model Explanations through Energy-Constrained Conformal Counterfactuals
Patrick Altmeyer, Mojtaba Farmanbar, Arie Van Deursen et al.
Catch-Up Mix: Catch-Up Class for Struggling Filters in CNN
Minsoo Kang, Minkoo Kang, Suhyun Kim
NestE: Modeling Nested Relational Structures for Knowledge Graph Reasoning
Bo Xiong, Mojtaba Nayyeri, Linhao Luo et al.
Semantic Human Mesh Reconstruction with Textures
xiaoyu zhan, Jianxin Yang, Yuanqi Li et al.
Probabilistic Neural Circuits
Pedro Zuidberg Dos Martires
DAG-Aware Variational Autoencoder for Social Propagation Graph Generation
Dongpeng Hou, Chao Gao, Xuelong Li et al.
Identification of Causal Structure with Latent Variables Based on Higher Order Cumulants
Wei Chen, Zhiyi Huang, Ruichu Cai et al.
FrePolad: Frequency-Rectified Point Latent Diffusion for Point Cloud Generation
Chenliang Zhou, Fangcheng Zhong, Param Hanji et al.
Learning the Causal Structure of Networked Dynamical Systems under Latent Nodes and Structured Noise
Augusto Santos, Diogo Rente, Rui Seabra et al.
Dialogue for Prompting: A Policy-Gradient-Based Discrete Prompt Generation for Few-Shot Learning
Chengzhengxu Li, Xiaoming Liu, Yichen Wang et al.
BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events
Yijin Li, Yichen Shen, Zhaoyang Huang et al.
Implicit Motion Function
Yue Gao, Jiahao Li, Lei Chu et al.
Unmasking Bias in Diffusion Model Training
Hu Yu, Li Shen, Jie Huang et al.
Accelerating the Global Aggregation of Local Explanations
Alon Mor, Yonatan Belinkov, Benny Kimelfeld
External Knowledge Enhanced 3D Scene Generation from Sketch
Zijie Wu, Mingtao Feng, Yaonan Wang et al.
Exploiting Polarized Material Cues for Robust Car Detection
Wen Dong, Haiyang Mei, Ziqi Wei et al.
LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking
Jialin Li, Qiang Nie, Weifu Fu et al.
On the Robustness of Neural-Enhanced Video Streaming against Adversarial Attacks
Qihua Zhou, Jingcai Guo, Song Guo et al.
Fact-Driven Logical Reasoning for Machine Reading Comprehension
Siru Ouyang, Zhuosheng Zhang, Hai Zhao
SuperPrimitive: Scene Reconstruction at a Primitive Level
Kirill Mazur, Gwangbin Bae, Andrew J. Davison
Dependency Structure-Enhanced Graph Attention Networks for Event Detection
Qizhi Wan, Changxuan wan, Keli Xiao et al.
Elucidating the Hierarchical Nature of Behavior with Masked Autoencoders
Lucas Stoffl, Andy Bonnetto, Stéphane D'Ascoli et al.
Teacher as a Lenient Expert: Teacher-Agnostic Data-Free Knowledge Distillation
Hyunjune Shin, Dong-Wan Choi
UNR-Explainer: Counterfactual Explanations for Unsupervised Node Representation Learning Models
Hyunju Kang, Geonhee Han, Hogun Park
Color Event Enhanced Single-Exposure HDR Imaging
Mengyao Cui, Zhigang Wang, Dong Wang et al.
Weakly Supervised Few-Shot Object Detection with DETR
Chenbo Zhang, Yinglu Zhang, Lu Zhang et al.
Elegantly Written: Disentangling Writer and Character Styles for Enhancing Online Chinese Handwriting
Yu Liu, Fatimah binti Khalid, Lei Wang et al.
Steganographic Passport: An Owner and User Verifiable Credential for Deep Model IP Protection Without Retraining
Qi Cui, Ruohan Meng, Chaohui Xu et al.
Generating Illustrated Instructions
Sachit Menon, Ishan Misra, Rohit Girdhar
Unsupervised Pan-Sharpening via Mutually Guided Detail Restoration
Huangxing Lin, Yuhang Dong, Xinghao Ding et al.
Monocular Identity-Conditioned Facial Reflectance Reconstruction
Xingyu Ren, Jiankang Deng, Yuhao Cheng et al.
Causally Aligned Curriculum Learning
Mingxuan Li, Junzhe Zhang, Elias Bareinboim
Your Career Path Matters in Person-Job Fit
Zhuocheng Gong, Yang Song, Tao Zhang et al.
Fisher Calibration for Backdoor-Robust Heterogeneous Federated Learning
Wenke Huang, Mang Ye, zekun shi et al.
Towards the Disappearing Truth: Fine-Grained Joint Causal Influences Learning with Hidden Variable-Driven Causal Hypergraphs
Kun Zhu, Chunhui Zhao
The Wasserstein Believer: Learning Belief Updates for Partially Observable Environments through Reliable Latent Space Models
Raphael Avalos, Florent Delgrange, Ann Nowe et al.
HiFi-Score: Fine-grained Image Description Evaluation with Hierarchical Parsing Graphs
Ziwei Yao, Ruiping Wang, Xilin CHEN
ConVQG: Contrastive Visual Question Generation with Multimodal Guidance
Li Mi, Syrielle Montariol, Javiera Castillo Navarro et al.
The Expressive Leaky Memory Neuron: an Efficient and Expressive Phenomenological Neuron Model Can Solve Long-Horizon Tasks.
Aaron Spieler, Nasim Rahaman, Georg Martius et al.
Open-Set Biometrics: Beyond Good Closed-Set Models
Yiyang Su, Minchul Kim, Feng Liu et al.
H-ensemble: An Information Theoretic Approach to Reliable Few-Shot Multi-Source-Free Transfer
Yanru Wu, Jianning Wang, Weida Wang et al.
MCSSME: Multi-Task Contrastive Learning for Semi-supervised Singing Melody Extraction from Polyphonic Music
Shuai Yu
Hard Regularization to Prevent Deep Online Clustering Collapse without Data Augmentation
Louis Mahon, Thomas Lukasiewicz
In-Hand 3D Object Reconstruction from a Monocular RGB Video
Shijian Jiang, Qi Ye, Rengan Xie et al.
Improving Knowledge Extraction from LLMs for Task Learning through Agent Analysis
James Kirk, Robert Wray, Peter Lindes et al.
Learning Small Decision Trees with Few Outliers: A Parameterized Perspective
Harmender Gahlawat, Meirav Zehavi
SeTformer Is What You Need for Vision and Language
Pourya Shamsolmoali, Masoumeh Zareapoor, Eric Granger et al.
Knowledge-Enhanced Historical Document Segmentation and Recognition
En-Hao Gao, Yu-Xuan Huang, Wen-Chao Hu et al.
Rawformer: Unpaired Raw-to-Raw Translation for Learnable Camera ISPs
Georgy Perevozchikov, Nancy Mehta, Mahmoud Afifi et al.
SkyScenes: A Synthetic Dataset for Aerial Scene Understanding
Sahil Santosh Khose, Anisha Pal, Aayushi Agarwal et al.
Using Stratified Sampling to Improve LIME Image Explanations
Muhammad Rashid, Elvio G. Amparore, Enrico Ferrari et al.
Benchmarking Segmentation Models with Mask-Preserved Attribute Editing
Zijin Yin, Kongming Liang, Bing Li et al.
Probabilistic Weather Forecasting with Deterministic Guidance-based Diffusion Model
Donggeun Yoon, Minseok Seo, Doyi Kim et al.
Learning Cross-hand Policies of High-DOF Reaching and Grasping
Qijin She, Shishun Zhang, Yunfan Ye et al.
Synergy of Sight and Semantics: Visual Intention Understanding with CLIP
Qu Yang, Mang Ye, Dacheng Tao
EA-VTR: Event-Aware Video-Text Retrieval
Zongyang Ma, Ziqi Zhang, Yuxin Chen et al.
DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks
Tongzhou Mu, Minghua Liu, Hao Su
Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation
Anqi Zhang, Guangyu Gao
PaPr: Training-Free One-Step Patch Pruning with Lightweight ConvNets for Faster Inference
Tanvir Mahmud, Burhaneddin Yaman, Chun-Hao Liu et al.
HPE-Li: WiFi-enabled Lightweight Dual Selective Kernel Convolution for Human Pose Estimation
Gian Toan D., Tien Dac Lai, Thien Van Luong et al.
Layout-Corrector: Alleviating Layout Sticking Phenomenon in Discrete Diffusion Model
Shoma Iwai, Atsuki Osanai, Shunsuke Kitada et al.
Instance-based Max-margin for Practical Few-shot Recognition
Minghao Fu, Ke Zhu
Structural Information Guided Multimodal Pre-training for Vehicle-Centric Perception
Xiao Wang, Wentao Wu, Chenglong Li et al.
PairingNet: A Learning-based Pair-searching and -matching Network for Image Fragments
rixin zhou, Ding Xia, YI ZHANG et al.
Overcome Modal Bias in Multi-modal Federated Learning via Balanced Modality Selection
Yunfeng Fan, Wenchao Xu, Haozhao Wang et al.
Enhancing Tampered Text Detection through Frequency Feature Fusion and Decomposition
Zhongxi Chen, Shen Chen, Taiping Yao et al.
Modelling Competitive Behaviors in Autonomous Driving Under Generative World Model
Guanren Qiao, Guiliang Liu, Guorui Quan et al.
G3DR: Generative 3D Reconstruction in ImageNet
Pradyumna Reddy, Ismail Elezi, Jiankang Deng
Multi-Modal Disordered Representation Learning Network for Description-Based Person Search
Fan Yang, Wei Li, Menglong Yang et al.
Click Prompt Learning with Optimal Transport for Interactive Segmentation
Jie Liu, haochen wang, Wenzhe Yin et al.
Lyapunov-Stable Deep Equilibrium Models
Haoyu Chu, Shikui Wei, Ting Liu et al.
Neural Amortized Inference for Nested Multi-Agent Reasoning
Kunal Jha, Tuan Anh Le, Chuanyang Jin et al.
HGL: Hierarchical Geometry Learning for Test-time Adaptation in 3D Point Cloud Segmentation
Tianpei Zou, Sanqing Qu, Zhijun Li et al.
Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization
Weihang Liu, Xue Xian Zheng, Jingyi Yu et al.
From Fake to Real: Pretraining on Balanced Synthetic Images to Prevent Spurious Correlations in Image Recognition
Maan Qraitem, Kate Saenko, Bryan Plummer
L-MAGIC: Language Model Assisted Generation of Images with Coherence
zhipeng cai, Matthias Mueller, Reiner Birkl et al.
Temporal Residual Jacobians for Rig-free Motion Transfer
Sanjeev Muralikrishnan, Niladri Shekhar Dutt, Siddhartha Chaudhuri et al.
Robust Zero-Shot Crowd Counting and Localization with Adaptive Resolution SAM
Jia Wan, qiangqiang wu, Wei Lin et al.
Interactive Visual Task Learning for Robots
Weiwei Gu, Anant Sah, N. Gopalan
Agglomerative Token Clustering
Joakim Bruslund Haurum, Sergio Escalera, Graham W. Taylor et al.
Learning to Drive via Asymmetric Self-Play
Chris Zhang, Sourav Biswas, Kelvin Wong et al.
Fast Encoding and Decoding for Implicit Video Representation
Hao Chen, Saining Xie, Ser-Nam Lim et al.
Diffeomorphic Mesh Deformation via Efficient Optimal Transport for Cortical Surface Reconstruction
Thanh-Tung Le, Khai Nguyen, shanlin sun et al.
Unsupervised Multi-modal Medical Image Registration via Invertible Translation
Mengjie Guo
Hetecooper: Feature Collaboration Graph for Heterogeneous Collaborative Perception
Congzhang Shao, Guiyang Luo, Quan Yuan et al.
Dissolving Is Amplifying: Towards Fine-Grained Anomaly Detection
Jian Shi, Pengyi Zhang, Ni Zhang et al.
Token Compensator: Altering Inference Cost of Vision Transformer without Re-Tuning
Shibo Jie, Yehui Tang, Jianyuan Guo et al.
CFEVER: A Chinese Fact Extraction and VERification Dataset
Ying-Jia Lin, ChunYi Lin, Chia-Jen Yeh et al.
When Visual Grounding Meets Gigapixel-level Large-scale Scenes: Benchmark and Approach
TAO MA, Bing Bai, Haozhe Lin et al.
Faithful and Efficient Explanations for Neural Networks via Neural Tangent Kernel Surrogate Models
Andrew Engel, Zhichao Wang, Natalie Frank et al.
RCL: Reliable Continual Learning for Unified Failure Detection
Fei Zhu, Zhen Cheng, Xu-Yao Zhang et al.
SPIN: Hierarchical Segmentation with Subpart Granularity in Natural Images
josh myers-dean, Jarek T Reynolds, Brian Price et al.
Efficient Multitask Dense Predictor via Binarization
Yuzhang Shang, Dan Xu, Gaowen Liu et al.
Dual-Enhanced Coreset Selection with Class-wise Collaboration for Online Blurry Class Incremental Learning
Yutian Luo, Shiqi Zhao, Haoran Wu et al.
De-confounded Gaze Estimation
Ziyang Liang, Yiwei Bao, Feng Lu
MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps
Jianhao Zheng, Daniel Barath, Marc Pollefeys et al.
SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments
Niklas Gard, Anna Hilsmann, Peter Eisert
Decoupled Marked Temporal Point Process using Neural Ordinary Differential Equations
Yujee Song, Donghyun LEE, Rui Meng et al.
Delving Deep into Engagement Prediction of Short Videos
dasong Li, Wenjie Li, Baili Lu et al.
DECOLLAGE: 3D Detailization by Controllable, Localized, and Learned Geometry Enhancement
Qimin Chen, Zhiqin Chen, Vladimir Kim et al.
Region-centric Image-Language Pretraining for Open-Vocabulary Detection
Dahun Kim, Anelia Angelova, Weicheng Kuo
Adapt without Forgetting: Distill Proximity from Dual Teachers in Vision-Language Models
MENGYU ZHENG, Yehui Tang, Zhiwei Hao et al.
Semantically Guided Representation Learning For Action Anticipation
Anxhelo Diko, Danilo Avola, Bardh Prenkaj et al.
The Lipschitz-Variance-Margin Tradeoff for Enhanced Randomized Smoothing
Blaise Delattre, Alexandre Araujo, Quentin Barthélemy et al.
Density-guided Translator Boosts Synthetic-to-Real Unsupervised Domain Adaptive Segmentation of 3D Point Clouds
Zhimin Yuan, Wankang Zeng, Yanfei Su et al.
STSP: Spatial-Temporal Subspace Projection for Video Class-incremental Learning
Hao CHENG, SIYUAN YANG, Chong Wang et al.
Event Trojan: Asynchronous Event-based Backdoor Attacks
Ruofei Wang, Qing Guo, Haoliang Li et al.
UDA-Bench: Revisiting Common Assumptions in Unsupervised Domain Adaptation Using a Standardized Framework
Tarun Kalluri, Sreyas Ravichandran, Manmohan Chandraker
Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation
Yeongtak Oh, Jonghyun Lee, Jooyoung Choi et al.
ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion
Sungmin Woo, Wonjoon Lee, Woo Jin Kim et al.
Unveiling the Unknown: Unleashing the Power of Unknown to Known in Open-Set Source-Free Domain Adaptation
Fuli Wan, Han Zhao, Xu Yang et al.
DA-BEV: Unsupervised Domain Adaptation for Bird's Eye View Perception
Kai Jiang, Jiaxing Huang, Weiying Xie et al.
Accurate Training Data for Occupancy Map Prediction in Automated Driving Using Evidence Theory
Jonas Kälble, Sascha Wirges, Maxim Tatarchenko et al.
Better Regression Makes Better Test-time Adaptive 3D Object Detection
Jiakang Yuan, Bo Zhang, Kaixiong Gong et al.
VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition
Ahmad Khaliq, Ming Xu, Stephen Hausler et al.
PolyRoom: Room-aware Transformer for Floorplan Reconstruction
Yuzhou Liu, Lingjie Zhu, Xiaodong Ma et al.
Hierarchical Conditioning of Diffusion Models Using Tree-of-Life for Studying Species Evolution
Mridul Khurana, Arka Daw, M. Maruf et al.
Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization
Lahav Lipson, Jia Deng
Coupled Laplacian Eigenmaps for Locally-Aware 3D Rigid Point Cloud Matching
Matteo Bastico, Etienne Decencière, Laurent Corté et al.
AFreeCA: Annotation-Free Counting for All
Adriano DAlessandro, Ali Mahdavi-Amiri, Ghassan Hamarneh
Prompt Augmentation for Self-supervised Text-guided Image Manipulation
Rumeysa Bodur, Binod Bhattarai, Tae-Kyun Kim
Twice Class Bias Correction for Imbalanced Semi
supervised Learning
Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields
Tianqi Liu, Xinyi Ye, Min Shi et al.
Differentiable Product Quantization for Memory Efficient Camera Relocalization
Zakaria Laskar, Iaroslav Melekhov, Assia Benbihi et al.
Probability-Polarized Optimal Transport for Unsupervised Domain Adaptation
Yan Wang, Chuan-Xian Ren, Yi-Ming Zhai et al.
Risk-Aware Self-Consistent Imitation Learning for Trajectory Planning in Autonomous Driving
Yixuan Fan, Ya-Li Li, Shengjin Wang
PDiscoFormer: Relaxing Part Discovery Constraints with Vision Transformers
Ananthu Aniraj, Cassio F. Dantas, Dino Ienco et al.
Multi-Level Cross-Modal Alignment for Image Clustering
Liping Qiu, Qin Zhang, Xiaojun Chen et al.
FedNS: A Fast Sketching Newton-Type Algorithm for Federated Learning
Jian Li, Yong Liu, Wei Wang et al.
Revisiting Sampson Approximations for Geometric Estimation Problems
Felix Rydell, Angelica Torres, Viktor Larsson
Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts
Jianhao Li, Tianyu Sun, Zhongdao Wang et al.
Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor
Han Liu, Siyang Zhao, Xiaotong Zhang et al.
Diffusion Reflectance Map: Single-Image Stochastic Inverse Rendering of Illumination and Reflectance
Yuto Enyo, Ko Nishino
Learning to Rank Patches for Unbiased Image Redundancy Reduction
Yang Luo, Zhineng Chen, Peng Zhou et al.
Fully Geometric Panoramic Localization
Junho Kim, Jiwon Jeong, Young Min Kim
LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement
Ye Yu, Fengxin Chen, Jun Yu et al.
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin, Pierluca D'Oro, Evgenii Nikishin et al.
Spherical World-Locking for Audio-Visual Localization in Egocentric Videos
Heeseung Yun, Ruohan Gao, Ishwarya Ananthabhotla et al.
OpticalDR: A Deep Optical Imaging Model for Privacy-Protective Depression Recognition
Yuchen Pan, Junjun Jiang, Kui Jiang et al.
CMA: A Chromaticity Map Adapter for Robust Detection of Screen-Recapture Document Images
Changsheng Chen, Liangwei Lin, Yongqi Chen et al.
FreeMan: Towards Benchmarking 3D Human Pose Estimation under Real-World Conditions
Jiong WANG, Fengyu Yang, Bingliang Li et al.
3D-Aware Face Editing via Warping-Guided Latent Direction Learning
Yuhao Cheng, Zhuo Chen, Xingyu Ren et al.
SENCR: A Span Enhanced Two-Stage Network with Counterfactual Rethinking for Chinese NER
Hang Zheng, Qingsong Li, Shen Chen et al.
Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment
Yuxiao Chen, Kai Li, Wentao Bao et al.
DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation
Sanghyun Jo, Fei Pan, In-Jae Yu et al.
Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis
Atefeh Khoshkhahtinat, Ali Zafari, Piyush Mehta et al.
Pano-NeRF: Synthesizing High Dynamic Range Novel Views with Geometry from Sparse Low Dynamic Range Panoramic Images
Zhan Lu, Qian Zheng, Boxin Shi et al.
Self-Prompt Mechanism for Few-Shot Image Recognition
Mingchen Song, Huiqiang Wang, Guoqiang Zhong
Improving Out-of-Distribution Generalization in Graphs via Hierarchical Semantic Environments
Yinhua Piao, Sangseon Lee, Yijingxiu Lu et al.
Geometry Fidelity for Spherical Images
Anders Christensen, Nooshin Mojab, Khushman Patel et al.
Flexible Depth Completion for Sparse and Varying Point Densities
Jinhyung Park, Yu-Jhe Li, Kris Kitani
Constructing Concept-based Models to Mitigate Spurious Correlations with Minimal Human Effort
Jeeyung Kim, Ze Wang, Qiang Qiu
Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas
Fabio Quattrini, Vittorio Pippi, Silvia Cascianelli et al.
CNC-Net: Self-Supervised Learning for CNC Machining Operations
Mohsen Yavartanoo, Sangmin Hong, Reyhaneh Neshatavar et al.
Adaptive Multi-task Learning for Few-shot Object Detection
Yan Ren, Yanling Li, Wai-Kin Adams Kong
FairWASP: Fast and Optimal Fair Wasserstein Pre-processing
Zikai Xiong, Niccolo Dalmasso, Alan Mishler et al.
Operational Open-Set Recognition and PostMax Refinement
Steve Cruz, Ryan Rabinowitz, Manuel Günther et al.
X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning
Artemis Panagopoulou, Le Xue, Ning Yu et al.
CoG-DQA: Chain-of-Guiding Learning with Large Language Models for Diagram Question Answering
Shaowei Wang, Lingling Zhang, Longji Zhu et al.
Epistemic Uncertainty Quantification For Pre-Trained Neural Networks
Hanjing Wang, Qiang Ji
Conceptual Codebook Learning for Vision-Language Models
Yi Zhang, Ke Yu, Siqi Wu et al.
PEGASUS: Personalized Generative 3D Avatars with Composable Attributes
Hyunsoo Cha, Byungjun Kim, Hanbyul Joo
Towards Making Learnware Specification and Market Evolvable
Jian-Dong Liu, Zhi-Hao Tan, Zhi-Hua Zhou
Cross-view and Cross-pose Completion for 3D Human Understanding
Matthieu Armando, Salma Galaaoui, Fabien Baradel et al.
LLMCO4MR: LLMs-aided Neural Combinatorial Optimization for Ancient Manuscript Restoration from Fragments with Case Studies on Dunhuang
Yuqing Zhang, Hangqi Li, Shengyu Zhang et al.
MeshVPR: Citywide Visual Place Recognition Using 3D Meshes
Gabriele Berton, Lorenz Junglas, Riccardo Zaccone et al.
Blind Face Restoration under Extreme Conditions: Leveraging 3D-2D Prior Fusion for Superior Structural and Texture Recovery
Zhengrui Chen, Liying Lu, Ziyang Yuan et al.
Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation
YoungJoon Yoo, Jongwon Choi
Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework
Shengqi Xu, Run Sun, Yi Chang et al.
Beyond the Label Itself: Latent Labels Enhance Semi-supervised Point Cloud Panoptic Segmentation
Yujun Chen, Xin Tan, Zhizhong Zhang et al.
In-Context Matting
He Guo, Zixuan Ye, Zhiguo Cao et al.
Mind Artist: Creating Artistic Snapshots with Human Thought
Jiaxuan Chen, Yu Qi, Yueming Wang et al.
Analyzing and Improving Optimal-Transport-based Adversarial Networks
Jaemoo Choi, Jaewoong Choi, Myungjoo Kang
Boosting Residual Networks with Group Knowledge
Shengji Tang, Peng Ye, Baopu Li et al.
Scores for Learning Discrete Causal Graphs with Unobserved Confounders
Alexis Bellot, Junzhe Zhang, Elias Bareinboim
Partial Label Learning with a Partner
Chongjie Si, Zekun Jiang, Xuehui Wang et al.
Rethinking LiDAR Domain Generalization: Single Source as Multiple Density Domains
Jaeyeul Kim, Jungwan Woo, Jeonghoon Kim et al.
Incorporating Geo-Diverse Knowledge into Prompting for Increased Geographical Robustness in Object Recognition
Kyle Buettner, Sina Malakouti, Xiang Li et al.
Multi-View Representation is What You Need for Point-Cloud Pre-Training
Siming Yan, Chen Song, Youkang Kong et al.
Spatial Voting with Incomplete Voter Information
Aviram Imber, Jonas Israel, Markus Brill et al.
DSD$^2$: Can We Dodge Sparse Double Descent and Compress the Neural Network Worry-Free?
Victor Quetu, Enzo Tartaglione
Alice Benchmarks: Connecting Real World Re-Identification with the Synthetic
Xiaoxiao Sun, Yue Yao, Shengjin Wang et al.
Finding NeMo: Negative-mined Mosaic Augmentation for Referring Image Segmentation
Seongsu Ha, Chaeyun Kim, Donghwa Kim et al.