Most Cited AAAI "semantic mismatch" Papers
5,317 papers found • Page 5 of 27
Conference
B-spine: Learning B-spline Curve Representation for Robust and Interpretable Spinal Curvature Estimation
Hao Wang, Qiang Song, Ruofeng Yin et al.
Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model Using 3D Whole-Body CT Scans
Heng Guo, Jianfeng Zhang, Jiaxing Huang et al.
Efficient Attention-Sharing Information Distillation Transformer for Lightweight Single Image Super-Resolution
Karam Park, Jae Woong Soh, Nam Ik Cho
Patched Line Segment Learning for Vector Road Mapping
Jiakun Xu, Bowen Xu, Gui-Song Xia et al.
2043 Improved MLP Point Cloud Processing with High-Dimensional Positional Encoding
Yanmei Zou, Hongshan Yu, Zhengeng Yang et al.
Semi-supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix
Kewei Wang, Yizheng Wu, Zhiyu Pan et al.
Cross-Modal Match for Language Conditioned 3D Object Grounding
Yachao Zhang, Runze Hu, Ronghui Li et al.
Not All Tasks Are Equally Difficult: Multi-Task Deep Reinforcement Learning with Dynamic Depth Routing
Jinmin He, Kai Li, Yifan Zang et al.
Multi-Domain Recommendation to Attract Users via Domain Preference Modeling
Hyunjun Ju, SeongKu Kang, Dongha Lee et al.
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
Jialu Li, Aishwarya Padmakumar, Gaurav Sukhatme et al.
Colour Passing Revisited: Lifted Model Construction with Commutative Factors
Malte Luttermann, Tanya Braun, Ralf Möller et al.
Improving Open Set Recognition via Visual Prompts Distilled from Common-Sense Knowledge
Seong-Tae Kim, Hyungil Kim, Y. Ro
Delivering Inflated Explanations
Yacine Izza, Alexey Ignatiev, Peter Stuckey et al.
Revisiting Multimodal Fusion for 3D Anomaly Detection from an Architectural Perspective
Kaifang Long, Guoyang Xie, Lianbo Ma et al.
Domain Generalization with Vital Phase Augmentation
Ingyun Lee, WooJu Lee, Hyun Myung
Measuring Human and AI Values Based on Generative Psychometrics with Large Language Models
Haoran Ye, Yuhang Xie, Yuanyi Ren et al.
Symmetric Self-Paced Learning for Domain Generalization
Di Zhao, Yun Sing Koh, Gillian Dobbie et al.
Multi-View Dynamic Reflection Prior for Video Glass Surface Detection
Fang Liu, Yuhao Liu, Jiaying Lin et al.
Completing Priceable Committees: Utilitarian and Representation Guarantees for Proportional Multiwinner Voting
Markus Brill, Jannik Peters
Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks
Fuzhi Wu, Jiasong Wu, Youyong Kong et al.
DM-Adapter: Domain-Aware Mixture-of-Adapters for Text-Based Person Retrieval
Yating Liu, Zimo Liu, Xiangyuan Lan et al.
Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search
Thomy Phan, Taoan Huang, Bistra Dilkina et al.
Causal Inference over Visual-Semantic-Aligned Graph for Image Classification
Lei Meng, Xiangxian Li, Xiaoshuo Yan et al.
Towards Optimal Subsidy Bounds for Envy-Freeable Allocations
Yasushi Kawase, Kazuhisa Makino, Hanna Sumita et al.
Unraveling Batch Normalization for Realistic Test-Time Adaptation
Zixian Su, Jingwei Guo, Kai Yao et al.
SlerpFace: Face Template Protection via Spherical Linear Interpolation
Zhizhou Zhong, Yuxi Mi, Yuge Huang et al.
Exact ASP Counting with Compact Encodings
Mohimenul Kabir, Supratik Chakraborty, Kuldeep S Meel
Label-Free Backdoor Attacks in Vertical Federated Learning
Wei Shen, Wenke Huang, Guancheng Wan et al.
Identification of Necessary Semantic Undertakers in the Causal View for Image-Text Matching
Huatian Zhang, Lei Zhang, Kun Zhang et al.
Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation
Ling-An Zeng, Guohong Huang, Gaojie Wu et al.
Improved Bandits in Many-to-One Matching Markets with Incentive Compatibility
Fang Kong, Shuai Li
Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning
Hang Du, Xuejun Yan, Jingjing Wang et al.
GRPose: Learning Graph Relations for Human Image Generation with Pose Priors
Xiangchen Yin, Donglin Di, Lei Fan et al.
How to Use the Metropolis Algorithm for Multi-Objective Optimization?
Weijie Zheng, Mingfeng Li, Renzhong Deng et al.
CatFormer: Category-Level 6D Object Pose Estimation with Transformer
Sheng Yu, Dihua Zhai, Yuanqing Xia
Learning Deformable Hypothesis Sampling for Accurate PatchMatch Multi-View Stereo
Hongjie Li, Yao Guo, Xianwei Zheng et al.
Confidence Estimation for Error Detection in Text-to-SQL Systems
Oleg Somov, Elena Tutubalina
Reverse Multi-Choice Dialogue Commonsense Inference with Graph-of-Thought
Li Zheng, Hao Fei, Fei Li et al.
VVS: Video-to-Video Retrieval with Irrelevant Frame Suppression
Won Jo, Geuntaek Lim, Gwangjin Lee et al.
Cumulative Regret Analysis of the Piyavskii–Shubert Algorithm and Its Variants for Global Optimization
Kaan Gokcesu, Hakan Gökcesu
Geometry-Guided Domain Generalization for Monocular 3D Object Detection
Fan Yang, Hui Chen, Yuwei He et al.
Live and Learn: Continual Action Clustering with Incremental Views
Xiaoqiang Yan, Yingtao Gan, Yiqiao Mao et al.
Semi-supervised 3D Object Detection with PatchTeacher and PillarMix
Xiaopei Wu, Liang Peng, Liang Xie et al.
High-Fidelity Diffusion-Based Image Editing
Chen Hou, Guoqiang Wei, Zhibo Chen
Efficient Axiomatization of OWL 2 EL Ontologies from Data by Means of Formal Concept Analysis
Francesco Kriegel
DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization
Wenze Chen, Shiyu Huang, Yuan Chiang et al.
GENTEEL-NEGOTIATOR: LLM-Enhanced Mixture-of-Expert-Based Reinforcement Learning Approach for Polite Negotiation Dialogue
Priyanshu Priya, Rishikant Chigrupaatii, Mauajama Firdaus et al.
Make Prompts Adaptable: Bayesian Modeling for Vision-Language Prompt Learning with Data-Dependent Prior
Youngjae Cho, HeeSun Bae, Seungjae Shin et al.
Limitations in Employing Natural Language Supervision for Sensor-Based Human Activity Recognition - And Ways to Overcome Them
Harish Haresamudram, Apoorva Beedu, Mashfiqui Rabbi et al.
Attention-Driven GUI Grounding: Leveraging Pretrained Multimodal Large Language Models Without Fine-Tuning
Hai-Ming Xu, Qi Chen, Lei Wang et al.
KPL: Training-Free Medical Knowledge Mining of Vision-Language Models
Jiaxiang Liu, Tianxiang Hu, Jiawei Du et al.
Semi-Supervised Multi-View Multi-Label Learning with View-Specific Transformer and Enhanced Pseudo-Label
Quanjiang Li, Tingjin Luo, Mingdie Jiang et al.
ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation
Mengyang Wu, Yuzhi Zhao, Jialun Cao et al.
REGLO: Provable Neural Network Repair for Global Robustness Properties
Feisi Fu, Zhilu Wang, Weichao Zhou et al.
Evidential Uncertainty-Guided Mitochondria Segmentation for 3D EM Images
Ruohua Shi, Lingyu Duan, Tiejun Huang et al.
Knowledge Guided Semi-supervised Learning for Quality Assessment of User Generated Videos
Shankhanil Mitra, Rajiv Soundararajan
Self-attention-based Diffusion Model for Time-series Imputation in Partial Blackout Scenarios
Mohammad Rafid Ul Islam, Prasad Tadepalli, Alan Fern
LIBA: Language Instructed Multi-granularity Bridge Assistant for 3D Visual Grounding
Yuan Wang, Ya-Li Li, W U Eastman Z Y et al.
DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning
Xinghao Wang, Junliang He, Pengyu Wang et al.
Pre-Training Graph Neural Networks on Molecules by Using Subgraph-Conditioned Graph Information Bottleneck
Van Thuy Hoang, O-Joun Lee
Learning Robust Rationales for Model Explainability: A Guidance-Based Approach
Shuaibo Hu, Kui Yu
Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning
Zizhao Wang, Caroline Wang, Xuesu Xiao et al.
CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation
Han He, Qianchu Liu, Lei Xu et al.
A Primal-Dual Algorithm for Hybrid Federated Learning
Tom Overman, Garrett Blum, Diego Klabjan
RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance
Chengrui Wang, Pengfei Liu, Min Zhou et al.
Axiomatic Aggregations of Abductive Explanations
Gagan Biradar, Yacine Izza, Elita Lobo et al.
ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation
Shiqi Huang, Shuting He, Bihan Wen
Model-Driven Deep Neural Network for Enhanced AoA Estimation Using 5G gNB
Shengheng Liu, Xingkang Li, Zihuan Mao et al.
Region-Based Optimization in Continual Learning for Audio Deepfake Detection
Yujie Chen, Jiangyan Yi, Cunhang Fan et al.
Relieving Universal Label Noise for Unsupervised Visible-Infrared Person Re-Identification by Inferring from Neighbors
Xiao Teng, Long Lan, Dingyao Chen et al.
TACIT: A Target-Agnostic Feature Disentanglement Framework for Cross-Domain Text Classification
Rui Song, Fausto Giunchiglia, Yingji Li et al.
Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views
Shuai Guo, Qiuwen Wang, Yijie Gao et al.
A Unified Masked Autoencoder with Patchified Skeletons for Motion Synthesis
Esteve Valls Mascaro, Hyemin Ahn, Dongheui Lee
WildFake: A Large-Scale and Hierarchical Dataset for AI-Generated Images Detection
Yan Hong, Jianming Feng, Haoxing Chen et al.
Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning
Bang Yang, Yong Dai, Xuxin Cheng et al.
DIUSum: Dynamic Image Utilization for Multimodal Summarization
Min Xiao, Junnan Zhu, Feifei Zhai et al.
DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation
Xiankang He, Guangkai Xu, Bo Zhang et al.
Decoupling Degradations with Recurrent Network for Video Restoration in Under-Display Camera
Chengxu Liu, Xuan Wang, Yuanting Fan et al.
Learning Subject-Aware Cropping by Outpainting Professional Photos
James Hong, Lu Yuan, Michaël Gharbi et al.
Context Enhanced Transformer for Single Image Object Detection in Video Data
Seungjun An, Seonghoon Park, Gyeongnyeon Kim et al.
Multi-Focus Image Fusion via Explicit Defocus Blur Modelling
Yuhui Quan, Xi Wan, Zitao Tang et al.
Improving Expressive Power of Spectral Graph Neural Networks with Eigenvalue Correction
Kangkang Lu, Yanhua Yu, Hao Fei et al.
Link Prediction in Multilayer Networks via Cross-Network Embedding
Guojing Ren, Xiao Ding, Xiao-Ke Xu et al.
Constrained Fair and Efficient Allocations
Benjamin Cookson, Soroush Ebadian, Nisarg Shah
PPEA-Depth: Progressive Parameter-Efficient Adaptation for Self-Supervised Monocular Depth Estimation
Yue-Jiang Dong, Yuan-Chen Guo, Ying-Tian Liu et al.
Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification
Jiaxiang Gou, Luping Ji, Pei Liu et al.
Accurate and Regret-Aware Numerical Problem Solver for Tabular Question Answering
Yuxiang Wang, Jianzhong Qi, Junhao Gan
ZOOM: Learning Video Mirror Detection with Extremely-Weak Supervision
Ke Xu, Tsun Wai Siu, Rynson W.H. Lau
Intrinsic Phase-Preserving Networks for Depth Super Resolution
Xuanhong Chen, Hang Wang, Jinfan Liu et al.
DeepCalliFont: Few-Shot Chinese Calligraphy Font Synthesis by Integrating Dual-Modality Generative Models
Yitian Liu, Zhouhui Lian
Comprehensive View Embedding Learning for Single-Cell Multimodal Integration
Zhenchao Tang, Jiehui Huang, Guanxing Chen et al.
Multi-Pair Temporal Sentence Grounding via Multi-Thread Knowledge Transfer Network
Xiang Fang, Wanlong Fang, Changshuo Wang et al.
A Multimodal, Multi-Task Adapting Framework for Video Action Recognition
Mengmeng Wang, Jiazheng Xing, Boyuan Jiang et al.
Factorized Diffusion Autoencoder for Unsupervised Disentangled Representation Learning
Ancong Wu, Wei-shi Zheng
Towards More Faithful Natural Language Explanation Using Multi-Level Contrastive Learning in VQA
Chengen Lai, Shengli Song, Shiqi Meng et al.
Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users
Hantao Yang, Xutong Liu, Zhiyong Wang et al.
Coupling Graph Neural Networks with Fractional Order Continuous Dynamics: A Robustness Study
Qiyu Kang, Kai Zhao, Yang Song et al.
All-in-One: Transferring Vision Foundation Models into Stereo Matching
Jingyi Zhou, Haoyu Zhang, Jiakang Yuan et al.
Learning Visual Abstract Reasoning through Dual-Stream Networks
Kai Zhao, Chang Xu, Bailu Si
One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models
Yutao Zhu, Zhaoheng Huang, Zhicheng Dou et al.
DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes
Yiyuan Liang, Zhiying Yan, Liqun Chen et al.
InverseCoder: Self-improving Instruction-Tuned Code LLMs with Inverse-Instruct
Yutong Wu, Di Huang, Wenxuan Shi et al.
GLDL: Graph Label Distribution Learning
Yufei Jin, Richard Gao, Yi He et al.
Unsupervised Group Re-identification via Adaptive Clustering-Driven Progressive Learning
Hongxu Chen, Quan Zhang, Jian-Huang Lai et al.
PowerMLP: An Efficient Version of KAN
Ruichen Qiu, Yibo Miao, Shiwen Wang et al.
PointPatchMix: Point Cloud Mixing with Patch Scoring
Yi Wang, Jiaze Wang, Jinpeng Li et al.
Data Disparity and Temporal Unavailability Aware Asynchronous Federated Learning for Predictive Maintenance on Transportation Fleets
Leonie von Wahl, Niklas Heidenreich, Prasenjit Mitra et al.
One at a Time: Progressive Multi-Step Volumetric Probability Learning for Reliable 3D Scene Perception
Bohan Li, Yasheng Sun, Jingxin Dong et al.
VIXEN: Visual Text Comparison Network for Image Difference Captioning
Alexander Black, Jing Shi, Yifei Fan et al.
MegActor-Sigma: Unlocking Flexible Mixed-Modal Control in Portrait Animation with Diffusion Transformer
Shurong Yang, Huadong Li, Juhao Wu et al.
Pareto Front-Diverse Batch Multi-Objective Bayesian Optimization
Alaleh Ahmadianshalchi, Syrine Belakaria, Janardhan Rao Doppa
GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
Yirui Chen, Xudong Huang, Quan Zhang et al.
MSE-Adapter: A Lightweight Plugin Endowing LLMs with the Capability to Perform Multimodal Sentiment Analysis and Emotion Recognition
Yang Yang, Xunde Dong, Yupeng Qiang
TEncDM: Understanding the Properties of the Diffusion Model in the Space of Language Model Encodings
Alexander Shabalin, Viacheslav Meshchaninov, Egor Chimbulatov et al.
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning
Ji Soo Lee, Jongha Kim, Jeehye Na et al.
Motion-adaptive Transformer for Event-based Image Deblurring
Senyan Xu, Zhijing Sun, Mingchen Zhong et al.
SIG: Speaker Identification in Literature via Prompt-Based Generation
Zhenlin Su, Liyan Xu, Jin Xu et al.
GNS: Solving Plane Geometry Problems by Neural-Symbolic Reasoning with Multi-Modal LLMs
Maizhen Ning, Zihao Zhou, Qiufeng Wang et al.
Regret Analysis of Repeated Delegated Choice
Suho Shin, Keivan Rezaei, Mohammad Hajiaghayi et al.
Learning Diverse Risk Preferences in Population-Based Self-Play
Yuhua Jiang, Qihan Liu, Xiaoteng Ma et al.
Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization
Zongkai Liu, Qian Lin, Chao Yu et al.
Just What You Desire: Constrained Timeline Summarization with Self-Reflection for Enhanced Relevance
Muhammad Reza Qorib, Qisheng Hu, Hwee Tou Ng
TimeCHEAT: A Channel Harmony Strategy for Irregularly Sampled Multivariate Time Series Analysis
Jiexi Liu, Meng Cao, Songcan Chen
BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling
Sameera Ramasinghe, Violetta Shevchenko, Gil Avraham et al.
Large-Scale Multi-Robot Coverage Path Planning via Local Search
Jingtao Tang, Hang Ma
Deep Evidential Hashing for Trustworthy Cross-Modal Retrieval
Yuan Li, Liangli Zhen, Yuan Sun et al.
Backdoor Adjustment via Group Adaptation for Debiased Coupon Recommendations
Junpeng Fang, Gongduo Zhang, Qing Cui et al.
Robust Communicative Multi-Agent Reinforcement Learning with Active Defense
Lebin Yu, Yunbo Qiu, Quanming Yao et al.
CAPrompt: Cyclic Prompt Aggregation for Pre-Trained Model Based Class Incremental Learning
Qiwei Li, Jiahuan Zhou
Adaptive Draft-Verification for Efficient Large Language Model Decoding
Xukun Liu, Bowen Lei, Ruqi Zhang et al.
Diversity-Authenticity Co-constrained Stylization for Federated Domain Generalization in Person Re-identification
Fengxiang Yang, Zhun Zhong, Zhiming Luo et al.
Scaffold-BPE: Enhancing Byte Pair Encoding for Large Language Models with Simple and Effective Scaffold Token Removal
Haoran Lian, Yizhe Xiong, Jianwei Niu et al.
Enhancing Adversarial Transferability with Adversarial Weight Tuning
Jiahao Chen, Zhou Feng, Rui Zeng et al.
Improving PTM Site Prediction by Coupling of Multi-Granularity Structure and Multi-Scale Sequence Representation
Zhengyi Li, Menglu Li, Lida Zhu et al.
IPRemover: A Generative Model Inversion Attack against Deep Neural Network Fingerprinting and Watermarking
Wei Zong, Yang-Wai Chow, Willy Susilo et al.
Exploring Model Editing for LLM-based Aspect-Based Sentiment Classification
Shichen Li, Zhongqing Wang, Zheyu Zhao et al.
CatmullRom Splines-Based Regression for Image Forgery Localization
Li Zhang, Mingliang Xu, Dong Li et al.
Interactive Hyperparameter Optimization in Multi-Objective Problems via Preference Learning
Joseph Giovanelli, Alexander Tornede, Tanja Tornede et al.
Dehaze-RetinexGAN: Real-World Image Dehazing via Retinex-based Generative Adversarial Network
Xinran Wang, Guang Yang, Tian Ye et al.
An Item Is Worth a Prompt: Versatile Image Editing with Disentangled Control
Aosong Feng, Weikang Qiu, Jinbin Bai et al.
OpenViewer: Openness-Aware Multi-View Learning
Shide Du, Zihan Fang, Yanchao Tan et al.
Towards Unifying Evaluation of Counterfactual Explanations: Leveraging Large Language Models for Human-Centric Assessments
Marharyta Domnich, Julius Välja, Rasmus Moorits Veski et al.
EPSD: Early Pruning with Self-Distillation for Efficient Model Compression
Dong Chen, Ning Liu, Yichen Zhu et al.
1/2-Approximate MMS Allocation for Separable Piecewise Linear Concave Valuations
Chandra Chekuri, Pooja Kulkarni, Rucha Kulkarni et al.
Multi-clue Consistency Learning to Bridge Gaps Between General and Oriented Object in Semi-supervised Detection
Chenxu Wang, Chunyan Xu, Xiang Li et al.
RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images
Benzhi Wang, Jingkai Zhou, Jingqi Bai et al.
Improved Anonymous Multi Agent Path Finding Algorithm
Zain Alabedeen Ali, Konstantin Yakovlev
Text2City: One-Stage Text-Driven Urban Layout Regeneration
Yiming Qin, Nanxuan Zhao, Bin Sheng et al.
Keep the Faith: Faithful Explanations in Convolutional Neural Networks for Case-Based Reasoning
Tom Nuno Wolf, Fabian Bongratz, Anne-Marie Rickmann et al.
Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Agent Preference-Based Reinforcement Learning
Tianchen Zhu, Yue Qiu, Haoyi Zhou et al.
Unsupervised Audio-Visual Segmentation with Modality Alignment
Swapnil Bhosale, Haosen Yang, Diptesh Kanojia et al.
Colorizing Monochromatic Radiance Fields
Yean Cheng, Renjie Wan, Shuchen Weng et al.
Robust 3D Tracking with Quality-Aware Shape Completion
Jingwen Zhang, Zikun Zhou, Guangming Lu et al.
EWMoE: An Effective Model for Global Weather Forecasting with Mixture-of-Experts
Lihao Gan, Xin Man, Chenghong Zhang et al.
Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling
Jakob Hollenstein, Georg Martius, Justus Piater
Online Guidance Graph Optimization for Lifelong Multi-Agent Path Finding
Hongzhi Zang, Yulun Zhang, He Jiang et al.
An Efficient Knowledge Transfer Strategy for Spiking Neural Networks from Static to Event Domain
Xiang He, Dongcheng Zhao, Yang Li et al.
Neural Reasoning about Agents’ Goals, Preferences, and Actions
Matteo Bortoletto, Lei Shi, Andreas Bulling
Hierarchical Aligned Multimodal Learning for NER on Tweet Posts
Peipei Liu, Hong Li, Yimo Ren et al.
V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer
Hangzhou He, Lei Zhu, Xinliang Zhang et al.
Gaze from Origin: Learning for Generalized Gaze Estimation by Embedding the Gaze Frontalization Process
Mingjie Xu, Feng Lu
(Almost Full) EFX for Three (and More) Types of Agents
Pratik Ghosal, Vishwa Prakash HV, Prajakta Nimbhorkar et al.
Towards Continual Learning Desiderata via HSIC-Bottleneck Orthogonalization and Equiangular Embedding
Depeng Li, Tianqi Wang, Junwei Chen et al.
Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter Profile
Seokjun Lee, Seung-Won Jung, Hyunseok Seo
LoRID: Low-Rank Iterative Diffusion for Adversarial Purification
Geigh Zollicoffer, Minh N. Vu, Ben Nebgen et al.
Asynchronous Federated Clustering with Unknown Number of Clusters
Yunfan Zhang, Yiqun Zhang, Yang Lu et al.
Knowledge Enhanced Representation Learning for Drug Discovery
Thanh Lam Hoang, Marco Luca Sbodio, Marcos Martinez et al.
Re2LLM: Reflective Reinforcement Large Language Model for Session-based Recommendation
Ziyan Wang, Yingpeng Du, Zhu Sun et al.
UniPCGC: Towards Practical Point Cloud Geometry Compression via an Efficient Unified Approach
Kangli Wang, Wei Gao
Autonomous LLM-Enhanced Adversarial Attack for Text-to-Motion
Honglei Miao, Fan Ma, Ruijie Quan et al.
Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing
Haobin Jiang, Ziluo Ding, Zongqing Lu
MVREC: A General Few-shot Defect Classification Model Using Multi-View Region-Context
Shuai Lyu, Rongchen Zhang, Zeqi Ma et al.
Improved Metric Distortion via Threshold Approvals
Elliot Anshelevich, Aris Filos-Ratsikas, Christopher Jerrett et al.
Residual Hyperbolic Graph Convolution Networks
Yangkai Xue, Jindou Dai, Zhipeng Lu et al.
Robust Policy Learning via Offline Skill Diffusion
Woo Kyung Kim, Minjong Yoo, Honguk Woo
Multi-Granular Multimodal Clue Fusion for Meme Understanding
Li Zheng, Hao Fei, Ting Dai et al.
Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video
Junkai Fan, Kun Wang, Zhiqiang Yan et al.
HC-LLM: Historical-Constrained Large Language Models for Radiology Report Generation
Tengfei Liu, Jiapu Wang, Yongli Hu et al.
Reachability of Fair Allocations via Sequential Exchanges
Ayumi Igarashi, Naoyuki Kamiyama, Warut Suksompong et al.
Leveraging Normalization Layer in Adapters with Progressive Learning and Adaptive Distillation for Cross-Domain Few-Shot Learning
YongJin Yang, Taehyeon Kim, Se-Young Yun
Diffusion-based Synthetic Data Generation for Visible-Infrared Person Re-Identification
Wenbo Dai, Lijing Lu, Zhihang Li
SEER: Backdoor Detection for Vision-Language Models through Searching Target Text and Image Trigger Jointly
Liuwan Zhu, Rui Ning, Jiang Li et al.
Bridging the Semantic Latent Space between Brain and Machine: Similarity Is All You Need
Jiaxuan Chen, Yu Qi, Yueming Wang et al.
VIoTGPT: Learning to Schedule Vision Tools Towards Intelligent Video Internet of Things
Yaoyao Zhong, Mengshi Qi, Rui Wang et al.
MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification
Jimin Park, AHyun Ji, Minji Park et al.
Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems
Junyi Ye, Jingyi Gu, Xinyun Zhao et al.
A Dynamic Learning Method towards Realistic Compositional Zero-Shot Learning
Xiaoming Hu, Zilei Wang
Incomplete Modality Disentangled Representation for Ophthalmic Disease Grading and Diagnosis
Chengzhi Liu, Zile Huang, Zhe Chen et al.
DeepPointMap: Advancing LiDAR SLAM with Unified Neural Descriptors
Xiaze Zhang, Ziheng Ding, Qi Jing et al.
IVP-VAE: Modeling EHR Time Series with Initial Value Problem Solvers
Jingge Xiao, Leonie Basso, Wolfgang Nejdl et al.
QPEN: Quantum Projection and Quantum Entanglement Enhanced Network for Cross-Lingual Aspect-Based Sentiment Analysis
Xingqiang Zhao, Hai Wan, Kunxun Qi
Offline-to-Online Hyperparameter Transfer for Stochastic Bandits
Dravyansh Sharma, Arun Suggala
LOMA: Language-assisted Semantic Occupancy Network via Triplane Mamba
Yubo Cui, Zhiheng Li, Jiaqiang Wang et al.
Spherical Pseudo-Cylindrical Representation for Omnidirectional Image Super-resolution
Qing Cai, Mu Li, Dongwei Ren et al.
ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention
Bencheng Liao, Xinggang Wang, Lianghui Zhu et al.
Expressive Forecasting of 3D Whole-Body Human Motions
Pengxiang Ding, Qiongjie Cui, Haofan Wang et al.
From PEFT to DEFT: Parameter Efficient Finetuning for Reducing Activation Density in Transformers
Bharat Runwal, Tejaswini Pedapati, Pin-Yu Chen
Mitigating Social Bias in Large Language Models: A Multi-Objective Approach Within a Multi-Agent Framework
Zhenjie Xu, Wenqing Chen, Yi Tang et al.
Micro-macro Wavelet-based Gaussian Splatting for 3D Reconstruction from Unconstrained Images
Yihui Li, Chengxin Lv, Hongyu Yang et al.
Feature Denoising Diffusion Model for Blind Image Quality Assessment
Xudong Li, Yan Zhang, Yunhang Shen et al.