Most Cited AAAI "sam segmentation masks" Papers
5,317 papers found • Page 7 of 27
Conference
DG-Mamba: Robust and Efficient Dynamic Graph Structure Learning with Selective State Space Models
Haonan Yuan, Qingyun Sun, Zhaonan Wang et al.
Zero-Shot Low-Light Image Enhancement via Latent Diffusion Models
Yan Huang, Xiaoshan Liao, Jinxiu Liang et al.
FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation
Yuntian Bo, Yazhou Zhu, Lunbo Li et al.
UFDA: Universal Federated Domain Adaptation with Practical Assumptions
Xinhui Liu, Zhenghao Chen, Luping Zhou et al.
CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training
Xiuli Bi, Jian Lu, Bo Liu et al.
Poincaré Differential Privacy for Hierarchy-Aware Graph Embedding
Yuecen Wei, Haonan Yuan, Xingcheng Fu et al.
Bridging the Gap for Test-Time Multimodal Sentiment Analysis
Zirun Guo, Tao Jin, Wenlong Xu et al.
Temporal Correlation Vision Transformer for Video Person Re-Identification
Pengfei Wu, Le Wang, Sanping Zhou et al.
Selective Visual Prompting in Vision Mamba
Yifeng Yao, Zichen Liu, Zhenyu Cui et al.
KnowPO: Knowledge-Aware Preference Optimization for Controllable Knowledge Selection in Retrieval-Augmented Language Models
Ruizhe Zhang, Yongxin Xu, Yuzhen Xiao et al.
Empowering Dual-Level Graph Self-Supervised Pretraining with Motif Discovery
Pengwei Yan, Kaisong Song, Zhuoren Jiang et al.
MDD-5k: A New Diagnostic Conversation Dataset for Mental Disorders Synthesized via Neuro-Symbolic LLM Agents
Congchi Yin, Feng Li, Shu Zhang et al.
I-rebalance: Personalized Vehicle Repositioning for Supply Demand Balance
Haoyang Chen, Peiyan Sun, Qiyuan Song et al.
LAMPAT: Low-Rank Adaption for Multilingual Paraphrasing Using Adversarial Training
Khoi M. Le, Trinh Pham, Tho Quan et al.
MM-CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object Scenarios
Jiacheng Ruan, Wenzhen Yuan, Zehao Lin et al.
MotionMix: Weakly-Supervised Diffusion for Controllable Motion Generation
Nhat Hoang, Kehong Gong, Chuan Guo et al.
Relational Programming with Foundational Models
Ziyang Li, Jiani Huang, Jason Liu et al.
CR-SAM: Curvature Regularized Sharpness-Aware Minimization
Tao Wu, Tie Luo, Donald Wunsch
Stability in Online Coalition Formation
Authors: Martin Bullinger, René Romen
Automated Design of Affine Maximizer Mechanisms in Dynamic Settings
Michael Curry, Vinzenz Thoma, Darshan Chakrabarti et al.
VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers
Juncan Deng, Shuaiting Li, Zeyu Wang et al.
Dynamic Sub-graph Distillation for Robust Semi-supervised Continual Learning
Yan Fan, Yu Wang, Pengfei Zhu et al.
Multi-Step Denoising Scheduled Sampling: Towards Alleviating Exposure Bias for Diffusion Models
Zhiyao Ren, Yibing Zhan, Liang Ding et al.
ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Models
Yeji Park, Deokyeong Lee, Junsuk Choe et al.
RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data
Chenglong Wang, Yang Gan, Yifu Huo et al.
DTMFormer: Dynamic Token Merging for Boosting Transformer-Based Medical Image Segmentation
Zhehao Wang, Xian Lin, Nannan Wu et al.
CONSIDER: Commonalities and Specialties Driven Multilingual Code Retrieval Framework
Rui Li, LiYang He, Qi Liu et al.
Segment beyond View: Handling Partially Missing Modality for Audio-Visual Semantic Segmentation
Renjie Wu, Hu Wang, Feras Dayoub et al.
Iterative Token Evaluation and Refinement for Real-World Super-resolution
Authors: Chaofeng Chen, Shangchen Zhou, Liang Liao et al.
Tuning-Free Accountable Intervention for LLM Deployment – a Metacognitive Approach
Zhen Tan, Jie Peng, Song Wang et al.
Improving Generalization for AI-Synthesized Voice Detection
Hainan Ren, Li Lin, Chun-Hao Liu et al.
DistilVPR: Cross-Modal Knowledge Distillation for Visual Place Recognition
Sijie Wang, Rui She, Qiyu Kang et al.
MIDI-GPT: A Controllable Generative Model for Computer-Assisted Multitrack Music Composition
Philippe Pasquier, Jeff Ens, Nathan Fradet et al.
FD3D: Exploiting Foreground Depth Map for Feature-Supervised Monocular 3D Object Detection
Zizhang Wu, Yuanzhu Gan, Yunzhe Wu et al.
Privacy-Preserving Low-Rank Adaptation Against Membership Inference Attacks for Latent Diffusion Models
Zihao Luo, Xilie Xu, Feng Liu et al.
Learning Spatially Collaged Fourier Bases for Implicit Neural Representation
Jason Chun Lok Li, Chang Liu, Binxiao Huang et al.
Diff-Shadow: Global-guided Diffusion Model for Shadow Removal
Jinting Luo, Ru Li, Chengzhi Jiang et al.
Understanding Emotional Body Expressions via Large Language Models
Haifeng Lu, Jiuyi Chen, Feng Liang et al.
Compressing Image-to-Image Translation GANs Using Local Density Structures on Their Learned Manifold
Alireza Ganjdanesh, Shangqian Gao, Hirad Alipanah et al.
FedLPS: Heterogeneous Federated Learning for Multiple Tasks with Local Parameter Sharing
Yongzhe Jia, Xuyun Zhang, Amin Beheshti et al.
The Complexity of Fair Division of Indivisible Items with Externalities
Argyrios Deligkas, Eduard Eiben, Viktoriia Korchemna et al.
BearLLM: A Prior Knowledge-Enhanced Bearing Health Management Framework with Unified Vibration Signal Representation
Haotian Peng, Jiawei Liu, Jinsong Du et al.
ViSTec: Video Modeling for Sports Technique Recognition and Tactical Analysis
Yuchen He, Zeqing Yuan, Yihong Wu et al.
LINGO-Space: Language-Conditioned Incremental Grounding for Space
Dohyun Kim, Nayoung Oh, Deokmin Hwang et al.
Global Graph Propagation with Hierarchical Information Transfer for Incomplete Contrastive Multi-view Clustering
Guoqing Chao, Kaixin Xu, Xijiong Xie et al.
MPTSNet: Integrating Multiscale Periodic Local Patterns and Global Dependencies for Multivariate Time Series Classification
Yang Mu, Muhammad Shahzad, Xiao Xiang Zhu
A Positive-Unlabeled Metric Learning Framework for Document-Level Relation Extraction with Incomplete Labeling
Ye Wang, Huazheng Pan, Tao Zhang et al.
Non-parametric Representation Learning with Kernels
Hebaixu Wang, Meiqi Gong, Xiaoguang Mei et al.
Let There Be Sound: Reconstructing High Quality Speech from Silent Videos
Ji-Hoon Kim, Jaehun Kim, Joon Son Chung
Knowledge Graph Error Detection with Contrastive Confidence Adaption
Xiangyu Liu, Yang Liu, Wei Hu
Importance Weighting Can Help Large Language Models Self-Improve
Chunyang Jiang, Chi-Min Chan, Wei Xue et al.
FedLF: Layer-Wise Fair Federated Learning
Zibin Pan, Chi Li, Fangchen Yu et al.
Conformalized Interval Arithmetic with Symmetric Calibration
Rui Luo, Zhixin Zhou
Adaptive Prompting for Continual Relation Extraction: A Within-Task Variance Perspective
Minh Le, Tien Ngoc Luu, An Nguyen The et al.
Enhancing Non-English Capabilities of English-Centric Large Language Models Through Deep Supervision Fine-Tuning
Wenshuai Huo, Xiaocheng Feng, Yichong Huang et al.
Chain-of-Instructions: Compositional Instruction Tuning on Large Language Models
Shirley Anugrah Hayati, Taehee Jung, Tristan Bodding-Long et al.
Temporal Fair Division
Benjamin Cookson, Soroush Ebadian, Nisarg Shah
Cycle Self-Refinement for Multi-Source Domain Adaptation
Chaoyang Zhou, Zengmao Wang, Bo Du et al.
Detect Any Keypoints: An Efficient Light-Weight Few-Shot Keypoint Detector
Changsheng Lu, Piotr Koniusz
Unraveling Batch Normalization for Realistic Test-Time Adaptation
Zixian Su, Jingwei Guo, Kai Yao et al.
Union Subgraph Neural Networks
Jiaxing Xu, Aihu Zhang, Qingtian Bian et al.
1497 Once and for All: Universal Transferable Adversarial Perturbation against Deep Hashing-Based Facial Image Retrieval
Long Tang, Dengpan Ye, Yunna Lv et al.
Learning to Stop Cut Generation for Efficient Mixed-Integer Linear Programming
Haotian Ling, Zhihai Wang, Jie Wang
UNEX-RL: Reinforcing Long-Term Rewards in Multi-Stage Recommender Systems with UNidirectional EXecution
Gengrui Zhang, Xiaoshuang Chen, Yao WANG et al.
ESRL: Efficient Sampling-Based Reinforcement Learning for Sequence Generation
Chenglong Wang, Hang Zhou, Yimin Hu et al.
Point Transformer with Federated Learning for Predicting Breast Cancer HER2 Status from Hematoxylin and Eosin-Stained Whole Slide Images
Bao Li, Zhenyu Liu, Lizhi Shao et al.
Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search
Thomy Phan, Taoan Huang, Bistra Dilkina et al.
Knowledge-Aware Parameter Coaching for Personalized Federated Learning
Mingjian Zhi, Yuanguo Bi, Wenchao Xu et al.
ConsistNER: Towards Instructive NER Demonstrations for LLMs with the Consistency of Ontology and Context
Chenxiao Wu, Ke Wenjun, Peng Wang et al.
From Words to Worth: Newborn Article Impact Prediction with LLM
Penghai Zhao, Qinghua Xing, Kairan Dou et al.
Semantic Segmentation in Multiple Adverse Weather Conditions with Domain Knowledge Retention
Xin Yang, Wending Yan, Yuan Yuan et al.
Accurate and Regret-Aware Numerical Problem Solver for Tabular Question Answering
Yuxiang Wang, Jianzhong Qi, Junhao Gan
Open-Set Facial Expression Recognition
Yuhang Zhang, Yue Yao, Xuannan Liu et al.
One Node One Model: Featuring the Missing-Half for Graph Clustering
Xuanting Xie, Bingheng Li, Erlin Pan et al.
Hierarchical Mixture of Experts: Generalizable Learning for High-Level Synthesis
Weikai Li, Ding Wang, Zijian Ding et al.
Test-Time Personalization with Meta Prompt for Gaze Estimation
Huan Liu, Julia Qi, Zhenhao Li et al.
StyO: Stylize Your Face in Only One-Shot
Bonan Li, Zicheng Zhang, Xuecheng Nie et al.
Tri-Ergon: Fine-Grained Video-to-Audio Generation with Multi-Modal Conditions and LUFS Control
Bingliang Li, Fengyu Yang, Yuxin Mao et al.
SMamba: Sparse Mamba for Event-based Object Detection
Nan Yang, Yang Wang, Zhanwen Liu et al.
SeGA: Preference-Aware Self-Contrastive Learning with Prompts for Anomalous User Detection on Twitter
Ying-Ying Chang, Wei-Yao Wang, Wen-Chih Peng
X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-Modal Knowledge Transfer
Linglin Jing, Ying Xue, Xu Yan et al.
Coupling Graph Neural Networks with Fractional Order Continuous Dynamics: A Robustness Study
Qiyu Kang, Kai Zhao, Yang Song et al.
Cumulative Regret Analysis of the Piyavskii–Shubert Algorithm and Its Variants for Global Optimization
Kaan Gokcesu, Hakan Gökcesu
Improved Bandits in Many-to-One Matching Markets with Incentive Compatibility
Fang Kong, Shuai Li
Identifying Macro Conditional Independencies and Macro Total Effects in Summary Causal Graphs with Latent Confounding
Simon Ferreira, Charles K. Assaad
BOK-VQA: Bilingual outside Knowledge-Based Visual Question Answering via Graph Representation Pretraining
Minjun Kim, SeungWoo Song, Youhan Lee et al.
Empowering CAM-Based Methods with Capability to Generate Fine-Grained and High-Faithfulness Explanations
Changqing Qiu, Fusheng Jin, Yining Zhang
LoRID: Low-Rank Iterative Diffusion for Adversarial Purification
Geigh Zollicoffer, Minh N. Vu, Ben Nebgen et al.
Learning Deformable Hypothesis Sampling for Accurate PatchMatch Multi-View Stereo
Hongjie Li, Yao Guo, Xianwei Zheng et al.
Mixture-of-Attack-Experts with Class Regularization for Unified Physical-Digital Face Attack Detection
Shunxin Chen, Ajian Liu, Junze Zheng et al.
Reverse Multi-Choice Dialogue Commonsense Inference with Graph-of-Thought
Li Zheng, Hao Fei, Fei Li et al.
A Sequentially Fair Mechanism for Multiple Sensitive Attributes
Francois HU, Philipp Ratz, Arthur Charpentier
PARSAC: Accelerating Robust Multi-Model Fitting with Parallel Sample Consensus
Florian Kluger, Bodo Rosenhahn
Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via Sparse Time-Variant Attribute Modeling
Hanyang Kong, Xingyi Yang, Xinchao Wang
Joint Learning Neuronal Skeleton and Brain Circuit Topology with Permutation Invariant Encoders for Neuron Classification
Minghui Liao, Guojia Wan, Bo Du
Radiology Report Generation via Multi-objective Preference Optimization
Ting Xiao, Lei Shi, Peng Liu et al.
Instance-Aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning
Yang Jiao, Zequn Jie, Shaoxiang Chen et al.
ProtoOcc: Accurate, Efficient 3D Occupancy Prediction Using Dual Branch Encoder-Prototype Query Decoder
Jungho Kim, Changwon Kang, Dongyoung Lee et al.
Revolutionizing Encrypted Traffic Classification with MH-Net: A Multi-View Heterogeneous Graph Model
Haozhen Zhang, Haodong Yue, Xi Xiao et al.
How to Use the Metropolis Algorithm for Multi-Objective Optimization?
Weijie Zheng, Mingfeng Li, Renzhong Deng et al.
ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation
Mengyang Wu, Yuzhi Zhao, Jialun Cao et al.
Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion
Jingyuan Chen, Fuchen Long, Jie An et al.
FedCFA: Alleviating Simpson’s Paradox in Model Aggregation with Counterfactual Federated Learning
Zhonghua Jiang, Jimin Xu, Shengyu Zhang et al.
MobileInst: Video Instance Segmentation on the Mobile
Renhong Zhang, Tianheng Cheng, Shusheng Yang et al.
GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians
Xiaobao Wei, Peng Chen, Ming Lu et al.
Domain Generalization with Vital Phase Augmentation
Ingyun Lee, WooJu Lee, Hyun Myung
SC-NeuS: Consistent Neural Surface Reconstruction from Sparse and Noisy Views
Shi-Sheng Huang, Zixin Zou, Yichi Zhang et al.
Incomplete Multi-view Clustering via Diffusion Contrastive Generation
Yuanyang Zhang, Yijie Lin, Weiqing Yan et al.
Adversarial Purification with the Manifold Hypothesis
Zhaoyuan Yang, Zhiwei Xu, Jing Zhang et al.
Contributing Dimension Structure of Deep Feature for Coreset Selection
Zhijing Wan, Zhixiang Wang, Yuran Wang et al.
Scalable Surrogate Verification of Image-Based Neural Network Control Systems Using Composition and Unrolling
Feiyang Cai, Chuchu Fan, Stanley Bak
Towards Modeling Uncertainties of Self-explaining Neural Networks via Conformal Prediction
Wei Qian, Chenxu Zhao, Yangyi Li et al.
FlexiTex: Enhancing Texture Generation via Visual Guidance
Dadong Jiang, Xianghui Yang, Zibo Zhao et al.
GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via Stationary Distribution Correction Estimation
Abhinav Jain, Vaibhav Unhelkar
UniAP: Towards Universal Animal Perception in Vision via Few-Shot Learning
Meiqi Sun, Zhonghan Zhao, Wenhao Chai et al.
Generalized Bradley-Terry Models for Score Estimation from Paired Comparisons
Julien Fageot, Lê-Nguyên Hoang, Oscar Villemaud et al.
M2OST: Many-to-one Regression for Predicting Spatial Transcriptomics from Digital Pathology Images
Hongyi Wang, Xiuju Du, Jing Liu et al.
ADBA: Approximation Decision Boundary Approach for Black-Box Adversarial Attacks
Feiyang Wang, Xingquan Zuo, Hai Huang et al.
LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement
Renyuan Peng, Xinyue Cai, Hang Xu et al.
Modeling Inter-Intra Heterogeneity for Graph Federated Learning
Wentao Yu, Shuo Chen, Yongxin Tong et al.
RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance
Chengrui Wang, Pengfei Liu, Min Zhou et al.
Taylor Series-Inspired Local Structure Fitting Network for Few-shot Point Cloud Semantic Segmentation
Changshuo Wang, Shuting He, Xiang Fang et al.
Semi-supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix
Kewei Wang, Yizheng Wu, Zhiyu Pan et al.
ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation
Shiqi Huang, Shuting He, Bihan Wen
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
Jialu Li, Aishwarya Padmakumar, Gaurav Sukhatme et al.
PG-LBO: Enhancing High-Dimensional Bayesian Optimization with Pseudo-Label and Gaussian Process Guidance
Taicai Chen, Yue Duan, Dong Li et al.
Cross-Modal Match for Language Conditioned 3D Object Grounding
Yachao Zhang, Runze Hu, Ronghui Li et al.
Beyond Federated Prototype Learning: Learnable Semantic Anchors with Hyperspherical Contrast for Domain-Skewed Data
Lele Fu, Sheng Huang, Yanyi Lai et al.
V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer
Hangzhou He, Lei Zhu, Xinliang Zhang et al.
Enhancing Low-Resource Relation Representations through Multi-View Decoupling
LLaVA Needs More Knowledge: Retrieval Augmented Natural Language Generation with Knowledge Graph for Explaining Thoracic Pathologies
Ameer Hamza, Abdullah, Yong Hyun Ahn et al.
Simplifying Complex Observation Models in Continuous POMDP Planning with Probabilistic Guarantees and Practice
Idan Lev-Yehudi, Moran Barenboim, Vadim Indelman
Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation
Hao Liu, Xin Li, Mingming Gong et al.
Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model Using 3D Whole-Body CT Scans
Heng Guo, Jianfeng Zhang, Jiaxing Huang et al.
Low-Light Face Super-resolution via Illumination, Structure, and Texture Associated Representation
Chenyang Wang, Junjun Jiang, Kui Jiang et al.
Surgical Workflow Recognition and Blocking Effectiveness Detection in Laparoscopic Liver Resection with Pringle Maneuver
Diandian Guo, Weixin Si, Zhixi Li et al.
Efficient Attention-Sharing Information Distillation Transformer for Lightweight Single Image Super-Resolution
Karam Park, Jae Woong Soh, Nam Ik Cho
Semantic Lens: Instance-Centric Semantic Alignment for Video Super-resolution
ParZC: Parametric Zero-Cost Proxies for Efficient NAS
Peijie Dong, Lujun Li, Zhenheng Tang et al.
Multi-View Dynamic Reflection Prior for Video Glass Surface Detection
Fang Liu, Yuhao Liu, Jiaying Lin et al.
Label-Free Backdoor Attacks in Vertical Federated Learning
Wei Shen, Wenke Huang, Guancheng Wan et al.
FACL-Attack: Frequency-Aware Contrastive Learning for Transferable Adversarial Attacks
Hunmin Yang, Jongoh Jeong, Kuk-Jin Yoon
Federated Foundation Models on Heterogeneous Time Series
Shengchao Chen, Guodong Long, Jing Jiang et al.
patchDPCC: A Patchwise Deep Compression Framework for Dynamic Point Clouds
Authors: Zirui Pan, Mengbai Xiao, Xu Han et al.
Model-Driven Deep Neural Network for Enhanced AoA Estimation Using 5G gNB
Shengheng Liu, Xingkang Li, Zihuan Mao et al.
Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning
Hang Du, Xuejun Yan, Jingjing Wang et al.
B-spine: Learning B-spline Curve Representation for Robust and Interpretable Spinal Curvature Estimation
Hao Wang, Qiang Song, Ruofeng Yin et al.
Collaborative Synthesis of Patient Records through Multi-Visit Health State Inference
Hongda Sun, Hongzhan Lin, Rui Yan
Revisiting Multimodal Fusion for 3D Anomaly Detection from an Architectural Perspective
Kaifang Long, Guoyang Xie, Lianbo Ma et al.
Confidence Estimation for Error Detection in Text-to-SQL Systems
Oleg Somov, Elena Tutubalina
Graph-Based Prediction and Planning Policy Network (GP3Net) for Scalable Self-Driving in Dynamic Environments Using Deep Reinforcement Learning
Jayabrata Chowdhury, Venkataramanan Shivaraman, Suresh Sundaram et al.
Few-Shot Neural Radiance Fields under Unconstrained Illumination
SeokYeong Lee, JunYong Choi, Seungryong Kim et al.
DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning
Xinghao Wang, Junliang He, Pengyu Wang et al.
Self-Explainable Graph Transformer for Link Sign Prediction
Lu Li, Jiale Liu, Xingyu Ji et al.
PowerMLP: An Efficient Version of KAN
Ruichen Qiu, Yibo Miao, Shiwen Wang et al.
Patched Line Segment Learning for Vector Road Mapping
Jiakun Xu, Bowen Xu, Gui-Song Xia et al.
VVS: Video-to-Video Retrieval with Irrelevant Frame Suppression
Won Jo, Geuntaek Lim, Gwangjin Lee et al.
Seeing Your Speech Style: A Novel Zero-Shot Identity-Disentanglement Face-based Voice Conversion
Yan Rong, Li Liu
VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering
Chun-Mei Feng, Yang Bai, Tao Luo et al.
Identification of Necessary Semantic Undertakers in the Causal View for Image-Text Matching
Huatian Zhang, Lei Zhang, Kun Zhang et al.
RG-GAN: Dynamic Regenerative Pruning for Data-Efficient Generative Adversarial Networks
Divya Saxena, Jiannong Cao, Jiahao Xu et al.
Exact ASP Counting with Compact Encodings
Mohimenul Kabir, Supratik Chakraborty, Kuldeep S Meel
Knowledge Graph Completion with Relation-Aware Anchor Enhancement
Duanyang Yuan, Sihang Zhou, Xiaoshu Chen et al.
Debiased Novel Category Discovering and Localization
Juexiao Feng, Yuhong Yang, Yanchun Xie et al.
Detection and Defense of Unlearnable Examples
Yifan Zhu, lijia Yu, Xiao-Shan Gao
CatFormer: Category-Level 6D Object Pose Estimation with Transformer
Sheng Yu, Dihua Zhai, Yuanqing Xia
DM-Adapter: Domain-Aware Mixture-of-Adapters for Text-Based Person Retrieval
Yating Liu, Zimo Liu, Xiangyuan Lan et al.
Parsing All Adverse Scenes: Severity-Aware Semantic Segmentation with Mask-Enhanced Cross-Domain Consistency
Fuhao Li, Ziyang Gong, Yupeng Deng et al.
Omnipotent Distillation with LLMs for Weakly-Supervised Natural Language Video Localization:
Peijun Bao, Zihao Shao, Wenhan Yang et al.
Scaffold-BPE: Enhancing Byte Pair Encoding for Large Language Models with Simple and Effective Scaffold Token Removal
Haoran Lian, Yizhe Xiong, Jianwei Niu et al.
OctOcc: High-Resolution 3D Occupancy Prediction with Octree
Wenzhe Ouyang, Xiaolin Song, Bailan Feng et al.
Symmetric Self-Paced Learning for Domain Generalization
Di Zhao, Yun Sing Koh, Gillian Dobbie et al.
Human and AI Perceptual Differences in Image Classification Errors
Minghao Liu, Jiaheng Wei, Yang Liu et al.
Curved Representation Space of Vision Transformers
Juyeop Kim, Junha Park, Songkuk Kim et al.
SlerpFace: Face Template Protection via Spherical Linear Interpolation
Zhizhou Zhong, Yuxi Mi, Yuge Huang et al.
Improving Open Set Recognition via Visual Prompts Distilled from Common-Sense Knowledge
Seong-Tae Kim, Hyungil Kim, Y. Ro
CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation
Han He, Qianchu Liu, Lei Xu et al.
Geometry-Guided Domain Generalization for Monocular 3D Object Detection
Fan Yang, Hui Chen, Yuwei He et al.
Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks
Fuzhi Wu, Jiasong Wu, Youyong Kong et al.
Causal Inference over Visual-Semantic-Aligned Graph for Image Classification
Lei Meng, Xiangxian Li, Xiaoshuo Yan et al.
Early Concept Drift Detection via Prediction Uncertainty
Pengqian Lu, Jie Lu, Anjin Liu et al.
ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention
Bencheng Liao, Xinggang Wang, Lianghui Zhu et al.
Semi-supervised 3D Object Detection with PatchTeacher and PillarMix
Xiaopei Wu, Liang Peng, Liang Xie et al.
Combinatorial Stochastic-Greedy Bandit
Fares Fourati, Christopher John Quinn, Mohamed-Slim Alouini et al.
Decoupling Degradations with Recurrent Network for Video Restoration in Under-Display Camera
Chengxu Liu, Xuan Wang, Yuanting Fan et al.
VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models
Yabo Zhang, Yuxiang Wei, Xianhui Lin et al.
Maximizing Nash Social Welfare under Two-Sided Preferences
Pallavi Jain, Rohit Vaish
Towards More Faithful Natural Language Explanation Using Multi-Level Contrastive Learning in VQA
Chengen Lai, Shengli Song, Shiqi Meng et al.
Multi-Domain Recommendation to Attract Users via Domain Preference Modeling
Hyunjun Ju, SeongKu Kang, Dongha Lee et al.
Colour Passing Revisited: Lifted Model Construction with Commutative Factors
Malte Luttermann, Tanya Braun, Ralf Möller et al.
FilterTS: Comprehensive Frequency Filtering for Multivariate Time Series Forecasting
Yulong Wang, Yushuo Liu, Xiaoyi Duan et al.
Self-Training Based Few-Shot Node Classification by Knowledge Distillation
Zongqian Wu, Yujie Mo, Peng Zhou et al.
De-biased Attention Supervision for Text Classification with Causality
Yiquan Wu, Yifei Liu, Ziyu Zhao et al.
Region-Based Optimization in Continual Learning for Audio Deepfake Detection
Yujie Chen, Jiangyan Yi, Cunhang Fan et al.
Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection
Jiaqi Chen, Xiaoye Zhu, Tianyang Liu et al.
Federated Graph Condensation with Information Bottleneck Principles
Bo Yan, Sihao He, Cheng Yang et al.
2043 Improved MLP Point Cloud Processing with High-Dimensional Positional Encoding
Yanmei Zou, Hongshan Yu, Zhengeng Yang et al.
Towards Optimal Subsidy Bounds for Envy-Freeable Allocations
Yasushi Kawase, Kazuhisa Makino, Hanna Sumita et al.
HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval
Zexuan Qiu, Jiahong Liu, Yankai Chen et al.
UniPCGC: Towards Practical Point Cloud Geometry Compression via an Efficient Unified Approach
Kangli Wang, Wei Gao