Most Cited AAAI "self-instruction generation" Papers
5,317 papers found • Page 6 of 27
Conference
EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation
Yuqiao Wen, Behzad Shayegh, Chenyang Huang et al.
UniPCGC: Towards Practical Point Cloud Geometry Compression via an Efficient Unified Approach
Kangli Wang, Wei Gao
MaFeRw: Query Rewriting with Multi-Aspect Feedbacks for Retrieval-Augmented Large Language Models
Yujing Wang, Hainan Zhang, Liang Pang et al.
A Dynamic Learning Method towards Realistic Compositional Zero-Shot Learning
Xiaoming Hu, Zilei Wang
Knowledge Enhanced Representation Learning for Drug Discovery
Thanh Lam Hoang, Marco Luca Sbodio, Marcos Martinez et al.
Filter or Compensate: Towards Invariant Representation from Distribution Shift for Anomaly Detection
Zining Chen, Xingshuang Luo, Weiqiu Wang et al.
Adaptive Few-shot Prompting for Machine Translation with Pre-trained Language Models
Lei Tang, Jinghui Qin, Wenxuan Ye et al.
MergeNet: Knowledge Migration Across Heterogeneous Models, Tasks, and Modalities
Kunxi Li, Tianyu Zhan, Kairui Fu et al.
Re2LLM: Reflective Reinforcement Large Language Model for Session-based Recommendation
Ziyan Wang, Yingpeng Du, Zhu Sun et al.
Just What You Desire: Constrained Timeline Summarization with Self-Reflection for Enhanced Relevance
Muhammad Reza Qorib, Qisheng Hu, Hwee Tou Ng
QPEN: Quantum Projection and Quantum Entanglement Enhanced Network for Cross-Lingual Aspect-Based Sentiment Analysis
Xingqiang Zhao, Hai Wan, Kunxun Qi
Alleviate and Mining: Rethinking Unsupervised Domain Adaptation for Mitochondria Segmentation from Pseudo-Label Perspective
Yujia Chen, Rui Sun, Wangkai Li et al.
Diffusion-based Synthetic Data Generation for Visible-Infrared Person Re-Identification
Wenbo Dai, Lijing Lu, Zhihang Li
Deep Evidential Hashing for Trustworthy Cross-Modal Retrieval
Yuan Li, Liangli Zhen, Yuan Sun et al.
GNS: Solving Plane Geometry Problems by Neural-Symbolic Reasoning with Multi-Modal LLMs
Maizhen Ning, Zihao Zhou, Qiufeng Wang et al.
Regret Analysis of Repeated Delegated Choice
Suho Shin, Keivan Rezaei, Mohammad Hajiaghayi et al.
On Computing Makespan-Optimal Solutions for Generalized Sliding-Tile Puzzles
Marcus Gozon, Jingjin Yu
Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph
Xujian Liang, Zhaoquan Gu
SWEA: Updating Factual Knowledge in Large Language Models via Subject Word Embedding Altering
Xiaopeng Li, Shasha Li, Shezheng Song et al.
DanceFix: An Exploration in Group Dance Neatness Assessment Through Fixing Abnormal Challenges of Human Pose
Huangbiao Xu, Xiao Ke, Huanqi Wu et al.
DAG-Aware Variational Autoencoder for Social Propagation Graph Generation
Dongpeng Hou, Chao Gao, Xuelong Li et al.
COLUMBUS: Evaluating COgnitive Lateral Understanding Through Multiple-Choice reBUSes
Koen Kraaijveld, Yifan Jiang, Kaixin Ma et al.
Details Enhancement in Unsigned Distance Field Learning for High-fidelity 3D Surface Reconstruction
Cheng Xu, Fei Hou, Wencheng Wang et al.
Neural Amortized Inference for Nested Multi-Agent Reasoning
Kunal Jha, Tuan Anh Le, Chuanyang Jin et al.
Against All Odds: Overcoming Typology, Script, and Language Confusion in Multilingual Embedding Inversion Attacks
Yiyi Chen, Russa Biswas, Heather Lent et al.
ConVQG: Contrastive Visual Question Generation with Multimodal Guidance
Li Mi, Syrielle Montariol, Javiera Castillo Navarro et al.
Lyapunov-Stable Deep Equilibrium Models
Haoyu Chu, Shikui Wei, Ting Liu et al.
The Bandit Whisperer: Communication Learning for Restless Bandits
Yunfan Zhao, Tonghan Wang, Dheeraj Mysore Nagaraj et al.
Color Event Enhanced Single-Exposure HDR Imaging
Mengyao Cui, Zhigang Wang, Dong Wang et al.
Factor Augmented Tensor-on-Tensor Neural Networks
Guanhao Zhou, Yuefeng Han, Xiufan Yu
CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird’s Eye View Perception
Senkang Hu, Yihang Tao, Guowen Xu et al.
Assessing Pre-Trained Models for Transfer Learning Through Distribution of Spectral Components
Tengxue Zhang, Yang Shu, Xinyang Chen et al.
H-ensemble: An Information Theoretic Approach to Reliable Few-Shot Multi-Source-Free Transfer
Yanru Wu, Jianning Wang, Weida Wang et al.
CVLUE: A New Benchmark Dataset for Chinese Vision-Language Understanding Evaluation
Yuxuan Wang, Yijun Liu, Fei Yu et al.
Multi-Modal Disordered Representation Learning Network for Description-Based Person Search
Fan Yang, Wei Li, Menglong Yang et al.
Towards the Disappearing Truth: Fine-Grained Joint Causal Influences Learning with Hidden Variable-Driven Causal Hypergraphs
Kun Zhu, Chunhui Zhao
Discretization-Induced Dirichlet Posterior for Robust Uncertainty Quantification on Regression
Xuanlong Yu, Gianni Franchi, Jindong Gu et al.
MCSSME: Multi-Task Contrastive Learning for Semi-supervised Singing Melody Extraction from Polyphonic Music
Shuai Yu
On the Expressiveness and Length Generalization of Selective State Space Models on Regular Languages
Aleksandar Terzic, Michael Hersche, Giacomo Camposampiero et al.
Hard Regularization to Prevent Deep Online Clustering Collapse without Data Augmentation
Louis Mahon, Thomas Lukasiewicz
In-Hand 3D Object Reconstruction from a Monocular RGB Video
Shijian Jiang, Qi Ye, Rengan Xie et al.
Doubly Contrastive Learning for Source-Free Domain Adaptive Person Search
Yizhen Jia, Rong Quan, Yue Feng et al.
Alignment-Free RGB-T Salient Object Detection: A Large-Scale Dataset and Progressive Correlation Network
Kunpeng Wang, Keke Chen, Chenglong Li et al.
Error Bounds for Gaussian Process Regression Under Bounded Support Noise with Applications to Safety Certification
Robert Reed, Luca Laurenti, Morteza Lahijanian
Spear and Shield: Adversarial Attacks and Defense Methods for Model-Based Link Prediction on Continuous-Time Dynamic Graphs
Dongjin Lee, Juho Lee, Kijung Shin
Generalizable Sensor-Based Activity Recognition via Categorical Concept Invariant Learning
Di Xiong, Shuoyuan Wang, Lei Zhang et al.
Interactive Visual Task Learning for Robots
Weiwei Gu, Anant Sah, N. Gopalan
Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation
Xie Tianyidan, Rui Ma, Qian Wang et al.
Continual Learning Using a Kernel-Based Method Over Foundation Models
Saleh Momeni, Sahisnu Mazumder, Bing Liu
Beyond Human Data: Aligning Multimodal Large Language Models by Iterative Self-Evolution
Wentao Tan, Qiong Cao, Yibing Zhan et al.
Improving Knowledge Extraction from LLMs for Task Learning through Agent Analysis
James Kirk, Robert Wray, Peter Lindes et al.
Accelerating the Global Aggregation of Local Explanations
Alon Mor, Yonatan Belinkov, Benny Kimelfeld
Spatial-Semantic Collaborative Cropping for User Generated Content
Yukun Su, Yiwen Cao, Jingliang Deng et al.
CFEVER: A Chinese Fact Extraction and VERification Dataset
Ying-Jia Lin, ChunYi Lin, Chia-Jen Yeh et al.
PPIDSG: A Privacy-Preserving Image Distribution Sharing Scheme with GAN in Federated Learning
Yuting Ma, Yuanzhi Yao, Xiaohua Xu
Learning Complex Heterogeneous Multimodal Fake News via Social Latent Network Inference
Mingxin Li, Yuchen Zhang, Haowei Xu et al.
On the Robustness of Neural-Enhanced Video Streaming against Adversarial Attacks
Qihua Zhou, Jingcai Guo, Song Guo et al.
Medical Multimodal Model Stealing Attacks via Adversarial Domain Alignment
Yaling Shen, Zhixiong Zhuang, Kun Yuan et al.
Inverse Weight-Balancing for Deep Long-Tailed Learning
Wenqi Dang, Zhou Yang, Weisheng Dong et al.
Destroy and Repair Using Hyper-Graphs for Routing
Ke Li, Fei Liu, Zhenkun Wang et al.
SDAC: A Multimodal Synthetic Dataset for Anomaly and Corner Case Detection in Autonomous Driving
Lei Gong, Yu Zhang, Yingqing Xia et al.
DuMo: Dual Encoder Modulation Network for Precise Concept Erasure
Feng Han, Kai Chen, Chao Gong et al.
Knowledge-Enhanced Historical Document Segmentation and Recognition
En-Hao Gao, Yu-Xuan Huang, Wen-Chao Hu et al.
Adaptive Calibration: A Unified Conversion Framework of Spiking Neural Networks
Ziqing Wang, Yuetong Fang, Jiahang Cao et al.
Expensive Multi-Objective Bayesian Optimization Based on Diffusion Models
Bingdong Li, Zixiang Di, Yongfan Lu et al.
Auto-Regressive Moving Diffusion Models for Time Series Forecasting
Jiaxin Gao, Qinglong Cao, Yuntian Chen
Exploiting Polarized Material Cues for Robust Car Detection
Wen Dong, Haiyang Mei, Ziqi Wei et al.
Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation
Thong Thanh Nguyen, Xiaobao Wu, Yi Bin et al.
SwiftTry: Fast and Consistent Video Virtual Try-On with Diffusion Models
Hung Nguyen, Quang Qui-Vinh Nguyen, Khoi Nguyen et al.
LRS: Enhancing Adversarial Transferability through Lipschitz Regularized Surrogate
Tao Wu, Tie Luo, D. C. Wunsch
Boosting Fine-Grained Visual Anomaly Detection with Coarse-Knowledge-Aware Adversarial Learning
Qingqing Fang, Qinliang Su, Wenxi Lv et al.
Capture Global Feature Statistics for One-Shot Federated Learning
Zenghao Guan, Yucan Zhou, Xiaoyan Gu
Stochastic Online Instrumental Variable Regression: Regrets for Endogeneity and Bandit Feedback
Riccardo Della Vecchia, Debabrota Basu
MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity
Kanghyun Choi, Hyeyoon Lee, Dain Kwon et al.
Arbitrary Reading Order Scene Text Spotter with Local Semantics Guidance
Jiahao Lyu, Wei Wang, Dongbao Yang et al.
Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning
Subhabrata Dutta, Ishan Pandey, Joykirat Singh et al.
Fact-Driven Logical Reasoning for Machine Reading Comprehension
Siru Ouyang, Zhuosheng Zhang, Hai Zhao
GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation
Jiawei Lu, YingPeng Zhang, Zengjun Zhao et al.
Solving Robust Markov Decision Processes: Generic, Reliable, Efficient
Tobias Meggendorfer, Maximilian Weininger, Patrick Wienhöft
SeTformer Is What You Need for Vision and Language
Pourya Shamsolmoali, Masoumeh Zareapoor, Eric Granger et al.
Out of Length Text Recognition with Sub-String Matching
Yongkun Du, Zhineng Chen, Caiyan Jia et al.
Dependency Structure-Enhanced Graph Attention Networks for Event Detection
Qizhi Wan, Changxuan wan, Keli Xiao et al.
Instruction-Augmented Long-Horizon Planning: Embedding Grounding Mechanisms in Embodied Mobile Manipulation
Fangyuan Wang, Shipeng Lyu, Peng Zhou et al.
Int2Planner: An Intention-based Multi-modal Motion Planner for Integrated Prediction and Planning
Xiaolei Chen, Junchi Yan, Wenlong Liao et al.
Safe Planner: Empowering Safety Awareness in Large Pre-Trained Models for Robot Task Planning
Siyuan Li, Feifan Liu, Lingfei Cui et al.
Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
Zhao Shan, Chenyou Fan, Shuang Qiu et al.
Long-Term EEG Partitioning for Seizure Onset Detection
Zheng Chen, Yasuko Matsubara, Yasushi Sakurai et al.
ContextualStory: Consistent Visual Storytelling with Spatially-Enhanced and Storyline Context
Sixiao Zheng, Yanwei Fu
Weak Distribution Detectors Lead to Stronger Generalizability of Vision-Language Prompt Tuning
Kun Ding, Haojian Zhang, Qiang Yu et al.
Probabilistic Neural Circuits
Pedro Zuidberg Dos Martires
Using Stratified Sampling to Improve LIME Image Explanations
Muhammad Rashid, Elvio G. Amparore, Enrico Ferrari et al.
Faithful Model Explanations through Energy-Constrained Conformal Counterfactuals
Patrick Altmeyer, Mojtaba Farmanbar, Arie Van Deursen et al.
Teacher as a Lenient Expert: Teacher-Agnostic Data-Free Knowledge Distillation
Hyunjune Shin, Dong-Wan Choi
Complete Neural Networks for Complete Euclidean Graphs
Snir Hordan, Tal Amir, Nadav Dym et al.
Catch-Up Mix: Catch-Up Class for Struggling Filters in CNN
Minsoo Kang, Minkoo Kang, Suhyun Kim
DiffGrasp: Whole-Body Grasping Synthesis Guided by Object Motion Using a Diffusion Model
Yonghao Zhang, Qiang He, Yanguang Wan et al.
Weakly Supervised Few-Shot Object Detection with DETR
Chenbo Zhang, Yinglu Zhang, Lu Zhang et al.
Erase Then Rectify: A Training-Free Parameter Editing Approach for Cost-Effective Graph Unlearning
Zhe-Rui Yang, Jindong Han, Chang-Dong Wang et al.
Revisiting Graph Contrastive Learning on Anomaly Detection: A Structural Imbalance Perspective
Yiming Xu, Zhen Peng, Bin Shi et al.
Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction
Quan Zhang, Yuxin Qi, Xi Tang et al.
Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling
Yuze Hao, Jianrong Zhang, Tao Zhuo et al.
Dialogue for Prompting: A Policy-Gradient-Based Discrete Prompt Generation for Few-Shot Learning
Chengzhengxu Li, Xiaoming Liu, Yichen Wang et al.
Distilling Structured Rationale from Large Language Models to Small Language Models for Abstractive Summarization
Linyong Wang, Lianwei Wu, Shaoqi Song et al.
DeepSaDe: Learning Neural Networks That Guarantee Domain Constraint Satisfaction
Kshitij Goyal, Sebastijan Dumancic, Hendrik Blockeel
Hand1000: Generating Realistic Hands from Text with Only 1,000 Images
Haozhuo Zhang, Bin Zhu, Yu Cao et al.
GDiffRetro: Retrosynthesis Prediction with Dual Graph Enhanced Molecular Representation and Diffusion Generation
Shengyin Sun, Wenhao Yu, Yuxiang Ren et al.
Unsupervised Pan-Sharpening via Mutually Guided Detail Restoration
Huangxing Lin, Yuhang Dong, Xinghao Ding et al.
Anchoring Path for Inductive Relation Prediction in Knowledge Graphs
Zhixiang Su, Di Wang, Chunyan Miao et al.
SSL-STMFormer Self-Supervised Learning Spatio-Temporal Entanglement Transformer for Traffic Flow Prediction
Zetao Li, Zheng Hu, Peng Han et al.
Instruction-guided Multi-Granularity Segmentation and Captioning with Large Multimodal Model
Xu Yuan, Li Zhou, Zenghui Sun et al.
Structural Information Guided Multimodal Pre-training for Vehicle-Centric Perception
Xiao Wang, Wentao Wu, Chenglong Li et al.
A Comprehensive Evaluation on Event Reasoning of Large Language Models
Zhengwei Tao, Zhi Jin, Yifan Zhang et al.
SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers
Zehao Chen, Rong Pan
Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning
Zeyang Liu, Lipeng Wan, Xinrui Yang et al.
Dynamic-Width Speculative Beam Decoding for LLM Inference
Zongyue Qin, Zifan He, Neha Prakriya et al.
Your Career Path Matters in Person-Job Fit
Zhuocheng Gong, Yang Song, Tao Zhang et al.
Learning Small Decision Trees with Few Outliers: A Parameterized Perspective
Harmender Gahlawat, Meirav Zehavi
Predicting the Original Appearance of Damaged Historical Documents
Zhenhua Yang, Dezhi Peng, Yongxin Shi et al.
Identification of Causal Structure with Latent Variables Based on Higher Order Cumulants
Wei Chen, Zhiyi Huang, Ruichu Cai et al.
Noisy Label Calibration for Multi-View Classification
Shilin Xu, Yuan Sun, Xingfeng Li et al.
TAMER: Tree-Aware Transformer for Handwritten Mathematical Expression Recognition
Jianhua Zhu, Wenqi Zhao, Yu Li et al.
NestE: Modeling Nested Relational Structures for Knowledge Graph Reasoning
Bo Xiong, Mojtaba Nayyeri, Linhao Luo et al.
Boosting Short Text Classification with Multi-Source Information Exploration and Dual-Level Contrastive Learning
Yonghao Liu, Mengyu Li, Wei Pang et al.
Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval
Guangyuan Ma, Yongliang Ma, Xing Wu et al.
Procedural Level Generation with Diffusion Models from a Single Example
Shiqi Dai, Xuanyu Zhu, Naiqi Li et al.
Exploit Your Latents: Coarse-Grained Protein Backmapping with Latent Diffusion Models
Rongchao Zhang, Yu Huang, Yiwei Lou et al.
TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training
Chaoya Jiang, Wei Ye, Haiyang Xu et al.
Optimizing ADMM and Over-Relaxed ADMM Parameters for Linear Quadratic Problems
Song Jintao, Wenqi Lu, Yunwen Lei et al.
Dealing with Numeric and Metric Time Constraints in PDDL3 via Compilation to Numeric Planning
Luigi Bonassi, Alfonso Emilio Gerevini, Enrico Scala
H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving
Siran Chen, Yuxiao Luo, Yue Ma et al.
Zero-resource Hallucination Detection for Text Generation via Graph-based Contextual Knowledge Triples Modeling
Xinyue Fang, Zhen Huang, Zhiliang Tian et al.
Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting
Qi ZHANG, Yunfei Gong, Daijie Chen et al.
Lost Domain Generalization Is a Natural Consequence of Lack of Training Domains
Yimu Wang, Yihan Wu, Hongyang Zhang
Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection
Kedi Chen, Qin Chen, Jie Zhou et al.
Quantum Interference Model for Semantic Biases of Glosses in Word Sense Disambiguation
Junwei Zhang, Ruifang He, Fengyu Guo et al.
Bridging Training and Execution via Dynamic Directed Graph-Based Communication in Cooperative Multi-Agent Systems
Zhuohui Zhang, Bin He, Bin Cheng et al.
Scaling Up Semi-supervised Learning with Unconstrained Unlabelled Data
Shuvendu Roy, Ali Etemad
Multi-Agent Motion Planning for Differential Drive Robots Through Stationary State Search
Jingtian Yan, Jiaoyang Li
REVECA: Adaptive Planning and Trajectory-Based Validation in Cooperative Language Agents Using Information Relevance and Relative Proximity
SeungWon Seo, SeongRae Noh, Junhyeok Lee et al.
Prediction-Feedback DETR for Temporal Action Detection
Jihwan Kim, Miso Lee, Cheol-Ho Cho et al.
Racing Control Variable Genetic Programming for Symbolic Regression
Nan Jiang, Yexiang Xue
FairWASP: Fast and Optimal Fair Wasserstein Pre-processing
Zikai Xiong, Niccolo Dalmasso, Alan Mishler et al.
Neural Time
Reversed Generalized Riccati Equation
Existence Is Chaos: Enhancing 3D Human Motion Prediction with Uncertainty Consideration
Zhihao Wang, Yulin Zhou, Ningyu Zhang et al.
Event-Enhanced Blurry Video Super-Resolution
Dachun Kai, Yueyi Zhang, Jin Wang et al.
Enhancing Zero-Shot Multi-Speaker TTS with Negated Speaker Representations
Yejin Jeon, Yunsu Kim, Gary Geunbae Lee
RadarMOSEVE: A Spatial-Temporal Transformer Network for Radar-Only Moving Object Segmentation and Ego-Velocity Estimation
Changsong Pang, Xieyuanli Chen, Yimin Liu et al.
WaterDiffusion: Learning a Prior-involved Unrolling Diffusion for Joint Underwater Saliency Detection and Visual Restoration
Laibin Chang, Yunke Wang, Longxiang Deng et al.
Regulating AI: Applying Insights from Behavioural Economics and Psychology to the Application of Article 5 of the EU AI Act
Huixin Zhong, Eamonn O'Neill, Janina Hoffmann
AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring
Xinyi Wang, Na Zhao, Zhiyuan Han et al.
MLC-NC: Long-Tailed Multi-Label Image Classification Through the Lens of Neural Collapse
Zijian Tao, Shao-Yuan Li, Wenhai Wan et al.
Learning the Causal Structure of Networked Dynamical Systems under Latent Nodes and Structured Noise
Augusto Santos, Diogo Rente, Rui Seabra et al.
ScaleOT: Privacy-utility-scalable Offsite-tuning with Dynamic LayerReplace and Selective Rank Compression
Kai Yao, Zhaorui Tan, Tiandi Ye et al.
UVAGaze: Unsupervised 1-to-2 Views Adaptation for Gaze Estimation
Ruicong Liu, Feng Lu
DOGE-Train: Discrete Optimization on GPU with End-to-End Training
Ahmed Abbas, P. Swoboda
SpotActor: Training-Free Layout-Controlled Consistent Image Generation
Jiahao Wang, Caixia Yan, Weizhan Zhang et al.
Dual Conditioned Motion Diffusion for Pose-Based Video Anomaly Detection
Hongsong Wang, Andi Xu, Pinle Ding et al.
QCS:Feature Refining from Quadruplet Cross Similarity for Facial Expression Recognition
Chengpeng Wang, Li Chen, Lili Wang et al.
DME: Unveiling the Bias for Better Generalized Monocular Depth Estimation
Songsong Yu, Yifan Wang, Yunzhi Zhuge et al.
Towards Making Learnware Specification and Market Evolvable
Jian-Dong Liu, Zhi-Hao Tan, Zhi-Hua Zhou
Constrained Generative Modeling with Manually Bridged Diffusion Models
Saeid Naderiparizi, Xiaoxuan Liang, Berend Zwartsenberg et al.
Generative Model-Based Feature Knowledge Distillation for Action Recognition
Guiqin Wang, Peng Zhao, Yanjiang Shi et al.
ProtoArgNet: Interpretable Image Classification with Super-Prototypes and Argumentation
Hamed Ayoobi, Nico Potyka, Francesca Toni
AutoSGNN: Automatic Propagation Mechanism Discovery for Spectral Graph Neural Networks
Shibing Mo, Kai Wu, Qixuan Gao et al.
PhysAug: A Physical-guided and Frequency-based Data Augmentation for Single-Domain Generalized Object Detection
Xiaoran Xu, Jiangang Yang, Wenhui Shi et al.
VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment
Shangkun Sun, Xiaoyu Liang, Songlin Fan et al.
Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer
Haopeng Sun, Yingwei Zhang, Lumin Xu et al.
Component Fourier Neural Operator for Singularly Perturbed Differential Equations
Ye Li, Ting Du, Yiwen Pang et al.
ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data
Yufan Shen, Chuwei Luo, Zhaoqing Zhu et al.
Self-Prompting Analogical Reasoning for UAV Object Detection
Nianxin Li, Mao Ye, Lihua Zhou et al.
LDP: Generalizing to Multilingual Visual Information Extraction by Language Decoupled Pretraining
Huawen Shen, Gengluo Li, Jinwen Zhong et al.
Real-Time Recurrent Reinforcement Learning
Julian Lemmel, Radu Grosu
Advancing Video Synchronization with Fractional Frame Analysis: Introducing a Novel Dataset and Model
Yuxuan Liu, Haizhou Ai, Junliang Xing et al.
Blind Face Restoration under Extreme Conditions: Leveraging 3D-2D Prior Fusion for Superior Structural and Texture Recovery
Zhengrui Chen, Liying Lu, Ziyang Yuan et al.
GigaHumanDet: Exploring Full-Body Detection on Gigapixel-Level Images
Chenglong Liu, Haoran Wei, Jinze Yang et al.
CSformer: Combining Channel Independence and Mixing for Robust Multivariate Time Series Forecasting
Haoxin Wang, Yipeng Mo, Kunlan Xiang et al.
Are Expressive Models Truly Necessary for Offline RL?
Guan Wang, Haoyi Niu, Jianxiong Li et al.
Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation
YoungJoon Yoo, Jongwon Choi
Population Aware Diffusion for Time Series Generation
Yang Li, Han Meng, Zhenyu Bi et al.
DualCP: Rehearsal-Free Domain-Incremental Learning via Dual-Level Concept Prototype
Qiang Wang, Yuhang He, Songlin Dong et al.
Twice Class Bias Correction for Imbalanced Semi
supervised Learning
Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language Navigation
Yiyuan Pan, Yunzhe Xu, Zhe Liu et al.
Auto-Regressive Diffusion for Generating 3D Human-Object Interactions
Zichen Geng, Zeeshan Hayder, Wei Liu et al.
EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation
Hongwei Niu, Jie Hu, Jianghang Lin et al.
BBScore: A Brownian Bridge Based Metric for Assessing Text Coherence
Zhecheng Sheng, Tianhao Zhang, Chen Jiang et al.
Probability-Polarized Optimal Transport for Unsupervised Domain Adaptation
Yan Wang, Chuan-Xian Ren, Yi-Ming Zhai et al.
Bi-Directional Multi-Scale Graph Dataset Condensation via Information Bottleneck
Xingcheng Fu, Yisen Gao, Beining Yang et al.
FedCompetitors: Harmonious Collaboration in Federated Learning with Competing Participants
Shanli Tan, Hao Cheng, Xiaohu Wu et al.
Boosting ViT-based MRI Reconstruction from the Perspectives of Frequency Modulation, Spatial Purification, and Scale Diversification
Yucong Meng, Zhiwei Yang, Yonghong Shi et al.
Runtime Analysis for Multi-Objective Evolutionary Algorithms in Unbounded Integer Spaces
Benjamin Doerr, Martin S. Krejca, Günter Rudolph
FedNS: A Fast Sketching Newton-Type Algorithm for Federated Learning
Jian Li, Yong Liu, Wei Wang et al.
Multi-Level Cross-Modal Alignment for Image Clustering
Liping Qiu, Qin Zhang, Xiaojun Chen et al.
TransGOP: Transformer-Based Gaze Object Prediction
Binglu Wang, Chenxi Guo, Yang Jin et al.
Beyond the Label Itself: Latent Labels Enhance Semi-supervised Point Cloud Panoptic Segmentation
Yujun Chen, Xin Tan, Zhizhong Zhang et al.
Generalized Dimension Reduction Using Semi-Relaxed Gromov-Wasserstein Distance
Ranthony A. Clark, Tom Needham, Thomas Weighill
Locally Convex Global Loss Network for Decision-Focused Learning
Haeun Jeon, Hyunglip Bae, Minsu Park et al.
ZeroHAR: Sensor Context Augments Zero-Shot Wearable Action Recognition
Ranak Roy Chowdhury, Ritvik Kapila, Ameya Panse et al.
CMDA: Cross-Modal and Domain Adversarial Adaptation for LiDAR-Based 3D Object Detection
Gyusam Chang, Wonseok Roh, Sujin Jang et al.
Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor
Han Liu, Siyang Zhao, Xiaotong Zhang et al.
SUMI-IFL: An Information-Theoretic Framework for Image Forgery Localization with Sufficiency and Minimality Constraints
Ziqi Sheng, Wei Lu, Xiangyang Luo et al.