Most Cited AAAI "graph alignment framework" Papers
5,317 papers found • Page 6 of 27
Conference
Re2LLM: Reflective Reinforcement Large Language Model for Session-based Recommendation
Ziyan Wang, Yingpeng Du, Zhu Sun et al.
EPSD: Early Pruning with Self-Distillation for Efficient Model Compression
Dong Chen, Ning Liu, Yichen Zhu et al.
Feature Denoising Diffusion Model for Blind Image Quality Assessment
Xudong Li, Yan Zhang, Yunhang Shen et al.
UniPCGC: Towards Practical Point Cloud Geometry Compression via an Efficient Unified Approach
Kangli Wang, Wei Gao
Just What You Desire: Constrained Timeline Summarization with Self-Reflection for Enhanced Relevance
Muhammad Reza Qorib, Qisheng Hu, Hwee Tou Ng
Generalized Bradley-Terry Models for Score Estimation from Paired Comparisons
Julien Fageot, Lê-Nguyên Hoang, Oscar Villemaud et al.
Improved Anonymous Multi Agent Path Finding Algorithm
Zain Alabedeen Ali, Konstantin Yakovlev
Leveraging Normalization Layer in Adapters with Progressive Learning and Adaptive Distillation for Cross-Domain Few-Shot Learning
YongJin Yang, Taehyeon Kim, Se-Young Yun
MaFeRw: Query Rewriting with Multi-Aspect Feedbacks for Retrieval-Augmented Large Language Models
Yujing Wang, Hainan Zhang, Liang Pang et al.
QPEN: Quantum Projection and Quantum Entanglement Enhanced Network for Cross-Lingual Aspect-Based Sentiment Analysis
Xingqiang Zhao, Hai Wan, Kunxun Qi
Filter or Compensate: Towards Invariant Representation from Distribution Shift for Anomaly Detection
Zining Chen, Xingshuang Luo, Weiqiu Wang et al.
GNS: Solving Plane Geometry Problems by Neural-Symbolic Reasoning with Multi-Modal LLMs
Maizhen Ning, Zihao Zhou, Qiufeng Wang et al.
Reachability of Fair Allocations via Sequential Exchanges
Ayumi Igarashi, Naoyuki Kamiyama, Warut Suksompong et al.
MergeNet: Knowledge Migration Across Heterogeneous Models, Tasks, and Modalities
Kunxi Li, Tianyu Zhan, Kairui Fu et al.
Interactive Hyperparameter Optimization in Multi-Objective Problems via Preference Learning
Joseph Giovanelli, Alexander Tornede, Tanja Tornede et al.
SEER: Backdoor Detection for Vision-Language Models through Searching Target Text and Image Trigger Jointly
Liuwan Zhu, Rui Ning, Jiang Li et al.
Bridging the Semantic Latent Space between Brain and Machine: Similarity Is All You Need
Jiaxuan Chen, Yu Qi, Yueming Wang et al.
Backdoor Adjustment via Group Adaptation for Debiased Coupon Recommendations
Junpeng Fang, Gongduo Zhang, Qing Cui et al.
Adaptive Draft-Verification for Efficient Large Language Model Decoding
Xukun Liu, Bowen Lei, Ruqi Zhang et al.
Scaffold-BPE: Enhancing Byte Pair Encoding for Large Language Models with Simple and Effective Scaffold Token Removal
Haoran Lian, Yizhe Xiong, Jianwei Niu et al.
Learning Diverse Risk Preferences in Population-Based Self-Play
Yuhua Jiang, Qihan Liu, Xiaoteng Ma et al.
Alleviate and Mining: Rethinking Unsupervised Domain Adaptation for Mitochondria Segmentation from Pseudo-Label Perspective
Yujia Chen, Rui Sun, Wangkai Li et al.
Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter Profile
Seokjun Lee, Seung-Won Jung, Hyunseok Seo
Diffusion-based Synthetic Data Generation for Visible-Infrared Person Re-Identification
Wenbo Dai, Lijing Lu, Zhihang Li
Diversity-Authenticity Co-constrained Stylization for Federated Domain Generalization in Person Re-identification
Fengxiang Yang, Zhun Zhong, Zhiming Luo et al.
Exploring Model Editing for LLM-based Aspect-Based Sentiment Classification
Shichen Li, Zhongqing Wang, Zheyu Zhao et al.
Keep the Faith: Faithful Explanations in Convolutional Neural Networks for Case-Based Reasoning
Tom Nuno Wolf, Fabian Bongratz, Anne-Marie Rickmann et al.
H-ensemble: An Information Theoretic Approach to Reliable Few-Shot Multi-Source-Free Transfer
Yanru Wu, Jianning Wang, Weida Wang et al.
DanceFix: An Exploration in Group Dance Neatness Assessment Through Fixing Abnormal Challenges of Human Pose
Huangbiao Xu, Xiao Ke, Huanqi Wu et al.
Against All Odds: Overcoming Typology, Script, and Language Confusion in Multilingual Embedding Inversion Attacks
Yiyi Chen, Russa Biswas, Heather Lent et al.
COLUMBUS: Evaluating COgnitive Lateral Understanding Through Multiple-Choice reBUSes
Koen Kraaijveld, Yifan Jiang, Kaixin Ma et al.
Details Enhancement in Unsigned Distance Field Learning for High-fidelity 3D Surface Reconstruction
Cheng Xu, Fei Hou, Wencheng Wang et al.
The Bandit Whisperer: Communication Learning for Restless Bandits
Yunfan Zhao, Tonghan Wang, Dheeraj Mysore Nagaraj et al.
Factor Augmented Tensor-on-Tensor Neural Networks
Guanhao Zhou, Yuefeng Han, Xiufan Yu
MCSSME: Multi-Task Contrastive Learning for Semi-supervised Singing Melody Extraction from Polyphonic Music
Shuai Yu
Hard Regularization to Prevent Deep Online Clustering Collapse without Data Augmentation
Louis Mahon, Thomas Lukasiewicz
In-Hand 3D Object Reconstruction from a Monocular RGB Video
Shijian Jiang, Qi Ye, Rengan Xie et al.
Assessing Pre-Trained Models for Transfer Learning Through Distribution of Spectral Components
Tengxue Zhang, Yang Shu, Xinyang Chen et al.
Learning the Causal Structure of Networked Dynamical Systems under Latent Nodes and Structured Noise
Augusto Santos, Diogo Rente, Rui Seabra et al.
Spear and Shield: Adversarial Attacks and Defense Methods for Model-Based Link Prediction on Continuous-Time Dynamic Graphs
Dongjin Lee, Juho Lee, Kijung Shin
Interactive Visual Task Learning for Robots
Weiwei Gu, Anant Sah, N. Gopalan
CVLUE: A New Benchmark Dataset for Chinese Vision-Language Understanding Evaluation
Yuxuan Wang, Yijun Liu, Fei Yu et al.
On the Expressiveness and Length Generalization of Selective State Space Models on Regular Languages
Aleksandar Terzic, Michael Hersche, Giacomo Camposampiero et al.
Doubly Contrastive Learning for Source-Free Domain Adaptive Person Search
Yizhen Jia, Rong Quan, Yue Feng et al.
Accelerating the Global Aggregation of Local Explanations
Alon Mor, Yonatan Belinkov, Benny Kimelfeld
Error Bounds for Gaussian Process Regression Under Bounded Support Noise with Applications to Safety Certification
Robert Reed, Luca Laurenti, Morteza Lahijanian
Alignment-Free RGB-T Salient Object Detection: A Large-Scale Dataset and Progressive Correlation Network
Kunpeng Wang, Keke Chen, Chenglong Li et al.
CFEVER: A Chinese Fact Extraction and VERification Dataset
Ying-Jia Lin, ChunYi Lin, Chia-Jen Yeh et al.
Improving Knowledge Extraction from LLMs for Task Learning through Agent Analysis
James Kirk, Robert Wray, Peter Lindes et al.
Noisy Label Calibration for Multi-View Classification
Shilin Xu, Yuan Sun, Xingfeng Li et al.
Spatial-Semantic Collaborative Cropping for User Generated Content
Yukun Su, Yiwen Cao, Jingliang Deng et al.
SeTformer Is What You Need for Vision and Language
Pourya Shamsolmoali, Masoumeh Zareapoor, Eric Granger et al.
Learning Small Decision Trees with Few Outliers: A Parameterized Perspective
Harmender Gahlawat, Meirav Zehavi
Generalizable Sensor-Based Activity Recognition via Categorical Concept Invariant Learning
Di Xiong, Shuoyuan Wang, Lei Zhang et al.
Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation
Xie Tianyidan, Rui Ma, Qian Wang et al.
SDAC: A Multimodal Synthetic Dataset for Anomaly and Corner Case Detection in Autonomous Driving
Lei Gong, Yu Zhang, Yingqing Xia et al.
Training Consistent Mixture-of-Experts-Based Prompt Generator for Continual Learning
Yue Lu, Shizhou Zhang, De Cheng et al.
Inverse Weight-Balancing for Deep Long-Tailed Learning
Wenqi Dang, Zhou Yang, Weisheng Dong et al.
PPIDSG: A Privacy-Preserving Image Distribution Sharing Scheme with GAN in Federated Learning
Yuting Ma, Yuanzhi Yao, Xiaohua Xu
Continual Learning Using a Kernel-Based Method Over Foundation Models
Saleh Momeni, Sahisnu Mazumder, Bing Liu
Beyond Human Data: Aligning Multimodal Large Language Models by Iterative Self-Evolution
Wentao Tan, Qiong Cao, Yibing Zhan et al.
Knowledge-Enhanced Historical Document Segmentation and Recognition
En-Hao Gao, Yu-Xuan Huang, Wen-Chao Hu et al.
Learning Complex Heterogeneous Multimodal Fake News via Social Latent Network Inference
Mingxin Li, Yuchen Zhang, Haowei Xu et al.
Destroy and Repair Using Hyper-Graphs for Routing
Ke Li, Fei Liu, Zhenkun Wang et al.
Exploiting Polarized Material Cues for Robust Car Detection
Wen Dong, Haiyang Mei, Ziqi Wei et al.
LRS: Enhancing Adversarial Transferability through Lipschitz Regularized Surrogate
Tao Wu, Tie Luo, D. C. Wunsch
Medical Multimodal Model Stealing Attacks via Adversarial Domain Alignment
Yaling Shen, Zhixiong Zhuang, Kun Yuan et al.
DuMo: Dual Encoder Modulation Network for Precise Concept Erasure
Feng Han, Kai Chen, Chao Gong et al.
Fact-Driven Logical Reasoning for Machine Reading Comprehension
Siru Ouyang, Zhuosheng Zhang, Hai Zhao
Using Stratified Sampling to Improve LIME Image Explanations
Muhammad Rashid, Elvio G. Amparore, Enrico Ferrari et al.
Expensive Multi-Objective Bayesian Optimization Based on Diffusion Models
Bingdong Li, Zixiang Di, Yongfan Lu et al.
Capture Global Feature Statistics for One-Shot Federated Learning
Zenghao Guan, Yucan Zhou, Xiaoyan Gu
Adaptive Calibration: A Unified Conversion Framework of Spiking Neural Networks
Ziqing Wang, Yuetong Fang, Jiahang Cao et al.
Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning
Subhabrata Dutta, Ishan Pandey, Joykirat Singh et al.
Auto-Regressive Moving Diffusion Models for Time Series Forecasting
Jiaxin Gao, Qinglong Cao, Yuntian Chen
Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation
Thong Thanh Nguyen, Xiaobao Wu, Yi Bin et al.
SwiftTry: Fast and Consistent Video Virtual Try-On with Diffusion Models
Hung Nguyen, Quang Qui-Vinh Nguyen, Khoi Nguyen et al.
Boosting Fine-Grained Visual Anomaly Detection with Coarse-Knowledge-Aware Adversarial Learning
Qingqing Fang, Qinliang Su, Wenxi Lv et al.
Solving Robust Markov Decision Processes: Generic, Reliable, Efficient
Tobias Meggendorfer, Maximilian Weininger, Patrick Wienhöft
Dependency Structure-Enhanced Graph Attention Networks for Event Detection
Qizhi Wan, Changxuan wan, Keli Xiao et al.
Stochastic Online Instrumental Variable Regression: Regrets for Endogeneity and Bandit Feedback
Riccardo Della Vecchia, Debabrota Basu
Inconsistency-Based Data-Centric Active Open-Set Annotation
Ruiyu Mao, Ouyang Xu, Yunhui Guo
Complete Neural Networks for Complete Euclidean Graphs
Snir Hordan, Tal Amir, Nadav Dym et al.
MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity
Kanghyun Choi, Hyeyoon Lee, Dain Kwon et al.
Arbitrary Reading Order Scene Text Spotter with Local Semantics Guidance
Jiahao Lyu, Wei Wang, Dongbao Yang et al.
GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation
Jiawei Lu, YingPeng Zhang, Zengjun Zhao et al.
Weak Distribution Detectors Lead to Stronger Generalizability of Vision-Language Prompt Tuning
Kun Ding, Haojian Zhang, Qiang Yu et al.
Faithful Model Explanations through Energy-Constrained Conformal Counterfactuals
Patrick Altmeyer, Mojtaba Farmanbar, Arie Van Deursen et al.
Out of Length Text Recognition with Sub-String Matching
Yongkun Du, Zhineng Chen, Caiyan Jia et al.
Probabilistic Neural Circuits
Pedro Zuidberg Dos Martires
Teacher as a Lenient Expert: Teacher-Agnostic Data-Free Knowledge Distillation
Hyunjune Shin, Dong-Wan Choi
Identification of Causal Structure with Latent Variables Based on Higher Order Cumulants
Wei Chen, Zhiyi Huang, Ruichu Cai et al.
Safe Planner: Empowering Safety Awareness in Large Pre-Trained Models for Robot Task Planning
Siyuan Li, Feifan Liu, Lingfei Cui et al.
Int2Planner: An Intention-based Multi-modal Motion Planner for Integrated Prediction and Planning
Xiaolei Chen, Junchi Yan, Wenlong Liao et al.
Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
Zhao Shan, Chenyou Fan, Shuang Qiu et al.
Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling
Yuze Hao, Jianrong Zhang, Tao Zhuo et al.
Catch-Up Mix: Catch-Up Class for Struggling Filters in CNN
Minsoo Kang, Minkoo Kang, Suhyun Kim
Weakly Supervised Few-Shot Object Detection with DETR
Chenbo Zhang, Yinglu Zhang, Lu Zhang et al.
Long-Term EEG Partitioning for Seizure Onset Detection
Zheng Chen, Yasuko Matsubara, Yasushi Sakurai et al.
DeepSaDe: Learning Neural Networks That Guarantee Domain Constraint Satisfaction
Kshitij Goyal, Sebastijan Dumancic, Hendrik Blockeel
ContextualStory: Consistent Visual Storytelling with Spatially-Enhanced and Storyline Context
Sixiao Zheng, Yanwei Fu
Unsupervised Pan-Sharpening via Mutually Guided Detail Restoration
Huangxing Lin, Yuhang Dong, Xinghao Ding et al.
Instruction-Augmented Long-Horizon Planning: Embedding Grounding Mechanisms in Embodied Mobile Manipulation
Fangyuan Wang, Shipeng Lyu, Peng Zhou et al.
DiffGrasp: Whole-Body Grasping Synthesis Guided by Object Motion Using a Diffusion Model
Yonghao Zhang, Qiang He, Yanguang Wan et al.
Structural Information Guided Multimodal Pre-training for Vehicle-Centric Perception
Xiao Wang, Wentao Wu, Chenglong Li et al.
A Comprehensive Evaluation on Event Reasoning of Large Language Models
Zhengwei Tao, Zhi Jin, Yifan Zhang et al.
Erase Then Rectify: A Training-Free Parameter Editing Approach for Cost-Effective Graph Unlearning
Zhe-Rui Yang, Jindong Han, Chang-Dong Wang et al.
Revisiting Graph Contrastive Learning on Anomaly Detection: A Structural Imbalance Perspective
Yiming Xu, Zhen Peng, Bin Shi et al.
Distilling Structured Rationale from Large Language Models to Small Language Models for Abstractive Summarization
Linyong Wang, Lianwei Wu, Shaoqi Song et al.
Your Career Path Matters in Person-Job Fit
Zhuocheng Gong, Yang Song, Tao Zhang et al.
Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction
Quan Zhang, Yuxin Qi, Xi Tang et al.
Multi-Modal Disordered Representation Learning Network for Description-Based Person Search
Fan Yang, Wei Li, Menglong Yang et al.
Dynamic-Width Speculative Beam Decoding for LLM Inference
Zongyue Qin, Zifan He, Neha Prakriya et al.
GDiffRetro: Retrosynthesis Prediction with Dual Graph Enhanced Molecular Representation and Diffusion Generation
Shengyin Sun, Wenhao Yu, Yuxiang Ren et al.
Hand1000: Generating Realistic Hands from Text with Only 1,000 Images
Haozhuo Zhang, Bin Zhu, Yu Cao et al.
Towards the Disappearing Truth: Fine-Grained Joint Causal Influences Learning with Hidden Variable-Driven Causal Hypergraphs
Kun Zhu, Chunhui Zhao
Dialogue for Prompting: A Policy-Gradient-Based Discrete Prompt Generation for Few-Shot Learning
Chengzhengxu Li, Xiaoming Liu, Yichen Wang et al.
NestE: Modeling Nested Relational Structures for Knowledge Graph Reasoning
Bo Xiong, Mojtaba Nayyeri, Linhao Luo et al.
SSL-STMFormer Self-Supervised Learning Spatio-Temporal Entanglement Transformer for Traffic Flow Prediction
Zetao Li, Zheng Hu, Peng Han et al.
Instruction-guided Multi-Granularity Segmentation and Captioning with Large Multimodal Model
Xu Yuan, Li Zhou, Zenghui Sun et al.
Discretization-Induced Dirichlet Posterior for Robust Uncertainty Quantification on Regression
Xuanlong Yu, Gianni Franchi, Jindong Gu et al.
Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning
Zeyang Liu, Lipeng Wan, Xinrui Yang et al.
Lyapunov-Stable Deep Equilibrium Models
Haoyu Chu, Shikui Wei, Ting Liu et al.
On Computing Makespan-Optimal Solutions for Generalized Sliding-Tile Puzzles
Marcus Gozon, Jingjin Yu
ConVQG: Contrastive Visual Question Generation with Multimodal Guidance
Li Mi, Syrielle Montariol, Javiera Castillo Navarro et al.
SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers
Zehao Chen, Rong Pan
Color Event Enhanced Single-Exposure HDR Imaging
Mengyao Cui, Zhigang Wang, Dong Wang et al.
Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph
Xujian Liang, Zhaoquan Gu
Predicting the Original Appearance of Damaged Historical Documents
Zhenhua Yang, Dezhi Peng, Yongxin Shi et al.
SWEA: Updating Factual Knowledge in Large Language Models via Subject Word Embedding Altering
Xiaopeng Li, Shasha Li, Shezheng Song et al.
DAG-Aware Variational Autoencoder for Social Propagation Graph Generation
Dongpeng Hou, Chao Gao, Xuelong Li et al.
On the Robustness of Neural-Enhanced Video Streaming against Adversarial Attacks
Qihua Zhou, Jingcai Guo, Song Guo et al.
Anchoring Path for Inductive Relation Prediction in Knowledge Graphs
Zhixiang Su, Di Wang, Chunyan Miao et al.
TAMER: Tree-Aware Transformer for Handwritten Mathematical Expression Recognition
Jianhua Zhu, Wenqi Zhao, Yu Li et al.
Neural Amortized Inference for Nested Multi-Agent Reasoning
Kunal Jha, Tuan Anh Le, Chuanyang Jin et al.
FairWASP: Fast and Optimal Fair Wasserstein Pre-processing
Zikai Xiong, Niccolo Dalmasso, Alan Mishler et al.
Neural Time
Reversed Generalized Riccati Equation
Racing Control Variable Genetic Programming for Symbolic Regression
Nan Jiang, Yexiang Xue
Exploit Your Latents: Coarse-Grained Protein Backmapping with Latent Diffusion Models
Rongchao Zhang, Yu Huang, Yiwei Lou et al.
Existence Is Chaos: Enhancing 3D Human Motion Prediction with Uncertainty Consideration
Zhihao Wang, Yulin Zhou, Ningyu Zhang et al.
Zero-resource Hallucination Detection for Text Generation via Graph-based Contextual Knowledge Triples Modeling
Xinyue Fang, Zhen Huang, Zhiliang Tian et al.
Enhancing Zero-Shot Multi-Speaker TTS with Negated Speaker Representations
Yejin Jeon, Yunsu Kim, Gary Geunbae Lee
H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving
Siran Chen, Yuxiao Luo, Yue Ma et al.
Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection
Kedi Chen, Qin Chen, Jie Zhou et al.
RadarMOSEVE: A Spatial-Temporal Transformer Network for Radar-Only Moving Object Segmentation and Ego-Velocity Estimation
Changsong Pang, Xieyuanli Chen, Yimin Liu et al.
Bridging Training and Execution via Dynamic Directed Graph-Based Communication in Cooperative Multi-Agent Systems
Zhuohui Zhang, Bin He, Bin Cheng et al.
Multi-Agent Motion Planning for Differential Drive Robots Through Stationary State Search
Jingtian Yan, Jiaoyang Li
REVECA: Adaptive Planning and Trajectory-Based Validation in Cooperative Language Agents Using Information Relevance and Relative Proximity
SeungWon Seo, SeongRae Noh, Junhyeok Lee et al.
Regulating AI: Applying Insights from Behavioural Economics and Psychology to the Application of Article 5 of the EU AI Act
Huixin Zhong, Eamonn O'Neill, Janina Hoffmann
UVAGaze: Unsupervised 1-to-2 Views Adaptation for Gaze Estimation
Ruicong Liu, Feng Lu
Prediction-Feedback DETR for Temporal Action Detection
Jihwan Kim, Miso Lee, Cheol-Ho Cho et al.
Event-Enhanced Blurry Video Super-Resolution
Dachun Kai, Yueyi Zhang, Jin Wang et al.
Towards Making Learnware Specification and Market Evolvable
Jian-Dong Liu, Zhi-Hao Tan, Zhi-Hua Zhou
ScaleOT: Privacy-utility-scalable Offsite-tuning with Dynamic LayerReplace and Selective Rank Compression
Kai Yao, Zhaorui Tan, Tiandi Ye et al.
MLC-NC: Long-Tailed Multi-Label Image Classification Through the Lens of Neural Collapse
Zijian Tao, Shao-Yuan Li, Wenhai Wan et al.
DOGE-Train: Discrete Optimization on GPU with End-to-End Training
Ahmed Abbas, P. Swoboda
Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting
Qi ZHANG, Yunfei Gong, Daijie Chen et al.
WaterDiffusion: Learning a Prior-involved Unrolling Diffusion for Joint Underwater Saliency Detection and Visual Restoration
Laibin Chang, Yunke Wang, Longxiang Deng et al.
AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring
Xinyi Wang, Na Zhao, Zhiyuan Han et al.
Generative Model-Based Feature Knowledge Distillation for Action Recognition
Guiqin Wang, Peng Zhao, Yanjiang Shi et al.
Text2Relight: Creative Portrait Relighting with Text Guidance
Junuk Cha, Mengwei Ren, Krishna Kumar Singh et al.
DME: Unveiling the Bias for Better Generalized Monocular Depth Estimation
Songsong Yu, Yifan Wang, Yunzhi Zhuge et al.
SpotActor: Training-Free Layout-Controlled Consistent Image Generation
Jiahao Wang, Caixia Yan, Weizhan Zhang et al.
Dual Conditioned Motion Diffusion for Pose-Based Video Anomaly Detection
Hongsong Wang, Andi Xu, Pinle Ding et al.
PhysAug: A Physical-guided and Frequency-based Data Augmentation for Single-Domain Generalized Object Detection
Xiaoran Xu, Jiangang Yang, Wenhui Shi et al.
GigaHumanDet: Exploring Full-Body Detection on Gigapixel-Level Images
Chenglong Liu, Haoran Wei, Jinze Yang et al.
Component Fourier Neural Operator for Singularly Perturbed Differential Equations
Ye Li, Ting Du, Yiwen Pang et al.
QCS:Feature Refining from Quadruplet Cross Similarity for Facial Expression Recognition
Chengpeng Wang, Li Chen, Lili Wang et al.
Constrained Generative Modeling with Manually Bridged Diffusion Models
Saeid Naderiparizi, Xiaoxuan Liang, Berend Zwartsenberg et al.
AutoSGNN: Automatic Propagation Mechanism Discovery for Spectral Graph Neural Networks
Shibing Mo, Kai Wu, Qixuan Gao et al.
Advancing Video Synchronization with Fractional Frame Analysis: Introducing a Novel Dataset and Model
Yuxuan Liu, Haizhou Ai, Junliang Xing et al.
ProtoArgNet: Interpretable Image Classification with Super-Prototypes and Argumentation
Hamed Ayoobi, Nico Potyka, Francesca Toni
Blind Face Restoration under Extreme Conditions: Leveraging 3D-2D Prior Fusion for Superior Structural and Texture Recovery
Zhengrui Chen, Liying Lu, Ziyang Yuan et al.
VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment
Shangkun Sun, Xiaoyu Liang, Songlin Fan et al.
Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer
Haopeng Sun, Yingwei Zhang, Lumin Xu et al.
Population Aware Diffusion for Time Series Generation
Yang Li, Han Meng, Zhenyu Bi et al.
Self-Prompting Analogical Reasoning for UAV Object Detection
Nianxin Li, Mao Ye, Lihua Zhou et al.
Are Expressive Models Truly Necessary for Offline RL?
Guan Wang, Haoyi Niu, Jianxiong Li et al.
ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data
Yufan Shen, Chuwei Luo, Zhaoqing Zhu et al.
CSformer: Combining Channel Independence and Mixing for Robust Multivariate Time Series Forecasting
Haoxin Wang, Yipeng Mo, Kunlan Xiang et al.
TransGOP: Transformer-Based Gaze Object Prediction
Binglu Wang, Chenxi Guo, Yang Jin et al.
LDP: Generalizing to Multilingual Visual Information Extraction by Language Decoupled Pretraining
Huawen Shen, Gengluo Li, Jinwen Zhong et al.
Uncertainty Quantification in Heterogeneous Treatment Effect Estimation with Gaussian-Process-Based Partially Linear Model
Shunsuke Horii, Yoichi Chikahara
BBScore: A Brownian Bridge Based Metric for Assessing Text Coherence
Zhecheng Sheng, Tianhao Zhang, Chen Jiang et al.
Beyond the Label Itself: Latent Labels Enhance Semi-supervised Point Cloud Panoptic Segmentation
Yujun Chen, Xin Tan, Zhizhong Zhang et al.
Probability-Polarized Optimal Transport for Unsupervised Domain Adaptation
Yan Wang, Chuan-Xian Ren, Yi-Ming Zhai et al.
Real-Time Recurrent Reinforcement Learning
Julian Lemmel, Radu Grosu
Multi-Level Cross-Modal Alignment for Image Clustering
Liping Qiu, Qin Zhang, Xiaojun Chen et al.
FedCompetitors: Harmonious Collaboration in Federated Learning with Competing Participants
Shanli Tan, Hao Cheng, Xiaohu Wu et al.
Runtime Analysis for Multi-Objective Evolutionary Algorithms in Unbounded Integer Spaces
Benjamin Doerr, Martin S. Krejca, Günter Rudolph
FedNS: A Fast Sketching Newton-Type Algorithm for Federated Learning
Jian Li, Yong Liu, Wei Wang et al.
Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor
Han Liu, Siyang Zhao, Xiaotong Zhang et al.
Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation
YoungJoon Yoo, Jongwon Choi
Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language Navigation
Yiyuan Pan, Yunzhe Xu, Zhe Liu et al.
CMDA: Cross-Modal and Domain Adversarial Adaptation for LiDAR-Based 3D Object Detection
Gyusam Chang, Wonseok Roh, Sujin Jang et al.
Locally Convex Global Loss Network for Decision-Focused Learning
Haeun Jeon, Hyunglip Bae, Minsu Park et al.
Bi-Directional Multi-Scale Graph Dataset Condensation via Information Bottleneck
Xingcheng Fu, Yisen Gao, Beining Yang et al.
Auto-Regressive Diffusion for Generating 3D Human-Object Interactions
Zichen Geng, Zeeshan Hayder, Wei Liu et al.
EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation
Hongwei Niu, Jie Hu, Jianghang Lin et al.
DUO: Diverse, Uncertain, On-Policy Query Generation and Selection for Reinforcement Learning from Human Feedback
Xuening Feng, Zhaohui Jiang, Timo Kaufmann et al.