Most Cited AAAI "superquadric primitives" Papers
5,317 papers found • Page 5 of 27
Conference
NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization
Danial Kamali, Elham J. Barezi, Parisa Kordjamshidi
Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion
Jingyuan Chen, Fuchen Long, Jie An et al.
Omnipotent Distillation with LLMs for Weakly-Supervised Natural Language Video Localization:
Peijun Bao, Zihao Shao, Wenhan Yang et al.
GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians
Xiaobao Wei, Peng Chen, Ming Lu et al.
BOK-VQA: Bilingual outside Knowledge-Based Visual Question Answering via Graph Representation Pretraining
Minjun Kim, SeungWoo Song, Youhan Lee et al.
Scalable Surrogate Verification of Image-Based Neural Network Control Systems Using Composition and Unrolling
Feiyang Cai, Chuchu Fan, Stanley Bak
Let All Be Whitened: Multi-Teacher Distillation for Efficient Visual Retrieval
Zhe Ma, Jianfeng Dong, Shouling Ji et al.
LAMPAT: Low-Rank Adaption for Multilingual Paraphrasing Using Adversarial Training
Khoi M. Le, Trinh Pham, Tho Quan et al.
Improving Open Set Recognition via Visual Prompts Distilled from Common-Sense Knowledge
Seong-Tae Kim, Hyungil Kim, Y. Ro
OctOcc: High-Resolution 3D Occupancy Prediction with Octree
Wenzhe Ouyang, Xiaolin Song, Bailan Feng et al.
ParZC: Parametric Zero-Cost Proxies for Efficient NAS
Peijie Dong, Lujun Li, Zhenheng Tang et al.
Multi-Dimensional Fair Federated Learning
Cong Su, Guoxian Yu, Jun Wang et al.
Completing Priceable Committees: Utilitarian and Representation Guarantees for Proportional Multiwinner Voting
Markus Brill, Jannik Peters
2043 Improved MLP Point Cloud Processing with High-Dimensional Positional Encoding
Yanmei Zou, Hongshan Yu, Zhengeng Yang et al.
ADBA: Approximation Decision Boundary Approach for Black-Box Adversarial Attacks
Feiyang Wang, Xingquan Zuo, Hai Huang et al.
Taylor Series-Inspired Local Structure Fitting Network for Few-shot Point Cloud Semantic Segmentation
Changshuo Wang, Shuting He, Xiang Fang et al.
B-spine: Learning B-spline Curve Representation for Robust and Interpretable Spinal Curvature Estimation
Hao Wang, Qiang Song, Ruofeng Yin et al.
Cross-Modal Match for Language Conditioned 3D Object Grounding
Yachao Zhang, Runze Hu, Ronghui Li et al.
Semi-supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix
Kewei Wang, Yizheng Wu, Zhiyu Pan et al.
Patched Line Segment Learning for Vector Road Mapping
Jiakun Xu, Bowen Xu, Gui-Song Xia et al.
Knowledge Graph Completion with Relation-Aware Anchor Enhancement
Duanyang Yuan, Sihang Zhou, Xiaoshu Chen et al.
Not All Tasks Are Equally Difficult: Multi-Task Deep Reinforcement Learning with Dynamic Depth Routing
Jinmin He, Kai Li, Yifan Zang et al.
FilterTS: Comprehensive Frequency Filtering for Multivariate Time Series Forecasting
Yulong Wang, Yushuo Liu, Xiaoyi Duan et al.
Colour Passing Revisited: Lifted Model Construction with Commutative Factors
Malte Luttermann, Tanya Braun, Ralf Möller et al.
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
Jialu Li, Aishwarya Padmakumar, Gaurav Sukhatme et al.
Delivering Inflated Explanations
Yacine Izza, Alexey Ignatiev, Peter Stuckey et al.
Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search
Thomy Phan, Taoan Huang, Bistra Dilkina et al.
Multi-Domain Recommendation to Attract Users via Domain Preference Modeling
Hyunjun Ju, SeongKu Kang, Dongha Lee et al.
Domain Generalization with Vital Phase Augmentation
Ingyun Lee, WooJu Lee, Hyun Myung
Symmetric Self-Paced Learning for Domain Generalization
Di Zhao, Yun Sing Koh, Gillian Dobbie et al.
Coupling Graph Neural Networks with Fractional Order Continuous Dynamics: A Robustness Study
Qiyu Kang, Kai Zhao, Yang Song et al.
Measuring Human and AI Values Based on Generative Psychometrics with Large Language Models
Haoran Ye, Yuhang Xie, Yuanyi Ren et al.
Geometry-Guided Domain Generalization for Monocular 3D Object Detection
Fan Yang, Hui Chen, Yuwei He et al.
Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks
Fuzhi Wu, Jiasong Wu, Youyong Kong et al.
Exact ASP Counting with Compact Encodings
Mohimenul Kabir, Supratik Chakraborty, Kuldeep S Meel
Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model Using 3D Whole-Body CT Scans
Heng Guo, Jianfeng Zhang, Jiaxing Huang et al.
How to Use the Metropolis Algorithm for Multi-Objective Optimization?
Weijie Zheng, Mingfeng Li, Renzhong Deng et al.
Efficient Attention-Sharing Information Distillation Transformer for Lightweight Single Image Super-Resolution
Karam Park, Jae Woong Soh, Nam Ik Cho
Poincaré Differential Privacy for Hierarchy-Aware Graph Embedding
Yuecen Wei, Haonan Yuan, Xingcheng Fu et al.
Towards Optimal Subsidy Bounds for Envy-Freeable Allocations
Yasushi Kawase, Kazuhisa Makino, Hanna Sumita et al.
LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement
Renyuan Peng, Xinyue Cai, Hang Xu et al.
Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning
Hang Du, Xuejun Yan, Jingjing Wang et al.
Towards More Faithful Natural Language Explanation Using Multi-Level Contrastive Learning in VQA
Chengen Lai, Shengli Song, Shiqi Meng et al.
Towards Modeling Uncertainties of Self-explaining Neural Networks via Conformal Prediction
Wei Qian, Chenxu Zhao, Yangyi Li et al.
Cumulative Regret Analysis of the Piyavskii–Shubert Algorithm and Its Variants for Global Optimization
Kaan Gokcesu, Hakan Gökcesu
Revisiting Multimodal Fusion for 3D Anomaly Detection from an Architectural Perspective
Kaifang Long, Guoyang Xie, Lianbo Ma et al.
Unraveling Batch Normalization for Realistic Test-Time Adaptation
Zixian Su, Jingwei Guo, Kai Yao et al.
Identification of Necessary Semantic Undertakers in the Causal View for Image-Text Matching
Huatian Zhang, Lei Zhang, Kun Zhang et al.
Understanding and Improving Optimization in Predictive Coding Networks
Nicholas Alonso, Jeffrey Krichmar, Emre Neftci
CatFormer: Category-Level 6D Object Pose Estimation with Transformer
Sheng Yu, Dihua Zhai, Yuanqing Xia
Label-Free Backdoor Attacks in Vertical Federated Learning
Wei Shen, Wenke Huang, Guancheng Wan et al.
PARSAC: Accelerating Robust Multi-Model Fitting with Parallel Sample Consensus
Florian Kluger, Bodo Rosenhahn
Improved Bandits in Many-to-One Matching Markets with Incentive Compatibility
Fang Kong, Shuai Li
Parsing All Adverse Scenes: Severity-Aware Semantic Segmentation with Mask-Enhanced Cross-Domain Consistency
Fuhao Li, Ziyang Gong, Yupeng Deng et al.
DM-Adapter: Domain-Aware Mixture-of-Adapters for Text-Based Person Retrieval
Yating Liu, Zimo Liu, Xiangyuan Lan et al.
PowerMLP: An Efficient Version of KAN
Ruichen Qiu, Yibo Miao, Shiwen Wang et al.
Learning Deformable Hypothesis Sampling for Accurate PatchMatch Multi-View Stereo
Hongjie Li, Yao Guo, Xianwei Zheng et al.
Semi-supervised 3D Object Detection with PatchTeacher and PillarMix
Xiaopei Wu, Liang Peng, Liang Xie et al.
Joint Learning Neuronal Skeleton and Brain Circuit Topology with Permutation Invariant Encoders for Neuron Classification
Minghui Liao, Guojia Wan, Bo Du
Reverse Multi-Choice Dialogue Commonsense Inference with Graph-of-Thought
Li Zheng, Hao Fei, Fei Li et al.
SlerpFace: Face Template Protection via Spherical Linear Interpolation
Zhizhou Zhong, Yuxi Mi, Yuge Huang et al.
Confidence Estimation for Error Detection in Text-to-SQL Systems
Oleg Somov, Elena Tutubalina
PTMQ: Post-training Multi-Bit Quantization of Neural Networks
Ke Xu, Zhongcheng Li, Shanshan Wang et al.
Seeing Your Speech Style: A Novel Zero-Shot Identity-Disentanglement Face-based Voice Conversion
Yan Rong, Li Liu
Contributing Dimension Structure of Deep Feature for Coreset Selection
Zhijing Wan, Zhixiang Wang, Yuran Wang et al.
RG-GAN: Dynamic Regenerative Pruning for Data-Efficient Generative Adversarial Networks
Divya Saxena, Jiannong Cao, Jiahao Xu et al.
Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation
Ling-An Zeng, Guohong Huang, Gaojie Wu et al.
MobileInst: Video Instance Segmentation on the Mobile
Renhong Zhang, Tianheng Cheng, Shusheng Yang et al.
Multi-View Dynamic Reflection Prior for Video Glass Surface Detection
Fang Liu, Yuhao Liu, Jiaying Lin et al.
Low-Light Face Super-resolution via Illumination, Structure, and Texture Associated Representation
Chenyang Wang, Junjun Jiang, Kui Jiang et al.
Causal Inference over Visual-Semantic-Aligned Graph for Image Classification
Lei Meng, Xiangxian Li, Xiaoshuo Yan et al.
SC-NeuS: Consistent Neural Surface Reconstruction from Sparse and Noisy Views
Shi-Sheng Huang, Zixin Zou, Yichi Zhang et al.
Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation
Hao Liu, Xin Li, Mingming Gong et al.
Self-Training Based Few-Shot Node Classification by Knowledge Distillation
Zongqian Wu, Yujie Mo, Peng Zhou et al.
Limitations in Employing Natural Language Supervision for Sensor-Based Human Activity Recognition - And Ways to Overcome Them
Harish Haresamudram, Apoorva Beedu, Mashfiqui Rabbi et al.
Model-Driven Deep Neural Network for Enhanced AoA Estimation Using 5G gNB
Shengheng Liu, Xingkang Li, Zihuan Mao et al.
A Primal-Dual Algorithm for Hybrid Federated Learning
Tom Overman, Garrett Blum, Diego Klabjan
KPL: Training-Free Medical Knowledge Mining of Vision-Language Models
Jiaxiang Liu, Tianxiang Hu, Jiawei Du et al.
Attention-Driven GUI Grounding: Leveraging Pretrained Multimodal Large Language Models Without Fine-Tuning
Hai-Ming Xu, Qi Chen, Lei Wang et al.
Semi-Supervised Multi-View Multi-Label Learning with View-Specific Transformer and Enhanced Pseudo-Label
Quanjiang Li, Tingjin Luo, Mingdie Jiang et al.
Improve Robustness of Reinforcement Learning against Observation Perturbations via l∞ Lipschitz Policy Networks
Buqing Nie, Jingtian Ji, Yangqing Fu et al.
ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation
Mengyang Wu, Yuzhi Zhao, Jialun Cao et al.
Self-attention-based Diffusion Model for Time-series Imputation in Partial Blackout Scenarios
Mohammad Rafid Ul Islam, Prasad Tadepalli, Alan Fern
TACIT: A Target-Agnostic Feature Disentanglement Framework for Cross-Domain Text Classification
Rui Song, Fausto Giunchiglia, Yingji Li et al.
Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views
Shuai Guo, Qiuwen Wang, Yijie Gao et al.
Pre-Training Graph Neural Networks on Molecules by Using Subgraph-Conditioned Graph Information Bottleneck
Van Thuy Hoang, O-Joun Lee
Learning Robust Rationales for Model Explainability: A Guidance-Based Approach
Shuaibo Hu, Kui Yu
DIUSum: Dynamic Image Utilization for Multimodal Summarization
Min Xiao, Junnan Zhu, Feifei Zhai et al.
LIBA: Language Instructed Multi-granularity Bridge Assistant for 3D Visual Grounding
Yuan Wang, Ya-Li Li, W U Eastman Z Y et al.
CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation
Han He, Qianchu Liu, Lei Xu et al.
PPEA-Depth: Progressive Parameter-Efficient Adaptation for Self-Supervised Monocular Depth Estimation
Yue-Jiang Dong, Yuan-Chen Guo, Ying-Tian Liu et al.
Learning Subject-Aware Cropping by Outpainting Professional Photos
James Hong, Lu Yuan, Michaël Gharbi et al.
Decoupling Degradations with Recurrent Network for Video Restoration in Under-Display Camera
Chengxu Liu, Xuan Wang, Yuanting Fan et al.
DeepCalliFont: Few-Shot Chinese Calligraphy Font Synthesis by Integrating Dual-Modality Generative Models
Yitian Liu, Zhouhui Lian
CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird’s Eye View Perception
Senkang Hu, Yihang Tao, Guowen Xu et al.
RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance
Chengrui Wang, Pengfei Liu, Min Zhou et al.
Region-Based Optimization in Continual Learning for Audio Deepfake Detection
Yujie Chen, Jiangyan Yi, Cunhang Fan et al.
Context Enhanced Transformer for Single Image Object Detection in Video Data
Seungjun An, Seonghoon Park, Gyeongnyeon Kim et al.
ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation
Shiqi Huang, Shuting He, Bihan Wen
Link Prediction in Multilayer Networks via Cross-Network Embedding
Guojing Ren, Xiao Ding, Xiao-Ke Xu et al.
Improving Expressive Power of Spectral Graph Neural Networks with Eigenvalue Correction
Kangkang Lu, Yanhua Yu, Hao Fei et al.
Relieving Universal Label Noise for Unsupervised Visible-Infrared Person Re-Identification by Inferring from Neighbors
Xiao Teng, Long Lan, Dingyao Chen et al.
Intrinsic Phase-Preserving Networks for Depth Super Resolution
Xuanhong Chen, Hang Wang, Jinfan Liu et al.
Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users
Hantao Yang, Xutong Liu, Zhiyong Wang et al.
WildFake: A Large-Scale and Hierarchical Dataset for AI-Generated Images Detection
Yan Hong, Jianming Feng, Haoxing Chen et al.
DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation
Xiankang He, Guangkai Xu, Bo Zhang et al.
Comprehensive View Embedding Learning for Single-Cell Multimodal Integration
Zhenchao Tang, Jiehui Huang, Guanxing Chen et al.
A Multimodal, Multi-Task Adapting Framework for Video Action Recognition
Mengmeng Wang, Jiazheng Xing, Boyuan Jiang et al.
Multi-Focus Image Fusion via Explicit Defocus Blur Modelling
Yuhui Quan, Xi Wan, Zitao Tang et al.
Constrained Fair and Efficient Allocations
Benjamin Cookson, Soroush Ebadian, Nisarg Shah
Learning Visual Abstract Reasoning through Dual-Stream Networks
Kai Zhao, Chang Xu, Bailu Si
One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models
Yutao Zhu, Zhaoheng Huang, Zhicheng Dou et al.
Unsupervised Group Re-identification via Adaptive Clustering-Driven Progressive Learning
Hongxu Chen, Quan Zhang, Jian-Huang Lai et al.
Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning
Hui-Yue Yang, Hui Chen, Ao Wang et al.
Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification
Jiaxiang Gou, Luping Ji, Pei Liu et al.
One at a Time: Progressive Multi-Step Volumetric Probability Learning for Reliable 3D Scene Perception
Bohan Li, Yasheng Sun, Jingxin Dong et al.
VIXEN: Visual Text Comparison Network for Image Difference Captioning
Alexander Black, Jing Shi, Yifei Fan et al.
Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning
Bang Yang, Yong Dai, Xuxin Cheng et al.
MSE-Adapter: A Lightweight Plugin Endowing LLMs with the Capability to Perform Multimodal Sentiment Analysis and Emotion Recognition
Yang Yang, Xunde Dong, Yupeng Qiang
GLDL: Graph Label Distribution Learning
Yufei Jin, Richard Gao, Yi He et al.
MVREC: A General Few-shot Defect Classification Model Using Multi-View Region-Context
Shuai Lyu, Rongchen Zhang, Zeqi Ma et al.
InverseCoder: Self-improving Instruction-Tuned Code LLMs with Inverse-Instruct
Yutong Wu, Di Huang, Wenxuan Shi et al.
High-Fidelity Diffusion-Based Image Editing
Chen Hou, Guoqiang Wei, Zhibo Chen
Multi-Pair Temporal Sentence Grounding via Multi-Thread Knowledge Transfer Network
Xiang Fang, Wanlong Fang, Changshuo Wang et al.
ZOOM: Learning Video Mirror Detection with Extremely-Weak Supervision
Ke Xu, Tsun Wai Siu, Rynson W.H. Lau
PointPatchMix: Point Cloud Mixing with Patch Scoring
Yi Wang, Jiaze Wang, Jinpeng Li et al.
Factorized Diffusion Autoencoder for Unsupervised Disentangled Representation Learning
Ancong Wu, Wei-shi Zheng
Efficient Axiomatization of OWL 2 EL Ontologies from Data by Means of Formal Concept Analysis
Francesco Kriegel
Pareto Front-Diverse Batch Multi-Objective Bayesian Optimization
Alaleh Ahmadianshalchi, Syrine Belakaria, Janardhan Rao Doppa
Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video
Junkai Fan, Kun Wang, Zhiqiang Yan et al.
DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization
Wenze Chen, Shiyu Huang, Yuan Chiang et al.
Axiomatic Aggregations of Abductive Explanations
Gagan Biradar, Yacine Izza, Elita Lobo et al.
All-in-One: Transferring Vision Foundation Models into Stereo Matching
Jingyi Zhou, Haoyu Zhang, Jiakang Yuan et al.
Noise-Free Optimization in Early Training Steps for Image Super-resolution
MinKyu Lee, Jae-Pil Heo
TEncDM: Understanding the Properties of the Diffusion Model in the Space of Language Model Encodings
Alexander Shabalin, Viacheslav Meshchaninov, Egor Chimbulatov et al.
REGLO: Provable Neural Network Repair for Global Robustness Properties
Feisi Fu, Zhilu Wang, Weichao Zhou et al.
Make Prompts Adaptable: Bayesian Modeling for Vision-Language Prompt Learning with Data-Dependent Prior
Youngjae Cho, HeeSun Bae, Seungjae Shin et al.
GENTEEL-NEGOTIATOR: LLM-Enhanced Mixture-of-Expert-Based Reinforcement Learning Approach for Polite Negotiation Dialogue
Priyanshu Priya, Rishikant Chigrupaatii, Mauajama Firdaus et al.
LOMA: Language-assisted Semantic Occupancy Network via Triplane Mamba
Yubo Cui, Zhiheng Li, Jiaqiang Wang et al.
DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes
Yiyuan Liang, Zhiying Yan, Liqun Chen et al.
Knowledge Guided Semi-supervised Learning for Quality Assessment of User Generated Videos
Shankhanil Mitra, Rajiv Soundararajan
A Unified Masked Autoencoder with Patchified Skeletons for Motion Synthesis
Esteve Valls Mascaro, Hyemin Ahn, Dongheui Lee
Evidential Uncertainty-Guided Mitochondria Segmentation for 3D EM Images
Ruohua Shi, Lingyu Duan, Tiejun Huang et al.
DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning
Xinghao Wang, Junliang He, Pengyu Wang et al.
Improved Anonymous Multi Agent Path Finding Algorithm
Zain Alabedeen Ali, Konstantin Yakovlev
Data Disparity and Temporal Unavailability Aware Asynchronous Federated Learning for Predictive Maintenance on Transportation Fleets
Leonie von Wahl, Niklas Heidenreich, Prasenjit Mitra et al.
Live and Learn: Continual Action Clustering with Incremental Views
Xiaoqiang Yan, Yingtao Gan, Yiqiao Mao et al.
Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning
Zizhao Wang, Caroline Wang, Xuesu Xiao et al.
MegActor-Sigma: Unlocking Flexible Mixed-Modal Control in Portrait Animation with Diffusion Transformer
Shurong Yang, Huadong Li, Juhao Wu et al.
GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
Yirui Chen, Xudong Huang, Quan Zhang et al.
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning
Ji Soo Lee, Jongha Kim, Jeehye Na et al.
Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization
Zongkai Liu, Qian Lin, Chao Yu et al.
Motion-adaptive Transformer for Event-based Image Deblurring
Senyan Xu, Zhijing Sun, Mingchen Zhong et al.
Keep the Faith: Faithful Explanations in Convolutional Neural Networks for Case-Based Reasoning
Tom Nuno Wolf, Fabian Bongratz, Anne-Marie Rickmann et al.
TimeCHEAT: A Channel Harmony Strategy for Irregularly Sampled Multivariate Time Series Analysis
Jiexi Liu, Meng Cao, Songcan Chen
Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling
Jakob Hollenstein, Georg Martius, Justus Piater
Deep Evidential Hashing for Trustworthy Cross-Modal Retrieval
Yuan Li, Liangli Zhen, Yuan Sun et al.
Time-Aware Knowledge Representations of Dynamic Objects with Multidimensional Persistence
Baris Coskunuzer, Ignacio Segovia-Dominguez, Yuzhou Chen et al.
CAPrompt: Cyclic Prompt Aggregation for Pre-Trained Model Based Class Incremental Learning
Qiwei Li, Jiahuan Zhou
Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Agent Preference-Based Reinforcement Learning
Tianchen Zhu, Yue Qiu, Haoyi Zhou et al.
An Efficient Knowledge Transfer Strategy for Spiking Neural Networks from Static to Event Domain
Xiang He, Dongcheng Zhao, Yang Li et al.
Text2City: One-Stage Text-Driven Urban Layout Regeneration
Yiming Qin, Nanxuan Zhao, Bin Sheng et al.
Enhancing Adversarial Transferability with Adversarial Weight Tuning
Jiahao Chen, Zhou Feng, Rui Zeng et al.
Hierarchical Aligned Multimodal Learning for NER on Tweet Posts
Peipei Liu, Hong Li, Yimo Ren et al.
Colorizing Monochromatic Radiance Fields
Yean Cheng, Renjie Wan, Shuchen Weng et al.
Dehaze-RetinexGAN: Real-World Image Dehazing via Retinex-based Generative Adversarial Network
Xinran Wang, Guang Yang, Tian Ye et al.
An Item Is Worth a Prompt: Versatile Image Editing with Disentangled Control
Aosong Feng, Weikang Qiu, Jinbin Bai et al.
OpenViewer: Openness-Aware Multi-View Learning
Shide Du, Zihan Fang, Yanchao Tan et al.
Gaze from Origin: Learning for Generalized Gaze Estimation by Embedding the Gaze Frontalization Process
Mingjie Xu, Feng Lu
Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter Profile
Seokjun Lee, Seung-Won Jung, Hyunseok Seo
Multi-clue Consistency Learning to Bridge Gaps Between General and Oriented Object in Semi-supervised Detection
Chenxu Wang, Chunyan Xu, Xiang Li et al.
LoRID: Low-Rank Iterative Diffusion for Adversarial Purification
Geigh Zollicoffer, Minh N. Vu, Ben Nebgen et al.
Towards Unifying Evaluation of Counterfactual Explanations: Leveraging Large Language Models for Human-Centric Assessments
Marharyta Domnich, Julius Välja, Rasmus Moorits Veski et al.
RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images
Benzhi Wang, Jingkai Zhou, Jingqi Bai et al.
Robust 3D Tracking with Quality-Aware Shape Completion
Jingwen Zhang, Zikun Zhou, Guangming Lu et al.
Unsupervised Audio-Visual Segmentation with Modality Alignment
Swapnil Bhosale, Haosen Yang, Diptesh Kanojia et al.
Knowledge Enhanced Representation Learning for Drug Discovery
Thanh Lam Hoang, Marco Luca Sbodio, Marcos Martinez et al.
Mixture of Experts as Representation Learner for Deep Multi-View Clustering
Yunhe Zhang, Jinyu Cai, Zhihao Wu et al.
Asynchronous Federated Clustering with Unknown Number of Clusters
Yunfan Zhang, Yiqun Zhang, Yang Lu et al.
Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing
Haobin Jiang, Ziluo Ding, Zongqing Lu
Online Guidance Graph Optimization for Lifelong Multi-Agent Path Finding
Hongzhi Zang, Yulun Zhang, He Jiang et al.
Improved Metric Distortion via Threshold Approvals
Elliot Anshelevich, Aris Filos-Ratsikas, Christopher Jerrett et al.
Generalized Bradley-Terry Models for Score Estimation from Paired Comparisons
Julien Fageot, Lê-Nguyên Hoang, Oscar Villemaud et al.
EWMoE: An Effective Model for Global Weather Forecasting with Mixture-of-Experts
Lihao Gan, Xin Man, Chenghong Zhang et al.
Leveraging Normalization Layer in Adapters with Progressive Learning and Adaptive Distillation for Cross-Domain Few-Shot Learning
YongJin Yang, Taehyeon Kim, Se-Young Yun
Towards Continual Learning Desiderata via HSIC-Bottleneck Orthogonalization and Equiangular Embedding
Depeng Li, Tianqi Wang, Junwei Chen et al.
Residual Hyperbolic Graph Convolution Networks
Yangkai Xue, Jindou Dai, Zhipeng Lu et al.
V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer
Hangzhou He, Lei Zhu, Xinliang Zhang et al.
(Almost Full) EFX for Three (and More) Types of Agents
Pratik Ghosal, Vishwa Prakash HV, Prajakta Nimbhorkar et al.
Reachability of Fair Allocations via Sequential Exchanges
Ayumi Igarashi, Naoyuki Kamiyama, Warut Suksompong et al.
Robust Policy Learning via Offline Skill Diffusion
Woo Kyung Kim, Minjong Yoo, Honguk Woo
Revisiting Gradient Pruning: A Dual Realization for Defending against Gradient Attacks
Lulu Xue, Shengshan Hu, Ruizhi Zhao et al.
IVP-VAE: Modeling EHR Time Series with Initial Value Problem Solvers
Jingge Xiao, Leonie Basso, Wolfgang Nejdl et al.
Bridging the Semantic Latent Space between Brain and Machine: Similarity Is All You Need
Jiaxuan Chen, Yu Qi, Yueming Wang et al.
PokerBench: Training Large Language Models to Become Professional Poker Players
Richard Zhuang, Akshat Gupta, Richard Yang et al.
Expressive Forecasting of 3D Whole-Body Human Motions
Pengxiang Ding, Qiongjie Cui, Haofan Wang et al.
DeepPointMap: Advancing LiDAR SLAM with Unified Neural Descriptors
Xiaze Zhang, Ziheng Ding, Qi Jing et al.
A Dynamic Learning Method towards Realistic Compositional Zero-Shot Learning
Xiaoming Hu, Zilei Wang
Graph Context Transformation Learning for Progressive Correspondence Pruning
Junwen Guo, Guobao Xiao, Shiping Wang et al.
Multi-Granular Multimodal Clue Fusion for Meme Understanding
Li Zheng, Hao Fei, Ting Dai et al.