Most Cited AAAI "text embeddings fusion" Papers
5,317 papers found • Page 14 of 27
Conference
Learning to Reweight for Generalizable Graph Neural Network
Zhengyu Chen, Teng Xiao, Kun Kuang et al.
Restabilizing Diffusion Models with Predictive Noise Fusion Strategy for Image Super-Resolution
Luoqian Jiang, Yong Guo, Bingna Xu et al.
Multi-Domain Multi-Scale Diffusion Model for Low-Light Image Enhancement
Kai Shang, Mingwen Shao, Chao Wang et al.
Weakly Supervised Multimodal Affordance Grounding for Egocentric Images
Lingjing Xu, Yang Gao, Wenfeng Song et al.
RRT-MVS: Recurrent Regularization Transformer for Multi-View Stereo
Jianfei Jiang, Liyong Wang, Haochen Yu et al.
ARNet: Self-Supervised FG-SBIR with Unified Sample Feature Alignment and Multi-Scale Token Recycling
Jianan Jiang, Hao Tang, Zhilin Jiang et al.
When to Show a Suggestion? Integrating Human Feedback in AI-Assisted Programming
Hussein Mozannar, Gagan Bansal, Adam Fourney et al.
Granularity-Adaptive Spatial Evidence Tokenization for Video Question Answering
Hao Jiang, Yang Jin, Zhicheng Sun et al.
FlexiTex: Enhancing Texture Generation via Visual Guidance
Dadong Jiang, Xianghui Yang, Zibo Zhao et al.
CatmullRom Splines-Based Regression for Image Forgery Localization
Li Zhang, Mingliang Xu, Dong Li et al.
DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework
Yueru Jia, Aosong Cheng, Yuhui Yuan et al.
Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks
Alexander Jaus, Constantin Marc Seibold, Simon Reiß et al.
Molecular Optimization Model with Patentability Constraint
Sally Turutov, Kira Radinsky
Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior
Lee Hyoseok, Kyeong Seon Kim, Kwon Byung-Ki et al.
SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization
Yongle Huang, Haodong Chen, Zhenbang Xu et al.
DeRDaVa: Deletion-Robust Data Valuation for Machine Learning
Xiao Tian, Rachael Hwee Ling Sim, Jue Fan et al.
T2MAC: Targeted and Trusted Multi-Agent Communication through Selective Engagement and Evidence-Driven Integration
Chuxiong Sun, Zehua Zang, Jiabao Li et al.
DAMPER: A Dual-Stage Medical Report Generation Framework with Coarse-Grained MeSH Alignment and Fine-Grained Hypergraph Matching
Xiaofei Huang, Wenting Chen, Jie Liu et al.
Efficient Constrained K-center Clustering with Background Knowledge
Longkun Guo, Chaoqi Jia, Kewen Liao et al.
CLIP-RestoreX: Restore Image Structure and Perception in Exposure Correction
Xiang Huang, Qing Zhang, Jian-Fang Hu et al.
Efficient Indoor Depth Completion Network Using Mask-adaptive Gated Convolution
Tingxuan Huang, Jiacheng Miao, Shizhuo Deng et al.
IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
Yiwen Chen, Chi Zhang, Xiaofeng Yang et al.
Quantum-Inspired Neural Network with Runge-Kutta Method
Zipeng Fan, Jing Zhang, Peng Zhang et al.
EvoChart: A Benchmark and a Self-Training Approach Towards Real-World Chart Understanding
Muye Huang, Han Lai, Xinyu Zhang et al.
Stable Model Semantics for Description Logic Terminologies
Federica Di Stefano, Mantas Simkus
AUTE: Peer-Alignment and Self-Unlearning Boost Adversarial Robustness for Training Ensemble Models
Lifeng Huang, Tian Su, Chengying Gao et al.
DME: Unveiling the Bias for Better Generalized Monocular Depth Estimation
Songsong Yu, Yifan Wang, Yunzhi Zhuge et al.
Optimal Mechanism in a Dynamic Stochastic Knapsack Environment
Jihyeok Jung, Chan-Oi Song, Deok-Joo Lee et al.
Cross-Constrained Progressive Inference for 3D Hand Pose Estimation
ZheHan Kan, Xueting Hu, Zihan Liao et al.
Intelligent Calibration for Bias Reduction in Sentiment Corpora Annotation Process
Idan Toker, David Sarne, Jonathan Schler
Embedded Feature Selection on Graph-Based Multi-View Clustering
Guangfei Li, Haizhou Yang, Quanxue Gao et al.
A Learnable Discrete-Prior Fusion Autoencoder with Contrastive Learning for Tabular Data Synthesis
Rongchao Zhang, Yu Huang, Yiwei Lou et al.
On the Convergence of an Adaptive Momentum Method for Adversarial Attacks
Sheng Long, Wei Tao, Shuohao LI et al.
Well, Now We Know! Unveiling Sarcasm: Initiating and Exploring Multimodal Conversations with Reasoning
Gopendra Singh, Mauajama Firdaus, Dushyant Singh Chauhan et al.
Transferable Video Moment Localization by Moment-Guided Query Prompting
Hao Jiang, Yang Yizhang, Yadong Mu
Energy-Efficient Streaming Time Series Classification with Attentive Power Iteration
Hao Huang, Tapan Shah, Scott Evans et al.
PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning
Jizhou Wu, Jianye Hao, Tianpei Yang et al.
SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control
Binyuan Huang, Yuqing Wen, Yucheng Zhao et al.
Compositional Generalization for Multi-Label Text Classification: A Data-Augmentation Approach
Yuyang Chai, Zhuang Li, Jiahui Liu et al.
M3D: Dataset Condensation by Minimizing Maximum Mean Discrepancy
Hansong Zhang, Shikun Li, Pengju Wang et al.
Principle Component Trees and Their Persistent Homology
Ben Kizaric, Daniel Pimentel-Alarcon
Knowledge-Aware Explainable Reciprocal Recommendation
Kai-Huang Lai, Zhe-Rui Yang, Pei-Yuan Lai et al.
Motion Decoupled 3D Gaussian Splatting for Dynamic Object Representation
Xiao Hu, Libo Long, Jochen Lang
How to Use the Metropolis Algorithm for Multi-Objective Optimization?
Weijie Zheng, Mingfeng Li, Renzhong Deng et al.
Improving the Robustness of Knowledge-Grounded Dialogue via Contrastive Learning
Jiaan Wang, JIanfeng Qu, Kexin Wang et al.
FashionTailor: Controllable Clothing Editing for Human Images with Appearance Preserving
Jie Hou, Jianghong Ma, Xiangyu Mu et al.
A Sequentially Fair Mechanism for Multiple Sensitive Attributes
Francois HU, Philipp Ratz, Arthur Charpentier
From GARCH to Neural Network for Volatility Forecast
Pengfei Zhao, Haoren ZHU, Wilfred Ng et al.
WildFake: A Large-Scale and Hierarchical Dataset for AI-Generated Images Detection
Yan Hong, Jianming Feng, Haoxing Chen et al.
Robust and Consistent Online Video Instance Segmentation via Instance Mask Propagation
Miran Heo, Seoung Wug Oh, Seon Joo Kim et al.
TETRIS: Towards Exploring the Robustness of Interactive Segmentation
Andrey Moskalenko, Vlad Shakhuro, Anna Vorontsova et al.
Strong Baselines for Parameter-Efficient Few-Shot Fine-Tuning
Samyadeep Basu, Shell Hu, Daniela Massiceti et al.
MCSSME: Multi-Task Contrastive Learning for Semi-supervised Singing Melody Extraction from Polyphonic Music
Shuai Yu
End-to-End Real-Time Vanishing Point Detection with Transformer
Xin Tong, Shi Peng, Yufei Guo et al.
Current Page
Disentangle Nighttime Lens Flares: Self-supervised Generation-based Lens Flare Removal
Yuwen He, Wei Wang, Wanyu Wu et al.
Learning Hybrid Dynamics Models with Simulator-Informed Latent States
Katharina Ensinger, Sebastian Ziesche, Sebastian Trimpe
Efficient Online Training for Zero-Shot Time-Lapse Microscopy Denoising and Super-Resolution
Ruian He, Ri Cheng, Xinkai Lyu et al.
Working Memory Capacity of ChatGPT: An Empirical Study
Dongyu Gong, Xingchen Wan, Dingmin Wang
Multi-Frame Deformable Look-Up Table for Compressed Video Quality Enhancement
Gang He, Guancheng Quan, Chang Wu et al.
AnomalyDiffusion: Few-Shot Anomaly Image Generation with Diffusion Model
Teng Hu, Jiangning Zhang, Ran Yi et al.
Sterling: Synergistic Representation Learning on Bipartite Graphs
Baoyu Jing, Yuchen Yan, Kaize Ding et al.
From Coarse to Fine: A Distillation Method for Fine-Grained Emotion-Causal Span Pair Extraction in Conversation
Xinhao Chen, Chong Yang, Changzhi Sun et al.
Decentralized Scheduling with QoS Constraints: Achieving O(1) QoS Regret of Multi-Player Bandits
Qingsong Liu, Zhixuan Fang
ID-Sculpt: ID-aware 3D Head Generation from Single In-the-wild Portrait Image
Jinkun Hao, Junshu Tang, Jiangning Zhang et al.
S2CycleDiff: Spatial-Spectral-Bilateral Cycle-Diffusion Framework for Hyperspectral Image Super-resolution
Jiahui Qu, Jie He, Wenqian Dong et al.
AsyncDSB: Schedule-Asynchronous Diffusion Schrödinger Bridge for Image Inpainting
Zihao Han, Baoquan Zhang, Lisai Zhang et al.
ConditionVideo: Training-Free Condition-Guided Video Generation
Bo Peng, Xinyuan Chen, Yaohui Wang et al.
A Plug-and-Play Quaternion Message-Passing Module for Molecular Conformation Representation
Angxiao Yue, Dixin Luo, Hongteng Xu
3D Visibility-Aware Generalizable Neural Radiance Fields for Interacting Hands
Xuan Huang, Hanhui Li, Zejun Yang et al.
BiPFT: Binary Pre-trained Foundation Transformer with Low-Rank Estimation of Binarization Residual Polynomials
Xingrun Xing, Li Du, Xinyuan Wang et al.
Harnessing the Power of SVD: An SVA Module for Enhanced Signal Classification
Lei Zhai, Shuyuan Yang, Yitong Li et al.
OpenVIS: Open-vocabulary Video Instance Segmentation
Pinxue Guo, Hao Huang, Peiyang He et al.
Mixed Geometry Message and Trainable Convolutional Attention Network for Knowledge Graph Completion
Bin Shang, Yinliang Zhao, Jun Liu et al.
MetaNeRV: Meta Neural Representations for Videos with Spatial-Temporal Guidance
Jialong Guo, Ke Liu, Jiangchao Yao et al.
Enhancing Low-Rank Adaptation with Recoverability-Based Reinforcement Pruning for Object Counting
Haojie Guo, Junyu Gao, Yuan Yuan
Partial Multi-View Clustering via Self-Supervised Network
Qianqian Wang, Guoshuai Sheng, Quanxue Gao et al.
LLMRG: Improving Recommendations through Large Language Model Reasoning Graphs
Yan Wang, Zhixuan Chu, Xin Ouyang et al.
MAPTree: Beating “Optimal” Decision Trees with Bayesian Decision Trees
Colin Sullivan, Mo Tiwari, Sebastian Thrun
Cross-Spectral Gaussian Splatting with Spatial Occupancy Consistency
Haipeng Guo, Huanyu Liu, Jiazheng Wen et al.
You Should Learn to Stop Denoising on Point Clouds in Advance
Chuchen Guo, Weijie Zhou, Zheng Liu et al.
Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains
Yu Zhang, Yunyi Zhang, Yanzhen Shen et al.
OT-StainNet: Optimal Transport Driven Semantic Matching for Weakly Paired H&E-to-IHC Stain Transfer
Xianchao Guan, Yifeng Wang, Ye Zhang et al.
Divergence-Guided Simultaneous Speech Translation
Xinjie Chen, Kai Fan, Wei Luo et al.
MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning
Shengbo Gu, Yu-Kun Qiu, Yu-Ming Tang et al.
Towards Inductive Robustness: Distilling and Fostering Wave-Induced Resonance in Transductive GCNs against Graph Adversarial Attacks
Ao Liu, Wenshan Li, Tao Li et al.
Learning Time Slot Preferences via Mobility Tree for Next POI Recommendation
Tianhao Huang, Xuan Pan, Xiangrui Cai et al.
Amodal Scene Analysis via Holistic Occlusion Relation Inference and Generative Mask Completion
Bowen Zhang, Qing Liu, Jianming Zhang et al.
Domain Generalized Medical Landmark Detection via Robust Boundary-Aware Pre-Training
Haifan Gong, Yu Lu, Xiang Wan et al.
Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding
Xianqiang Gao, Pingrui Zhang, Delin Qu et al.
Resource Democratization: Is Compute the Binding Constraint on AI Research?
Rebecca Gelles, Veronica Kinoshita, Micah Musser et al.
Imitation of Life: A Search Engine for Biologically Inspired Design
Hen Emuna, Nadav Borenstein, Xin Qian et al.
Efficient Look-Up Table from Expanded Convolutional Network for Accelerating Image Super-resolution
Kai Yin, Jie Shen
Quantum Interference Model for Semantic Biases of Glosses in Word Sense Disambiguation
Junwei Zhang, Ruifang He, Fengyu Guo et al.
Instance-Conditional Timescales of Decay for Nonstationary Learning
Nishant Jain, Pradeep Shenoy
Integer Is Enough: When Vertical Federated Learning Meets Rounding
Pengyu Qiu, Yuwen Pu, Yongchao Liu et al.
AIM: Let Any Multimodal Large Language Models Embrace Efficient In-Context Learning
Jun Gao, Qian Qiao, Tianxiang Wu et al.
DFDNet: Disentangling and Filtering Dynamics for Enhanced Video Prediction
Lianqiang Gan, Junyu Lai, Jingze Ju et al.
MFL-Owner: Ownership Protection for Multi-modal Federated Learning via Orthogonal Transform Watermark
Keke Gai, Dongjue Wang, Jing Yu et al.
Your Career Path Matters in Person-Job Fit
Zhuocheng Gong, Yang Song, Tao Zhang et al.
Graph Neural Networks with Soft Association between Topology and Attribute
Yachao Yang, Yanfeng Sun, Shaofan Wang et al.
BGHR: Bridging the Gap Between HBox-Supervised and RBox-Supervised Oriented Object Detection via Adaptive Fine-Grained Sample Mining
Chenlin Fu, Yingying Zhu
Simplifying Control Mechanism in Text-to-Image Diffusion Models
Zhida Feng, Li Chen, Yuenan Sun et al.
Deletion-Robust Submodular Maximization with Knapsack Constraints
Shuang Cui, Kai Han, He Huang
D3: A Methodological Exploration of Domain Division, Modeling, and Balance in Multi-Domain Recommendations
Pengyue Jia, Yichao Wang, Shanru LIN et al.
Big Learning Expectation Maximization
Yulai Cong, Sijia Li
HDLayout: Hierarchical and Directional Layout Planning for Arbitrary Shaped Visual Text Generation
Tonghui Feng, Chunsheng Yan, Qianru Wang et al.
United We Stand: Accelerating Privacy-Preserving Neural Inference by Conjunctive Optimization with Interleaved Nexus
Qiao Zhang, Tao Xiang, Chunsheng Xin et al.
SkeletonGait: Gait Recognition Using Skeleton Maps
Chao Fan, Jingzhe Ma, Dongyang Jin et al.
Autoregressive Omni-Aware Outpainting for Open-Vocabulary 360-Degree Image Generation
Zhuqiang Lu, Kun Hu, Chaoyue Wang et al.
Video Frame Prediction from a Single Image and Events
Juanjuan Zhu, Zhexiong Wan, Yuchao Dai
VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering
Chun-Mei Feng, Yang Bai, Tao Luo et al.
CcDPM: A Continuous Conditional Diffusion Probabilistic Model for Inverse Design
Yanxuan Zhao, Peng Zhang, Guopeng Sun et al.
Omnipotent Distillation with LLMs for Weakly-Supervised Natural Language Video Localization:
Peijun Bao, Zihao Shao, Wenhan Yang et al.
Fine-Grained Multi-View Hand Reconstruction Using Inverse Rendering
Qijun Gan, Wentong Li, Jinwei Ren et al.
Unsupervised Pan-Sharpening via Mutually Guided Detail Restoration
Huangxing Lin, Yuhang Dong, Xinghao Ding et al.
Curriculum-Enhanced Residual Soft An-Isotropic Normalization for Over-Smoothness in Deep GNNs
Jin Li, Qirong Zhang, Shuling Xu et al.
Friendly Attacks to Improve Channel Coding Reliability
Anastasiia Kurmukova, Deniz Gunduz
CoSDA: Enhancing the Robustness of Inversion-based Generative Image Watermarking Framework
Han Fang, Kejiang Chen, Zijin Yang et al.
EMHI: A Multimodal Egocentric Human Motion Dataset with HMD and Body-Worn IMUs
Zhen Fan, Peng Dai, Zhuo Su et al.
EventPillars: Pillar-based Efficient Representations for Event Data
Rui Fan, Weidong Hao, Juntao Guan et al.
Altruism in Facility Location Problems
Hau Chan, Minming Li, Houyu Zhou
Deep Semantic Graph Transformer for Multi-View 3D Human Pose Estimation
Lijun Zhang, Kangkang Zhou, Feng Lu et al.
Is a Large Language Model a Good Annotator for Event Extraction?
Ruirui Chen, Chengwei Qin, Weifeng Jiang et al.
Vision-guided Text Mining for Unsupervised Cross-modal Hashing with Community Similarity Quantization
Haozhi Fan, Yuan Cao
Dual-View Whitening on Pre-trained Text Embeddings for Sequential Recommendation
Lingzi Zhang, Xin Zhou, Zhiwei Zeng et al.
A Diffusion-Based Framework for Occluded Object Movement
Zheng-Peng Duan, Jiawei Zhang, Siyu Liu et al.
Clarifying the Behavior and the Difficulty of Adversarial Training
Xu Cheng, Hao Zhang, Yue Xin et al.
Improve Robustness of Reinforcement Learning against Observation Perturbations via l∞ Lipschitz Policy Networks
Buqing Nie, Jingtian Ji, Yangqing Fu et al.
PathAsst: A Generative Foundation AI Assistant towards Artificial General Intelligence of Pathology
Yuxuan Sun, Chenglu Zhu, Sunyi Zheng et al.
HybridReg: Robust 3D Point Cloud Registration with Hybrid Motions
Keyu Du, Hao Xu, Haipeng Li et al.
Delegation-Relegation for Boolean Matrix Factorization
Florent Avellaneda, Roger Villemaire
GarFast: Realistic and Fast Garment Transfer with a Simplified Parser-Free Approach
Chenghu Du, Junyin Wang, Yi Rong et al.
DePRL: Achieving Linear Convergence Speedup in Personalized Decentralized Learning with Shared Representations
Guojun Xiong, Gang Yan, Shiqiang Wang et al.
Muses: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration
Yanbo Ding, Shaobin Zhuang, Kunchang Li et al.
Dis²Booth: Learning Image Distribution with Disentangled Features for Text-to-Image Diffusion Models
Guanqi Ding, Chengyu Yang, Shuhui Wang et al.
Causality-Inspired Invariant Representation Learning for Text-Based Person Retrieval
Yu Liu, Guihe Qin, Haipeng Chen et al.
Ada-Retrieval: An Adaptive Multi-Round Retrieval Paradigm for Sequential Recommendations
Lei Li, Jianxun Lian, Xiao Zhou et al.
Occlusion-Insensitive Talking Head Video Generation via Facelet Compensation
Yuhui Deng, Yuqin Lu, Yangyang Xu et al.
Weisfeiler and Lehman Go Paths: Learning Topological Features via Path Complexes
Quang Truong, Peter Chin
OTIAS: OcTree Implicit Adaptive Sampling for Multispectral and Hyperspectral Image Fusion
Shangqi Deng, Jun Ma, Liang-Jian Deng et al.
Adaptive Siamese Masked Autoencoder with Global Optimization for Unsupervised Point Cloud Shape Correspondence
Jiacheng Deng, Jiahao Lu
DiffCorr: Conditional Diffusion Model with Reliable Pseudo-Label Guidance for Unsupervised Point Cloud Shape Correspondence
Jiacheng Deng, Jiahao Lu, Zhixin Cheng et al.
Deep Non-Rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling
Hui Deng, Jiawei Shi, Zhen Qin et al.
Harmonious Music-driven Group Choreography with Trajectory-Controllable Diffusion
Yuqin Dai, Wanlu Zhu, Ronghui Li et al.
Opponent-Model Search in Games with Incomplete Information
Junkang Li, Bruno Zanuttini, Véronique Ventos
GCD-Sampling: A General Cross-scale Decoupled Sampling for Point Cloud
Tao Dai, Yanzi Wang, Jianyu Xiong et al.
MASS: Overcoming Language Bias in Image-Text Matching
Jiwan Chung, Seungwon Lim, Sangkyu Lee et al.
LaViP: Language-Grounded Visual Prompting
Nilakshan Kunananthaseelan, Jing Zhang, Mehrtash Harandi
Digging into Intrinsic Contextual Information for High-fidelity 3D Point Cloud Completion
Jisheng Chu, Wenrui Li, Xingtao Wang et al.
DynASyn: Multi-Subject Personalization Enabling Dynamic Action Synthesis
Yongjin Choi, Chanhun Park, Seung Jun Baek
Self-Interpretable Graph Learning with Sufficient and Necessary Explanations
Jiale Deng, Yanyan Shen
Hypothesis, Verification, and Induction: Grounding Large Language Models with Self-Driven Skill Learning
Shaohui Peng, Xing Hu, Qi Yi et al.
Adaptive Graph Learning for Multimodal Conversational Emotion Detection
Geng Tu, Tian Xie, Bin Liang et al.
Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video Generation
Aram Davtyan, Paolo Favaro
Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces
Wonhyeok Choi, Kyumin Hwang, Minwoo Choi et al.
p-Laplacian Adaptation for Generative Pre-trained Vision-Language Models
Haoyuan Wu, Xinyun Zhang, Peng Xu et al.
Leveraging Local Variance for Pseudo-Label Selection in Semi-supervised Learning
Zeping Min, Jinfeng Bai, Chengfei Li
Optimistic Value Instructors for Cooperative Multi-Agent Reinforcement Learning
Chao Li, Yupeng Zhang, Jianqi Wang et al.
Inertial Algorithm with Dry Fraction and Convolutional Sparse Coding for 3D Localization with Light Field Microscopy
Xiaofan Wang, Zhiyuan Deng, Changle Wang et al.
RG-GAN: Dynamic Regenerative Pruning for Data-Efficient Generative Adversarial Networks
Divya Saxena, Jiannong Cao, Jiahao Xu et al.
Offline Model-Based Optimization via Policy-Guided Gradient Search
Yassine Chemingui, Aryan Deshwal, Nghia Hoang et al.
On the Outcome Equivalence of Extensive-Form and Behavioral Correlated Equilibria
Brian Zhang, Tuomas Sandholm
AltDiffusion: A Multilingual Text-to-Image Diffusion Model
Fulong Ye, Guang Liu, Xinya Wu et al.
Computing Nash Equilibria in Potential Games with Private Uncoupled Constraints
Nikolas Patris, Stelios Stavroulakis, Fivos Kalogiannis et al.
Aligning Instance Brownian Bridge with Texts for Open-Vocabulary Video Instance Segmentation
Zesen Cheng, Kehan Li, Li Hao et al.
Effective Diffusion Transformer Architecture for Image Super-Resolution
Kun Cheng, Lei Yu, Zhijun Tu et al.
SeqRank: Sequential Ranking of Salient Objects
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditions
Zhiyuan Chen, Jiajiong Cao, Zhiquan Chen et al.
VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis
Zhipeng Chen, Lan Yang, Yonggang Qi et al.
Preference Aware Dual Contrastive Learning for Item Cold-Start Recommendation
Wenbo Wang, Bingquan Liu, Lili Shan et al.
Learning Performance Maximizing Ensembles with Explainability Guarantees
Vincent Pisztora, Jia Li
Communication Efficient Distributed Newton Method over Unreliable Networks
Ming Wen, Chengchang Liu, Yuedong Xu
SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers
Zehao Chen, Rong Pan
Continual Vision-Language Retrieval via Dynamic Knowledge Rectification
Zhenyu Cui, Yuxin Peng, Xun Wang et al.
Comprehensive Multi-Modal Prototypes Are Simple and Effective Classifiers for Vast-Vocabulary Object Detection
Yitong Chen, Wenhao Yao, Lingchen Meng et al.
Efficient Conditional Diffusion Model with Probability Flow Sampling for Image Super-resolution
Yutao Yuan, Chun Yuan
Inconsistency-Based Data-Centric Active Open-Set Annotation
Ruiyu Mao, Ouyang Xu, Yunhui Guo
Hierarchical Planning and Learning for Robots in Stochastic Settings Using Zero-Shot Option Invention
Naman Shah, Siddharth Srivastava
Optimizing the Optimization of Planning Domains by Automatic Action Schema Splitting
Mojtaba Elahi, Jussi Rintanen
Generator Assisted Mixture of Experts for Feature Acquisition in Batch
Vedang Asgaonkar, Aditya Jain, Abir De
Efficient Learning in Polyhedral Games via Best-Response Oracles
Darshan Chakrabarti, Gabriele Farina, Christian Kroer
Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information
Yi Chen, Jian Xu, Xu-Yao Zhang et al.
Fully Data-Driven Pseudo Label Estimation for Pointly-Supervised Panoptic Segmentation
Jing Li, Junsong Fan, Yuran Yang et al.
From Toxic to Trustworthy: Using Self-Distillation and Semi-supervised Methods to Refine Neural Networks
Xianda Zhang, Baolin Zheng, Jianbao Hu et al.
MINES: Message Intercommunication for Inductive Relation Reasoning over Neighbor-Enhanced Subgraphs
Ke Liang, Lingyuan Meng, Sihang Zhou et al.
ACAMDA: Improving Data Efficiency in Reinforcement Learning through Guided Counterfactual Data Augmentation
Yuewen Sun, Erli Wang, Biwei Huang et al.
Towards Safe Policy Learning under Partial Identifiability: A Causal Approach
Shalmali Joshi, Junzhe Zhang, Elias Bareinboim
Composing Biases by Using CP to Decompose Minimal Functional Dependencies for Acquiring Complex Formulae
Ramiz Gindullin, Nicolas Beldiceanu, Jovial Cheukam Ngouonou et al.
SocialCVAE: Predicting Pedestrian Trajectory via Interaction Conditioned Latents
Wei Xiang, Haoteng YIN, He Wang et al.
Diffusion Models for Attribution
Xiongren Chen, Jiuyong Li, Jixue Liu et al.
Computing the Why-Provenance for Datalog Queries via SAT Solvers
Haitong Luo, Xuying Meng, Suhang Wang et al.
Deep Incomplete Multi-View Learning Network with Insufficient Label Information
Zhangqi Jiang, Tingjin Luo, Xinyan Liang
Unsupervised Degradation Representation Aware Transform for Real-World Blind Image Super-Resolution
Sen Chen, Hongying Liu, Chaowei Fang et al.
CP-DETR: Concept Prompt Guide DETR Toward Stronger Universal Object Detection
Qibo Chen, Weizhong Jin, Jianyue Ge et al.
Improving Audio-Visual Segmentation with Bidirectional Generation
Dawei Hao, Yuxin Mao, Bowen He et al.
Towards Multi-Mode Outlier Robust Tensor Ring Decomposition
Yuning Qiu, Guoxu Zhou, Andong Wang et al.
A Brain-Inspired Way of Reducing the Network Complexity via Concept-Regularized Coding for Emotion Recognition
Han Lu, Xiahai Zhuang, Qiang Luo
Contrasting Adversarial Perturbations: The Space of Harmless Perturbations
Lu Chen, Shaofeng Li, Benhao Huang et al.