Most Cited AAAI "clip-activated learning" Papers
5,317 papers found • Page 8 of 27
Conference
Diagnosing and Rectifying Fake OOD Invariance: A Restructured Causal Approach
Ziliang Chen, Yongsen Zheng, Zhao-Rong Lai et al.
A Unified Loss for Handling Inter-Class and Intra-Class Imbalance in Medical Image Segmentation
Feilong Xu, Feiyang Yang, Xiongfei Li et al.
Scaling Few-Shot Learning for the Open World
Zhipeng Lin, Wenjing Yang, Haotian Wang et al.
MHBench: Demystifying Motion Hallucination in VideoLLMs
Ming Kong, Xianzhou Zeng, Luyuan Chen et al.
Fine Structure-Aware Sampling: A New Sampling Training Scheme for Pixel-Aligned Implicit Models in Single-View Human Reconstruction
K. Chan, Fayao Liu, Guosheng Lin et al.
HiCM²: Hierarchical Compact Memory Modeling for Dense Video Captioning
Minkuk Kim, Hyeon Bae Kim, Jinyoung Moon et al.
CutFreq: Cut-and-Swap Frequency Components for Low-Level Vision Augmentation
Hongyang Chen, Kaisheng Ma
Dynamic Spiking Graph Neural Networks
Yin Nan, Mengzhu Wang, Zhenghan Chen et al.
C2PD: Continuity-Constrained Pixelwise Deformation for Guided Depth Super-Resolution
Jiahui Kang, Qing Cai, Runqing Tan et al.
Learning to Approximate Adaptive Kernel Convolution on Graphs
Jaeyoon Sim, Sooyeon Jeon, InJun Choi et al.
Decoupled Training: Return of Frustratingly Easy Multi-Domain Learning
Ximei Wang, Junwei Pan, Xingzhuo Guo et al.
Learning Multi-Object Positional Relationships via Emergent Communication
Yicheng Feng, Boshi An, Zongqing Lu
Swift-Mapping: Online Neural Implicit Dense Mapping in Urban Scenes
Ke Wu, Kaizhao Zhang, Mingzhe Gao et al.
Spatial-Related Sensors Matters: 3D Human Motion Reconstruction Assisted with Textual Semantics
Xueyuan Yang, Chao Yao, Xiaojuan Ban
SpatioTemporal Learning for Human Pose Estimation in Sparsely-Labeled Videos
Yingying Jiao, Zhigang Wang, Sifan Wu et al.
Combining Multiple Supervision for Robust Zero-Shot Dense Retrieval
Yan Fang, Qingyao Ai, Jingtao Zhan et al.
LATTE: Improving Latex Recognition for Tables and Formulae with Iterative Refinement
Nan Jiang, Shanchao Liang, Chengxiao Wang et al.
Enhancing Fine-Grained Vision-Language Pretraining with Negative Augmented Samples
Yeyuan Wang, Dehong Gao, Lei Yi et al.
Friendly Attacks to Improve Channel Coding Reliability
Anastasiia Kurmukova, Deniz Gunduz
GCD: Advancing Vision-Language Models for Incremental Object Detection via Global Alignment and Correspondence Distillation
Xu Wang, Zilei Wang, Zihan Lin
RRT-MVS: Recurrent Regularization Transformer for Multi-View Stereo
Jianfei Jiang, Liyong Wang, Haochen Yu et al.
TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Xingrui Wang, Xin Li, Yaosi Hu et al.
Big Learning Expectation Maximization
Yulai Cong, Sijia Li
TokenMatcher: Diverse Tokens Matching for Unsupervised Visible-Infrared Person Re-Identification
Xiao Wang, Lekai Liu, Bin Yang et al.
Relational Distant Supervision for Image Captioning without Image-Text Pairs
Yayun Qi, Wentian Zhao, Xinxiao Wu
Thinking in Granularity: Dynamic Quantization for Image Super-Resolution by Intriguing Multi-Granularity Clues
Mingshen Wang, Zhao Zhang, Feng Li et al.
Debiased Distillation for Consistency Regularization
Lu Wang, Liuchi Xu, Xiong Yang et al.
Learning Diffusions under Uncertainty
Hao Huang, Qian Yan, Keqi Han et al.
PICNN: A Pathway towards Interpretable Convolutional Neural Networks
Wengang Guo, Jiayi Yang, HuiLin YIN et al.
WikiSQE: A Large-Scale Dataset for Sentence Quality Estimation in Wikipedia
Kenichiro Ando, S. Sekine, Mamoru Komachi
MA-Net: Rethinking Neural Unit in the Light of Astrocytes
Mengqiao Han, Liyuan Pan, Xiabi Liu
A Graph Dynamics Prior for Relational Inference
Liming Pan, Cheng Shi, Ivan Dokmanic
Evaluate Geometry of Radiance Fields with Low-Frequency Color Prior
Qihang Fang, Yafei Song, Keqiang Li et al.
CEDFlow: Latent Contour Enhancement for Dark Optical Flow Estimation
Fengyuan Zuo, Zhaolin Xiao, Haiyan Jin et al.
HUANG: A Robust Diffusion Model-based Targeted Adversarial Attack Against Deep Hashing Retrieval
Chihan Huang, Xiaobo Shen
Confusing Pair Correction Based on Category Prototype for Domain Adaptation under Noisy Environments
Churan Zhi, Junbao Zhuo, Shuhui Wang
A Training-free Synthetic Data Selection Method for Semantic Segmentation
Hao Tang, Siyue Yu, Jian Pang et al.
ALLVB: All-in-One Long Video Understanding Benchmark
Xichen Tan, Yuanjing Luo, Yunfan Ye et al.
ConSense: Continually Sensing Human Activity with WiFi via Growing and Picking
Rong Li, Tao Deng, Siwei Feng et al.
Revisiting Interpolation for Noisy Label Correction
Yuanzhuo Xu, Xiaoguang Niu, Jie Yang et al.
Labels Need Prompts Too: Mask Matching for Natural Language Understanding Tasks
Bo Li, Wei Ye, Quansen Wang et al.
Scalable Motion Style Transfer with Constrained Diffusion Generation
Wenjie Yin, Yi Yu, Hang Yin et al.
MAPTree: Beating “Optimal” Decision Trees with Bayesian Decision Trees
Colin Sullivan, Mo Tiwari, Sebastian Thrun
Decentralized Federated Learning with Model Caching on Mobile Agents
Xiaoyu Wang, Guojun Xiong, Houwei Cao et al.
Unify Named Entity Recognition Scenarios via Contrastive Real-Time Updating Prototype
Yanhe Liu, Peng Wang, Ke Wenjun et al.
Competition among Pairwise Lottery Contests
Xiaotie Deng, Hangxin Gan, Ningyuan Li et al.
Optimal Bounds for Dissatisfaction in Perpetual Voting
Alexander Kozachinskiy, Alexander Shen, Tomasz Steifer
Conformal Inference of Individual Treatment Effects Using Conditional Density Estimates
Baozhen Wang, Xingye Qiao
Improved Maximin Share Approximations for Chores by Bin Packing
Jugal Garg, Xin Huang, Erel Segal-Halevi
Disentangle Nighttime Lens Flares: Self-supervised Generation-based Lens Flare Removal
Yuwen He, Wei Wang, Wanyu Wu et al.
Exploiting Symmetric Temporally Sparse BPTT for Efficient RNN Training
Xi Chen, Chang Gao, Zuowen Wang et al.
Every Bit Helps: Achieving the Optimal Distortion with a Few Queries
Soroush Ebadian, Nisarg Shah
Testing Causal Models with Hidden Variables in Polynomial Delay via Conditional Independencies
Hyunchai Jeong, Adiba Ejaz, Jin Tian et al.
The Value of Recall in Extensive-Form Games
Ratip Emin Berker, Emanuel Tewolde, Ioannis Anagnostides et al.
Proxyformer: Nystrom-Based Linear Transformer with Trainable Proxy Tokens
SangHo Lee, Hayun Lee, Dongkun Shin
EF2X Exists for Four Agents
Arash Ashuri, Vasilis Gkatzelis, Alkmini Sgouritsa
Operationalising Rawlsian Ethics for Fairness in Norm Learning Agents
Jessica Woodgate, Paul Marshall, Nirav Ajmeri
ConSequence: Synthesizing Logically Constrained Sequences for Electronic Health Record Generation
Brandon Theodorou, Shrusti Jain, Cao Xiao et al.
Tokenphormer: Structure-aware Multi-token Graph Transformer for Node Classification
Zijie Zhou, Zhaoqi Lu, Xuekai Wei et al.
ExcluIR: Exclusionary Neural Information Retrieval
Wenhao Zhang, Mengqi Zhang, Shiguang Wu et al.
L3TC: Leveraging RWKV for Learned Lossless Low-Complexity Text Compression
Junxuan Zhang, Zhengxue Cheng, Yan Zhao et al.
Distributed Manifold Hashing for Image Set Classification and Retrieval
Xiaobo Shen, Peizhuo Song, Yun-Hao Yuan et al.
Molecular Optimization Model with Patentability Constraint
Sally Turutov, Kira Radinsky
Towards Effective, Efficient and Unsupervised Social Event Detection in the Hyperbolic Space
Xiaoyan Yu, Yifan Wei, Shuaishuai Zhou et al.
Disentangling Tabular Data Towards Better One-Class Anomaly Detection
Jianan Ye, Zhaorui Tan, Yijie Hu et al.
Divide-Solve-Combine: An Interpretable and Accurate Prompting Framework for Zero-shot Multi-Intent Detection
Libo Qin, Qiguang Chen, Jingxuan Zhou et al.
Mental-Perceiver: Audio-Textual Multi-Modal Learning for Estimating Mental Disorders
Jinghui Qin, Changsong Liu, Tianchi Tang et al.
ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks
Renshuai Tao, Manyi Le, Chuangchuang Tan et al.
Filling Memory Gaps: Enhancing Continual Semantic Parsing via SQL Syntax Variance-Guided LLMs Without Real Data Replay
Ruiheng Liu, Jinyu Zhang, Yanqi Song et al.
Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation
Suho Park, SuBeen Lee, Hyun Seok Seong et al.
Parameterized Projected Bellman Operator
Théo Vincent, Alberto Maria Metelli, Boris Belousov et al.
No Head Left Behind – Multi-Head Alignment Distillation for Transformers
Tianyang Zhao, Kunwar Singh, Srikar Appalaraju et al.
KAES: Multi-aspect Shared Knowledge Finding and Aligning for Cross-prompt Automated Scoring of Essay Traits
Xia Li, Wenjing Pan
Ahpatron: A New Budgeted Online Kernel Learning Machine with Tighter Mistake Bound
Yun Liao, Junfan Li, Shizhong Liao et al.
Strategyproof Mechanisms for Group-Fair Obnoxious Facility Location Problems
Jiaqian Li, Minming Li, Hau Chan
Pioneer: Physics-informed Riemannian Graph ODE for Entropy-increasing Dynamics
Li Sun, Ziheng Zhang, Zixi Wang et al.
UniFORM: Towards Unified Framework for Anomaly Detection on Graphs
Chuancheng Song, Xixun Lin, Hanyang Shen et al.
Deciphering Compatibility Relationships with Textual Descriptions via Extraction and Explanation
9135 Yu Wang, Zexue He, Zhankui He et al.
END^2: Robust Dual-Decoder Watermarking Framework Against Non-Differentiable Distortions
Nan Sun, Han Fang, Yuxing Lu et al.
CoPEFT: Fast Adaptation Framework for Multi-Agent Collaborative Perception with Parameter-Efficient Fine-Tuning
Quanmin Wei, Penglin Dai, Wei Li et al.
Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces
Xiaotian Hao, Jianye Hao, Chenjun Xiao et al.
Prior and Prediction Inverse Kernel Transformer for Single Image Defocus Deblurring
Peng TANG, Zhiqiang Xu, Chunlai Zhou et al.
GraSP: Simple Yet Effective Graph Similarity Predictions
Haoran Zheng, Jieming Shi, Renchi Yang
Semantic-Aware Data Augmentation for Text-to-Image Synthesis
Zhaorui Tan, Xi Yang, Kaizhu Huang
s-ID: Causal Effect Identification in a Sub-population
Amir Mohammad Abouei, Ehsan Mokhtarian, Negar Kiyavash
SSAN: A Symbol Spatial-Aware Network for Handwritten Mathematical Expression Recognition
Haoran Zhang, Xiangdong Su, Xingxiang Zhou et al.
Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification
Long-Fei Li, Peng Zhao, Zhi-Hua Zhou
SAUI: Scale-Aware Unseen Imagineer for Zero-Shot Object Detection
Jiahao Wang, Caixia Yan, Weizhan Zhang et al.
CAKE: Category Aware Knowledge Extraction for Open-Vocabulary Object Detection
Shiyuan Ma, Donglin Qian, Kai Ye et al.
HI-DR: Exploiting Health Status-Aware Attention and an EHR Graph+ for Effective Medication Recommendation
Taeri Kim, Jiho Heo, Hyunjoon Kim et al.
An LLM-Empowered Adaptive Evolutionary Algorithm for Multi-Component Deep Learning Systems
Haoxiang Tian, Xingshuo Han, Guoquan Wu et al.
Conditional Diffusion Models Based Conditional Independence Testing
Yanfeng Yang, Shuai Li, Yingjie Zhang et al.
HyperMixer: Specializable Hypergraph Channel Mixing for Long-term Multivariate Time Series Forecasting
Changyuan Tian, Zhicong Lu, Zequn Zhang et al.
Beyond Graph Convolution: Multimodal Recommendation with Topology-aware MLPs
Junjie Huang, Jiarui Qin, Yong Yu et al.
Backward Responsibility in Transition Systems Using General Power Indices
Christel Baier, Roxane van den Bossche, Sascha Klüppelholz et al.
Single-View Graph Contrastive Learning with Soft Neighborhood Awareness
Qingqiang Sun, Chaoqi Chen, Ziyue Qiao et al.
Rethinking U-Net: Task-Adaptive Mixture of Skip Connections for Enhanced Medical Image Segmentation
Zichen Luo, Xinshan Zhu, Lan Zhang et al.
ConceptSearch: Towards Efficient Program Search Using LLMs for Abstraction and Reasoning Corpus (ARC)
Kartik Singhal, Gautam Shroff
Learning by Erasing: Conditional Entropy Based Transferable Out-of-Distribution Detection
Meng Xing, Zhiyong Feng, Yong Su et al.
Towards Generalizable Multi-Camera 3D Object Detection via Perspective Rendering
Hao Lu, Yunpeng Zhang, Guoqing Wang et al.
VPDETR: End-to-End Vanishing Point DEtection TRansformers
Taiyan Chen, Xianghua Ying, Jinfa Yang et al.
Enhancing Multi-Label Classification via Dynamic Label-Order Learning
Jiangnan Li, Yice ZHANG, Shiwei Chen et al.
Neural Reasoning for Sure Through Constructing Explainable Models
Tiansi Dong, Mateja Jamnik, Pietro Liò
User Preference Meets Pareto-Optimality in Multi-Objective Bayesian Optimization
Joshua Hang Sai Ip, Ankush Chakrabarty, Ali Mesbah et al.
Max-Mahalanobis Anchors Guidance for Multi-View Clustering
Pei Zhang, Yuangang Pan, Siwei Wang et al.
Enhancing SQL Query Generation with Neurosymbolic Reasoning
Henrijs Princis, Cristina David, Alan Mycroft
On Corruption-Robustness in Performative Reinforcement Learning
Vasilis Pollatos, Debmalya Mandal, Goran Radanovic
SOLA-GCL: Subgraph-Oriented Learnable Augmentation Method for Graph Contrastive Learning
Tianhao Peng, Xuhong Li, Haitao Yuan et al.
Benchmarking and Understanding Compositional Relational Reasoning of LLMs
Ruikang Ni, Da Xiao, Qingye Meng et al.
DualDynamics: Synergizing Implicit and Explicit Methods for Robust Irregular Time Series Analysis
YongKyung Oh, Dong-Young Lim, Sungil Kim
Adversarial Attacks on Federated-Learned Adaptive Bitrate Algorithms
Ruixiao Zhang, Tianchi Huang
Efficient Few-Shot Neural Architecture Search by Counting the Number of Nonlinear Functions
Youngmin Oh, Hyunju Lee, Bumsub Ham
Improving the Lower Bound in Branch-and-Bound Algorithms for MaxSAT
Shuolin Li, Chu-Min Li, Jordi Coll et al.
Training Matting Models Without Alpha Labels
Wenze Liu, Zixuan Ye, Hao Lu et al.
InstructOCR: Instruction Boosting Scene Text Spotting
Chen Duan, Qianyi Jiang, Pei Fu et al.
Large-Scale Non-convex Stochastic Constrained Distributionally Robust Optimization
Qi Zhang, Yi Zhou, Ashley Prater-Bennette et al.
Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation
Tao Liu, Rongjie Li, Chongyu Wang et al.
Spurious Feature Eraser: Stabilizing Test-Time Adaptation for Vision-Language Foundation Model
Huan Ma, Yan Zhu, Changqing Zhang et al.
Bridge Diffusion Model: Bridge Chinese Text-to-Image Diffusion Model with English Communities
Shanyuan Liu, Bo Cheng, Yuhang Ma et al.
AGMixup: Adaptive Graph Mixup for Semi-supervised Node Classification
Weigang Lu, Ziyu Guan, Wei Zhao et al.
Epsilon: Exploring Comprehensive Visual-Semantic Projection for Multi-Label Zero-Shot Learning
Ziming Liu, Jingcai Guo, Song Guo et al.
AeroGTO: An Efficient Graph-Transformer Operator for Learning Large-Scale Aerodynamics of 3D Vehicle Geometries
Pengwei Liu, Pengkai Wang, Xingyu Ren et al.
Robust Beamforming for Downlink Multi-Cell Systems: A Bilevel Optimization Perspective
Xingdi Chen, Yu Xiong, Kai YANG
Enhancing Masked Time-Series Modeling via Dropping Patches
Tianyu Qiu, Yi Xie, Hao Niu et al.
ReX: A Framework for Incorporating Temporal Information in Model-Agnostic Local Explanation Techniques
Junhao Liu, Xin Zhang
ProtoCar: Learning 3D Vehicle Prototypes from Single-View and Unconstrained Driving Scene Images
Hongyuan Liu, Haochen Yu, Bochao Zou et al.
Visual Reinforcement Learning with Residual Action
Zhenxian Liu, Peixi Peng, Yonghong Tian
Towards Scalable and Deep Graph Neural Networks via Noise Masking
Yuxuan Liang, Wentao Zhang, Zeang Sheng et al.
AgentMixer: Multi-Agent Correlated Policy Factorization
Zhiyuan Li, Wenshuai Zhao, Lijun Wu et al.
Sparse Variational Student-t Processes
Jian Xu, Delu Zeng
Dynamic Contrastive Knowledge Distillation for Efficient Image Restoration
Yunshuai Zhou, Junbo Qiao, Jincheng Liao et al.
Text to Point Cloud Localization with Multi-Level Negative Contrastive Learning
Dunqiang Liu, Shujun Huang, Wen Li et al.
Emergent Communication for Numerical Concepts Generalization
Enshuai Zhou, Yifan Hao, Rui Zhang et al.
Regret Analysis of Multi-task Representation Learning for Linear-Quadratic Adaptive Control
Bruce D. Lee, Leonardo F. Toso, Thomas T. Zhang et al.
MindMap: Constructing Evidence Chains for Multi-Step Reasoning in Large Language Models
Yangyu Wu, Xu Han, Wei Song et al.
Semi-Supervised Online Cross-Modal Hashing
Xiao Kang, Xingbo Liu, Xuening Zhang et al.
Backdoor Attack on Propagation-based Rumor Detectors
Di Jin, Yujun Zhang, Bingdao Feng et al.
FlowMamba: Learning Point Cloud Scene Flow with Global Motion Propagation
Min Lin, Gangwei Xu, Yun Wang et al.
DCSF-KD: Dynamic Channel-wise Spatial Feature Knowledge Distillation for Object Detection
Tao Dai, Yang Lin, Hang Guo et al.
12087 Limitations of Face Image Generation
Harrison Rosenberg, Shimaa Ahmed, Guruprasad Ramesh et al.
Counterexample Guided Program Repair Using Zero-Shot Learning and MaxSAT-based Fault Localization
Pedro Orvalho, Mikoláš Janota, Vasco M. Manquinho
What to Preserve and What to Transfer: Faithful, Identity-Preserving Diffusion-based Hairstyle Transfer
Chaeyeon Chung, Sunghyun Park, Jeongho Kim et al.
Cross-PCR: A Robust Cross-Source Point Cloud Registration Framework
Guiyu Zhao, Zhentao Guo, Zewen Du et al.
Exploiting Geometry for Treatment Effect Estimation via Optimal Transport
Yuguang Yan, Zeqin Yang, Weilin Chen et al.
RegMixMatch: Optimizing Mixup Utilization in Semi-Supervised Learning
Haorong Han, Jidong Yuan, Chixuan Wei et al.
DivGCL: A Graph Contrastive Learning Model for Diverse Recommendation
Wenwen Gong, Yangliao Geng, Dan Zhang et al.
UniCell: Universal Cell Nucleus Classification via Prompt Learning
Junjia Huang, Haofeng Li, Xiang Wan et al.
Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding
Wenbo Zhang, Lu Zhang, Ping Hu et al.
Enhancing Robustness in Incremental Learning with Adversarial Training
Seungju Cho, Hongsin Lee, Changick Kim
DeepSN: A Sheaf Neural Framework for Influence Maximization
Asela Hevapathige, Qing Wang, Ahad N. Zehmakan
Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot Classification with CLIP
Yayuan Li, Jintao Guo, Lei Qi et al.
PHFormer: Multi-Fragment Assembly Using Proxy-Level Hybrid Transformer
Wenting Cui, Runzhao Yao, Shaoyi Du
Ambiguity-Restrained Text-Video Representation Learning for Partially Relevant Video Retrieval
Cheol-Ho Cho, WonJun Moon, WooJin Jun et al.
Bayesian Low-Rank Learning (Bella): A Practical Approach to Bayesian Neural Networks
Bao Gia Doan, Afshar Shamsi, Xiao-Yu Guo et al.
Analyzing Generalization in Policy Networks: A Case Study with the Double-Integrator System
Ruining Zhang, Haoran Han, Maolong Lv et al.
Hyperbolic-Constraint Point Cloud Reconstruction from Single RGB-D Images
Wenrui Li, Zhe Yang, Wei Han et al.
SADBA: Self-Adaptive Distributed Backdoor Attack Against Federated Learning
Jun Feng, Yuzhe Lai, Hong Sun et al.
A Similarity Paradigm Through Textual Regularization Without Forgetting
Fangming Cui, Jan Fong, Rongfei Zeng et al.
Collaborative Evolution: Multi-Round Learning Between Large and Small Language Models for Emergent Fake News Detection
Ziyi Zhou, Xiaoming Zhang, Shenghan Tan et al.
Generalization Analysis for Deep Contrastive Representation Learning
Nong Minh Hieu, Antoine Ledent, Yunwen Lei et al.
Unveiling the Threat of Fraud Gangs to Graph Neural Networks: Multi-Target Graph Injection Attacks Against GNN-Based Fraud Detectors
Jinhyeok Choi, Heehyeon Kim, Joyce Jiyoung Whang
Learning to Generate Gradients for Test-Time Adaptation via Test-Time Training Layers
Qi Deng, Shuaicheng Niu, Ronghao Zhang et al.
Universality of Real Minimal Complexity Reservoir
Robert Simon Fong, Boyu Li, Peter Tino
MetaCARD: Meta-Reinforcement Learning with Task Uncertainty Feedback via Decoupled Context-Aware Reward and Dynamics Components
Min Wang, Xin Li, Leiji Zhang et al.
Revisiting CAD Model Generation by Learning Raster Sketch
Pu Li, Wenhao Zhang, Jianwei Guo et al.
Effectiveness of Constant Stepsize in Markovian LSA and Statistical Inference
Dongyan Huo, Yudong Chen, Qiaomin Xie
Action-Agnostic Point-Level Supervision for Temporal Action Detection
Shuhei M. Yoshida, Takashi Shibata, Makoto Terao et al.
RA-SGG: Retrieval-Augmented Scene Graph Generation Framework via Multi-Prototype Learning
Kanghoon Yoon, Kibum Kim, Jaehyeong Jeon et al.
EditBoard: Towards a Comprehensive Evaluation Benchmark for Text-Based Video Editing Models
Yupeng Chen, Penglin Chen, Xiaoyu Zhang et al.
When Witnesses Defend: A Witness Graph Topological Layer for Adversarial Graph Learning
Naheed Anjum Arafat, Debabrota Basu, Yulia Gel et al.
HISR: Hybrid Implicit Surface Representation for Photorealistic 3D Human Reconstruction
Angtian Wang, Yuanlu Xu, Nikolaos Sarafianos et al.
Active Fourier Auditor for Estimating Distributional Properties of ML Models
Ayoub Ajarra, Bishwamittra Ghosh, Debabrota Basu
AIF-SFDA: Autonomous Information Filter Driven Source-Free Domain Adaptation for Medical Image Segmentation
Haojin Li, Heng Li, Jianyu Chen et al.
VEGAS: Towards Visually Explainable and Grounded Artificial Social Intelligence
Hao Li, Hao Fei, Zechao Hu et al.
A Practical Approach to Causal Inference over Time
Martina Cinquini, Isacco Beretta, Salvatore Ruggieri et al.
NaviFormer: A Spatio-Temporal Context-Aware Transformer for Object Navigation
Wei Xie, Haobo Jiang, Yun Zhu et al.
Towards Audio-Visual Navigation in Noisy Environments: A Large-Scale Benchmark Dataset and an Architecture Considering Multiple Sound-Sources
Zhanbo Shi, Lin Zhang, Linfei Li et al.
A General Theoretical Framework for Learning Smallest Interpretable Models
Sebastian Ordyniak, Giacomo Paesani, Mateusz Banany et al.
Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner
Aizierjiang Aiersilan
MRBTP: Efficient Multi-Robot Behavior Tree Planning and Collaboration
Yishuai Cai, Xinglin Chen, Zhongxuan Cai et al.
Expand-and-Quantize: Unsupervised Semantic Segmentation Using High-Dimensional Space and Product Quantization
Jiyoung Kim, Kyuhong Shim, Insu Lee et al.
PBECount: Prompt-Before-Extract Paradigm for Class-Agnostic Counting
Canchen Yang, Tianyu Geng, Jian Peng et al.
MULTISCRIPT: Multimodal Script Learning for Supporting Open Domain Everyday Tasks
Jingyuan Qi, Minqian Liu, Ying Shen et al.
Research Papers
Xiangci Li, Jessica Ouyang
Batch Normalization Is Blind to the First and Second Derivatives of the Loss
Zhanpeng Zhou, Wen Shen, Huixin Chen et al.
EvSTVSR: Event Guided Space-Time Video Super-Resolution
Haojie Yan, Zhan Lu, Zehao Chen et al.
Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge Transfer
Xinyue Chen, Miaojing Shi, Zijian Zhou et al.
Navigating Label Ambiguity for Facial Expression Recognition in the Wild
JunGyu Lee, Yeji Choi, Haksub Kim et al.
FIRM: Flexible Interactive Reflection ReMoval
Xiao Chen, Xudong Jiang, Yunkang Tao et al.
Progressive Multi-granular Alignments for Grounded Reasoning in Large Vision-Language Models
Quang-Hung Le, Long Hoang Dang, Ngan Hoang Le et al.
Mind the Uncertainty in Human Disagreement: Evaluating Discrepancies Between Model Predictions and Human Responses in VQA
Jian Lan, Diego Frassinelli, Barbara Plank
On the Computational Complexity of Plan Verification, (Bounded) Plan-Optimality Verification, and Bounded Plan Existence
Songtuan Lin, Conny Olz, Malte Helmert et al.
BAIT: Benchmarking (Embedding) Architectures for Interactive Theorem-Proving
Sean Lamont, Michael Norrish, Amir Dezfouli et al.
Unpaired Multi-Domain Histopathology Virtual Staining Using Dual Path Prompted Inversion
Bing Xiong, Yue Peng, Ranran Zhang et al.
Fusing Conditional Submodular GAN and Programmatic Weak Supervision
Kumar Shubham, Pranav Sastry, AP Prathosh
Efficient Constraint Generation for Stochastic Shortest Path Problems
Johannes Schmalz, Felipe Trevizan
Spatial Annealing for Efficient Few-shot Neural Rendering
Yuru Xiao, Deming Zhai, Wenbo Zhao et al.
Pantypes: Diverse Representatives for Self-Explainable Models
Rune Kjærsgaard, Ahcène Boubekki, Line Clemmensen
Continuous Rotation Group Equivariant Network Inspired by Neural Population Coding
Zhiqiang Chen, Yang Chen, xiaolong Zou et al.
CP-DETR: Concept Prompt Guide DETR Toward Stronger Universal Object Detection
Qibo Chen, Weizhong Jin, Jianyue Ge et al.