Most Cited AAAI "grasp-text-aligned dataset" Papers
5,317 papers found • Page 8 of 27
Conference
Unified Coding for Both Human Perception and Generalized Machine Analytics with CLIP Supervision
Kangsheng Yin, Quan Liu, Xuelin Shen et al.
Research Papers
Xiangci Li, Jessica Ouyang
Exploit Gradient Skewness to Circumvent Byzantine Defenses for Federated Learning
Yuchen Liu, Chen Chen, Lingjuan Lyu et al.
BAIT: Benchmarking (Embedding) Architectures for Interactive Theorem-Proving
Sean Lamont, Michael Norrish, Amir Dezfouli et al.
Fusing Conditional Submodular GAN and Programmatic Weak Supervision
Kumar Shubham, Pranav Sastry, AP Prathosh
Continuous Rotation Group Equivariant Network Inspired by Neural Population Coding
Zhiqiang Chen, Yang Chen, xiaolong Zou et al.
Pantypes: Diverse Representatives for Self-Explainable Models
Rune Kjærsgaard, Ahcène Boubekki, Line Clemmensen
DUEL: Duplicate Elimination on Active Memory for Self-Supervised Class-Imbalanced Learning
Won-Seok Choi, Hyundo Lee, Dong-Sig Han et al.
High-Dimensional Analysis for Generalized Nonlinear Regression: From Asymptotics to Algorithm
Jian Li, Yong Liu, Weiping Wang
Full-Body Motion Reconstruction with Sparse Sensing from Graph Perspective
Feiyu Yao, Zongkai Wu, Li Yi
Logic-Q: Improving Deep Reinforcement Learning-based Quantitative Trading via Program Sketch-based Tuning
Zhiming Li, Junzhe Jiang, Yushi Cao et al.
Symbolic Neural Ordinary Differential Equations
Xin Li, Chengli Zhao, Xue Zhang et al.
CALLIC: Content Adaptive Learning for Lossless Image Compression
Daxin Li, Yuanchao Bai, Kai Wang et al.
PBECount: Prompt-Before-Extract Paradigm for Class-Agnostic Counting
Canchen Yang, Tianyu Geng, Jian Peng et al.
Effective Causal Discovery under Identifiable Heteroscedastic Noise Model
Naiyu Yin, Tian Gao, Yue Yu et al.
A Sample-Level Evaluation and Generative Framework for Model Inversion Attacks
Haoyang Li, Li Bai, Qingqing Ye et al.
Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge Transfer
Xinyue Chen, Miaojing Shi, Zijian Zhou et al.
RILQ: Rank-Insensitive LoRA-Based Quantization Error Compensation for Boosting 2-Bit Large Language Model Accuracy
Geonho Lee, Janghwan Lee, Sukjin Hong et al.
FIRM: Flexible Interactive Reflection ReMoval
Xiao Chen, Xudong Jiang, Yunkang Tao et al.
p-Mean Regret for Stochastic Bandits
Anand Krishna, Philips George John, Adarsh Barik et al.
Learning Uncertainty-Aware Temporally-Extended Actions
Joongkyu Lee, Seung Joon Park, Yunhao Tang et al.
A Unified Framework for Human-Allied Learning of Probabilistic Circuits
Athresh Karanam, Saurabh Mathur, Sahil Sidheekh et al.
VarDrop: Enhancing Training Efficiency by Reducing Variate Redundancy in Periodic Time Series Forecasting
Junhyeok Kang, Yooju Shin, Jae-Gil Lee
Balancing Humans and Machines: A Study on Integration Scale and Its Impact on Collaborative Performance
Rui Zou, Sannyuya Liu, Yawei Luo et al.
Unpaired Multi-Domain Histopathology Virtual Staining Using Dual Path Prompted Inversion
Bing Xiong, Yue Peng, Ranran Zhang et al.
AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers
Runqing Jiang, Ye Zhang, Longguang Wang et al.
Progressive Multi-granular Alignments for Grounded Reasoning in Large Vision-Language Models
Quang-Hung Le, Long Hoang Dang, Ngan Hoang Le et al.
Structural Entropy Guided Probabilistic Coding
Xiang Huang, Hao Peng, Li Sun et al.
Mind the Uncertainty in Human Disagreement: Evaluating Discrepancies Between Model Predictions and Human Responses in VQA
Jian Lan, Diego Frassinelli, Barbara Plank
IPDN: Image-enhanced Prompt Decoding Network for 3D Referring Expression Segmentation
Qi Chen, Changli Wu, Jiayi Ji et al.
TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models
Haocheng Huang, Jiaxin Chen, Jinyang Guo et al.
Cross-Modal Stealth: A Coarse-to-Fine Attack Framework for RGB-T Tracker
Xinyu Xiang, Qinglong Yan, Hao Zhang et al.
Deep Linear Array Pushbroom Image Restoration: A Degradation Pipeline and Jitter-Aware Restoration Network
Zida Chen, Ziran Zhang, Haoying Li et al.
GSDiff: Synthesizing Vector Floorplans via Geometry-enhanced Structural Graph Generation
Sizhe Hu, Wenming Wu, Yuntao Wang et al.
Enhancing Close-up Novel View Synthesis via Pseudo-labeling
Jiatong Xia, Libo Sun, Lingqiao Liu
Domain Generalizable Person Search Using Unreal Dataset
Authors: Minyoung Oh, Duhyun Kim, Jae-Young Sim
Target Semantics Clustering via Text Representations for Robust Universal Domain Adaptation
Weinan He, Zilei Wang, Yixin Zhang
BAND: Biomedical Alert News Dataset
Zihao Fu, Meiru Zhang, Zaiqiao Meng et al.
Watch Your Head: Assembling Projection Heads to Save the Reliability of Federated Models
Authors: Jinqian Chen, Jihua Zhu, Qinghai Zheng et al.
A Wander Through the Multimodal Landscape: Efficient Transfer Learning via Low-rank Sequence Multimodal Adapter
Zirun Guo, Xize Cheng, Yangyang Wu et al.
SoundCount: Sound Counting from Raw Audio with Dyadic Decomposition Neural Network
Yuhang He, Zhuangzhuang Dai, Niki Trigoni et al.
Statistical Spatially Inhomogeneous Diffusion Inference
Yinuo Ren, Yiping Lu, Lexing Ying et al.
Out-of-Distribution Detection with Prototypical Outlier Proxy
Mingrong Gong, Chaoqi Chen, Qingqiang Sun et al.
Extracting Interpretable Task-Specific Circuits from Large Language Models for Faster Inference
Jorge García-Carrasco, Alejandro Maté, Juan Trujillo
Auto-Regressive Moving Diffusion Models for Time Series Forecasting
Jiaxin Gao, Qinglong Cao, Yuntian Chen
Discrete Curvature Graph Information Bottleneck
Xingcheng Fu, Jian Wang, Yisen Gao et al.
Frequency-Masked Embedding Inference: A Non-Contrastive Approach for Time Series Representation Learning
En Fu, Yanyan Hu
GAS: Generative Activation-Aided Asynchronous Split Federated Learning
Jiarong Yang, Yuan Liu
Detection-Based Intermediate Supervision for Visual Question Answering
Yuhang Liu, Daowan Peng, Wei Wei et al.
Fast Incomplete Multi-view Clustering with Adaptive Similarity Completion and Reconstruction
Deng Xu, Chao Zhang, Cong Guo et al.
Quality over Quantity: Boosting Data Efficiency Through Ensembled Multimodal Data Curation
Jinda Xu, Yuhao Song, Daming Wang et al.
MCGAN: Enhancing GAN Training with Regression-Based Generator Loss
Baoren Xiao, Hao Ni, Weixin Yang
Neural Networks Perform Sufficient Dimension Reduction
Shuntuo Xu, Zhou Yu
PAC-Bayes Generalisation Bounds for Dynamical Systems including Stable RNNs
Deividas Eringis, John Leth, Zheng-Hua Tan et al.
HoneypotNet: Backdoor Attacks Against Model Extraction
Yixu Wang, Tianle Gu, Yan Teng et al.
Architecture-Aware Learning Curve Extrapolation via Graph Ordinary Differential Equation
Yanna Ding, Zijie Huang, Xiao Shou et al.
A General Search-Based Framework for Generating Textual Counterfactual Explanations
Daniel Gilo, Shaul Markovitch
SigStyle: Signature Style Transfer via Personalized Text-to-Image Models
Ye Wang, Tongyuan Bai, Xuping Xie et al.
Optimizing Human Pose Estimation Through Focused Human and Joint Regions
Yingying Jiao, Zhigang Wang, Zhenguang Liu et al.
DFF: Decision-Focused Fine-Tuning for Smarter Predict-Then-Optimize with Limited Data
Jiaqi Yang, Enming Liang, Zicheng Su et al.
Pre-Assignment Problem for Unique Minimum Vertex Cover on Bounded Clique-Width Graphs
Shinwoo An, Yeonsu Chang, Kyungjin Cho et al.
Convergence Rate in a Nonlinear Two-Time-Scale Stochastic Approximation with State (Time)-Dependence
Zixi Chen, Yumin Xu, Ruixun Zhang
Holistic Semantic Representation for Navigational Trajectory Generation
Ji Cao, Tongya Zheng, Qinghong Guo et al.
Move and Act: Enhanced Object Manipulation and Background Integrity for Image Editing
Pengfei Jiang, Mingbao Lin, Fei Chao
Angle Robustness Unmanned Aerial Vehicle Navigation in GNSS-Denied Scenarios
Yuxin Wang, Zunlei Feng, Haofei Zhang et al.
A Hierarchical Network for Multimodal Document-Level Relation Extraction
Lingxing Kong, Jiuliang Wang, Zheng Ma et al.
Autoregressive Sequence Modeling for 3D Medical Image Representation
Siwen Wang, Churan Wang, Fei Gao et al.
Meme Trojan: Backdoor Attacks Against Hateful Meme Detection via Cross-Modal Triggers
Ruofei Wang, Hongzhan Lin, Ziyuan Luo et al.
Medical Manifestation-Aware De-Identification
Yuan Tian, Shuo Wang, Guangtao Zhai
Navigating Label Ambiguity for Facial Expression Recognition in the Wild
JunGyu Lee, Yeji Choi, Haksub Kim et al.
SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing
Chen Chen, Liangjin Zhao, Yuanchun He et al.
DP-MemArc: Differential Privacy Transfer Learning for Memory Efficient Language Models
Yanming Liu, Xinyue Peng, Yuwei Zhang et al.
Transfer Learning of Real Image Features with Soft Contrastive Loss for Fake Image Detection
Ziyou Liang, Weifeng Liu, Run Wang et al.
Invariant Random Forest: Tree-Based Model Solution for OOD Generalization
Yufan LIAO, Qi Wu, Xing Yan
ML-GOOD: Towards Multi-Label Graph Out-Of-Distribution Detection
Tingyi Cai, Yunliang Jiang, Ming Li et al.
Accelerated Methods with Compressed Communications for Distributed Optimization Problems Under Data Similarity
Dmitry Bylinkin, Aleksandr Beznosikov
MEPNet: Medical Entity-Balanced Prompting Network for Brain CT Report Generation
Xiaodan Zhang, Yanzhao Shi, Junzhong Ji et al.
Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence beyond the Minty Property
I. Anagnostides, Ioannis Panageas, Gabriele Farina et al.
Efficient Self-Supervised Video Hashing with Selective State Spaces
Jinpeng Wang, Niu Lian, Jun Li et al.
Neural Combinatorial Clustered Bandits for Recommendation Systems
Baran Atalar, Carlee Joe-Wong
Recall-Oriented Continual Learning with Generative Adversarial Meta-Model
Haneol Kang, Dong-Wan Choi
Approximating Metric Magnitude of Point Sets
Rayna Andreeva, James Ward, Primoz Skraba et al.
DALDet: Depth-Aware Learning Based Object Detection for Autonomous Driving
K. Hu, Tongbo Cao, Yuan Li et al.
Relation-Aware Equivariant Graph Networks for Epitope-Unknown Antibody Design and Specificity Optimization
Lirong Wu, Haitao Lin, Yufei Huang et al.
Multi-Grained Query-Guided Set Prediction Network for Grounded Multimodal Named Entity Recognition
Jielong Tang, Zhenxing Wang, ZiYang Gong et al.
FakeDiffer: Distributional Disparity Learning on Differentiated Reconstruction for Face Forgery Detection
Bo Wang, Zhao Zhang, Suiyi Zhao et al.
DECT: Harnessing LLM-assisted Fine-Grained Linguistic Knowledge and Label-Switched and Label-Preserved Data Generation for Diagnosis of Alzheimer’s Disease
Tingyu Mo, Jacqueline C. K. Lam, Victor O. K. Li et al.
Relation Also Knows: Rethinking the Recall and Editing of Factual Associations in Auto-Regressive Transformer Language Models
Xiyu Liu, Zhengxiao Liu, Naibin Gu et al.
Multi-View Pedestrian Occupancy Prediction with a Novel Synthetic Dataset
Sithu Aung, Min-Cheol Sagong, Junghyun Cho
DialogDraw: Image Generation and Editing System Based on Multi-Turn Dialogue
Shichao Ma, Xinfeng Zhang, Zeng Zhao et al.
Representing Sounds as Neural Amplitude Fields: A Benchmark of Coordinate-MLPs and a Fourier Kolmogorov-Arnold Framework
Linfei Li, Lin Zhang, Zhong Wang et al.
Neural Assembler: Learning to Generate Fine-Grained Robotic Assembly Instructions from Multi-View Images
Hongyu Yan, Yadong Mu
APIRL: Deep Reinforcement Learning for REST API Fuzzing
Myles Foley, Sergio Maffeis
LRM-LLaVA: Overcoming the Modality Gap of Multilingual Large Language-Vision Model for Low-Resource Languages
Junchen Li, Qing Yang, Bojian Jiang et al.
ChatterBox: Multimodal Referring and Grounding with Chain-of-Questions
Yunjie Tian, Tianren Ma, Lingxi Xie et al.
Correcting Large Language Model Behavior via Influence Function
Han Zhang, Zhuo Zhang, Yi Zhang et al.
Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation
Shoutao Guo, Shaolei Zhang, Zhengrui Ma et al.
Compressing Streamable Free-Viewpoint Videos to 0.1 MB per Frame
Luyang Tang, Jiayu Yang, Rui Peng et al.
Gradient-Guided Credit Assignment and Joint Optimization for Dependency-Aware Spatial Crowdsourcing
Yafei Li, Wei Chen, Jinxing Yan et al.
BSDB-Net: Band-Split Dual-Branch Network with Selective State Spaces Mechanism for Monaural Speech Enhancement
Cunhang Fan, Enrui Liu, Andong Li et al.
ISR-DPO: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO
Daechul Ahn, Yura Choi, San Kim et al.
Incremental Quasi-Newton Methods with Faster Superlinear Convergence Rates
Zhuanghua Liu, Luo Luo, Bryan Kian Hsiang Low
Targeted Activation Penalties Help CNNs Ignore Spurious Signals
Dekai Zhang, Matt Williams, Francesca Toni
SoundBrush: Sound as a Brush for Visual Scene Editing
Kim Sung-Bin, Kim Jun-Seong, Junseok Ko et al.
Practical Offloading for Fine-Tuning LLM on Commodity GPU via Learned Sparse Projectors
Siyuan Chen, Zhuofeng Wang, Zelong Guan et al.
Improved Regret Bounds for Online Fair Division with Bandit Learning
Benjamin Schiffer, Shirley Zhang
ViT-Calibrator: Decision Stream Calibration for Vision Transformer
Lin Chen, Zhijie Jia, Lechao Cheng et al.
Sentence-level Aggregation of Lexical Metrics Correlates Stronger with Human Judgements than Corpus-level Aggregation
Paulo Cavalin, Pedro H. Domingues, Claudio Pinhanez
Fair Division with Social Impact
Michele Flammini, Gianluigi Greco, Giovanna Varricchio
Decentralized and Uncoordinated Learning of Stable Matchings: A Game-Theoretic Approach
S. Rasoul Etesami, R. Srikant
HDformer: A Higher
Dimensional Transformer for Detecting Diabetes Utilizing Long-Range Vascular Signals - Ella Lan
Private Blotto: Viewpoint Competition with Polarized Agents
Kate Donahue, Jon Kleinberg
Synchronization in Learning in Periodic Zero-Sum Games Triggers Divergence from Nash Equilibrium
Yuma Fujimoto, Kaito Ariu, Kenshi Abe
MLAAN: Scaling Supervised Local Learning with Multilaminar Leap Augmented Auxiliary Network
Yuming Zhang, Shouxin Zhang, Peizhe Wang et al.
Controller-Guided Partial Label Consistency Regularization with Unlabeled Data
Qian-Wei Wang, Bowen Zhao, Mingyan Zhu et al.
Differentiable Information Enhanced Model-Based Reinforcement Learning
Xiaoyuan Zhang, Xinyan Cai, Bo Liu et al.
Toward Efficient Data-Free Unlearning
Chenhao Zhang, Shaofei Shen, Weitong Chen et al.
Zero-Shot Image Captioning with Multi-type Entity Representations
Delong Zeng, Ying Shen, Man Lin et al.
Contrastive Functional Principal Component Analysis
Eric Zhang, Didong Li
In-Context Policy Adaptation via Cross-Domain Skill Diffusion
Minjong Yoo, Woo Kyung Kim, Honguk Woo
From Pairwise to Ranking: Climbing the Ladder to Ideal Collaborative Filtering with Pseudo-Ranking
Yuhan Zhao, Rui Chen, Li Chen et al.
Symbolic Numeric Planning with Patterns
Matteo Cardellini, E. Giunchiglia, M. Maratea
HOGSA: Bimanual Hand-Object Interaction Understanding with 3D Gaussian Splatting Based Data Augmentation
Wentian Qu, Jiahe Li, Jian Cheng et al.
Predicting User Behavior in Smart Spaces with LLM-Enhanced Logs and Personalized Prompts
Yunpeng Song, Jiawei Li, Yiheng Bian et al.
Sequence Accumulation and Beyond: Infinite Context Length on Single GPU and Large Clusters
Weigao Sun, Yongtuo Liu, Xiaqiang Tang et al.
First-Order Federated Bilevel Learning
Yifan Yang, Peiyao Xiao, Shiqian Ma et al.
Controllable 3D Dance Generation Using Diffusion-Based Transformer U-Net
Puyuan Guo, Tuo Hao, Wenxin Fu et al.
A Scalable and Effective Alternative to Graph Transformers
Kaan Sancak, Zhigang Hua, Jin Fang et al.
Delay as Payoff in MAB
Ofir Schlisselberg, Ido Cohen, Tal Lancewicki et al.
PoseGen: Learning to Generate 3D Human Pose Dataset with NeRF
Mohsen Gholami, Rabab Ward, Z. Jane Wang
DHAKR: Learning Deep Hierarchical Attention-Based Kernelized Representations for Graph Classification
Feifei Qian, Lu Bai, Lixin Cui et al.
In-depth Analysis of Low-rank Matrix Factorisation in a Federated Setting
Constantin Philippenko, Kevin Scaman, Laurent Massoulié
Rule-Guided Graph Neural Networks for Explainable Knowledge Graph Reasoning
Zhe Wang, Suxue Ma, Kewen Wang et al.
Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation
Prashansa Panda, Shalabh Bhatnagar
FaceCoresetNet: Differentiable Coresets for Face Set Recognition
Gil Shapira, Yosi Keller
BOIDS: High-Dimensional Bayesian Optimization via Incumbent-Guided Direction Lines and Subspace Embeddings
Lam Ngo, Huong Ha, Jeffrey Chan et al.
Improving Pareto Set Learning for Expensive Multi-objective Optimization via Stein Variational Hypernetworks
Minh-Duc Nguyen, Phuong Mai Dinh, Quang-Huy Nguyen et al.
Foreseeing Reconstruction Quality of Gradient Inversion: An Optimization Perspective
Hyeong Gwon Hong, Yooshin Cho, Hanbyel Cho et al.
Explaining Decisions of Agents in Mixed-Motive Games
Maayan Orner, Oleg Maksimov, Akiva Kleinerman et al.
Efficient Robustness Evaluation via Constraint Relaxation
Chao Pan, Yu Wu, Ke Tang et al.
Multi-Scale Contrastive Learning for Video Temporal Grounding
Thong Thanh Nguyen, Yi Bin, Xiaobao Wu et al.
Scaling Diffusion Mamba with Bidirectional SSMs for Efficient 3D Shape Generation
Shentong Mo
Implicit Relative Labeling-Importance Aware Multi-Label Metric Learning
Jun-Xiang Mao, Yong Rui, Min-Ling Zhang
Batch Normalization Is Blind to the First and Second Derivatives of the Loss
Zhanpeng Zhou, Wen Shen, Huixin Chen et al.
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-trajectory Reward
Haoxin Lin, Hongqiu Wu, Jiaji Zhang et al.
AD4CD: Causal-Guided Anomaly Detection for Enhancing Cognitive Diagnosis
Haiping Ma, Yue Yao, Changqian Wang et al.
Extract Free Dense Misalignment from CLIP
JeongYeon Nam, Jinbae Im, Wonjae Kim et al.
Neural Network Approximators for Marginal MAP in Probabilistic Circuits
Shivvrat Arya, Tahrima Rahman, Vibhav Gogate
Unified Graph Neural Networks Pre-training for Multi-domain Graphs
Mingkai Lin, Xiaobin Hong, Wenzhong Li et al.
Reward Penalties on Augmented States for Solving Richly Constrained RL Effectively
Hao Jiang, Tien Mai, Pradeep Varakantham et al.
Span Graph Transformer for Document-Level Named Entity Recognition
Hongli Mao, Xian-Ling Mao, Hanlin Tang et al.
Any-Way Meta Learning
JunHoo Lee, Yearim Kim, Hyunho Lee et al.
Novel View Synthesis Under Large-Deviation Viewpoint for Autonomous Driving
Xin Ma, Jiguang Zhang, Peng Lu et al.
CasFT: Future Trend Modeling for Information Popularity Prediction with Dynamic Cues-Driven Diffusion Models
Xin Jing, Yichen Jing, Yuhuan Lu et al.
EyEar: Learning Audio Synchronized Human Gaze Trajectory Based on Physics-Informed Dynamics
Xiaochuan Liu, Xin Cheng, Yuchong Sun et al.
Minimal Macro-Based Rewritings of Formal Languages: Theory and Applications in Ontology Engineering (and Beyond)
Christian Kindermann, Anne-Marie George, Bijan Parsia et al.
Double Auction on Diffusion Network
Miao Li, Yuhan Cao, Dengji Zhao
Future Sight and Tough Fights: Revolutionizing Sequential Recommendation with FENRec
Yu-Hsuan Huang, Ling Lo, Hongxia Xie et al.
Semantic Ambiguity Modeling and Propagation for Fine-Grained Visual Cross View Geo-Localization
Mingtao Feng, Fenghao Tian, Jianqiao Luo et al.
Residual Diffusion Deblurring Model for Single Image Defocus Deblurring
Haoxuan Feng, Haohui Zhou, Tian Ye et al.
Auditable Algorithms for Approximate Model Counting
S Akshay, Supratik Chakraborty, Kuldeep S Meel
Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization
Jiahao Qiu, Hui Yuan, Jinghong Zhang et al.
Representation Space Augmentation for Effective Self-Supervised Learning on Tabular Data
Moonjung Eo, Kyungeun Lee, Hye-Seung Cho et al.
PROSAC: Provably Safe Certification for Machine Learning Models under Adversarial Attacks
Chen Feng, Ziquan Liu, Zhuo Zhi et al.
Enhancing Low-Light Images: A Synthetic Data Perspective on Practical and Generalizable Solutions
Yu Long, Qinghua Lin, Zhihua Wang et al.
Towards Projected and Incremental Pseudo-Boolean Model Counting
Suwei Yang, Kuldeep S. Meel
Generalizing Constraint Models in Constraint Acquisition
Dimos Tsouros, Senne Berden, Steven Prestwich et al.
What Does a Query Answer Tell You? Informativeness of Query Answers for Knowledge Bases
Luca Andolfi, Gianluca Cima, Marco Console et al.
MUN: Image Forgery Localization Based on M³ Encoder and UN Decoder
Yaqi Liu, Shuhuan Chen, Haichao Shi et al.
Multi-Energy Guided Image Translation with Stochastic Differential Equations for Near-Infrared Facial Expression Recognition
13319 Bingjun Luo, Zewen Wang, Jinpeng Wang et al.
Exploring Task-Level Optimal Prompts for Visual In-Context Learning
Yan Zhu, Huan Ma, Changqing Zhang
D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching
Jingyu Liu, Minquan Wang, Ye Ma et al.
UN-DETR: Promoting Objectness Learning via Joint Supervision for Unknown Object Detection
HaoMiao Liu, Hao Xu, Chuhuai Yue et al.
Causal Discovery from Poisson Branching Structural Causal Model Using High-Order Cumulant with Path Analysis
Jie Qiao, Yu Xiang, Zhengming Chen et al.
Position-Aware Guided Point Cloud Completion with CLIP Model
Feng Zhou, Qi Zhang, Ju Dai et al.
Polarization Guided Mask-Free Shadow Removal
Chu Zhou, Chao Xu, Boxin Shi
IPVTON: Image-based 3D Virtual Try-on with Image Prompt Adapter
Xiaojing Zhong, Zhonghua Wu, Xiaofeng Yang et al.
Anti-Diffusion: Preventing Abuse of Modifications of Diffusion-Based Models
Li Zheng, Liangbin Xie, Jiantao Zhou et al.
Greedy-Based Online Fair Allocation with Adversarial Input: Enabling Best-of-Many-Worlds Guarantees
Zongjun Yang, Luofeng Liao, Christian Kroer
Multi-Modal Latent Variables for Cross-Individual Primary Visual Cortex Modeling and Analysis
Yu Zhu, Bo Lei, Chunfeng Song et al.
Heterogeneous Prompt-Guided Entity Inferring and Distilling for Scene-Text Aware Cross-Modal Retrieval
Zhiqian Zhao, Liang Li, Jiehua Zhang et al.
Enhancing Identity-Deformation Disentanglement in StyleGAN for One-Shot Face Video Re-Enactment
Qing Chang, Yao-Xiang Ding, Kun Zhou
Rethinking Robustness of Model Attributions
Sandesh Kamath, Sankalp Mittal, Amit Deshpande et al.
Zero-Shot Learning in Industrial Scenarios: New Large-Scale Benchmark, Challenges and Baseline
Zekai Zhang, Qinghui Chen, Maomao Xiong et al.
Toy-GS: Assembling Local Gaussians for Precisely Rendering Large-Scale Free Camera Trajectories
Xiaohan Zhang, Zhenyu Sun, Yukui Qiu et al.
Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception
Xiang Zhang, Yufei Cui, Chenchen Fu et al.
DUSTED: Dual-Attention Enhanced Spatial Transcriptomics Denoiser
Jun Zhu, Yifu Li, Zhenchao Tang et al.
VidEvent: A Large Dataset for Understanding Dynamic Evolution of Events in Videos
Baoyu Liang, Qile Su, Shoutai Zhu et al.
Elevating Flow-Guided Video Inpainting with Reference Generation
Suhwan Cho, Seoung Wug Oh, Sangyoun Lee et al.
Exact Policy Recovery in Offline RL with Both Heavy-Tailed Rewards and Data Corruption
Yiding Chen, Xuezhou Zhang, Qiaomin Xie et al.
E4: Energy-Efficient DNN Inference for Edge Video Analytics via Early Exiting and DVFS
Ziyang Zhang, Yang Zhao, Ming-Ching Chang et al.
DCA: Dividing and Conquering Amnesia in Incremental Object Detection
Aoting Zhang, Dongbao Yang, Chang Liu et al.
Consistency-Guided Temperature Scaling Using Style and Content Information for Out-of-Domain Calibration
Wonjeong Choi, Jungwuk Park, Dong-Jun Han et al.
The Logic of Doxastic Strategies
Junli Jiang, Pavel Naumov
SyncNoise: Geometrically Consistent Noise Prediction for Instruction-based 3D Editing
Ruihuang Li, Liyi Chen, Zhengqiang Zhang et al.
FreeNet: Liberating Depth-Wise Separable Operations for Building Faster Mobile Vision Architectures
Hao Yu, Haoyu Chen, Wei Peng et al.
Spatiotemporal Blind-Spot Network with Calibrated Flow Alignment for Self-Supervised Video Denoising
Zikang Chen, Tao Jiang, Xiaowan Hu et al.
On Alternating-Time Temporal Logic, Hyperproperties, and Strategy Sharing
Raven Beutner, Bernd Finkbeiner
Noisy Node Classification by Bi-level Optimization Based Multi-Teacher Distillation
Yujing Liu, Zongqian Wu, Zhengyu Lu et al.
Concept Conductor: Orchestrating Multiple Personalized Concepts in Text-to-Image Synthesis
Zebin Yao, Fangxiang Feng, Ruifan Li et al.