Most Cited 2024 "grasp-text-aligned dataset" Papers
12,324 papers found • Page 52 of 62
Conference
NondBREM: Nondeterministic Offline Reinforcement Learning for Large-Scale Order Dispatching
Hongbo Zhang, Guang Wang, Xu Wang et al.
Benchmarking Audio Visual Segmentation for Long-Untrimmed Videos
Chen Liu, Peike Li, Qingtao Yu et al.
Adaptive Graph Learning for Multimodal Conversational Emotion Detection
Geng Tu, Tian Xie, Bin Liang et al.
Hypothesis, Verification, and Induction: Grounding Large Language Models with Self-Driven Skill Learning
Shaohui Peng, Xing Hu, Qi Yi et al.
LaViP: Language-Grounded Visual Prompting
Nilakshan Kunananthaseelan, Jing Zhang, Mehrtash Harandi
Adaptive Reactive Synthesis for LTL and LTLf Modulo Theories
Andoni Rodríguez, César Sánchez
Mean-Shift Feature Transformer
Takumi Kobayashi
MKG-FENN: A Multimodal Knowledge Graph Fused End-to-End Neural Network for Accurate Drug–Drug Interaction Prediction
Di Wu, Wu Sun, Yi He et al.
Fast Adaptation for Human Pose Estimation via Meta-Optimization
Shengxiang Hu, Huaijiang Sun, Bin Li et al.
L4D-Track: Language-to-4D Modeling Towards 6-DoF Tracking and Shape Reconstruction in 3D Point Cloud Stream
Jingtao Sun, Yaonan Wang, Mingtao Feng et al.
Submodel Enumeration for CTL Is Hard
Nicolas Fröhlich, Arne Meier
IBD-SLAM: Learning Image-Based Depth Fusion for Generalizable SLAM
Minghao Yin, Shangzhe Wu, Kai Han
NegVSR: Augmenting Negatives for Generalized Noise Modeling in Real-world Video Resolution
Yexing Song, Meilin Wang, Zhijing Yang et al.
Segment Any Event Streams via Weighted Adaptation of Pivotal Tokens
Zhiwen Chen, Zhiyu Zhu, Yifan Zhang et al.
Boosting Image Quality Assessment through Efficient Transformer Adaptation with Local Feature Enhancement
Kangmin Xu, Liang Liao, Jing Xiao et al.
Exploring Orthogonality in Open World Object Detection
Zhicheng Sun, Jinghan Li, Yadong Mu
Discovering Sequential Patterns with Predictable Inter-event Delays
Joscha Cüppers, Paul Krieger, Jilles Vreeken
Causality-Inspired Invariant Representation Learning for Text-Based Person Retrieval
Yu Liu, Guihe Qin, Haipeng Chen et al.
Learning Small Decision Trees for Data of Low Rank-Width
Konrad K. Dabrowski, Eduard Eiben, Sebastian Ordyniak et al.
Inference and Learning in Dynamic Decision Networks Using Knowledge Compilation
Gabriele Venturato, Vincent Derkinderen, Pedro Zuidberg Dos Martires et al.
Latency Correction for Event-guided Deblurring and Frame Interpolation
Yixin Yang, Jinxiu Liang, Bohan Yu et al.
DOCTR: Disentangled Object-Centric Transformer for Point Scene Understanding
Xiaoxuan Yu, Hao Wang, Weiming Li et al.
Delegation-Relegation for Boolean Matrix Factorization
Florent Avellaneda, Roger Villemaire
ZERO-IG: Zero-Shot Illumination-Guided Joint Denoising and Adaptive Enhancement for Low-Light Images
Yiqi Shi, Duo Liu, Liguo Zhang et al.
Self-Supervised Representation Learning from Arbitrary Scenarios
Zhaowen Li, Yousong Zhu, Zhiyang Chen et al.
Adversarial Distillation Based on Slack Matching and Attribution Region Alignment
Shenglin Yin, Zhen Xiao, Mingxuan Song et al.
Clarifying the Behavior and the Difficulty of Adversarial Training
Xu Cheng, Hao Zhang, Yue Xin et al.
Dual-View Whitening on Pre-trained Text Embeddings for Sequential Recommendation
Lingzi Zhang, Xin Zhou, Zhiwei Zeng et al.
Is a Large Language Model a Good Annotator for Event Extraction?
Ruirui Chen, Chengwei Qin, Weifeng Jiang et al.
Task-Adaptive Prompted Transformer for Cross-Domain Few-Shot Learning
Jiamin Wu, Xin Liu, Xiaotian Yin et al.
AHIVE: Anatomy-aware Hierarchical Vision Encoding for Interactive Radiology Report Retrieval
Sixing Yan, William K. Cheung, Ivor Tsang et al.
SPU-PMD: Self-Supervised Point Cloud Upsampling via Progressive Mesh Deformation
Yanzhe Liu, Rong Chen, Yushi Li et al.
Enhancing the Power of OOD Detection via Sample-Aware Model Selection
Feng Xue, Zi He, Yuan Zhang et al.
Deep Semantic Graph Transformer for Multi-View 3D Human Pose Estimation
Lijun Zhang, Kangkang Zhou, Feng Lu et al.
MMA: Multi-Modal Adapter for Vision-Language Models
Lingxiao Yang, Ru-Yuan Zhang, Yanchen Wang et al.
Altruism in Facility Location Problems
Hau Chan, Minming Li, Houyu Zhou
Limited-Supervised Multi-Label Learning with Dependency Noise
Yejiang Wang, Yuhai Zhao, Zhengkui Wang et al.
A Category Agnostic Model for Visual Rearrangment
Yuyi Liu, Xinhang Song, Weijie Li et al.
Towards Progressive Multi-Frequency Representation for Image Warping
Jun Xiao, Zihang Lyu, Cong Zhang et al.
MultiSum: A Multi-Facet Approach for Extractive Social Summarization Utilizing Semantic and Sociological Relationships
Tanglong Zhao, Ruifang He, Jing Xu et al.
Molecular Data Programming: Towards Molecule Pseudo-labeling with Systematic Weak Supervision
Xin Juan, Kaixiong Zhou, Ninghao Liu et al.
OTE: Exploring Accurate Scene Text Recognition Using One Token
Jianjun Xu, Yuxin Wang, Hongtao Xie et al.
TTA-EVF: Test-Time Adaptation for Event-based Video Frame Interpolation via Reliable Pixel and Sample Estimation
Hoonhee Cho, Taewoo Kim, Yuhwan Jeong et al.
DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses
Chen Zhao, Tong Zhang, Zheng Dang et al.
CEGAR-Based Approach for Solving Combinatorial Optimization Modulo Quantified Linear Arithmetics Problems
Kerian Thuillier, Anne Siegel, Loïc Paulevé
Hyper-MD: Mesh Denoising with Customized Parameters Aware of Noise Intensity and Geometric Characteristics
Xingtao Wang, Hongliang Wei, Xiaopeng Fan et al.
MID-FiLD: MIDI Dataset for Fine-Level Dynamics
Jesung Ryu, Seungyeon Rhyu, Hong-Gyu Yoon et al.
DINGO: Towards Diverse and Fine-Grained Instruction-Following Evaluation
Zihui Gu, Xingwu Sun, Fengzong Lian et al.
Risk-Conditioned Reinforcement Learning: A Generalized Approach for Adapting to Varying Risk Measures
Gwangpyo Yoo, Jinwoo Park, Honguk Woo
An Empirical Study of Scaling Law for Scene Text Recognition
Miao Rang, Zhenni Bi, Chuanjian Liu et al.
United We Stand: Accelerating Privacy-Preserving Neural Inference by Conjunctive Optimization with Interleaved Nexus
Qiao Zhang, Tao Xiang, Chunsheng Xin et al.
When StyleGAN Meets Stable Diffusion: a W+ Adapter for Personalized Image Generation
Xiaoming Li, Xinyu Hou, Chen Change Loy
Differentiable Neural Surface Refinement for Modeling Transparent Objects
Weijian Deng, Dylan Campbell, Chunyi Sun et al.
Towards Co-Evaluation of Cameras HDR and Algorithms for Industrial-Grade 6DoF Pose Estimation
Agastya Kalra, Guy Stoppi, Dmitrii Marin et al.
Tune-An-Ellipse: CLIP Has Potential to Find What You Want
Jinheng Xie, Songhe Deng, Bing Li et al.
Graph Neural Networks with Soft Association between Topology and Attribute
Yachao Yang, Yanfeng Sun, Shaofan Wang et al.
Integer Is Enough: When Vertical Federated Learning Meets Rounding
Pengyu Qiu, Yuwen Pu, Yongchao Liu et al.
PairDETR : Joint Detection and Association of Human Bodies and Faces
Ammar Ali, Georgii Gaikov, Denis Rybalchenko et al.
Close Imitation of Expert Retouching for Black-and-White Photography
Seunghyun Shin, Jisu Shin, Jihwan Bae et al.
KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking
Liu Liu, Anran Huang, Qi Wu et al.
Instance-Conditional Timescales of Decay for Nonstationary Learning
Nishant Jain, Pradeep Shenoy
Efficient Look-Up Table from Expanded Convolutional Network for Accelerating Image Super-resolution
Kai Yin, Jie Shen
A Separation and Alignment Framework for Black-Box Domain Adaptation
Mingxuan Xia, Junbo Zhao, Gengyu Lyu et al.
Amalgamating Multi-Task Models with Heterogeneous Architectures
Jidapa Thadajarassiri, Walter Gerych, Xiangnan Kong et al.
Amodal Scene Analysis via Holistic Occlusion Relation Inference and Generative Mask Completion
Bowen Zhang, Qing Liu, Jianming Zhang et al.
Bézier Everywhere All at Once: Learning Drivable Lanes as Bézier Graphs
Hugh Blayney, Hanlin Tian, Hamish Scott et al.
Ink Dot-Oriented Differentiable Optimization for Neural Image Halftoning
Hao Jiang, Bingfeng Zhou, Yadong Mu
Divergence-Guided Simultaneous Speech Translation
Xinjie Chen, Kai Fan, Wei Luo et al.
FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Action Segmentation
Zijia Lu, Ehsan Elhamifar
Collaborative Consortium of Foundation Models for Open-World Few-Shot Learning
Shuai Shao, Yu Bai, Yan Wang et al.
Efficient Target Propagation by Deriving Analytical Solution
Yanhao Bao, Tatsukichi Shibuya, Ikuro Sato et al.
ShapeMatcher: Self-Supervised Joint Shape Canonicalization Segmentation Retrieval and Deformation
Yan Di, Chenyangguang Zhang, Chaowei Wang et al.
SVDTree: Semantic Voxel Diffusion for Single Image Tree Reconstruction
Yuan Li, Zhihao Liu, Bedrich Benes et al.
Patch2Self2: Self-supervised Denoising on Coresets via Matrix Sketching
Shreyas Fadnavis, Agniva Chowdhury, Joshua Batson et al.
Partial Multi-View Clustering via Self-Supervised Network
Qianqian Wang, Guoshuai Sheng, Quanxue Gao et al.
Mixed Geometry Message and Trainable Convolutional Attention Network for Knowledge Graph Completion
Bin Shang, Yinliang Zhao, Jun Liu et al.
Imagine Before Go: Self-Supervised Generative Map for Object Goal Navigation
Sixian Zhang, Xinyao Yu, Xinhang Song et al.
Harnessing the Power of SVD: An SVA Module for Enhanced Signal Classification
Lei Zhai, Shuyuan Yang, Yitong Li et al.
PoseIRM: Enhance 3D Human Pose Estimation on Unseen Camera Settings via Invariant Risk Minimization
Yanlu Cai, Weizhong Zhang, Yuan Wu et al.
DNCs Require More Planning Steps
Yara Shamshoum, Nitzan Hodos, Yuval Sieradzki et al.
LoS: Local Structure-Guided Stereo Matching
Kunhong Li, Longguang Wang, Ye Zhang et al.
I/O Complexity of Attention, or How Optimal is FlashAttention?
Barna Saha, Christopher Ye
Position: Data Authenticity, Consent, & Provenance for AI are all broken: what will it take to fix them?
Shayne Longpre, Robert Mahari, Naana Obeng-Marnu et al.
On Disentanglement of Asymmetrical Knowledge Transfer for Modality-Task Agnostic Federated Learning
Jiayi Chen, Aidong Zhang
ConditionVideo: Training-Free Condition-Guided Video Generation
Bo Peng, Xinyuan Chen, Yaohui Wang et al.
DiffForensics: Leveraging Diffusion Prior to Image Forgery Detection and Localization
Zeqin Yu, Jiangqun Ni, Yuzhen Lin et al.
VideoGrounding-DINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding
Syed Talal Wasim, Muzammal Naseer, Salman Khan et al.
S2CycleDiff: Spatial-Spectral-Bilateral Cycle-Diffusion Framework for Hyperspectral Image Super-resolution
Jiahui Qu, Jie He, Wenqian Dong et al.
Sheared Backpropagation for Fine-tuning Foundation Models
Zhiyuan Yu, Li Shen, Liang Ding et al.
From Coarse to Fine: A Distillation Method for Fine-Grained Emotion-Causal Span Pair Extraction in Conversation
Xinhao Chen, Chong Yang, Changzhi Sun et al.
Everything2Motion: Synchronizing Diverse Inputs via a Unified Framework for Human Motion Synthesis
Zhaoxin Fan, Longbin Ji, Pengxin Xu et al.
CLIB-FIQA: Face Image Quality Assessment with Confidence Calibration
Fu-Zhao Ou, Chongyi Li, Shiqi Wang et al.
Differentiable Micro-Mesh Construction
Yishun Dou, Zhong Zheng, Qiaoqiao Jin et al.
Current Page
Direction-Aware Video Demoiréing with Temporal-Guided Bilateral Learning
Shuning Xu, Binbin SONG, Xiangyu Chen et al.
Can’t Make an Omelette Without Breaking Some Eggs: Plausible Action Anticipation Using Large Video-Language Models
Himangi Mittal, Nakul Agarwal, Shao-Yuan Lo et al.
Unsupervised 3D Structure Inference from Category-Specific Image Collections
Weikang Wang, Dongliang Cao, Florian Bernard
Video2Game: Real-time Interactive Realistic and Browser-Compatible Environment from a Single Video
Hongchi Xia, Chih-Hao Lin, Wei-Chiu Ma et al.
End-to-End Real-Time Vanishing Point Detection with Transformer
Xin Tong, Shi Peng, Yufei Guo et al.
Are Conventional SNNs Really Efficient? A Perspective from Network Quantization
Guobin Shen, Dongcheng Zhao, Tenglong Li et al.
RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation
Zeyuan Yang, LIU JIAGENG, Peihao Chen et al.
Sharingan: A Transformer Architecture for Multi-Person Gaze Following
Samy Tafasca, Anshul Gupta, Jean-marc Odobez
Robust Visual Recognition with Class-Imbalanced Open-World Noisy Data
Na Zhao, Gim Hee Lee
Dynamic Support Information Mining for Category-Agnostic Pose Estimation
Pengfei Ren, Yuanyuan Gao, Haifeng Sun et al.
MART: Masked Affective RepresenTation Learning via Masked Temporal Distribution Distillation
Zhicheng Zhang, Pancheng Zhao, Eunil Park et al.
Data Poisoning to Fake a Nash Equilibria for Markov Games
Young Wu, Jeremy McMahan, Xiaojin Zhu et al.
Threshold-Based Responsive Simulated Annealing for Directed Feedback Vertex Set Problem
Qingyun Zhang, YuMing Du, Zhipeng Lü et al.
Knowledge-Aware Explainable Reciprocal Recommendation
Kai-Huang Lai, Zhe-Rui Yang, Pei-Yuan Lai et al.
CrossMAE: Cross-Modality Masked Autoencoders for Region-Aware Audio-Visual Pre-Training
Yuxin Guo, Siyang Sun, Shuailei Ma et al.
Principle Component Trees and Their Persistent Homology
Ben Kizaric, Daniel Pimentel-Alarcon
PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning
Jizhou Wu, Jianye Hao, Tianpei Yang et al.
Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection
Yicheng Xiao, Zhuoyan Luo, Yong Liu et al.
Energy-Efficient Streaming Time Series Classification with Attentive Power Iteration
Hao Huang, Tapan Shah, Scott Evans et al.
Transferable Video Moment Localization by Moment-Guided Query Prompting
Hao Jiang, Yang Yizhang, Yadong Mu
StegFormer: Rebuilding the Glory of Autoencoder-Based Steganography
Xiao Ke, Huanqi Wu, Wenzhong Guo
VS: Reconstructing Clothed 3D Human from Single Image via Vertex Shift
Leyuan Liu, Yuhan Li, Yunqi Gao et al.
Well, Now We Know! Unveiling Sarcasm: Initiating and Exploring Multimodal Conversations with Reasoning
Gopendra Singh, Mauajama Firdaus, Dushyant Singh Chauhan et al.
On the Convergence of an Adaptive Momentum Method for Adversarial Attacks
Sheng Long, Wei Tao, Shuohao LI et al.
Point Transformer V3: Simpler Faster Stronger
Xiaoyang Wu, Li Jiang, Peng-Shuai Wang et al.
A Learnable Discrete-Prior Fusion Autoencoder with Contrastive Learning for Tabular Data Synthesis
Rongchao Zhang, Yu Huang, Yiwei Lou et al.
Decomposing Constraint Networks for Calculating c-Representations
Marco Wilhelm, Gabriele Kern-Isberner
Intelligent Calibration for Bias Reduction in Sentiment Corpora Annotation Process
Idan Toker, David Sarne, Jonathan Schler
Cross-Constrained Progressive Inference for 3D Hand Pose Estimation
ZheHan Kan, Xueting Hu, Zihan Liao et al.
EVS-assisted Joint Deblurring Rolling-Shutter Correction and Video Frame Interpolation through Sensor Inverse Modeling
Rui Jiang, Fangwen Tu, Yixuan Long et al.
Quantum-Inspired Neural Network with Runge-Kutta Method
Zipeng Fan, Jing Zhang, Peng Zhang et al.
Empowering Resampling Operation for Ultra-High-Definition Image Enhancement with Model-Aware Guidance
Yu, Jie Huang, Li et al.
READ: Retrieval-Enhanced Asymmetric Diffusion for Motion Planning
Takeru Oba, Matthew Walter, Norimichi Ukita
MeshPose: Unifying DensePose and 3D Body Mesh Reconstruction
Eric-Tuan Le, Antonios Kakolyris, Petros Koutras et al.
MemSAM: Taming Segment Anything Model for Echocardiography Video Segmentation
Xiaolong Deng, Huisi Wu, Runhao Zeng et al.
Goal Alignment: Re-analyzing Value Alignment Problems Using Human-Aware AI
Malek Mechergui, Sarath Sreedharan
Pay Attention to Target: Relation-Aware Temporal Consistency for Domain Adaptive Video Semantic Segmentation
Huayu Mai, Rui Sun, Yuan Wang et al.
MFTN: Multi-Level Feature Transfer Network Based on MRI-Transformer for MR Image Super-resolution
Shuying Huang, Ge Chen, Yong Yang et al.
GxVAEs: Two Joint VAEs Generate Hit Molecules from Gene Expression Profiles
Chen Li, Yoshihiro Yamanishi
Learn How to See: Collaborative Embodied Learning for Object Detection and Camera Adjusting
Lingdong Shen, Chunlei Huo, Nuo Xu et al.
Multi-Domain Multi-Scale Diffusion Model for Low-Light Image Enhancement
Kai Shang, Mingwen Shao, Chao Wang et al.
Hidden Follower Detection: How Is the Gaze-Spacing Pattern Embodied in Frequency Domain?
Shu Li, Ruimin Hu, Suhui Li et al.
LDS2AE: Local Diffusion Shared-Specific Autoencoder for Multimodal Remote Sensing Image Classification with Arbitrary Missing Modalities
Jiahui Qu, Yuanbo Yang, Wenqian Dong et al.
TIGER: Time-Varying Denoising Model for 3D Point Cloud Generation with Diffusion Process
Zhiyuan Ren, Minchul Kim, Feng Liu et al.
Semi-supervised Blind Image Quality Assessment through Knowledge Distillation and Incremental Learning
Wensheng Pan, Timin Gao, Yan Zhang et al.
Learning Continual Compatible Representation for Re-indexing Free Lifelong Person Re-identification
Zhenyu Cui, Jiahuan Zhou, Xun Wang et al.
A Unified Knowledge Transfer Network for Generalized Category Discovery
Wenkai Shi, Wenbin An, Feng Tian et al.
G2L-CariGAN: Caricature Generation from Global Structure to Local Features
Xin Huang, Yunfeng Bai, Dong Liang et al.
Z*: Zero-shot Style Transfer via Attention Reweighting
Yingying Deng, Xiangyu He, Fan Tang et al.
Using Clustering to Strengthen Decision Diagram Bounds for Discrete Optimization
Spike-guided Motion Deblurring with Unknown Modal Spatiotemporal Alignment
Jiyuan Zhang, Shiyan Chen, Yajing Zheng et al.
ConCon-Chi: Concept-Context Chimera Benchmark for Personalized Vision-Language Tasks
Andrea Rosasco, Stefano Berti, Giulia Pasquale et al.
Instance-aware Contrastive Learning for Occluded Human Mesh Reconstruction
Mi-Gyeong Gwon, Gi-Mun Um, Won-Sik Cheong et al.
UniMODE: Unified Monocular 3D Object Detection
Zhuoling Li, Xiaogang Xu, Ser-Nam Lim et al.
Less Is More: Label Recommendation for Weakly Supervised Point Cloud Semantic Segmentation
Stratified GNN Explanations through Sufficient Expansion
IRPruneDet: Efficient Infrared Small Target Detection via Wavelet Structure-Regularized Soft Channel Pruning
Investigating Compositional Challenges in Vision-Language Models for Visual Grounding
Yunan Zeng, Yan Huang, Jinjin Zhang et al.
Multi-Constellation-Inspired Single-Shot Global LiDAR Localization
Tongzhou Zhang, Gang Wang, Yu Chen et al.
Bi-directional Adapter for Multimodal Tracking
Bing Cao, Junliang Guo, Pengfei Zhu et al.
Accept the Modality Gap: An Exploration in the Hyperbolic Space
Sameera Ramasinghe, Violetta Shevchenko, Gil Avraham et al.
MirageRoom: 3D Scene Segmentation with 2D Pre-trained Models by Mirage Projection
Haowen Sun, Yueqi Duan, Juncheng Yan et al.
SHAP\@k: Efficient and Probably Approximately Correct (PAC) Identification of Top-K Features
Sanjay Kariyappa, Leonidas Tsepenekas, Freddy Lecue et al.
A Theoretical Analysis of Backdoor Poisoning Attacks in Convolutional Neural Networks
Boqi Li, Weiwei Liu
An Interpretable Approach to the Solutions of High-Dimensional Partial Differential Equations
Lulu Cao, Yufei Liu, Zhenzhong Wang et al.
LISR: Learning Linear 3D Implicit Surface Representation Using Compactly Supported Radial Basis Functions
Atharva Pandey, Vishal Yadav, Rajendra Nagar et al.
Random Entangled Tokens for Adversarially Robust Vision Transformer
Huihui Gong, Minjing Dong, Siqi Ma et al.
DYSON: Dynamic Feature Space Self-Organization for Online Task-Free Class Incremental Learning
Yuhang He, YingJie Chen, Yuhan Jin et al.
Learned Trajectory Embedding for Subspace Clustering
Yaroslava Lochman, Christopher Zach, Carl Olsson
Weakly Supervised Video Individual Counting
Xinyan Liu, Guorong Li, Yuankai Qi et al.
Simultaneous Optimization of Bid Shading and Internal Auction for Demand-Side Platforms
Yadong Xu, Bonan Ni, Weiran Shen et al.
A Dynamic GCN with Cross-Representation Distillation for Event-Based Learning
Yongjian Deng, Hao Chen, Youfu Li
Multimodal Graph Neural Architecture Search under Distribution Shifts
Jie Cai, Xin Wang, Haoyang Li et al.
Paths, Proofs, and Perfection: Developing a Human-Interpretable Proof System for Constrained Shortest Paths
Konstantin Sidorov, Gonçalo Homem de Almeida Correia, Mathijs de Weerdt et al.
A Pedestrian is Worth One Prompt: Towards Language Guidance Person Re-Identification
Zexian Yang, Dayan Wu, Chenming Wu et al.
From SAM to CAMs: Exploring Segment Anything Model for Weakly Supervised Semantic Segmentation
Hyeokjun Kweon, Kuk-Jin Yoon
Curvature-Invariant Adversarial Attacks for 3D Point Clouds
Jianping Zhang, Wenwei Gu, Yizhan Huang et al.
Simplicity Bias in Overparameterized Machine Learning
KGDM: A Diffusion Model to Capture Multiple Relation Semantics for Knowledge Graph Embedding
Xiao Long, Liansheng Zhuang, Aodi Li et al.
A Robust Mutual-Reinforcing Framework for 3D Multi-Modal Medical Image Fusion Based on Visual-Semantic Consistency
Hao Zhang, Xuhui Zuo, Huabing Zhou et al.
R-Cyclic Diffuser: Reductive and Cyclic Latent Diffusion for 3D Clothed Human Digitalization
Kennard Chan, Fayao Liu, Guosheng Lin et al.
1066 Benchmarking Large Language Models on Controllable Generation under Diversified Instructions
Yihan Chen, Benfeng Xu, Quan Wang et al.
Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention Alignment and Prompt Tuning
Leslie Ching Ow Tiong, Dick Sigmund, Chen-Hui Chan et al.
964 Measuring Self-Supervised Representation Quality for Downstream Classification Using Discriminative Features
Neha Kalibhat, Kanika Narang, Hamed Firooz et al.
12181 What Do Hebbian Learners Learn? Reduction Axioms for Unstable Hebbian Learning
Caleb Schultz Kisby, Saúl Blanco, Larry Moss
Class Incremental Learning with Multi-Teacher Distillation
Haitao Wen, Lili Pan, Yu Dai et al.
3752 Decoupled Textual Embeddings for Customized Image Generation
Yufei Cai, Yuxiang Wei, Zhilong Ji et al.
Parameter Efficient Self-Supervised Geospatial Domain Adaptation
Linus Scheibenreif, Michael Mommert, Damian Borth
8284 Federated X-armed Bandit
Wenjie Li, Qifan Song, Jean Honorio et al.
780 Learning Discriminative Noise Guidance for Image Forgery Detection and Localization
Jiaying Zhu, Dong Li, Xueyang Fu et al.
Beyond Seen Primitive Concepts and Attribute-Object Compositional Learning
Nirat Saini, Khoi Pham, Abhinav Shrivastava
3515 Protein 3D Graph Structure Learning for Robust Structure-Based Protein Property Prediction
Yufei Huang, Siyuan Li, Lirong Wu et al.
10783 The Complexity of Optimizing Atomic Congestion
Cornelius Brand, Robert Ganian, Subrahmanyam Kalyanasundaram et al.
8318 Optimal Transport with Tempered Exponential Measures
Ehsan Amid, Frank Nielsen, Richard Nock et al.
11879 Distributional Off-Policy Evaluation for Slate Recommendations
Shreyas Chaudhari, David Arbour, Georgios Theocharous et al.
2243 ADA-GAD: Anomaly-Denoised Autoencoders for Graph Anomaly Detection
Junwei He, Qianqian Xu, Yangbangyan Jiang et al.
6402 NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning
Linsheng Chen, Guangrun Wang, Liuchun Yuan et al.
5602 Consistency-GAN: Training GANs with Consistency Model
Yunpeng Wang, Meng Pang, Shengbo Chen et al.
10491 Multi-Granularity Causal Structure Learning
Tianyi Chu, Wei Xing, Jiafu Chen et al.
GraFITi: Graphs for Forecasting Irregularly Sampled Time Series
Vijaya Krishna Yalavarthi, Kiran Madhusudhanan, Randolf Scholz et al.
Sparse Bayesian Deep Learning for Cross Domain Medical Image Reconstruction
Jiaxin Huang, Qi Wu, Yazhou Ren et al.
AnyScene: Customized Image Synthesis with Composited Foreground
Ruidong Chen, Lanjun Wang, Weizhi Nie et al.
Endow SAM with Keen Eyes: Temporal-spatial Prompt Learning for Video Camouflaged Object Detection
Wenjun Hui, Zhenfeng Zhu, Shuai Zheng et al.
NICE: Neurogenesis Inspired Contextual Encoding for Replay-free Class Incremental Learning
Mustafa B Gurbuz, Jean Moorman, Constantine Dovrolis
A Fixed-Parameter Tractable Algorithm for Counting Markov Equivalence Classes with the Same Skeleton
Vidya Sagar Sharma
Noisy One-point Homographies are Surprisingly Good
Yaqing Ding, Jonathan Astermark, Magnus Oskarsson et al.