Most Cited 2024 "human motion retrieval" Papers
12,324 papers found • Page 26 of 62
Conference
AMD: Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion
Beibei Jing, Youjia Zhang, Zikai Song et al.
Expressive Multi-Agent Communication via Identity-Aware Learning
Wei Du, Shifei Ding, Lili Guo et al.
Structured Probabilistic Coding
Dou Hu, Lingwei Wei, Yaxin Liu et al.
Learning to Manipulate Artistic Images
Wei Guo, Yuqi Zhang, De Ma et al.
Keypoint Fusion for RGB-D Based 3D Hand Pose Estimation
Xingyu Liu, Pengfei Ren, Yuanyuan Gao et al.
Restoring Speaking Lips from Occlusion for Audio-Visual Speech Recognition
Jiadong Wang, Zexu Pan, Malu Zhang et al.
Pandora’s Problem with Deadlines
Ben Berger, Tomer Ezra, Michal Feldman et al.
Revisit Human-Scene Interaction via Space Occupancy
Xinpeng Liu, Haowen Hou, Yanchao Yang et al.
Zero-Shot Detection of AI-Generated Images
Davide Cozzolino, GIovanni Poggi, Matthias Niessner et al.
Transient Glimpses: Unveiling Occluded Backgrounds through the Spike Camera
Jiyuan Zhang, Shiyan Chen, Yajing Zheng et al.
SCP: Spherical-Coordinate-Based Learned Point Cloud Compression
Ao Luo, Linxin Song, Keisuke Nonaka et al.
AVSegFormer: Audio-Visual Segmentation with Transformer
Shengyi Gao, Zhe Chen, Guo Chen et al.
From Retrieval to Generation: A Simple and Unified Generative Model for End-to-End Task-Oriented Dialogue
Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation
Hyun Seok Seong, WonJun Moon, SuBeen Lee et al.
Class-Attribute Priors: Adapting Optimization to Heterogeneity and Fairness Objective
Xuechen Zhang, Mingchen Li, Jiasi Chen et al.
Dual-Channel Learning Framework for Drug-Drug Interaction Prediction via Relation-Aware Heterogeneous Graph Transformer
Xiaorui Su, Pengwei Hu, Zhu-Hong You et al.
Calibrated One Round Federated Learning with Bayesian Inference in the Predictive Space
Mohsin Hasan, Guojun Zhang, Kaiyang Guo et al.
Multi-branch Collaborative Learning Network for 3D Visual Grounding
Zhipeng Qian, Yiwei Ma, Zhekai Lin et al.
Time-Aware Knowledge Representations of Dynamic Objects with Multidimensional Persistence
Baris Coskunuzer, Ignacio Segovia-Dominguez, Yuzhou Chen et al.
R3CD: Scene Graph to Image Generation with Relation-Aware Compositional Control Diffusion
Jinxiu Liu, Qi Liu
SDAC: A Multimodal Synthetic Dataset for Anomaly and Corner Case Detection in Autonomous Driving
Lei Gong, Yu Zhang, Yingqing Xia et al.
Neural Physical Simulation with Multi-Resolution Hash Grid Encoding
Haoxiang Wang, Tao Yu, Tianwei Yang et al.
Noise-Aware Image Captioning with Progressively Exploring Mismatched Words
Zhongtian Fu, Kefei Song, Luping Zhou et al.
Learning to Stop Cut Generation for Efficient Mixed-Integer Linear Programming
Haotian Ling, Zhihai Wang, Jie Wang
High-Fidelity 3D Head Avatars Reconstruction through Spatially-Varying Expression Conditioned Neural Radiance Field
Minghan Qin, Yifan Liu, Yuelang Xu et al.
HyperSpaceX: Radial and Angular Exploration of HyperSpherical Dimensions
Chiranjeev Chiranjeev, Muskan Dosi, Kartik Thakral et al.
FLAT: Flux-aware Imperceptible Adversarial Attacks on 3D Point Clouds
Keke Tang, Lujie Huang, Weilong Peng et al.
Label-anticipated Event Disentanglement for Audio-Visual Video Parsing
Jinxing Zhou, Dan Guo, Yuxin Mao et al.
WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians
Dmytro Kotovenko, Olga Grebenkova, Nikolaos Sarafianos et al.
Decentralized Gradient-Free Methods for Stochastic Non-smooth Non-convex Optimization
Zhenwei Lin, Jingfan Xia, Qi Deng et al.
When Are Two Lists Better than One?: Benefits and Harms in Joint Decision-Making
Kate Donahue, Sreenivas Gollapudi, Kostas Kollias
Learning Task-Aware Language-Image Representation for Class-Incremental Object Detection
Hongquan Zhang, Bin-Bin Gao, Yi Zeng et al.
EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding
Wenhua Wu, Qi Wang, Guangming Wang et al.
What Are the Rules? Discovering Constraints from Data
Boris Wiegand, Dietrich Klakow, Jilles Vreeken
Relative Policy-Transition Optimization for Fast Policy Transfer
Jiawei xu, Cheng Zhou, Yizheng Zhang et al.
GroCo: Ground Constraint for Metric Self-Supervised Monocular Depth
Aurélien Cecille, Stefan Duffner, Franck DAVOINE et al.
DiDA: Disambiguated Domain Alignment for Cross-Domain Retrieval with Partial Labels
Haoran Liu, Ying Ma, Ming Yan et al.
Decomposing Temporal Equilibrium Strategy for Coordinated Distributed Multi-Agent Reinforcement Learning
Chenyang Zhu, Wen Si, Jinyu Zhu et al.
Harnessing the Power of Beta Scoring in Deep Active Learning for Multi-Label Text Classification
Wei Tan, Ngoc Dang Nguyen, Lan Du et al.
PaintHuman: Towards High-Fidelity Text-to-3D Human Texturing via Denoised Score Distillation
Jianhui Yu, Hao Zhu, Liming Jiang et al.
Parameterization of (Partial) Maximum Satisfiability above Matching in a Variable-Clause Graph
Vasily Alferov, Ivan Bliznets, Kirill Brilliantov
EAFormer: Scene Text Segmentation with Edge-Aware Transformers
Haiyang Yu, Teng Fu, Bin Li et al.
Pushing the Limit of Fine-Tuning for Few-Shot Learning: Where Feature Reusing Meets Cross-Scale Attention
Ying-Yu Chen, Jun-Wei Hsieh, Xin Li et al.
Locally Rainbow Paths
Till Fluschnik, Leon Kellerhals, Malte Renken
Fast Diffusion-Based Counterfactuals for Shortcut Removal and Generation
Nina Weng, Paraskevas Pegios, Eike Petersen et al.
A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image
Nailei Hei, Qianyu Guo, Zihao Wang et al.
Federated Label-Noise Learning with Local Diversity Product Regularization
Xiaochen Zhou, Xudong Wang
Gramformer: Learning Crowd Counting via Graph-Modulated Transformer
Hui LIN, Zhiheng Ma, Xiaopeng Hong et al.
JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation
ChenHan Jiang, Yihan Zeng, Tianyang Hu et al.
GIN-SD: Source Detection in Graphs with Incomplete Nodes via Positional Encoding and Attentive Fusion
Le Cheng, Peican Zhu, Keke Tang et al.
PrefAce: Face-Centric Pretraining with Self-Structure Aware Distillation
Siyuan Hu, Zheng Wang, Peng Hu et al.
Missing Modality Prediction for Unpaired Multimodal Learning via Joint Embedding of Unimodal Models
Taesup Kim, Donggeun Kim
DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degradations
Ruilu Wang, Yang Xue, Lianwen Jin
GSENet: Global Semantic Enhancement Network for Lane Detection
Junhao Su, Zhenghan Chen, Chenghao He et al.
Point2Real: Bridging the Gap between Point Cloud and Realistic Image for Open-World 3D Recognition
Hanxuan Li, Bin Fu, Ruiping Wang et al.
A Multimodal Benchmark Dataset and Model for Crop Disease Diagnosis
Xiang Liu, Zhaoxiang Liu, Huan Hu et al.
Online Temporal Action Localization with Memory-Augmented Transformer
Youngkil Song, Dongkeun Kim, Minsu Cho et al.
SUMix: Mixup with Semantic and Uncertain Information
Huafeng Qin, Xin Jin, Hongyu Zhu et al.
Disentangled Generation and Aggregation for Robust Radiance Fields
Shihe Shen, Huachen Gao, Wangze Xu et al.
MusER: Musical Element-Based Regularization for Generating Symbolic Music with Emotion
Shulei Ji, Xinyu Yang
MANIKIN: Biomechanically Accurate Neural Inverse Kinematics for Human Motion Estimation
Jiaxi Jiang, Paul Streli, Xuejing Luo et al.
Image as a Language: Revisiting Scene Text Recognition via Balanced, Unified and Synchronized Vision
Language Reasoning Network
Spanning the Spectrum of Hatred Detection: A Persian Multi
Label Hate Speech Dataset with Annotator Rationales
MoDE: A Mixture
of-Experts Model with Mutual Distillation among the Experts
Can Large Language Models Understand Real
World Complex Instructions?
The Irrelevance of Influencers: Information Diffusion with Re
Activation and Immunity Lasts Exponentially Long on Social Network Models
Reinforcement Learning and Data
Generation for Syntax-Guided Synthesis
Efficient NeRF Optimization - Not All Samples Remain Equally Hard
Juuso Korhonen, Goutham Rangu, Hamed Rezazadegan Tavakoli et al.
Learning Exhaustive Correlation for Spectral Super-Resolution: Where Spatial-Spectral Attention Meets Linear Dependence
Hongyuan Wang, Lizhi Wang, Jiang Xu et al.
Learning Accurate and Bidirectional Transformation via Dynamic Embedding Transportation for Cross
Domain Recommendation
SoftCLIP: Softer Cross
Modal Alignment Makes CLIP Stronger
Other Papers
Spline-based Transformers
Prashanth Chandran, Agon Serifi, Markus Gross et al.
Debiasing surgeon: fantastic weights and how to find them
Remi Nahon, Ivan Luiz De Moura Matos, Van-Tam Nguyen et al.
Click-Gaussian: Interactive Segmentation to Any 3D Gaussians
Seokhun Choi, Hyeonseop Song, Jaechul Kim et al.
Minibatch Stochastic Three Points Method for Unconstrained Smooth Minimization
Soumia Boucherouite, Grigory Malinovsky, Peter Richtarik et al.
Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation
Xiawei Li, Qingyuan Xu, Jing Zhang et al.
Stitching Segments and Sentences towards Generalization in Video-Text Pre-training
Fan Ma, Xiaojie Jin, Heng Wang et al.
Image Safeguarding: Reasoning with Conditional Vision Language Model and Obfuscating Unsafe Content Counterfactually
Mazal Bethany, Brandon Wherry, Nishant Vishwamitra et al.
Supervision Interpolation via LossMix: Generalizing Mixup for Object Detection and Beyond
Thanh Vu, Baochen Sun, Bodi Yuan et al.
MagMax: Leveraging Model Merging for Seamless Continual Learning
Daniel Marczak, Bartlomiej Twardowski, Tomasz Trzcinski et al.
SceneTeller: Language-to-3D Scene Generation
Basak Melis Ocal, Maxim Tatarchenko, Sezer Karaoglu et al.
Online Vectorized HD Map Construction using Geometry
Zhixin Zhang, Yiyuan Zhang, Xiaohan Ding et al.
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
Qi Sun, Hang Zhou, Wengang Zhou et al.
DreamDrone: Text-to-Image Diffusion Models are Zero-shot Perpetual View Generators
Hanyang Kong, Dongze Lian, Michael Bi Mi et al.
RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation
Luis Li, Hubert P. H. Shum, Toby P Breckon
Defect Spectrum: A Granular Look of Large-scale Defect Datasets with Rich Semantics
Shuai Yang, ZhiFei Chen, Pengguang Chen et al.
KGTS: Contrastive Trajectory Similarity Learning over Prompt Knowledge Graph Embedding
Zhen Chen, Dalin Zhang, Shanshan Feng et al.
DS-AL: A Dual-Stream Analytic Learning for Exemplar-Free Class-Incremental Learning
Huiping Zhuang, Run He, Kai Tong et al.
Open-Set Graph Domain Adaptation via Separate Domain Alignment
Yu Wang, Ronghang Zhu, Pengsheng Ji et al.
EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-commerce
Li Yangning, Shirong Ma, Xiaobin Wang et al.
SHIC: Shape-Image Correspondences with no Keypoint Supervision
Aleksandar Shtedritski, Christian Rupprecht, Andrea Vedaldi
Learning Cluster-Wise Anchors for Multi-View Clustering
Chao Zhang, Xiuyi Jia, Zechao Li et al.
TDeLTA: A Light-Weight and Robust Table Detection Method Based on Learning Text Arrangement
Yang Fan, Xiangping Wu, Qingcai Chen et al.
Painterly Image Harmonization by Learning from Painterly Objects
Li Niu, Junyan Cao, Yan Hong et al.
Instant Uncertainty Calibration of NeRFs Using a Meta-Calibrator
Niki Amini-Naieni, Tomas Jakab, Andrea Vedaldi et al.
Transition-Informed Reinforcement Learning for Large-Scale Stackelberg Mean-Field Games
Pengdeng Li, Runsheng Yu, Xinrun Wang et al.
ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer
Zachary Horvitz, Ajay Patel, Chris Callison-Burch et al.
Focus-Then-Decide: Segmentation-Assisted Reinforcement Learning
Diffusion-Guided Weakly Supervised Semantic Segmentation
Sung-Hoon Yoon, Hoyong Kwon, Jaeseok Jeong et al.
Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation
Mengchen Zhang, Tong Wu, Tai Wang et al.
CycleVTON: A Cycle Mapping Framework for Parser-Free Virtual Try-On
Chenghu Du, Junyin Wang, Yi Rong et al.
GSN: Generalisable Segmentation in Neural Radiance Field
Siddharth Barman, Umang Bhaskar, Yeshwant Pandit et al.
Linking in Style: Understanding learned features in deep learning models
Maren Wehrheim, Pamela Osuna Vargas, Matthias Kaschube
Nearly Equitable Allocations beyond Additivity and Monotonicity
Panel-Specific Degradation Representation for Raw Under-Display Camera Image Restoration
Youngjin Oh, Keuntek Lee, Jooyoung Lee et al.
Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and Skills
Hongcai He, Anjie Zhu, Shuang Liang et al.
GridFormer: Point-Grid Transformer for Surface Reconstruction
Shengtao Li, Ge Gao, Yudong Liu et al.
Adaptive Uncertainty-Based Learning for Text-Based Person Retrieval
Shenshen Li, Chen He, Xing Xu et al.
Learning Multi-Task Sparse Representation Based on Fisher Information
Yayu Zhang, Yuhua Qian, Guoshuai Ma et al.
WaveFormer: Wavelet Transformer for Noise-Robust Video Inpainting
Zhiliang Wu, Changchang Sun, Hanyu Xuan et al.
Defeasible Normative Reasoning: A Proof-Theoretic Integration of Logical Argumentation
Ofer Arieli, Kees van Berkel, Christian Straßer
Compact 3D Scene Representation via Self-Organizing Gaussian Grids
Wieland Morgenstern, Florian Barthel, Anna Hilsmann et al.
Fully Authentic Visual Question Answering Dataset from Online Communities
Chongyan Chen, Mengchen Liu, Noel C Codella et al.
High-Fidelity 3D Textured Shapes Generation by Sparse Encoding and Adversarial Decoding
Qi Zuo, Xiaodong Gu, Yuan Dong et al.
KG-TREAT: Pre-training for Treatment Effect Estimation by Synergizing Patient Data with Knowledge Graphs
Ruoqi Liu, Lingfei Wu, Ping Zhang
Cross-Domain Contrastive Learning for Time Series Clustering
Furong Peng, Jiachen Luo, Xuan Lu et al.
Teach CLIP to Develop a Number Sense for Ordinal Regression
Yao DU, Qiang Zhai, Weihang Dai et al.
HACDR-Net: Heterogeneous-Aware Convolutional Network for Diabetic Retinopathy Multi-Lesion Segmentation
QiHao Xu, Xiaoling Luo, Chao Huang et al.
A Twist for Graph Classification: Optimizing Causal Information Flow in Graph Neural Networks
Zhe Zhao, Pengkun Wang, HaiBin Wen et al.
Towards Automated RISC-V Microarchitecture Design with Reinforcement Learning
Chen BAI, Jianwang Zhai, Yuzhe Ma et al.
Fully Sparse 3D Occupancy Prediction
Haisong Liu, Yang Chen, Haiguang Wang et al.
Designing Biological Sequences without Prior Knowledge Using Evolutionary Reinforcement Learning
Xi Zeng, Xiaotian Hao, Hongyao Tang et al.
Generation and Classification Reframing
Fenglong Ma, Hongyang Chen, Hong Yu et al.
Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation
Fangfu Liu, Hanyang Wang, Weiliang Chen et al.
Towards More Likely Models for AI Planning
11625 Turgay Caglar, Sirine Belhaj, Tathagata Chakraborty et al.
Long-Tailed Partial Label Learning by Head Classifier and Tail Classifier Cooperation
10576 Yuheng Jia, Xiaorui Peng, Ran Wang et al.
MagiCapture: High-Resolution Multi-Concept Portrait Customization
9256 Junha Hyung, Jaeyo Shin, Jaegul Choo
Bayesian Inference with Complex Knowledge Graph Evidence
12846 Armin Toroghi, Scott Sanner
Identity-Consistent Diffusion Network for Grading Knee Osteoarthritis Progression in Radiographic Imaging
Wenhua Wu, Kun Hu, Wenxi Yue et al.
Text to Layer-wise 3D Clothed Human Generation
Junting Dong, Qi Fang, Zehuan Huang et al.
Accelerating Cutting-Plane Algorithms via Reinforcement Learning Surrogates
12620 Kyle Mana, Fernando Acero, Stephen Mak et al.
Structural Entropy Based Graph Structure Learning for Node Classification
13155 Liang Duan, Xiang Chen, Wenjie Liu et al.
SEIT: Structural Enhancement for Unsupervised Image Translation in Frequency Domain
9044 Zhifeng Zhu, Yaochen Li, Yifan Li et al.
Unsupervised Representation Learning by Balanced Self Attention Matching
Daniel Shalam, Simon Korman
Online Sensitivity Optimization in Differentially Private Learning
8623 Filippo Galli, Catuscia Palamidessi, Tommaso Cucinotta
Local-Global Multi-Modal Distillation for Weakly-Supervised Temporal Video Grounding
6627 Peijun Bao, Yong Xia, Wenhan Yang et al.
Robust Loss Functions for Training Decision Trees with Noisy Labels
12831 Jonathan Wilton, Nan Ye
GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval
Han Zhou, Wei Dong, Xiaohong Liu et al.
Occluded Person Re-identification via Saliency-Guided Patch Transfer
821 Lei Tan, Jiaer Xia, Wenfeng Liu et al.
Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection
Hu Cao, Zehua Zhang, Yan Xia et al.
WeakPCSOD: Overcoming the Bias of Box Annotations for Weakly Supervised Point Cloud Salient Object Detection
2964 Jun Wei, S. Kevin Zhou, Shuguang Cui et al.
Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning
Amandeep Kumar, Muhammad Awais, Sanath Narayan et al.
SemTrack: A Large-scale Dataset for Semantic Tracking in the Wild
Pengfei Wang, Xiaofei Hui, Jing Wu et al.
SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images
Nir Barel, Ron Aharon Shapira Weber, Nir Mualem et al.
Robust Fitting on a Gate Quantum Computer
Frances Yang, Michele Sasdelli, Tat-Jun Chin
Unveiling Implicit Deceptive Patterns in Multi-Modal Fake News via Neuro-Symbolic Reasoning
1903 Yiqi Dong, Dongxiao He, Xiaobao Wang et al.
Multitarget Device-Free Localization via Cross-Domain Wi-Fi RSS Training Data and Attentional Prior Fusion
Na FAN, Zeyue Tian, Amartansh DUBEY et al.
WaveNet: Tackling Non-stationary Graph Signals via Graph Spectral Wavelets
Zhirui Yang, Yulan Hu, Sheng Ouyang et al.
High-Fidelity Gradient Inversion in Distributed Learning
Zipeng Ye, Wenjian Luo, Qi Zhou et al.
Novelty vs. Potential Heuristics: A Comparison of Hardness Measures for Satisficing Planning
Simon Dold, Malte Helmert
TaskLAMA: Probing the Complex Task Understanding of Language Models
Quan Yuan, Mehran Kazemi, Xin Xu et al.
Enhancing Multi-Scale Diffusion Prediction via Sequential Hypergraphs and Adversarial Learning
Pengfei Jiao, Hongqian Chen, Qing Bao et al.
R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model
Changhoon Kim, Kyle Min, Yezhou Yang
Open-Vocabulary Camouflaged Object Segmentation
Youwei Pang, Xiaoqi Zhao, JiaMing Zuo et al.
Quality-Diversity Generative Sampling for Learning with Synthetic Data
Allen Chang, Matthew C. Fontaine, Serena Booth et al.
ProMerge: Prompt and Merge for Unsupervised Instance Segmentation
Dylan Li, Gyungin Shin
Distribution Alignment for Fully Test-Time Adaptation with Dynamic Online Data Streams
Ziqiang Wang, Zhixiang Chi, Yanan Wu et al.
SemTra: A Semantic Skill Translator for Cross-Domain Zero-Shot Policy Adaptation
Sangwoo Shin, Minjong Yoo, Jeongwoo Lee et al.
Ced-NeRF: A Compact and Efficient Method for Dynamic Neural Radiance Fields
Kinetic Typography Diffusion Model
Seonmi Park, Inhwan Bae, Seunghyun Shin et al.
Revisiting Feature Disentanglement Strategy in Diffusion Training and Breaking Conditional Independence Assumption in Sampling
Wonwoong Cho, Hareesh Ravi, Midhun Harikumar et al.
Zero-Sum Games between Mean-Field Teams: Reachability-Based Analysis under Mean-Field Sharing
Yue Guan, Mohammad Afshari, Panagiotis Tsiotras
Generalized Coverage for More Robust Low-Budget Active Learning
Wonho Bae, Junhyug Noh, Danica J. Sutherland
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis
Basile Van Hoorick, Rundi Wu, Ege Ozguroglu et al.
Towards Automated Chinese Ancient Character Restoration: A Diffusion-Based Method with a New Dataset
Haolong Li, Chenghao Du, Ziheng Jiang et al.
SDGAN: Disentangling Semantic Manipulation for Facial Attribute Editing
Wenmin Huang, Weiqi Luo, Jiwu Huang et al.
A General Implicit Framework for Fast NeRF Composition and Rendering
Xinyu Gao, Ziyi Yang, Yunlu Zhao et al.
Efficient Vision Transformers with Partial Attention
Xuan-Thuy Vo, Duy-Linh Nguyen, Adri Priadana et al.
Theoretical Aspects of Generating Instances with Unique Solutions: Pre-assignment Models for Unique Vertex Cover
Takashi Horiyama, Yasuaki Kobayashi, Hirotaka Ono et al.
Heterogeneous Test-Time Training for Multi-Modal Person Re-identification
Zi Wang, Huaibo Huang, Aihua Zheng et al.
Successive POI Recommendation via Brain-Inspired Spatiotemporal Aware Representation
Gehua Ma, He Wang, Jingyuan Zhao et al.
Co-Student: Collaborating Strong and Weak Students for Sparsely Annotated Object Detection
Lianjun Wu, Jiangxiao Han, Zengqiang Zheng et al.
SIGMA: Sinkhorn-Guided Masked Video Modeling
Mohammadreza Salehi, Michael Dorkenwald, Fida Mohammad Thoker et al.
Co-synthesis of Histopathology Nuclei Image-Label Pairs using a Context-Conditioned Joint Diffusion Model
Seonghui Min, Hyun-Jic Oh, Won-Ki Jeong
Low-Rank Kernel Tensor Learning for Incomplete Multi-View Clustering
Tingting Wu, Songhe Feng, Jiazheng Yuan
Privileged Prior Information Distillation for Image Matting
Cheng Lyu, Jiake Xie, Bo Xu et al.
Graph Learning in 4D: A Quaternion-Valued Laplacian to Enhance Spectral GCNs
Stefano Fiorini, Stefano Coniglio, Michele Ciavotta et al.
MorphVAE: Advancing Morphological Design of Voxel-Based Soft Robots with Variational Autoencoders
Junru Song, Yang Yang, Wei Peng et al.
RWMS: Reliable Weighted Multi-Phase for Semi-supervised Segmentation
Wensi Liu, Xiao-Yu Tang, Chong Yang et al.
Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Xiaoshi Wu, Yiming Hao, Manyuan Zhang et al.
SRPose: Two-view Relative Pose Estimation with Sparse Keypoints
Rui Yin, Yulun Zhang, Zherong Pan et al.
Learning Continuous Implicit Field with Local Distance Indicator for Arbitrary-Scale Point Cloud Upsampling
Shujuan Li, Junsheng Zhou, Baorui Ma et al.
AMSP-UOD: When Vortex Convolution and Stochastic Perturbation Meet Underwater Object Detection
Jingchun Zhou, Zongxin He, Kin-Man Lam et al.
AGS: Affordable and Generalizable Substitute Training for Transferable Adversarial Attack
Ruikui Wang, Yuanfang Guo, Yunhong Wang
No More Shortcuts: Realizing the Potential of Temporal Self-Supervision
Ishan Rajendrakumar Dave, Simon Jenni, Mubarak Shah
Sampling-Resilient Multi-Object Tracking
Zepeng Li, Dongxiang Zhang, Sai Wu et al.
Geospecific View Generation - Geometry-Context Aware High-resolution Ground View Inference from Satellite Views
Ningli Xu, Rongjun Qin
Federated Graph Learning under Domain Shift with Generalizable Prototypes
Guancheng Wan, Wenke Huang, Mang Ye
Mitigating Label Noise through Data Ambiguation
Julian Lienen, Eyke Hüllermeier
Hyperbolic Graph Diffusion Model
Lingfeng Wen, Xuan Tang, Mingjie Ouyang et al.
PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control
Rishubh Parihar, Sachidanand VS, Sabariswaran Mani et al.
Mutual-Modality Adversarial Attack with Semantic Perturbation
Jingwen Ye, Ruonan Yu, Songhua Liu et al.
DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation
Jeongsol Kim, Geon Yeong Park, Jong Chul Ye
CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge Transfer
Yabing Wang, Fan Wang, Jianfeng Dong et al.
Adaptive Meta-Learning Probabilistic Inference Framework for Long Sequence Prediction
Jianping Zhu, Xin Guo, Yang Chen et al.
Depth-Aware Blind Image Decomposition for Real-World Adverse Weather Recovery
Chao Wang, Zhedong Zheng, Ruijie Quan et al.
You Only Read Once: Constituency-Oriented Relational Graph Convolutional Network for Multi-Aspect Multi-Sentiment Classification
Yongqiang Zheng, Xia Li
M-BEV: Masked BEV Perception for Robust Autonomous Driving
Siran Chen, Yue Ma, Yu Qiao et al.
An Implicit Trust Region Approach to Behavior Regularized Offline Reinforcement Learning
Zhe Zhang, Xiaoyang Tan