Most Cited 2024 "latent dimension alignment" Papers
12,324 papers found • Page 15 of 62
Conference
Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers’ Opinion Scores
Lucas Goncalves, Prashant Mathur, Chandrashekhar Lavania et al.
ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model
Fu-Yun Wang, Zhaoyang Huang, Qiang Ma et al.
RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method
Ming Yan, Yan Zhang, Shuqiang Cai et al.
Certifiably Robust Image Watermark
Zhengyuan Jiang, Moyang Guo, Yuepeng Hu et al.
Motion Diversification Networks
Hee Jae Kim, Eshed Ohn-Bar
Equivariant Matrix Function Neural Networks
Ilyes Batatia, Lars Leon Schaaf, Gábor Csányi et al.
One at a Time: Progressive Multi-Step Volumetric Probability Learning for Reliable 3D Scene Perception
Bohan Li, Yasheng Sun, Jingxin Dong et al.
VIXEN: Visual Text Comparison Network for Image Difference Captioning
Alexander Black, Jing Shi, Yifei Fan et al.
Pareto Front-Diverse Batch Multi-Objective Bayesian Optimization
Alaleh Ahmadianshalchi, Syrine Belakaria, Janardhan Rao Doppa
Self-supervised visual learning from interactions with objects
Arthur Aubret, Céline Teulière, Jochen Triesch
Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning
Zizhao Wang, Caroline Wang, Xuesu Xiao et al.
TACIT: A Target-Agnostic Feature Disentanglement Framework for Cross-Domain Text Classification
Rui Song, Fausto Giunchiglia, Yingji Li et al.
Take A Step Back: Rethinking the Two Stages in Visual Reasoning
Mingyu Zhang, Jiting Cai, Mingyu Liu et al.
REGLO: Provable Neural Network Repair for Global Robustness Properties
Feisi Fu, Zhilu Wang, Weichao Zhou et al.
Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes
Diandian Guo, Deng-Ping Fan, Tongyu Lu et al.
Context Enhanced Transformer for Single Image Object Detection in Video Data
Seungjun An, Seonghoon Park, Gyeongnyeon Kim et al.
HSR: Holistic 3D Human-Scene Reconstruction from Monocular Videos
Lixin Xue, Chen Guo, Chengwei Zheng et al.
Link Prediction in Multilayer Networks via Cross-Network Embedding
Guojing Ren, Xiao Ding, Xiao-Ke Xu et al.
Improving Expressive Power of Spectral Graph Neural Networks with Eigenvalue Correction
Kangkang Lu, Yanhua Yu, Hao Fei et al.
DeepCalliFont: Few-Shot Chinese Calligraphy Font Synthesis by Integrating Dual-Modality Generative Models
Yitian Liu, Zhouhui Lian
Model-Driven Deep Neural Network for Enhanced AoA Estimation Using 5G gNB
Shengheng Liu, Xingkang Li, Zihuan Mao et al.
Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users
Hantao Yang, Xutong Liu, Zhiyong Wang et al.
Learning Visual Abstract Reasoning through Dual-Stream Networks
Kai Zhao, Chang Xu, Bailu Si
Epitopological learning and Cannistraci-Hebb network shape intelligence brain-inspired theory for ultra-sparse advantage in deep learning
Yingtao Zhang, Jialin Zhao, Wenjing Wu et al.
ActionVOS: Actions as Prompts for Video Object Segmentation
LIANGYANG OUYANG, Ruicong Liu, Yifei Huang et al.
ZOOM: Learning Video Mirror Detection with Extremely-Weak Supervision
Ke Xu, Tsun Wai Siu, Rynson W.H. Lau
GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields
Fangyin Wei, Hanlin Chen, Gim Hee Lee
RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception
Xiaosu Zhu, Hualian Sheng, Sijia Cai et al.
Symbol as Points: Panoptic Symbol Spotting via Point-based Representation
Wenlong Liu, Tianyu Yang, Yuhan Wang et al.
A Multimodal, Multi-Task Adapting Framework for Video Action Recognition
Mengmeng Wang, Jiazheng Xing, Boyuan Jiang et al.
Comprehensive View Embedding Learning for Single-Cell Multimodal Integration
Zhenchao Tang, Jiehui Huang, Guanxing Chen et al.
PQ-SAM: Post-training Quantization for Segment Anything Model
Xiaoyu Liu, Xin Ding, Lei Yu et al.
Efficient Axiomatization of OWL 2 EL Ontologies from Data by Means of Formal Concept Analysis
Francesco Kriegel
CountFormer: Multi-View Crowd Counting Transformer
Hong Mo, Xiong Zhang, Jianchao Tan et al.
Axiomatic Aggregations of Abductive Explanations
Gagan Biradar, Yacine Izza, Elita Lobo et al.
Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners
Bowen Shi, XIAOPENG ZHANG, Yaoming Wang et al.
RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting
Qi Wang, Ruijie Lu, Xudong XU et al.
GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning
Xiaojie Li, Yibo Yang, Xiangtai Li et al.
Hierarchical Correlation Clustering and Tree Preserving Embedding
Morteza Haghir Chehreghani, Mostafa Haghir Chehreghani
Towards More Faithful Natural Language Explanation Using Multi-Level Contrastive Learning in VQA
Chengen Lai, Shengli Song, Shiqi Meng et al.
UnSAMFlow: Unsupervised Optical Flow Guided by Segment Anything Model
Shuai Yuan, Lei Luo, Zhuo Hui et al.
LDReg: Local Dimensionality Regularized Self-Supervised Learning
Hanxun Huang, Ricardo Campello, Sarah Erfani et al.
Neural structure learning with stochastic differential equations
Benjie Wang, Joel Jennings, Wenbo Gong
DiG-IN: Diffusion Guidance for Investigating Networks - Uncovering Classifier Differences Neuron Visualisations and Visual Counterfactual Explanations
Maximilian Augustin, Yannic Neuhaus, Matthias Hein
GLDL: Graph Label Distribution Learning
Yufei Jin, Richard Gao, Yi He et al.
Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion
Sanghyun Kim, Seohyeon Jung, Balhae Kim et al.
Intrinsic Single-Image HDR Reconstruction
Sebastian Dille, Chris Careaga, Yagiz Aksoy
Foundation Model-oriented Robustness: Robust Image Model Evaluation with Pretrained Models
Peiyan Zhang, Haoyang Liu, Chaozhuo Li et al.
Factorized Diffusion Autoencoder for Unsupervised Disentangled Representation Learning
Ancong Wu, Wei-shi Zheng
Coupling Graph Neural Networks with Fractional Order Continuous Dynamics: A Robustness Study
Qiyu Kang, Kai Zhao, Yang Song et al.
Make Prompts Adaptable: Bayesian Modeling for Vision-Language Prompt Learning with Data-Dependent Prior
Youngjae Cho, HeeSun Bae, Seungjae Shin et al.
Unsupervised Group Re-identification via Adaptive Clustering-Driven Progressive Learning
Hongxu Chen, Quan Zhang, Jian-Huang Lai et al.
DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning
Xinghao Wang, Junliang He, Pengyu Wang et al.
A Unified Masked Autoencoder with Patchified Skeletons for Motion Synthesis
Esteve Valls Mascaro, Hyemin Ahn, Dongheui Lee
Quantized Prompt for Efficient Generalization of Vision-Language Models
Tianxiang Hao, Xiaohan Ding, Juexiao Feng et al.
A Primal-Dual Algorithm for Hybrid Federated Learning
Tom Overman, Garrett Blum, Diego Klabjan
Boosting Flow-based Generative Super-Resolution Models via Learned Prior
Li-Yuan Tsao, Yi-Chen Lo, Chia-Che Chang et al.
Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views
Shuai Guo, Qiuwen Wang, Yijie Gao et al.
DIUSum: Dynamic Image Utilization for Multimodal Summarization
Min Xiao, Junnan Zhu, Feifei Zhai et al.
CityGuessr: City-Level Video Geo-Localization on a Global Scale
Parth Parag Kulkarni, Gaurav Kumar Nayak, Shah Mubarak
Decoupling Degradations with Recurrent Network for Video Restoration in Under-Display Camera
Chengxu Liu, Xuan Wang, Yuanting Fan et al.
ProMotion: Prototypes As Motion Learners
Yawen Lu, Dongfang Liu, Qifan Wang et al.
Learning Robust Rationales for Model Explainability: A Guidance-Based Approach
Shuaibo Hu, Kui Yu
GenRC: Generative 3D Room Completion from Sparse Image Collections
Ming-Feng Li, Yueh-Feng Ku, Hong-Xuan Yen et al.
Towards More Accurate Diffusion Model Acceleration with A Timestep Tuner
Mengfei Xia, Yujun Shen, Changsong Lei et al.
Fully Convolutional Slice-to-Volume Reconstruction for Single-Stack MRI
Sean I. Young, Yaël Balbastre, Bruce Fischl et al.
Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning
Bang Yang, Yong Dai, Xuxin Cheng et al.
Making Visual Sense of Oracle Bones for You and Me
Runqi Qiao, LAN YANG, Kaiyue Pang et al.
Bilateral Event Mining and Complementary for Event Stream Super-Resolution
Zhilin Huang, Quanmin Liang, Yijie Yu et al.
High-Fidelity Diffusion-Based Image Editing
Chen Hou, Guoqiang Wei, Zhibo Chen
Live and Learn: Continual Action Clustering with Incremental Views
Xiaoqiang Yan, Yingtao Gan, Yiqiao Mao et al.
Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation
Shoumeng Qiu, Jie Chen, Xinrun Li et al.
Layer-Wise Relevance Propagation with Conservation Property for ResNet
Seitaro Otsuki, Tsumugi Iida, Félix Doublet et al.
Statewide Visual Geolocalization in the Wild
Florian Fervers, Sebastian Bullinger, Christoph Bodensteiner et al.
Quality Assured: Rethinking Annotation Strategies in Imaging AI
Tim Rädsch, Annika Reinke, Vivienn Weru et al.
Knowledge Guided Semi-supervised Learning for Quality Assessment of User Generated Videos
Shankhanil Mitra, Rajiv Soundararajan
Evidential Uncertainty-Guided Mitochondria Segmentation for 3D EM Images
Ruohua Shi, Lingyu Duan, Tiejun Huang et al.
Compress3D: a Compressed Latent Space for 3D Generation from a Single Image
Bowen Zhang, Tianyu Yang, Yu Li et al.
Unlocking the Power of Representations in Long-term Novelty-based Exploration
Alaa Saade, Steven Kapturowski, Daniele Calandriello et al.
O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation
Muer Tie, Julong Wei, Zhengjun Wang et al.
Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation
Wei Cong, Yang Cong, Yuyang Liu et al.
Efficient 3D Implicit Head Avatar with Mesh-anchored Hash Table Blendshapes
Ziqian Bai, Feitong Tan, Sean Fanello et al.
TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields
Tianyu Huang, Yihan Zeng, Bowen Dong et al.
A Unified and Interpretable Emotion Representation and Expression Generation
Reni Paskaleva, Mykyta Holubakha, Andela Ilic et al.
AddBiomechanics Dataset: Capturing the Physics of Human Motion at Scale
Keenon Werling, Janelle M Kaneda, Tian Tan et al.
Linear Log-Normal Attention with Unbiased Concentration
Yury Nahshan, Joseph Kampeas, Emir Haleva
Learning Subject-Aware Cropping by Outpainting Professional Photos
James Hong, Lu Yuan, Michaël Gharbi et al.
GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields
Xiufeng HUANG, Ka Chun Cheung, Simon See et al.
Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation
Ruijie Xu, Chuyu Zhang, Hui Ren et al.
AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation
Lorenzo Mur Labadia, Ruben Martinez-Cantin, Jose J Guerrero et al.
Relation Rectification in Diffusion Model
Yinwei Wu, Xingyi Yang, Xinchao Wang
Learning Diffusion Models for Multi-View Anomaly Detection
Chieh Liu, Yu-Min Chu, Ting-I Hsieh et al.
DynoSurf: Neural Deformation-based Temporally Consistent Dynamic Surface Reconstruction
Yuxin Yao, Siyu Ren, Junhui Hou et al.
Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL
Xiangyu Liu, Souradip Chakraborty, Yanchao Sun et al.
Semantics-aware Motion Retargeting with Vision-Language Models
Haodong Zhang, ZhiKe Chen, Haocheng Xu et al.
Intrinsic Phase-Preserving Networks for Depth Super Resolution
Xuanhong Chen, Hang Wang, Jinfan Liu et al.
Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression
Hancheng Ye, Chong Yu, Peng Ye et al.
Poly-View Contrastive Learning
Amitis Shidani, R Devon Hjelm, Jason Ramapuram et al.
FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation
Xinzhi MU, Li Chen, Bohan CHEN et al.
PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines
Zidong Wang, Zeyu Lu, Di Huang et al.
SMILe: Leveraging Submodular Mutual Information For Robust Few-Shot Object Detection
Anay Majee, Ryan X Sharp, Rishabh Iyer
Bellman Optimal Stepsize Straightening of Flow-Matching Models
Bao Nguyen, Binh Nguyen, Viet Anh Nguyen
Taming Lookup Tables for Efficient Image Retouching
Sidi Yang, Binxiao Huang, Mingdeng Cao et al.
EraseDraw : Learning to Insert Objects by Erasing Them from Images
Alper Canberk, Maksym Bondarenko, Ege Ozguroglu et al.
PointPatchMix: Point Cloud Mixing with Patch Scoring
Yi Wang, Jiaze Wang, Jinpeng Li et al.
CDFormer: When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution
Qingguo Liu, Chenyi Zhuang, Pan Gao et al.
Data Disparity and Temporal Unavailability Aware Asynchronous Federated Learning for Predictive Maintenance on Transportation Fleets
Leonie von Wahl, Niklas Heidenreich, Prasenjit Mitra et al.
DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization
Wenze Chen, Shiyu Huang, Yuan Chiang et al.
Toward Tiny and High-quality Facial Makeup with Data Amplify Learning
Qiaoqiao Jin, Xuanhong Chen, Meiguang Jin et al.
TriNeRFLet: A Wavelet Based Triplane NeRF Representation
Rajaei Khatib, RAJA GIRYES
A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives
Simone Alberto Peirone, Francesca Pistilli, Antonio Alliegro et al.
ASMR: Activation-Sharing Multi-Resolution Coordinate Networks for Efficient Inference
Jason Chun Lok Li, Steven Luo, Le Xu et al.
Semi-supervised 3D Object Detection with PatchTeacher and PillarMix
Xiaopei Wu, Liang Peng, Liang Xie et al.
High-Quality Facial Geometry and Appearance Capture at Home
Yuxuan Han, Junfeng Lyu, Feng Xu
Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos
Sagnik Majumder, Ziad Al-Halah, Kristen Grauman
MinD-3D: Reconstruct High-quality 3D objects in Human Brain
Jianxiong Gao, Yuqian Fu, Yun Wang et al.
PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion
Runsong Zhu, Shi Qiu, Qianyi Wu et al.
Hyperion – A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM
David Hug, Ignacio Alzugaray Lopez, Margarita Chli
VidLA: Video-Language Alignment at Scale
Mamshad Nayeem Rizve, Fan Fei, Jayakrishnan Unnikrishnan et al.
Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community
Arman Isajanyan, Artur Shatveryan, David Kocharian et al.
Adversarially Robust Few-shot Learning via Parameter Co-distillation of Similarity and Class Concept Learners
Junhao Dong, Piotr Koniusz, Junxi Chen et al.
UniINR: Event-guided Unified Rolling Shutter Correction, Deblurring, and Interpolation
Yunfan Lu, Guoqiang Liang, Yusheng Wang et al.
PELA: Learning Parameter-Efficient Models with Low-Rank Approximation
Yangyang Guo, Guangzhi Wang, Mohan Kankanhalli
COMO: Compact Mapping and Odometry
Eric Dexheimer, Andrew Davison
FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection
Zheng Jiang, Jinqing Zhang, Yanan Zhang et al.
PartImageNet++ Dataset: Scaling up Part-based Models for Robust Recognition
Xiao Li, Yining Liu, Na Dong et al.
Rethinking Image-to-Video Adaptation: An Object-centric Perspective
Rui Qian, Shuangrui Ding, Dahua Lin
General Point Model Pretraining with Autoencoding and Autoregressive
Zhe Li, Zhangyang Gao, Cheng Tan et al.
ConDense: Consistent 2D-3D Pre-training for Dense and Sparse Features from Multi-View Images
Xiaoshuai Zhang, Zhicheng Wang, Howard Zhou et al.
FARSE-CNN: Fully Asynchronous, Recurrent and Sparse Event-Based CNN
Riccardo Santambrogio, Marco Cannici, Matteo Matteucci
Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment
Simon Weber, Je Hyeong Hong, Daniel Cremers
Time-Efficient Light-Field Acquisition Using Coded Aperture and Events
Shuji Habuchi, Keita Takahashi, Chihiro Tsutake et al.
Event-based Structure-from-Orbit
Ethan Elms, Yasir Latif, Tae Ha Park et al.
StreamingFlow: Streaming Occupancy Forecasting with Asynchronous Multi-modal Data Streams via Neural Ordinary Differential Equation
Yining Shi, Kun JIANG, Ke Wang et al.
Cross Initialization for Face Personalization of Text-to-Image Models
Lianyu Pang, Jian Yin, Haoran Xie et al.
Preventing Catastrophic Overfitting in Fast Adversarial Training: A Bi-level Optimization Perspective
Zhaoxin Wang, Handing Wang, Cong Tian et al.
Camera Calibration using a Collimator System
Shunkun Liang, Banglei Guan, Zhenbao Yu et al.
CoRe-GD: A Hierarchical Framework for Scalable Graph Visualization with GNNs
Florian Grötschla, Joël Mathys, Róbert Veres et al.
Flash Cache: Reducing Bias in Radiance Cache Based Inverse Rendering
Benjamin Attal, Dor Verbin, Ben Mildenhall et al.
IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception
Shaohong Wang, Lu Bin, Xinyu Xiao et al.
Plan, Posture and Go: Towards Open-vocabulary Text-to-Motion Generation
Jinpeng Liu, Wenxun Dai, Chunyu Wang et al.
Self-Supervised Representation Learning for Adversarial Attack Detection
Yi Li, Plamen Angelov, Neeraj Suri
On the Limitations of Temperature Scaling for Distributions with Overlaps
Muthu Chidambaram, Rong Ge
Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention
Xunjiang Gu, Guanyu Song, Igor Gilitschenski et al.
Sur^2f: A Hybrid Representation for High-Quality and Efficient Surface Reconstruction from Multi-view Images
Zhangjin Huang, Zhihao Liang, Kui Jia
Uncertainty-aware Graph-based Hyperspectral Image Classification
Linlin Yu, Yifei Lou, Feng Chen
Learning Discriminative Dynamics with Label Corruption for Noisy Label Detection
Suyeon Kim, Dongha Lee, SeongKu Kang et al.
Snuffy: Efficient Whole Slide Image Classifier
Hossein Jafarinia, Alireza Alipanah, Saeed Razavi et al.
Markov Knowledge Distillation: Make Nasty Teachers trained by Self-undermining Knowledge Distillation Fully Distillable
En-Hui Yang, Linfeng Ye
Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models
Vitali Petsiuk, Kate Saenko
Towards Scene Graph Anticipation
Rohith Peddi, Saksham Singh, Saurabh . et al.
Within the Dynamic Context: Inertia-aware 3D Human Modeling with Pose Sequence
Yutong Chen, Yifan Zhan, Zhihang Zhong et al.
Task-Aware Encoder Control for Deep Video Compression
Xingtong Ge, Jixiang Luo, XINJIE ZHANG et al.
Memory-Efficient Fine-Tuning for Quantized Diffusion Model
Hyogon Ryu, Seohyun Lim, Hyunjung Shim
Un-Mixing Test-Time Normalization Statistics: Combatting Label Temporal Correlation
Devavrat Tomar, Guillaume Vray, Jean-Philippe Thiran et al.
Towards Physical World Backdoor Attacks against Skeleton Action Recognition
Qichen Zheng, Yi Yu, SIYUAN YANG et al.
Neural Auto-designer for Enhanced Quantum Kernels
Cong Lei, Yuxuan Du, Peng Mi et al.
Enriching Information and Preserving Semantic Consistency in Expanding Curvilinear Object Segmentation Datasets
Qin Lei, Jiang Zhong, Qizhu Dai
Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection
Tim Salzmann, Markus Ryll, Alex Bewley et al.
Zero-Shot Adaptation for Approximate Posterior Sampling of Diffusion Models in Inverse Problems
Yasar Utku Alcalar, Mehmet Akcakaya
Zero-Shot Image Feature Consensus with Deep Functional Maps
Xinle Cheng, Congyue Deng, Adam Harley et al.
Beam Enumeration: Probabilistic Explainability For Sample Efficient Self-conditioned Molecular Design
Jeff Guo, Philippe Schwaller
Prompting Future Driven Diffusion Model for Hand Motion Prediction
Bowen Tang, Kaihao Zhang, Wenhan Luo et al.
Auto-DAS: Automated Proxy Discovery for Training-free Distillation-aware Architecture Search
Haosen SUN, Lujun Li, Peijie Dong et al.
SLIM: Spuriousness Mitigation with Minimal Human Annotations
Xiwei Xuan, Ziquan Deng, Hsuan-Tien Lin et al.
Cross-Dimension Affinity Distillation for 3D EM Neuron Segmentation
Xiaoyu Liu, Miaomiao Cai, Yinda Chen et al.
Clustering for Protein Representation Learning
Ruijie Quan, Wenguan Wang, Fan Ma et al.
RAVE: Residual Vector Embedding for CLIP-Guided Backlit Image Enhancement
Tatiana Gaintseva, Martin Benning, Greg Slabaugh
D4-VTON: Dynamic Semantics Disentangling for Differential Diffusion based Virtual Try-On
Zhaotong Yang, Zicheng Jiang, Xinzhe Li et al.
Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods
Avery Ma, Yangchen Pan, Amir-massoud Farahmand
Parameterized Quasi-Physical Simulators for Dexterous Manipulations Transfer
Xueyi Liu, Kangbo Lyu, jieqiong zhang et al.
Task-Adaptive Saliency Guidance for Exemplar-free Class Incremental Learning
Xialei Liu, Jiang-Tian Zhai, Andrew Bagdanov et al.
In-N-Out: Faithful 3D GAN Inversion with Volumetric Decomposition for Face Editing
Yiran Xu, Zhixin Shu, Cameron Smith et al.
Relational Matching for Weakly Semi-Supervised Oriented Object Detection
Wenhao Wu, Hau San Wong, Si Wu et al.
PoseCrafter: One-Shot Personalized Video Synthesis Following Flexible Pose Control
Yong Zhong, Min Zhao, Zebin You et al.
Better than Random: Reliable NLG Human Evaluation with Constrained Active Sampling
Jie Ruan, Xiao Pu, Mingqi Gao et al.
Neural Reasoning about Agents’ Goals, Preferences, and Actions
Matteo Bortoletto, Lei Shi, Andreas Bulling
An Efficient Knowledge Transfer Strategy for Spiking Neural Networks from Static to Event Domain
Xiang He, Dongcheng Zhao, Yang Li et al.
Regret Analysis of Repeated Delegated Choice
Suho Shin, Keivan Rezaei, Mohammad Hajiaghayi et al.
Diffusion-FOF: Single-View Clothed Human Reconstruction via Diffusion-Based Fourier Occupancy Field
Yuanzhen Li, Fei LUO, Chunxia Xiao
Robust Communicative Multi-Agent Reinforcement Learning with Active Defense
Lebin Yu, Yunbo Qiu, Quanming Yao et al.
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
Zhiyu Mei, Wei Fu, Jiaxuan Gao et al.
AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings
Jamie Watson, Filippo Aleotti, Mohamed Sayed et al.
Made to Order: Discovering monotonic temporal changes via self-supervised video ordering
Charig Yang, Weidi Xie, Andrew ZISSERMAN
Improved Anonymous Multi Agent Path Finding Algorithm
Zain Alabedeen Ali, Konstantin Yakovlev
EPSD: Early Pruning with Self-Distillation for Efficient Model Compression
Dong Chen, Ning Liu, Yichen Zhu et al.
Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Agent Preference-Based Reinforcement Learning
Tianchen Zhu, Yue Qiu, Haoyi Zhou et al.
Gaze from Origin: Learning for Generalized Gaze Estimation by Embedding the Gaze Frontalization Process
Mingjie Xu, Feng Lu
Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter Profile
Seokjun Lee, Seung-Won Jung, Hyunseok Seo
Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling
Jakob Hollenstein, Georg Martius, Justus Piater
Keep the Faith: Faithful Explanations in Convolutional Neural Networks for Case-Based Reasoning
Tom Nuno Wolf, Fabian Bongratz, Anne-Marie Rickmann et al.
Towards Continual Learning Desiderata via HSIC-Bottleneck Orthogonalization and Equiangular Embedding
Depeng Li, Tianqi Wang, Junwei Chen et al.
Hierarchical Aligned Multimodal Learning for NER on Tweet Posts
Peipei Liu, Hong Li, Yimo Ren et al.
PPEA-Depth: Progressive Parameter-Efficient Adaptation for Self-Supervised Monocular Depth Estimation
Yue-Jiang Dong, Yuan-Chen Guo, Ying-Tian Liu et al.
RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection
Ming Chang, Xishan Zhang, Rui Zhang et al.
Residual Hyperbolic Graph Convolution Networks
Yangkai Xue, Jindou Dai, Zhipeng Lu et al.
Improved Metric Distortion via Threshold Approvals
Elliot Anshelevich, Aris Filos-Ratsikas, Christopher Jerrett et al.
IVP-VAE: Modeling EHR Time Series with Initial Value Problem Solvers
Jingge Xiao, Leonie Basso, Wolfgang Nejdl et al.
Graph Context Transformation Learning for Progressive Correspondence Pruning
Junwen Guo, Guobao Xiao, Shiping Wang et al.
DeCoTR: Enhancing Depth Completion with 2D and 3D Attentions
Yunxiao Shi, Manish Singh, Hong Cai et al.