🧬Representation Learning

Disentangled Representations

Learning factorized representations

100 papers2,121 total citations

Compare with other topics

Feb '24 — Jan '26668 papers

Top Conferences

ICLR: 35 AAAI: 24 CVPR: 23 ICCV: 6 NeurIPS: 6 ECCV: 4

Top Papers

#1

Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Sihyun Yu, Sangkyung Kwak, Huiwon Jang et al.

ICLR 2025arXiv:2410.06940

diffusion transformersrepresentation alignmentgenerative diffusion modelsdenoising networks+3

308

citations

#2

EquiformerV2: Improved Equivariant Transformer for Scaling to Higher-Degree Representations

Yi-Lun Liao, Brandon Wood, Abhishek Das et al.

Linearity of Relation Decoding in Transformer Language Models

Evan Hernandez, Arnab Sen Sharma, Tal Haklay et al.

Bidirectional Multi-Scale Implicit Neural Representations for Image Deraining

Xiang Chen, Jinshan Pan, Jiangxin Dong

Frequency-Spatial Entanglement Learning for Camouflaged Object Detection

Yanguang Sun, Chunyan Xu, Jian Yang et al.

FINER: Flexible Spectral-bias Tuning in Implicit NEural Representation by Variable-periodic Activation Functions

Zhen Liu, Hao Zhu, Qi Zhang et al.

Towards Compact 3D Representations via Point Feature Enhancement Masked Autoencoders

Yaohua Zha, Huizhen Ji, Jinmin Li et al.

AAAI 2024arXiv:2312.10726

masked autoencoders3d representation learningpoint cloud pre-trainingtransformer encoder+4

61

citations

#8

Geographic Location Encoding with Spherical Harmonics and Sinusoidal Representation Networks

Marc Rußwurm, Konstantin Klemmer, Esther Rolf et al.

Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion

Kiran Chhatre, Radek Danecek, Nikos Athanasiou et al.

Sonata: Self-Supervised Learning of Reliable Point Representations

Xiaoyang Wu, Daniel DeTone, Duncan Frost et al.

Scaling Language-Free Visual Representation Learning

David Fan, Shengbang Tong, Jiachen Zhu et al.

ICCV 2025arXiv:2504.01017

visual self-supervised learningcontrastive language-image pretrainingmultimodal representation learningvision encoders+2

39

citations

#12

Disentangled Prompt Representation for Domain Generalization

De Cheng, Zhipeng Xu, XINYANG JIANG et al.

Rethinking Generalizable Face Anti-spoofing via Hierarchical Prototype-guided Distribution Refinement in Hyperbolic Space

Chengyang Hu, Ke-Yue Zhang, Taiping Yao et al.

Random Feature Amplification: Feature Learning and Generalization in Neural Networks

Spencer Frei, Niladri Chatterji, Peter L. Bartlett

Dataset Distillation with Neural Characteristic Function: A Minmax Perspective

Shaobo Wang, Yicun Yang, Zhiyuan Liu et al.

DTL: Disentangled Transfer Learning for Visual Recognition

Minghao Fu, Ke Zhu, Jianxin Wu

AAAI 2024arXiv:2312.07856

parameter-efficient transfer learningvisual recognitiongpu memory reductiondisentangled representation learning+4

25

citations

#17

Exploring Unbiased Deepfake Detection via Token-Level Shuffling and Mixing

Xinghe Fu, Zhiyuan Yan, Taiping Yao et al.

FLD: Fourier Latent Dynamics for Structured Motion Representation and Learning

Chenhao Li, Elijah Stanger-Jones, Steve Heim et al.

Learning Equi-angular Representations for Online Continual Learning

Minhyuk Seo, Hyunseo Koh, Wonje Jeung et al.

Rethinking Multi-view Representation Learning via Distilled Disentangling

Guanzhou Ke, Bo Wang, Xiao-Li Wang et al.

On the Provable Advantage of Unsupervised Pretraining

Jiawei Ge, Shange Tang, Jianqing Fan et al.

Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks

Ben Eisner, Yi Yang, Todor Davchev et al.

FoldToken: Learning Protein Language via Vector Quantization and Beyond

Zhangyang Gao, Cheng Tan, Jue Wang et al.

Bounds on Representation-Induced Confounding Bias for Treatment Effect Estimation

Valentyn Melnychuk, Dennis Frauen, Stefan Feuerriegel

Leveraging Cross-Modal Neighbor Representation for Improved CLIP Classification

Chao Yi, Lu Ren, De-Chuan Zhan et al.

Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders

Fiona Ryan, Ajay Bati, Sangmin Lee et al.

TIME-FS: Joint Learning of Tensorial Incomplete Multi-View Unsupervised Feature Selection and Missing-View Imputation

Yanyong Huang, Minghui Lu, Wei Huang et al.

Personalized Federated Collaborative Filtering: A Variational AutoEncoder Approach

Zhiwei Li, Guodong Long, Tianyi Zhou et al.

Adaptive Length Image Tokenization via Recurrent Allocation

Shivam Duggal, Phillip Isola, Antonio Torralba et al.

Scalable Image Tokenization with Index Backpropagation Quantization

Fengyuan Shi, Zhuoyan Luo, Yixiao Ge et al.

Efficient Learning with Sine-Activated Low-Rank Matrices

Yiping Ji, Hemanth Saratchandran, Cameron Gordon et al.

Hybrid Proposal Refiner: Revisiting DETR Series from the Faster R-CNN Perspective

Jinjing Zhao, Fangyun Wei, Chang Xu

Region-Based Representations Revisited

Michal Shlapentokh-Rothman, Ansel Blume, Yao Xiao et al.

Identifiable Exchangeable Mechanisms for Causal Structure and Representation Learning

Patrik Reizinger, Siyuan Guo, Ferenc Huszar et al.

SketchINR: A First Look into Sketches as Implicit Neural Representations

Hmrishav Bandyopadhyay, Ayan Kumar Bhunia, Pinaki Nath Chowdhury et al.

Self-Supervised Disentangled Representation Learning for Robust Target Speech Extraction

Zhaoxi Mu, Xinyu Yang, Sining Sun et al.

AAAI 2024arXiv:2312.10305

disentangled representation learningtarget speech extractionspeaker identity disentanglementadaptive modulation transformer+4

12

citations

#37

Closed-Loop Unsupervised Representation Disentanglement with $\beta$-VAE Distillation and Diffusion Probabilistic Feedback

Xin Jin, Bohan Li, Baao Xie et al.

ConcaveQ: Non-monotonic Value Function Factorization via Concave Representations in Deep Multi-Agent Reinforcement Learning

Huiqun Li, Hanhan Zhou, Yifei Zou et al.

AAAI 2024arXiv:2312.15555

value function factorizationmulti-agent reinforcement learningnon-monotonic mixing functionsconcave representations+3

12

citations

#39

A Unifying Framework for Representation Learning

Shaden Alshammari, John Hershey, Axel Feldmann et al.

Unsupervised Foundation Model-Agnostic Slide-Level Representation Learning

Tim Lenz, Peter Neidlinger, Marta Ligero et al.

Long-Sequence Recommendation Models Need Decoupled Embeddings

Ningya Feng, Junwei Pan, Jialong Wu et al.

ICLR 2025arXiv:2410.02604

long-sequence recommendationattention mechanismuser behavior modelingembedding decoupling+2

11

citations

#42

Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation

Kihong Kim, Haneol Lee, Jihye Park et al.

Interaction Asymmetry: A General Principle for Learning Composable Abstractions

Jack Brady, Julius von Kügelgen, Sebastien Lachapelle et al.

VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers

Juncan Deng, Shuaiting Li, Zeyu Wang et al.

FedLF: Layer-Wise Fair Federated Learning

Zibin Pan, Chi Li, Fangchen Yu et al.

Colour Passing Revisited: Lifted Model Construction with Commutative Factors

Malte Luttermann, Tanya Braun, Ralf Möller et al.

AAAI 2024arXiv:2309.11236

lifted probabilistic inferencesymmetry detectionprobabilistic model compressioncolour passing algorithm+4

10

citations

#47

Plastic Learning with Deep Fourier Features

Alex Lewandowski, Dale Schuurmans, Marlos C. Machado

Factorized Diffusion Autoencoder for Unsupervised Disentangled Representation Learning

Ancong Wu, Wei-shi Zheng

TACIT: A Target-Agnostic Feature Disentanglement Framework for Cross-Domain Text Classification

Rui Song, Fausto Giunchiglia, Yingji Li et al.

AAAI 2024arXiv:2312.17263

cross-domain text classificationfeature disentanglementdomain-invariant featuresvariational auto-encoders+4

9

citations

#50

Deep Nonlinear Sufficient Dimension Reduction

Yinfeng Chen, Yuling Jiao, Rui Qiu et al.

The Non-Linear Representation Dilemma: Is Causal Abstraction Enough for Mechanistic Interpretability?

Denis Sutter, Julian Minder, Thomas Hofmann et al.

Link Prediction in Multilayer Networks via Cross-Network Embedding

Guojing Ren, Xiao Ding, Xiao-Ke Xu et al.

Memory-Scalable and Simplified Functional Map Learning

Robin Magnet, Maks Ovsjanikov

SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning

Seokju Yun, Seunghye Chae, Dongheon Lee et al.

CVPR 2025arXiv:2412.04077

domain generalizationparameter efficient fine tuningsingular value decompositionsemantic segmentation+4

8

citations

#55

REPA Works Until It Doesn’t: Early-Stopped, Holistic Alignment Supercharges Diffusion Training

Ziqiao Wang, Wangbo Zhao, Yuhao Zhou et al.

Unlocking the Power of Function Vectors for Characterizing and Mitigating Catastrophic Forgetting in Continual Instruction Tuning

Gangwei Jiang, caigao jiang, Zhaoyi Li et al.

Not all solutions are created equal: An analytical dissociation of functional and representational similarity in deep linear neural networks

Lukas Braun, Erin Grant, Andrew Saxe

PELA: Learning Parameter-Efficient Models with Low-Rank Approximation

Yangyang Guo, Guangzhi Wang, Mohan Kankanhalli

Deep Weight Factorization: Sparse Learning Through the Lens of Artificial Symmetries

Chris Kolb, Tobias Weber, Bernd Bischl et al.

Synthetic Prior for Few-Shot Drivable Head Avatar Inversion

Wojciech Zielonka, Stephan J. Garbin, Alexandros Lattas et al.

Erase Then Rectify: A Training-Free Parameter Editing Approach for Cost-Effective Graph Unlearning

Zhe-Rui Yang, Jindong Han, Chang-Dong Wang et al.

FreSh: Frequency Shifting for Accelerated Neural Representation Learning

Adam Kania, Marko Mihajlovic, Sergey Prokudin et al.

Fast Encoding and Decoding for Implicit Video Representation

Hao Chen, Saining Xie, Ser-Nam Lim et al.

Overcoming the Curse of Dimensionality in Reinforcement Learning Through Approximate Factorization

Chenbei Lu, Laixi Shi, Zaiwei Chen et al.

Implicit Neural Representations and the Algebra of Complex Wavelets

T Mitchell Roddenberry, Vishwanath Saragadam, Maarten V de Hoop et al.

UNR-Explainer: Counterfactual Explanations for Unsupervised Node Representation Learning Models

Hyunju Kang, Geonhee Han, Hogun Park

Towards the Disappearing Truth: Fine-Grained Joint Causal Influences Learning with Hidden Variable-Driven Causal Hypergraphs

Kun Zhu, Chunhui Zhao

Capture Global Feature Statistics for One-Shot Federated Learning

Zenghao Guan, Yucan Zhou, Xiaoyan Gu

How DNNs break the Curse of Dimensionality: Compositionality and Symmetry Learning

Arthur Jacot, Seok Hoan Choi, Yuxiao Wen

ICLR 2025arXiv:2407.05664

curse of dimensionalityfunction composition learninggeneralization boundscovering number argument+4

6

citations

#70

Generalized Dimension Reduction Using Semi-Relaxed Gromov-Wasserstein Distance

Ranthony A. Clark, Tom Needham, Thomas Weighill

Intrinsic Dimension Correlation: uncovering nonlinear connections in multimodal representations

Lorenzo Basile, Santiago Acevedo, Luca Bortolussi et al.

DSD$^2$: Can We Dodge Sparse Double Descent and Compress the Neural Network Worry-Free?

Victor Quetu, Enzo Tartaglione

AAAI 2024arXiv:2303.01213

sparse double descentmodel sparsityneural network compressiongeneralization improvement+3

6

citations

#73

Enhancing Zero-Shot Multi-Speaker TTS with Negated Speaker Representations

Yejin Jeon, Yunsu Kim, Gary Geunbae Lee

AAAI 2024arXiv:2401.02014

zero-shot ttsmulti-speaker synthesisspeaker disentanglementnegation feature learning+4

6

citations

#74

Diffusion Bridge AutoEncoders for Unsupervised Representation Learning

Yeongmin Kim, Kwanghyeon Lee, Minsang Park et al.

ICLR 2025arXiv:2405.17111

diffusion modelsunsupervised representation learninginformation bottlenecklatent variable inference+3

6

citations

#75

LDP: Generalizing to Multilingual Visual Information Extraction by Language Decoupled Pretraining

Huawen Shen, Gengluo Li, Jinwen Zhong et al.

Zeroth-Order Fine-Tuning of LLMs in Random Subspaces

Ziming Yu, Pan Zhou, Sike Wang et al.

DRL: Decomposed Representation Learning for Tabular Anomaly Detection

Hangting Ye, He Zhao, Wei Fan et al.

Efficient Multitask Dense Predictor via Binarization

Yuzhang Shang, Dan Xu, Gaowen Liu et al.

Combining Frame and GOP Embeddings for Neural Video Representation

Jens Eirik Saethre, Roberto Azevedo, Christopher Schroers

COAP: Memory-Efficient Training with Correlation-Aware Gradient Projection

Jinqi Xiao, Shen Sang, Tiancheng Zhi et al.

An Intuitive Multi-Frequency Feature Representation for SO(3)-Equivariant Networks

Dongwon Son, Jaehyung Kim, Sanghyeon Son et al.

Hessian-Free Online Certified Unlearning

Xinbao Qiao, Meng Zhang, Ming Tang et al.

ICLR 2025arXiv:2404.01712

machine unlearningcertified unlearningonline unlearninghessian-free optimization+4

5

citations

#83

Robust Feature Learning for Multi-Index Models in High Dimensions

Alireza Mousavi-Hosseini, Adel Javanmard, Murat A Erdogdu

Unraveling the Enigma of Double Descent: An In-depth Analysis through the Lens of Learned Feature Space

Yufei Gu, Xiaoqing Zheng, Tomaso Aste

Two Sparse Matrices are Better than One: Sparsifying Neural Networks with Double Sparse Factorization

Vladimir Boza, Vladimir Macko

FactorGCL: A Hypergraph-Based Factor Model with Temporal Residual Contrastive Learning for Stock Returns Prediction

Yitong Duan, Weiran Wang, Jian Li

Towards a Theoretical Understanding of Why Local Search Works for Clustering with Fair-Center Representation

Zhen Zhang, Junfeng Yang, Limei Liu et al.

FADA: Fast Diffusion Avatar Synthesis with Mixed-Supervised Multi-CFG Distillation

Tianyun Zhong, Chao Liang, Jianwen Jiang et al.

CVPR 2025arXiv:2412.16915

diffusion modelsaudio-driven synthesistalking avatar generationmodel distillation+4

5

citations

#89

DCT-CryptoNets: Scaling Private Inference in the Frequency Domain

Arjun Roy, Kaushik Roy

Unleashing Diffusion Transformers for Visual Correspondence by Modulating Massive Activations

Chaofan Gan, Yuanpeng Tu, Xi Chen et al.

Optimal Spectral Transitions in High-Dimensional Multi-Index Models

Leonardo Defilippis, Yatin Dandi, Pierre Mergny et al.

Small Singular Values Matter: A Random Matrix Analysis of Transformer Models

Max Staats, Matthias Thamm, Bernd Rosenow

On the Joint Interaction of Models, Data, and Features

Yiding Jiang, Christina Baek, J Kolter

Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning

Qi Wang, Zhipeng Zhang, Baao Xie et al.

Robust Machine Unlearning for Quantized Neural Networks via Adaptive Gradient Reweighting with Similar Labels

Yujia Tong, Yuze Wang, Jingling Yuan et al.

Scaling Convex Neural Networks with Burer-Monteiro Factorization

Arda Sahiner, Tolga Ergen, Batu Ozturkler et al.

Disentangling Representations through Multi-task Learning

Pantelis Vafidis, Aman Bhargava, Antonio Rangel

Robust Multi-View Learning via Representation Fusion of Sample-Level Attention and Alignment of Simulated Perturbation

Jie Xu, Na Zhao, Gang Niu et al.

Towards Learnable Anchor for Deep Multi-View Clustering

Bocheng Wang, Chusheng Zeng, Mulin Chen et al.

Synergy Between Sufficient Changes and Sparse Mixing Procedure for Disentangled Representation Learning

Zijian Li, Shunxing Fan, Yujia Zheng et al.

ICLR 2025

4

citations

Disentangled Representations

Top Conferences

Related Topics (Representation Learning)

Top Papers

Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

EquiformerV2: Improved Equivariant Transformer for Scaling to Higher-Degree Representations

Linearity of Relation Decoding in Transformer Language Models

Bidirectional Multi-Scale Implicit Neural Representations for Image Deraining

Frequency-Spatial Entanglement Learning for Camouflaged Object Detection

FINER: Flexible Spectral-bias Tuning in Implicit NEural Representation by Variable-periodic Activation Functions

Towards Compact 3D Representations via Point Feature Enhancement Masked Autoencoders

Geographic Location Encoding with Spherical Harmonics and Sinusoidal Representation Networks

Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion

Sonata: Self-Supervised Learning of Reliable Point Representations

Scaling Language-Free Visual Representation Learning

Disentangled Prompt Representation for Domain Generalization

Rethinking Generalizable Face Anti-spoofing via Hierarchical Prototype-guided Distribution Refinement in Hyperbolic Space

Random Feature Amplification: Feature Learning and Generalization in Neural Networks

Dataset Distillation with Neural Characteristic Function: A Minmax Perspective

DTL: Disentangled Transfer Learning for Visual Recognition

Exploring Unbiased Deepfake Detection via Token-Level Shuffling and Mixing

FLD: Fourier Latent Dynamics for Structured Motion Representation and Learning

Learning Equi-angular Representations for Online Continual Learning

Rethinking Multi-view Representation Learning via Distilled Disentangling

On the Provable Advantage of Unsupervised Pretraining

Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks

FoldToken: Learning Protein Language via Vector Quantization and Beyond

Bounds on Representation-Induced Confounding Bias for Treatment Effect Estimation

Leveraging Cross-Modal Neighbor Representation for Improved CLIP Classification

Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders

TIME-FS: Joint Learning of Tensorial Incomplete Multi-View Unsupervised Feature Selection and Missing-View Imputation

Personalized Federated Collaborative Filtering: A Variational AutoEncoder Approach

Adaptive Length Image Tokenization via Recurrent Allocation

Scalable Image Tokenization with Index Backpropagation Quantization

Efficient Learning with Sine-Activated Low-Rank Matrices

Hybrid Proposal Refiner: Revisiting DETR Series from the Faster R-CNN Perspective

Region-Based Representations Revisited

Identifiable Exchangeable Mechanisms for Causal Structure and Representation Learning

SketchINR: A First Look into Sketches as Implicit Neural Representations

Self-Supervised Disentangled Representation Learning for Robust Target Speech Extraction

Closed-Loop Unsupervised Representation Disentanglement with $\beta$-VAE Distillation and Diffusion Probabilistic Feedback

ConcaveQ: Non-monotonic Value Function Factorization via Concave Representations in Deep Multi-Agent Reinforcement Learning

A Unifying Framework for Representation Learning

Unsupervised Foundation Model-Agnostic Slide-Level Representation Learning

Long-Sequence Recommendation Models Need Decoupled Embeddings

Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation

Interaction Asymmetry: A General Principle for Learning Composable Abstractions

VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers

FedLF: Layer-Wise Fair Federated Learning

Colour Passing Revisited: Lifted Model Construction with Commutative Factors

Plastic Learning with Deep Fourier Features

Factorized Diffusion Autoencoder for Unsupervised Disentangled Representation Learning

TACIT: A Target-Agnostic Feature Disentanglement Framework for Cross-Domain Text Classification

Deep Nonlinear Sufficient Dimension Reduction

The Non-Linear Representation Dilemma: Is Causal Abstraction Enough for Mechanistic Interpretability?

Link Prediction in Multilayer Networks via Cross-Network Embedding

Memory-Scalable and Simplified Functional Map Learning

SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning

REPA Works Until It Doesn’t: Early-Stopped, Holistic Alignment Supercharges Diffusion Training

Unlocking the Power of Function Vectors for Characterizing and Mitigating Catastrophic Forgetting in Continual Instruction Tuning

Not all solutions are created equal: An analytical dissociation of functional and representational similarity in deep linear neural networks

PELA: Learning Parameter-Efficient Models with Low-Rank Approximation

Deep Weight Factorization: Sparse Learning Through the Lens of Artificial Symmetries

Synthetic Prior for Few-Shot Drivable Head Avatar Inversion

Erase Then Rectify: A Training-Free Parameter Editing Approach for Cost-Effective Graph Unlearning

FreSh: Frequency Shifting for Accelerated Neural Representation Learning

Fast Encoding and Decoding for Implicit Video Representation

Overcoming the Curse of Dimensionality in Reinforcement Learning Through Approximate Factorization

Implicit Neural Representations and the Algebra of Complex Wavelets

UNR-Explainer: Counterfactual Explanations for Unsupervised Node Representation Learning Models

Towards the Disappearing Truth: Fine-Grained Joint Causal Influences Learning with Hidden Variable-Driven Causal Hypergraphs

Capture Global Feature Statistics for One-Shot Federated Learning

How DNNs break the Curse of Dimensionality: Compositionality and Symmetry Learning

Generalized Dimension Reduction Using Semi-Relaxed Gromov-Wasserstein Distance

Intrinsic Dimension Correlation: uncovering nonlinear connections in multimodal representations

DSD$^2$: Can We Dodge Sparse Double Descent and Compress the Neural Network Worry-Free?

Enhancing Zero-Shot Multi-Speaker TTS with Negated Speaker Representations

Diffusion Bridge AutoEncoders for Unsupervised Representation Learning

LDP: Generalizing to Multilingual Visual Information Extraction by Language Decoupled Pretraining

Zeroth-Order Fine-Tuning of LLMs in Random Subspaces