"contrastive learning" Papers

308 papers found • Page 6 of 7

Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning

Jihai Zhang, Xiang Lan, Xiaoye Qu et al.

ECCV 2024arXiv:2402.11816
5
citations

Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment

Yuxiao Chen, Kai Li, Wentao Bao et al.

ECCV 2024arXiv:2409.16145
7
citations

Low-Rank Similarity Mining for Multimodal Dataset Distillation

Yue Xu, Zhilin Lin, Yusong Qiu et al.

ICML 2024arXiv:2406.03793
11
citations

MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models

Justin Chih-Yao Chen, Swarnadeep Saha, Elias Stengel-Eskin et al.

ICML 2024arXiv:2402.01620
28
citations

MF-CLR: Multi-Frequency Contrastive Learning Representation for Time Series

Jufang Duan, Wei Zheng, Yangzhou Du et al.

ICML 2024

Mitigating the Impact of False Negative in Dense Retrieval with Contrastive Confidence Regularization

Shiqi Wang, Yeqin Zhang, Cam-Tu Nguyen

AAAI 2024paperarXiv:2401.00165
4
citations

MM-Point: Multi-View Information-Enhanced Multi-Modal Self-Supervised 3D Point Cloud Understanding

HaiTao Yu, Mofei Song

AAAI 2024paperarXiv:2402.10002
18
citations

MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment

Anurag Das, Xinting Hu, Li Jiang et al.

ECCV 2024arXiv:2407.21654
11
citations

Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation

Qiushi Zhu, Jie Zhang, Yu Gu et al.

AAAI 2024paperarXiv:2401.03468
15
citations

Narrowing the Gap between Supervised and Unsupervised Sentence Representation Learning with Large Language Model

Mingxin Li, Richong Zhang, Zhijie Nie et al.

AAAI 2024paperarXiv:2309.06453
1
citations

No More Shortcuts: Realizing the Potential of Temporal Self-Supervision

Ishan Rajendrakumar Dave, Simon Jenni, Mubarak Shah

AAAI 2024paperarXiv:2312.13008
12
citations

Non-parametric Representation Learning with Kernels

Hebaixu Wang, Meiqi Gong, Xiaoguang Mei et al.

AAAI 2024paperarXiv:2309.02028
11
citations

Open-World Human-Object Interaction Detection via Multi-modal Prompts

Jie Yang, Bingliang Li, Ailing Zeng et al.

CVPR 2024arXiv:2406.07221
35
citations

PinNet: Pinpoint Instructive Information for Retrieval Augmented Code-to-Text Generation

Han Fu, Jian Tan, Pinhan Zhang et al.

ICML 2024

Poly-View Contrastive Learning

Amitis Shidani, R Devon Hjelm, Jason Ramapuram et al.

ICLR 2024arXiv:2403.05490
9
citations

Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data

Fahim Tajwar, Anikait Singh, Archit Sharma et al.

ICML 2024arXiv:2404.14367
179
citations

Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo

Stephen Zhao, Rob Brekelmans, Alireza Makhzani et al.

ICML 2024arXiv:2404.17546
56
citations

Referring Expression Counting

Siyang Dai, Jun Liu, Ngai-Man Cheung

CVPR 2024highlightarXiv:2505.22850
3
citations

Region-centric Image-Language Pretraining for Open-Vocabulary Detection

Dahun Kim, Anelia Angelova, Weicheng Kuo

ECCV 2024arXiv:2310.00161
7
citations

Regularizing with Pseudo-Negatives for Continual Self-Supervised Learning

Sungmin Cha, Kyunghyun Cho, Taesup Moon

ICML 2024arXiv:2306.05101
5
citations

Relation DETR: Exploring Explicit Position Relation Prior for Object Detection

Xiuquan Hou, Meiqin Liu, Senlin Zhang et al.

ECCV 2024arXiv:2407.11699
64
citations

Rethinking Specificity in SBDD: Leveraging Delta Score and Energy-Guided Diffusion

Bowen Gao, Minsi Ren, Yuyan Ni et al.

ICML 2024arXiv:2403.12987
9
citations

Revealing Vision-Language Integration in the Brain with Multimodal Networks

Vighnesh Subramaniam, Colin Conwell, Christopher Wang et al.

ICML 2024arXiv:2406.14481
18
citations

Root Cause Analysis in Microservice Using Neural Granger Causal Discovery

Cheng-Ming Lin, Ching Chang, Wei-Yao Wang et al.

AAAI 2024paperarXiv:2402.01140
32
citations

SALSA: Semantically-Aware Latent Space Autoencoder

Kathryn Kirchoff, Travis Maxfield, Alexander Tropsha et al.

AAAI 2024paperarXiv:2310.02744
3
citations

SCD-Net: Spatiotemporal Clues Disentanglement Network for Self-Supervised Skeleton-Based Action Recognition

Cong Wu, Xiao-Jun Wu, Josef Kittler et al.

AAAI 2024paperarXiv:2309.05834
26
citations

SCoRe: Submodular Combinatorial Representation Learning

Anay Majee, Suraj Kothawade, Krishnateja Killamsetty et al.

ICML 2024arXiv:2310.00165
5
citations

SECap: Speech Emotion Captioning with Large Language Model

Yaoxun Xu, Hangting Chen, Jianwei Yu et al.

AAAI 2024paperarXiv:2312.10381
58
citations

Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities

Kaiwen Cai, ZheKai Duan, Gaowen Liu et al.

ECCV 2024arXiv:2403.04908
10
citations

Semantic-Aware Data Augmentation for Text-to-Image Synthesis

Zhaorui Tan, Xi Yang, Kaizhu Huang

AAAI 2024paperarXiv:2312.07951
4
citations

SleepFM: Multi-modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory Signals

Rahul Thapa, Bryan He, Magnus Ruud Kjaer et al.

ICML 2024arXiv:2405.17766
34
citations

Soft Contrastive Learning for Time Series

Seunghan Lee, Taeyoung Park, Kibok Lee

ICLR 2024oralarXiv:2312.16424
52
citations

Spatial-Related Sensors Matters: 3D Human Motion Reconstruction Assisted with Textual Semantics

Xueyuan Yang, Chao Yao, Xiaojuan Ban

AAAI 2024paperarXiv:2401.05412
4
citations

Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter Profile

Seokjun Lee, Seung-Won Jung, Hyunseok Seo

AAAI 2024paperarXiv:2403.05093
8
citations

Sterling: Synergistic Representation Learning on Bipartite Graphs

Baoyu Jing, Yuchen Yan, Kaize Ding et al.

AAAI 2024paperarXiv:2302.05428
23
citations

Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning

Woo-Jin Ahn, Geun-Yeong Yang, Hyunduck Choi et al.

CVPR 2024arXiv:2403.06122
30
citations

SyncMask: Synchronized Attentional Masking for Fashion-centric Vision-Language Pretraining

Chull Hwan Song, Taebaek Hwang, Jooyoung Yoon et al.

CVPR 2024arXiv:2404.01156
11
citations

Synergistic Anchored Contrastive Pre-training for Few-Shot Relation Extraction

Da Luo, Yanglei Gan, Rui Hou et al.

AAAI 2024paperarXiv:2312.12021
11
citations

Teaching Tailored to Talent: Adverse Weather Restoration via Prompt Pool and Depth-Anything Constraint

Sixiang Chen, Tian Ye, Kai Zhang et al.

ECCV 2024arXiv:2409.15739
23
citations

Text2Loc: 3D Point Cloud Localization from Natural Language

Yan Xia, Letian Shi, Zifeng Ding et al.

CVPR 2024arXiv:2311.15977
56
citations

The Hard Positive Truth about Vision-Language Compositionality

Amita Kamath, Cheng-Yu Hsieh, Kai-Wei Chang et al.

ECCV 2024arXiv:2409.17958
16
citations

TimeSiam: A Pre-Training Framework for Siamese Time-Series Modeling

Jiaxiang Dong, Haixu Wu, Yuxuan Wang et al.

ICML 2024oralarXiv:2402.02475
21
citations

TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training

Chaoya Jiang, Wei Ye, Haiyang Xu et al.

AAAI 2024paperarXiv:2312.08846
6
citations

Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition

Qianrui Zhou, Hua Xu, Hao Li et al.

AAAI 2024paperarXiv:2312.14667
35
citations

To Supervise or Not to Supervise: Understanding and Addressing the Key Challenges of Point Cloud Transfer Learning

Souhail Hadgi, Lei Li, Maks Ovsjanikov

ECCV 2024arXiv:2403.17869

Towards Generalization beyond Pointwise Learning: A Unified Information-theoretic Perspective

Yuxin Dong, Tieliang Gong, Hong Chen et al.

ICML 2024

Towards More Faithful Natural Language Explanation Using Multi-Level Contrastive Learning in VQA

Chengen Lai, Shengli Song, Shiqi Meng et al.

AAAI 2024paperarXiv:2312.13594
10
citations

T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Qing Jiang, Feng Li, Zhaoyang Zeng et al.

ECCV 2024arXiv:2403.14610
86
citations

Two-Stage Active Learning for Efficient Temporal Action Segmentation

Yuhao Su, Ehsan Elhamifar

ECCV 2024
6
citations

UniCorn: A Unified Contrastive Learning Approach for Multi-view Molecular Representation Learning

Shikun Feng, Yuyan Ni, Li et al.

ICML 2024arXiv:2405.10343
18
citations