All Papers

34,598 papers found • Page 557 of 692

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists

Yulu Gan, Sung Woo Park, Alexander Schubert et al.

ICLR 2024arXiv:2310.00390

citations

InstructDET: Diversifying Referring Object Detection with Generalized Instructions

Ronghao Dang, Jiangyan Feng, Haodong Zhang et al.

ICLR 2024arXiv:2310.05136

citations

InstructDiffusion: A Generalist Modeling Interface for Vision Tasks

Zigang Geng, Binxin Yang, Tiankai Hang et al.

CVPR 2024arXiv:2309.03895

162

citations

InstructDoc: A Dataset for Zero

Shot Generalization of Visual Document Understanding with Instructions - Ryota Tanaka, Taichi Iki, Kyosuke Nishida et al.

AAAI 2024paperarXiv:2401.13313

citations

InstructGIE: Towards Generalizable Image Editing

Zichong Meng, Changdi Yang, Jun Liu et al.

ECCV 2024arXiv:2403.05018

citations

Instruct-Imagen: Image Generation with Multi-modal Instruction

Hexiang Hu, Kelvin C.K. Chan, Yu-Chuan Su et al.

CVPR 2024arXiv:2401.01952

citations

Instruction Tuning for Secure Code Generation

Jingxuan He, Mark Vero, Gabriela Krasnopolska et al.

ICML 2024arXiv:2402.09497

citations

Instruction Tuning-free Visual Token Complement for Multimodal LLMs

Dongsheng Wang, Jiequan Cui, Miaoge Li et al.

ECCV 2024arXiv:2408.05019

citations

InstructIR: High-Quality Image Restoration Following Human Instructions

Marcos Conde, Gregor Geigle, Radu Timofte

ECCV 2024arXiv:2401.16468

118

citations

Instructive Decoding: Instruction-Tuned Large Language Models are Self-Refiner from Noisy Instructions

Taehyeon Kim, JOONKEE KIM, Gihun Lee et al.

ICLR 2024spotlightarXiv:2311.00233

citations

InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image

Jianhui Li, Shilong Liu, Zidong Liu et al.

ICLR 2024arXiv:2311.02826

citations

Instruct-ReID: A Multi-purpose Person Re-identification Task with Instructions

Weizhen He, Yiheng Deng, SHIXIANG TANG et al.

CVPR 2024arXiv:2306.07520

citations

InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining

Boxin Wang, Wei Ping, Lawrence McAfee et al.

ICML 2024arXiv:2310.07713

citations

InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior

Chenguo Lin, Yadong MU

ICLR 2024spotlightarXiv:2402.04717

citations

InstructSpeech: Following Speech Editing Instructions via Large Language Models

Rongjie Huang, Ruofan Hu, Yongqi Wang et al.

ICML 2024

InstructVideo: Instructing Video Diffusion Models with Human Feedback

Hangjie Yuan, Shiwei Zhang, Xiang Wang et al.

CVPR 2024arXiv:2312.12490

citations

InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models

Lichang Chen, Jiuhai Chen, Tom Goldstein et al.

ICML 2024arXiv:2306.03082

citations

Instrumental Variable Estimation for Causal Inference in Longitudinal Data with Time-Dependent Latent Confounders

Debo Cheng, Ziqi Xu, Jiuyong Li et al.

AAAI 2024paperarXiv:2312.07175

citations

Integer Is Enough: When Vertical Federated Learning Meets Rounding

Pengyu Qiu, Yuwen Pu, Yongchao Liu et al.

AAAI 2024paper

Integer-Valued Training and Spike-driven Inference Spiking Neural Network for High-performance and Energy-efficient Object Detection

Xinhao Luo, Man Yao, Yuhong Chou et al.

ECCV 2024arXiv:2407.20708

citations

Integrated Decision Gradients: Compute Your Attributions Where the Model Makes Its Decision

Weibo Gao, Qi Liu, Hao Wang et al.

AAAI 2024paper

Integrated Hardware Architecture and Device Placement Search

Irene Wang, Jakub Tarnawski, Amar Phanishayee et al.

ICML 2024spotlightarXiv:2407.13143

citations

Integrating Efficient Optimal Transport and Functional Maps For Unsupervised Shape Correspondence Learning

Tung Le, Khai Nguyen, Shanlin Sun et al.

CVPR 2024arXiv:2403.01781

citations

Integrating Global Context Contrast and Local Sensitivity for Blind Image Quality Assessment

Xudong Li, Runze Hu, Jingyuan Zheng et al.

ICML 2024spotlight

Integrating Markov Blanket Discovery into Causal Representation Learning for Domain Generalization

Naiyu Yin, Hanjing Wang, Yue Yu et al.

ECCV 2024

Integrating Multimodal Data for Joint Generative Modeling of Complex Dynamics

Manuel Brenner, Florian Hess, Georgia Koppe et al.

ICML 2024oralarXiv:2212.07892

citations

Integrating Planning and Deep Reinforcement Learning via Automatic Induction of Task Substructures

Jung-Chun Liu, Chi-Hsien Chang, Shao-Hua Sun et al.

ICLR 2024

Integration of Global and Local Representations for Fine-grained Cross-modal Alignment

Seungwan Jin, Hoyoung Choi, Taehyung Noh et al.

ECCV 2024

citations

Intelligent Calibration for Bias Reduction in Sentiment Corpora Annotation Process

Idan Toker, David Sarne, Jonathan Schler

AAAI 2024paper

Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models

Chang Liu, Haoning Wu, Yujie Zhong et al.

CVPR 2024arXiv:2306.00973

citations

Intelligent Switching for Reset-Free RL

Darshan Patil, Janarthanan Rajendran, Glen Berseth et al.

ICLR 2024arXiv:2405.01684

citations

Intensity-Robust Autofocus for Spike Camera

Changqing Su, Zhiyuan Ye, Yongsheng Xiao et al.

CVPR 2024

citations

InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models

Jiun Tian Hoe, Xudong Jiang, Chee Seng Chan et al.

CVPR 2024arXiv:2312.05849

citations

Interacting Diffusion Processes for Event Sequence Forecasting

Mai Zeng, Florence Regol, Mark Coates

ICML 2024oralarXiv:2310.17800

citations

Interaction-based Retrieval-augmented Diffusion Models for Protein-specific 3D Molecule Generation

Zhilin Huang, Ling Yang, Xiangxin Zhou et al.

ICML 2024

Interaction-centric Spatio-Temporal Context Reasoning for Multi-Person Video HOI Recognition

Yisong Wang, Nan Xi, Jingjing Meng et al.

ECCV 2024

Interactive3D: Create What You Want by Interactive 3D Generation

Shaocong Dong, Lihe Ding, Zhanpeng Huang et al.

CVPR 2024arXiv:2404.16510

citations

Interactive 3D Object Detection with Prompts

Ruifei Zhang, Xiangru Lin, Wei Zhang et al.

ECCV 2024

citations

Interactive Continual Learning: Fast and Slow Thinking

Biqing Qi, Xinquan Chen, Junqi Gao et al.

CVPR 2024arXiv:2403.02628

citations

Interactive Hyperparameter Optimization in Multi-Objective Problems via Preference Learning

Joseph Giovanelli, Alexander Tornede, Tanja Tornede et al.

AAAI 2024paperarXiv:2309.03581

citations

Interactive Visual Task Learning for Robots

Weiwei Gu, Anant Sah, N. Gopalan

AAAI 2024paperarXiv:2312.13219

citations

Inter-Class Topology Alignment for Efficient Black-Box Substitute Attacks

lingzhuang meng, Mingwen Shao, Yuanjian Qiao et al.

ECCV 2024

citations

InterFusion: Text-Driven Generation of 3D Human-Object Interaction

Sisi Dai, Wenhao Li, Haowen Sun et al.

ECCV 2024arXiv:2403.15612

citations

InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion

Jihyun Lee, Shunsuke Saito, Giljoo Nam et al.

CVPR 2024arXiv:2403.17422

citations

Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection

Yongwei Nie, Hao Huang, Chengjiang Long et al.

ECCV 2024arXiv:2401.13551

citations

InterLUDE: Interactions between Labeled and Unlabeled Data to Enhance Semi-Supervised Learning

Zhe Huang, Xiaowei Yu, Dajiang Zhu et al.

ICML 2024arXiv:2403.10658

Internal Cross-layer Gradients for Extending Homogeneity to Heterogeneity in Federated Learning

Yun-Hin Chan, Rui Zhou, Running Zhao et al.

ICLR 2024spotlightarXiv:2308.11464

citations

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation

Yi Wang, Yinan He, Yizhuo Li et al.

ICLR 2024spotlightarXiv:2307.06942

419

citations

InternVideo2: Scaling Foundation Models for Multimodal Video Understanding

Yi Wang, Kunchang Li, Xinhao Li et al.

ECCV 2024arXiv:2403.15377

236

citations

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Zhe Chen, Jiannan Wu, Wenhai Wang et al.

CVPR 2024arXiv:2312.14238

2295

citations

← Previous

1...555 556 557 558 559...692