Most Cited 2024 "response calibration" Papers

12,324 papers found • Page 14 of 62

Filters:Most Cited 2024 response calibration Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#2601

Segment and Caption Anything

Xiaoke Huang, Jianfeng Wang, Yansong Tang et al.

CVPR 2024arXiv:2312.00869

citations

#2602

Dynamic Inertial Poser (DynaIP): Part-Based Motion Dynamics Learning for Enhanced Human Pose Estimation with Sparse Inertial Sensors

Yu Zhang, Songpengcheng Xia, Lei Chu et al.

CVPR 2024arXiv:2312.02196

citations

#2603

Second-Order Uncertainty Quantification: A Distance-Based Approach

Yusuf Sale, Viktor Bengs, Michele Caprio et al.

ICML 2024spotlightarXiv:2312.00995

citations

#2604

ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis

Muhammad Hamza Mughal, Rishabh Dabral, Ikhsanul Habibie et al.

CVPR 2024arXiv:2403.17936

citations

#2605

SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified Pre-training

Kazem Meidani, Parshin Shojaee, Chandan Reddy et al.

ICLR 2024spotlightarXiv:2310.02227

citations

#2606

Language-driven Grasp Detection

An Dinh Vuong, Minh Nhat VU, Baoru Huang et al.

CVPR 2024arXiv:2406.09489

citations

#2607

RoHM: Robust Human Motion Reconstruction via Diffusion

Siwei Zhang, Bharat Lal Bhatnagar, Yuanlu Xu et al.

CVPR 2024arXiv:2401.08570

citations

#2608

Poisoned Forgery Face: Towards Backdoor Attacks on Face Forgery Detection

Jiawei Liang, Siyuan Liang, Aishan Liu et al.

ICLR 2024spotlightarXiv:2402.11473

citations

#2609

Open-Vocabulary Semantic Segmentation with Image Embedding Balancing

Xiangheng Shan, Dongyue Wu, Guilin Zhu et al.

CVPR 2024arXiv:2406.09829

citations

#2610

Label-anticipated Event Disentanglement for Audio-Visual Video Parsing

Jinxing Zhou, Dan Guo, Yuxin Mao et al.

ECCV 2024arXiv:2407.08126

citations

#2611

AnimatableDreamer: Text-Guided Non-rigid 3D Model Generation and Reconstruction with Canonical Score Distillation

Xinzhou Wang, Yikai Wang, junliang ye et al.

ECCV 2024arXiv:2312.03795

citations

#2612

Urban Region Embedding via Multi-View Contrastive Prediction

Zechen Li, Weiming Huang, Kai Zhao et al.

AAAI 2024paperarXiv:2312.09681

citations

#2613

In value-based deep reinforcement learning, a pruned network is a good network

Johan Obando Ceron, Aaron Courville, Pablo Samuel Castro

ICML 2024arXiv:2402.12479

citations

#2614

Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation

Gauthier Guinet, Behrooz Tehrani, Anoop Deoras et al.

ICML 2024arXiv:2405.13622

citations

#2615

G2P-DDM: Generating Sign Pose Sequence from Gloss Sequence with Discrete Diffusion Model

Pan Xie, Qipeng Zhang, Peng Taiying et al.

AAAI 2024paperarXiv:2208.09141

citations

#2616

CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment

Sajid Javed, Arif Mahmood, IYYAKUTTI IYAPPAN GANAPATHI et al.

CVPR 2024arXiv:2406.05205

citations

#2617

Random Feature Amplification: Feature Learning and Generalization in Neural Networks

Spencer Frei, Niladri Chatterji, Peter L. Bartlett

ICLR 2024arXiv:2202.07626

citations

#2618

ExACT: Language-guided Conceptual Reasoning and Uncertainty Estimation for Event-based Action Recognition and More

Jiazhou Zhou, Xu Zheng, Yuanhuiyi Lyu et al.

CVPR 2024highlightarXiv:2403.12534

citations

#2619

Logical Languages Accepted by Transformer Encoders with Hard Attention

Pablo Barcelo, Alexander Kozachinskiy, Anthony W. Lin et al.

ICLR 2024arXiv:2310.03817

citations

#2620

G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis

Yufei Ye, Abhinav Gupta, Kris Kitani et al.

CVPR 2024arXiv:2404.12383

citations

#2621

Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular Structures

Yannick Kirchhoff, Maximilian Rokuss, Saikat Roy et al.

ECCV 2024arXiv:2404.03010

citations

#2622

SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge

Andong Wang, Bo Wu, Sunli Chen et al.

CVPR 2024arXiv:2405.09713

citations

#2623

Beyond TreeSHAP: Efficient Computation of Any-Order Shapley Interactions for Tree Ensembles

Maximilian Muschalik, Fabian Fumagalli, Barbara Hammer et al.

AAAI 2024paperarXiv:2401.12069

citations

#2624

LaMAGIC: Language-Model-based Topology Generation for Analog Integrated Circuits

Chen-Chia Chang, Yikang Shen, Shaoze Fan et al.

ICML 2024arXiv:2407.18269

citations

#2625

CABINET: Content Relevance-based Noise Reduction for Table Question Answering

Sohan Patnaik, Heril Changwal, Milan Aggarwal et al.

ICLR 2024spotlightarXiv:2402.01155

citations

#2626

MAS: Multi-view Ancestral Sampling for 3D Motion Generation Using 2D Diffusion

Roy Kapon, Guy Tevet, Daniel Cohen-Or et al.

CVPR 2024arXiv:2310.14729

citations

#2627

On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling

Xiaobao Wu, Fengjun Pan, Thong Nguyen et al.

AAAI 2024paperarXiv:2401.14113

citations

#2628

Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects

Zicong Fan, Takehiko Ohkawa, Linlin Yang et al.

ECCV 2024arXiv:2403.16428

citations

#2629

Locally Estimated Global Perturbations are Better than Local Perturbations for Federated Sharpness-aware Minimization

Ziqing Fan, Shengchao Hu, Jiangchao Yao et al.

ICML 2024spotlightarXiv:2405.18890

citations

#2630

Provably Powerful Graph Neural Networks for Directed Multigraphs

Beni Egressy, Luc von Niederhäusern, Jovan Blanuša et al.

AAAI 2024paperarXiv:2306.11586

citations

#2631

The Hidden Language of Diffusion Models

Hila Chefer, Oran Lang, Mor Geva et al.

ICLR 2024arXiv:2306.00966

citations

#2632

Training Unbiased Diffusion Models From Biased Dataset

Yeongmin Kim, Byeonghu Na, Minsang Park et al.

ICLR 2024arXiv:2403.01189

citations

#2633

Exploiting Inter-sample and Inter-feature Relations in Dataset Distillation

Wenxiao Deng, Wenbin Li, Tianyu Ding et al.

CVPR 2024arXiv:2404.00563

citations

#2634

Privacy-Preserving Face Recognition Using Trainable Feature Subtraction

Yuxi Mi, Zhizhou Zhong, Yuge Huang et al.

CVPR 2024arXiv:2403.12457

citations

#2635

HybridNeRF: Efficient Neural Rendering via Adaptive Volumetric Surfaces

Haithem Turki, Vasu Agrawal, Samuel Rota Bulò et al.

CVPR 2024highlightarXiv:2312.03160

citations

#2636

CATS: Enhancing Multivariate Time Series Forecasting by Constructing Auxiliary Time Series as Exogenous Variables

Jiecheng Lu, Xu Han, Sun et al.

ICML 2024oralarXiv:2403.01673

citations

#2637

Resurrecting Old Classes with New Data for Exemplar-Free Continual Learning

Dipam Goswami, Albin Soutif, Yuyang Liu et al.

CVPR 2024arXiv:2405.19074

citations

#2638

TCI-Former: Thermal Conduction-Inspired Transformer for Infrared Small Target Detection

Tianxiang Chen, Zhentao Tan, Qi Chu et al.

AAAI 2024paperarXiv:2402.02046

citations

#2639

Don't Play Favorites: Minority Guidance for Diffusion Models

Soobin Um, Suhyeon Lee, Jong Chul YE

ICLR 2024arXiv:2301.12334

citations

#2640

Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making

Vivek Myers, Chongyi Zheng, Anca Dragan et al.

ICML 2024oralarXiv:2406.17098

citations

#2641

Neural Monge Map estimation and its applications

Shaojun Ma, Yongxin Chen, Hao-Min Zhou et al.

ICLR 2024arXiv:2106.03812

citations

#2642

Multi-Prompts Learning with Cross-Modal Alignment for Attribute-Based Person Re-identification

Yajing Zhai, Yawen Zeng, Zhiyong Huang et al.

AAAI 2024paperarXiv:2312.16797

citations

#2643

Frequency-Adaptive Pan-Sharpening with Mixture of Experts

Xuanhua He, Keyu Yan, Rui Li et al.

AAAI 2024paperarXiv:2401.02151

citations

#2644

Rethinking Generalizable Face Anti-spoofing via Hierarchical Prototype-guided Distribution Refinement in Hyperbolic Space

Chengyang Hu, Ke-Yue Zhang, Taiping Yao et al.

CVPR 2024highlight

citations

#2645

Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation

Xiaoyi Bao, Jie Qin, Siyang Sun et al.

AAAI 2024paperarXiv:2312.06474

citations

#2646

Rotational Equilibrium: How Weight Decay Balances Learning Across Neural Networks

Atli Kosson, Bettina Messmer, Martin Jaggi

ICML 2024arXiv:2305.17212

citations

#2647

Relaxed Contrastive Learning for Federated Learning

Seonguk Seo, Jinkyu Kim, Geeho Kim et al.

CVPR 2024arXiv:2401.04928

citations

#2648

LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection

Sifan Zhou, Liang Li, Xinyu Zhang et al.

ICLR 2024arXiv:2401.15865

citations

#2649

Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution

Zhikai Chen, Fuchen Long, Zhaofan Qiu et al.

CVPR 2024arXiv:2403.17000

citations

#2650

AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents

Jieming Cui, Tengyu Liu, Nian Liu et al.

CVPR 2024arXiv:2403.12835

citations

#2651

Memorization Capacity of Multi-Head Attention in Transformers

Sadegh Mahdavi, Renjie Liao, Christos Thrampoulidis

ICLR 2024spotlightarXiv:2306.02010

citations

#2652

Graph Invariant Learning with Subgraph Co-mixup for Out-of-Distribution Generalization

Tianrui Jia, Haoyang Li, Cheng Yang et al.

AAAI 2024paperarXiv:2312.10988

citations

#2653

Graph-Aware Contrasting for Multivariate Time-Series Classification

Yucheng Wang, Yuecong Xu, Jianfei Yang et al.

AAAI 2024paperarXiv:2309.05202

citations

#2654

Explaining Generalization Power of a DNN Using Interactive Concepts

Huilin Zhou, Hao Zhang, Huiqi Deng et al.

AAAI 2024paperarXiv:2302.13091

citations

#2655

GTP-4o: Modality-prompted Heterogeneous Graph Learning for Omni-modal Biomedical Representation

Chenxin Li, Xinyu Liu, Cheng Wang et al.

ECCV 2024arXiv:2407.05540

citations

#2656

AST-T5: Structure-Aware Pretraining for Code Generation and Understanding

Linyuan Gong, Mostafa Elhoushi, Alvin Cheung

ICML 2024arXiv:2401.03003

citations

#2657

PeFLL: Personalized Federated Learning by Learning to Learn

Jonathan Scott, Hossein Zakerinia, Christoph Lampert

ICLR 2024arXiv:2306.05515

citations

#2658

A Simple Recipe for Language-guided Domain Generalized Segmentation

Mohammad Fahes, TUAN-HUNG VU, Andrei Bursuc et al.

CVPR 2024arXiv:2311.17922

citations

#2659

Towards Efficient Exact Optimization of Language Model Alignment

Haozhe Ji, Cheng Lu, Yilin Niu et al.

ICML 2024arXiv:2402.00856

citations

#2660

SAM-guided Graph Cut for 3D Instance Segmentation

Haoyu Guo, He Zhu, Sida Peng et al.

ECCV 2024arXiv:2312.08372

citations

#2661

A Unified Recipe for Deriving (Time-Uniform) PAC-Bayes Bounds

Ben Chugg, Hongjian Wang, Aaditya Ramdas

ICML 2024arXiv:2302.03421

citations

#2662

Towards the Uncharted: Density-Descending Feature Perturbation for Semi-supervised Semantic Segmentation

Xiaoyang Wang, Huihui Bai, Limin Yu et al.

CVPR 2024arXiv:2403.06462

citations

#2663

RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models

Bowen Zhang, Yiji Cheng, Chunyu Wang et al.

ECCV 2024arXiv:2407.06938

citations

#2664

Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network

ye junyan, Zhutao Lv, Li Weijia et al.

ECCV 2024arXiv:2408.05475

citations

#2665

Rethinking the Objectives of Vector-Quantized Tokenizers for Image Synthesis

Yuchao Gu, Xintao Wang, Yixiao Ge et al.

CVPR 2024arXiv:2212.03185

citations

#2666

Interpretable Diffusion via Information Decomposition

Xianghao Kong, Ollie Liu, Han Li et al.

ICLR 2024arXiv:2310.07972

citations

#2667

SpecNeRF: Gaussian Directional Encoding for Specular Reflections

Li Ma, Vasu Agrawal, Haithem Turki et al.

CVPR 2024highlightarXiv:2312.13102

citations

#2668

ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling

Siming Yan, Min Bai, Weifeng Chen et al.

ECCV 2024arXiv:2402.06118

citations

#2669

Exploring Diffusion Time-steps for Unsupervised Representation Learning

Zhongqi Yue, Zhongqi Yue, Jiankun Wang et al.

ICLR 2024arXiv:2401.11430

citations

#2670

Auto-Encoding Morph-Tokens for Multimodal LLM

Kaihang Pan, Siliang Tang, Juncheng Li et al.

ICML 2024spotlightarXiv:2405.01926

citations

#2671

AZ-NAS: Assembling Zero-Cost Proxies for Network Architecture Search

Junghyup Lee, Bumsub Ham

CVPR 2024arXiv:2403.19232

citations

#2672

Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation

Zhiwei Yang, Kexue Fu, Minghong Duan et al.

CVPR 2024arXiv:2402.18467

citations

#2673

Beyond Prompt Learning: Continual Adapter for Efficient Rehearsal-Free Continual Learning

XINYUAN GAO, Songlin Dong, Yuhang He et al.

ECCV 2024arXiv:2407.10281

citations

#2674

Inversion-Free Image Editing with Language-Guided Diffusion Models

Sihan Xu, Yidong Huang, Jiayi Pan et al.

CVPR 2024

citations

#2675

Fantastic Animals and Where to Find Them: Segment Any Marine Animal with Dual SAM

Pingping Zhang, Tianyu Yan, Yang Liu et al.

CVPR 2024highlightarXiv:2404.04996

citations

#2676

Domain-Controlled Prompt Learning

Qinglong Cao, Zhengqin Xu, Yuntian Chen et al.

AAAI 2024paperarXiv:2310.07730

citations

#2677

Case-Based or Rule-Based: How Do Transformers Do the Math?

Yi Hu, Xiaojuan Tang, Haotong Yang et al.

ICML 2024arXiv:2402.17709

citations

#2678

Shrinking Your TimeStep: Towards Low-Latency Neuromorphic Object Recognition with Spiking Neural Networks

Yongqi Ding, Lin Zuo, Mengmeng Jing et al.

AAAI 2024paperarXiv:2401.01912

citations

#2679

GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers

Takeru Miyato, Bernhard Jaeger, Max Welling et al.

ICLR 2024arXiv:2310.10375

citations

#2680

CPPO: Continual Learning for Reinforcement Learning with Human Feedback

Han Zhang, Yu Lei, Lin Gui et al.

ICLR 2024

citations

#2681

Towards Generalizable Multi-Object Tracking

Zheng Qin, Le Wang, Sanping Zhou et al.

CVPR 2024arXiv:2406.00429

citations

#2682

Single Domain Generalization for Crowd Counting

Zhuoxuan Peng, S.-H. Gary Chan

CVPR 2024arXiv:2403.09124

citations

#2683

Sieve: Multimodal Dataset Pruning using Image Captioning Models

Anas Mahmoud, Mostafa Elhoushi, Amro Abbas et al.

CVPR 2024arXiv:2310.02110

citations

#2684

Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion

Fan Zhang, Shaodi You, Yu Li et al.

CVPR 2024highlightarXiv:2312.12471

citations

#2685

INViT: A Generalizable Routing Problem Solver with Invariant Nested View Transformer

Han Fang, Zhihao Song, Paul Weng et al.

ICML 2024arXiv:2402.02317

citations

#2686

Towards Transferable Targeted 3D Adversarial Attack in the Physical World

Yao Huang, Yinpeng Dong, Shouwei Ruan et al.

CVPR 2024arXiv:2312.09558

citations

#2687

Light and Optimal Schrödinger Bridge Matching

Nikita Gushchin, Sergei Kholkin, Evgeny Burnaev et al.

ICML 2024arXiv:2402.03207

citations

#2688

Enhancing One-Shot Federated Learning Through Data and Ensemble Co-Boosting

Rong Dai, Yonggang Zhang, Ang Li et al.

ICLR 2024arXiv:2402.15070

citations

#2689

R^2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding

Ye Liu, Jixuan He, Wanhua Li et al.

ECCV 2024arXiv:2404.00801

citations

#2690

Position: Measure Dataset Diversity, Don't Just Claim It

Dora Zhao, Jerone Andrews, Orestis Papakyriakopoulos et al.

ICML 2024arXiv:2407.08188

citations

#2691

ViT-Lens: Towards Omni-modal Representations

Stan Weixian Lei, Yixiao Ge, Kun Yi et al.

CVPR 2024arXiv:2311.16081

citations

#2692

Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG Translation

Debaditya Shome, Pritam Sarkar, Ali Etemad

AAAI 2024paperarXiv:2308.13568

citations

#2693

HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D

Sangmin Woo, byeongjun park, Hyojun Go et al.

CVPR 2024arXiv:2312.15980

citations

#2694

GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs

Mustafa Munir, William Avery, Md Mostafijur Rahman et al.

CVPR 2024arXiv:2405.06849

citations

#2695

TokenCompose: Text-to-Image Diffusion with Token-level Supervision

Zirui Wang, Zhizhou Sha, Zheng Ding et al.

CVPR 2024arXiv:2312.03626

citations

#2696

A Non-parametric Graph Clustering Framework for Multi-View Data

Shengju Yu, Siwei Wang, Zhibin Dong et al.

AAAI 2024paper

citations

#2697

Adaptive Hardness Negative Sampling for Collaborative Filtering

Riwei Lai, Rui Chen, Qilong Han et al.

AAAI 2024paperarXiv:2401.05191

citations

#2698

Split-and-Denoise: Protect large language model inference with local differential privacy

Peihua Mai, Ran Yan, Zhe Huang et al.

ICML 2024arXiv:2310.09130

citations

#2699

Transductive Zero-Shot and Few-Shot CLIP

Ségolène Martin, Yunshi HUANG, Fereshteh Shakeri et al.

CVPR 2024highlightarXiv:2405.18437

citations

#2700

RadOcc: Learning Cross-Modality Occupancy Knowledge through Rendering Assisted Distillation

Haiming Zhang, Xu Yan, Dongfeng Bai et al.

AAAI 2024paperarXiv:2312.11829

citations

#2701

QAGait: Revisit Gait Recognition from a Quality Perspective

Zengbin Wang, Saihui Hou, Man Zhang et al.

AAAI 2024paperarXiv:2401.13531

citations

#2702

Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs

Feiyang Kang, Hoang Anh Just, Yifan Sun et al.

ICLR 2024arXiv:2405.02774

citations

#2703

Test-Time Domain Adaptation by Learning Domain-Aware Batch Normalization

Yanan Wu, Zhixiang Chi, Yang Wang et al.

AAAI 2024paperarXiv:2312.10165

citations

#2704

Rethinking Graph Masked Autoencoders through Alignment and Uniformity

Liang Wang, Xiang Tao, Qiang Liu et al.

AAAI 2024paperarXiv:2402.07225

citations

#2705

Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities

Lorenzo Baraldi, Federico Cocchi, Marcella Cornia et al.

ECCV 2024arXiv:2407.20337

citations

#2706

A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution

Zhixiong Yang, Jingyuan Xia, Shengxi Li et al.

CVPR 2024arXiv:2404.15620

citations

#2707

VOODOO 3D: Volumetric Portrait Disentanglement For One-Shot 3D Head Reenactment

Phong Tran, Egor Zakharov, Long Nhat Ho et al.

CVPR 2024arXiv:2312.04651

citations

#2708

Lossy Image Compression with Foundation Diffusion Models

Lucas Relic, Roberto Azevedo, Markus Gross et al.

ECCV 2024arXiv:2404.08580

citations

#2709

CFR-ICL: Cascade-Forward Refinement with Iterative Click Loss for Interactive Image Segmentation

Shoukun Sun, Min Xian, Fei Xu et al.

AAAI 2024paperarXiv:2303.05620

citations

#2710

Distilling Autoregressive Models to Obtain High-Performance Non-autoregressive Solvers for Vehicle Routing Problems with Faster Inference Speed

Yubin Xiao, Di Wang, Boyang Li et al.

AAAI 2024paperarXiv:2312.12469

citations

#2711

CAT: Exploiting Inter-Class Dynamics for Domain Adaptive Object Detection

Mikhail Kennerley, Jian-Gang Wang, Bharadwaj Veeravalli et al.

CVPR 2024arXiv:2403.19278

citations

#2712

SnAG: Scalable and Accurate Video Grounding

Fangzhou Mu, Sicheng Mo, Yin Li

CVPR 2024arXiv:2404.02257

citations

#2713

Root Cause Analysis in Microservice Using Neural Granger Causal Discovery

Cheng-Ming Lin, Ching Chang, Wei-Yao Wang et al.

AAAI 2024paperarXiv:2402.01140

citations

#2714

Audio Generation with Multiple Conditional Diffusion Model

Zhifang Guo, Jianguo Mao, Tao Rui et al.

AAAI 2024paperarXiv:2308.11940

citations

#2715

Attention Guided CAM: Visual Explanations of Vision Transformer Guided by Self-Attention

Saebom Leem, Hyunseok Seo

AAAI 2024paperarXiv:2402.04563

citations

#2716

CausalLM is not optimal for in-context learning

Nan Ding, Tomer Levinboim, Jialin Wu et al.

ICLR 2024arXiv:2308.06912

citations

#2717

Material Palette: Extraction of Materials from a Single Image

Ivan Lopes, Fabio Pizzati, Raoul de Charette

CVPR 2024arXiv:2311.17060

citations

#2718

OmniViD: A Generative Framework for Universal Video Understanding

Junke Wang, Dongdong Chen, Chong Luo et al.

CVPR 2024arXiv:2403.17935

citations

#2719

Plug and Play Active Learning for Object Detection

Chenhongyi Yang, Lichao Huang, Elliot Crowley

CVPR 2024arXiv:2211.11612

citations

#2720

LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation

Kibum Kim, Kanghoon Yoon, Jaehyeong Jeon et al.

CVPR 2024arXiv:2310.10404

citations

#2721

RegionDrag: Fast Region-Based Image Editing with Diffusion Models

Jingyi Lu, Xinghui Li, Kai Han

ECCV 2024arXiv:2407.18247

citations

#2722

Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation

Xiao Lin, Wenfei Yang, Yuan Gao et al.

CVPR 2024arXiv:2403.19527

citations

#2723

Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation

Renshuai Liu, Bowen Ma, Wei Zhang et al.

CVPR 2024highlightarXiv:2401.01207

citations

#2724

Color Shift Estimation-and-Correction for Image Enhancement

Yiyu Li, Ke Xu, Gerhard Hancke et al.

CVPR 2024arXiv:2405.17725

citations

#2725

Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape

Rundi Wu, Ruoshi Liu, Carl Vondrick et al.

ICLR 2024arXiv:2305.15399

citations

#2726

Exact Diffusion Inversion via Bidirectional Integration Approximation

Guoqiang Zhang, j.p. lewis, W. Bastiaan Kleijn

ECCV 2024

citations

#2727

Physical Property Understanding from Language-Embedded Feature Fields

Albert J. Zhai, Yuan Shen, Emily Y. Chen et al.

CVPR 2024arXiv:2404.04242

citations

#2728

UniGarmentManip: A Unified Framework for Category-Level Garment Manipulation via Dense Visual Correspondence

Ruihai Wu, Haoran Lu, Yiyan Wang et al.

CVPR 2024arXiv:2405.06903

citations

#2729

NECO: NEural Collapse Based Out-of-distribution detection

Mouïn Ben Ammar, Nacim Belkhir, Sebastian Popescu et al.

ICLR 2024arXiv:2310.06823

citations

#2730

Modular Blind Video Quality Assessment

Wen Wen, Mu Li, Yabin ZHANG et al.

CVPR 2024arXiv:2402.19276

citations

#2731

CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents

Siyuan Qi, Shuo Chen, Yexin Li et al.

ICLR 2024spotlightarXiv:2401.10568

citations

#2732

Learning to Transform Dynamically for Better Adversarial Transferability

Rongyi Zhu, Zeliang Zhang, Susan Liang et al.

CVPR 2024arXiv:2405.14077

citations

#2733

Learning Generalized Medical Image Segmentation from Decoupled Feature Queries

1207 Qi Bi, Jingjun Yi, Hao Zheng et al.

AAAI 2024paper

citations

#2734

Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement Learning

Chenyu Zhang, Han Wang, Aritra Mitra et al.

ICLR 2024arXiv:2401.15273

citations

#2735

Put Myself in Your Shoes: Lifting the Egocentric Perspective from Exocentric Videos

Mi Luo, Zihui Xue, Alex Dimakis et al.

ECCV 2024arXiv:2403.06351

citations

#2736

InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists

Yulu Gan, Sung Woo Park, Alexander Schubert et al.

ICLR 2024arXiv:2310.00390

citations

#2737

Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection

Jiaming Li, Jiacheng Zhang, Jichang Li et al.

CVPR 2024arXiv:2406.00510

citations

#2738

An Economic Framework for 6-DoF Grasp Detection

Xiao-Ming Wu, Jia-Feng Cai, Jian-Jian Jiang et al.

ECCV 2024arXiv:2407.08366

citations

#2739

Gaussian Splatting on the Move: Blur and Rolling Shutter Compensation for Natural Camera Motion

Otto Seiskari, Jerry Ylilammi, Valtteri Kaatrasalo et al.

ECCV 2024arXiv:2403.13327

citations

#2740

NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging

Takahiro Shirakawa, Seiichi Uchida

CVPR 2024arXiv:2403.03485

citations

#2741

Diffusion-based Blind Text Image Super-Resolution

Yuzhe Zhang, jiawei zhang, Hao Li et al.

CVPR 2024arXiv:2312.08886

citations

#2742

CLIP the Bias: How Useful is Balancing Data in Multimodal Learning?

Ibrahim Alabdulmohsin, Xiao Wang, Andreas Steiner et al.

ICLR 2024arXiv:2403.04547

citations

#2743

Can Biases in ImageNet Models Explain Generalization?

Paul Gavrikov, Janis Keuper

CVPR 2024arXiv:2404.01509

citations

#2744

AV-RIR: Audio-Visual Room Impulse Response Estimation

Anton Ratnarajah, Sreyan Ghosh, Sonal Kumar et al.

CVPR 2024arXiv:2312.00834

citations

#2745

Training Like a Medical Resident: Context-Prior Learning Toward Universal Medical Image Segmentation

Yunhe Gao

CVPR 2024arXiv:2306.02416

citations

#2746

Think Twice Before Selection: Federated Evidential Active Learning for Medical Image Analysis with Domain Shifts

Jiayi Chen, Benteng Ma, Hengfei Cui et al.

CVPR 2024arXiv:2312.02567

citations

#2747

Denoising Vision Transformers

Jiawei Yang, Katie Luo, Jiefeng Li et al.

ECCV 2024arXiv:2401.02957

citations

#2748

MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views

Wangze Xu, Huachen Gao, Shihe Shen et al.

ECCV 2024arXiv:2409.14316

citations

#2749

Representation Surgery: Theory and Practice of Affine Steering

Shashwat Singh, Shauli Ravfogel, Jonathan Herzig et al.

ICML 2024arXiv:2402.09631

citations

#2750

Initializing Models with Larger Ones

Zhiqiu Xu, Yanjie Chen, Kirill Vishniakov et al.

ICLR 2024spotlightarXiv:2311.18823

citations

#2751

Spatio-Temporal Few-Shot Learning via Diffusive Neural Network Generation

Yuan Yuan, Chenyang Shao, Jingtao Ding et al.

ICLR 2024oralarXiv:2402.11922

citations

#2752

Improving Semantic Correspondence with Viewpoint-Guided Spherical Maps

Octave Mariotti, Oisin Mac Aodha, Hakan Bilen

CVPR 2024arXiv:2312.13216

citations

#2753

GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion

Xueyi Liu, Li Yi

ICLR 2024arXiv:2402.14810

citations

#2754

Domain-Agnostic Molecular Generation with Chemical Feedback

Yin Fang, Ningyu Zhang, Zhuo Chen et al.

ICLR 2024arXiv:2301.11259

citations

#2755

Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining

Jiahao Nie, Yun Xing, Gongjie Zhang et al.

CVPR 2024arXiv:2401.08407

citations

#2756

UniVS: Unified and Universal Video Segmentation with Prompts as Queries

Minghan LI, Shuai Li, Xindong Zhang et al.

CVPR 2024arXiv:2402.18115

citations

#2757

Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models

Yufei Zhan, Yousong Zhu, Zhiyang Chen et al.

ECCV 2024arXiv:2311.14552

citations

#2758

G-Adapter: Towards Structure-Aware Parameter-Efficient Transfer Learning for Graph Transformer Networks

Anchun Gui, Jinqiang Ye, Han Xiao

AAAI 2024paperarXiv:2305.10329

citations

#2759

SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous Manipulation

Qianxu Wang, Haotong Zhang, Congyue Deng et al.

ICLR 2024arXiv:2310.16838

citations

#2760

A Statistical Theory of Regularization-Based Continual Learning

Xuyang Zhao, Huiyuan Wang, Weiran Huang et al.

ICML 2024arXiv:2406.06213

citations

#2761

Localization Is All You Evaluate: Data Leakage in Online Mapping Datasets and How to Fix It

Adam Lilja, Junsheng Fu, Erik Stenborg et al.

CVPR 2024arXiv:2312.06420

citations

#2762

LILO: Learning Interpretable Libraries by Compressing and Documenting Code

Gabriel Grand, Lio Wong, Maddy Bowers et al.

ICLR 2024arXiv:2310.19791

citations

#2763

Masked Generative Video-to-Audio Transformers with Enhanced Synchronicity

Santiago Pascual, Chunghsin YEH, Ioannis Tsiamas et al.

ECCV 2024arXiv:2407.10387

citations

#2764

APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation

Weizhao He, Yang Zhang, Wei Zhuo et al.

CVPR 2024arXiv:2406.08372

citations

#2765

Spanning Training Progress: Temporal Dual-Depth Scoring (TDDS) for Enhanced Dataset Pruning

xin zhang, Jiawei Du, Weiying Xie et al.

CVPR 2024arXiv:2311.13613

citations

#2766

InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion

Jihyun Lee, Shunsuke Saito, Giljoo Nam et al.

CVPR 2024arXiv:2403.17422

citations

#2767

Unified Language-driven Zero-shot Domain Adaptation

Senqiao Yang, Zhuotao Tian, Li Jiang et al.

CVPR 2024arXiv:2404.07155

citations

#2768

PTQ4SAM: Post-Training Quantization for Segment Anything

Chengtao Lv, Hong Chen, Jinyang Guo et al.

CVPR 2024arXiv:2405.03144

citations

#2769

Revisiting the Domain Shift and Sample Uncertainty in Multi-source Active Domain Transfer

Wenqiao Zhang, Zheqi Lv

CVPR 2024arXiv:2311.12905

citations

#2770

Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention

Zuyao Chen, Jinlin Wu, Zhen Lei et al.

ECCV 2024arXiv:2311.10988

citations

#2771

Q-value Regularized Transformer for Offline Reinforcement Learning

Shengchao Hu, Ziqing Fan, Chaoqin Huang et al.

ICML 2024arXiv:2405.17098

citations

#2772

Active Statistical Inference

Tijana Zrnic, Emmanuel J Candes

ICML 2024arXiv:2403.03208

citations

#2773

SEGNO: Generalizing Equivariant Graph Neural Networks with Physical Inductive Biases

Yang Liu, Jiashun Cheng, Haihong Zhao et al.

ICLR 2024spotlightarXiv:2308.13212

citations

#2774

Class-Imbalanced Graph Learning without Class Rebalancing

Zhining Liu, Ruizhong Qiu, Zhichen Zeng et al.

ICML 2024arXiv:2308.14181

citations

#2775

Fair and Efficient Contribution Valuation for Vertical Federated Learning

Zhenan Fan, Huang Fang, Xinglu Wang et al.

ICLR 2024arXiv:2201.02658

citations

#2776

Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling

Shentong Mo, Pedro Morgado

CVPR 2024arXiv:2312.01017

citations

#2777

Unifying Image Processing as Visual Prompting Question Answering

Yihao Liu, Xiangyu Chen, Xianzheng Ma et al.

ICML 2024arXiv:2310.10513

citations

#2778

Conformal Prediction Sets Improve Human Decision Making

Jesse Cresswell, yi sui, Bhargava Kumar et al.

ICML 2024arXiv:2401.13744

citations

#2779

Disentangled 3D Scene Generation with Layout Learning

Dave Epstein, Ben Poole, Ben Mildenhall et al.

ICML 2024arXiv:2402.16936

citations

#2780

Contextrast: Contextual Contrastive Learning for Semantic Segmentation

Changki Sung, Wanhee Kim, Jungho An et al.

CVPR 2024arXiv:2404.10633

citations

#2781

Transformer Fusion with Optimal Transport

Moritz Imfeld, Jacopo Graldi, Marco Giordano et al.

ICLR 2024arXiv:2310.05719

citations

#2782

Attention Calibration for Disentangled Text-to-Image Personalization

Yanbing Zhang, Mengping Yang, Qin Zhou et al.

CVPR 2024arXiv:2403.18551

citations

#2783

Building Bridges across Spatial and Temporal Resolutions: Reference-Based Super-Resolution via Change Priors and Conditional Diffusion Model

Runmin Dong, Shuai Yuan, Bin Luo et al.

CVPR 2024arXiv:2403.17460

citations

#2784

Graph-enhanced Large Language Models in Asynchronous Plan Reasoning

Fangru Lin, Emanuele La Malfa, Valentin Hofmann et al.

ICML 2024arXiv:2402.02805

citations

#2785

FlowIE: Efficient Image Enhancement via Rectified Flow

Yixuan Zhu, Wenliang Zhao, Ao Li et al.

CVPR 2024arXiv:2406.00508

citations

#2786

FairerCLIP: Debiasing CLIP's Zero-Shot Predictions using Functions in RKHSs

Sepehr Dehdashtian, Lan Wang, Vishnu Boddeti

ICLR 2024arXiv:2403.15593

citations

#2787

Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation

Luca Barsellotti, Roberto Amoroso, Marcella Cornia et al.

CVPR 2024arXiv:2404.06542

citations

#2788

EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data

Shengjie Wang, Shaohuai Liu, Weirui Ye et al.

ICML 2024spotlightarXiv:2403.00564

citations

#2789

It's All About Your Sketch: Democratising Sketch Control in Diffusion Models

Subhadeep Koley, Ayan Kumar Bhunia, Deeptanshu Sekhri et al.

CVPR 2024arXiv:2403.07234

citations

#2790

Intraoperative 2D/3D Image Registration via Differentiable X-ray Rendering

Vivek Gopalakrishnan, Neel Dey, Polina Golland

CVPR 2024arXiv:2312.06358

citations

#2791

Towards Language-Driven Video Inpainting via Multimodal Large Language Models

Jianzong Wu, Xiangtai Li, Chenyang Si et al.

CVPR 2024arXiv:2401.10226

citations

#2792

Hierarchical Gaussian Mixture Normalizing Flow Modeling for Unified Anomaly Detection

Xincheng Yao, Ruoqi Li, Zefeng Qian et al.

ECCV 2024arXiv:2403.13349

citations

#2793

PREGO: Online Mistake Detection in PRocedural EGOcentric Videos

Alessandro Flaborea, Guido M. D&amp, #x27 et al.

CVPR 2024arXiv:2404.01933

citations

#2794

Image Inpainting via Tractable Steering of Diffusion Models

Anji Liu, Mathias Niepert, Guy Van den Broeck

ICLR 2024arXiv:2401.03349

citations

#2795

View Selection for 3D Captioning via Diffusion Ranking

Tiange Luo, Justin Johnson, Honglak Lee

ECCV 2024arXiv:2404.07984

citations

#2796

C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models

Mintong Kang, Nezihe Merve Gürel, Ning Yu et al.

ICML 2024arXiv:2402.03181

citations

#2797

Emergent Representations of Program Semantics in Language Models Trained on Programs

Charles Jin, Martin Rinard

ICML 2024arXiv:2305.11169

citations

#2798

Deep Contrastive Graph Learning with Clustering-Oriented Guidance

Mulin Chen, Bocheng Wang, Xuelong Li

AAAI 2024paperarXiv:2402.16012

citations

#2799

Agent3D-Zero: An Agent for Zero-shot 3D Understanding

Sha Zhang, Di Huang, Jiajun Deng et al.

ECCV 2024arXiv:2403.11835

citations

#2800

CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation

Kangfu Mei, Mauricio Delbracio, Hossein Talebi et al.

CVPR 2024arXiv:2310.01407

citations

← Previous

1...12 13 14 15 16...62