Most Cited 2024 "large-scale graph dataset" Papers

12,324 papers found • Page 10 of 62

#1801

Adapting Short-Term Transformers for Action Detection in Untrimmed Videos

Min Yang, gaohuan, Ping Guo et al.

CVPR 2024posterarXiv:2312.01897
17
citations
#1802

Exploring the Transferability of Visual Prompting for Multimodal Large Language Models

Yichi Zhang, Yinpeng Dong, Siyuan Zhang et al.

CVPR 2024highlightarXiv:2404.11207
17
citations
#1803

Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation

Kai Huang, Hanyun Yin, Heng Huang et al.

ICLR 2024posterarXiv:2309.13192
17
citations
#1804

Crystalformer: Infinitely Connected Attention for Periodic Structure Encoding

Tatsunori Taniai, Ryo Igarashi, Yuta Suzuki et al.

ICLR 2024posterarXiv:2403.11686
17
citations
#1805

DeiT-LT: Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets

Harsh Rangwani, Pradipto Mondal, Mayank Mishra et al.

CVPR 2024posterarXiv:2404.02900
17
citations
#1806

Understanding Video Transformers via Universal Concept Discovery

Matthew Kowal, Achal Dave, Rares Andrei Ambrus et al.

CVPR 2024highlightarXiv:2401.10831
17
citations
#1807

Decompose-and-Compose: A Compositional Approach to Mitigating Spurious Correlation

Fahimeh Hosseini Noohdani, Parsa Hosseini, Aryan Yazdan Parast et al.

CVPR 2024posterarXiv:2402.18919
17
citations
#1808

Keypoint Promptable Re-Identification

Vladimir Somers, Alexandre ALahi, Christophe De Vleeschouwer

ECCV 2024posterarXiv:2407.18112
17
citations
#1809

Decomposing Semantic Shifts for Composed Image Retrieval

Xingyu Yang, Daqing Liu, Heng Zhang et al.

AAAI 2024paperarXiv:2309.09531
17
citations
#1810

HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud

WENCAN CHENG, Hao Tang, Luc Van Gool et al.

CVPR 2024highlightarXiv:2404.03159
17
citations
#1811

UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving

Jian Zou, Tianyu Huang, Guanglei Yang et al.

ECCV 2024poster
17
citations
#1812

DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data

Qihao Liu, Yi Zhang, Song Bai et al.

CVPR 2024posterarXiv:2406.04322
17
citations
#1813

CycleINR: Cycle Implicit Neural Representation for Arbitrary-Scale Volumetric Super-Resolution of Medical Data

Wei Fang, Yuxing Tang, Heng Guo et al.

CVPR 2024posterarXiv:2404.04878
17
citations
#1814

InfMAE: A Foundation Model in The Infrared Modality

Fangcen liu, Chenqiang Gao, Yaming Zhang et al.

ECCV 2024posterarXiv:2402.00407
17
citations
#1815

Raindrop Clarity: A Dual-Focused Dataset for Day and Night Raindrop Removal

Yeying Jin, Xin Li, Jiadong Wang et al.

ECCV 2024posterarXiv:2407.16957
17
citations
#1816

Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation

Bolin Lai, Fiona Ryan, Wenqi Jia et al.

ECCV 2024posterarXiv:2305.03907
17
citations
#1817

HAVE-FUN: Human Avatar Reconstruction from Few-Shot Unconstrained Images

Xihe Yang, Xingyu Chen, Daiheng Gao et al.

CVPR 2024posterarXiv:2311.15672
17
citations
#1818

Towards Multimodal Open-Set Domain Generalization and Adaptation through Self-supervision

Hao Dong, Eleni Chatzi, Olga Fink

ECCV 2024posterarXiv:2407.01518
17
citations
#1819

Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation

Tao Chen, Xiruo Jiang, Gensheng Pei et al.

ECCV 2024posterarXiv:2407.02768
17
citations
#1820

LMUFormer: Low Complexity Yet Powerful Spiking Model With Legendre Memory Units

Zeyu Liu, Gourav Datta, Anni Li et al.

ICLR 2024posterarXiv:2402.04882
17
citations
#1821

One-stage Prompt-based Continual Learning

Youngeun Kim, YUHANG LI, Priyadarshini Panda

ECCV 2024posterarXiv:2402.16189
17
citations
#1822

Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation

Junyan Wang, Zhenhong Sun, Stewart Tan et al.

CVPR 2024posterarXiv:2403.05239
17
citations
#1823

M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models

Seunggeun Chi, Hyung-gun Chi, Hengbo Ma et al.

ECCV 2024posterarXiv:2407.14502
17
citations
#1824

Unsupervised Layer-Wise Score Aggregation for Textual OOD Detection

Maxime Darrin, Guillaume Staerman, Eduardo Dadalto Camara Gomes et al.

AAAI 2024paperarXiv:2302.09852
17
citations
#1825

Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance

Tien Toan Nguyen, Minh Nhat Nhat Vu, Baoru Huang et al.

ECCV 2024posterarXiv:2407.13842
17
citations
#1826

MESA: Matching Everything by Segmenting Anything

Yesheng Zhang, Xu Zhao

CVPR 2024posterarXiv:2401.16741
17
citations
#1827

EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval

Thomas Hummel, Shyamgopal Karthik, Mariana-Iuliana Georgescu et al.

ECCV 2024posterarXiv:2407.16658
17
citations
#1828

Latent Diffusion Prior Enhanced Deep Unfolding for Snapshot Spectral Compressive Imaging

Zongliang Wu, Ruiying Lu, Ying Fu et al.

ECCV 2024posterarXiv:2311.14280
17
citations
#1829

Multiagent Multitraversal Multimodal Self-Driving: Open MARS Dataset

Yiming Li, Zhiheng Li, Nuo Chen et al.

CVPR 2024posterarXiv:2406.09383
17
citations
#1830

Revisiting Adversarial Training Under Long-Tailed Distributions

Xinli Yue, Ningping Mou, Qian Wang et al.

CVPR 2024posterarXiv:2403.10073
17
citations
#1831

Label-Agnostic Forgetting: A Supervision-Free Unlearning in Deep Models

Shaofei Shen, Chenhao Zhang, Yawen Zhao et al.

ICLR 2024posterarXiv:2404.00506
17
citations
#1832

Motion-prior Contrast Maximization for Dense Continuous-Time Motion Estimation

Friedhelm Hamann, Ziyun Wang, Ioannis Asmanis et al.

ECCV 2024posterarXiv:2407.10802
17
citations
#1833

Weakly Supervised Semantic Segmentation for Driving Scenes

Dongseob Kim, Seungho Lee, Junsuk Choe et al.

AAAI 2024paperarXiv:2312.13646
17
citations
#1834

CLOSER: Towards Better Representation Learning for Few-Shot Class-Incremental Learning

Junghun Oh, Sungyong Baik, Kyoung Mu Lee

ECCV 2024posterarXiv:2410.05627
17
citations
#1835

Diffusion Model is a Good Pose Estimator from 3D RF-Vision

Junqiao Fan, Jianfei Yang, Yuecong Xu et al.

ECCV 2024posterarXiv:2403.16198
17
citations
#1836

Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation

Xuelu Feng, Dongdong Chen, Junsong Yuan et al.

ECCV 2024posterarXiv:2403.12042
17
citations
#1837

SPEAL: Skeletal Prior Embedded Attention Learning for Cross-Source Point Cloud Registration

Kezheng Xiong, Maoji Zheng, Qingshan Xu et al.

AAAI 2024paperarXiv:2312.08664
17
citations
#1838

LRANet: Towards Accurate and Efficient Scene Text Detection with Low-Rank Approximation

Yuchen Su, Zhineng Chen, Zhiwen Shao et al.

AAAI 2024paperarXiv:2306.15142
17
citations
#1839

Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition

Mingfang Zhang, Yifei Huang, Ruicong Liu et al.

ECCV 2024posterarXiv:2407.06628
17
citations
#1840

Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment

Brian Gordon, Yonatan Bitton, Yonatan Shafir et al.

ECCV 2024posterarXiv:2312.03766
17
citations
#1841

PreRoutGNN for Timing Prediction with Order Preserving Partition: Global Circuit Pre-training, Local Delay Learning and Attentional Cell Modeling

Ruizhe Zhong, Junjie Ye, Zhentao Tang et al.

AAAI 2024paperarXiv:2403.00012
17
citations
#1842

Adaptive VIO: Deep Visual-Inertial Odometry with Online Continual Learning

Youqi Pan, Wugen Zhou, Yingdian Cao et al.

CVPR 2024posterarXiv:2405.16754
17
citations
#1843

Towards Understanding Factual Knowledge of Large Language Models

Xuming Hu, Junzhe Chen, Xiaochuan Li et al.

ICLR 2024oral
17
citations
#1844

What Makes a Good Prune? Maximal Unstructured Pruning for Maximal Cosine Similarity

Gabryel Mason-Williams, Fredrik Dahlqvist

ICLR 2024poster
17
citations
#1845

Condition-Aware Neural Network for Controlled Image Generation

Han Cai, Muyang Li, Qinsheng Zhang et al.

CVPR 2024posterarXiv:2404.01143
17
citations
#1846

A Comprehensive Augmentation Framework for Anomaly Detection

Lin Jiang, Yaping Yan

AAAI 2024paperarXiv:2308.15068
16
citations
#1847

Programmable Motion Generation for Open-Set Motion Control Tasks

Hanchao Liu, Xiaohang Zhan, Shaoli Huang et al.

CVPR 2024highlightarXiv:2405.19283
16
citations
#1848

Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models

Matthew Kowal, Richard P. Wildes, Kosta Derpanis

CVPR 2024highlightarXiv:2404.02233
16
citations
#1849

Object Pose Estimation via the Aggregation of Diffusion Features

Tianfu Wang, Guosheng Hu, Hongguang Wang

CVPR 2024highlightarXiv:2403.18791
16
citations
#1850

IS-DARTS: Stabilizing DARTS through Precise Measurement on Candidate Importance

Hongyi He, Longjun Liu, Haonan Zhang et al.

AAAI 2024paperarXiv:2312.12648
16
citations
#1851

Lazy Diffusion Transformer for Interactive Image Editing

Yotam Nitzan, Zongze Wu, Richard Zhang et al.

ECCV 2024posterarXiv:2404.12382
16
citations
#1852

Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models

Yu-Chu Yu, Chi-Pin Huang, Jr-Jen Chen et al.

ECCV 2024posterarXiv:2403.09296
16
citations
#1853

R-MAE: Regions Meet Masked Autoencoders

Duy-Kien Nguyen, Yanghao Li, Vaibhav Aggarwal et al.

ICLR 2024posterarXiv:2306.05411
16
citations
#1854

Video Prediction by Modeling Videos as Continuous Multi-Dimensional Processes

Gaurav Shrivastava, Abhinav Shrivastava

CVPR 2024poster
16
citations
#1855

PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer

Tongkun Guan, Chengyu Lin, Wei Shen et al.

ECCV 2024posterarXiv:2407.07764
16
citations
#1856

Taming Latent Diffusion Model for Neural Radiance Field Inpainting

Chieh Lin, Changil Kim, Jia-Bin Huang et al.

ECCV 2024posterarXiv:2404.09995
16
citations
#1857

ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting

Chen Duan, Pei Fu, Shan Guo et al.

CVPR 2024posterarXiv:2403.00303
16
citations
#1858

Diversified and Personalized Multi-rater Medical Image Segmentation

Yicheng Wu, Xiangde Luo, Zhe Xu et al.

CVPR 2024highlightarXiv:2403.13417
16
citations
#1859

LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors

Saksham Suri, Matthew Walmer, Kamal Gupta et al.

ECCV 2024posterarXiv:2403.14625
16
citations
#1860

City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the Web

Kaiwen Song, Xiaoyi Zeng, Chenqu Ren et al.

ECCV 2024posterarXiv:2312.16457
16
citations
#1861

Day-Night Cross-domain Vehicle Re-identification

Hongchao Li, Jingong Chen, AIHUA ZHENG et al.

CVPR 2024poster
16
citations
#1862

Stable Unlearnable Example: Enhancing the Robustness of Unlearnable Examples via Stable Error-Minimizing Noise

Yixin Liu, Kaidi Xu, Xun Chen et al.

AAAI 2024paperarXiv:2311.13091
16
citations
#1863

C^2RV: Cross-Regional and Cross-View Learning for Sparse-View CBCT Reconstruction

Yiqun Lin, Jiewen Yang, hualiang wang et al.

CVPR 2024posterarXiv:2406.03902
16
citations
#1864

Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders

Alexandre Eymaël, Renaud Vandeghen, Anthony Cioppa et al.

ECCV 2024posterarXiv:2403.17823
16
citations
#1865

An Efficient and Effective Transformer Decoder-Based Framework for Multi-Task Visual Grounding

Wei Chen, Long Chen, Yu Wu

ECCV 2024posterarXiv:2408.01120
16
citations
#1866

Learning to Optimize Permutation Flow Shop Scheduling via Graph-Based Imitation Learning

Longkang Li, Siyuan Liang, Zihao Zhu et al.

AAAI 2024paperarXiv:2210.17178
16
citations
#1867

TP2O: Creative Text Pair-to-Object Generation using Balance Swap-Sampling

Jun Li, Zedong Zhang, Jian Yang

ECCV 2024posterarXiv:2310.01819
16
citations
#1868

Three Heads Are Better than One: Complementary Experts for Long-Tailed Semi-supervised Learning

Chengcheng Ma, Ismail Elezi, Jiankang Deng et al.

AAAI 2024paperarXiv:2312.15702
16
citations
#1869

Auto-GAS: Automated Proxy Discovery for Training-free Generative Architecture Search

Lujun Li, Haosen SUN, Shiwen Li et al.

ECCV 2024poster
16
citations
#1870

Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning

Yibing Wei, Abhinav Gupta, Pedro Morgado

ECCV 2024posterarXiv:2407.15837
16
citations
#1871

Osmosis: RGBD Diffusion Prior for Underwater Image Restoration

Opher Bar Nathan, Deborah Steinberger-Levy, Tali Treibitz et al.

ECCV 2024posterarXiv:2403.14837
16
citations
#1872

Efficiently Assemble Normalization Layers and Regularization for Federated Domain Generalization

Khiem Le, Tuan Long Ho, Cuong Do et al.

CVPR 2024posterarXiv:2403.15605
16
citations
#1873

TransCAD: A Hierarchical Transformer for CAD Sequence Inference from Point Clouds

Dupont Elona, Kseniya Cherenkova, Dimitrios Mallis et al.

ECCV 2024posterarXiv:2407.12702
16
citations
#1874

CADTalk: An Algorithm and Benchmark for Semantic Commenting of CAD Programs

Haocheng Yuan, Jing Xu, Hao Pan et al.

CVPR 2024highlightarXiv:2311.16703
16
citations
#1875

Joint Demosaicing and Denoising for Spike Camera

Yanchen Dong, Ruiqin Xiong, Jing Zhao et al.

AAAI 2024paper
16
citations
#1876

AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation

Jeongsoo Choi, Se Jin Park, Minsu Kim et al.

CVPR 2024highlightarXiv:2312.02512
16
citations
#1877

Review-Enhanced Hierarchical Contrastive Learning for Recommendation

Ke Wang, Yanmin Zhu, Tianzi Zang et al.

AAAI 2024paper
16
citations
#1878

Progressive Poisoned Data Isolation for Training-Time Backdoor Defense

Yiming Chen, Haiwei Wu, Jiantao Zhou

AAAI 2024paperarXiv:2312.12724
16
citations
#1879

Context Diffusion: In-Context Aware Image Generation

Ivona Najdenkoska, Animesh Sinha, Abhimanyu Dubey et al.

ECCV 2024posterarXiv:2312.03584
16
citations
#1880

Transformer-Based Selective Super-resolution for Efficient Image Refinement

Tianyi Zhang, Kishore Kasichainula, Yaoxin Zhuo et al.

AAAI 2024paperarXiv:2312.05803
16
citations
#1881

AnatoMask: Enhancing Medical Image Segmentation with Reconstruction-guided Self-masking

Yuheng Li, Tianyu Luan, Yizhou Wu et al.

ECCV 2024posterarXiv:2407.06468
16
citations
#1882

Revealing the Proximate Long-Tail Distribution in Compositional Zero-Shot Learning

Chenyi Jiang, Haofeng Zhang

AAAI 2024paperarXiv:2312.15923
16
citations
#1883

Semi-supervised Active Learning for Video Action Detection

Ayush Singh, Aayush J Rana, Akash Kumar et al.

AAAI 2024paperarXiv:2312.07169
16
citations
#1884

Learning Hierarchical Image Segmentation For Recognition and By Recognition

Tsung-Wei Ke, Sangwoo Mo, Stella Yu

ICLR 2024spotlightarXiv:2210.00314
16
citations
#1885

Interactive3D: Create What You Want by Interactive 3D Generation

Shaocong Dong, Lihe Ding, Zhanpeng Huang et al.

CVPR 2024posterarXiv:2404.16510
16
citations
#1886

Commonsense Prototype for Outdoor Unsupervised 3D Object Detection

Hai Wu, Shijia Zhao, Xun Huang et al.

CVPR 2024posterarXiv:2404.16493
16
citations
#1887

Breaking Physical and Linguistic Borders: Multilingual Federated Prompt Tuning for Low-Resource Languages

Wanru Zhao, Yihong Chen, Royson Lee et al.

ICLR 2024posterarXiv:2507.03003
16
citations
#1888

Token Transformation Matters: Towards Faithful Post-hoc Explanation for Vision Transformer

Junyi Wu, Bin Duan, Weitai Kang et al.

CVPR 2024posterarXiv:2403.14552
16
citations
#1889

CIFAR-10-Warehouse: Broad and More Realistic Testbeds in Model Generalization Analysis

Xiaoxiao Sun, Xingjian Leng, Zijian Wang et al.

ICLR 2024posterarXiv:2310.04414
16
citations
#1890

Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields

Haoyuan Wang, Wenbo Hu, Lei Zhu et al.

CVPR 2024posterarXiv:2403.16224
16
citations
#1891

Align Before Adapt: Leveraging Entity-to-Region Alignments for Generalizable Video Action Recognition

Yifei Chen, Dapeng Chen, Ruijin Liu et al.

CVPR 2024posterarXiv:2311.15619
16
citations
#1892

DiffusionNAG: Predictor-guided Neural Architecture Generation with Diffusion Models

Sohyun An, Hayeon Lee, Jaehyeong Jo et al.

ICLR 2024posterarXiv:2305.16943
16
citations
#1893

PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation

Zhenyu Li, Shariq Farooq Bhat, Peter Wonka

ECCV 2024posterarXiv:2406.06679
16
citations
#1894

Iterated Learning Improves Compositionality in Large Vision-Language Models

Chenhao Zheng, Jieyu Zhang, Aniruddha Kembhavi et al.

CVPR 2024posterarXiv:2404.02145
16
citations
#1895

Music Style Transfer with Time-Varying Inversion of Diffusion Models

Sifei Li, Yuxin Zhang, Fan Tang et al.

AAAI 2024paperarXiv:2402.13763
16
citations
#1896

CoT3DRef: Chain-of-Thoughts Data-Efficient 3D Visual Grounding

eslam Abdelrahman, Mohamed Ayman Mohamed, Mahmoud Ahmed et al.

ICLR 2024posterarXiv:2310.06214
16
citations
#1897

Controllable Navigation Instruction Generation with Chain of Thought Prompting

Xianghao Kong, Jinyu Chen, Wenguan Wang et al.

ECCV 2024posterarXiv:2407.07433
16
citations
#1898

Frozen Feature Augmentation for Few-Shot Image Classification

Andreas Bär, Neil Houlsby, Mostafa Dehghani et al.

CVPR 2024posterarXiv:2403.10519
16
citations
#1899

Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis

Qian Chen, Shihao Shu, Xiangzhi Bai

ECCV 2024posterarXiv:2409.08042
16
citations
#1900

TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Autonomous Driving

Cheng Zhao, su sun, Ruoyu Wang et al.

ECCV 2024posterarXiv:2404.02410
16
citations
#1901

Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation

Yixiao Wang, Chen Tang, Lingfeng Sun et al.

ECCV 2024posterarXiv:2408.00766
16
citations
#1902

Weakly Supervised Open-Vocabulary Object Detection

Jianghang Lin, Yunhang Shen, Bingquan Wang et al.

AAAI 2024paperarXiv:2312.12437
16
citations
#1903

Evidential Active Recognition: Intelligent and Prudent Open-World Embodied Perception

Lei Fan, Mingfu Liang, Yunxuan Li et al.

CVPR 2024posterarXiv:2311.13793
16
citations
#1904

Decentralized Riemannian Conjugate Gradient Method on the Stiefel Manifold

Jun Chen, Haishan Ye, Mengmeng Wang et al.

ICLR 2024posterarXiv:2308.10547
16
citations
#1905

Quadratic models for understanding catapult dynamics of neural networks

Libin Zhu, Chaoyue Liu, Adityanarayanan Radhakrishnan et al.

ICLR 2024posterarXiv:2205.11787
16
citations
#1906

Comparing the Robustness of Modern No-Reference Image- and Video-Quality Metrics to Adversarial Attacks

Anastasia Antsiferova, Khaled Abud, Aleksandr Gushchin et al.

AAAI 2024paperarXiv:2310.06958
16
citations
#1907

FRIH: Fine-Grained Region-Aware Image Harmonization

Jinlong Peng, Zekun Luo, Liang Liu et al.

AAAI 2024paperarXiv:2205.06448
16
citations
#1908

PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts

Zewen Chen, Haina Qin, Juan Wang et al.

ECCV 2024posterarXiv:2403.04993
16
citations
#1909

Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses

Inhee Lee, Byungjun Kim, Hanbyul Joo

CVPR 2024posterarXiv:2404.14410
16
citations
#1910

DART: Implicit Doppler Tomography for Radar Novel View Synthesis

Tianshu Huang, John Miller, Akarsh Prabhakara et al.

CVPR 2024posterarXiv:2403.03896
16
citations
#1911

ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open Vocabulary Object Detection

Joonhyun Jeong, Geondo Park, Jayeon Yoo et al.

AAAI 2024paperarXiv:2312.07266
16
citations
#1912

SURER: Structure-Adaptive Unified Graph Neural Network for Multi-View Clustering

Jing Wang, Songhe Feng, Gengyu Lyu et al.

AAAI 2024paper
16
citations
#1913

LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model

Dongkai Wang, shiyu xuan, Shiliang Zhang

CVPR 2024highlightarXiv:2406.04659
16
citations
#1914

Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective

Ming Zhong, Chenxin An, Weizhu Chen et al.

ICLR 2024posterarXiv:2310.11451
16
citations
#1915

MSD: A Benchmark Dataset for Floor Plan Generation of Building Complexes

Casper van Engelenburg, Fatemeh Mostafavi, Emanuel Kuhn et al.

ECCV 2024posterarXiv:2407.10121
16
citations
#1916

CORN: Contact-based Object Representation for Nonprehensile Manipulation of General Unseen Objects

Yoonyoung Cho, Junhyek Han, Yoontae Cho et al.

ICLR 2024posterarXiv:2403.10760
16
citations
#1917

Mirage: Model-agnostic Graph Distillation for Graph Classification

Mridul Gupta, Sahil Manchanda, HARIPRASAD KODAMANA et al.

ICLR 2024posterarXiv:2310.09486
16
citations
#1918

MaGGIe: Masked Guided Gradual Human Instance Matting

Chuong Huynh, Seoung Wug Oh, Abhinav Shrivastava et al.

CVPR 2024posterarXiv:2404.16035
16
citations
#1919

SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild

Andreas Engelhardt, Amit Raj, Mark Boss et al.

CVPR 2024posterarXiv:2401.10171
16
citations
#1920

SuperGaussian: Repurposing Video Models for 3D Super Resolution

Yuan Shen, Duygu Ceylan, Paul Guerrero et al.

ECCV 2024posterarXiv:2406.00609
16
citations
#1921

GaussReg: Fast 3D Registration with Gaussian Splatting

Jiahao Chang, Yinglin Xu, Yihao Li et al.

ECCV 2024posterarXiv:2407.05254
16
citations
#1922

DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery

Yixuan Zhu, Ao Li, Yansong Tang et al.

CVPR 2024posterarXiv:2404.01424
16
citations
#1923

KD-DETR: Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling

Yu Wang, Xin Li, Shengzhao Wen et al.

CVPR 2024posterarXiv:2211.08071
16
citations
#1924

SWAP-NAS: Sample-Wise Activation Patterns for Ultra-fast NAS

Yameng Peng, Andy Song, Haytham Fayek et al.

ICLR 2024spotlightarXiv:2403.04161
16
citations
#1925

SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis

Hanrong Ye, Jason Wen Yong Kuen, Qing Liu et al.

ECCV 2024posterarXiv:2311.03355
16
citations
#1926

Every Node Is Different: Dynamically Fusing Self-Supervised Tasks for Attributed Graph Clustering

Pengfei Zhu, Qian Wang, Yu Wang et al.

AAAI 2024paperarXiv:2401.06595
16
citations
#1927

VividDreamer: Invariant Score Distillation for Hyper-Realistic Text-to-3D Generation

Wenjie Zhuo, Fan Ma, Hehe Fan et al.

ECCV 2024posterarXiv:2407.09822
16
citations
#1928

Grounded Object-Centric Learning

Avinash Kori, Francesco Locatello, Fabio De Sousa Ribeiro et al.

ICLR 2024poster
16
citations
#1929

Self-Supervised Multi-Modal Knowledge Graph Contrastive Hashing for Cross-Modal Search

Meiyu Liang, Junping Du, Zhengyang Liang et al.

AAAI 2024paper
16
citations
#1930

Heterogeneous Graph Reasoning for Fact Checking over Texts and Tables

Haisong Gong, Weizhi Xu, Shu Wu et al.

AAAI 2024paperarXiv:2402.13028
16
citations
#1931

PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud Registration

Runzhao Yao, Shaoyi Du, Wenting Cui et al.

ECCV 2024posterarXiv:2407.10142
16
citations
#1932

Unleashing the Potential of Fractional Calculus in Graph Neural Networks with FROND

Qiyu Kang, Kai Zhao, Qinxu Ding et al.

ICLR 2024spotlightarXiv:2404.17099
16
citations
#1933

Deep Diffusion Image Prior for Efficient OOD Adaptation in 3D Inverse Problems

Hyungjin Chung, Jong Chul Ye

ECCV 2024posterarXiv:2407.10641
16
citations
#1934

Get an A in Math: Progressive Rectification Prompting

Zhenyu Wu, Meng Jiang, Chao Shen

AAAI 2024paperarXiv:2312.06867
15
citations
#1935

Versatile Medical Image Segmentation Learned from Multi-Source Datasets via Model Self-Disambiguation

Xiaoyang Chen, Hao Zheng, Yuemeng LI et al.

CVPR 2024posterarXiv:2311.10696
15
citations
#1936

Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation

Qiushi Zhu, Jie Zhang, Yu Gu et al.

AAAI 2024paperarXiv:2401.03468
15
citations
#1937

MeshSegmenter: Zero-Shot Mesh Segmentation via Texture Synthesis

ziming zhong, Yanyu Xu, Jing Li et al.

ECCV 2024poster
15
citations
#1938

What How and When Should Object Detectors Update in Continually Changing Test Domains?

Jayeon Yoo, Dongkwan Lee, Inseop Chung et al.

CVPR 2024posterarXiv:2312.08875
15
citations
#1939

One-Shot Structure-Aware Stylized Image Synthesis

Hansam Cho, Jonghyun Lee, Seunggyu Chang et al.

CVPR 2024posterarXiv:2402.17275
15
citations
#1940

DG-PIC: Domain Generalized Point-In-Context Learning for Point Cloud Understanding

Jincen Jiang, Qianyu Zhou, Yuhang Li et al.

ECCV 2024posterarXiv:2407.08801
15
citations
#1941

Compositional Generative Inverse Design

Tailin Wu, Takashi Maruyama, Long Wei et al.

ICLR 2024spotlightarXiv:2401.13171
15
citations
#1942

LaPose: Laplacian Mixture Shape Modeling for RGB-Based Category-Level Object Pose Estimation

Ruida Zhang, Ziqin Huang, Gu Wang et al.

ECCV 2024posterarXiv:2409.15727
15
citations
#1943

Codebook Transfer with Part-of-Speech for Vector-Quantized Image Modeling

Baoquan Zhang, Huaibin Wang, Luo Chuyao et al.

CVPR 2024posterarXiv:2403.10071
15
citations
#1944

Gaussian Shadow Casting for Neural Characters

Luis Bolanos, Shih-Yang Su, Helge Rhodin

CVPR 2024posterarXiv:2401.06116
15
citations
#1945

Signed Graph Neural Ordinary Differential Equation for Modeling Continuous-Time Dynamics

Lanlan Chen, Kai Wu, Jian Lou et al.

AAAI 2024paperarXiv:2312.11198
15
citations
#1946

Real-Time Simulated Avatar from Head-Mounted Sensors

Zhengyi Luo, Jinkun Cao, Rawal Khirodkar et al.

CVPR 2024highlightarXiv:2403.06862
15
citations
#1947

Enhancing Source-Free Domain Adaptive Object Detection with Low-confidence Pseudo Label Distillation

Ilhoon Yoon, Hyeongjun Kwon, Jin Kim et al.

ECCV 2024posterarXiv:2407.13524
15
citations
#1948

Enhancing Vision-Language Pre-training with Rich Supervisions

Yuan Gao, Kunyu Shi, Pengkai Zhu et al.

CVPR 2024highlightarXiv:2403.03346
15
citations
#1949

The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?

Qinyu Zhao, Ming Xu, Kartik Gupta et al.

ECCV 2024posterarXiv:2403.09037
15
citations
#1950

Diffusion Bridges for 3D Point Cloud Denoising

Mathias Vogel, Keisuke Tateno, Marc Pollefeys et al.

ECCV 2024posterarXiv:2408.16325
15
citations
#1951

Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation

Xiyi Chen, Marko Mihajlovic, Shaofei Wang et al.

CVPR 2024posterarXiv:2401.04728
15
citations
#1952

Adapters Strike Back

Jan-Martin Steitz, Stefan Roth

CVPR 2024posterarXiv:2406.06820
15
citations
#1953

LookupViT: Compressing visual information to a limited number of tokens

Rajat Koner, Gagan Jain, Sujoy Paul et al.

ECCV 2024posterarXiv:2407.12753
15
citations
#1954

Jointly Training and Pruning CNNs via Learnable Agent Guidance and Alignment

Alireza Ganjdanesh, Shangqian Gao, Heng Huang

CVPR 2024posterarXiv:2403.19490
15
citations
#1955

Tensorized Label Learning on Anchor Graph

Jing Li, Quanxue Gao, Qianqian Wang et al.

AAAI 2024paper
15
citations
#1956

Learning MDL Logic Programs from Noisy Data

Céline Hocquette, Andreas Niskanen, Matti Järvisalo et al.

AAAI 2024paperarXiv:2308.09393
15
citations
#1957

Learning Optimal Advantage from Preferences and Mistaking It for Reward

W Bradley Knox, Stephane Hatgis-Kessell, Sigurdur Orn Adalgeirsson et al.

AAAI 2024paperarXiv:2310.02456
15
citations
#1958

The Hard Positive Truth about Vision-Language Compositionality

Amita Kamath, Cheng-Yu Hsieh, Kai-Wei Chang et al.

ECCV 2024posterarXiv:2409.17958
15
citations
#1959

HiLo: Detailed and Robust 3D Clothed Human Reconstruction with High-and Low-Frequency Information of Parametric Models

Yifan Yang, Dong Liu, Shuhai Zhang et al.

CVPR 2024posterarXiv:2404.04876
15
citations
#1960

ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing

Jun-Kun Chen, Samuel Rota Bulò, Norman Müller et al.

CVPR 2024posterarXiv:2406.09404
15
citations
#1961

Instance-Aware Group Quantization for Vision Transformers

Jaehyeon Moon, Dohyung Kim, Jun Yong Cheon et al.

CVPR 2024posterarXiv:2404.00928
15
citations
#1962

OVER-NAV: Elevating Iterative Vision-and-Language Navigation with Open-Vocabulary Detection and StructurEd Representation

Ganlong Zhao, Guanbin Li, Weikai Chen et al.

CVPR 2024posterarXiv:2403.17334
15
citations
#1963

Adversarial Score Distillation: When score distillation meets GAN

Min Wei, Jingkai Zhou, Junyao Sun et al.

CVPR 2024posterarXiv:2312.00739
15
citations
#1964

Cyclic Learning for Binaural Audio Generation and Localization

Zhaojian Li, Bin Zhao, Yuan Yuan

CVPR 2024poster
15
citations
#1965

AugUndo: Scaling Up Augmentations for Monocular Depth Completion and Estimation

Yangchao Wu, Tian Yu Liu, Hyoungseob Park et al.

ECCV 2024posterarXiv:2310.09739
15
citations
#1966

Improving Spectral Snapshot Reconstruction with Spectral-Spatial Rectification

Jiancheng Zhang, Haijin Zeng, Yongyong Chen et al.

CVPR 2024poster
15
citations
#1967

Progressive Divide-and-Conquer via Subsampling Decomposition for Accelerated MRI

Chong Wang, Lanqing Guo, Yufei Wang et al.

CVPR 2024highlightarXiv:2403.10064
15
citations
#1968

OmniMotionGPT: Animal Motion Generation with Limited Data

Zhangsihao Yang, Mingyuan Zhou, Mengyi Shan et al.

CVPR 2024posterarXiv:2311.18303
15
citations
#1969

BaCon: Boosting Imbalanced Semi-supervised Learning via Balanced Feature-Level Contrastive Learning

Qianhan Feng, Lujing Xie, Shijie Fang et al.

AAAI 2024paperarXiv:2403.12986
15
citations
#1970

Bridging the Synthetic-to-Authentic Gap: Distortion-Guided Unsupervised Domain Adaptation for Blind Image Quality Assessment

Aobo Li, Jinjian Wu, Yongxu Liu et al.

CVPR 2024posterarXiv:2405.04167
15
citations
#1971

Tackling Structural Hallucination in Image Translation with Local Diffusion

Seunghoi Kim, Chen Jin, Tom Diethe et al.

ECCV 2024posterarXiv:2404.05980
15
citations
#1972

Make Me a BNN: A Simple Strategy for Estimating Bayesian Uncertainty from Pre-trained Models

Gianni Franchi, Olivier Laurent, Maxence Leguéry et al.

CVPR 2024posterarXiv:2312.15297
15
citations
#1973

Animal Avatars: Reconstructing Animatable 3D Animals from Casual Videos

Remy Sabathier, David Novotny, Niloy Mitra

ECCV 2024posterarXiv:2403.17103
15
citations
#1974

TeMO: Towards Text-Driven 3D Stylization for Multi-Object Meshes

Xuying Zhang, Bo-Wen Yin, yuming chen et al.

CVPR 2024posterarXiv:2312.04248
15
citations
#1975

Semi-supervised Open-World Object Detection

Sahal Shaji Mullappilly, Abhishek Singh Gehlot, Rao Muhammad Anwer et al.

AAAI 2024paperarXiv:2402.16013
15
citations
#1976

ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field

Zhangkai Ni, Peiqi Yang, Wenhan Yang et al.

AAAI 2024paperarXiv:2312.09095
15
citations
#1977

Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments

Liyuan Zhu, Shengyu Huang, Konrad Schindler et al.

CVPR 2024highlightarXiv:2312.09138
15
citations
#1978

A Noisy Elephant in the Room: Is Your Out-of-Distribution Detector Robust to Label Noise?

Galadrielle Humblot-Renaux, Sergio Escalera, Thomas B. Moeslund

CVPR 2024posterarXiv:2404.01775
15
citations
#1979

Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches

Qing Yu, Mikihiro Tanaka, Kent Fujiwara

CVPR 2024posterarXiv:2405.04771
15
citations
#1980

Instant 3D Human Avatar Generation using Image Diffusion Models

Nikos Kolotouros, Thiemo Alldieck, Enric Corona et al.

ECCV 2024posterarXiv:2406.07516
15
citations
#1981

Open Panoramic Segmentation

Junwei Zheng, Ruiping Liu, Yufan Chen et al.

ECCV 2024posterarXiv:2407.02685
15
citations
#1982

Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Models

Hao Cheng, Erjia Xiao, Jindong Gu et al.

ECCV 2024posterarXiv:2402.19150
15
citations
#1983

CREAD: A Classification-Restoration Framework with Error Adaptive Discretization for Watch Time Prediction in Video Recommender Systems

Jie Sun, Zhao Ying Ding, Xiaoshuang Chen et al.

AAAI 2024paperarXiv:2401.07521
15
citations
#1984

Kill Two Birds with One Stone: Rethinking Data Augmentation for Deep Long-tailed Learning

Binwu Wang, Pengkun Wang, Wei Xu et al.

ICLR 2024poster
15
citations
#1985

FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models

Junhyuk So, Jungwon Lee, Eunhyeok Park

ECCV 2024posterarXiv:2312.03517
15
citations
#1986

Self-Supervised Video Desmoking for Laparoscopic Surgery

Renlong Wu, Zhilu Zhang, Shuohao Zhang et al.

ECCV 2024posterarXiv:2403.11192
15
citations
#1987

Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge

Dongjin Kim, Sung Jin Um, Sangmin Lee et al.

CVPR 2024posterarXiv:2403.17420
15
citations
#1988

Adversarial Training Should Be Cast as a Non-Zero-Sum Game

Alex Robey, Fabian Latorre, George Pappas et al.

ICLR 2024posterarXiv:2306.11035
15
citations
#1989

ScanFormer: Referring Expression Comprehension by Iteratively Scanning

Wei Su, Peihan Miao, Huanzhang Dou et al.

CVPR 2024posterarXiv:2406.18048
15
citations
#1990

Quad Bayer Joint Demosaicing and Denoising Based on Dual Encoder Network with Joint Residual Learning

Bolun Zheng, Li Haoran, Quan Chen et al.

AAAI 2024paper
15
citations
#1991

History Matters: Temporal Knowledge Editing in Large Language Model

Xunjian Yin, Jin Jiang, Liming Yang et al.

AAAI 2024paperarXiv:2312.05497
15
citations
#1992

Kandinsky Conformal Prediction: Efficient Calibration of Image Segmentation Algorithms

Joren Brunekreef, Eric Marcus, Ray Sheombarsing et al.

CVPR 2024posterarXiv:2311.11837
15
citations
#1993

Accelerating Image Generation with Sub-path Linear Approximation Model

Chen Xu, Tianhui Song, Weixin Feng et al.

ECCV 2024posterarXiv:2404.13903
15
citations
#1994

VEON: Vocabulary-Enhanced Occupancy Prediction

Jilai Zheng, Pin Tang, Zhongdao Wang et al.

ECCV 2024posterarXiv:2407.12294
15
citations
#1995

Multimarginal Generative Modeling with Stochastic Interpolants

Michael Albergo, Nicholas Boffi, Michael Lindsey et al.

ICLR 2024posterarXiv:2310.03695
15
citations
#1996

Dissecting Sample Hardness: A Fine-Grained Analysis of Hardness Characterization Methods for Data-Centric AI

Nabeel Seedat, Fergus Imrie, Mihaela van der Schaar

ICLR 2024posterarXiv:2403.04551
15
citations
#1997

Bidirectional Autoregessive Diffusion Model for Dance Generation

Canyu Zhang, Youbao Tang, NING Zhang et al.

CVPR 2024poster
15
citations
#1998

Predicated Diffusion: Predicate Logic-Based Attention Guidance for Text-to-Image Diffusion Models

Kota Sueyoshi, Takashi Matsubara

CVPR 2024highlightarXiv:2311.16117
15
citations
#1999

GenesisTex: Adapting Image Denoising Diffusion to Texture Space

Chenjian Gao, Boyan Jiang, Xinghui Li et al.

CVPR 2024posterarXiv:2403.17782
15
citations
#2000

Dynamic Retraining-Updating Mean Teacher for Source-Free Object Detection

BA KHANH TRINH LE, Huy-Hung Nguyen, Long Hoang Pham et al.

ECCV 2024posterarXiv:2407.16497
15
citations