Most Cited 2024 "cognitive reasoning" Papers

12,324 papers found • Page 50 of 62

#9801

Generative-Based Fusion Mechanism for Multi-Modal Tracking

Zhangyong Tang, Tianyang Xu, Xiaojun Wu et al.

AAAI 2024paperarXiv:2309.01728
#9802

Towards Epistemic-Doxastic Planning with Observation and Revision

Thorsten Engesser, Andreas Herzig, Elise Perrotin

AAAI 2024paper
#9803

Frequency Shuffling and Enhancement for Open Set Recognition

Lijun Liu, Rui Wang, Yuan Wang et al.

AAAI 2024paper
#9804

GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval

Yuting Wang, Jinpeng Wang, Bin Chen et al.

AAAI 2024paperarXiv:2310.05195
#9805

One Self-Configurable Model to Solve Many Abstract Visual Reasoning Problems

Mikołaj Małkiński, Jacek Mańdziuk

AAAI 2024paperarXiv:2312.09997
#9806

From Past to Future: Rethinking Eligibility Traces

Dhawal Gupta, Scott Jordan, Shreyas Chaudhari et al.

AAAI 2024paperarXiv:2312.12972
#9807

A Diffusion Model with State Estimation for Degradation-Blind Inverse Imaging

Liya Ji, ZheFan Rao, Sinno Jialin Pan et al.

AAAI 2024paper
#9808

Temporal-Distributed Backdoor Attack against Video Based Action Recognition

Xi Li, Songhe Wang, Ruiquan Huang et al.

AAAI 2024paperarXiv:2308.11070
#9809

Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model

Junghun Cha, Ali Haider, Seoyun Yang et al.

AAAI 2024paperarXiv:2402.05350
#9810

SparseGNV: Generating Novel Views of Indoor Scenes with Sparse RGB-D Images

Weihao Cheng, Yan-Pei Cao, Ying Shan

AAAI 2024paper
#9811

Scale Optimization Using Evolutionary Reinforcement Learning for Object Detection on Drone Imagery

Jialu Zhang, Xiaoying Yang, Wentao He et al.

AAAI 2024paperarXiv:2312.15219
#9812

Collaborative Tooth Motion Diffusion Model in Digital Orthodontics

Yeying Fan, Guangshun Wei, Chen Wang et al.

AAAI 2024paper
#9813

An Information-Flow Perspective on Algorithmic Fairness

Samuel Teuber, Bernhard Beckert

AAAI 2024paperarXiv:2312.10128
#9814

KeDuSR: Real-World Dual-Lens Super-resolution via Kernel-Free Matching

Huanjing Yue, Zifan Cui, Kun Li et al.

AAAI 2024paperarXiv:2312.17050
#9815

Robustly Train Normalizing Flows via KL Divergence Regularization

Kun Song, Ruben Solozabal Ochoa de Retana, Hao Li et al.

AAAI 2024paper
#9816

CoVR: Learning Composed Video Retrieval from Web Video Captions

Lucas Ventura, Antoine Yang, Cordelia Schmid et al.

AAAI 2024paper
#9817

Double-Descent Curves in Neural Networks: A New Perspective Using Gaussian Processes

Ouns El Harzli, Bernardo Cuenca Grau, Guillermo Valle Perez et al.

AAAI 2024paperarXiv:2102.07238
#9818

Unknown-Aware Graph Regularization for Robust Semi-supervised Learning from Uncurated Data

Heejo Kong, Suneung Kim, Ho-Joong Kim et al.

AAAI 2024paper
#9819

DeRDaVa: Deletion-Robust Data Valuation for Machine Learning

Xiao Tian, Rachael Hwee Ling Sim, Jue Fan et al.

AAAI 2024paperarXiv:2312.11413
#9820

CGMGM: A Cross-Gaussian Mixture Generative Model for Few-Shot Semantic Segmentation

Junao Shen, Kun Kuang, Jiaheng Wang et al.

AAAI 2024paper
#9821

Efficient Constrained K-center Clustering with Background Knowledge

Longkun Guo, Chaoqi Jia, Kewen Liao et al.

AAAI 2024paper
#9822

Taming the Sigmoid Bottleneck: Provably Argmaxable Sparse Multi-Label Classification

Andreas Grivas, Antonio Vergari, Adam Lopez

AAAI 2024paperarXiv:2310.10443
#9823

Detection and Defense of Unlearnable Examples

Yifan Zhu, lijia Yu, Xiao-Shan Gao

AAAI 2024paperarXiv:2312.08898
#9824

MEPSI: An MDL-Based Ensemble Pruning Approach with Structural Information

Xiao-Dong Bi, Shao-Qun Zhang, Yuan Jiang

AAAI 2024paper
#9825

Decentralized Scheduling with QoS Constraints: Achieving O(1) QoS Regret of Multi-Player Bandits

Qingsong Liu, Zhixuan Fang

AAAI 2024paper
#9826

Hierarchical Multi-Marginal Optimal Transport for Network Alignment

Zhichen Zeng, Boxin Du, Si Zhang et al.

AAAI 2024paperarXiv:2310.04470
#9827

Semi-supervised Learning of Dynamical Systems with Neural Ordinary Differential Equations: A Teacher-Student Model Approach

Yu Wang, Yuxuan Yin, Karthik Somayaji NS et al.

AAAI 2024paperarXiv:2310.13110
#9828

A Plug-and-Play Quaternion Message-Passing Module for Molecular Conformation Representation

Angxiao Yue, Dixin Luo, Hongteng Xu

AAAI 2024paper
#9829

3D Visibility-Aware Generalizable Neural Radiance Fields for Interacting Hands

Xuan Huang, Hanhui Li, Zejun Yang et al.

AAAI 2024paperarXiv:2401.00979
#9830

New Classes of the Greedy-Applicable Arm Feature Distributions in the Sparse Linear Bandit Problem

Koji Ichikawa, Shinji Ito, Daisuke Hatano et al.

AAAI 2024paperarXiv:2312.12400
#9831

Convolutional Channel-Wise Competitive Learning for the Forward-Forward Algorithm

Andreas Papachristodoulou, Christos Kyrkou, Stelios Timotheou et al.

AAAI 2024paperarXiv:2312.12668
#9832

CcDPM: A Continuous Conditional Diffusion Probabilistic Model for Inverse Design

Yanxuan Zhao, Peng Zhang, Guopeng Sun et al.

AAAI 2024paper
#9833

Universal Weak Coreset

Ragesh Jaiswal, Amit Kumar

AAAI 2024paperarXiv:2305.16890
#9834

Contrastive Balancing Representation Learning for Heterogeneous Dose-Response Curves Estimation

Minqin Zhu, Anpeng Wu, Haoxuan Li et al.

AAAI 2024paperarXiv:2403.14232
#9835

DePRL: Achieving Linear Convergence Speedup in Personalized Decentralized Learning with Shared Representations

Guojun Xiong, Gang Yan, Shiqiang Wang et al.

AAAI 2024paperarXiv:2312.10815
#9836

RetroOOD: Understanding Out-of-Distribution Generalization in Retrosynthesis Prediction

Yemin Yu, Luotian Yuan, Ying WEI et al.

AAAI 2024paperarXiv:2312.10900
#9837

Robust Visual Imitation Learning with Inverse Dynamics Representations

Siyuan Li, Xun Wang, Rongchang Zuo et al.

AAAI 2024paperarXiv:2310.14274
#9838

Offline Model-Based Optimization via Policy-Guided Gradient Search

Yassine Chemingui, Aryan Deshwal, Nghia Hoang et al.

AAAI 2024paperarXiv:2405.05349
#9839

Generator Assisted Mixture of Experts for Feature Acquisition in Batch

Vedang Asgaonkar, Aditya Jain, Abir De

AAAI 2024paperarXiv:2312.12574
#9840

MemoryBank: Enhancing Large Language Models with Long-Term Memory

Wanjun Zhong, Lianghong Guo, Qiqi Gao et al.

AAAI 2024paperarXiv:2305.10250
#9841

Formal Logic Enabled Personalized Federated Learning through Property Inference

Ziyan An, Taylor Johnson, Meiyi Ma

AAAI 2024paperarXiv:2401.07448
#9842

Secure Distributed Sparse Gaussian Process Models Using Multi-Key Homomorphic Encryption

Adil Nawaz, Guopeng Chen, Muhammad Umair Raza et al.

AAAI 2024paper
#9843

Learn to Follow: Decentralized Lifelong Multi-Agent Pathfinding via Planning and Learning

Alexey Skrynnik, Anton Andreychuk, Maria Nesterova et al.

AAAI 2024paperarXiv:2310.01207
#9844

CaMIL: Causal Multiple Instance Learning for Whole Slide Image Classification

Kaitao Chen, Shiliang Sun, Jing Zhao

AAAI 2024paper
#9845

DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior

Ha-Yeong Choi, Sang-Hoon Lee, Seong-Whan Lee

AAAI 2024paper
#9846

Eliciting Honest Information from Authors Using Sequential Review

Yichi Zhang, Grant Schoenebeck, Weijie Su

AAAI 2024paperarXiv:2311.14619
#9847

Approximation Scheme for Weighted Metric Clustering via Sherali-Adams

Dmitrii Avdiukhin, Vaggos Chatziafratis, Konstantin Makarychev et al.

AAAI 2024paper
#9848

ConceptBed: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models

Maitreya Patel, Tejas Gokhale, Chitta Baral et al.

AAAI 2024paperarXiv:2306.04695
#9849

Inducing Point Operator Transformer: A Flexible and Scalable Architecture for Solving PDEs

Seungjun Lee, TaeIL Oh

AAAI 2024paperarXiv:2312.10975
#9850

Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning

Jiayu Chen, Zelai Xu, Yunfei Li et al.

AAAI 2024paperarXiv:2310.04796
#9851

Stochastic Bayesian Optimization with Unknown Continuous Context Distribution via Kernel Density Estimation

Xiaobin Huang, Lei Song, Ke Xue et al.

AAAI 2024paperarXiv:2312.10423
#9852

No Prior Mask: Eliminate Redundant Action for Deep Reinforcement Learning

Dianyu Zhong, Yiqin Yang, Qianchuan Zhao

AAAI 2024paperarXiv:2312.06258
#9853

$z$-SignFedAvg: A Unified Stochastic Sign-Based Compression for Federated Learning

Zhiwei Tang, Yanmeng Wang, Tsung-Hui Chang

AAAI 2024paperarXiv:2302.02589
#9854

Contextual Pandora’s Box

Alexia Atsidakou, Constantine Caramanis, Evangelia Gergatsouli et al.

AAAI 2024paper
#9855

Robust Distributed Gradient Aggregation Using Projections onto Gradient Manifolds

Kwang In Kim

AAAI 2024paper
#9856

Generative Model Perception Rectification Algorithm for Trade-Off between Diversity and Quality

Guipeng Lan, Shuai Xiao, Jiachen Yang et al.

AAAI 2024paper
#9857

Taming Binarized Neural Networks and Mixed-Integer Programs

Johannes Aspman, Georgios Korpas, Jakub Marecek

AAAI 2024paperarXiv:2310.04469
#9858

Enhancing the Robustness of Spiking Neural Networks with Stochastic Gating Mechanisms

Jianhao Ding, Zhaofei Yu, Tiejun Huang et al.

AAAI 2024paper
#9859

Towards Dynamic Spatial-Temporal Graph Learning: A Decoupled Perspective

Binwu Wang, Pengkun Wang, Yudong Zhang et al.

AAAI 2024paper
#9860

A Closer Look at Curriculum Adversarial Training: From an Online Perspective

Lianghe Shi, Weiwei Liu

AAAI 2024paper
#9861

Provably Convergent Federated Trilevel Learning

Yang Jiao, Kai YANG, Tiancheng Wu et al.

AAAI 2024paperarXiv:2312.11835
#9862

Equity-Transformer: Solving NP-Hard Min-Max Routing Problems as Sequential Generation with Equity Context

Jiwoo Son, Minsu Kim, Sanghyeok Choi et al.

AAAI 2024paperarXiv:2306.02689
#9863

DeepSpeed Data Efficiency: Improving Deep Learning Model Quality and Training Efficiency via Efficient Data Sampling and Routing

Conglong Li, Zhewei Yao, Xiaoxia Wu et al.

AAAI 2024paperarXiv:2212.03597
#9864

Dynamic Knowledge Injection for AIXI Agents

Samuel Yang-Zhao, Kee Siong Ng, Marcus Hutter

AAAI 2024paperarXiv:2312.16184
#9865

Factored Online Planning in Many-Agent POMDPs

Maris Galesloot, Thiago Simão, Sebastian Junges et al.

AAAI 2024paperarXiv:2312.11434
#9866

Principal-Agent Reward Shaping in MDPs

Omer Ben-Porat, Yishay Mansour, Michal Moshkovitz et al.

AAAI 2024paperarXiv:2401.00298
#9867

Feature Distribution Matching by Optimal Transport for Effective and Robust Coreset Selection

AAAI 2024paper
#9868

Dialogues Are Not Just Text: Modeling Cognition for Dialogue Coherence Evaluation

AAAI 2024paper
#9869

A Unified Self-Distillation Framework for Multimodal Sentiment Analysis with Uncertain Missing Modalities

AAAI 2024paper
#9870

Guiding a Harsh-Environments Robust Detector via RAW Data Characteristic Mining

AAAI 2024paper
#9871

LimeAttack: Local Explainable Method for Textual Hard-Label Adversarial Attack

AAAI 2024paperarXiv:2308.00319
#9872

A Novel Skip Orthogonal List for Dynamic Optimal Transport Problem

AAAI 2024paperarXiv:2310.18446
#9873

Mixed-Effects Contextual Bandits

Weiwei Xiao, Yongyong Chen, Qiben Shan et al.

AAAI 2024paper
#9874

Beyond Attention: Breaking the Limits of Transformer Context Length with Recurrent Memory

Aydar Bulatov, Yuri Kuratov, Yermek Kapushev et al.

AAAI 2024paper
#9875

Resisting Backdoor Attacks in Federated Learning via Bidirectional Elections and Individual Perspective

Zhen Qin, Feiyi Chen, Chen Zhi et al.

AAAI 2024paperarXiv:2309.16456
#9876

Transportable Representations for Domain Generalization

Kasra Jalaldoust, Elias Bareinboim

AAAI 2024paper
#9877

Exponential Hardness of Optimization from the Locality in Quantum Neural Networks

Hao-Kai Zhang, Chengkai Zhu, Geng Liu et al.

AAAI 2024paper
#9878

MFOS: Model-Free & One-Shot Object Pose Estimation

JongMin Lee, Yohann Cabon, Romain Brégier et al.

AAAI 2024paper
#9879

Hierarchical Topology Isomorphism Expertise Embedded Graph Contrastive Learning

Jiangmeng Li, Yifan Jin, Hang Gao et al.

AAAI 2024paperarXiv:2312.14222
#9880

PDE+: Enhancing Generalization via PDE with Adaptive Distributional Diffusion

Yige Yuan, Bingbing Xu, Bo Lin et al.

AAAI 2024paperarXiv:2305.15835
#9881

Towards Real-World Test-Time Adaptation: Tri-net Self-Training with Balanced Normalization

Yongyi Su, Xun Xu, Kui Jia

AAAI 2024paperarXiv:2309.14949
#9882

Probabilistic Offline Policy Ranking with Approximate Bayesian Computation

Longchao Da, Porter Jenkins, Trevor Schwantes et al.

AAAI 2024paperarXiv:2312.11551
#9883

DRF: Improving Certified Robustness via Distributional Robustness Framework

Zekai Wang, Zhengyu Zhou, Weiwei Liu

AAAI 2024paper
#9884

Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning

Ruiqian Nai, Zixin Wen, Ji Li et al.

AAAI 2024paperarXiv:2403.00352
#9885

Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation

Zhouhong Gu, Xiaoxuan Zhu, Haoning Ye et al.

AAAI 2024paperarXiv:2306.05783
#9886

Out of Thin Air: Exploring Data-Free Adversarial Robustness Distillation

Yuzheng Wang, Zhaoyu Chen, Dingkang Yang et al.

AAAI 2024paperarXiv:2303.11611
#9887

Towards Detailed Text-to-Motion Synthesis via Basic-to-Advanced Hierarchical Diffusion Model

Zhenyu Xie, Yang Wu, Xuehao Gao et al.

AAAI 2024paperarXiv:2312.10960
#9888

Dirichlet-Based Prediction Calibration for Learning with Noisy Labels

Chen-Chen Zong, Ye-Wen Wang, Ming-Kun Xie et al.

AAAI 2024paperarXiv:2401.07062
#9889

HAGO-Net: Hierarchical Geometric Massage Passing for Molecular Representation Learning

Hongbin Pei, Taile Chen, Chen A et al.

AAAI 2024paper
#9890

Unsupervised Template-assisted Point Cloud Shape Correspondence Network

Jiacheng Deng, Jiahao Lu, Tianzhu Zhang

CVPR 2024arXiv:2403.16412
#9891

X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition

Shuofeng Sun, Yongming Rao, Jiwen Lu et al.

CVPR 2024arXiv:2404.15010
#9892

Spectral and Polarization Vision: Spectro-polarimetric Real-world Dataset

Yujin Jeon, Eunsue Choi, Youngchan Kim et al.

CVPR 2024highlightarXiv:2311.17396
#9893

Efficient Model Stealing Defense with Noise Transition Matrix

Dong-Dong Wu, Chilin Fu, Weichang Wu et al.

CVPR 2024
#9894

HOIAnimator: Generating Text-prompt Human-object Animations using Novel Perceptive Diffusion Models

Wenfeng Song, Xinyu Zhang, Shuai Li et al.

CVPR 2024
#9895

MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval

bowen zhang, Xiaojie Jin, Weibo Gong et al.

CVPR 2024arXiv:2301.07868
#9896

Diffusion Models Without Attention

Jing Nathan Yan, Jiatao Gu, Alexander Rush

CVPR 2024arXiv:2311.18257
#9897

HDQMF: Holographic Feature Decomposition Using Quantum Algorithms

Prathyush Poduval, Zhuowen Zou, Mohsen Imani

CVPR 2024
#9898

DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving Scenes

Xiaoyu Zhou, Zhiwei Lin, Xiaojun Shan et al.

CVPR 2024arXiv:2312.07920
#9899

H-ViT: A Hierarchical Vision Transformer for Deformable Image Registration

Morteza Ghahremani, Mohammad Khateri, Bailiang Jian et al.

CVPR 2024highlight
#9900

Going Beyond Multi-Task Dense Prediction with Synergy Embedding Models

Huimin Huang, Yawen Huang, Lanfen Lin et al.

CVPR 2024
#9901

FLHetBench: Benchmarking Device and State Heterogeneity in Federated Learning

Junyuan Zhang, Shuang Zeng, Miao Zhang et al.

CVPR 2024
#9902

MR-VNet: Media Restoration using Volterra Networks

Siddharth Roheda, Amit Unde, Loay Rashid

CVPR 2024
#9903

OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition

Jianqiang Wan, Sibo Song, Wenwen Yu et al.

CVPR 2024arXiv:2403.19128
#9904

PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization

Xu Peng, Junwei Zhu, Boyuan Jiang et al.

CVPR 2024arXiv:2312.06354
#9905

MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation

Petru-Daniel Tudosiu, Yongxin Yang, Shifeng Zhang et al.

CVPR 2024arXiv:2404.02790
#9906

Dr.Hair: Reconstructing Scalp-Connected Hair Strands without Pre-Training via Differentiable Rendering of Line Segments

Yusuke Takimoto, Hikari Takehara, Hiroyuki Sato et al.

CVPR 2024highlightarXiv:2403.17496
#9907

CroSel: Cross Selection of Confident Pseudo Labels for Partial-Label Learning

Shiyu Tian, Hongxin Wei, Yiqun Wang et al.

CVPR 2024arXiv:2303.10365
#9908

PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild

Kun Yuan, Hongbo Liu, Mading Li et al.

CVPR 2024arXiv:2405.17765
#9909

Improved Self-Training for Test-Time Adaptation

Jing Ma

CVPR 2024
#9910

Mudslide: A Universal Nuclear Instance Segmentation Method

Jun Wang

CVPR 2024highlight
#9911

Collaborative Learning of Anomalies with Privacy (CLAP) for Unsupervised Video Anomaly Detection: A New Baseline

Anas Al-lahham, Muhammad Zaigham Zaheer, Nurbek Tastan et al.

CVPR 2024arXiv:2404.00847
#9912

Cache Me if You Can: Accelerating Diffusion Models through Block Caching

Felix Wimbauer, Bichen Wu, Edgar Schoenfeld et al.

CVPR 2024arXiv:2312.03209
#9913

Rewrite the Stars

Xu Ma, Xiyang Dai, Yue Bai et al.

CVPR 2024arXiv:2403.19967
#9914

Virtual Immunohistochemistry Staining for Histological Images Assisted by Weakly-supervised Learning

Jiahan Li, Jiuyang Dong, Shenjin Huang et al.

CVPR 2024
#9915

3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features

Chenfeng Xu, Huan Ling, Sanja Fidler et al.

CVPR 2024arXiv:2311.04391
#9916

Model Adaptation for Time Constrained Embodied Control

Jaehyun Song, Minjong Yoo, Honguk Woo

CVPR 2024arXiv:2406.11128
#9917

DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data

Chengxiang Fan, Muzhi Zhu, Hao Chen et al.

CVPR 2024arXiv:2405.10185
#9918

SPAD: Spatially Aware Multi-View Diffusers

Yash Kant, Aliaksandr Siarohin, Ziyi Wu et al.

CVPR 2024
#9919

SCE-MAE: Selective Correspondence Enhancement with Masked Autoencoder for Self-Supervised Landmark Estimation

Kejia Yin, Varshanth Rao, Ruowei Jiang et al.

CVPR 2024arXiv:2405.18322
#9920

DiffPerformer: Iterative Learning of Consistent Latent Guidance for Diffusion-based Human Video Generation

Chenyang Wang, Zerong Zheng, Tao Yu et al.

CVPR 2024
#9921

SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction

Pin Tang, Zhongdao Wang, Guoqing Wang et al.

CVPR 2024arXiv:2404.09502
#9922

Beyond First-Order Tweedie: Solving Inverse Problems using Latent Diffusion

Litu Rout, Yujia Chen, Abhishek Kumar et al.

CVPR 2024arXiv:2312.00852
#9923

Unsupervised Video Domain Adaptation with Masked Pre-Training and Collaborative Self-Training

Arun Reddy, William Paul, Corban Rivera et al.

CVPR 2024arXiv:2312.02914
#9924

RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object Detection

Zhiwei Lin, Zhe Liu, Zhongyu Xia et al.

CVPR 2024arXiv:2403.16440
#9925

FineParser: A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment

Jinglin Xu, Sibo Yin, Guohao Zhao et al.

CVPR 2024arXiv:2405.06887
#9926

SceneFun3D: Fine-Grained Functionality and Affordance Understanding in 3D Scenes

Alexandros Delitzas, Ayça Takmaz, Federico Tombari et al.

CVPR 2024
#9927

MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene Understanding

Xu Cao, Tong Zhou, Yunsheng Ma et al.

CVPR 2024
#9928

Do Vision and Language Encoders Represent the World Similarly?

Mayug Maniparambil, Raiymbek Akshulakov, YASSER ABDELAZIZ DAHOU DJILALI et al.

CVPR 2024arXiv:2401.05224
#9929

Weakly Supervised Point Cloud Semantic Segmentation via Artificial Oracle

Hyeokjun Kweon, Jihun Kim, Kuk-Jin Yoon

CVPR 2024
#9930

Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training

Runze He, Shaofei Huang, Xuecheng Nie et al.

CVPR 2024arXiv:2312.01663
#9931

Construct to Associate: Cooperative Context Learning for Domain Adaptive Point Cloud Segmentation

Guangrui Li

CVPR 2024
#9932

Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft

Hao Li, Xue Yang, Zhaokai Wang et al.

CVPR 2024arXiv:2312.09238
#9933

Wavelet-based Fourier Information Interaction with Frequency Diffusion Adjustment for Underwater Image Restoration

Chen Zhao, Weiling Cai, Chenyu Dong et al.

CVPR 2024arXiv:2311.16845
#9934

Generating Content for HDR Deghosting from Frequency View

Tao Hu, Qingsen Yan, Yuankai Qi et al.

CVPR 2024arXiv:2404.00849
#9935

Direct2.5: Diverse Text-to-3D Generation via Multi-view 2.5D Diffusion

Yuanxun Lu, Jingyang Zhang, Shiwei Li et al.

CVPR 2024arXiv:2311.15980
#9936

Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transfomers

Sheng Yang, Jiawang Bai, Kuofeng Gao et al.

CVPR 2024
#9937

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding Reasoning and Planning

Sijin Chen, Xin Chen, Chi Zhang et al.

CVPR 2024
#9938

GenTron: Diffusion Transformers for Image and Video Generation

Shoufa Chen, Mengmeng Xu, Jiawei Ren et al.

CVPR 2024arXiv:2312.04557
#9939

Map-Relative Pose Regression for Visual Re-Localization

Shuai Chen, Tommaso Cavallari, Victor Adrian Prisacariu et al.

CVPR 2024highlightarXiv:2404.09884
#9940

Gradient-based Parameter Selection for Efficient Fine-Tuning

Zhi Zhang, Qizhe Zhang, Zijun Gao et al.

CVPR 2024arXiv:2312.10136
#9941

Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis

Willi Menapace, Aliaksandr Siarohin, Ivan Skorokhodov et al.

CVPR 2024highlightarXiv:2402.14797
#9942

Backpropagation-free Network for 3D Test-time Adaptation

YANSHUO WANG, Ali Cheraghian, Zeeshan Hayder et al.

CVPR 2024arXiv:2403.18442
#9943

TransNeXt: Robust Foveal Visual Perception for Vision Transformers

Dai Shi

CVPR 2024arXiv:2311.17132
#9944

EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Priors

Zhipeng Hu, Minda Zhao, Chaoyi Zhao et al.

CVPR 2024arXiv:2308.13223
#9945

InstructDiffusion: A Generalist Modeling Interface for Vision Tasks

Zigang Geng, Binxin Yang, Tiankai Hang et al.

CVPR 2024arXiv:2309.03895
#9946

HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation

Linglin Jing, Yiming Ding, Yunpeng Gao et al.

CVPR 2024arXiv:2403.16788
#9947

Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences

Minyoung Hwang, Luca Weihs, Chanwoo Park et al.

CVPR 2024arXiv:2312.09337
#9948

Fourier Priors-Guided Diffusion for Zero-Shot Joint Low-Light Enhancement and Deblurring

Xiaoqian Lv, Shengping Zhang, Chenyang Wang et al.

CVPR 2024
#9949

Towards General Robustness Verification of MaxPool-based Convolutional Neural Networks via Tightening Linear Approximation

Yuan Xiao, Shiqing Ma, Juan Zhai et al.

CVPR 2024arXiv:2406.00699
#9950

RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D

Lingteng Qiu, Guanying Chen, Xiaodong Gu et al.

CVPR 2024highlightarXiv:2311.16918
#9951

Robust Synthetic-to-Real Transfer for Stereo Matching

Jiawei Zhang, Jiahe Li, Lei Huang et al.

CVPR 2024arXiv:2403.07705
#9952

Understanding and Improving Source-free Domain Adaptation from a Theoretical Perspective

Yu Mitsuzumi, Akisato Kimura, Hisashi Kashima

CVPR 2024
#9953

From Isolated Islands to Pangea: Unifying Semantic Space for Human Action Understanding

Yonglu Li, Xiaoqian Wu, Xinpeng Liu et al.

CVPR 2024highlightarXiv:2304.00553
#9954

LowRankOcc: Tensor Decomposition and Low-Rank Recovery for Vision-based 3D Semantic Occupancy Prediction

Linqing Zhao, Xiuwei Xu, Ziwei Wang et al.

CVPR 2024
#9955

Overcoming Generic Knowledge Loss with Selective Parameter Update

Wenxuan Zhang, Paul Janson, Rahaf Aljundi et al.

CVPR 2024arXiv:2308.12462
#9956

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Hao Ouyang, Qiuyu Wang, Yuxi Xiao et al.

CVPR 2024highlightarXiv:2308.07926
#9957

BT-Adapter: Video Conversation is Feasible Without Video Instruction Tuning

Ruyang Liu, Chen Li, Yixiao Ge et al.

CVPR 2024arXiv:2309.15785
#9958

Video Frame Interpolation via Direct Synthesis with the Event-based Reference

Yuhan Liu, Yongjian Deng, Hao Chen et al.

CVPR 2024
#9959

Lane2Seq: Towards Unified Lane Detection via Sequence Generation

Kunyang Zhou

CVPR 2024arXiv:2402.17172
#9960

CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation

Bo-Yuan Sun, Yuqi Yang, Le Zhang et al.

CVPR 2024arXiv:2306.04300
#9961

Rethinking Boundary Discontinuity Problem for Oriented Object Detection

Hang Xu, Xinyuan Liu, Haonan Xu et al.

CVPR 2024arXiv:2305.10061
#9962

MCNet: Rethinking the Core Ingredients for Accurate and Efficient Homography Estimation

Haokai Zhu, Si-Yuan Cao, Jianxin Hu et al.

CVPR 2024
#9963

UniDepth: Universal Monocular Metric Depth Estimation

Luigi Piccinelli, Yung-Hsu Yang, Christos Sakaridis et al.

CVPR 2024highlightarXiv:2403.18913
#9964

Diffusion Model Alignment Using Direct Preference Optimization

Bram Wallace, Meihua Dang, Rafael Rafailov et al.

CVPR 2024arXiv:2311.12908
#9965

SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching

Xinghui Li, Jingyi Lu, Kai Han et al.

CVPR 2024arXiv:2310.17569
#9966

Uncertainty-Guided Never-Ending Learning to Drive

Lei Lai, Eshed Ohn-Bar, Sanjay Arora et al.

CVPR 2024
#9967

Feedback-Guided Autonomous Driving

Jimuyang Zhang, Zanming Huang, Arijit Ray et al.

CVPR 2024highlight
#9968

Masked Autoencoders for Microscopy are Scalable Learners of Cellular Biology

Oren Kraus, Kian Kenyon-Dean, Saber Saberian et al.

CVPR 2024highlightarXiv:2404.10242
#9969

Small Steps and Level Sets: Fitting Neural Surface Models with Point Guidance

Chamin Hewa Koneputugodage, Yizhak Ben-Shabat, Dylan Campbell et al.

CVPR 2024
#9970

Adapt or Perish: Adaptive Sparse Transformer with Attentive Feature Refinement for Image Restoration

Shihao Zhou, Duosheng Chen, Jinshan Pan et al.

CVPR 2024
#9971

3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos

Jiakai Sun, Han Jiao, Guangyuan Li et al.

CVPR 2024highlightarXiv:2403.01444
#9972

LTM: Lightweight Textured Mesh Extraction and Refinement of Large Unbounded Scenes for Efficient Storage and Real-time Rendering

Jaehoon Choi, Rajvi Shah, Qinbo Li et al.

CVPR 2024
#9973

TextCraftor: Your Text Encoder Can be Image Quality Controller

Yanyu Li, Xian Liu, Anil Kag et al.

CVPR 2024arXiv:2403.18978
#9974

Geometry Transfer for Stylizing Radiance Fields

Hyunyoung Jung, Seonghyeon Nam, Nikolaos Sarafianos et al.

CVPR 2024arXiv:2402.00863
#9975

3D Human Pose Perception from Egocentric Stereo Videos

Hiroyasu Akada, Jian Wang, Vladislav Golyanik et al.

CVPR 2024highlightarXiv:2401.00889
#9976

QN-Mixer: A Quasi-Newton MLP-Mixer Model for Sparse-View CT Reconstruction

Ishak Ayad, Nicolas Larue, Mai K. Nguyen

CVPR 2024arXiv:2402.17951
#9977

Check Locate Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation

Biao Gong, Siteng Huang, Yutong Feng et al.

CVPR 2024
#9978

Prompt3D: Random Prompt Assisted Weakly-Supervised 3D Object Detection

Xiaohong Zhang, Huisheng Ye, Jingwen Li et al.

CVPR 2024
#9979

Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation

Keonhee Han, Dominik Muhle, Felix Wimbauer et al.

CVPR 2024arXiv:2404.07933
#9980

Volumetric Environment Representation for Vision-Language Navigation

Liu, Wenguan Wang, Yi Yang

CVPR 2024highlightarXiv:2403.14158
#9981

CrossKD: Cross-Head Knowledge Distillation for Object Detection

JiaBao Wang, yuming chen, Zhaohui Zheng et al.

CVPR 2024arXiv:2306.11369
#9982

Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation

Jiaming Liu, Ran Xu, Senqiao Yang et al.

CVPR 2024arXiv:2312.12480
#9983

TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing

Sherry X. Chen, Yaron Vaxman, Elad Ben Baruch et al.

CVPR 2024arXiv:2404.11120
#9984

Leveraging Camera Triplets for Efficient and Accurate Structure-from-Motion

Lalit Manam, Venu Madhav Govindu

CVPR 2024
#9985

CG-HOI: Contact-Guided 3D Human-Object Interaction Generation

Christian Diller, Angela Dai

CVPR 2024arXiv:2311.16097
#9986

Is Vanilla MLP in Neural Radiance Field Enough for Few-shot View Synthesis?

Hanxin Zhu, Tianyu He, Xin Li et al.

CVPR 2024arXiv:2403.06092
#9987

Resurrecting Old Classes with New Data for Exemplar-Free Continual Learning

Dipam Goswami, Albin Soutif, Yuyang Liu et al.

CVPR 2024arXiv:2405.19074
#9988

DIEM: Decomposition-Integration Enhancing Multimodal Insights

Xinyi Jiang, Guoming Wang, Junhao Guo et al.

CVPR 2024
#9989

Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters

Jiazuo Yu, Yunzhi Zhuge, Lu Zhang et al.

CVPR 2024arXiv:2403.11549
#9990

HOI-M^3: Capture Multiple Humans and Objects Interaction within Contextual Environment

Juze Zhang, Jingyan Zhang, Zining Song et al.

CVPR 2024highlight
#9991

CORES: Convolutional Response-based Score for Out-of-distribution Detection

Keke Tang, Chao Hou, Weilong Peng et al.

CVPR 2024
#9992

Equivariant Multi-Modality Image Fusion

Zixiang Zhao, Haowen Bai, Jiangshe Zhang et al.

CVPR 2024arXiv:2305.11443
#9993

PDF: A Probability-Driven Framework for Open World 3D Point Cloud Semantic Segmentation

Jinfeng Xu, Siyuan Yang, Xianzhi Li et al.

CVPR 2024arXiv:2404.00979
#9994

NeISF: Neural Incident Stokes Field for Geometry and Material Estimation

Chenhao Li, Taishi Ono, Takeshi Uemori et al.

CVPR 2024highlightarXiv:2311.13187
#9995

PromptKD: Unsupervised Prompt Distillation for Vision-Language Models

Zheng Li, Xiang Li, xinyi fu et al.

CVPR 2024arXiv:2403.02781
#9996

DeMatch: Deep Decomposition of Motion Field for Two-View Correspondence Learning

Shihua Zhang, Zizhuo Li, Yuan Gao et al.

CVPR 2024
#9997

Domain Gap Embeddings for Generative Dataset Augmentation

Yinong Oliver Wang, Younjoon Chung, Chen Henry Wu et al.

CVPR 2024
#9998

Domain-Agnostic Mutual Prompting for Unsupervised Domain Adaptation

Zhekai Du, Xinyao Li, Fengling Li et al.

CVPR 2024arXiv:2403.02899
#9999

TransLoc4D: Transformer-based 4D Radar Place Recognition

Guohao Peng, Heshan Li, Yangyang Zhao et al.

CVPR 2024
#10000

Higher-order Relational Reasoning for Pedestrian Trajectory Prediction

Sungjune Kim, Hyung-gun Chi, Hyerin Lim et al.

CVPR 2024