Most Cited 2024 "multi-modal crowd counting" Papers

12,324 papers found • Page 28 of 62

Filters:Most Cited 2024 multi-modal crowd counting Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#5401

Learning Invariant Inter-pixel Correlations for Superpixel Generation

Sen Xu, Shikui Wei, Tao Ruan et al.

AAAI 2024paperarXiv:2402.18201

#5402

Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation

Fu-Yun Wang, Xiaoshi Wu, Zhaoyang Huang et al.

ECCV 2024arXiv:2403.13745

#5403

Reinforcement Learning as a Parsimonious Alternative to Prediction Cascades: A Case Study on Image Segmentation

Bharat Srikishan, Anika Tabassum, Srikanth Allu et al.

AAAI 2024paperarXiv:2402.11760

#5404

Visual Redundancy Removal for Composite Images: A Benchmark Dataset and a Multi-Visual-Effects Driven Incremental Method

Miaohui Wang, Rong Zhang, Lirong Huang et al.

AAAI 2024paper

#5405

Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier

Prantik Howlader, Srijan Das, Hieu Le et al.

ECCV 2024arXiv:2407.04036

#5406

Independency Adversarial Learning for Cross-Modal Sound Separation

Zhenkai Lin, Yanli Ji, Yang Yang

AAAI 2024paper

#5407

Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum

Zhengliang Shi, Shen Gao, Minghang Zhu et al.

AAAI 2024paperarXiv:2308.14034

#5408

Enhancing Semi-supervised Domain Adaptation via Effective Target Labeling

Jiujun He, Bin Liu, Guosheng Yin

AAAI 2024paper

#5409

AdversariaLeak: External Information Leakage Attack Using Adversarial Samples on Face Recognition Systems

Roye Katzav, Amit Giloni, Edita Grolman et al.

ECCV 2024

#5410

H2GFormer: Horizontal-to-Global Voxel Transformer for 3D Semantic Scene Completion

Yu Wang, Chao Tong

AAAI 2024paper

#5411

Jointly Improving the Sample and Communication Complexities in Decentralized Stochastic Minimax Optimization

Xuan Zhang, Gabriel Mancino-Ball, Necdet Serhat Aybat et al.

AAAI 2024paperarXiv:2307.09421

#5412

FT-GAN: Fine-Grained Tune Modeling for Chinese Opera Synthesis

Meizhen Zheng, Peng Bai, Xiaodong Shi et al.

AAAI 2024paper

#5413

Voxel or Pillar: Exploring Efficient Point Cloud Representation for 3D Object Detection

Yuhao Huang, Sanping Zhou, Junjie Zhang et al.

AAAI 2024paperarXiv:2304.02867

#5414

Structural Information Enhanced Graph Representation for Link Prediction

Lei Shi, Bin Hu, Deng Zhao et al.

AAAI 2024paper

#5415

Restoring Images in Adverse Weather Conditions via Histogram Transformer

Shangquan Sun, Wenqi Ren, Xinwei Gao et al.

ECCV 2024arXiv:2407.10172

#5416

VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding

Yue Fan, Xiaojian Ma, Rujie Wu et al.

ECCV 2024arXiv:2403.11481

#5417

TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks

Jinjie Mai, Wenxuan Zhu, Sara Rojas Martinez et al.

ECCV 2024arXiv:2408.10739

#5418

Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis

Chirag Vashist, Shichong Peng, Ke Li

ECCV 2024arXiv:2409.17439

#5419

LGMRec: Local and Global Graph Learning for Multimodal Recommendation

Zhiqiang Guo, Jianjun Li, Guohui Li et al.

AAAI 2024paperarXiv:2312.16400

#5420

Data-Driven Knowledge-Aware Inference of Private Information in Continuous Double Auctions

Lvye Cui, Haoran Yu

AAAI 2024paper

#5421

Linear-Time Algorithms for Front-Door Adjustment in Causal Graphs

Marcel Wienöbst, Benito van der Zander, Maciej Liskiewicz

AAAI 2024paperarXiv:2211.16468

#5422

Discriminatively Fuzzy Multi-View K-means Clustering with Local Structure Preserving

Jun Yin, Shiliang Sun, Lai Wei et al.

AAAI 2024paper

#5423

Robust Blind Text Image Deblurring via Maximum Consensus Framework

Zijian Min, Gundu Hassan, GeunSik Jo

AAAI 2024paper

#5424

Phoneme Hallucinator: One-Shot Voice Conversion via Set Expansion

Siyuan Shan, Yang Li, Amartya Banerjee et al.

AAAI 2024paperarXiv:2308.06382

#5425

Continuous-Time Graph Representation with Sequential Survival Process

Abdulkadir Celikkanat, Nikolaos Nakis, Morten Mørup

AAAI 2024paperarXiv:2312.13068

#5426

MONTRAGE: Monitoring Training for Attribution of Generative Diffusion Models

Jonathan Brokman, Omer Hofman, Roman Vainshtein et al.

ECCV 2024

#5427

Understanding Distributed Representations of Concepts in Deep Neural Networks without Supervision

Wonjoon Chang, Dahee Kwon, Jaesik Choi

AAAI 2024paperarXiv:2312.17285

#5428

How Far Can a 1-Pixel Camera Go? Solving Vision Tasks using Photoreceptors and Computationally Designed Visual Morphology

Andrei Atanov, Rishubh Singh, Jiawei Fu et al.

ECCV 2024

#5429

Unsupervised Continual Anomaly Detection with Contrastively-Learned Prompt

Jiaqi Liu, Kai Wu, Qiang Nie et al.

AAAI 2024paperarXiv:2401.01010

#5430

Separate the Wheat from the Chaff: Model Deficiency Unlearning via Parameter-Efficient Module Operation

Xinshuo Hu, Dongfang Li, Zihao Zheng et al.

AAAI 2024paperarXiv:2308.08090

#5431

ProAgent: Building Proactive Cooperative Agents with Large Language Models

Ceyao Zhang, Kaijie Yang, Siyi Hu et al.

AAAI 2024paperarXiv:2308.11339

#5432

Encoding Constraints as Binary Constraint Networks Satisfying BTP

AAAI 2024paper

#5433

MLPHand: Real Time Multi-View 3D Hand Reconstruction via MLP Modeling

Jian Yang, Jiakun Li, Guoming Li et al.

ECCV 2024

#5434

Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning

Yunbin Tu, Liang Li, Li Su et al.

ECCV 2024arXiv:2407.11683

#5435

DreamIdentity: Enhanced Editability for Efficient Face-Identity Preserved Image Generation

Zhuowei Chen, Shancheng Fang, Wei Liu et al.

AAAI 2024paper

#5436

F3-Pruning: A Training-Free and Generalized Pruning Strategy towards Faster and Finer Text-to-Video Synthesis

Sitong Su, Jianzhi Liu, Lianli Gao et al.

AAAI 2024paperarXiv:2312.03459

#5437

Adaptive Feature Imputation with Latent Graph for Deep Incomplete Multi-View Clustering

Jingyu Pu, Chenhang Cui, Xinyue Chen et al.

AAAI 2024paper

#5438

Can Textual Semantics Mitigate Sounding Object Segmentation Preference?

Yaoting Wang, Peiwen Sun, Yuanchao Li et al.

ECCV 2024arXiv:2407.10947

#5439

Variational Hybrid-Attention Framework for Multi-Label Few-Shot Aspect Category Detection

Cheng Peng, Ke Chen, Lidan Shou et al.

AAAI 2024paper

#5440

SNN-PDE: Learning Dynamic PDEs from Data with Simplicial Neural Networks

Jae Choi, Yuzhou Chen, Huikyo Lee et al.

AAAI 2024paper

#5441

DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling

Haoran Li, Haolin Shi, Wenli Zhang et al.

ECCV 2024arXiv:2404.03575

#5442

Resilience of Entropy Model in Distributed Neural Networks

Milin Zhang, Mohammad Abdi, Shahriar Rifat et al.

ECCV 2024arXiv:2403.00942

#5443

Dynamic Reactive Spiking Graph Neural Network

AAAI 2024paper

#5444

Block Image Compressive Sensing with Local and Global Information Interaction

Xiaoyu Kong, Yongyong Chen, Feng Zheng et al.

AAAI 2024paper

#5445

StockMixer: A Simple Yet Strong MLP-Based Architecture for Stock Price Forecasting

Jinyong Fan, Yanyan Shen

AAAI 2024paper

#5446

A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment

Tianhe Wu, Kede Ma, Jie Liang et al.

ECCV 2024arXiv:2403.10854

#5447

Efficient Nonparametric Tensor Decomposition for Binary and Count Data

Zerui Tao, Toshihisa Tanaka, Qibin Zhao

AAAI 2024paperarXiv:2401.07711

#5448

Identification of Causal Structure in the Presence of Missing Data with Additive Noise Model

Zhengrui Chen, Liying Lu, Ziyang Yuan et al.

AAAI 2024paperarXiv:2312.12206

#5449

TCNet: Continuous Sign Language Recognition from Trajectories and Correlated Regions

AAAI 2024paperarXiv:2403.11818

#5450

Learning Domain-Independent Heuristics for Grounded and Lifted Planning

AAAI 2024paperarXiv:2312.11143

#5451

Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking

Lorenzo Vaquero, Yihong XU, Xavier Alameda-Pineda et al.

ECCV 2024arXiv:2407.10151

#5452

Prompt-Driven Contrastive Learning for Transferable Adversarial Attacks

Hunmin Yang, Jongoh Jeong, Kuk-Jin Yoon

ECCV 2024arXiv:2407.20657

#5453

OntoFact: Unveiling Fantastic Fact-Skeleton of LLMs via Ontology-Driven Reinforcement Learning

Ziyu Shang, Ke Wenjun, Nana Xiu et al.

AAAI 2024paper

#5454

Exploring Temporal Feature Correlation for Efficient and Stable Video Semantic Segmentation

Matthieu Lin, Jenny Sheng, Yubin Hu et al.

AAAI 2024paper

#5455

DeblurSR: Event-Based Motion Deblurring under the Spiking Representation

Chen Song, Chandrajit Bajaj, Qixing Huang

AAAI 2024paperarXiv:2303.08977

#5456

Boosting Multiple Instance Learning Models for Whole Slide Image Classification: A Model-Agnostic Framework Based on Counterfactual Inference

Weiping Lin, Zhenfeng Zhuang, Lequan Yu et al.

AAAI 2024paper

#5457

OCEAN-MBRL: Offline Conservative Exploration for Model-Based Offline Reinforcement Learning

Fan Wu, Rui Zhang, Qi Yi et al.

AAAI 2024paper

#5458

Continual Learning and Unknown Object Discovery in 3D Scenes via Self-Distillation

Mohamed El Amine Boudjoghra, Jean Lahoud, Salman Khan et al.

ECCV 2024

#5459

Semantic Complete Scene Forecasting from a 4D Dynamic Point Cloud Sequence

Zifan Wang, Zhuorui Ye, Haoran Wu et al.

AAAI 2024paperarXiv:2312.08054

#5460

Authors

- Xinshu Li, Lina Yao

AAAI 2024paperarXiv:2206.05016

#5461

Improving Panoptic Narrative Grounding by Harnessing Semantic Relationships and Visual Confirmation

AAAI 2024paper

#5462

TAG: Text Prompt Augmentation for Zero-Shot Out-of-Distribution Detection

Xixi Liu, Christopher Zach

ECCV 2024

#5463

PLOT: Text-based Person Search with Part Slot Attention for Corresponding Part Discovery

Jicheol Park, Dongwon Kim, Boseung Jeong et al.

ECCV 2024arXiv:2409.13475

#5464

Occluded Gait Recognition with Mixture of Experts: An Action Detection Perspective

Panjian Huang, Yunjie Peng, Saihui Hou et al.

ECCV 2024

#5465

Causal Strategic Learning with Competitive Selection

AAAI 2024paperarXiv:2308.16262

#5466

Causal Walk: Debiasing Multi-Hop Fact Verification with Front-Door Adjustment

AAAI 2024paperarXiv:2403.02698

#5467

Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition

Sergio Izquierdo, Javier Civera

ECCV 2024arXiv:2407.02422

#5468

Video-Language Aligned Transformer for Video Question Answering

AAAI 2024paper

#5469

PoetryDiffusion: Towards Joint Semantic and Metrical Manipulation in Poetry Generation

AAAI 2024paperarXiv:2306.08456

#5470

DoubleTake: Geometry Guided Depth Estimation

Mohamed Sayed, Filippo Aleotti, Jamie Watson et al.

ECCV 2024arXiv:2406.18387

#5471

Efficient Asynchronous Federated Learning with Prospective Momentum Aggregation and Fine-Grained Correction

AAAI 2024paper

#5472

HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping

AAAI 2024paperarXiv:2312.17492

#5473

On the Structural Hardness of Answer Set Programming: Can Structure Efficiently Confine the Power of Disjunctions?

Markus Hecher, Rafael Kiesel

AAAI 2024paperarXiv:2402.03539

#5474

Multi-View Randomized Kernel Classification via Nonconvex Optimization

AAAI 2024paper

#5475

Oulu Remote-photoplethysmography Physical Domain Attacks Database (ORPDAD)

Marko Savic, Guoying Zhao

ECCV 2024

#5476

Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance

Donghoon Ahn, Hyoungwon Cho, Jaewon Min et al.

ECCV 2024arXiv:2403.17377

#5477

Are You Concerned about Limited Function Evaluations: Data-Augmented Pareto Set Learning for Expensive Multi-Objective Optimization

AAAI 2024paper

#5478

Enhancing Representation of Spiking Neural Networks via Similarity-Sensitive Contrastive Learning

AAAI 2024paper

#5479

Learning Coalition Structures with Games

AAAI 2024paperarXiv:2312.09058

#5480

On Unsupervised Domain Adaptation: Pseudo Label Guided Mixup for Adversarial Prompt Tuning

AAAI 2024paper

#5481

Distribution-Conditioned Adversarial Variational Autoencoder for Valid Instrumental Variable Generation

AAAI 2024paper

#5482

Leveraging Representations from Intermediate Encoder-blocks for Synthetic Image Detection

Christos Koutlis, Symeon Papadopoulos

ECCV 2024arXiv:2402.19091

#5483

Solving Motion Planning Tasks with a Scalable Generative Model

Yihan Hu, Siqi Chai, Zhening Yang et al.

ECCV 2024arXiv:2407.02797

#5484

Transfer and Alignment Network for Generalized Category Discovery

Wenbin An, Feng Tian, Wenkai Shi et al.

AAAI 2024paperarXiv:2312.16467

#5485

FedCD: Federated Semi-supervised Learning with Class Awareness Balance via Dual Teachers

Yuzhi Liu, Huisi Wu, Jing Qin

AAAI 2024paper

#5486

Semi-supervised TEE Segmentation via Interacting with SAM Equipped with Noise-Resilient Prompting

Sen Deng, Yidan Feng, Haoneng Lin et al.

AAAI 2024paper

#5487

Prompting Multi-Modal Image Segmentation with Semantic Grouping

AAAI 2024paper

#5488

TMFormer: Token Merging Transformer for Brain Tumor Segmentation with Missing Modalities

Zheyu Zhang, Gang Yang, Yueyi Zhang et al.

AAAI 2024paper

#5489

Multiscale Attention Wavelet Neural Operator for Capturing Steep Trajectories in Biochemical Systems

Jiayang Su, Junbo Ma, Songyang Tong et al.

AAAI 2024paper

#5490

An Effective Augmented Lagrangian Method for Fine-Grained Multi-View Optimization

Yuze Tan, Hecheng Cai, Shudong Huang et al.

AAAI 2024paper

#5491

Versatile Incremental Learning: Towards Class and Domain-Agnostic Incremental Learning

Minyeong Park, Jae-Ho Lee, Gyeong-Moon Park

ECCV 2024arXiv:2409.10956

#5492

Parrot Captions Teach CLIP to Spot Text

Yiqi Lin, Conghui He, Alex Jinpeng Wang et al.

ECCV 2024arXiv:2312.14232

#5493

Primitive-Based 3D Human-Object Interaction Modelling and Programming

Siqi Liu, Yong-Lu Li, Zhou FANG et al.

AAAI 2024paperarXiv:2312.10714

#5494

Collaborative Synthesis of Patient Records through Multi-Visit Health State Inference

Hongda Sun, Hongzhan Lin, Rui Yan

AAAI 2024paperarXiv:2312.14646

#5495

Decentralized Sum-of-Nonconvex Optimization

Zhuanghua Liu, Bryan Kian Hsiang Low

AAAI 2024paperarXiv:2402.02356

#5496

All Beings Are Equal in Open Set Recognition

Chaohua Li, Enhao Zhang, Chuanxing Geng et al.

AAAI 2024paperarXiv:2401.17654

#5497

PRP Rebooted: Advancing the State of the Art in FOND Planning

Christian Muise, Sheila McIlraith, J. Christopher Beck

AAAI 2024paperarXiv:2312.11675

#5498

LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models

Yabin Zhang, Wenjie Zhu, Chenhang He et al.

ECCV 2024arXiv:2407.08966

#5499

ExpeL: LLM Agents Are Experiential Learners

Andrew Zhao, Daniel Huang, Quentin Xu et al.

AAAI 2024paperarXiv:2308.10144

#5500

Multi-Cross Sampling and Frequency-Division Reconstruction for Image Compressed Sensing

Heping Song, Jingyao Gong, Hongying Meng et al.

AAAI 2024paper

#5501

Electron Microscopy Images as Set of Fragments for Mitochondrial Segmentation

Naisong Luo, Rui Sun, Yuwen Pan et al.

AAAI 2024paper

#5502

Graph-Based Prediction and Planning Policy Network (GP3Net) for Scalable Self-Driving in Dynamic Environments Using Deep Reinforcement Learning

Jayabrata Chowdhury, Venkataramanan Shivaraman, Suresh Sundaram et al.

AAAI 2024paperarXiv:2312.05784

#5503

MedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models

Yan Cai, Linlin Wang, Ye Wang et al.

AAAI 2024paperarXiv:2312.12806

#5504

Sampling for Beyond-Worst-Case Online Ranking

Qingyun Chen, Sungjin Im, Benjamin Moseley et al.

AAAI 2024paper

#5505

PMET: Precise Model Editing in a Transformer

Xiaopeng Li, Shasha Li, Shezheng Song et al.

AAAI 2024paperarXiv:2308.08742

#5506

Learning Neural Deformation Representation for 4D Dynamic Shape Generation

Gyojin Han, Jiwan Hur, Jaehyun Choi et al.

ECCV 2024

#5507

ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders

Carlos Hinojosa, Shuming Liu, Bernard Ghanem

ECCV 2024arXiv:2407.13036

#5508

Explaining Reinforcement Learning Agents through Counterfactual Action Outcomes

Yotam Amitai, Yael Friedler, Ofra Amir

AAAI 2024paperarXiv:2312.11118

#5509

CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model

Pengwei Yin, Guanzhong Zeng, Jingjing Wang et al.

AAAI 2024paperarXiv:2403.05124

#5510

Point Deformable Network with Enhanced Normal Embedding for Point Cloud Analysis

Xingyilang Yin, Xi Yang, Liangchen Liu et al.

AAAI 2024paperarXiv:2312.13071

#5511

Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge

Xuan Shen, Peiyan Dong, Lei Lu et al.

AAAI 2024paperarXiv:2312.05693

#5512

Chains of Diffusion Models

Yanheng Wei, Lianghua Huang, Zhi-Fan Wu et al.

ECCV 2024

#5513

Robust Nearest Neighbors for Source-Free Domain Adaptation under Class Distribution Shift

Antonio Tejero-de-Pablos, Riku Togashi, Mayu Otani et al.

ECCV 2024

#5514

A Direct Approach to Viewing Graph Solvability

Federica Arrigoni, Andrea Fusiello, Tomas Pajdla

ECCV 2024

#5515

Selective Deep Autoencoder for Unsupervised Feature Selection

Wael Hassanieh, Abdallah Chehade

AAAI 2024paper

#5516

Variable Importance in High-Dimensional Settings Requires Grouping

Yifan Lu, Ziqi Zhang, Chunfeng Yuan et al.

AAAI 2024paper

#5517

Physics-informed Knowledge Transfer for Underwater Monocular Depth Estimation

Jinghe Yang, Mingming Gong, Ye Pu

ECCV 2024

#5518

EG-NAS: Neural Architecture Search with Fast Evolutionary Exploration

AAAI 2024paper

#5519

Cross-Modal Feature Distribution Calibration for Few-Shot Visual Question Answering

AAAI 2024paper

#5520

Learning Equilibrium Transformation for Gamut Expansion and Color Restoration

JUN XIAO, Changjian Shui, Zhi-Song Liu et al.

ECCV 2024

#5521

SFC: Shared Feature Calibration in Weakly Supervised Semantic Segmentation

AAAI 2024paperarXiv:2401.11719

#5522

Set Prediction Guided by Semantic Concepts for Diverse Video Captioning

AAAI 2024paperarXiv:2312.15720

#5523

Pseudo-Label Calibration Semi-supervised Multi-Modal Entity Alignment

Luyao Wang, Pengnian Qi, Xigang Bao et al.

AAAI 2024paperarXiv:2403.01203

#5524

GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time

Hao Li, Yuanyuan Gao, Dingwen Zhang et al.

ECCV 2024

#5525

Weighted Ensemble Models Are Strong Continual Learners

Imad Eddine Marouf, Subhankar Roy, Enzo Tartaglione et al.

ECCV 2024arXiv:2312.08977

#5526

Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediciton Tasks

Manyuan Zhang, Guanglu Song, Xiaoyu Shi et al.

ECCV 2024

#5527

Cocktail Universal Adversarial Attack on Deep Neural Networks

Shaoxin Li, Xiaofeng Liao, Xin Che et al.

ECCV 2024

#5528

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

Seung Hyun Lee, Yinxiao Li, Junjie Ke et al.

ECCV 2024arXiv:2401.05675

#5529

Semi-Supervised Teacher-Reference-Student Architecture for Action Quality Assessment

Wulian Yun, Mengshi Qi, Fei Peng et al.

ECCV 2024arXiv:2407.19675

#5530

CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance

Zhipeng Hu, Yongqiang Zhang, Chen Liu et al.

ECCV 2024

#5531

Orthogonal Dictionary Guided Shape Completion Network for Point Cloud

Pingping Cai, Deja Scott, Xiaoguang Li et al.

AAAI 2024paper

#5532

Progressive High-Frequency Reconstruction for Pan-Sharpening with Implicit Neural Representation

Ge Meng, Jingjia Huang, Yingying Wang et al.

AAAI 2024paper

#5533

LPViT: Low-Power Semi-structured Pruning for Vision Transformers

KAIXIN Xu, Zhe Wang, Chunyun Chen et al.

ECCV 2024arXiv:2407.02068

#5534

What to Remember: Self-Adaptive Continual Learning for Audio Deepfake Detection

XiaoHui Zhang, Jiangyan Yi, Chenglong Wang et al.

AAAI 2024paperarXiv:2312.09651

#5535

Can You Rely on Synthetic Labellers in Preference-Based Reinforcement Learning? It’s Complicated

Katherine Metcalf, Miguel Sarabia, Masha Fedzechkina et al.

AAAI 2024paper

#5536

GAD-PVI: A General Accelerated Dynamic-Weight Particle-Based Variational Inference Framework

Fangyikang Wang, Huminhao Zhu, Chao Zhang et al.

AAAI 2024paperarXiv:2312.16429

#5537

Beyond Grounding: Extracting Fine-Grained Event Hierarchies across Modalities

Hammad Ayyubi, Christopher Thomas, Lovish Chum et al.

AAAI 2024paperarXiv:2206.07207

#5538

Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis

Brian Isaac Medina, Yona Falinie Abdul Gaus, Neelanjan Bhowmik et al.

ECCV 2024arXiv:2407.15763

#5539

Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models

Yasi Zhang, Peiyu Yu, Ying Nian Wu

ECCV 2024arXiv:2404.07389

#5540

Underspecification in Language Modeling Tasks: A Causality-Informed Study of Gendered Pronoun Resolution

Emily McMilin

AAAI 2024paperarXiv:2210.00131

#5541

Enhancing Bilingual Lexicon Induction via Bi-directional Translation Pair Retrieving

Ding Qiuyu, Hailong Cao, Tiejun Zhao

AAAI 2024paper

#5542

Graph Reasoning Transformers for Knowledge-Aware Question Answering

Ruilin Zhao, Feng Zhao, Liang Hu et al.

AAAI 2024paper

#5543

Multi-Modal Hallucination Control by Visual Information Grounding

Alessandro Favero, Luca Zancato, Matthew Trager et al.

CVPR 2024arXiv:2403.14003

#5544

Earthfarsser: Versatile Spatio-Temporal Dynamical Systems Modeling in One Model

Hao Wu, Yuxuan Liang, Wei Xiong et al.

AAAI 2024paper

#5545

HoloADMM: High-Quality Holographic Complex Field Recovery

Mazen Mel, Paul Springer, Pietro Zanuttigh et al.

ECCV 2024

#5546

Unsqueeze [CLS] Bottleneck to Learn Rich Representations

Qing Su, Shihao Ji

ECCV 2024arXiv:2407.17671

#5547

Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical Labels

Zhuohong Li, Wei He, Jiepan Li et al.

CVPR 2024highlightarXiv:2403.02746

#5548

Boosting Image Quality Assessment through Efficient Transformer Adaptation with Local Feature Enhancement

Kangmin Xu, Liang Liao, Jing Xiao et al.

CVPR 2024

#5549

Segment Any Event Streams via Weighted Adaptation of Pivotal Tokens

Zhiwen Chen, Zhiyu Zhu, Yifan Zhang et al.

CVPR 2024

#5550

IBD-SLAM: Learning Image-Based Depth Fusion for Generalizable SLAM

Minghao Yin, Shangzhe Wu, Kai Han

CVPR 2024

#5551

MAPSeg: Unified Unsupervised Domain Adaptation for Heterogeneous Medical Image Segmentation Based on 3D Masked Autoencoding and Pseudo-Labeling

Xuzhe Zhang, Yuhao Wu, Elsa Angelini et al.

CVPR 2024arXiv:2303.09373

#5552

Anomaly Heterogeneity Learning for Open-set Supervised Anomaly Detection

Jiawen Zhu, Choubo Ding, Yu Tian et al.

CVPR 2024arXiv:2310.12790

#5553

Market-GAN: Adding Control to Financial Market Data Generation with Semantic Context

Haochong Xia, Shuo Sun, Xinrun Wang et al.

AAAI 2024paperarXiv:2309.07708

#5554

Fast Adaptation for Human Pose Estimation via Meta-Optimization

Shengxiang Hu, Huaijiang Sun, Bin Li et al.

CVPR 2024

#5555

Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model

Dian Zheng, Xiao-Ming Wu, Shuzhou Yang et al.

CVPR 2024arXiv:2403.11157

#5556

Semantic Segmentation in Multiple Adverse Weather Conditions with Domain Knowledge Retention

Xin Yang, Wending Yan, Yuan Yuan et al.

AAAI 2024paperarXiv:2401.07459

#5557

Understanding the Generalization of Pretrained Diffusion Models on Out-of-Distribution Data

Sai Niranjan Ramachandran, Rudrabha Mukhopadhyay, Madhav Agarwal et al.

AAAI 2024paper

#5558

Purified and Unified Steganographic Network

GuoBiao Li, Sheng Li, Zicong Luo et al.

CVPR 2024arXiv:2402.17210

#5559

Explicitly Perceiving and Preserving the Local Geometric Structures for 3D Point Cloud Attack

Daizong Liu, Wei Hu

AAAI 2024paper

#5560

KVQ: Kwai Video Quality Assessment for Short-form Videos

Yiting Lu, Xin Li, Yajing Pei et al.

CVPR 2024arXiv:2402.07220

#5561

Consistent Prompting for Rehearsal-Free Continual Learning

Zhanxin Gao, Jun Cen, Xiaobin Chang

CVPR 2024arXiv:2403.08568

#5562

ModWaveMLP: MLP-Based Mode Decomposition and Wavelet Denoising Model to Defeat Complex Structures in Traffic Forecasting

Ke Sun, Pei Liu, Pengfei Li et al.

AAAI 2024paper

#5563

FedHide: Federated Learning by Hiding in the Neighbors

Hyunsin Park, Sungrack Yun

ECCV 2024arXiv:2409.07808

#5564

Mean-Shift Feature Transformer

Takumi Kobayashi

CVPR 2024

#5565

Tactile-Augmented Radiance Fields

Yiming Dou, Fengyu Yang, Yi Liu et al.

CVPR 2024arXiv:2405.04534

#5566

SpikingResformer: Bridging ResNet and Vision Transformer in Spiking Neural Networks

Xinyu Shi, Zecheng Hao, Zhaofei Yu

CVPR 2024arXiv:2403.14302

#5567

Improving Transferability for Cross-Domain Trajectory Prediction via Neural Stochastic Differential Equation

Daehee Park, Jaewoo Jeong, Kuk-Jin Yoon

AAAI 2024paperarXiv:2312.15906

#5568

One-Shot Open Affordance Learning with Foundation Models

Gen Li, Deqing Sun, Laura Sevilla-Lara et al.

CVPR 2024arXiv:2311.17776

#5569

Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld

Yijun Yang, Tianyi Zhou, kanxue Li et al.

CVPR 2024arXiv:2311.16714

#5570

Hypercorrelation Evolution for Video Class-Incremental Learning

Sen Liang, Kai Zhu, Wei Zhai et al.

AAAI 2024paper

#5571

Inter-X: Towards Versatile Human-Human Interaction Analysis

Liang Xu, Xintao Lv, Yichao Yan et al.

CVPR 2024arXiv:2312.16051

#5572

Benchmarking Audio Visual Segmentation for Long-Untrimmed Videos

Chen Liu, Peike Li, Qingtao Yu et al.

CVPR 2024

#5573

Information Design for Congestion Games with Unknown Demand

Svenja M. Griesbach, Martin Hoefer, Max Klimm et al.

AAAI 2024paperarXiv:2310.08314

#5574

Towards Image Ambient Lighting Normalization

Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei Wu et al.

ECCV 2024arXiv:2403.18730

#5575

CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches

Sifan Wu, Amir Hosein Khasahmadi, Mor Katz et al.

ECCV 2024arXiv:2409.17457

#5576

DomainFusion: Generalizing To Unseen Domains with Latent Diffusion Models

Yuyang Huang, Yabo Chen, Yuchen Liu et al.

ECCV 2024

#5577

Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors

Jae Joong Lee, Bosheng Li, Sara Beery et al.

ECCV 2024arXiv:2407.10330

#5578

DiVAS: Video and Audio Synchronization with Dynamic Frame Rates

Clara Maria Fernandez Labrador, Mertcan Akcay, Eitan Abecassis et al.

CVPR 2024

#5579

Holodeck: Language Guided Generation of 3D Embodied AI Environments

Yue Yang, Fan-Yun Sun, Luca Weihs et al.

CVPR 2024arXiv:2312.09067

#5580

PartDistill: 3D Shape Part Segmentation by Vision-Language Model Distillation

Ardian Umam, Cheng-Kun Yang, Min-Hung Chen et al.

CVPR 2024arXiv:2312.04016

#5581

Detector-Free Structure from Motion

Xingyi He, Jiaming Sun, Yifan Wang et al.

CVPR 2024arXiv:2306.15669

#5582

Rethinking Human Motion Prediction with Symplectic Integral

Haipeng Chen, Kedi L yu, Zhenguang Liu et al.

CVPR 2024

#5583

Double Buffers CEM-TD3: More Efficient Evolution and Richer Exploration

Sheng Zhu, Chun Shen, Shuai Lü et al.

AAAI 2024paper

#5584

iToF-flow-based High Frame Rate Depth Imaging

Yu Meng, Zhou Xue, Xu Chang et al.

CVPR 2024

#5585

AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute Decomposition and Indexing

Fan Yang, Tianyi Chen, XIAOSHENG HE et al.

CVPR 2024arXiv:2312.02209

#5586

C3: High-Performance and Low-Complexity Neural Compression from a Single Image or Video

Hyunjik Kim, Matthias Bauer, Lucas Theis et al.

CVPR 2024arXiv:2312.02753

#5587

Learning Coupled Dictionaries from Unpaired Data for Image Super-Resolution

Longguang Wang, Juncheng Li, Yingqian Wang et al.

CVPR 2024

#5588

L4D-Track: Language-to-4D Modeling Towards 6-DoF Tracking and Shape Reconstruction in 3D Point Cloud Stream

Jingtao Sun, Yaonan Wang, Mingtao Feng et al.

CVPR 2024

#5589

On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling

Xiaobao Wu, Fengjun Pan, Thong Nguyen et al.

AAAI 2024paperarXiv:2401.14113

#5590

Adaptive Slot Attention: Object Discovery with Dynamic Slot Number

Ke Fan, Zechen Bai, Tianjun Xiao et al.

CVPR 2024arXiv:2406.09196

#5591

Rethinking Prior Information Generation with CLIP for Few-Shot Segmentation

Jin Wang, Bingfeng Zhang, Jian Pang et al.

CVPR 2024arXiv:2405.08458

#5592

LiSA: LiDAR Localization with Semantic Awareness

Bochun Yang, Zijun Li, Wen Li et al.

CVPR 2024highlight

#5593

MmAP: Multi-Modal Alignment Prompt for Cross-Domain Multi-Task Learning

Yi Xin, Junlong Du, Qiang Wang et al.

AAAI 2024paperarXiv:2312.08636

#5594

Teaching Large Language Models to Translate with Comparison

Jiali Zeng, Fandong Meng, Yongjing Yin et al.

AAAI 2024paperarXiv:2307.04408

#5595

CausalPC: Improving the Robustness of Point Cloud Classification by Causal Effect Identification

Yuanmin Huang, Mi Zhang, Daizong Ding et al.

CVPR 2024

#5596

Adapting to Length Shift: FlexiLength Network for Trajectory Prediction

Yi Xu, Yun Fu

CVPR 2024arXiv:2404.00742

#5597

Diffeomorphic Template Registration for Atmospheric Turbulence Mitigation

Dong Lao, Congli Wang, Alex Wong et al.

CVPR 2024highlightarXiv:2405.03662

#5598

Instruct-Imagen: Image Generation with Multi-modal Instruction

Hexiang Hu, Kelvin C.K. Chan, Yu-Chuan Su et al.

CVPR 2024arXiv:2401.01952

#5599

Rapid Motor Adaptation for Robotic Manipulator Arms

Yichao Liang, Kevin Ellis, João F. Henriques

CVPR 2024arXiv:2312.04670

#5600

PM-INR: Prior-Rich Multi-Modal Implicit Large-Scale Scene Neural Representation

Yiying Yang, Fukun Yin, Wen Liu et al.

AAAI 2024paper

← Previous

1...26 27 28 29 30...62