Most Cited 2024 "multi-modal crowd counting" Papers

12,324 papers found • Page 28 of 62

#5401

Learning Invariant Inter-pixel Correlations for Superpixel Generation

Sen Xu, Shikui Wei, Tao Ruan et al.

AAAI 2024paperarXiv:2402.18201
#5402

Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation

Fu-Yun Wang, Xiaoshi Wu, Zhaoyang Huang et al.

ECCV 2024arXiv:2403.13745
#5403

Reinforcement Learning as a Parsimonious Alternative to Prediction Cascades: A Case Study on Image Segmentation

Bharat Srikishan, Anika Tabassum, Srikanth Allu et al.

AAAI 2024paperarXiv:2402.11760
#5404

Visual Redundancy Removal for Composite Images: A Benchmark Dataset and a Multi-Visual-Effects Driven Incremental Method

Miaohui Wang, Rong Zhang, Lirong Huang et al.

AAAI 2024paper
#5405

Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier

Prantik Howlader, Srijan Das, Hieu Le et al.

ECCV 2024arXiv:2407.04036
#5406

Independency Adversarial Learning for Cross-Modal Sound Separation

Zhenkai Lin, Yanli Ji, Yang Yang

AAAI 2024paper
#5407

Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum

Zhengliang Shi, Shen Gao, Minghang Zhu et al.

AAAI 2024paperarXiv:2308.14034
#5408

Enhancing Semi-supervised Domain Adaptation via Effective Target Labeling

Jiujun He, Bin Liu, Guosheng Yin

AAAI 2024paper
#5409

AdversariaLeak: External Information Leakage Attack Using Adversarial Samples on Face Recognition Systems

Roye Katzav, Amit Giloni, Edita Grolman et al.

ECCV 2024
#5410

H2GFormer: Horizontal-to-Global Voxel Transformer for 3D Semantic Scene Completion

Yu Wang, Chao Tong

AAAI 2024paper
#5411

Jointly Improving the Sample and Communication Complexities in Decentralized Stochastic Minimax Optimization

Xuan Zhang, Gabriel Mancino-Ball, Necdet Serhat Aybat et al.

AAAI 2024paperarXiv:2307.09421
#5412

FT-GAN: Fine-Grained Tune Modeling for Chinese Opera Synthesis

Meizhen Zheng, Peng Bai, Xiaodong Shi et al.

AAAI 2024paper
#5413

Voxel or Pillar: Exploring Efficient Point Cloud Representation for 3D Object Detection

Yuhao Huang, Sanping Zhou, Junjie Zhang et al.

AAAI 2024paperarXiv:2304.02867
#5414

Structural Information Enhanced Graph Representation for Link Prediction

Lei Shi, Bin Hu, Deng Zhao et al.

AAAI 2024paper
#5415

Restoring Images in Adverse Weather Conditions via Histogram Transformer

Shangquan Sun, Wenqi Ren, Xinwei Gao et al.

ECCV 2024arXiv:2407.10172
#5416

VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding

Yue Fan, Xiaojian Ma, Rujie Wu et al.

ECCV 2024arXiv:2403.11481
#5417

TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks

Jinjie Mai, Wenxuan Zhu, Sara Rojas Martinez et al.

ECCV 2024arXiv:2408.10739
#5418

Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis

Chirag Vashist, Shichong Peng, Ke Li

ECCV 2024arXiv:2409.17439
#5419

LGMRec: Local and Global Graph Learning for Multimodal Recommendation

Zhiqiang Guo, Jianjun Li, Guohui Li et al.

AAAI 2024paperarXiv:2312.16400
#5420

Data-Driven Knowledge-Aware Inference of Private Information in Continuous Double Auctions

Lvye Cui, Haoran Yu

AAAI 2024paper
#5421

Linear-Time Algorithms for Front-Door Adjustment in Causal Graphs

Marcel Wienöbst, Benito van der Zander, Maciej Liskiewicz

AAAI 2024paperarXiv:2211.16468
#5422

Discriminatively Fuzzy Multi-View K-means Clustering with Local Structure Preserving

Jun Yin, Shiliang Sun, Lai Wei et al.

AAAI 2024paper
#5423

Robust Blind Text Image Deblurring via Maximum Consensus Framework

Zijian Min, Gundu Hassan, GeunSik Jo

AAAI 2024paper
#5424

Phoneme Hallucinator: One-Shot Voice Conversion via Set Expansion

Siyuan Shan, Yang Li, Amartya Banerjee et al.

AAAI 2024paperarXiv:2308.06382
#5425

Continuous-Time Graph Representation with Sequential Survival Process

Abdulkadir Celikkanat, Nikolaos Nakis, Morten Mørup

AAAI 2024paperarXiv:2312.13068
#5426

MONTRAGE: Monitoring Training for Attribution of Generative Diffusion Models

Jonathan Brokman, Omer Hofman, Roman Vainshtein et al.

ECCV 2024
#5427

Understanding Distributed Representations of Concepts in Deep Neural Networks without Supervision

Wonjoon Chang, Dahee Kwon, Jaesik Choi

AAAI 2024paperarXiv:2312.17285
#5428

How Far Can a 1-Pixel Camera Go? Solving Vision Tasks using Photoreceptors and Computationally Designed Visual Morphology

Andrei Atanov, Rishubh Singh, Jiawei Fu et al.

ECCV 2024
#5429

Unsupervised Continual Anomaly Detection with Contrastively-Learned Prompt

Jiaqi Liu, Kai Wu, Qiang Nie et al.

AAAI 2024paperarXiv:2401.01010
#5430

Separate the Wheat from the Chaff: Model Deficiency Unlearning via Parameter-Efficient Module Operation

Xinshuo Hu, Dongfang Li, Zihao Zheng et al.

AAAI 2024paperarXiv:2308.08090
#5431

ProAgent: Building Proactive Cooperative Agents with Large Language Models

Ceyao Zhang, Kaijie Yang, Siyi Hu et al.

AAAI 2024paperarXiv:2308.11339
#5432

Encoding Constraints as Binary Constraint Networks Satisfying BTP

AAAI 2024paper
#5433

MLPHand: Real Time Multi-View 3D Hand Reconstruction via MLP Modeling

Jian Yang, Jiakun Li, Guoming Li et al.

ECCV 2024
#5434

Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning

Yunbin Tu, Liang Li, Li Su et al.

ECCV 2024arXiv:2407.11683
#5435

DreamIdentity: Enhanced Editability for Efficient Face-Identity Preserved Image Generation

Zhuowei Chen, Shancheng Fang, Wei Liu et al.

AAAI 2024paper
#5436

F3-Pruning: A Training-Free and Generalized Pruning Strategy towards Faster and Finer Text-to-Video Synthesis

Sitong Su, Jianzhi Liu, Lianli Gao et al.

AAAI 2024paperarXiv:2312.03459
#5437

Adaptive Feature Imputation with Latent Graph for Deep Incomplete Multi-View Clustering

Jingyu Pu, Chenhang Cui, Xinyue Chen et al.

AAAI 2024paper
#5438

Can Textual Semantics Mitigate Sounding Object Segmentation Preference?

Yaoting Wang, Peiwen Sun, Yuanchao Li et al.

ECCV 2024arXiv:2407.10947
#5439

Variational Hybrid-Attention Framework for Multi-Label Few-Shot Aspect Category Detection

Cheng Peng, Ke Chen, Lidan Shou et al.

AAAI 2024paper
#5440

SNN-PDE: Learning Dynamic PDEs from Data with Simplicial Neural Networks

Jae Choi, Yuzhou Chen, Huikyo Lee et al.

AAAI 2024paper
#5441

DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling

Haoran Li, Haolin Shi, Wenli Zhang et al.

ECCV 2024arXiv:2404.03575
#5442

Resilience of Entropy Model in Distributed Neural Networks

Milin Zhang, Mohammad Abdi, Shahriar Rifat et al.

ECCV 2024arXiv:2403.00942
#5443

Dynamic Reactive Spiking Graph Neural Network

AAAI 2024paper
#5444

Block Image Compressive Sensing with Local and Global Information Interaction

Xiaoyu Kong, Yongyong Chen, Feng Zheng et al.

AAAI 2024paper
#5445

StockMixer: A Simple Yet Strong MLP-Based Architecture for Stock Price Forecasting

Jinyong Fan, Yanyan Shen

AAAI 2024paper
#5446

A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment

Tianhe Wu, Kede Ma, Jie Liang et al.

ECCV 2024arXiv:2403.10854
#5447

Efficient Nonparametric Tensor Decomposition for Binary and Count Data

Zerui Tao, Toshihisa Tanaka, Qibin Zhao

AAAI 2024paperarXiv:2401.07711
#5448

Identification of Causal Structure in the Presence of Missing Data with Additive Noise Model

Zhengrui Chen, Liying Lu, Ziyang Yuan et al.

AAAI 2024paperarXiv:2312.12206
#5449

TCNet: Continuous Sign Language Recognition from Trajectories and Correlated Regions

AAAI 2024paperarXiv:2403.11818
#5450

Learning Domain-Independent Heuristics for Grounded and Lifted Planning

AAAI 2024paperarXiv:2312.11143
#5451

Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking

Lorenzo Vaquero, Yihong XU, Xavier Alameda-Pineda et al.

ECCV 2024arXiv:2407.10151
#5452

Prompt-Driven Contrastive Learning for Transferable Adversarial Attacks

Hunmin Yang, Jongoh Jeong, Kuk-Jin Yoon

ECCV 2024arXiv:2407.20657
#5453

OntoFact: Unveiling Fantastic Fact-Skeleton of LLMs via Ontology-Driven Reinforcement Learning

Ziyu Shang, Ke Wenjun, Nana Xiu et al.

AAAI 2024paper
#5454

Exploring Temporal Feature Correlation for Efficient and Stable Video Semantic Segmentation

Matthieu Lin, Jenny Sheng, Yubin Hu et al.

AAAI 2024paper
#5455

DeblurSR: Event-Based Motion Deblurring under the Spiking Representation

Chen Song, Chandrajit Bajaj, Qixing Huang

AAAI 2024paperarXiv:2303.08977
#5456

Boosting Multiple Instance Learning Models for Whole Slide Image Classification: A Model-Agnostic Framework Based on Counterfactual Inference

Weiping Lin, Zhenfeng Zhuang, Lequan Yu et al.

AAAI 2024paper
#5457

OCEAN-MBRL: Offline Conservative Exploration for Model-Based Offline Reinforcement Learning

Fan Wu, Rui Zhang, Qi Yi et al.

AAAI 2024paper
#5458

Continual Learning and Unknown Object Discovery in 3D Scenes via Self-Distillation

Mohamed El Amine Boudjoghra, Jean Lahoud, Salman Khan et al.

ECCV 2024
#5459

Semantic Complete Scene Forecasting from a 4D Dynamic Point Cloud Sequence

Zifan Wang, Zhuorui Ye, Haoran Wu et al.

AAAI 2024paperarXiv:2312.08054
#5460

Authors

- Xinshu Li, Lina Yao

AAAI 2024paperarXiv:2206.05016
#5461

Improving Panoptic Narrative Grounding by Harnessing Semantic Relationships and Visual Confirmation

AAAI 2024paper
#5462

TAG: Text Prompt Augmentation for Zero-Shot Out-of-Distribution Detection

Xixi Liu, Christopher Zach

ECCV 2024
#5463

PLOT: Text-based Person Search with Part Slot Attention for Corresponding Part Discovery

Jicheol Park, Dongwon Kim, Boseung Jeong et al.

ECCV 2024arXiv:2409.13475
#5464

Occluded Gait Recognition with Mixture of Experts: An Action Detection Perspective

Panjian Huang, Yunjie Peng, Saihui Hou et al.

ECCV 2024
#5465

Causal Strategic Learning with Competitive Selection

AAAI 2024paperarXiv:2308.16262
#5466

Causal Walk: Debiasing Multi-Hop Fact Verification with Front-Door Adjustment

AAAI 2024paperarXiv:2403.02698
#5467

Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition

Sergio Izquierdo, Javier Civera

ECCV 2024arXiv:2407.02422
#5468

Video-Language Aligned Transformer for Video Question Answering

AAAI 2024paper
#5469

PoetryDiffusion: Towards Joint Semantic and Metrical Manipulation in Poetry Generation

AAAI 2024paperarXiv:2306.08456
#5470

DoubleTake: Geometry Guided Depth Estimation

Mohamed Sayed, Filippo Aleotti, Jamie Watson et al.

ECCV 2024arXiv:2406.18387
#5471

Efficient Asynchronous Federated Learning with Prospective Momentum Aggregation and Fine-Grained Correction

AAAI 2024paper
#5472

HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping

AAAI 2024paperarXiv:2312.17492
#5473

On the Structural Hardness of Answer Set Programming: Can Structure Efficiently Confine the Power of Disjunctions?

Markus Hecher, Rafael Kiesel

AAAI 2024paperarXiv:2402.03539
#5474

Multi-View Randomized Kernel Classification via Nonconvex Optimization

AAAI 2024paper
#5475

Oulu Remote-photoplethysmography Physical Domain Attacks Database (ORPDAD)

Marko Savic, Guoying Zhao

ECCV 2024
#5476

Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance

Donghoon Ahn, Hyoungwon Cho, Jaewon Min et al.

ECCV 2024arXiv:2403.17377
#5477

Are You Concerned about Limited Function Evaluations: Data-Augmented Pareto Set Learning for Expensive Multi-Objective Optimization

AAAI 2024paper
#5478

Enhancing Representation of Spiking Neural Networks via Similarity-Sensitive Contrastive Learning

AAAI 2024paper
#5479

Learning Coalition Structures with Games

AAAI 2024paperarXiv:2312.09058
#5480

On Unsupervised Domain Adaptation: Pseudo Label Guided Mixup for Adversarial Prompt Tuning

AAAI 2024paper
#5481

Distribution-Conditioned Adversarial Variational Autoencoder for Valid Instrumental Variable Generation

AAAI 2024paper
#5482

Leveraging Representations from Intermediate Encoder-blocks for Synthetic Image Detection

Christos Koutlis, Symeon Papadopoulos

ECCV 2024arXiv:2402.19091
#5483

Solving Motion Planning Tasks with a Scalable Generative Model

Yihan Hu, Siqi Chai, Zhening Yang et al.

ECCV 2024arXiv:2407.02797
#5484

Transfer and Alignment Network for Generalized Category Discovery

Wenbin An, Feng Tian, Wenkai Shi et al.

AAAI 2024paperarXiv:2312.16467
#5485

FedCD: Federated Semi-supervised Learning with Class Awareness Balance via Dual Teachers

Yuzhi Liu, Huisi Wu, Jing Qin

AAAI 2024paper
#5486

Semi-supervised TEE Segmentation via Interacting with SAM Equipped with Noise-Resilient Prompting

Sen Deng, Yidan Feng, Haoneng Lin et al.

AAAI 2024paper
#5487

Prompting Multi-Modal Image Segmentation with Semantic Grouping

AAAI 2024paper
#5488

TMFormer: Token Merging Transformer for Brain Tumor Segmentation with Missing Modalities

Zheyu Zhang, Gang Yang, Yueyi Zhang et al.

AAAI 2024paper
#5489

Multiscale Attention Wavelet Neural Operator for Capturing Steep Trajectories in Biochemical Systems

Jiayang Su, Junbo Ma, Songyang Tong et al.

AAAI 2024paper
#5490

An Effective Augmented Lagrangian Method for Fine-Grained Multi-View Optimization

Yuze Tan, Hecheng Cai, Shudong Huang et al.

AAAI 2024paper
#5491

Versatile Incremental Learning: Towards Class and Domain-Agnostic Incremental Learning

Minyeong Park, Jae-Ho Lee, Gyeong-Moon Park

ECCV 2024arXiv:2409.10956
#5492

Parrot Captions Teach CLIP to Spot Text

Yiqi Lin, Conghui He, Alex Jinpeng Wang et al.

ECCV 2024arXiv:2312.14232
#5493

Primitive-Based 3D Human-Object Interaction Modelling and Programming

Siqi Liu, Yong-Lu Li, Zhou FANG et al.

AAAI 2024paperarXiv:2312.10714
#5494

Collaborative Synthesis of Patient Records through Multi-Visit Health State Inference

Hongda Sun, Hongzhan Lin, Rui Yan

AAAI 2024paperarXiv:2312.14646
#5495

Decentralized Sum-of-Nonconvex Optimization

Zhuanghua Liu, Bryan Kian Hsiang Low

AAAI 2024paperarXiv:2402.02356
#5496

All Beings Are Equal in Open Set Recognition

Chaohua Li, Enhao Zhang, Chuanxing Geng et al.

AAAI 2024paperarXiv:2401.17654
#5497

PRP Rebooted: Advancing the State of the Art in FOND Planning

Christian Muise, Sheila McIlraith, J. Christopher Beck

AAAI 2024paperarXiv:2312.11675
#5498

LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models

Yabin Zhang, Wenjie Zhu, Chenhang He et al.

ECCV 2024arXiv:2407.08966
#5499

ExpeL: LLM Agents Are Experiential Learners

Andrew Zhao, Daniel Huang, Quentin Xu et al.

AAAI 2024paperarXiv:2308.10144
#5500

Multi-Cross Sampling and Frequency-Division Reconstruction for Image Compressed Sensing

Heping Song, Jingyao Gong, Hongying Meng et al.

AAAI 2024paper
#5501

Electron Microscopy Images as Set of Fragments for Mitochondrial Segmentation

Naisong Luo, Rui Sun, Yuwen Pan et al.

AAAI 2024paper
#5502

Graph-Based Prediction and Planning Policy Network (GP3Net) for Scalable Self-Driving in Dynamic Environments Using Deep Reinforcement Learning

Jayabrata Chowdhury, Venkataramanan Shivaraman, Suresh Sundaram et al.

AAAI 2024paperarXiv:2312.05784
#5503

MedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models

Yan Cai, Linlin Wang, Ye Wang et al.

AAAI 2024paperarXiv:2312.12806
#5504

Sampling for Beyond-Worst-Case Online Ranking

Qingyun Chen, Sungjin Im, Benjamin Moseley et al.

AAAI 2024paper
#5505

PMET: Precise Model Editing in a Transformer

Xiaopeng Li, Shasha Li, Shezheng Song et al.

AAAI 2024paperarXiv:2308.08742
#5506

Learning Neural Deformation Representation for 4D Dynamic Shape Generation

Gyojin Han, Jiwan Hur, Jaehyun Choi et al.

ECCV 2024
#5507

ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders

Carlos Hinojosa, Shuming Liu, Bernard Ghanem

ECCV 2024arXiv:2407.13036
#5508

Explaining Reinforcement Learning Agents through Counterfactual Action Outcomes

Yotam Amitai, Yael Friedler, Ofra Amir

AAAI 2024paperarXiv:2312.11118
#5509

CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model

Pengwei Yin, Guanzhong Zeng, Jingjing Wang et al.

AAAI 2024paperarXiv:2403.05124
#5510

Point Deformable Network with Enhanced Normal Embedding for Point Cloud Analysis

Xingyilang Yin, Xi Yang, Liangchen Liu et al.

AAAI 2024paperarXiv:2312.13071
#5511

Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge

Xuan Shen, Peiyan Dong, Lei Lu et al.

AAAI 2024paperarXiv:2312.05693
#5512

Chains of Diffusion Models

Yanheng Wei, Lianghua Huang, Zhi-Fan Wu et al.

ECCV 2024
#5513

Robust Nearest Neighbors for Source-Free Domain Adaptation under Class Distribution Shift

Antonio Tejero-de-Pablos, Riku Togashi, Mayu Otani et al.

ECCV 2024
#5514

A Direct Approach to Viewing Graph Solvability

Federica Arrigoni, Andrea Fusiello, Tomas Pajdla

ECCV 2024
#5515

Selective Deep Autoencoder for Unsupervised Feature Selection

Wael Hassanieh, Abdallah Chehade

AAAI 2024paper
#5516

Variable Importance in High-Dimensional Settings Requires Grouping

Yifan Lu, Ziqi Zhang, Chunfeng Yuan et al.

AAAI 2024paper
#5517

Physics-informed Knowledge Transfer for Underwater Monocular Depth Estimation

Jinghe Yang, Mingming Gong, Ye Pu

ECCV 2024
#5518

EG-NAS: Neural Architecture Search with Fast Evolutionary Exploration

AAAI 2024paper
#5519

Cross-Modal Feature Distribution Calibration for Few-Shot Visual Question Answering

AAAI 2024paper
#5520

Learning Equilibrium Transformation for Gamut Expansion and Color Restoration

JUN XIAO, Changjian Shui, Zhi-Song Liu et al.

ECCV 2024
#5521

SFC: Shared Feature Calibration in Weakly Supervised Semantic Segmentation

AAAI 2024paperarXiv:2401.11719
#5522

Set Prediction Guided by Semantic Concepts for Diverse Video Captioning

AAAI 2024paperarXiv:2312.15720
#5523

Pseudo-Label Calibration Semi-supervised Multi-Modal Entity Alignment

Luyao Wang, Pengnian Qi, Xigang Bao et al.

AAAI 2024paperarXiv:2403.01203
#5524

GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time

Hao Li, Yuanyuan Gao, Dingwen Zhang et al.

ECCV 2024
#5525

Weighted Ensemble Models Are Strong Continual Learners

Imad Eddine Marouf, Subhankar Roy, Enzo Tartaglione et al.

ECCV 2024arXiv:2312.08977
#5526

Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediciton Tasks

Manyuan Zhang, Guanglu Song, Xiaoyu Shi et al.

ECCV 2024
#5527

Cocktail Universal Adversarial Attack on Deep Neural Networks

Shaoxin Li, Xiaofeng Liao, Xin Che et al.

ECCV 2024
#5528

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

Seung Hyun Lee, Yinxiao Li, Junjie Ke et al.

ECCV 2024arXiv:2401.05675
#5529

Semi-Supervised Teacher-Reference-Student Architecture for Action Quality Assessment

Wulian Yun, Mengshi Qi, Fei Peng et al.

ECCV 2024arXiv:2407.19675
#5530

CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance

Zhipeng Hu, Yongqiang Zhang, Chen Liu et al.

ECCV 2024
#5531

Orthogonal Dictionary Guided Shape Completion Network for Point Cloud

Pingping Cai, Deja Scott, Xiaoguang Li et al.

AAAI 2024paper
#5532

Progressive High-Frequency Reconstruction for Pan-Sharpening with Implicit Neural Representation

Ge Meng, Jingjia Huang, Yingying Wang et al.

AAAI 2024paper
#5533

LPViT: Low-Power Semi-structured Pruning for Vision Transformers

KAIXIN Xu, Zhe Wang, Chunyun Chen et al.

ECCV 2024arXiv:2407.02068
#5534

What to Remember: Self-Adaptive Continual Learning for Audio Deepfake Detection

XiaoHui Zhang, Jiangyan Yi, Chenglong Wang et al.

AAAI 2024paperarXiv:2312.09651
#5535

Can You Rely on Synthetic Labellers in Preference-Based Reinforcement Learning? It’s Complicated

Katherine Metcalf, Miguel Sarabia, Masha Fedzechkina et al.

AAAI 2024paper
#5536

GAD-PVI: A General Accelerated Dynamic-Weight Particle-Based Variational Inference Framework

Fangyikang Wang, Huminhao Zhu, Chao Zhang et al.

AAAI 2024paperarXiv:2312.16429
#5537

Beyond Grounding: Extracting Fine-Grained Event Hierarchies across Modalities

Hammad Ayyubi, Christopher Thomas, Lovish Chum et al.

AAAI 2024paperarXiv:2206.07207
#5538

Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis

Brian Isaac Medina, Yona Falinie Abdul Gaus, Neelanjan Bhowmik et al.

ECCV 2024arXiv:2407.15763
#5539

Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models

Yasi Zhang, Peiyu Yu, Ying Nian Wu

ECCV 2024arXiv:2404.07389
#5540

Underspecification in Language Modeling Tasks: A Causality-Informed Study of Gendered Pronoun Resolution

Emily McMilin

AAAI 2024paperarXiv:2210.00131
#5541

Enhancing Bilingual Lexicon Induction via Bi-directional Translation Pair Retrieving

Ding Qiuyu, Hailong Cao, Tiejun Zhao

AAAI 2024paper
#5542

Graph Reasoning Transformers for Knowledge-Aware Question Answering

Ruilin Zhao, Feng Zhao, Liang Hu et al.

AAAI 2024paper
#5543

Multi-Modal Hallucination Control by Visual Information Grounding

Alessandro Favero, Luca Zancato, Matthew Trager et al.

CVPR 2024arXiv:2403.14003
#5544

Earthfarsser: Versatile Spatio-Temporal Dynamical Systems Modeling in One Model

Hao Wu, Yuxuan Liang, Wei Xiong et al.

AAAI 2024paper
#5545

HoloADMM: High-Quality Holographic Complex Field Recovery

Mazen Mel, Paul Springer, Pietro Zanuttigh et al.

ECCV 2024
#5546

Unsqueeze [CLS] Bottleneck to Learn Rich Representations

Qing Su, Shihao Ji

ECCV 2024arXiv:2407.17671
#5547

Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical Labels

Zhuohong Li, Wei He, Jiepan Li et al.

CVPR 2024highlightarXiv:2403.02746
#5548

Boosting Image Quality Assessment through Efficient Transformer Adaptation with Local Feature Enhancement

Kangmin Xu, Liang Liao, Jing Xiao et al.

CVPR 2024
#5549

Segment Any Event Streams via Weighted Adaptation of Pivotal Tokens

Zhiwen Chen, Zhiyu Zhu, Yifan Zhang et al.

CVPR 2024
#5550

IBD-SLAM: Learning Image-Based Depth Fusion for Generalizable SLAM

Minghao Yin, Shangzhe Wu, Kai Han

CVPR 2024
#5551

MAPSeg: Unified Unsupervised Domain Adaptation for Heterogeneous Medical Image Segmentation Based on 3D Masked Autoencoding and Pseudo-Labeling

Xuzhe Zhang, Yuhao Wu, Elsa Angelini et al.

CVPR 2024arXiv:2303.09373
#5552

Anomaly Heterogeneity Learning for Open-set Supervised Anomaly Detection

Jiawen Zhu, Choubo Ding, Yu Tian et al.

CVPR 2024arXiv:2310.12790
#5553

Market-GAN: Adding Control to Financial Market Data Generation with Semantic Context

Haochong Xia, Shuo Sun, Xinrun Wang et al.

AAAI 2024paperarXiv:2309.07708
#5554

Fast Adaptation for Human Pose Estimation via Meta-Optimization

Shengxiang Hu, Huaijiang Sun, Bin Li et al.

CVPR 2024
#5555

Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model

Dian Zheng, Xiao-Ming Wu, Shuzhou Yang et al.

CVPR 2024arXiv:2403.11157
#5556

Semantic Segmentation in Multiple Adverse Weather Conditions with Domain Knowledge Retention

Xin Yang, Wending Yan, Yuan Yuan et al.

AAAI 2024paperarXiv:2401.07459
#5557

Understanding the Generalization of Pretrained Diffusion Models on Out-of-Distribution Data

Sai Niranjan Ramachandran, Rudrabha Mukhopadhyay, Madhav Agarwal et al.

AAAI 2024paper
#5558

Purified and Unified Steganographic Network

GuoBiao Li, Sheng Li, Zicong Luo et al.

CVPR 2024arXiv:2402.17210
#5559

Explicitly Perceiving and Preserving the Local Geometric Structures for 3D Point Cloud Attack

Daizong Liu, Wei Hu

AAAI 2024paper
#5560

KVQ: Kwai Video Quality Assessment for Short-form Videos

Yiting Lu, Xin Li, Yajing Pei et al.

CVPR 2024arXiv:2402.07220
#5561

Consistent Prompting for Rehearsal-Free Continual Learning

Zhanxin Gao, Jun Cen, Xiaobin Chang

CVPR 2024arXiv:2403.08568
#5562

ModWaveMLP: MLP-Based Mode Decomposition and Wavelet Denoising Model to Defeat Complex Structures in Traffic Forecasting

Ke Sun, Pei Liu, Pengfei Li et al.

AAAI 2024paper
#5563

FedHide: Federated Learning by Hiding in the Neighbors

Hyunsin Park, Sungrack Yun

ECCV 2024arXiv:2409.07808
#5564

Mean-Shift Feature Transformer

Takumi Kobayashi

CVPR 2024
#5565

Tactile-Augmented Radiance Fields

Yiming Dou, Fengyu Yang, Yi Liu et al.

CVPR 2024arXiv:2405.04534
#5566

SpikingResformer: Bridging ResNet and Vision Transformer in Spiking Neural Networks

Xinyu Shi, Zecheng Hao, Zhaofei Yu

CVPR 2024arXiv:2403.14302
#5567

Improving Transferability for Cross-Domain Trajectory Prediction via Neural Stochastic Differential Equation

Daehee Park, Jaewoo Jeong, Kuk-Jin Yoon

AAAI 2024paperarXiv:2312.15906
#5568

One-Shot Open Affordance Learning with Foundation Models

Gen Li, Deqing Sun, Laura Sevilla-Lara et al.

CVPR 2024arXiv:2311.17776
#5569

Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld

Yijun Yang, Tianyi Zhou, kanxue Li et al.

CVPR 2024arXiv:2311.16714
#5570

Hypercorrelation Evolution for Video Class-Incremental Learning

Sen Liang, Kai Zhu, Wei Zhai et al.

AAAI 2024paper
#5571

Inter-X: Towards Versatile Human-Human Interaction Analysis

Liang Xu, Xintao Lv, Yichao Yan et al.

CVPR 2024arXiv:2312.16051
#5572

Benchmarking Audio Visual Segmentation for Long-Untrimmed Videos

Chen Liu, Peike Li, Qingtao Yu et al.

CVPR 2024
#5573

Information Design for Congestion Games with Unknown Demand

Svenja M. Griesbach, Martin Hoefer, Max Klimm et al.

AAAI 2024paperarXiv:2310.08314
#5574

Towards Image Ambient Lighting Normalization

Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei Wu et al.

ECCV 2024arXiv:2403.18730
#5575

CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches

Sifan Wu, Amir Hosein Khasahmadi, Mor Katz et al.

ECCV 2024arXiv:2409.17457
#5576

DomainFusion: Generalizing To Unseen Domains with Latent Diffusion Models

Yuyang Huang, Yabo Chen, Yuchen Liu et al.

ECCV 2024
#5577

Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors

Jae Joong Lee, Bosheng Li, Sara Beery et al.

ECCV 2024arXiv:2407.10330
#5578

DiVAS: Video and Audio Synchronization with Dynamic Frame Rates

Clara Maria Fernandez Labrador, Mertcan Akcay, Eitan Abecassis et al.

CVPR 2024
#5579

Holodeck: Language Guided Generation of 3D Embodied AI Environments

Yue Yang, Fan-Yun Sun, Luca Weihs et al.

CVPR 2024arXiv:2312.09067
#5580

PartDistill: 3D Shape Part Segmentation by Vision-Language Model Distillation

Ardian Umam, Cheng-Kun Yang, Min-Hung Chen et al.

CVPR 2024arXiv:2312.04016
#5581

Detector-Free Structure from Motion

Xingyi He, Jiaming Sun, Yifan Wang et al.

CVPR 2024arXiv:2306.15669
#5582

Rethinking Human Motion Prediction with Symplectic Integral

Haipeng Chen, Kedi L yu, Zhenguang Liu et al.

CVPR 2024
#5583

Double Buffers CEM-TD3: More Efficient Evolution and Richer Exploration

Sheng Zhu, Chun Shen, Shuai Lü et al.

AAAI 2024paper
#5584

iToF-flow-based High Frame Rate Depth Imaging

Yu Meng, Zhou Xue, Xu Chang et al.

CVPR 2024
#5585

AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute Decomposition and Indexing

Fan Yang, Tianyi Chen, XIAOSHENG HE et al.

CVPR 2024arXiv:2312.02209
#5586

C3: High-Performance and Low-Complexity Neural Compression from a Single Image or Video

Hyunjik Kim, Matthias Bauer, Lucas Theis et al.

CVPR 2024arXiv:2312.02753
#5587

Learning Coupled Dictionaries from Unpaired Data for Image Super-Resolution

Longguang Wang, Juncheng Li, Yingqian Wang et al.

CVPR 2024
#5588

L4D-Track: Language-to-4D Modeling Towards 6-DoF Tracking and Shape Reconstruction in 3D Point Cloud Stream

Jingtao Sun, Yaonan Wang, Mingtao Feng et al.

CVPR 2024
#5589

On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling

Xiaobao Wu, Fengjun Pan, Thong Nguyen et al.

AAAI 2024paperarXiv:2401.14113
#5590

Adaptive Slot Attention: Object Discovery with Dynamic Slot Number

Ke Fan, Zechen Bai, Tianjun Xiao et al.

CVPR 2024arXiv:2406.09196
#5591

Rethinking Prior Information Generation with CLIP for Few-Shot Segmentation

Jin Wang, Bingfeng Zhang, Jian Pang et al.

CVPR 2024arXiv:2405.08458
#5592

LiSA: LiDAR Localization with Semantic Awareness

Bochun Yang, Zijun Li, Wen Li et al.

CVPR 2024highlight
#5593

MmAP: Multi-Modal Alignment Prompt for Cross-Domain Multi-Task Learning

Yi Xin, Junlong Du, Qiang Wang et al.

AAAI 2024paperarXiv:2312.08636
#5594

Teaching Large Language Models to Translate with Comparison

Jiali Zeng, Fandong Meng, Yongjing Yin et al.

AAAI 2024paperarXiv:2307.04408
#5595

CausalPC: Improving the Robustness of Point Cloud Classification by Causal Effect Identification

Yuanmin Huang, Mi Zhang, Daizong Ding et al.

CVPR 2024
#5596

Adapting to Length Shift: FlexiLength Network for Trajectory Prediction

Yi Xu, Yun Fu

CVPR 2024arXiv:2404.00742
#5597

Diffeomorphic Template Registration for Atmospheric Turbulence Mitigation

Dong Lao, Congli Wang, Alex Wong et al.

CVPR 2024highlightarXiv:2405.03662
#5598

Instruct-Imagen: Image Generation with Multi-modal Instruction

Hexiang Hu, Kelvin C.K. Chan, Yu-Chuan Su et al.

CVPR 2024arXiv:2401.01952
#5599

Rapid Motor Adaptation for Robotic Manipulator Arms

Yichao Liang, Kevin Ellis, João F. Henriques

CVPR 2024arXiv:2312.04670
#5600

PM-INR: Prior-Rich Multi-Modal Implicit Large-Scale Scene Neural Representation

Yiying Yang, Fukun Yin, Wen Liu et al.

AAAI 2024paper