Most Cited 2024 "geometric unification" Papers

12,324 papers found • Page 53 of 62

Filters:Most Cited 2024 geometric unification Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#10401

A Unified Environmental Network for Pedestrian Trajectory Prediction

Guoqing Chao, Yi Jiang, Dianhui Chu

AAAI 2024paper

#10402

End-to-End Verification for Subgraph Solving

AAAI 2024paper

#10403

Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation

Peixi Xiong, Michael A Kozuch, Nilesh Jain

ECCV 2024

#10404

Self-Calibrating Vicinal Risk Minimisation for Model Calibration

Jiawei Liu, Changkun Ye, Ruikai Cui et al.

CVPR 2024

#10405

CORE-MPI: Consistency Object Removal with Embedding MultiPlane Image

Donggeun Yoon, Donghyeon Cho

CVPR 2024

#10406

ScoreHypo: Probabilistic Human Mesh Estimation with Hypothesis Scoring

Yuan Xu, Xiaoxuan Ma, Jiajun Su et al.

CVPR 2024

#10407

EnMatch: Matchmaking for Better Player Engagement via Neural Combinatorial Optimization

Kai Wang, Haoyu Liu, Zhipeng Hu et al.

AAAI 2024paper

#10408

Behavioral Recognition of Skeletal Data Based on Targeted Dual Fusion Strategy

Xiao Yun, Chenglong Xu, Kevin Riou et al.

AAAI 2024paper

#10409

BilevelPruning: Unified Dynamic and Static Channel Pruning for Convolutional Neural Networks

Shangqian Gao, Yanfu Zhang, Feihu Huang et al.

CVPR 2024

#10410

DART: Dual-Modal Adaptive Online Prompting and Knowledge Retention for Test-Time Adaptation

Zichen Liu, Hongbo Sun, Yuxin Peng et al.

AAAI 2024paper

#10411

CIDR: A Cooperative Integrated Dynamic Refining Method for Minimal Feature Removal Problem

Qian Chen, Taolin Zhang, Dongyang Li et al.

AAAI 2024paperarXiv:2312.08157

#10412

DiffRAW: Leveraging Diffusion Model to Generate DSLR-Comparable Perceptual Quality sRGB from Smartphone RAW Images

Mingxin Yi, Kai Zhang, Pei Liu et al.

AAAI 2024paper

#10413

Towards Molecular Structure Discovery from Cryo-ET Density Volumes via Modelling Auxiliary Semantic Prototypes

Ashwin Nair, Xingjian Li, Mostofa Rafid Uddin et al.

AAAI 2024paper

#10414

A Computation-Aware Shape Loss Function for Point Cloud Completion

Shunran Zhang, Xiubo Zhang, Tsz Nam Chan et al.

AAAI 2024paper

#10415

Device-Wise Federated Network Pruning

Shangqian Gao, Junyi Li, Zeyu Zhang et al.

CVPR 2024

#10416

Automated Defect Report Generation for Enhanced Industrial Quality Control

Jiayuan Xie, Zhiping Zhou, Zihan Wu et al.

AAAI 2024paper

#10417

Tree-of-Reasoning Question Decomposition for Complex Question Answering with Large Language Models

Kun Zhang, Jiali Zeng, Fandong Meng et al.

AAAI 2024paper

#10418

Motion Deblurring via Spatial-Temporal Collaboration of Frames and Events

Wen Yang, Jinjian Wu, Jupo Ma et al.

AAAI 2024paper

#10419

Online Conversion Rate Prediction via Multi-Interval Screening and Synthesizing under Delayed Feedback

Qiming Liu, Xiang Ao, Yuyao Guo et al.

AAAI 2024paper

#10420

Neural Embeddings for kNN Search in Biological Sequence

Zhihao Chang, Linzhu Yu, Yanchao Xu et al.

AAAI 2024paper

#10421

Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos

Yuhan Shen, Ehsan Elhamifar

CVPR 2024

#10422

Learning to Segment Referred Objects from Narrated Egocentric Videos

Yuhan Shen, Huiyu Wang, Xitong Yang et al.

CVPR 2024

#10423

Assessment via Transformer Text Prompting

AAAI 2024paper

#10424

DanceMVP: Self-Supervised Learning for Multi-Task Primitive-Based Dance Performance

Yun Zhong, Yiannis Demiris

AAAI 2024paper

#10425

Inlier Confidence Calibration for Point Cloud Registration

Yongzhe Yuan, Yue Wu, Xiaolong Fan et al.

CVPR 2024

#10426

A Two-Stage Information Extraction Network for Incomplete Multi-View Multi-Label Classification

Xin Tan, Ce Zhao, Chengliang Liu et al.

AAAI 2024paper

#10427

RetouchFormer: Semi-supervised High-Quality Face Retouching Transformer with Prior-Based Selective Self-Attention

Xue Wen, Lianxin Xie, Le Jiang et al.

AAAI 2024paper

#10428

Optimal Quasi-clique: Hardness, Equivalence with Densest-$k$-Subgraph, and Quasi-partitioned Community Mining

Aritra Konar, Nicholas Sidiropoulos

AAAI 2024paper

#10429

Enhancing the Efficiency of Altruism and Taxes in Affine Congestion Games through Signalling

Vittorio Bilò, Cosimo Vinci

AAAI 2024paper

#10430

Content Filtering with Inattentive Information Consumers

Justin Grana, Alex Slivkins, Brendan Lucier et al.

AAAI 2024paperarXiv:2205.14060

#10431

Structure-Aware Multimodal Sequential Learning for Visual Dialog

Youngjin Kim, Min-Jun Kim, Kyunghwan An et al.

AAAI 2024paper

#10432

Manipulation-Robust Selection of Citizens’ Assemblies

Bailey Flanigan, Jennifer Liang, Ariel Procaccia et al.

AAAI 2024paper

#10433

Complementary Knowledge Distillation for Robust and Privacy-Preserving Model Serving in Vertical Federated Learning

Dashan Gao, Sheng Wan, Lixin Fan et al.

AAAI 2024paper

#10434

Your Transferability Barrier is Fragile: Free-Lunch for Transferring the Non-Transferable Learning

Ziming Hong, Li Shen, Tongliang Liu

CVPR 2024highlight

#10435

RR-PU: A Synergistic Two-Stage Positive and Unlabeled Learning Framework for Robust Tax Evasion Detection

Shuzhi Cao, Jianfei Ruan, Bo Dong et al.

AAAI 2024paper

#10436

MaxQ: Multi-Axis Query for N:M Sparsity Network

Jingyang Xiang, Siqi Li, Junhao Chen et al.

CVPR 2024

#10437

CTO-SLAM: Contour Tracking for Object-Level Robust 4D SLAM

Xiaohan Li, Dong Liu, Jun Wu

AAAI 2024paper

#10438

Practical Privacy-Preserving MLaaS: When Compressive Sensing Meets Generative Networks

Jia Wang, Wuqiang Su, Zushu Huang et al.

AAAI 2024paper

#10439

Efficient Scene Recovery Using Luminous Flux Prior

ZhongYu Li, Lei Zhang

CVPR 2024

#10440

Revisiting Global Translation Estimation with Feature Tracks

Peilin Tao, Hainan Cui, Mengqi Rong et al.

CVPR 2024

#10441

TD²-Net: Toward Denoising and Debiasing for Video Scene Graph Generation

Xin Lin, Chong Shi, Yibing Zhan et al.

AAAI 2024paper

#10442

Causal Representation Learning via Counterfactual Intervention

Xiutian Li, Siqi Sun, Rui Feng

AAAI 2024paper

#10443

ViLT-CLIP: Video and Language Tuning CLIP with Multimodal Prompt Learning and Scenario-guided Optimization

Hao Wang, Fang Liu, Licheng Jiao et al.

AAAI 2024paper

#10444

Abstraction of Situation Calculus Concurrent Game Structures

Yves Lesperance, Giuseppe De Giacomo, Maryam Rostamigiv et al.

AAAI 2024paper

#10445

LAMP: Learn A Motion Pattern for Few-Shot Video Generation

Rui-Qi Wu, Liangyu Chen, Tong Yang et al.

CVPR 2024

#10446

Repurposing Ensemble of Black-Box Models to New Task Domains

Minh Hoang, Nghia Hoang

AAAI 2024paper

#10447

Towards CLIP-driven Language-free 3D Visual Grounding via 2D-3D Relational Enhancement and Consistency

Yuqi Zhang, Han Luo, Yinjie Lei

CVPR 2024

#10448

Neural Fields as Distributions: Signal Processing Beyond Euclidean Space

Daniel Rebain, Soroosh Yazdani, Kwang Moo Yi et al.

CVPR 2024

#10449

PVALane: Prior-Guided 3D Lane Detection with View-Agnostic Feature Alignment

Zewen Zheng, Xuemin Zhang, Yongqiang Mou et al.

AAAI 2024paper

#10450

Global and Hierarchical Geometry Consistency Priors for Few-shot NeRFs in Indoor Scenes

Xiaotian Sun, Qingshan Xu, Xinjie Yang et al.

CVPR 2024

#10451

The STVchrono Dataset: Towards Continuous Change Recognition in Time

Yanjun Sun, Yue Qiu, Mariia Khan et al.

CVPR 2024

#10452

Unleashing Channel Potential: Space-Frequency Selection Convolution for SAR Object Detection

Ke Li, Di Wang, Zhangyuan Hu et al.

CVPR 2024

#10453

Rethinking Two-Stage Referring Expression Comprehension: A Novel Grounding and Segmentation Method Modulated by Point

Peizhi Zhao, Shiyi Zheng, Wenye Zhao et al.

AAAI 2024paper

#10454

Pixel-Aligned Language Model

Jiarui Xu, Xingyi Zhou, Shen Yan et al.

CVPR 2024

#10455

Peer Learning: Learning Complex Policies in Groups from Scratch via Action Recommendations

Cedric Derstroff, Jannis Brugger, Mattia Cerrato et al.

AAAI 2024paperarXiv:2312.09950

#10456

QDETRv: Query-Guided DETR for One-Shot Object Localization in Videos

Yogesh Kumar, Saswat Mallick, Anand Mishra et al.

AAAI 2024paper

#10457

CAMEL: CAusal Motion Enhancement Tailored for Lifting Text-driven Video Editing

Guiwei Zhang, Tianyu Zhang, Guanglin Niu et al.

CVPR 2024

#10458

Mastering Context-to-Label Representation Transformation for Event Causality Identification with Diffusion Models

Hieu Man, Franck Dernoncourt, Thien Huu Nguyen

AAAI 2024paper

#10459

A Physics-informed Low-rank Deep Neural Network for Blind and Universal Lens Aberration Correction

Jin Gong, Runzhao Yang, Weihang Zhang et al.

CVPR 2024

#10460

Non-excludable Bilateral Trade between Groups

Yixuan Even Xu, Hanrui Zhang, Vincent Conitzer

AAAI 2024paperarXiv:2312.11800

#10461

NAPGuard: Towards Detecting Naturalistic Adversarial Patches

Siyang Wu, Jiakai Wang, Jiejie Zhao et al.

CVPR 2024

#10462

Bootstrapping SparseFormers from Vision Foundation Models

Ziteng Gao, Zhan Tong, Kevin Qinghong Lin et al.

CVPR 2024arXiv:2312.01987

#10463

Large Occluded Human Image Completion via Image-Prior Cooperating

Hengrun Zhao, Yu Zeng, Huchuan Lu et al.

AAAI 2024paper

#10464

A Joint Framework with Heterogeneous-Relation-Aware Graph and Multi-Channel Label Enhancing Strategy for Event Causality Extraction

Ruili Pu, Yang Li, Jun Zhao et al.

AAAI 2024paper

#10465

Generating Handwritten Mathematical Expressions From Symbol Graphs: An End-to-End Pipeline

Yu chen, Fei Gao, YanguangZhang et al.

CVPR 2024

#10466

Beyond Entities: A Large-Scale Multi-Modal Knowledge Graph with Triplet Fact Grounding

Jingping Liu, Mingchuan Zhang, Weichen Li et al.

AAAI 2024paper

#10467

Domain Separation Graph Neural Networks for Saliency Object Ranking

Zijian Wu, Jun Lu, Jing Han et al.

CVPR 2024

#10468

A Local-Ascending-Global Learning Strategy for Brain-Computer Interface

Dongrui Gao, Haokai Zhang, Pengrui Li et al.

AAAI 2024paper

#10469

Resource-Efficient Transformer Pruning for Finetuning of Large Models

Fatih Ilhan, Gong Su, Selim Tekin et al.

CVPR 2024

#10470

Tail-STEAK: Improve Friend Recommendation for Tail Users via Self-Training Enhanced Knowledge Distillation

Yijun Ma, Chaozhuo Li, Xiao Zhou

AAAI 2024paper

#10471

Deep-TROJ: An Inference Stage Trojan Insertion Algorithm through Efficient Weight Replacement Attack

Sabbir Ahmed, RANYANG ZHOU, Shaahin Angizi et al.

CVPR 2024

#10472

Optimize & Reduce: A Top-Down Approach for Image Vectorization

Or Hirschorn, Amir Jevnisek, Shai Avidan

AAAI 2024paper

#10473

Divide and Conquer: Hybrid Pre-training for Person Search

Yanling Tian, Di Chen, Yunan Liu et al.

AAAI 2024paperarXiv:2312.07970

#10474

Language-aware Visual Semantic Distillation for Video Question Answering

Bo Zou, Chao Yang, Yu Qiao et al.

CVPR 2024

#10475

Multi-Prototype Space Learning for Commonsense-Based Scene Graph Generation

Lianggangxu Chen, Youqi Song, Yiqing Cai et al.

AAAI 2024paper

#10476

DiLiGenRT: A Photometric Stereo Dataset with Quantified Roughness and Translucency

Heng Guo, Jieji Ren, Feishi Wang et al.

CVPR 2024

#10477

StyLitGAN: Image-Based Relighting via Latent Control

Anand Bhattad, James Soole, David Forsyth

CVPR 2024

#10478

Label-Efficient Group Robustness via Out-of-Distribution Concept Curation

Yiwei Yang, Anthony Liu, Robert Wolfe et al.

CVPR 2024

#10479

Video Event Extraction with Multi-View Interaction Knowledge Distillation

Kaiwen Wei, Du Runyan, Li Jin et al.

AAAI 2024paper

#10480

Omnidirectional Image Super-resolution via Bi-projection Fusion

Jiangang Wang, Yuning Cui, Yawen Li et al.

AAAI 2024paper

#10481

Efficient Algorithms for Non-gaussian Single Index Models with Generative Priors

Junren CHEN, Zhaoqiang Liu

AAAI 2024paper

#10482

Batch Normalization Alleviates the Spectral Bias in Coordinate Networks

Zhicheng Cai, Hao Zhu, Qiu Shen et al.

CVPR 2024

#10483

DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification

Tony Alex, Sara Ahmed, Armin Mustafa et al.

AAAI 2024paper

#10484

Not All Classes Stand on Same Embeddings: Calibrating a Semantic Distance with Metric Tensor

Jae Hyeon Park, Gyoomin Lee, Seunggi Park et al.

CVPR 2024

#10485

Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling

Zhe Li, Zerong Zheng, Lizhen Wang et al.

CVPR 2024

#10486

Improving the Adversarial Transferability of Vision Transformers with Virtual Dense Connection

Jianping Zhang, Yizhan Huang, Zhuoer Xu et al.

AAAI 2024paper

#10487

Data-Augmented Curriculum Graph Neural Architecture Search under Distribution Shifts

Yang Yao, Xin Wang, Yijian Qin et al.

AAAI 2024paper

#10488

Shuffled Deep Regression

AAAI 2024paper

#10489

NB-GTR: Narrow-Band Guided Turbulence Removal

Yifei Xia, Chu Zhou, Chengxuan Zhu et al.

CVPR 2024

#10490

Positive-Unlabeled Learning by Latent Group-Aware Meta Disambiguation

Lin Long, Haobo Wang, Zhijie Jiang et al.

CVPR 2024

#10491

Sample-Constrained Black Box Optimization for Audio Personalization

Rajalaxmi Rajagopalan, Yu-Lin Wei, Romit Roy Choudhury

AAAI 2024paperarXiv:2507.12773

#10492

Text-conditional Attribute Alignment across Latent Spaces for 3D Controllable Face Image Synthesis

FeiFan Xu, Rui Li, Si Wu et al.

CVPR 2024

#10493

Runtime Analysis of the (μ + 1) GA: Provable Speed-Ups from Strong Drift towards Diverse Populations

Benjamin Doerr, Aymen Echarghaoui, Mohammed Jamal et al.

AAAI 2024paper

#10494

Selective and Orthogonal Feature Activation for Pedestrian Attribute Recognition

Junyi Wu, Yan Huang, Min Gao et al.

AAAI 2024paper

#10495

Arbitrary-Scale Video Super-resolution Guided by Dynamic Context

Cong Huang, jiahao Li, Lei Chu et al.

AAAI 2024paper

#10496

MoML: Online Meta Adaptation for 3D Human Motion Prediction

Xiaoning Sun, Huaijiang Sun, Bin Li et al.

CVPR 2024

#10497

Learning with Structural Labels for Learning with Noisy Labels

Noo-ri Kim, Jin-Seop Lee, Jee-Hyong Lee

CVPR 2024

#10498

What If the TV Was Off? Examining Counterfactual Reasoning Abilities of Multi-modal Language Models

Letian Zhang, Xiaotong Zhai, Zhongkai Zhao et al.

CVPR 2024arXiv:2310.06627

#10499

RLfOLD: Reinforcement Learning from Online Demonstrations in Urban Autonomous Driving

Daniel Coelho, Miguel Oliveira, Vitor Santos

AAAI 2024paper

#10500

Incremental Nuclei Segmentation from Histopathological Images via Future-class Awareness and Compatibility-inspired Distillation

Huyong Wang, Huisi Wu, Jing Qin

CVPR 2024

#10501

Scene-adaptive and Region-aware Multi-modal Prompt for Open Vocabulary Object Detection

Xiaowei Zhao, Xianglong Liu, Duorui Wang et al.

CVPR 2024

#10502

SA²VP: Spatially Aligned-and-Adapted Visual Prompt

Wenjie Pei, Tongqi Xia, Fanglin Chen et al.

AAAI 2024paper

#10503

Generate Like Experts: Multi-Stage Font Generation by Incorporating Font Transfer Process into Diffusion Models

Bin Fu, Fanghua Yu, Anran Liu et al.

CVPR 2024

#10504

MaskCLR: Attention-Guided Contrastive Learning for Robust Action Representation Learning

Mohamed Abdelfattah, Mariam Hassan, Alex Alahi

CVPR 2024

#10505

DiG-In-GNN: Discriminative Feature Guided GNN-based Fraud Detector against Inconsistencies in Multi-Relation Fraud Graph

Jinghui Zhang, Zhengjia Xu, Dingyang Lv et al.

AAAI 2024paper

#10506

MAGICK: A Large-scale Captioned Dataset from Matting Generated Images using Chroma Keying

Ryan Burgert, Brian Price, Jason Kuen et al.

CVPR 2024

#10507

Scalable Enumeration of Trap Spaces in Boolean Networks via Answer Set Programming

Srikar Appalaraju, Peng Tang, Qi Dong et al.

AAAI 2024paper

#10508

Multi-Modal Prompting for Open-Vocabulary Video Visual Relationship Detection

AAAI 2024paper

#10509

RL-SeqISP: Reinforcement Learning-Based Sequential Optimization for Image Signal Processing

AAAI 2024paper

#10510

DMMR: Cross-Subject Domain Generalization for EEG-Based Emotion Recognition via Denoising Mixed Mutual Reconstruction

AAAI 2024paper

#10511

Online Task-Free Continual Generative and Discriminative Learning via Dynamic Cluster Memory

飞叶, Adrian Bors

CVPR 2024

#10512

FADES: Fair Disentanglement with Sensitive Relevance

Taeuk Jang, Xiaoqian Wang

CVPR 2024

#10513

Improving Depth Completion via Depth Feature Upsampling

Yufei Wang, Ge Zhang, Shaoqian Wang et al.

CVPR 2024

#10514

Visual Chain-of-Thought Prompting for Knowledge-Based Visual Reasoning

AAAI 2024paper

#10515

Diverse and Stable 2D Diffusion Guided Text to 3D Generation with Noise Recalibration

AAAI 2024paper

#10516

Test-Time Adaptation via Style and Structure Guidance for Histological Image Registration

Shenglong Zhou, Zhiwei Xiong, Feng Wu

AAAI 2024paper

#10517

MRFS: Mutually Reinforcing Image Fusion and Segmentation

HAO ZHANG, Xuhui Zuo, Jie Jiang et al.

CVPR 2024

#10518

Reproduce, Replicate, Reevaluate. The Long but Safe Way to Extend Machine Learning Methods

Luisa Werner, Nabil Layaïda, Pierre Genevès et al.

AAAI 2024paper

#10519

IIRP-Net: Iterative Inference Residual Pyramid Network for Enhanced Image Registration

Tai Ma, zhangsuwei, Jiafeng Li et al.

CVPR 2024

#10520

SEED-Bench: Benchmarking Multimodal Large Language Models

Bohao Li, Yuying Ge, Yixiao Ge et al.

CVPR 2024

#10521

Active Domain Adaptation with False Negative Prediction for Object Detection

Yuzuru Nakamura, Yasunori Ishii, Takayoshi Yamashita

CVPR 2024highlight

#10522

Approximate Distance Oracle for Fault-Tolerant Geometric Spanners

Kyungjin Cho, Jihun Shin, Eunjin Oh

AAAI 2024paper

#10523

Stereo Vision Conversion from Planar Videos Based on Temporal Multiplane Images

Shanding Diao, Yuan Chen, Yang Zhao et al.

AAAI 2024paper

#10524

Reg-PTQ: Regression-specialized Post-training Quantization for Fully Quantized Object Detector

Yifu Ding, Weilun Feng, Chuyan Chen et al.

CVPR 2024

#10525

Rethinking the Up-Sampling Operations in CNN-based Generative Network for Generalizable Deepfake Detection

Chuangchuang Tan, Huan Liu, Yao Zhao et al.

CVPR 2024arXiv:2312.10461

#10526

PerFedRLNAS: One-for-All Personalized Federated Neural Architecture Search

Dixi Yao, Baochun Li

AAAI 2024paper

#10527

AMD: Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion

Beibei Jing, Youjia Zhang, Zikai Song et al.

AAAI 2024paper

#10528

UFC-Net: Unrolling Fixed-point Continuous Network for Deep Compressive Sensing

Xiaoyang Wang, Hongping Gan

CVPR 2024

#10529

Expressive Multi-Agent Communication via Identity-Aware Learning

Wei Du, Shifei Ding, Lili Guo et al.

AAAI 2024paper

#10530

Learning to Manipulate Artistic Images

Wei Guo, Yuqi Zhang, De Ma et al.

AAAI 2024paperarXiv:2401.13976

#10531

Keypoint Fusion for RGB-D Based 3D Hand Pose Estimation

Xingyu Liu, Pengfei Ren, Yuanyuan Gao et al.

AAAI 2024paper

#10532

Restoring Speaking Lips from Occlusion for Audio-Visual Speech Recognition

Jiadong Wang, Zexu Pan, Malu Zhang et al.

AAAI 2024paper

#10533

Pandora’s Problem with Deadlines

Ben Berger, Tomer Ezra, Michal Feldman et al.

AAAI 2024paper

#10534

Transient Glimpses: Unveiling Occluded Backgrounds through the Spike Camera

Jiyuan Zhang, Shiyan Chen, Yajing Zheng et al.

AAAI 2024paper

#10535

From Retrieval to Generation: A Simple and Unified Generative Model for End-to-End Task-Oriented Dialogue

AAAI 2024paper

#10536

MaskPLAN: Masked Generative Layout Planning from Partial Input

Hang Zhang, Anton Savov, Benjamin Dillenburger

CVPR 2024

#10537

Dual-Channel Learning Framework for Drug-Drug Interaction Prediction via Relation-Aware Heterogeneous Graph Transformer

Xiaorui Su, Pengwei Hu, Zhu-Hong You et al.

AAAI 2024paper

#10538

A-Teacher: Asymmetric Network for 3D Semi-Supervised Object Detection

Hanshi Wang, Zhipeng Zhang, Jin Gao et al.

CVPR 2024

#10539

R3CD: Scene Graph to Image Generation with Relation-Aware Compositional Control Diffusion

Jinxiu Liu, Qi Liu

AAAI 2024paper

#10540

DMR: Decomposed Multi-Modality Representations for Frames and Events Fusion in Visual Reinforcement Learning

Haoran Xu, Peixi Peng, Guang Tan et al.

CVPR 2024

#10541

3D Feature Tracking via Event Camera

Siqi Li, Zhou Zhikuan, Zhou Xue et al.

CVPR 2024

#10542

Frequency-aware Event-based Video Deblurring for Real-World Motion Blur

Taewoo Kim, Hoonhee Cho, Kuk-Jin Yoon

CVPR 2024

#10543

FedHCA2: Towards Hetero-Client Federated Multi-Task Learning

Yuxiang Lu, Suizhi Huang, Yuwen Yang et al.

CVPR 2024

#10544

Improving Unsupervised Hierarchical Representation with Reinforcement Learning

Ruyi An, Yewen Li, Xu He et al.

CVPR 2024

#10545

Neural Physical Simulation with Multi-Resolution Hash Grid Encoding

Haoxiang Wang, Tao Yu, Tianwei Yang et al.

AAAI 2024paper

#10546

Noise-Aware Image Captioning with Progressively Exploring Mismatched Words

Zhongtian Fu, Kefei Song, Luping Zhou et al.

AAAI 2024paper

#10547

BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition

Yuxuan Zhou, Xudong Yan, Zhi-Qi Cheng et al.

CVPR 2024

#10548

Person-in-WiFi 3D: End-to-End Multi-Person 3D Pose Estimation with Wi-Fi

Kangwei Yan, Fei Wang, Bo Qian et al.

CVPR 2024

#10549

Relative Policy-Transition Optimization for Fast Policy Transfer

Jiawei xu, Cheng Zhou, Yizheng Zhang et al.

AAAI 2024paperarXiv:2206.06009

#10550

ERMVP: Communication-Efficient and Collaboration-Robust Multi-Vehicle Perception in Challenging Environments

Jingyu Zhang, Kun Yang, Yilei Wang et al.

CVPR 2024

#10551

DiDA: Disambiguated Domain Alignment for Cross-Domain Retrieval with Partial Labels

Haoran Liu, Ying Ma, Ming Yan et al.

AAAI 2024paper

#10552

Decomposing Temporal Equilibrium Strategy for Coordinated Distributed Multi-Agent Reinforcement Learning

Chenyang Zhu, Wen Si, Jinyu Zhu et al.

AAAI 2024paper

#10553

Parameterization of (Partial) Maximum Satisfiability above Matching in a Variable-Clause Graph

Vasily Alferov, Ivan Bliznets, Kirill Brilliantov

AAAI 2024paper

#10554

DiffusionRegPose: Enhancing Multi-Person Pose Estimation using a Diffusion-Based End-to-End Regression Approach

Dayi Tan, Hansheng Chen, Wei Tian et al.

CVPR 2024

#10555

Pushing the Limit of Fine-Tuning for Few-Shot Learning: Where Feature Reusing Meets Cross-Scale Attention

Ying-Yu Chen, Jun-Wei Hsieh, Xin Li et al.

AAAI 2024paper

#10556

Locally Rainbow Paths

Till Fluschnik, Leon Kellerhals, Malte Renken

AAAI 2024paperarXiv:2402.12905

#10557

Tumor Micro-environment Interactions Guided Graph Learning for Survival Analysis of Human Cancers from Whole-slide Pathological Images

WEI SHAO, YangYang Shi, Daoqiang Zhang et al.

CVPR 2024

#10558

Exact Fusion via Feature Distribution Matching for Few-shot Image Generation

Yingbo Zhou, Yutong Ye, Pengyu Zhang et al.

CVPR 2024

#10559

Affine Equivariant Networks Based on Differential Invariants

Yikang Li, Yeqing Qiu, Yuxuan Chen et al.

CVPR 2024

#10560

Federated Label-Noise Learning with Local Diversity Product Regularization

Xiaochen Zhou, Xudong Wang

AAAI 2024paper

#10561

Improving Generalized Zero-Shot Learning by Exploring the Diverse Semantics from External Class Names

Yapeng Li, Yong Luo, Zengmao Wang et al.

CVPR 2024

#10562

Continual Learning for Motion Prediction Model via Meta-Representation Learning and Optimal Memory Buffer Retention Strategy

Dae Jun Kang, Dongsuk Kum, Sanmin Kim

CVPR 2024

#10563

FlowDiffuser: Advancing Optical Flow Estimation with Diffusion Models

Ao Luo, XIN LI, Fan Yang et al.

CVPR 2024highlight

#10564

PrefAce: Face-Centric Pretraining with Self-Structure Aware Distillation

Siyuan Hu, Zheng Wang, Peng Hu et al.

AAAI 2024paper

#10565

DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degradations

Ruilu Wang, Yang Xue, Lianwen Jin

AAAI 2024paper

#10566

SynSP: Synergy of Smoothness and Precision in Pose Sequences Refinement

Tao Wang, Lei Jin, Zheng Wang et al.

CVPR 2024

#10567

Building Vision-Language Models on Solid Foundations with Masked Distillation

Sepehr Sameni, Kushal Kafle, Hao Tan et al.

CVPR 2024

#10568

GSENet: Global Semantic Enhancement Network for Lane Detection

Junhao Su, Zhenghan Chen, Chenghao He et al.

AAAI 2024paper

#10569

Point2Real: Bridging the Gap between Point Cloud and Realistic Image for Open-World 3D Recognition

Hanxuan Li, Bin Fu, Ruiping Wang et al.

AAAI 2024paper

#10570

Density-Guided Semi-Supervised 3D Semantic Segmentation with Dual-Space Hardness Sampling

Jianan Li, Qiulei Dong

CVPR 2024

#10571

Image as a Language: Revisiting Scene Text Recognition via Balanced, Unified and Synchronized Vision

Language Reasoning Network

AAAI 2024paper

#10572

Spanning the Spectrum of Hatred Detection: A Persian Multi

Label Hate Speech Dataset with Annotator Rationales

AAAI 2024paper

#10573

MoDE: A Mixture

of-Experts Model with Mutual Distillation among the Experts

AAAI 2024paper

#10574

Can Large Language Models Understand Real

World Complex Instructions?

AAAI 2024paper

#10575

1-Lipschitz Layers Compared: Memory Speed and Certifiable Robustness

Bernd Prach, Fabio Brau, Giorgio Buttazzo et al.

CVPR 2024

#10576

The Irrelevance of Influencers: Information Diffusion with Re

Activation and Immunity Lasts Exponentially Long on Social Network Models

AAAI 2024paper

#10577

Learning Accurate and Bidirectional Transformation via Dynamic Embedding Transportation for Cross

Domain Recommendation

AAAI 2024paper

#10578

SoftCLIP: Softer Cross

Modal Alignment Makes CLIP Stronger

AAAI 2024paper

#10579

Other Papers

AAAI 2024paper

#10580

M3-UDA: A New Benchmark for Unsupervised Domain Adaptive Fetal Cardiac Structure Detection

Bin Pu, Liwen Wang, Jiewen Yang et al.

CVPR 2024

#10581

HIT: Estimating Internal Human Implicit Tissues from the Body Surface

Marilyn Keller, Vaibhav ARORA, Abdelmouttaleb Dakri et al.

CVPR 2024

#10582

Stitching Segments and Sentences towards Generalization in Video-Text Pre-training

Fan Ma, Xiaojie Jin, Heng Wang et al.

AAAI 2024paper

#10583

KGTS: Contrastive Trajectory Similarity Learning over Prompt Knowledge Graph Embedding

Zhen Chen, Dalin Zhang, Shanshan Feng et al.

AAAI 2024paper

#10584

Open-Set Graph Domain Adaptation via Separate Domain Alignment

Yu Wang, Ronghang Zhu, Pengsheng Ji et al.

AAAI 2024paper

#10585

Learning Cluster-Wise Anchors for Multi-View Clustering

Chao Zhang, Xiuyi Jia, Zechao Li et al.

AAAI 2024paper

#10586

TDeLTA: A Light-Weight and Robust Table Detection Method Based on Learning Text Arrangement

Yang Fan, Xiangping Wu, Qingcai Chen et al.

AAAI 2024paperarXiv:2312.11043

#10587

Transition-Informed Reinforcement Learning for Large-Scale Stackelberg Mean-Field Games

Pengdeng Li, Runsheng Yu, Xinrun Wang et al.

AAAI 2024paper

#10588

Focus-Then-Decide: Segmentation-Assisted Reinforcement Learning

AAAI 2024paper

#10589

CycleVTON: A Cycle Mapping Framework for Parser-Free Virtual Try-On

Chenghu Du, Junyin Wang, Yi Rong et al.

AAAI 2024paper

#10590

Regularized Parameter Uncertainty for Improving Generalization in Reinforcement Learning

Pehuen Moure, Longbiao Cheng, Joachim Ott et al.

CVPR 2024

#10591

Robust Noisy Correspondence Learning with Equivariant Similarity Consistency

Yuchen Yang, Erkun Yang, Likai Wang et al.

CVPR 2024

#10592

Adaptive Uncertainty-Based Learning for Text-Based Person Retrieval

Shenshen Li, Chen He, Xing Xu et al.

AAAI 2024paper

#10593

Learning Multi-Task Sparse Representation Based on Fisher Information

Yayu Zhang, Yuhua Qian, Guoshuai Ma et al.

AAAI 2024paper

#10594

WaveFormer: Wavelet Transformer for Noise-Robust Video Inpainting

Zhiliang Wu, Changchang Sun, Hanyu Xuan et al.

AAAI 2024paper

#10595

Task-Driven Wavelets using Constrained Empirical Risk Minimization

Eric Marcus, Ray Sheombarsing, Jan-Jakob Sonke et al.

CVPR 2024

#10596

Defeasible Normative Reasoning: A Proof-Theoretic Integration of Logical Argumentation

Ofer Arieli, Kees van Berkel, Christian Straßer

AAAI 2024paper

#10597

Probing Synergistic High-Order Interaction in Infrared and Visible Image Fusion

Naishan Zheng, Man Zhou, Jie Huang et al.

CVPR 2024

#10598

Cross-Domain Contrastive Learning for Time Series Clustering

Furong Peng, Jiachen Luo, Xuan Lu et al.

AAAI 2024paper

#10599

HACDR-Net: Heterogeneous-Aware Convolutional Network for Diabetic Retinopathy Multi-Lesion Segmentation

QiHao Xu, Xiaoling Luo, Chao Huang et al.

AAAI 2024paper

#10600

Towards Automated RISC-V Microarchitecture Design with Reinforcement Learning

Chen BAI, Jianwang Zhai, Yuzhe Ma et al.

AAAI 2024paper

← Previous

1...51 52 53 54 55...62

Most Cited 2024 "geometric unification" Papers

Conference

Paper Type

A Unified Environmental Network for Pedestrian Trajectory Prediction

End-to-End Verification for Subgraph Solving

Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation

Self-Calibrating Vicinal Risk Minimisation for Model Calibration

CORE-MPI: Consistency Object Removal with Embedding MultiPlane Image

ScoreHypo: Probabilistic Human Mesh Estimation with Hypothesis Scoring

EnMatch: Matchmaking for Better Player Engagement via Neural Combinatorial Optimization

Behavioral Recognition of Skeletal Data Based on Targeted Dual Fusion Strategy

BilevelPruning: Unified Dynamic and Static Channel Pruning for Convolutional Neural Networks

DART: Dual-Modal Adaptive Online Prompting and Knowledge Retention for Test-Time Adaptation

CIDR: A Cooperative Integrated Dynamic Refining Method for Minimal Feature Removal Problem

DiffRAW: Leveraging Diffusion Model to Generate DSLR-Comparable Perceptual Quality sRGB from Smartphone RAW Images

Towards Molecular Structure Discovery from Cryo-ET Density Volumes via Modelling Auxiliary Semantic Prototypes

A Computation-Aware Shape Loss Function for Point Cloud Completion

Device-Wise Federated Network Pruning

Automated Defect Report Generation for Enhanced Industrial Quality Control

Tree-of-Reasoning Question Decomposition for Complex Question Answering with Large Language Models

Motion Deblurring via Spatial-Temporal Collaboration of Frames and Events

Online Conversion Rate Prediction via Multi-Interval Screening and Synthesizing under Delayed Feedback

Neural Embeddings for kNN Search in Biological Sequence

Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos

Learning to Segment Referred Objects from Narrated Egocentric Videos

Assessment via Transformer Text Prompting

DanceMVP: Self-Supervised Learning for Multi-Task Primitive-Based Dance Performance

Inlier Confidence Calibration for Point Cloud Registration

A Two-Stage Information Extraction Network for Incomplete Multi-View Multi-Label Classification

RetouchFormer: Semi-supervised High-Quality Face Retouching Transformer with Prior-Based Selective Self-Attention

Optimal Quasi-clique: Hardness, Equivalence with Densest-$k$-Subgraph, and Quasi-partitioned Community Mining

Enhancing the Efficiency of Altruism and Taxes in Affine Congestion Games through Signalling

Content Filtering with Inattentive Information Consumers

Structure-Aware Multimodal Sequential Learning for Visual Dialog

Manipulation-Robust Selection of Citizens’ Assemblies

Complementary Knowledge Distillation for Robust and Privacy-Preserving Model Serving in Vertical Federated Learning

Your Transferability Barrier is Fragile: Free-Lunch for Transferring the Non-Transferable Learning

RR-PU: A Synergistic Two-Stage Positive and Unlabeled Learning Framework for Robust Tax Evasion Detection

MaxQ: Multi-Axis Query for N:M Sparsity Network

CTO-SLAM: Contour Tracking for Object-Level Robust 4D SLAM

Practical Privacy-Preserving MLaaS: When Compressive Sensing Meets Generative Networks

Efficient Scene Recovery Using Luminous Flux Prior

Revisiting Global Translation Estimation with Feature Tracks

TD²-Net: Toward Denoising and Debiasing for Video Scene Graph Generation

Causal Representation Learning via Counterfactual Intervention

ViLT-CLIP: Video and Language Tuning CLIP with Multimodal Prompt Learning and Scenario-guided Optimization

Abstraction of Situation Calculus Concurrent Game Structures

LAMP: Learn A Motion Pattern for Few-Shot Video Generation

Repurposing Ensemble of Black-Box Models to New Task Domains

Towards CLIP-driven Language-free 3D Visual Grounding via 2D-3D Relational Enhancement and Consistency

Neural Fields as Distributions: Signal Processing Beyond Euclidean Space

PVALane: Prior-Guided 3D Lane Detection with View-Agnostic Feature Alignment

Global and Hierarchical Geometry Consistency Priors for Few-shot NeRFs in Indoor Scenes

The STVchrono Dataset: Towards Continuous Change Recognition in Time

Unleashing Channel Potential: Space-Frequency Selection Convolution for SAR Object Detection

Rethinking Two-Stage Referring Expression Comprehension: A Novel Grounding and Segmentation Method Modulated by Point

Pixel-Aligned Language Model

Peer Learning: Learning Complex Policies in Groups from Scratch via Action Recommendations

QDETRv: Query-Guided DETR for One-Shot Object Localization in Videos

CAMEL: CAusal Motion Enhancement Tailored for Lifting Text-driven Video Editing

Mastering Context-to-Label Representation Transformation for Event Causality Identification with Diffusion Models

A Physics-informed Low-rank Deep Neural Network for Blind and Universal Lens Aberration Correction

Non-excludable Bilateral Trade between Groups

NAPGuard: Towards Detecting Naturalistic Adversarial Patches

Bootstrapping SparseFormers from Vision Foundation Models

Large Occluded Human Image Completion via Image-Prior Cooperating

A Joint Framework with Heterogeneous-Relation-Aware Graph and Multi-Channel Label Enhancing Strategy for Event Causality Extraction

Generating Handwritten Mathematical Expressions From Symbol Graphs: An End-to-End Pipeline

Beyond Entities: A Large-Scale Multi-Modal Knowledge Graph with Triplet Fact Grounding

Domain Separation Graph Neural Networks for Saliency Object Ranking

A Local-Ascending-Global Learning Strategy for Brain-Computer Interface

Resource-Efficient Transformer Pruning for Finetuning of Large Models

Tail-STEAK: Improve Friend Recommendation for Tail Users via Self-Training Enhanced Knowledge Distillation

Deep-TROJ: An Inference Stage Trojan Insertion Algorithm through Efficient Weight Replacement Attack

Optimize &#x26; Reduce: A Top-Down Approach for Image Vectorization

Divide and Conquer: Hybrid Pre-training for Person Search

Language-aware Visual Semantic Distillation for Video Question Answering

Multi-Prototype Space Learning for Commonsense-Based Scene Graph Generation

DiLiGenRT: A Photometric Stereo Dataset with Quantified Roughness and Translucency

StyLitGAN: Image-Based Relighting via Latent Control

Optimize & Reduce: A Top-Down Approach for Image Vectorization