Most Cited 2024 "geometric unification" Papers
12,324 papers found • Page 53 of 62
Conference
A Unified Environmental Network for Pedestrian Trajectory Prediction
Guoqing Chao, Yi Jiang, Dianhui Chu
End-to-End Verification for Subgraph Solving
Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation
Peixi Xiong, Michael A Kozuch, Nilesh Jain
Self-Calibrating Vicinal Risk Minimisation for Model Calibration
Jiawei Liu, Changkun Ye, Ruikai Cui et al.
CORE-MPI: Consistency Object Removal with Embedding MultiPlane Image
Donggeun Yoon, Donghyeon Cho
ScoreHypo: Probabilistic Human Mesh Estimation with Hypothesis Scoring
Yuan Xu, Xiaoxuan Ma, Jiajun Su et al.
EnMatch: Matchmaking for Better Player Engagement via Neural Combinatorial Optimization
Kai Wang, Haoyu Liu, Zhipeng Hu et al.
Behavioral Recognition of Skeletal Data Based on Targeted Dual Fusion Strategy
Xiao Yun, Chenglong Xu, Kevin Riou et al.
BilevelPruning: Unified Dynamic and Static Channel Pruning for Convolutional Neural Networks
Shangqian Gao, Yanfu Zhang, Feihu Huang et al.
DART: Dual-Modal Adaptive Online Prompting and Knowledge Retention for Test-Time Adaptation
Zichen Liu, Hongbo Sun, Yuxin Peng et al.
CIDR: A Cooperative Integrated Dynamic Refining Method for Minimal Feature Removal Problem
Qian Chen, Taolin Zhang, Dongyang Li et al.
DiffRAW: Leveraging Diffusion Model to Generate DSLR-Comparable Perceptual Quality sRGB from Smartphone RAW Images
Mingxin Yi, Kai Zhang, Pei Liu et al.
Towards Molecular Structure Discovery from Cryo-ET Density Volumes via Modelling Auxiliary Semantic Prototypes
Ashwin Nair, Xingjian Li, Mostofa Rafid Uddin et al.
A Computation-Aware Shape Loss Function for Point Cloud Completion
Shunran Zhang, Xiubo Zhang, Tsz Nam Chan et al.
Device-Wise Federated Network Pruning
Shangqian Gao, Junyi Li, Zeyu Zhang et al.
Automated Defect Report Generation for Enhanced Industrial Quality Control
Jiayuan Xie, Zhiping Zhou, Zihan Wu et al.
Tree-of-Reasoning Question Decomposition for Complex Question Answering with Large Language Models
Kun Zhang, Jiali Zeng, Fandong Meng et al.
Motion Deblurring via Spatial-Temporal Collaboration of Frames and Events
Wen Yang, Jinjian Wu, Jupo Ma et al.
Online Conversion Rate Prediction via Multi-Interval Screening and Synthesizing under Delayed Feedback
Qiming Liu, Xiang Ao, Yuyao Guo et al.
Neural Embeddings for kNN Search in Biological Sequence
Zhihao Chang, Linzhu Yu, Yanchao Xu et al.
Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos
Yuhan Shen, Ehsan Elhamifar
Learning to Segment Referred Objects from Narrated Egocentric Videos
Yuhan Shen, Huiyu Wang, Xitong Yang et al.
Assessment via Transformer Text Prompting
DanceMVP: Self-Supervised Learning for Multi-Task Primitive-Based Dance Performance
Yun Zhong, Yiannis Demiris
Inlier Confidence Calibration for Point Cloud Registration
Yongzhe Yuan, Yue Wu, Xiaolong Fan et al.
A Two-Stage Information Extraction Network for Incomplete Multi-View Multi-Label Classification
Xin Tan, Ce Zhao, Chengliang Liu et al.
RetouchFormer: Semi-supervised High-Quality Face Retouching Transformer with Prior-Based Selective Self-Attention
Xue Wen, Lianxin Xie, Le Jiang et al.
Optimal Quasi-clique: Hardness, Equivalence with Densest-$k$-Subgraph, and Quasi-partitioned Community Mining
Aritra Konar, Nicholas Sidiropoulos
Enhancing the Efficiency of Altruism and Taxes in Affine Congestion Games through Signalling
Vittorio Bilò, Cosimo Vinci
Content Filtering with Inattentive Information Consumers
Justin Grana, Alex Slivkins, Brendan Lucier et al.
Structure-Aware Multimodal Sequential Learning for Visual Dialog
Youngjin Kim, Min-Jun Kim, Kyunghwan An et al.
Manipulation-Robust Selection of Citizens’ Assemblies
Bailey Flanigan, Jennifer Liang, Ariel Procaccia et al.
Complementary Knowledge Distillation for Robust and Privacy-Preserving Model Serving in Vertical Federated Learning
Dashan Gao, Sheng Wan, Lixin Fan et al.
Your Transferability Barrier is Fragile: Free-Lunch for Transferring the Non-Transferable Learning
Ziming Hong, Li Shen, Tongliang Liu
RR-PU: A Synergistic Two-Stage Positive and Unlabeled Learning Framework for Robust Tax Evasion Detection
Shuzhi Cao, Jianfei Ruan, Bo Dong et al.
MaxQ: Multi-Axis Query for N:M Sparsity Network
Jingyang Xiang, Siqi Li, Junhao Chen et al.
CTO-SLAM: Contour Tracking for Object-Level Robust 4D SLAM
Xiaohan Li, Dong Liu, Jun Wu
Practical Privacy-Preserving MLaaS: When Compressive Sensing Meets Generative Networks
Jia Wang, Wuqiang Su, Zushu Huang et al.
Efficient Scene Recovery Using Luminous Flux Prior
ZhongYu Li, Lei Zhang
Revisiting Global Translation Estimation with Feature Tracks
Peilin Tao, Hainan Cui, Mengqi Rong et al.
TD²-Net: Toward Denoising and Debiasing for Video Scene Graph Generation
Xin Lin, Chong Shi, Yibing Zhan et al.
Causal Representation Learning via Counterfactual Intervention
Xiutian Li, Siqi Sun, Rui Feng
ViLT-CLIP: Video and Language Tuning CLIP with Multimodal Prompt Learning and Scenario-guided Optimization
Hao Wang, Fang Liu, Licheng Jiao et al.
Abstraction of Situation Calculus Concurrent Game Structures
Yves Lesperance, Giuseppe De Giacomo, Maryam Rostamigiv et al.
LAMP: Learn A Motion Pattern for Few-Shot Video Generation
Rui-Qi Wu, Liangyu Chen, Tong Yang et al.
Repurposing Ensemble of Black-Box Models to New Task Domains
Minh Hoang, Nghia Hoang
Towards CLIP-driven Language-free 3D Visual Grounding via 2D-3D Relational Enhancement and Consistency
Yuqi Zhang, Han Luo, Yinjie Lei
Neural Fields as Distributions: Signal Processing Beyond Euclidean Space
Daniel Rebain, Soroosh Yazdani, Kwang Moo Yi et al.
PVALane: Prior-Guided 3D Lane Detection with View-Agnostic Feature Alignment
Zewen Zheng, Xuemin Zhang, Yongqiang Mou et al.
Global and Hierarchical Geometry Consistency Priors for Few-shot NeRFs in Indoor Scenes
Xiaotian Sun, Qingshan Xu, Xinjie Yang et al.
The STVchrono Dataset: Towards Continuous Change Recognition in Time
Yanjun Sun, Yue Qiu, Mariia Khan et al.
Unleashing Channel Potential: Space-Frequency Selection Convolution for SAR Object Detection
Ke Li, Di Wang, Zhangyuan Hu et al.
Rethinking Two-Stage Referring Expression Comprehension: A Novel Grounding and Segmentation Method Modulated by Point
Peizhi Zhao, Shiyi Zheng, Wenye Zhao et al.
Pixel-Aligned Language Model
Jiarui Xu, Xingyi Zhou, Shen Yan et al.
Peer Learning: Learning Complex Policies in Groups from Scratch via Action Recommendations
Cedric Derstroff, Jannis Brugger, Mattia Cerrato et al.
QDETRv: Query-Guided DETR for One-Shot Object Localization in Videos
Yogesh Kumar, Saswat Mallick, Anand Mishra et al.
CAMEL: CAusal Motion Enhancement Tailored for Lifting Text-driven Video Editing
Guiwei Zhang, Tianyu Zhang, Guanglin Niu et al.
Mastering Context-to-Label Representation Transformation for Event Causality Identification with Diffusion Models
Hieu Man, Franck Dernoncourt, Thien Huu Nguyen
A Physics-informed Low-rank Deep Neural Network for Blind and Universal Lens Aberration Correction
Jin Gong, Runzhao Yang, Weihang Zhang et al.
Non-excludable Bilateral Trade between Groups
Yixuan Even Xu, Hanrui Zhang, Vincent Conitzer
NAPGuard: Towards Detecting Naturalistic Adversarial Patches
Siyang Wu, Jiakai Wang, Jiejie Zhao et al.
Bootstrapping SparseFormers from Vision Foundation Models
Ziteng Gao, Zhan Tong, Kevin Qinghong Lin et al.
Large Occluded Human Image Completion via Image-Prior Cooperating
Hengrun Zhao, Yu Zeng, Huchuan Lu et al.
A Joint Framework with Heterogeneous-Relation-Aware Graph and Multi-Channel Label Enhancing Strategy for Event Causality Extraction
Ruili Pu, Yang Li, Jun Zhao et al.
Generating Handwritten Mathematical Expressions From Symbol Graphs: An End-to-End Pipeline
Yu chen, Fei Gao, YanguangZhang et al.
Beyond Entities: A Large-Scale Multi-Modal Knowledge Graph with Triplet Fact Grounding
Jingping Liu, Mingchuan Zhang, Weichen Li et al.
Domain Separation Graph Neural Networks for Saliency Object Ranking
Zijian Wu, Jun Lu, Jing Han et al.
A Local-Ascending-Global Learning Strategy for Brain-Computer Interface
Dongrui Gao, Haokai Zhang, Pengrui Li et al.
Resource-Efficient Transformer Pruning for Finetuning of Large Models
Fatih Ilhan, Gong Su, Selim Tekin et al.
Tail-STEAK: Improve Friend Recommendation for Tail Users via Self-Training Enhanced Knowledge Distillation
Yijun Ma, Chaozhuo Li, Xiao Zhou
Deep-TROJ: An Inference Stage Trojan Insertion Algorithm through Efficient Weight Replacement Attack
Sabbir Ahmed, RANYANG ZHOU, Shaahin Angizi et al.
Optimize & Reduce: A Top-Down Approach for Image Vectorization
Or Hirschorn, Amir Jevnisek, Shai Avidan
Divide and Conquer: Hybrid Pre-training for Person Search
Yanling Tian, Di Chen, Yunan Liu et al.
Language-aware Visual Semantic Distillation for Video Question Answering
Bo Zou, Chao Yang, Yu Qiao et al.
Multi-Prototype Space Learning for Commonsense-Based Scene Graph Generation
Lianggangxu Chen, Youqi Song, Yiqing Cai et al.
DiLiGenRT: A Photometric Stereo Dataset with Quantified Roughness and Translucency
Heng Guo, Jieji Ren, Feishi Wang et al.
StyLitGAN: Image-Based Relighting via Latent Control
Anand Bhattad, James Soole, David Forsyth
Label-Efficient Group Robustness via Out-of-Distribution Concept Curation
Yiwei Yang, Anthony Liu, Robert Wolfe et al.
Video Event Extraction with Multi-View Interaction Knowledge Distillation
Kaiwen Wei, Du Runyan, Li Jin et al.
Omnidirectional Image Super-resolution via Bi-projection Fusion
Jiangang Wang, Yuning Cui, Yawen Li et al.
Efficient Algorithms for Non-gaussian Single Index Models with Generative Priors
Junren CHEN, Zhaoqiang Liu
Batch Normalization Alleviates the Spectral Bias in Coordinate Networks
Zhicheng Cai, Hao Zhu, Qiu Shen et al.
DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification
Tony Alex, Sara Ahmed, Armin Mustafa et al.
Not All Classes Stand on Same Embeddings: Calibrating a Semantic Distance with Metric Tensor
Jae Hyeon Park, Gyoomin Lee, Seunggi Park et al.
Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling
Zhe Li, Zerong Zheng, Lizhen Wang et al.
Improving the Adversarial Transferability of Vision Transformers with Virtual Dense Connection
Jianping Zhang, Yizhan Huang, Zhuoer Xu et al.
Data-Augmented Curriculum Graph Neural Architecture Search under Distribution Shifts
Yang Yao, Xin Wang, Yijian Qin et al.
Shuffled Deep Regression
NB-GTR: Narrow-Band Guided Turbulence Removal
Yifei Xia, Chu Zhou, Chengxuan Zhu et al.
Positive-Unlabeled Learning by Latent Group-Aware Meta Disambiguation
Lin Long, Haobo Wang, Zhijie Jiang et al.
Sample-Constrained Black Box Optimization for Audio Personalization
Rajalaxmi Rajagopalan, Yu-Lin Wei, Romit Roy Choudhury
Text-conditional Attribute Alignment across Latent Spaces for 3D Controllable Face Image Synthesis
FeiFan Xu, Rui Li, Si Wu et al.
Runtime Analysis of the (μ + 1) GA: Provable Speed-Ups from Strong Drift towards Diverse Populations
Benjamin Doerr, Aymen Echarghaoui, Mohammed Jamal et al.
Selective and Orthogonal Feature Activation for Pedestrian Attribute Recognition
Junyi Wu, Yan Huang, Min Gao et al.
Arbitrary-Scale Video Super-resolution Guided by Dynamic Context
Cong Huang, jiahao Li, Lei Chu et al.
MoML: Online Meta Adaptation for 3D Human Motion Prediction
Xiaoning Sun, Huaijiang Sun, Bin Li et al.
Learning with Structural Labels for Learning with Noisy Labels
Noo-ri Kim, Jin-Seop Lee, Jee-Hyong Lee
What If the TV Was Off? Examining Counterfactual Reasoning Abilities of Multi-modal Language Models
Letian Zhang, Xiaotong Zhai, Zhongkai Zhao et al.
RLfOLD: Reinforcement Learning from Online Demonstrations in Urban Autonomous Driving
Daniel Coelho, Miguel Oliveira, Vitor Santos
Incremental Nuclei Segmentation from Histopathological Images via Future-class Awareness and Compatibility-inspired Distillation
Huyong Wang, Huisi Wu, Jing Qin
Scene-adaptive and Region-aware Multi-modal Prompt for Open Vocabulary Object Detection
Xiaowei Zhao, Xianglong Liu, Duorui Wang et al.
SA²VP: Spatially Aligned-and-Adapted Visual Prompt
Wenjie Pei, Tongqi Xia, Fanglin Chen et al.
Generate Like Experts: Multi-Stage Font Generation by Incorporating Font Transfer Process into Diffusion Models
Bin Fu, Fanghua Yu, Anran Liu et al.
MaskCLR: Attention-Guided Contrastive Learning for Robust Action Representation Learning
Mohamed Abdelfattah, Mariam Hassan, Alex Alahi
DiG-In-GNN: Discriminative Feature Guided GNN-based Fraud Detector against Inconsistencies in Multi-Relation Fraud Graph
Jinghui Zhang, Zhengjia Xu, Dingyang Lv et al.
MAGICK: A Large-scale Captioned Dataset from Matting Generated Images using Chroma Keying
Ryan Burgert, Brian Price, Jason Kuen et al.
Scalable Enumeration of Trap Spaces in Boolean Networks via Answer Set Programming
Srikar Appalaraju, Peng Tang, Qi Dong et al.
Multi-Modal Prompting for Open-Vocabulary Video Visual Relationship Detection
RL-SeqISP: Reinforcement Learning-Based Sequential Optimization for Image Signal Processing
DMMR: Cross-Subject Domain Generalization for EEG-Based Emotion Recognition via Denoising Mixed Mutual Reconstruction
Online Task-Free Continual Generative and Discriminative Learning via Dynamic Cluster Memory
飞 叶, Adrian Bors
FADES: Fair Disentanglement with Sensitive Relevance
Taeuk Jang, Xiaoqian Wang
Improving Depth Completion via Depth Feature Upsampling
Yufei Wang, Ge Zhang, Shaoqian Wang et al.
Visual Chain-of-Thought Prompting for Knowledge-Based Visual Reasoning
Diverse and Stable 2D Diffusion Guided Text to 3D Generation with Noise Recalibration
Test-Time Adaptation via Style and Structure Guidance for Histological Image Registration
Shenglong Zhou, Zhiwei Xiong, Feng Wu
MRFS: Mutually Reinforcing Image Fusion and Segmentation
HAO ZHANG, Xuhui Zuo, Jie Jiang et al.
Reproduce, Replicate, Reevaluate. The Long but Safe Way to Extend Machine Learning Methods
Luisa Werner, Nabil Layaïda, Pierre Genevès et al.
IIRP-Net: Iterative Inference Residual Pyramid Network for Enhanced Image Registration
Tai Ma, zhangsuwei, Jiafeng Li et al.
SEED-Bench: Benchmarking Multimodal Large Language Models
Bohao Li, Yuying Ge, Yixiao Ge et al.
Active Domain Adaptation with False Negative Prediction for Object Detection
Yuzuru Nakamura, Yasunori Ishii, Takayoshi Yamashita
Approximate Distance Oracle for Fault-Tolerant Geometric Spanners
Kyungjin Cho, Jihun Shin, Eunjin Oh
Stereo Vision Conversion from Planar Videos Based on Temporal Multiplane Images
Shanding Diao, Yuan Chen, Yang Zhao et al.
Reg-PTQ: Regression-specialized Post-training Quantization for Fully Quantized Object Detector
Yifu Ding, Weilun Feng, Chuyan Chen et al.
Rethinking the Up-Sampling Operations in CNN-based Generative Network for Generalizable Deepfake Detection
Chuangchuang Tan, Huan Liu, Yao Zhao et al.
PerFedRLNAS: One-for-All Personalized Federated Neural Architecture Search
Dixi Yao, Baochun Li
AMD: Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion
Beibei Jing, Youjia Zhang, Zikai Song et al.
UFC-Net: Unrolling Fixed-point Continuous Network for Deep Compressive Sensing
Xiaoyang Wang, Hongping Gan
Expressive Multi-Agent Communication via Identity-Aware Learning
Wei Du, Shifei Ding, Lili Guo et al.
Learning to Manipulate Artistic Images
Wei Guo, Yuqi Zhang, De Ma et al.
Keypoint Fusion for RGB-D Based 3D Hand Pose Estimation
Xingyu Liu, Pengfei Ren, Yuanyuan Gao et al.
Restoring Speaking Lips from Occlusion for Audio-Visual Speech Recognition
Jiadong Wang, Zexu Pan, Malu Zhang et al.
Pandora’s Problem with Deadlines
Ben Berger, Tomer Ezra, Michal Feldman et al.
Transient Glimpses: Unveiling Occluded Backgrounds through the Spike Camera
Jiyuan Zhang, Shiyan Chen, Yajing Zheng et al.
From Retrieval to Generation: A Simple and Unified Generative Model for End-to-End Task-Oriented Dialogue
MaskPLAN: Masked Generative Layout Planning from Partial Input
Hang Zhang, Anton Savov, Benjamin Dillenburger
Dual-Channel Learning Framework for Drug-Drug Interaction Prediction via Relation-Aware Heterogeneous Graph Transformer
Xiaorui Su, Pengwei Hu, Zhu-Hong You et al.
A-Teacher: Asymmetric Network for 3D Semi-Supervised Object Detection
Hanshi Wang, Zhipeng Zhang, Jin Gao et al.
R3CD: Scene Graph to Image Generation with Relation-Aware Compositional Control Diffusion
Jinxiu Liu, Qi Liu
DMR: Decomposed Multi-Modality Representations for Frames and Events Fusion in Visual Reinforcement Learning
Haoran Xu, Peixi Peng, Guang Tan et al.
3D Feature Tracking via Event Camera
Siqi Li, Zhou Zhikuan, Zhou Xue et al.
Frequency-aware Event-based Video Deblurring for Real-World Motion Blur
Taewoo Kim, Hoonhee Cho, Kuk-Jin Yoon
FedHCA2: Towards Hetero-Client Federated Multi-Task Learning
Yuxiang Lu, Suizhi Huang, Yuwen Yang et al.
Improving Unsupervised Hierarchical Representation with Reinforcement Learning
Ruyi An, Yewen Li, Xu He et al.
Neural Physical Simulation with Multi-Resolution Hash Grid Encoding
Haoxiang Wang, Tao Yu, Tianwei Yang et al.
Noise-Aware Image Captioning with Progressively Exploring Mismatched Words
Zhongtian Fu, Kefei Song, Luping Zhou et al.
BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition
Yuxuan Zhou, Xudong Yan, Zhi-Qi Cheng et al.
Person-in-WiFi 3D: End-to-End Multi-Person 3D Pose Estimation with Wi-Fi
Kangwei Yan, Fei Wang, Bo Qian et al.
Relative Policy-Transition Optimization for Fast Policy Transfer
Jiawei xu, Cheng Zhou, Yizheng Zhang et al.
ERMVP: Communication-Efficient and Collaboration-Robust Multi-Vehicle Perception in Challenging Environments
Jingyu Zhang, Kun Yang, Yilei Wang et al.
DiDA: Disambiguated Domain Alignment for Cross-Domain Retrieval with Partial Labels
Haoran Liu, Ying Ma, Ming Yan et al.
Decomposing Temporal Equilibrium Strategy for Coordinated Distributed Multi-Agent Reinforcement Learning
Chenyang Zhu, Wen Si, Jinyu Zhu et al.
Parameterization of (Partial) Maximum Satisfiability above Matching in a Variable-Clause Graph
Vasily Alferov, Ivan Bliznets, Kirill Brilliantov
DiffusionRegPose: Enhancing Multi-Person Pose Estimation using a Diffusion-Based End-to-End Regression Approach
Dayi Tan, Hansheng Chen, Wei Tian et al.
Pushing the Limit of Fine-Tuning for Few-Shot Learning: Where Feature Reusing Meets Cross-Scale Attention
Ying-Yu Chen, Jun-Wei Hsieh, Xin Li et al.
Locally Rainbow Paths
Till Fluschnik, Leon Kellerhals, Malte Renken
Tumor Micro-environment Interactions Guided Graph Learning for Survival Analysis of Human Cancers from Whole-slide Pathological Images
WEI SHAO, YangYang Shi, Daoqiang Zhang et al.
Exact Fusion via Feature Distribution Matching for Few-shot Image Generation
Yingbo Zhou, Yutong Ye, Pengyu Zhang et al.
Affine Equivariant Networks Based on Differential Invariants
Yikang Li, Yeqing Qiu, Yuxuan Chen et al.
Federated Label-Noise Learning with Local Diversity Product Regularization
Xiaochen Zhou, Xudong Wang
Improving Generalized Zero-Shot Learning by Exploring the Diverse Semantics from External Class Names
Yapeng Li, Yong Luo, Zengmao Wang et al.
Continual Learning for Motion Prediction Model via Meta-Representation Learning and Optimal Memory Buffer Retention Strategy
Dae Jun Kang, Dongsuk Kum, Sanmin Kim
FlowDiffuser: Advancing Optical Flow Estimation with Diffusion Models
Ao Luo, XIN LI, Fan Yang et al.
PrefAce: Face-Centric Pretraining with Self-Structure Aware Distillation
Siyuan Hu, Zheng Wang, Peng Hu et al.
DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degradations
Ruilu Wang, Yang Xue, Lianwen Jin
SynSP: Synergy of Smoothness and Precision in Pose Sequences Refinement
Tao Wang, Lei Jin, Zheng Wang et al.
Building Vision-Language Models on Solid Foundations with Masked Distillation
Sepehr Sameni, Kushal Kafle, Hao Tan et al.
GSENet: Global Semantic Enhancement Network for Lane Detection
Junhao Su, Zhenghan Chen, Chenghao He et al.
Point2Real: Bridging the Gap between Point Cloud and Realistic Image for Open-World 3D Recognition
Hanxuan Li, Bin Fu, Ruiping Wang et al.
Density-Guided Semi-Supervised 3D Semantic Segmentation with Dual-Space Hardness Sampling
Jianan Li, Qiulei Dong
Image as a Language: Revisiting Scene Text Recognition via Balanced, Unified and Synchronized Vision
Language Reasoning Network
Spanning the Spectrum of Hatred Detection: A Persian Multi
Label Hate Speech Dataset with Annotator Rationales
MoDE: A Mixture
of-Experts Model with Mutual Distillation among the Experts
Can Large Language Models Understand Real
World Complex Instructions?
1-Lipschitz Layers Compared: Memory Speed and Certifiable Robustness
Bernd Prach, Fabio Brau, Giorgio Buttazzo et al.
The Irrelevance of Influencers: Information Diffusion with Re
Activation and Immunity Lasts Exponentially Long on Social Network Models
Learning Accurate and Bidirectional Transformation via Dynamic Embedding Transportation for Cross
Domain Recommendation
SoftCLIP: Softer Cross
Modal Alignment Makes CLIP Stronger
Other Papers
M3-UDA: A New Benchmark for Unsupervised Domain Adaptive Fetal Cardiac Structure Detection
Bin Pu, Liwen Wang, Jiewen Yang et al.
HIT: Estimating Internal Human Implicit Tissues from the Body Surface
Marilyn Keller, Vaibhav ARORA, Abdelmouttaleb Dakri et al.
Stitching Segments and Sentences towards Generalization in Video-Text Pre-training
Fan Ma, Xiaojie Jin, Heng Wang et al.
KGTS: Contrastive Trajectory Similarity Learning over Prompt Knowledge Graph Embedding
Zhen Chen, Dalin Zhang, Shanshan Feng et al.
Open-Set Graph Domain Adaptation via Separate Domain Alignment
Yu Wang, Ronghang Zhu, Pengsheng Ji et al.
Learning Cluster-Wise Anchors for Multi-View Clustering
Chao Zhang, Xiuyi Jia, Zechao Li et al.
TDeLTA: A Light-Weight and Robust Table Detection Method Based on Learning Text Arrangement
Yang Fan, Xiangping Wu, Qingcai Chen et al.
Transition-Informed Reinforcement Learning for Large-Scale Stackelberg Mean-Field Games
Pengdeng Li, Runsheng Yu, Xinrun Wang et al.
Focus-Then-Decide: Segmentation-Assisted Reinforcement Learning
CycleVTON: A Cycle Mapping Framework for Parser-Free Virtual Try-On
Chenghu Du, Junyin Wang, Yi Rong et al.
Regularized Parameter Uncertainty for Improving Generalization in Reinforcement Learning
Pehuen Moure, Longbiao Cheng, Joachim Ott et al.
Robust Noisy Correspondence Learning with Equivariant Similarity Consistency
Yuchen Yang, Erkun Yang, Likai Wang et al.
Adaptive Uncertainty-Based Learning for Text-Based Person Retrieval
Shenshen Li, Chen He, Xing Xu et al.
Learning Multi-Task Sparse Representation Based on Fisher Information
Yayu Zhang, Yuhua Qian, Guoshuai Ma et al.
WaveFormer: Wavelet Transformer for Noise-Robust Video Inpainting
Zhiliang Wu, Changchang Sun, Hanyu Xuan et al.
Task-Driven Wavelets using Constrained Empirical Risk Minimization
Eric Marcus, Ray Sheombarsing, Jan-Jakob Sonke et al.
Defeasible Normative Reasoning: A Proof-Theoretic Integration of Logical Argumentation
Ofer Arieli, Kees van Berkel, Christian Straßer
Probing Synergistic High-Order Interaction in Infrared and Visible Image Fusion
Naishan Zheng, Man Zhou, Jie Huang et al.
Cross-Domain Contrastive Learning for Time Series Clustering
Furong Peng, Jiachen Luo, Xuan Lu et al.
HACDR-Net: Heterogeneous-Aware Convolutional Network for Diabetic Retinopathy Multi-Lesion Segmentation
QiHao Xu, Xiaoling Luo, Chao Huang et al.
Towards Automated RISC-V Microarchitecture Design with Reinforcement Learning
Chen BAI, Jianwang Zhai, Yuzhe Ma et al.