Most Cited AAAI "neural activity modeling" Papers
5,317 papers found • Page 25 of 27
Conference
Expanding the Scope of Negatives: Boosting Image-Text Matching with Negatives Distribution Guided Learning
Zhao Zhou, Weizhong Zhang, Xiangcheng Du et al.
Unifying Decision and Function Queries in Stochastic Boolean Satisfiability
Yu-Wei Fan, Jie-Hong Jiang
Achieving Ensemble-Like Performance in a Single Model: A Feature Diversification Framework for Image-Text Matching
Zhao Zhou, Yiqun Wang, Weizhong Zhang et al.
Improving Generalization of Deep Neural Networks by Optimum Shifting
Yuyan Zhou, Ye Li, Lei Feng et al.
AI-Powered Algorithm-Centric Quantum Processor Topology Design
Tian Li, Xiao-Yue Xu, Chen Ding et al.
Enhancing Training of Spiking Neural Network with Stochastic Latency
Srinivas Anumasa, Bhaskar Mukhoty, Velibor Bojkovic et al.
Test-Time Adaptation on Noisy Data via Model-Pruning-Based Filtering and Flatness-Aware Entropy Minimization
Xingzhi Zhou, Zhiliang Tian, Boyang Zhang et al.
Spatiotemporal-Aware Neural Fields for Dynamic CT Reconstruction
Qingyang Zhou, Yunfan Ye, Zhiping Cai
GLIC: General Format Learned Image Compression
MingSheng Zhou, MingMing Kong
SeqRank: Sequential Ranking of Salient Objects
SceneX: Procedural Controllable Large-Scale Scene Generation
Mengqi Zhou, Yuxi Wang, Jun Hou et al.
Joint Class-level and Instance-level Relationship Modeling for Novel Class Discovery
Jiaying Zhou, Qingchao Chen
Bilateral Gradual Semantics for Weighted Argumentation
Core-to-Global Reasoning for Compositional Visual Question Answering
Hao Zhou, Tingjin Luo, Zhangqi Jiang
What Makes Quantization for Large Language Model Hard? An Empirical Study from the Lens of Perturbation
Huankang Guan, Rynson W.H. Lau
Uncertainty-Aware Yield Prediction with Multimodal Molecular Features
Jiayuan Chen, Kehan Guo, Zhen Liu et al.
Preference Aware Dual Contrastive Learning for Item Cold-Start Recommendation
Wenbo Wang, Bingquan Liu, Lili Shan et al.
Learning Performance Maximizing Ensembles with Explainability Guarantees
Vincent Pisztora, Jia Li
Communication Efficient Distributed Newton Method over Unreliable Networks
Ming Wen, Chengchang Liu, Yuedong Xu
DECIDER: Difference-aware Contrastive Diffusion Model with Adversarial Perturbations for Image Change Captioning
Guojin Zhong, Jinhong Hu, Jiajun Chen et al.
MMPF: Multi-Modal Perception Framework for Abnormal Medical Condition Detection
Chuyi Zhong, Dingkang Yang, Peng Zhai et al.
Continual Vision-Language Retrieval via Dynamic Knowledge Rectification
Zhenyu Cui, Yuxin Peng, Xun Wang et al.
When Shadow Removal Meets Intrinsic Image Decomposition: A Joint Learning Framework Using Unpaired Data
Rongjia Zheng, Qing Zhang, Yongwei Nie et al.
MuST: Robust Image Watermarking for Multi-Source Tracing
Guanjie Wang, Zehua Ma, Chang Liu et al.
Supportive Negatives Spectral Augmentation for Source-Free Cross-Domain Segmentation
Kexin Zheng, Haifeng Xia, Siyu Xia et al.
PHR-DIFF: Portrait Highlights Removal via Patch-aware Diffusion Model
Hongsheng Zheng, Zhongyun Bao, Gang Fu et al.
Hierarchical Planning and Learning for Robots in Stochastic Settings Using Zero-Shot Option Invention
Naman Shah, Siddharth Srivastava
Optimizing the Optimization of Planning Domains by Automatic Action Schema Splitting
Mojtaba Elahi, Jussi Rintanen
Frequency-Aware Deepfake Detection: Improving Generalizability through Frequency Domain Learning
Chuangchuang Tan, Yao Zhao, Shikui Wei et al.
HFF-Tracker: A Hierarchical Fine-grained Fusion Tracker for Referring Multi-Object Tracking
Zeyong Zhao, Yanchao Hao, Minghao Zhang et al.
PMRC: Prompt-Based Machine Reading Comprehension for Few-Shot Named Entity Recognition
Jin Huang, Danfeng Yan, Yuanqiang Cai
NightReID: A Large-Scale Nighttime Person Re-Identification Benchmark
Yuxuan Zhao, Weijian Ruan, He Li et al.
ESEG: Event-Based Segmentation Boosted by Explicit Edge-Semantic Guidance
Yucheng Zhao, Gengyu Lyu, Ke Li et al.
Audio-Visual Adaptive Fusion Network for Question Answering Based on Contrastive Learning
Xujian Zhao, Yixin Wang, Peiquan Jin
Fully Data-Driven Pseudo Label Estimation for Pointly-Supervised Panoptic Segmentation
Jing Li, Junsong Fan, Yuran Yang et al.
From Toxic to Trustworthy: Using Self-Distillation and Semi-supervised Methods to Refine Neural Networks
Xianda Zhang, Baolin Zheng, Jianbao Hu et al.
MINES: Message Intercommunication for Inductive Relation Reasoning over Neighbor-Enhanced Subgraphs
Ke Liang, Lingyuan Meng, Sihang Zhou et al.
Evolving Parameterized Prompt Memory for Continual Learning
Muhammad Rifki Kurniawan, Xiang Song, Zhiheng Ma et al.
ACAMDA: Improving Data Efficiency in Reinforcement Learning through Guided Counterfactual Data Augmentation
Yuewen Sun, Erli Wang, Biwei Huang et al.
Towards Safe Policy Learning under Partial Identifiability: A Causal Approach
Shalmali Joshi, Junzhe Zhang, Elias Bareinboim
HAGO-Net: Hierarchical Geometric Massage Passing for Molecular Representation Learning
Hongbin Pei, Taile Chen, Chen A et al.
Training-free Open-Vocabulary Semantic Segmentation via Diverse Prototype Construction and Sub-region Matching
Xuanpu Zhao, Dianmo Sheng, Zhentao Tan et al.
Excluding the Impossible for Open Vocabulary Semantic Segmentation
Shiyuan Zhao, Baodi Liu, Yu Bai et al.
Adaptive Wavelet-Positional Encoding for High-Frequency Information Learning in Implicit Neural Representation
Hongxu Zhao, Zelin Gao, Yue Wang et al.
Multi-scale Activation, Refinement, and Aggregation: Exploring Diverse Cues for Fine-Grained Bird Recognition
Zhicheng Zhang, Hao Tang, Jinhui Tang
Computing the Why-Provenance for Datalog Queries via SAT Solvers
Haitong Luo, Xuying Meng, Suhang Wang et al.
FairTrade: Achieving Pareto-Optimal Trade-Offs between Balanced Accuracy and Fairness in Federated Learning
Maryam Badar, Sandipan Sikdar, Wolfgang Nejdl et al.
Training-Free Image Manipulation Localization Using Diffusion Models
Zhenfei Zhang, Ming-Ching Chang, Xin Li
RP-PGD: Boosting Segmentation Robustness with a Region-and-Prototype Based Adversarial Attack
Yuxuan Zhang, Zhenbo Shi, Shuchang Wang et al.
Partial Point Cloud Registration with Multi-view 2D Image Learning
Yue Zhang, Yue Wu, Wenping Ma et al.
InstantSticker: Realistic Decal Blending via Disentangled Object Reconstruction
Yi Zhang, Xiaoyang Huang, Yishun Dou et al.
PhyCamo: A Robust Physical Camouflage via Contrastive Learning for Multi-View Physical Adversarial Attack
Ximin Zhang, Jinyin Chen, Haibin Zheng et al.
Enhancing Multimodal Large Language Models Complex Reason via Similarity Computation
Xiaofeng Zhang, Fanshuo Zeng, Yihao Quan et al.
Iterative Self-Training with Class-Aware Text-to-Image Synthesis for Visual Task Learning
Xiang Zhang, Wanqing Zhao, Pengyang Li et al.
Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views
Songchun Zhang, Chunhui Zhao
DiMSOD: A Diffusion-Based Framework for Multi-Modal Salient Object Detection
Shuo Zhang, Jiaming Huang, Wenbing Tang et al.
Towards Multi-Mode Outlier Robust Tensor Ring Decomposition
Yuning Qiu, Guoxu Zhou, Andong Wang et al.
Visual Perturbation for Text-Based Person Search
Pengcheng Zhang, Xiaohan Yu, Xiao Bai et al.
SIGraph: Saliency Image-Graph Network for Retinal Disease Classification in Fundus Image
Peng Zhang, Yuan Li, Haotian Song et al.
PanoDiT: Panoramic Videos Generation with Diffusion Transformer
Muyang Zhang, Yuzhi Chen, Rongtao Xu et al.
A Brain-Inspired Way of Reducing the Network Complexity via Concept-Regularized Coding for Emotion Recognition
Han Lu, Xiahai Zhuang, Qiang Luo
Decoupling Scattering: Pseudo-Label Guided NeRF for Scenes with Scattering Media
Mingyang Zhang, Junkang Zhang, Faming Fang et al.
MOCID: Motion Context and Displacement Information Learning for Moving Infrared Small Target Detection
Mingjin Zhang, Yuanjun Ouyang, Fei Gao et al.
IRMamba: Pixel Difference Mamba with Layer Restoration for Infrared Small Target Detection
Mingjin Zhang, Xiaolong Li, Fei Gao et al.
Critical Forgetting-Based Multi-Scale Disentanglement for Deepfake Detection
Kai Li, Wenqi Ren, Jianshu Li et al.
Cumulative Difference Learning VAE for Time-Series with Temporally Correlated Inflow-Outflow
Tianchun Li, Chengxiang Wu, Pengyi Shi et al.
Common Sense Bias Modeling for Classification Tasks
Miao Zhang, Zee Fryer, Ben Colman et al.
R^2-Art: Category-Level Articulation Pose Estimation from Single RGB Image via Cascade Render Strategy
Li Zhang, Haonan Jiang, Yukang Huo et al.
Frame Semantic Role Labeling Using Arbitrary-Order Conditional Random Fields
When Open-Vocabulary Visual Question Answering Meets Causal Adapter: Benchmark and Approach
Feifei Zhang, Zhaoyi Zhang, Xi Zhang et al.
Spatio-Temporal Pivotal Graph Neural Networks for Traffic Flow Forecasting
Xiangyang Miao, Guobao Xiao, Shiping Wang et al.
Training-Free and Hardware-Friendly Acceleration for Diffusion Models via Similarity-based Token Pruning
Evelyn Zhang, Jiayi Tang, Xuefei Ning et al.
An Efficient Subgraph-Inferring Framework for Large-Scale Heterogeneous Graphs
Wei Zhou, Hong Huang, Ruize Shi et al.
DetRF: Detachable Novel Views Synthesis of Dynamic Scenes Using Backdrop-Driven Neural Radiance Fields
Boyu Zhang, Zheng Zhu, Wenbo Xu
TGFormer: Transformer with Track Query Group for Multi-Object Tracking
Rui Zeng, Yuanzhou Huang, Songwei Pei
Efficient Neural Network Encoding for 3D Color Lookup Tables
Vahid Zehtab, David B. Lindell, Marcus A. Brubaker et al.
Rectangle Search: An Anytime Beam Search
Sofia Lemons, Wheeler Ruml, Rob Holte et al.
Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark
Sen Pei, Shixiong Xu, Xiaojie Jin
Fewer Steps, Better Performance: Efficient Cross-Modal Clip Trimming for Video Moment Retrieval Using Language
Xiang Fang, Daizong Liu, Wanlong Fang et al.
Cross-Lingual Text-Rich Visual Comprehension: An Information Theory Perspective
Xinmiao Yu, Xiaocheng Feng, Yun Li et al.
SkipDiff: Adaptive Skip Diffusion Model for High-Fidelity Perceptual Image Super-resolution
Xiaotong Luo, Yuan Xie, Yanyun Qu et al.
OTPNet: ODE-inspired Tuning-free Proximal Network for Remote Sensing Image Fusion
Wei Yu, Zonglin Li, Qinglin Liu et al.
Fine-grained Adaptive Visual Prompt for Generative Medical Visual Question Answering
Ting Yu, Zixuan Tong, Jun Yu et al.
STGC-NeRF: Spatial-Temporal Geometric Consistency for LiDAR Neural Radiance Fields in Dynamic Scenes
Shangshu Yu, Xiaotian Sun, Wen Li et al.
ReMoGPT: Part-Level Retrieval-Augmented Motion-Language Models
Qing Yu, Mikihiro Tanaka, Kent Fujiwara
Separating the Wheat from the Chaff: Spatio-Temporal Transformer with View-interweaved Attention for Photon-Efficient Depth Sensing
Letian Yu, Jiaxi Yang, Bo Dong et al.
SGFormer: Semantic-Geometry Fusion Transformer for Multi-modal 3D Panoptic Segmentation
Hongqi Yu, Sixian Chan, Xiaolong Zhou et al.
FlexDataset: Crafting Annotated Dataset Generation for Diverse Applications
Ellen Yi-Ge, Leo Shawn
ShareBERT: Embeddings Are Capable of Learning Hidden Layers
Jia Cheng Hu, Roberto Cavicchioli, Giulia Berardinelli et al.
Spatio-Temporal Fusion for Human Action Recognition via Joint Trajectory Graph
Yaolin Zheng, Hongbo Huang, Xiuying Wang et al.
Sparse Enhanced Network: An Adversarial Generation Method for Robust Augmentation in Sequential Recommendation
Junyang Chen, Guoxuan Zou, Pan Zhou et al.
PromptHaze: Prompting Real-world Dehazing via Depth Anything Model
Tian Ye, Sixiang Chen, Haoyu Chen et al.
VersaFusion: A Versatile Diffusion-Based Framework for Fine-Grained Image Editing and Enhancement
Haocun Ye, Xinlong Jiang, Chenlong Gao et al.
Sharpness-Aware Model-Agnostic Long-Tailed Domain Generalization
Houcheng Su, Weihao Luo, Daixian Liu et al.
As Pseudo-Label Free as Possible: Leveraging Adaptive Feature Generation for Sparsely Annotated Object Detection
Shuilian Yao, Yu Liu, Qi Jia et al.
MM-Tracker: Motion Mamba for UAV-platform Multiple Object Tracking
Mufeng Yao, Jinlong Peng, Qingdong He et al.
RealPortrait: Realistic Portrait Animation with Diffusion Transformers
Zejun Yang, Huawei Wei, Zhisheng Wang
ERF: A Benchmark Dataset for Robust Semantic Segmentation Under Extreme Rainfall Conditions
Xin Yang, Xin Zhang, Xinchao Wang
Semantic Segmentation on Raindrop Degraded Images Using Two-Stage Dual Teacher-Student Learning
Xin Yang, Wending Yan, Yuan Yuan et al.
DriveGazen: Event-Based Driving Status Recognition Using Conventional Camera
Xiaoyin Yang, Xin Yang
Dual Information Purification for Lightweight SAR Object Detection
Xi Yang, Jiachen Sun, Songsong Duan et al.
Asymmetric Hierarchical Difference-aware Interaction Network for Event-guided Motion Deblurring
Wen Yang, Jinjian Wu, Leida Li et al.
3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly Detection
Enquan Yang, Peng Xing, Hanyang Sun et al.
Robust Image Hashing Based on Contrastive Masked Autoencoder with Weak-Strong Augmentation Alignment
Cundian Yang, Guibo Luo, Yuesheng Zhu et al.
Data-Free Universal Attack by Exploiting the Intrinsic Vulnerability of Deep Models
YangTian Yan, Jinyu Tian
Physical Marker: Revealing Invisible Hyperlinks Hidden in Printed Trademarks
Yuliang Xue, Lei Tan, Guobiao Li et al.
StegaStyleGAN: Towards Generic and Practical Generative Image Steganography
Wenkang Su, Jiangqun Ni, Yiyan Sun
RetouchGPT: LLM-based Interactive High-Fidelity Face Retouching via Imperfection Prompting
Wen Xue, Chun Ding, Ruotao Xu et al.
Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP
Zhongxing Xu, Feilong Tang, Zhe Chen et al.
Simple Weak Coresets for Non-decomposable Classification Measures
Jayesh Malaviya, Anirban Dasgupta, Rachit Chhaya
Learning Multi-Scale Video-Text Correspondence for Weakly Supervised Temporal Article Gronding
Wenjia Geng, Yong Liu, Lei Chen et al.
A Surprisingly Simple Continuous-Action POMDP Solver: Lazy Cross-Entropy Search over Policy Trees
Marcus Hoerger, Hanna Kurniawati, Dirk Kroese et al.
FATE: Feature-Adapted Parameter Tuning for Vision-Language Models
Zhengqin Xu, Zelin Peng, Xiaokang Yang et al.
Explainable Origin-Destination Crowd Flow Interpolation via Variational Multi-Modal Recurrent Graph Auto-Encoder
Qiang Zhou, Xinjiang Lu, Jingjing Gu et al.
HOIMamba: Efficient Mamba-based Disentangled Progressive Learning for HOI Detection
Yongchao Xu, Jiawei Liu, Sen Tao et al.
SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection
Ruoyu Xu, Zhiyu Xiang, Chenwei Zhang et al.
Low Category Uncertainty and High Training Potential Instance Learning for Unsupervised Domain Adaptation
Xinyu Zhang, Meng Kang, Shuai Lü
Multiple Feature Refining Network for Visual Emotion Distribution Learning
Qinfu Xu, Shaozu Yuan, Yiwei Wei et al.
Efficient Learning of PDEs via Taylor Expansion and Sparse Decomposition into Value and Fourier Domains
Md Nasim, Yexiang Xue
Discrepancy and Uncertainty Aware Denoising Knowledge Distillation for Zero-Shot Cross-Lingual Named Entity Recognition
Ling Ge, Chunming Hu, Guanghui Ma et al.
Foundations of Reactive Synthesis for Declarative Process Specifications
Andrey Rivkin, Luca Geatti, Marco Montali
3DHumanEdit: Multi-modal Body Part-aware Conditioning Information Integration for 3D Human Manipulation
FeiFan Xu, Tianyi Chen, Fan Yang et al.
Less Is More: Token Context-Aware Learning for Object Tracking
Chenlong Xu, Bineng Zhong, Qihua Liang et al.
FR²Seg: Continual Segmentation Across Multiple Sites via Fourier Style Replay and Adaptive Consistency Regularization
Cheng Xu, Weiwen Zhang, Hongrui Zhang et al.
Resource Efficient Deep Learning Hardware Watermarks with Signature Alignment
Joseph Clements, Yingjie Lao
DiffScene: Diffusion-Based Safety-Critical Scenario Generation for Autonomous Vehicles
Chejian Xu, Aleksandr Petiushko, Ding Zhao et al.
Few-Shot Incremental Learning via Foreground Aggregation and Knowledge Transfer for Audio-Visual Semantic Segmentation
Jingqiao Xiu, Mengze Li, Zongxin Yang et al.
IOFM: Using the Interpolation Technique on the Over-Fitted Models to Identify Clean-Annotated Samples
Dongha Kim, Yongchan Choi, Kunwoong Kim et al.
Improving Distinguishability of Class for Graph Neural Networks
Dongxiao He, Shuwei Liu, Zhizhi Yu et al.
Discrete Prior-Based Temporal-Coherent Content Prediction for Blind Face Video Restoration
Lianxin Xie, Bingbing Zheng, Wen Xue et al.
Omni-Query Active Learning for Source-Free Domain Adaptive Cross-Modality 3D Semantic Segmentation
Jianxiang Xie, Yao Wu, Yachao Zhang et al.
Boosting Vision State Space Model with Fractal Scanning
Haoke Xiao, Lv Tang, Peng-tao Jiang et al.
SMR-Net: Semantic-Guided Mutually Reinforcing Network for Cross-Modal Image Fusion and Salient Object Detection
Guobao Xiao, Xinyu Liu, Zebin Lin et al.
X-RefSeg3D: Enhancing Referring 3D Instance Segmentation via Structured Cross-Modal Graph Neural Networks
Zhipeng Qian, Yiwei Ma, Jiayi Ji et al.
Iterative Regularization with K-support Norm: An Important Complement to Sparse Recovery
William de Vazelhes, Bhaskar Mukhoty, Xiaotong Yuan et al.
Learning GAI-Decomposable Utility Models for Multiattribute Decision Making
Margot Herin, Patrice Perny, Nataliya Sokolovska
CA-Edit: Causality-Aware Condition Adapter for High-Fidelity Local Facial Attribute Editing
Xiaole Xian, Xilin He, Zenghao Niu et al.
PlaNet: Learning to Mitigate Atmospheric Turbulence in Planetary Images
Yifei Xia, Chu Zhou, Chengxuan Zhu et al.
‘Why Didn’t You Allocate This Task to Them?’ Negotiation-Aware Task Allocation and Contrastive Explanation Generation
Zahra Zahedi, Sailik Sengupta, Subbarao Kambhampati
RETRACTED: GEONet: Global Enhancement and Optimization Network for Lane Detection
Suyang Xi, Yunhao Liu, Hong Ding et al.
Unified Knowledge Maintenance Pruning and Progressive Recovery with Weight Recalling for Large Vision-Language Models
Zimeng Wu, Jiaxin Chen, Yunhong Wang
MUCD: Unsupervised Point Cloud Change Detection via Masked Consistency
Yue Wu, Zhipeng Wang, Yongzhe Yuan et al.
Deconfound Semantic Shift and Incompleteness in Incremental Few-shot Semantic Segmentation
Yirui Wu, Yuhang Xia, Hao Li et al.
VarCMP: Adapting Cross-Modal Pre-Training Models for Video Anomaly Retrieval
Peng Wu, Wanshun Su, Xiangteng He et al.
SVRMamba: Slice-to-Volume Reconstruction from Multiple MRI Stacks with Slice Sequence Guided Mamba
Jiangjie Wu, Hongjiang Wei, Yuyao Zhang
Spin: Diffusion-based Semantic Image Painting Through Independent Information Injection
Dantong Wu, Zhiqiang Chen, Tianjiao Du et al.
Multi-axis Prompt and Multi-dimension Fusion Network for All-in-one Weather-degraded Image Restoration
Yuanbo Wen, Tao Gao, Jing Zhang et al.
Mitigating Idiom Inconsistency: A Multi-Semantic Contrastive Learning Method for Chinese Idiom Reading Comprehension
Mingmin Wu, Yuxue Hu, Yongcheng Zhang et al.
Power of Diversity: Enhancing Data-Free Black-Box Attack with Domain-Augmented Learning
Yang Wei, Jingyu Tan, Guowen Xu et al.
GlyphSR: A Simple Glyph-Aware Framework for Scene Text Image Super-Resolution
Baole Wei, Yuxuan Zhou, Liangcai Gao et al.
MSV-PCT: Multi-Sparse-View Enhanced Transformer Framework for Salient Object Detection in Point Clouds
Zihao Wang, Yiming Huang, Gengyu Lyu et al.
GOALNET: Interleaving Neural Goal Predicate Inference with Classical Planning for Generalization in Robot Instruction Following
Jigyasa Gupta, Shreya Sharma, Shreshth Tuli et al.
Attention-Imperceptible Backdoor Attacks on Vision Transformers
Zhishen Wang, Rui Wang, Lihua Jing
Thermal-Aware Low-Light Image Enhancement: A Real-World Benchmark and a New Light-Weight Model
Zhen Wang, Yaozu Wu, Dongyuan Li et al.
Style Nursing with Spatial and Semantic Guidance for Zero-Shot Traffic Scene Style Transfer
Zhen Wang, Zihang Lin, Meng Yuan et al.
Two-Stage Evolutionary Reinforcement Learning for Enhancing Exploration and Exploitation
DualNet: Robust Self-Supervised Stereo Matching with Pseudo-Label Supervision
Yun Wang, Jiahao Zheng, Chenghao Zhang et al.
Target Scanpath-Guided 360-Degree Image Enhancement
Yujia Wang, Fang-Lue Zhang, Neil A. Dodgson
Manifold Constraints for Imperceptible Adversarial Attacks on Point Clouds
SpFormer: Spatio-Temporal Modeling for Scanpaths with Transformer
Zhijie Nie, Richong Zhang, Zhongyuan Wang et al.
Capturing the Unseen: Vision-Free Facial Motion Capture Using Inertial Measurement Units
Youjia Wang, Yiwen Wu, Hengan Zhou et al.
RefDetector: A Simple Yet Effective Matching-based Method for Referring Expression Comprehension
Yabing Wang, Zhuotao Tian, Zheng Qin et al.
From Coarse to Fine: A Matching and Alignment Framework for Unsupervised Cross-View Geo-Localization
Xueyi Wang, Lele Zhang, Zheng Fan et al.
Enhancing Neural Radiance Fields with Adaptive Multi-Exposure Fusion: A Bilevel Optimization Approach for Novel View Synthesis
Yang Zou, Xingyuan Li, Zhiying Jiang et al.
MIMTrack: In-Context Tracking via Masked Image Modeling
Xingmei Wang, Guohao Nie, Jiaxiang Meng et al.
Lifting Scheme-Based Implicit Disentanglement of Emotion-Related Facial Dynamics in the Wild
Xingjian Wang, Li Chai
DCTMamba: Advancing JPEG Image Restoration Through Long-Sequence Modeling and Adaptive Frequency Strategy
Xi Wang, Xueyang Fu, Liang Li et al.
FreeGen: Bridging Visual-Linguistic Discrepancies Towards Diffusion-based Pixel-level Data Synthesis
Wenzhuang Wang, Mingcan Ma, Yong Chen et al.
Imagine: Image-Guided 3D Part Assembly with Structure Knowledge Graph
Weihao Wang, Yu Lan, Mingyu You et al.
The Parables of the Mustard Seed and the Yeast: Extremely Low-Budget, High-Performance Nighttime Semantic Segmentation
Shiqin Wang, Xin Xu, Haoyang Chen et al.
Deep Multi-modal Graph Clustering via Graph Transformer Network
Qianqian Wang, Haiming Xu, Zihao Zhang et al.
Tracking Everything Everywhere across Multiple Cameras
Li-Heng Wang, YuJu Cheng, Tyng-Luh Liu
EMControl: Adding Conditional Control to Text-to-Image Diffusion Models via Expectation-Maximization
He Wang, Longquan Dai, Jinhui Tang
msLPCC: A Multimodal-Driven Scalable Framework for Deep LiDAR Point Cloud Compression
Miaohui Wang, Runnan Huang, Hengjin Dong et al.
S³-Mamba: Small-Size-Sensitive Mamba for Lesion Segmentation
Gui Wang, Yuexiang Li, Wenting Chen et al.
Scene Graph-Grounded Image Generation
Fuyun Wang, Tong Zhang, Yuanzhi Wang et al.
A Black-Box Evaluation Framework for Semantic Robustness in Bird’s Eye View Detection
Fu Wang, Yanghao Zhang, Xiangyu Yin et al.
RA-GAR: A Richly Annotated Benchmark for Gait Attribute Recognition
Chenye Wang, Saihui Hou, Aoqi Li et al.
Chain-of-Thought Improves Text Generation with Citations in Large Language Models
Bin Ji, Huijun Liu, Mingzhe Du et al.
The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models
Jongyeong Lee, Chao-Kai Chiang, Masashi Sugiyama
Hypergraph Neural Architecture Search
Wei Lin, Xu Peng, Zhengtao Yu et al.
Box2Poly: Memory
Efficient Polygon Prediction of Arbitrarily Shaped and Rotated Text - Xuyang Chen, Dong Wang, Konrad Schindler et al.
Machine Learning
Powered Combinatorial Clock Auction - Ermis Nikiforos Soumalias, Jakob Weissteiner, Jakob Heiss et al.
Boosting Few
Shot Learning via Attentive Feature Regularization - Xingyu Zhu, Shuo Wang, Jinda Lu et al.
VOILA: Complexity-Aware Universal Segmentation of CT Images by Voxel Interacting with Language
Zishuo Wan, Yu Gao, Wanyuan Pang et al.
Memory-Augmented Re-Completion for 3D Semantic Scene Completion
Yu-Wen Tseng, Sheng-Ping Yang, Jhih-Ciang Wu et al.
LSTKC: Long Short
Term Knowledge Consolidation for Lifelong Person Re-identification - Kunlun Xu, Xu Zou, Jiahuan Zhou
Interpretable3D: An Ad
Hoc Interpretable Classifier for 3D Point Clouds - Tuo Feng, Ruijie Quan, Xiaohan Wang et al.
Stitch, Contrast, and Segment: Learning a Human Action Segmentation Model Using Trimmed Skeleton Videos
Haitao Tian, Pierre Payeur
TraceEvader: Making DeepFakes More Untraceable via Evading the Forgery Model Attribution
Mengjie Wu, Jingui Ma, Run Wang et al.
3D²-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling
Zichen Tang, Hongyu Yang, Hanchen Zhang et al.
From Representation Space to Prognostic Insights: Whole Slide Image Generation with Hierarchical Diffusion Model for Survival Prediction
Zhihao Tang, Xi Zhang, Chaozhuo Li
Learning Only When It Matters: Cost
Aware Long-Tailed Classification - Yu-Cheng He, Yao-Xiang Ding, Han-Jia Ye et al.
RAGG: Retrieval-Augmented Grasp Generation Model
Zhenhua Tang, Bin Zhu, Yanbin Hao et al.
MICA: Towards Explainable Skin Lesion Diagnosis via Multi
Level Image-Concept Alignment - Yequan Bie, Luyang Luo, Hao Chen
Talk Funny! A Large
Scale Humor Response Dataset with Chain-of-Humor Interpretation - Yuyan Chen, Yichen Yuan, Panjun Liu et al.
NaMa: Neighbor
Aware Multi-Modal Adaptive Learning for Prostate Tumor Segmentation on Anisotropic MR Images - Runqi Meng, Xiao Zhang, Shijie Huang et al.
M2Flow: A Motion Information Fusion Framework for Enhanced Unsupervised Optical Flow Estimation in Autonomous Driving
Xunpei Sun, Gang Chen, Zuoxun Hou
Transferable Adversarial Attacks for Object Detection Using Object
Aware Significant Feature Distortion - Xinlong Ding, Jiansheng Chen, Hongwei Yu et al.
Taxonomy Driven Fast Adversarial Training
Kun Tong, Chengze Jiang, Jie Gui et al.