Most Cited AAAI "synthesized data generation" Papers
5,317 papers found • Page 18 of 27
Conference
RefDetector: A Simple Yet Effective Matching-based Method for Referring Expression Comprehension
Yabing Wang, Zhuotao Tian, Zheng Qin et al.
Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension
Yaxian Wang, Henghui Ding, Shuting He et al.
Breaking Barriers in Physical-World Adversarial Examples: Improving Robustness and Transferability via Robust Feature
Yichen Wang, Yuxuan Chou, Ziqi Zhou et al.
Capturing the Unseen: Vision-Free Facial Motion Capture Using Inertial Measurement Units
Youjia Wang, Yiwen Wu, Hengan Zhou et al.
Re-Attentional Controllable Video Diffusion Editing
Yuanzhi Wang, Yong Li, Mengyi Liu et al.
MambaPro: Multi-Modal Object Re-identification with Mamba Aggregation and Synergistic Prompt
Yuhao Wang, Xuehu Liu, Tianyu Yan et al.
IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis
Yuji Wang, Jingchen Ni, Yong Liu et al.
Target Scanpath-Guided 360-Degree Image Enhancement
Yujia Wang, Fang-Lue Zhang, Neil A. Dodgson
DualNet: Robust Self-Supervised Stereo Matching with Pseudo-Label Supervision
Yun Wang, Jiahao Zheng, Chenghao Zhang et al.
Mamba YOLO: A Simple Baseline for Object Detection with State Space Model
Zeyu Wang, Chen Li, Huiying Xu et al.
Style Nursing with Spatial and Semantic Guidance for Zero-Shot Traffic Scene Style Transfer
Zhen Wang, Zihang Lin, Meng Yuan et al.
Thermal-Aware Low-Light Image Enhancement: A Real-World Benchmark and a New Light-Weight Model
Zhen Wang, Yaozu Wu, Dongyuan Li et al.
Attention-Imperceptible Backdoor Attacks on Vision Transformers
Zhishen Wang, Rui Wang, Lihua Jing
LLM-RG4: Flexible and Factual Radiology Report Generation Across Diverse Input Contexts
Zhuhao Wang, Yihua Sun, Zihan Li et al.
MSV-PCT: Multi-Sparse-View Enhanced Transformer Framework for Salient Object Detection in Point Clouds
Zihao Wang, Yiming Huang, Gengyu Lyu et al.
GlyphSR: A Simple Glyph-Aware Framework for Scene Text Image Super-Resolution
Baole Wei, Yuxuan Zhou, Liangcai Gao et al.
Power of Diversity: Enhancing Data-Free Black-Box Attack with Domain-Augmented Learning
Yang Wei, Jingyu Tan, Guowen Xu et al.
Achieving Lightweight Super-Resolution for Real-Time Computer Graphics
Yu Wen, Chen Zhang, Chenhao Xie et al.
Multi-axis Prompt and Multi-dimension Fusion Network for All-in-one Weather-degraded Image Restoration
Yuanbo Wen, Tao Gao, Jing Zhang et al.
USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature Decorrelation
Wanjiang Weng, Hongsong Wang, Junbo Wang et al.
Spin: Diffusion-based Semantic Image Painting Through Independent Information Injection
Dantong Wu, Zhiqiang Chen, Tianjiao Du et al.
Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation
Dongyue Wu, Zilin Guo, Li Yu et al.
SVRMamba: Slice-to-Volume Reconstruction from Multiple MRI Stacks with Slice Sequence Guided Mamba
Jiangjie Wu, Hongjiang Wei, Yuyao Zhang
VarCMP: Adapting Cross-Modal Pre-Training Models for Video Anomaly Retrieval
Peng Wu, Wanshun Su, Xiangteng He et al.
Realistic Noise Synthesis with Diffusion Models
Qi Wu, Mingyan Han, Ting Jiang et al.
PanAdapter: Two-Stage Fine-Tuning with Spatial-Spectral Priors Injecting for Pansharpening
RuoCheng Wu, Zien Zhang, Shangqi Deng et al.
Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning
Shengqiong Wu, Hao Fei, Liangming Pan et al.
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
Tao Wu, Yong Zhang, Xintao Wang et al.
Deconfound Semantic Shift and Incompleteness in Incremental Few-shot Semantic Segmentation
Yirui Wu, Yuhang Xia, Hao Li et al.
Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark
Yongliang Wu, Wenbo Zhu, Jiawang Cao et al.
MUCD: Unsupervised Point Cloud Change Detection via Masked Consistency
Yue Wu, Zhipeng Wang, Yongzhe Yuan et al.
Unified Knowledge Maintenance Pruning and Progressive Recovery with Weight Recalling for Large Vision-Language Models
Zimeng Wu, Jiaxin Chen, Yunhong Wang
RETRACTED: GEONet: Global Enhancement and Optimization Network for Lane Detection
Suyang Xi, Yunhao Liu, Hong Ding et al.
PlaNet: Learning to Mitigate Atmospheric Turbulence in Planetary Images
Yifei Xia, Chu Zhou, Chengxuan Zhu et al.
CA-Edit: Causality-Aware Condition Adapter for High-Fidelity Local Facial Attribute Editing
Xiaole Xian, Xilin He, Zenghao Niu et al.
ReMask-Animate: Refined Character Image Animation Using Mask-Guided Adapters
Xunzhi Xiang, Haiwei Xue, Zonghong Dai et al.
SMR-Net: Semantic-Guided Mutually Reinforcing Network for Cross-Modal Image Fusion and Salient Object Detection
Guobao Xiao, Xinyu Liu, Zebin Lin et al.
Boosting Vision State Space Model with Fractal Scanning
Haoke Xiao, Lv Tang, Peng-tao Jiang et al.
Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval
Jian Xiao, Zhenzhen Hu, Jia Li et al.
Cross-modulated Attention Transformer for RGBT Tracking
Yun Xiao, Jiacong Zhao, Andong Lu et al.
Omni-Query Active Learning for Source-Free Domain Adaptive Cross-Modality 3D Semantic Segmentation
Jianxiang Xie, Yao Wu, Yachao Zhang et al.
TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning
Jingjing Xie, Yuxin Zhang, Jun Peng et al.
Discrete Prior-Based Temporal-Coherent Content Prediction for Blind Face Video Restoration
Lianxin Xie, Bingbing Zheng, Wen Xue et al.
Expand VSR Benchmark for VLLM to Expertize in Spatial Rules
Peijin Xie, Lin Sun, Bingquan Liu et al.
PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis
Yifan Xie, Tao Feng, Xin Zhang et al.
HieraFashDiff: Hierarchical Fashion Design with Multi-stage Diffusion Models
Zhifeng Xie, Hao Li, Huiming Ding et al.
Few-Shot Incremental Learning via Foreground Aggregation and Knowledge Transfer for Audio-Visual Semantic Segmentation
Jingqiao Xiu, Mengze Li, Zongxin Yang et al.
DiffScene: Diffusion-Based Safety-Critical Scenario Generation for Autonomous Vehicles
Chejian Xu, Aleksandr Petiushko, Ding Zhao et al.
FR²Seg: Continual Segmentation Across Multiple Sites via Fourier Style Replay and Adaptive Consistency Regularization
Cheng Xu, Weiwen Zhang, Hongrui Zhang et al.
Less Is More: Token Context-Aware Learning for Object Tracking
Chenlong Xu, Bineng Zhong, Qihua Liang et al.
3DHumanEdit: Multi-modal Body Part-aware Conditioning Information Integration for 3D Human Manipulation
FeiFan Xu, Tianyi Chen, Fan Yang et al.
Motion Artifact Removal in Pixel-Frequency Domain via Alternate Masks and Diffusion Model
Jiahua Xu, Dawei Zhou, Lei Hu et al.
OmniSR: Shadow Removal Under Direct and Indirect Lighting
Jiamin Xu, Zelong Li, Yuxin Zheng et al.
Multiple Feature Refining Network for Visual Emotion Distribution Learning
Qinfu Xu, Shaozu Yuan, Yiwei Wei et al.
SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection
Ruoyu Xu, Zhiyu Xiang, Chenwei Zhang et al.
LiON: Learning Point-Wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data
Shaocong Xu, Pengfei Li, Qianpu Sun et al.
Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models
Yifang Xu, Yunzhuo Sun, Benxiang Zhai et al.
HOIMamba: Efficient Mamba-based Disentangled Progressive Learning for HOI Detection
Yongchao Xu, Jiawei Liu, Sen Tao et al.
OOTDiffusion: Outfitting Fusion Based Latent Diffusion for Controllable Virtual Try-On
Yuhao Xu, Tao Gu, Weifeng Chen et al.
FLAME: Learning to Navigate with Multimodal LLM in Urban Environments
Yunzhe Xu, Yiyuan Pan, Zhe Liu et al.
FATE: Feature-Adapted Parameter Tuning for Vision-Language Models
Zhengqin Xu, Zelin Peng, Xiaokang Yang et al.
Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP
Zhongxing Xu, Feilong Tang, Zhe Chen et al.
RetouchGPT: LLM-based Interactive High-Fidelity Face Retouching via Imperfection Prompting
Wen Xue, Chun Ding, Ruotao Xu et al.
Physical Marker: Revealing Invisible Hyperlinks Hidden in Printed Trademarks
Yuliang Xue, Lei Tan, Guobiao Li et al.
Towards Universal Rainy Image Restoration: Benchmark and Baseline
Hujie Yan
SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation
Ke Yan, Qing Cai, Fan Zhang et al.
Data-Free Universal Attack by Exploiting the Intrinsic Vulnerability of Deep Models
YangTian Yan, Jinyu Tian
Robust Image Hashing Based on Contrastive Masked Autoencoder with Weak-Strong Augmentation Alignment
Cundian Yang, Guibo Luo, Yuesheng Zhu et al.
PlanLLM: Video Procedure Planning with Refinable Large Language Models
Dejie Yang, Zijing Zhao, Yang Liu
3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly Detection
Enquan Yang, Peng Xing, Hanyang Sun et al.
Diffusion Prior Interpolation for Flexibility Real-World Face Super-Resolution
Jiarui Yang, Tao Dai, Yufei Zhu et al.
SMamba: Sparse Mamba for Event-based Object Detection
Nan Yang, Yang Wang, Zhanwen Liu et al.
One-Shot Reference-based Structure-Aware Image to Sketch Synthesis
Rui Yang, Honghong Yang, Li Zhao et al.
LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding
Senqiao Yang, Jiaming Liu, Renrui Zhang et al.
Asymmetric Hierarchical Difference-aware Interaction Network for Event-guided Motion Deblurring
Wen Yang, Jinjian Wu, Leida Li et al.
Dual Information Purification for Lightweight SAR Object Detection
Xi Yang, Jiachen Sun, Songsong Duan et al.
DriveGazen: Event-Based Driving Status Recognition Using Conventional Camera
Xiaoyin Yang, Xin Yang
Semantic Segmentation on Raindrop Degraded Images Using Two-Stage Dual Teacher-Student Learning
Xin Yang, Wending Yan, Yuan Yuan et al.
ERF: A Benchmark Dataset for Robust Semantic Segmentation Under Extreme Rainfall Conditions
Xin Yang, Xin Zhang, Xinchao Wang
FreqTS: Frequency-Aware Token Selection for Accelerating Diffusion Models
Xinye Yang, Yuxin Yang, Haoran Pang et al.
Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving
Yu Yang, Jianbiao Mei, Yukai Ma et al.
UAWTrack: Universal 3D Single Object Tracking in Adverse Weather
Yuxiang Yang, Hongjie Gu, Yingqi Deng et al.
RealPortrait: Realistic Portrait Animation with Diffusion Transformers
Zejun Yang, Huawei Wei, Zhisheng Wang
Single Image Rolling Shutter Removal with Diffusion Models
Zhanglei Yang, Haipeng Li, Mingbo Hong et al.
MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation
Zhifei Yang, Keyang Lu, Chao Zhang et al.
MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation
Zhiwei Yang, Yucong Meng, Kexue Fu et al.
MM-Tracker: Motion Mamba for UAV-platform Multiple Object Tracking
Mufeng Yao, Jinlong Peng, Qingdong He et al.
As Pseudo-Label Free as Possible: Leveraging Adaptive Feature Generation for Sparsely Annotated Object Detection
Shuilian Yao, Yu Liu, Qi Jia et al.
Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation
Chengyang Ye, Yunzhi Zhuge, Pingping Zhang
VersaFusion: A Versatile Diffusion-Based Framework for Fine-Grained Image Editing and Enhancement
Haocun Ye, Xinlong Jiang, Chenlong Gao et al.
PromptHaze: Prompting Real-world Dehazing via Depth Anything Model
Tian Ye, Sixiang Chen, Haoyu Chen et al.
Optimized Gradient Clipping for Noisy Label Learning
Xichen Ye, Yifan Wu, Weizhong Zhang et al.
Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language
Jeong Hun Yeo, Chae Won Kim, Hyunjun Kim et al.
FlexDataset: Crafting Annotated Dataset Generation for Diverse Applications
Ellen Yi-Ge, Leo Shawn
ImagePiece: Content-aware Re-tokenization for Efficient Image Recognition
Seungdong Yoa, Seungjun Lee, Hye-Seung Cho et al.
FOCUS: Towards Universal Foreground Segmentation
Zuyao You, Lingyu Kong, Lingchen Meng et al.
SGFormer: Semantic-Geometry Fusion Transformer for Multi-modal 3D Panoptic Segmentation
Hongqi Yu, Sixian Chan, Xiaolong Zhou et al.
Separating the Wheat from the Chaff: Spatio-Temporal Transformer with View-interweaved Attention for Photon-Efficient Depth Sensing
Letian Yu, Jiaxi Yang, Bo Dong et al.
ReMoGPT: Part-Level Retrieval-Augmented Motion-Language Models
Qing Yu, Mikihiro Tanaka, Kent Fujiwara
STGC-NeRF: Spatial-Temporal Geometric Consistency for LiDAR Neural Radiance Fields in Dynamic Scenes
Shangshu Yu, Xiaotian Sun, Wen Li et al.
KeyPose: Category-Level 6D Object Pose Estimation with Self-Adaptive Keypoints
Sheng Yu, Di-Hua Zhai, Yuanqing Xia
Fine-grained Adaptive Visual Prompt for Generative Medical Visual Question Answering
Ting Yu, Zixuan Tong, Jun Yu et al.
OTPNet: ODE-inspired Tuning-free Proximal Network for Remote Sensing Image Fusion
Wei Yu, Zonglin Li, Qinglin Liu et al.
Cross-Lingual Text-Rich Visual Comprehension: An Information Theory Perspective
Xinmiao Yu, Xiaocheng Feng, Yun Li et al.
Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP
Yating Yu, Congqi Cao, Yueran Zhang et al.
OLMD: Orientation-aware Long-term Motion Decoupling for Continuous Sign Language Recognition
Yiheng Yu, Sheng Liu, Yuan Feng et al.
Where Precision Meets Efficiency: Transformation Diffusion Model for Point Cloud Registration
Yongzhe Yuan, Yue Wu, Xiaolong Fan et al.
Efficient Neural Network Encoding for 3D Color Lookup Tables
Vahid Zehtab, David B. Lindell, Marcus A. Brubaker et al.
Gaze Label Alignment: Alleviating Domain Shift for Gaze Estimation
Guanzhong Zeng, Jingjing Wang, Zefu Xu et al.
TGFormer: Transformer with Track Query Group for Multi-Object Tracking
Rui Zeng, Yuanzhou Huang, Songwei Pei
Frequency-Aware Density Control via Reparameterization for High-Quality Rendering of 3D Gaussian Splatting
Zhaojie Zeng, Yuesong Wang, Lili Ju et al.
World Knowledge-Enhanced Reasoning Using Instruction-Guided Interactor in Autonomous Driving
Mingliang Zhai, Cheng Li, Zengyuan Guo et al.
DetRF: Detachable Novel Views Synthesis of Dynamic Scenes Using Backdrop-Driven Neural Radiance Fields
Boyu Zhang, Zheng Zhu, Wenbo Xu
Training-Free and Hardware-Friendly Acceleration for Diffusion Models via Similarity-based Token Pruning
Evelyn Zhang, Jiayi Tang, Xuefei Ning et al.
When Open-Vocabulary Visual Question Answering Meets Causal Adapter: Benchmark and Approach
Feifei Zhang, Zhaoyi Zhang, Xi Zhang et al.
DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming
Jiaxin Zhang, Wentao Yang, Songxuan Lai et al.
Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm
Jinrong Zhang, Penghui Wang, Chunxiao Liu et al.
R^2-Art: Category-Level Articulation Pose Estimation from Single RGB Image via Cascade Render Strategy
Li Zhang, Haonan Jiang, Yukang Huo et al.
Common Sense Bias Modeling for Classification Tasks
Miao Zhang, Zee Fryer, Ben Colman et al.
IRMamba: Pixel Difference Mamba with Layer Restoration for Infrared Small Target Detection
Mingjin Zhang, Xiaolong Li, Fei Gao et al.
MOCID: Motion Context and Displacement Information Learning for Moving Infrared Small Target Detection
Mingjin Zhang, Yuanjun Ouyang, Fei Gao et al.
Decoupling Scattering: Pseudo-Label Guided NeRF for Scenes with Scattering Media
Mingyang Zhang, Junkang Zhang, Faming Fang et al.
PanoDiT: Panoramic Videos Generation with Diffusion Transformer
Muyang Zhang, Yuzhi Chen, Rongtao Xu et al.
SIGraph: Saliency Image-Graph Network for Retinal Disease Classification in Fundus Image
Peng Zhang, Yuan Li, Haotian Song et al.
Visual Perturbation for Text-Based Person Search
Pengcheng Zhang, Xiaohan Yu, Xiao Bai et al.
Matching While Perceiving: Enhance Image Feature Matching with Applicable Semantic Amalgamation
Shihua Zhang, Zhenjie Zhu, Zizhuo Li et al.
DiMSOD: A Diffusion-Based Framework for Multi-Modal Salient Object Detection
Shuo Zhang, Jiaming Huang, Wenbing Tang et al.
Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views
Songchun Zhang, Chunhui Zhao
DZAD: Diffusion-based Zero-shot Anomaly Detection
Tianrui Zhang, Liang Gao, Xinyu Li et al.
Enhancing Implicit Neural Representations via Symmetric Power Transformation
Weixiang Zhang, Shuzhao Xie, Chengwei Ren et al.
Iterative Self-Training with Class-Aware Text-to-Image Synthesis for Visual Task Learning
Xiang Zhang, Wanqing Zhao, Pengyang Li et al.
Enhancing Multimodal Large Language Models Complex Reason via Similarity Computation
Xiaofeng Zhang, Fanshuo Zeng, Yihao Quan et al.
PhyCamo: A Robust Physical Camouflage via Contrastive Learning for Multi-View Physical Adversarial Attack
Ximin Zhang, Jinyin Chen, Haibin Zheng et al.
CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression
Xinjie Zhang, Shenyuan Gao, Zhening Liu et al.
Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network
Xinyi Zhang, Qiqi Bao, Qinpeng Cui et al.
VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models
Yabo Zhang, Yuxiang Wei, Xianhui Lin et al.
Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Yan Zhang, Gangyan Zeng, Huawen Shen et al.
Category Prompt Mamba Network for Nuclei Segmentation and Classification
Ye Zhang, Zijie Fang, Yifeng Wang et al.
Cross-Modal Few-Shot Learning with Second-Order Neural Ordinary Differential Equations
Yi Zhang, Chun-Wun Cheng, Junyi He et al.
InstantSticker: Realistic Decal Blending via Disentangled Object Reconstruction
Yi Zhang, Xiaoyang Huang, Yishun Dou et al.
Partial Point Cloud Registration with Multi-view 2D Image Learning
Yue Zhang, Yue Wu, Wenping Ma et al.
RP-PGD: Boosting Segmentation Robustness with a Region-and-Prototype Based Adversarial Attack
Yuxuan Zhang, Zhenbo Shi, Shuchang Wang et al.
Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry
Zhaoxing Zhang, Junda Cheng, Gangwei Xu et al.
Training-Free Image Manipulation Localization Using Diffusion Models
Zhenfei Zhang, Ming-Ching Chang, Xin Li
Multi-scale Activation, Refinement, and Aggregation: Exploring Diverse Cues for Fine-Grained Bird Recognition
Zhicheng Zhang, Hao Tang, Jinhui Tang
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Guosheng Zhao, Xiaofeng Wang, Zheng Zhu et al.
Adaptive Wavelet-Positional Encoding for High-Frequency Information Learning in Implicit Neural Representation
Hongxu Zhao, Zelin Gao, Yue Wang et al.
Excluding the Impossible for Open Vocabulary Semantic Segmentation
Shiyuan Zhao, Baodi Liu, Yu Bai et al.
KALAHash: Knowledge-Anchored Low-Resource Adaptation for Deep Hashing
Shu Zhao, Tan Yu, Xiaoshuai Hao et al.
Training-free Open-Vocabulary Semantic Segmentation via Diverse Prototype Construction and Sub-region Matching
Xuanpu Zhao, Dianmo Sheng, Zhentao Tan et al.
Audio-Visual Adaptive Fusion Network for Question Answering Based on Contrastive Learning
Xujian Zhao, Yixin Wang, Peiquan Jin
ESEG: Event-Based Segmentation Boosted by Explicit Edge-Semantic Guidance
Yucheng Zhao, Gengyu Lyu, Ke Li et al.
NightReID: A Large-Scale Nighttime Person Re-Identification Benchmark
Yuxuan Zhao, Weijian Ruan, He Li et al.
HFF-Tracker: A Hierarchical Fine-grained Fusion Tracker for Referring Multi-Object Tracking
Zeyong Zhao, Yanchao Hao, Minghao Zhang et al.
PHR-DIFF: Portrait Highlights Removal via Patch-aware Diffusion Model
Hongsheng Zheng, Zhongyun Bao, Gang Fu et al.
Breaking Information Isolation: Accelerating MRI via Inter-sequence Mapping and Progressive Masking
Jianwei Zheng, Xiaomin Yao, Guojiang Shen et al.
Supportive Negatives Spectral Augmentation for Source-Free Cross-Domain Segmentation
Kexin Zheng, Haifeng Xia, Siyu Xia et al.
When Shadow Removal Meets Intrinsic Image Decomposition: A Joint Learning Framework Using Unpaired Data
Rongjia Zheng, Qing Zhang, Yongwei Nie et al.
A New Adversarial Perspective for LiDAR-based 3D Object Detection
Shijun Zheng, Weiquan Liu, Yu Guo et al.
Universal Domain Adaptive Object Detection via Dual Probabilistic Alignment
Yuanfan Zheng, Jinlin Wu, Wuyang Li et al.
MMPF: Multi-Modal Perception Framework for Abnormal Medical Condition Detection
Chuyi Zhong, Dingkang Yang, Peng Zhai et al.
DECIDER: Difference-aware Contrastive Diffusion Model with Adversarial Perturbations for Image Change Captioning
Guojin Zhong, Jinhong Hu, Jiajun Chen et al.
PointCFormer: A Relation-Based Progressive Feature Extraction Network for Point Cloud Completion
Yi Zhong, Weize Quan, Dong-Ming Yan et al.
Controllable Distortion-Perception Tradeoff Through Latent Diffusion for Neural Image Compression
Chuqin Zhou, Guo Lu, Jiangchuan Li et al.
TrackGo: A Flexible and Efficient Method for Controllable Video Generation
Haitao Zhou, Chuang Wang, Rui Nie et al.
Core-to-Global Reasoning for Compositional Visual Question Answering
Hao Zhou, Tingjin Luo, Zhangqi Jiang
Joint Class-level and Instance-level Relationship Modeling for Novel Class Discovery
Jiaying Zhou, Qingchao Chen
GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance
Jingqiu Zhou, Lue Fan, Xuesong Chen et al.
SceneX: Procedural Controllable Large-Scale Scene Generation
Mengqi Zhou, Yuxi Wang, Jun Hou et al.
GLIC: General Format Learned Image Compression
MingSheng Zhou, MingMing Kong
Mitigating Feature Gap for Adversarial Robustness by Feature Disentanglement
Nuoyan Zhou, Dawei Zhou, Decheng Liu et al.
Spatiotemporal-Aware Neural Fields for Dynamic CT Reconstruction
Qingyang Zhou, Yunfan Ye, Zhiping Cai
Test-Time Adaptation on Noisy Data via Model-Pruning-Based Filtering and Flatness-Aware Entropy Minimization
Xingzhi Zhou, Zhiliang Tian, Boyang Zhang et al.
Improving Generalization of Deep Neural Networks by Optimum Shifting
Yuyan Zhou, Ye Li, Lei Feng et al.
Achieving Ensemble-Like Performance in a Single Model: A Feature Diversification Framework for Image-Text Matching
Zhao Zhou, Yiqun Wang, Weizhong Zhang et al.
Expanding the Scope of Negatives: Boosting Image-Text Matching with Negatives Distribution Guided Learning
Zhao Zhou, Weizhong Zhang, Xiangcheng Du et al.
An Exemplar-based Framework for Chinese Text Recognition
Zhao Zhou, Xiangcheng Du, Yingbin Zheng et al.
GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expressions
Ziqi Zhou, Weize Quan, Hailin Shi et al.
A Lottery Ticket Hypothesis Approach with Sparse Fine-tuning and MAE for Image Forgery Detection and Localization
Jiaying Zhu, Dong Li, Xueyang Fu et al.
Thin-Plate Spline-based Interpolation for Animation Line Inbetweening
Tianyi Zhu, Wei Shang, Dongwei Ren
Mesh Watermark Removal Attack and Mitigation: A Novel Perspective of Function Space
Xingyu Zhu, Guanhui Ye, Chengdong Dong et al.
Mesoscopic Insights: Orchestrating Multi-Scale & Hybrid Architecture for Image Manipulation Localization
Xuekang Zhu, Xiaochen Ma, Lei Su et al.
MUC: Mixture of Uncalibrated Cameras for Robust 3D Human Body Reconstruction
Yitao Zhu, Sheng Wang, Mengjie Xu et al.
ST3: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming
Jiedong Zhuang, Lu Lu, Ming Dai et al.
Dynamic Entity-Masked Graph Diffusion Model for Histopathology Image Representation Learning
Zhenfeng Zhuang, Min Cen, Yanfeng Li et al.
AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction
Pufan Zou, Shijia Zhao, Weijie Huang et al.
L-Man: A Large Multi-modal Model Unifying Human-centric Tasks
Jialong Zuo, Ying Nie, Tianyu Guo et al.
Learning Valid Dual Bounds in Constraint Programming: Boosted Lagrangian Decomposition with Self-Supervised Learning
Swann Bessa, Darius Dabert, Max Bourgeat et al.
Optimal Classification Trees for Continuous Feature Data Using Dynamic Programming with Branch-and-Bound
Cătălin E. Brița, Jacobus G. M. van der Linden, Emir Demirović
Linear Equations with Min and Max Operators: Computational Complexity
Krishnendu Chatterjee, Ruichen Luo, Raimundo Saona et al.
GPU-Accelerated Parallel Bilevel Optimization for Roubst 6G ISAC
Xingdi Chen, Kai Yang
Proof Simulation via Round-based Strategy Extraction for QBF
Leroy Chew
Decentralized Projected Riemannian Stochastic Recursive Momentum Method for Nonconvex Optimization
Kangkang Deng, Jiang Hu
Parameterized Complexity of Caching in Networks
Robert Ganian, Fionn Mc Inerney, Dimitra Tsigkari
FFCG: Effective and Fast Family Column Generation for Solving Large-Scale Linear Program
Yi-Xiang Hu, Feng Wu, Shaoang Li et al.
DCC: Differentiable Cardinality Constraints for Partial Index Tracking
Wooyeon Jo, Hyunsouk Cho
Online Prompt Selection for Program Synthesis
Yixuan Li, Lewis Frampton, Federico Mora et al.
Search Strategy Generation for Branch and Bound Using Genetic Programming
Gwen Maudet, Grégoire Danoy
Towards Real-Time Approximate Counting
Yash Pote, Kuldeep S. Meel, Jiong Yang
Computationally Hard Problems Are Hard for QBF Proof Systems Too
Agnes Schleitzer, Olaf Beyersdorff