Most Cited 2025 "piecewise polynomial representation" Papers
22,274 papers found • Page 109 of 112
Conference
Teaching LLMs How to Learn with Contextual Fine-Tuning
Younwoo Choi, Muhammad Adil Asif, Ziwen Han et al.
Open-Vocabulary Customization from CLIP via Data-Free Knowledge Distillation
Yongxian Wei, Zixuan Hu, Li Shen et al.
Towards counterfactual fairness through auxiliary variables
Bowei Tian, Ziyao Wang, Shwai He et al.
Towards Continuous Reuse of Graph Models via Holistic Memory Diversification
Ziyue Qiao, Junren Xiao, Qingqiang Sun et al.
Cauchy-Schwarz Regularizers
Sueda Taner, Ziyi Wang, Christoph Studer
INS: Interaction-aware Synthesis to Enhance Offline Multi-agent Reinforcement Learning
Yuqian Fu, Yuanheng Zhu, Jian Zhao et al.
FlexCAD: Unified and Versatile Controllable CAD Generation with Fine-tuned Large Language Models
Zhanwei Zhang, Shizhao Sun, Wenxiao Wang et al.
Improving Complex Reasoning with Dynamic Prompt Corruption: A Soft Prompt Optimization Approach
Sinan Fan, Liang Xie, Chen Shen et al.
Discriminator-Guided Embodied Planning for LLM Agent
Haofu Qian, Chenjia Bai, Jiatao Zhang et al.
Controllable Unlearning for Image-to-Image Generative Models via $\epsilon$-Constrained Optimization
XiaoHua Feng, Yuyuan Li, Chaochao Chen et al.
Bridging Context Gaps: Leveraging Coreference Resolution for Long Contextual Understanding
Yanming Liu, Xinyue Peng, Jiannan Cao et al.
Greener GRASS: Enhancing GNNs with Encoding, Rewiring, and Attention
Tongzhou Liao, Barnabás Póczos
Generation and Comprehension Hand-in-Hand: Vision-guided Expression Diffusion for Boosting Referring Expression Generation and Comprehension
Jingcheng Ke, Jun-Cheng Chen, I-Hong Jhuo et al.
Sketch2Diagram: Generating Vector Diagrams from Hand-Drawn Sketches
Itsumi Saito, Haruto Yoshida, Keisuke Sakaguchi
BP-Modified Local Loss for Efficient Training of Deep Neural Networks
REN Lianhai, Qianxiao Li
DOCS: Quantifying Weight Similarity for Deeper Insights into Large Language Models
Zeping Min, Xinshang Wang
Improving Neural Network Accuracy by Concurrently Training with a Twin Network
Benjamin Vandersmissen, Lucas Deckers, Jose Oramas
Iterative Sparse Attention for Long-sequence Recommendation
Guanyu Lin, Jinwei Luo, Yinfeng Li et al.
Coherency Improved Explainable Recommendation via Large Language Model
Shijie Liu, Ruixin Ding, Weihai Lu et al.
Designing Concise ConvNets with Columnar Stages
Ashish Kumar, Jaesik Park
Local convergence of simultaneous min-max algorithms to differential equilibrium on Riemannian manifold
Sixin Zhang
Policy Optimization under Imperfect Human Interactions with Agent-Gated Shared Autonomy
Zhenghai Xue, Bo An, Shuicheng YAN
On the Optimal Memorization Capacity of Transformers
Tokio Kajitsuka, Issei Sato
Do Stochastic, Feel Noiseless: Stable Stochastic Optimization via a Double Momentum Mechanism
Tehila Dahan, Kfir Y Levy
Reexamining the Aleatoric and Epistemic Uncertainty Dichotomy
Michael Kirchhof, Gjergji Kasneci, Enkelejda Kasneci
Let SSMs be ConvNets: State-space Modeling with Optimal Tensor Contractions
Yan Ru Pei
How Feature Learning Can Improve Neural Scaling Laws
Blake Bordelon, Alexander Atanasov, Cengiz Pehlevan
Multi-LLM-Agents Debate - Performance, Efficiency, and Scaling Challenges
Hangfan Zhang, Zhiyao Cui, Qiaosheng Zhang et al.
Small-to-Large Generalization: Training Data Influences Models Consistently Across Scale
Alaa Khaddaj, Logan Engstrom, Aleksander Madry
Exploring The Forgetting in Adversarial Training: A Novel Method for Enhancing Robustness
Xianglu Wang, Hu Ding
Can LLM Simulations Truly Reflect Humanity? A Deep Dive
Qian Wang, Zhenheng Tang, Bingsheng He
Multi-objective antibody design with constrained preference optimization
Milong Ren, ZaiKai He, Haicang Zhang
Each Fake News Is Fake in Its Own Way: An Attribution Multi-Granularity Benchmark for Multimodal Fake News Detection
Hao Guo, Zihan Ma, Zhi Zeng et al.
Open-world Radio Frequency Fingerprint Identification via Augmented Semi-supervised Learning
Zehua Han, Jing Xiao, Qirui Zhao et al.
Simulate and Eliminate: Revoke Backdoors for Generative Large Language Models
Haoran Li, Yulin Chen, Zihao Zheng et al.
ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area
Junxian Li, Di Zhang, Xunzhi Wang et al.
Critical Forgetting-Based Multi-Scale Disentanglement for Deepfake Detection
Kai Li, Wenqi Ren, Jianshu Li et al.
Enhancing the Adversarial Robustness via Manifold Projection
Zhiting Li, Shibai Yin, Tai-Xiang Jiang et al.
Defense Against Model Stealing Based on Account-Aware Distribution Discrepancy
Jian-Ping Mei, Weibin Zhang, Jie Chen et al.
Exploring Query Efficient Data Generation Towards Data-Free Model Stealing in Hard Label Setting
Gaozheng Pei, Shaojie Lyu, Ke Ma et al.
HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting
Feng Shibo, Peilin Zhao, Liu Liu et al.
Embedding Robust Watermarking into Pattern to Protect the Copyright of Ceramic Artifacts
Lei Tan, Yuliang Xue, Guobiao Li et al.
PScalpel: A Machine Learning-based Guider for Protein Phase-Separating Behaviour Alteration
Jia Wang, Liyan Zhu, Zhe Wang et al.
VisRec: A Semi-Supervised Approach to Visibility Data Reconstruction in Radio Astronomy
Ruoqi Wang, Haitao Wang, Qiong Luo et al.
DearLLM: Enhancing Personalized Healthcare via Large Language Models-Deduced Feature Correlations
Yongxin Xu, Xinke Jiang, Xu Chu et al.
Revolutionizing Encrypted Traffic Classification with MH-Net: A Multi-View Heterogeneous Graph Model
Haozhen Zhang, Haodong Yue, Xi Xiao et al.
Motif-Oriented Representation Learning with Topology Refinement for Drug-Drug Interaction Prediction
Ran Zhang, Xuezhi Wang, Guannan Liu et al.
DeNC: Unleash Neural Codecs in Video Streaming with Diffusion Enhancement
Qihua Zhou, Ruibin Li, Jingcai Guo et al.
Text-Guided Fine-grained Counterfactual Inference for Short Video Fake News Detection
Linlin Zong, Wenmin Lin, Jiahui Zhou et al.
AFFAKT: A Hierarchical Optimal Transport Based Method for Affective Facial Knowledge Transfer in Video Deception Detection
Zihan Ji, Xuetao Tian, Ye Liu
Towards Accurate Binary Spiking Neural Networks: Learning with Adaptive Gradient Modulation Mechanism
Yu Liang, Wenjie Wei, Ammar Belatreche et al.
Knowledge-Enhanced Hierarchical Heterogeneous Graph for Personality Identification with Limited Training Data
Yuxuan Song, Qiudan Li, Yilin Wu et al.
Alignment of CNN and Human Judgments of Geometric and Topological Concepts
Neha Upadhyay, Vijay Marupudi, Kamala Varma et al.
Look Around Before Locating: Considering Content and Structure Information for Visual Grounding
Shiyi Zheng, Peizhi Zhao, Zhilong Zheng et al.
Bridge Then Begin Anew: Generating Target-Relevant Intermediate Model for Source-Free Visual Emotion Adaptation
Jiankun Zhu, Sicheng Zhao, Jing Jiang et al.
Frozen Language Models Are Gradient Coherence Rectifiers in Vision Transformers
Lichen Bai, Zixuan Xiong, Hai Lin et al.
Plug-and-Play Tri-Branch Invertible Block for Image Rescaling
Jingwei Bao, Jinhua Hao, Pengcheng Xu et al.
Dual Manifold Regularization Steered Robust Representation Learning for Point Cloud Analysis
Jian Bi, Qianliang Wu, Jianjun Qian et al.
CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training
Xiuli Bi, Jian Lu, Bo Liu et al.
Zero-shot Video Restoration and Enhancement Using Pre-Trained Image Diffusion Model
Cong Cao, Huanjing Yue, Xin Liu et al.
Causal-Inspired Multitask Learning for Video-Based Human Pose Estimation
Haipeng Chen, Sifan Wu, Zhigang Wang et al.
Dual-Level Precision Edges Guided Multi-View Stereo with Accurate Planarization
Kehua Chen, Zhenlong Yuan, Tianlu Mao et al.
CustomContrast: A Multilevel Contrastive Perspective for Subject-Driven Text-to-Image Customization
Nan Chen, Mengqi Huang, Zhuowei Chen et al.
Mixture-of-Attack-Experts with Class Regularization for Unified Physical-Digital Face Attack Detection
Shunxin Chen, Ajian Liu, Junze Zheng et al.
DiffDVC: Accurate Event Detection for Dense Video Captioning via Diffusion Models
Wei Chen, Jianwei Niu, Xuefeng Liu et al.
Ultra-High-Definition Dynamic Multi-Exposure Image Fusion via Infinite Pixel Learning
Xingchi Chen, Zhuoran Zheng, Xuerui Li et al.
Dr. Tongue: Sign-Oriented Multi-label Detection for Remote Tongue Diagnosis
Yiliang Chen, Steven SC Ho, Cheng Xu et al.
Unsupervised Diffusion-Based Degradation Modeling for Real-World Super-Resolution
Yuying Chen, Mingde Yao, Wenbo Li et al.
VFM-Adapter: Adapting Visual Foundation Models for Dense Prediction with Dynamic Hybrid Operation Mapping
Zheng Chen, Yu Zeng, Zehui Chen et al.
Gradient Alignment Improves Test-Time Adaptation for Medical Image Segmentation
Ziyang Chen, Yiwen Ye, Yongsheng Pan et al.
Zero-Shot Scene Change Detection
Kyusik Cho, Dong Yeop Kim, Euntai Kim
Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Image Generation
Quan Dao, Hao Phung, Trung Tuan Dao et al.
Single Exposure Quantitative Phase Imaging with a Conventional Microscope Using Diffusion Models
Gabriel della Maggiora, Luis Alberto Croquevielle, Harry Horsley et al.
IniRetinex: Rethinking Retinex-type Low-Light Image Enhancer via Initialization Perspective
Guodong Fan, Zishu Yao, Guang-Yong Chen et al.
AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scenes
Chaoran Feng, Wangbo Yu, Xinhua Cheng et al.
Weakly Supervised Gland Segmentation with Class Semantic Consistency and Purified Labels Filtration
Siyang Feng, Huadeng Wang, Chu Han et al.
Foundation Model Driven Appearance Extraction for Robust Multiple Object Tracking
Teng Fu, Haiyang Yu, Ke Niu et al.
TC-LLaVA: Rethinking the Transfer of LLava from Image to Video Understanding with Temporal Considerations
Mingze Gao, Jingyu Liu, Mingda Li et al.
ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis
Xinyu Geng, Jiaming Wang, Xiaolin Huang et al.
SpikeGS: Reconstruct 3D Scene Captured by a Fast-Moving Bio-Inspired Camera
Yijia Guo, Liwen Hu, Yuanxi Bai et al.
DME-Driver: Integrating Human Decision Logic and 3D Scene Perception in Autonomous Driving
Wencheng Han, Dongqian Guo, Cheng-Zhong Xu et al.
Long-Tailed Out-of-Distribution Detection: Prioritizing Attention to Tail
Yina He, Lei Peng, Yongcun Zhang et al.
BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation
Xiaolu Hou, Mingcheng Li, Dingkang Yang et al.
Training-and-Prompt-Free General Painterly Harmonization via Zero-Shot Disentenglement on Style and Content References
Teng-Fang Hsiao, Bo-Kai Ruan, Hong-Han Shuai
GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution
Jintong Hu, Bin Xia, Bin Chen et al.
VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression
Qiang Hu, Houqiang Zhong, Zihan Zheng et al.
Identity-Text Video Corpus Grounding
Bin Huang, Xin Wang, Hong Chen et al.
Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening
Jie Huang, Rui Huang, Jinghao Xu et al.
Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
Shaofei Huang, Rui Ling, Hongyu Li et al.
DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion Priors
Tianyu Huang, Haoze Zhang, Yihan Zeng et al.
Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence
Wenbo Huang, Jinghui Zhang, Guang Li et al.
Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine
Xiaoshuang Huang, Lingdong Shen, Jia Liu et al.
Medical MLLM Is Vulnerable: Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language Models
Xijie Huang, Xinyuan Wang, Hantao Zhang et al.
L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection
Xun Huang, Ziyu Xu, Hai Wu et al.
PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space Model
Yunlong Huang, Junshuo Liu, Ke Xian et al.
SCCS: Deep Neural Spectral Clustering for Self-Supervised Subcellular Structure Segmentation
Jimao Jiang, Diya Sun, Tianbing Wang et al.
CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis
Gyeongjin Kang, Younggeun Lee, Seungjun Oh et al.
PLATYPUS: Progressive Local Surface Estimator for Arbitrary-Scale Point Cloud Upsampling
Donghyun Kim, Hyeonkyeong Kwon, Yumin Kim et al.
Generalized Zero-Shot Learning for Point Cloud Segmentation with Evidence-Based Dynamic Calibration
Hyeonseok Kim, Byeongkeun Kang, Yeejin Lee
APR-RD: Complemental Two Steps for Self-Supervised Real Image Denoising
Hyunjun Kim, Nam Ik Cho
ProtoOcc: Accurate, Efficient 3D Occupancy Prediction Using Dual Branch Encoder-Prototype Query Decoder
Jungho Kim, Changwon Kang, Dongyoung Lee et al.
Stable Mean Teacher for Semi-supervised Video Action Detection
Akash Kumar, Sirshapan Mitra, Yogesh Singh Rawat
Color Transfer with Modulated Flows
Maria Larchenko, Alexander Lobashev, Dmitry Guskov et al.
MAMS: Model-Agnostic Module Selection Framework for Video Captioning
Sangho Lee, Il Yong Chun, Hogun Park
An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques
Chunxiao Li, Xiaoxiao Wang, Boming Miao et al.
Cascaded Diffusion Models for Virtual Try-On: Improving Control and Resolution
Guangyuan Li, Yongkang Wang, Junsheng Luan et al.
Similar Modality Enhancement and Action Consistency Learning for Weakly Supervised Temporal Action Localization
Maodong Li, Chao Zheng, Jian Wang et al.
DigitalLLaVA: Incorporating Digital Cognition Capability for Physical World Comprehension in Multimodal LLMs
Shiyu Li, Pengxu Wei, Pengchong Qiao et al.
PersonaMagic: Stage-Regulated High-Fidelity Face Customization with Tandem Equilibrium
Xinzhe Li, Jiahui Zhan, Shengfeng He et al.
Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion
Li Liang, Naveed Akhtar, Jordan Vice et al.
DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder
Ente Lin, Xujie Zhang, Fuwei Zhao et al.
InvSeg: Test-Time Prompt Inversion for Semantic Segmentation
Jiayi Lin, Jiabo Huang, Jian Hu et al.
Thinking Racial Bias in Fair Forgery Detection: Models, Datasets and Evaluations
Decheng Liu, Zongqi Wang, Chunlei Peng et al.
SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation
Hongjian Liu, Qingsong Xie, Tianxiang Ye et al.
PEIE: Physics Embedded Illumination Estimation for Adaptive Dehazing
Huaizhuo Liu, Hai-Miao Hu, Yonglong Jiang et al.
Union Is Strength! Unite the Power of LLMs and MLLMs for Chart Question Answering
Jiapeng Liu, Liang Li, Shihao Rao et al.
VQTalker: Towards Multilingual Talking Avatars Through Facial Motion Tokenization
Tao Liu, Ziyang Ma, Qi Chen et al.
Unlocking the Potential of Reverse Distillation for Anomaly Detection
Xinyue Liu, Jianyuan Wang, Biao Leng et al.
Does VLM Classification Benefit from LLM Description Semantics?
Pingchuan Ma, Lennart Rietdorf, Dmytro Kotovenko et al.
A Trusted Lesion-assessment Network for Interpretable Diagnosis of Coronary Artery Disease in Coronary CT Angiography
Xinghua Ma, Xinyan Fang, Mingye Zou et al.
Follow-Your-Click: Open-domain Regional Image Animation via Motion Prompts
Yue Ma, Yingqing He, Hongfa Wang et al.
OUS: Bridging Scene Context and Facial Features to Overcome the Rigid Cognitive Problem
Xinji Mai, Haoran Wang, Zeng Tao et al.
DMF-Net: Image-Guided Point Cloud Completion with Dual-Channel Modality Fusion and Shape-Aware Upsampling Transformer
Aihua Mao, Yuxuan Tang, Jiangtao Huang et al.
iMoT: Inertial Motion Transformer for Inertial Navigation
Son Minh Nguyen, Duc Viet Le, Paul Havinga
SPU-IMR: Self-supervised Arbitrary-scale Point Cloud Upsampling via Iterative Mask-recovery Network
Ziming Nie, Qiao Wu, Chenlei Lv et al.
Learning with Open-world Noisy Data via Class-independent Margin in Dual Representation Space
Linchao Pan, Can Gao, Jie Zhou et al.
DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image Segmentation
Qingtao Pan, Wenhao Qiao, Jingjiao Lou et al.
S2S2: Semantic Stacking for Robust Semantic Segmentation in Medical Imaging
Yimu Pan, Sitao Zhang, Alison D. Gernand et al.
Partially Blinded Unlearning: Class Unlearning for Deep Networks from Bayesian Perspective
Subhodip Panda, Shashwat Sourav, Prathosh A.P.
Adaptive Dual-domain Learning for Underwater Image Enhancement
Lintao Peng, Liheng Bian
OAMaskFlow: Occlusion-Aware Motion Mask for Scene Flow
Xiongfeng Peng, Zhihua Liu, Weiming Li et al.
PhysDiff: Physiology-based Dynamicity Disentangled Diffusion Model for Remote Physiological Measurement
Wei Qian, Gaoji Su, Dan Guo et al.
HSOD-BIT-V2: A Challenging Benchmark for Hyperspectral Salient Object Detection
Yuhao Qiu, Shuyan Bai, Tingfa Xu et al.
GHOST: Gaussian Hypothesis Open-Set Technique
Ryan Rabinowitz, Steve Cruz, Manuel Günther et al.
CDTR: Semantic Alignment for Video Moment Retrieval Using Concept Decomposition Transformer
Ran Ran, Jiwei Wei, Xiangyi Cai et al.
FunEditor: Achieving Complex Image Edits via Function Aggregation with Diffusion Models
Mohammadreza Samadi, Fred X. Han, Mohammad Salameh et al.
PVTree: Realistic and Controllable Palm Vein Generation for Recognition Tasks
Sheng Shang, Chenglong Zhao, Ruixin Zhang et al.
HS-FPN: High Frequency and Spatial Perception FPN for Tiny Object Detection
Zican Shi, Jing Hu, Jie Ren et al.
CtrlAvatar: Controllable Avatars Generation via Disentangled Invertible Networks
Wenfeng Song, Yang Ding, Fei Hou et al.
ERL-MPP: Evolutionary Reinforcement Learning with Multi-head Puzzle Perception for Solving Large-scale Jigsaw Puzzles of Eroded Gaps
Xingke Song, Xiaoying Yang, Chenglin Yao et al.
Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization Through Spare-Coding Transformer
Lei Su, Xiaochen Ma, Xuekang Zhu et al.
NeuralFlix: A Simple While Effective Framework for Semantic Decoding of Videos from Non-invasive Brain Recordings
Jingyuan Sun, Mingxiao Li, Marie-Francine Moens
Leveraging Large Vision-Language Model as User Intent-Aware Encoder for Composed Image Retrieval
Zelong Sun, Dong Jing, Guoxing Yang et al.
More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding
Yuan Tang, Xu Han, Xianzhi Li et al.
M2OST: Many-to-one Regression for Predicting Spatial Transcriptomics from Digital Pathology Images
Hongyi Wang, Xiuju Du, Jing Liu et al.
MM-Mixing: Multi-Modal Mixing Alignment for 3D Understanding
Jiaze Wang, Yi Wang, Ziyu Guo et al.
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision
Junjie Wang, Bin Chen, Bin Kang et al.
GFlow: Recovering 4D World from Monocular Video
Shizun Wang, Xingyi Yang, Qiuhong Shen et al.
MVReward: Better Aligning and Evaluating Multi-View Diffusion Models with Human Preferences
Weitao Wang, Haoran Xu, Yuxiao Yang et al.
From 2D CAD Drawings to 3D Parametric Models: A Vision-Language Approach
Xilin Wang, Jia Zheng, Yuanchao Hu et al.
DualNet: Robust Self-Supervised Stereo Matching with Pseudo-Label Supervision
Yun Wang, Jiahao Zheng, Chenghao Zhang et al.
Mamba YOLO: A Simple Baseline for Object Detection with State Space Model
Zeyu Wang, Chen Li, Huiying Xu et al.
Multi-axis Prompt and Multi-dimension Fusion Network for All-in-one Weather-degraded Image Restoration
Yuanbo Wen, Tao Gao, Jing Zhang et al.
USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature Decorrelation
Wanjiang Weng, Hongsong Wang, Junbo Wang et al.
Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark
Yongliang Wu, Wenbo Zhu, Jiawang Cao et al.
Unified Knowledge Maintenance Pruning and Progressive Recovery with Weight Recalling for Large Vision-Language Models
Zimeng Wu, Jiaxin Chen, Yunhong Wang
CA-Edit: Causality-Aware Condition Adapter for High-Fidelity Local Facial Attribute Editing
Xiaole Xian, Xilin He, Zenghao Niu et al.
Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval
Jian Xiao, Zhenzhen Hu, Jia Li et al.
Few-Shot Incremental Learning via Foreground Aggregation and Knowledge Transfer for Audio-Visual Semantic Segmentation
Jingqiao Xiu, Mengze Li, Zongxin Yang et al.
DiffScene: Diffusion-Based Safety-Critical Scenario Generation for Autonomous Vehicles
Chejian Xu, Aleksandr Petiushko, Ding Zhao et al.
Motion Artifact Removal in Pixel-Frequency Domain via Alternate Masks and Diffusion Model
Jiahua Xu, Dawei Zhou, Lei Hu et al.
Multiple Feature Refining Network for Visual Emotion Distribution Learning
Qinfu Xu, Shaozu Yuan, Yiwei Wei et al.
SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection
Ruoyu Xu, Zhiyu Xiang, Chenwei Zhang et al.
HOIMamba: Efficient Mamba-based Disentangled Progressive Learning for HOI Detection
Yongchao Xu, Jiawei Liu, Sen Tao et al.
OOTDiffusion: Outfitting Fusion Based Latent Diffusion for Controllable Virtual Try-On
Yuhao Xu, Tao Gu, Weifeng Chen et al.
RetouchGPT: LLM-based Interactive High-Fidelity Face Retouching via Imperfection Prompting
Wen Xue, Chun Ding, Ruotao Xu et al.
Physical Marker: Revealing Invisible Hyperlinks Hidden in Printed Trademarks
Yuliang Xue, Lei Tan, Guobiao Li et al.
3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly Detection
Enquan Yang, Peng Xing, Hanyang Sun et al.
ERF: A Benchmark Dataset for Robust Semantic Segmentation Under Extreme Rainfall Conditions
Xin Yang, Xin Zhang, Xinchao Wang
As Pseudo-Label Free as Possible: Leveraging Adaptive Feature Generation for Sparsely Annotated Object Detection
Shuilian Yao, Yu Liu, Qi Jia et al.
Optimized Gradient Clipping for Noisy Label Learning
Xichen Ye, Yifan Wu, Weizhong Zhang et al.
SGFormer: Semantic-Geometry Fusion Transformer for Multi-modal 3D Panoptic Segmentation
Hongqi Yu, Sixian Chan, Xiaolong Zhou et al.
World Knowledge-Enhanced Reasoning Using Instruction-Guided Interactor in Autonomous Driving
Mingliang Zhai, Cheng Li, Zengyuan Guo et al.
DetRF: Detachable Novel Views Synthesis of Dynamic Scenes Using Backdrop-Driven Neural Radiance Fields
Boyu Zhang, Zheng Zhu, Wenbo Xu
Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm
Jinrong Zhang, Penghui Wang, Chunxiao Liu et al.
R^2-Art: Category-Level Articulation Pose Estimation from Single RGB Image via Cascade Render Strategy
Li Zhang, Haonan Jiang, Yukang Huo et al.
IRMamba: Pixel Difference Mamba with Layer Restoration for Infrared Small Target Detection
Mingjin Zhang, Xiaolong Li, Fei Gao et al.
SIGraph: Saliency Image-Graph Network for Retinal Disease Classification in Fundus Image
Peng Zhang, Yuan Li, Haotian Song et al.
Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network
Xinyi Zhang, Qiqi Bao, Qinpeng Cui et al.
VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models
Yabo Zhang, Yuxiang Wei, Xianhui Lin et al.
Category Prompt Mamba Network for Nuclei Segmentation and Classification
Ye Zhang, Zijie Fang, Yifeng Wang et al.
Cross-Modal Few-Shot Learning with Second-Order Neural Ordinary Differential Equations
Yi Zhang, Chun-Wun Cheng, Junyi He et al.
RP-PGD: Boosting Segmentation Robustness with a Region-and-Prototype Based Adversarial Attack
Yuxuan Zhang, Zhenbo Shi, Shuchang Wang et al.
KALAHash: Knowledge-Anchored Low-Resource Adaptation for Deep Hashing
Shu Zhao, Tan Yu, Xiaoshuai Hao et al.
Audio-Visual Adaptive Fusion Network for Question Answering Based on Contrastive Learning
Xujian Zhao, Yixin Wang, Peiquan Jin
ESEG: Event-Based Segmentation Boosted by Explicit Edge-Semantic Guidance
Yucheng Zhao, Gengyu Lyu, Ke Li et al.
Test-Time Adaptation on Noisy Data via Model-Pruning-Based Filtering and Flatness-Aware Entropy Minimization
Xingzhi Zhou, Zhiliang Tian, Boyang Zhang et al.
Expanding the Scope of Negatives: Boosting Image-Text Matching with Negatives Distribution Guided Learning
Zhao Zhou, Weizhong Zhang, Xiangcheng Du et al.
MUC: Mixture of Uncalibrated Cameras for Robust 3D Human Body Reconstruction
Yitao Zhu, Sheng Wang, Mengjie Xu et al.
Linear Equations with Min and Max Operators: Computational Complexity
Krishnendu Chatterjee, Ruichen Luo, Raimundo Saona et al.
Online Prompt Selection for Program Synthesis
Yixuan Li, Lewis Frampton, Federico Mora et al.
Towards Real-Time Approximate Counting
Yash Pote, Kuldeep S. Meel, Jiong Yang
Decomposed Quadratization: Efficient QUBO Formulation for Learning Bayesian Network
Yuta Shikuri
On Designing the Optimal Integrated Ad Auction in E-commerce Platforms
Yuchao Ma, Weian Li, Yuhan Wang et al.
Trust-GRS: A Trustworthy Training Framework for Graph Neural Network Based Recommender Systems Against Shilling Attacks
Lingyu Mu, Zhengxiao Liu, Zhitong Zhu et al.
Online Fraud Detection via Test-Time Retrieval-Based Representation Enrichment
Yiran Qiao, Ningtao Wang, Yuncong Gao et al.
GeoMamba: Towards Multi-granular POI Recommendation with Geographical State Space Model
Yifang Qin, Jiaxuan Xie, Zhiping Xiao et al.
Hyperparametric Robust and Dynamic Influence Maximization
Arkaprava Saha, Bogdan Cautis, Xiaokui Xiao et al.
Towards Loss-Resilient Image Coding for Unstable Satellite Networks
Hongwei Sha, Muchen Dong, Quanyou Luo et al.
AlphaForge: A Framework to Mine and Dynamically Combine Formulaic Alpha Factors
Hao Shi, Weili Song, Xinting Zhang et al.