Most Cited 2024 "fourier embedding" Papers
12,324 papers found • Page 27 of 62
Conference
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
Junyi Li, Junfeng Wu, Weizhi Zhao et al.
Adaptive Proximal Gradient Methods Are Universal Without Approximation
Konstantinos Oikonomidis, Emanuel Laude, Puya Latafat et al.
Full Bayesian Significance Testing via Neural Networks
Zehua Liu, Zimeng Li, Jingyuan Wang et al.
FADAS: Towards Federated Adaptive Asynchronous Optimization
Yujia Wang, Shiqiang Wang, Songtao Lu et al.
DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion
Liao Shen, Tianqi Liu, Huiqiang Sun et al.
Sample-specific Masks for Visual Reprogramming-based Prompting
Chengyi Cai, Zesheng Ye, Lei Feng et al.
ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders
Jefferson Hernandez, Ruben Villegas, Vicente Ordonez
ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Jiazhi Guan, Zhiliang Xu, Hang Zhou et al.
PH-Net: Semi-Supervised Breast Lesion Segmentation via Patch-wise Hardness
Siyao Jiang, Huisi Wu, Junyang Chen et al.
Federated Causality Learning with Explainable Adaptive Optimization
Dezhi Yang, Xintong He, Jun Wang et al.
Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation
Xinliang Zhang, Lei Zhu, Hangzhou He et al.
Rethinking the Paradigm of Content Constraints in Unpaired Image-to-Image Translation
Xiuding Cai, Yaoyao Zhu, Dong Miao et al.
Locality-Sensitive Hashing-Based Efficient Point Transformer with Applications in High-Energy Physics
Siqi Miao, Zhiyuan Lu, Mia Liu et al.
Learning Representations of Satellite Images From Metadata Supervision
Jules Bourcier, Gohar Dashyan, Karteek Alahari et al.
FashionERN: Enhance-and-Refine Network for Composed Fashion Image Retrieval
Yanzhe Chen, Huasong Zhong, Xiangteng He et al.
Superposition Prompting: Improving and Accelerating Retrieval-Augmented Generation
Thomas Merth, Qichen Fu, Mohammad Rastegari et al.
M3SOT: Multi-Frame, Multi-Field, Multi-Space 3D Single Object Tracking
Jiaming Liu, Yue Wu, Maoguo Gong et al.
Recurrent Early Exits for Federated Learning with Heterogeneous Clients
Royson Lee, Javier Fernandez-Marques, Xu Hu et al.
F3Loc: Fusion and Filtering for Floorplan Localization
Changan Chen, Rui Wang, Christoph Vogel et al.
OSSCAR: One-Shot Structured Pruning in Vision and Language Models with Combinatorial Optimization
Xiang Meng, Shibal Ibrahim, Kayhan Behdin et al.
Multi-Scene Generalized Trajectory Global Graph Solver with Composite Nodes for Multiple Object Tracking
Yan Gao, Haojun Xu, Jie Li et al.
MagicEraser: Erasing Any Objects via Semantics-Aware Control
FAN LI, Zixiao Zhang, Yi Huang et al.
Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation
Zeyang Zhao, Qilong Xue, Yifan Bai et al.
Learning Temporal Resolution in Spectrogram for Audio Classification
Haohe Liu, Xubo Liu, Qiuqiang Kong et al.
On the Utility of 3D Hand Poses for Action Recognition
Md Salman Shamil, Dibyadip Chatterjee, Fadime Sener et al.
Byzantine-Robust Federated Learning: Impact of Client Subsampling and Local Updates
Youssef Allouah, Sadegh Farhadkhani, Rachid Guerraoui et al.
DiffSED: Sound Event Detection with Denoising Diffusion
Swapnil Bhosale, Sauradip Nag, Diptesh Kanojia et al.
Brain-ID: Learning Contrast-agnostic Anatomical Representations for Brain Imaging
Peirong Liu, Oula Puonti, Xiaoling Hu et al.
Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing
Xinyu Hu, Pengfei Tang, Simiao Zuo et al.
Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models
Qitan Lv, Jie Wang, Hanzhu Chen et al.
Object Recognition as Next Token Prediction
Kaiyu Yue, Bor-Chun Chen, Jonas Geiping et al.
Chronic Poisoning: Backdoor Attack against Split Learning
Fangchao Yu, Bo Zeng, Kai Zhao et al.
Generating Enhanced Negatives for Training Language-Based Object Detectors
Shiyu Zhao, Long Zhao, Vijay Kumar BG et al.
Model Inversion Robustness: Can Transfer Learning Help?
Sy-Tuyen Ho, Koh Jun Hao, Keshigeyan Chandrasegaran et al.
UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding
Bowen Shi, Peisen Zhao, Zichen Wang et al.
Retro-fallback: retrosynthetic planning in an uncertain world
Austin Tripp, Krzysztof Maziarz, Sarah Lewis et al.
Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting
xinlu zhang, Shiyang Li, Xianjun Yang et al.
MovingParts: Motion-based 3D Part Discovery in Dynamic Radiance Field
Kaizhi Yang, Xiaoshuai Zhang, Zhiao Huang et al.
Restoring balance: principled under/oversampling of data for optimal classification
Emanuele Loffredo, Mauro Pastore, Simona Cocco et al.
Bayesian Uncertainty for Gradient Aggregation in Multi-Task Learning
Idan Achituve, Idit Diamant, Arnon Netzer et al.
FedLoGe: Joint Local and Generic Federated Learning under Long-tailed Data
Zikai Xiao, Zihan Chen, Liyinglan Liu et al.
PAC Prediction Sets Under Label Shift
Wenwen Si, Sangdon Park, Insup Lee et al.
3iGS: Factorised Tensorial Illumination for 3D Gaussian Splatting
Zhe Jun Tang, Tat-Jen Cham
Federated Online Adaptation for Deep Stereo
Matteo Poggi, Fabio Tosi
Task-Driven Causal Feature Distillation: Towards Trustworthy Risk Prediction
Zhixuan Chu, Mengxuan Hu, Qing Cui et al.
EvTexture: Event-driven Texture Enhancement for Video Super-Resolution
Dachun Kai, Jiayao Lu, Yueyi Zhang et al.
ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention
Chenhang He, Ruihuang Li, Guowen Zhang et al.
Reward-Free Curricula for Training Robust World Models
Marc Rigter, Minqi Jiang, Ingmar Posner
Scalable Geometric Fracture Assembly via Co-creation Space among Assemblers
Ruiyuan Zhang, Jiaxiang Liu, Zexi Li et al.
Improving Neural Additive Models with Bayesian Principles
Kouroche Bouchiat, Alexander Immer, Hugo Yèche et al.
SimCalib: Graph Neural Network Calibration Based on Similarity between Nodes
Boshi Tang, Zhiyong Wu, Xixin Wu et al.
Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation
Zhengyuan Xie, Haiquan Lu, Jia-wen Xiao et al.
Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training
qiangqiang wu, Yan Xia, Jia Wan et al.
BKDSNN: Enhancing the Performance of Learning-based Spiking Neural Networks Training with Blurred Knowledge Distillation
Zekai Xu, Kang You, Qinghai Guo et al.
InstructGIE: Towards Generalizable Image Editing
Zichong Meng, Changdi Yang, Jun Liu et al.
CHAI: Clustered Head Attention for Efficient LLM Inference
Saurabh Agarwal, Bilge Acun, Basil Hosmer et al.
Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation
Peng Jin, Hao Li, Zesen Cheng et al.
Missing Modality Prediction for Unpaired Multimodal Learning via Joint Embedding of Unimodal Models
Taesup Kim, Donggeun Kim
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
Qi Sun, Hang Zhou, Wengang Zhou et al.
Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image
Pengkun Jiao, Na Zhao, Jingjing Chen et al.
From Posterior Sampling to Meaningful Diversity in Image Restoration
Noa Cohen, Hila Manor, Yuval Bahat et al.
BRUSLEATTACK: A QUERY-EFFICIENT SCORE- BASED BLACK-BOX SPARSE ADVERSARIAL ATTACK
Quoc Viet Vo, Ehsan Abbasnejad, Damith Ranasinghe
Open-Vocabulary Camouflaged Object Segmentation
Youwei Pang, Xiaoqi Zhao, JiaMing Zuo et al.
Geospecific View Generation - Geometry-Context Aware High-resolution Ground View Inference from Satellite Views
Ningli Xu, Rongjun Qin
Mitigating Label Noise through Data Ambiguation
Julian Lienen, Eyke Hüllermeier
3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions
Weijia Li, Haote Yang, Zhenghao Hu et al.
Realistic Unsupervised CLIP Fine-tuning with Universal Entropy Optimization
Jian Liang, Sheng, Zhengbo Wang et al.
What is Dataset Distillation Learning?
William Yang, Ye Zhu, Zhiwei Deng et al.
WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language
Zhenxiang Lin, Xidong Peng, peishan cong et al.
Fairness-aware Vision Transformer via Debiased Self-Attention
Yao Qiang, Chengyin Li, Prashant Khanduri et al.
STAMP: Outlier-Aware Test-Time Adaptation with Stable Memory Replay
Yu Yongcan, Lijun Sheng, Ran He et al.
Harnessing Text-to-Image Diffusion Models for Category-Agnostic Pose Estimation
Duo Peng, Zhengbo Zhang, Ping Hu et al.
Deeper or Wider: A Perspective from Optimal Generalization Error with Sobolev Loss
Yahong Yang, Juncai He
Privacy-Preserving Optics for Enhancing Protection in Face De-Identification
Jhon Lopez, Carlos Hinojosa, Henry Arguello et al.
Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
Longtian Qiu, Shan Ning, Xuming He
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
Jinyi Liu, Zhi Wang, Yan Zheng et al.
Semi-supervised Segmentation of Histopathology Images with Noise-Aware Topological Consistency
Meilong Xu, Xiaoling Hu, Saumya Gupta et al.
GENIXER: Empowering Multimodal Large Language Models as a Powerful Data Generator
Hengyuan Zhao, Pan Zhou, Mike Zheng Shou
Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection
Qinyu Zhao, Ming Xu, Kartik Gupta et al.
OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation
Kwanyoung Kim, Yujin Oh, Jong Chul Ye
Few-shot Defect Image Generation based on Consistency Modeling
Qingfeng Shi, Jing Wei, Fei Shen et al.
TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks
Jinjie Mai, Wenxuan Zhu, Sara Rojas Martinez et al.
BAFFLE: A Baseline of Backpropagation-Free Federated Learning
Haozhe Feng, Tianyu Pang, Chao Du et al.
Look, Remember and Reason: Grounded Reasoning in Videos with Language Models
Apratim Bhattacharyya, Sunny Panchal, Reza Pourreza et al.
SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation
Jiaben Chen, Huaizu Jiang
Can Textual Semantics Mitigate Sounding Object Segmentation Preference?
Yaoting Wang, Peiwen Sun, Yuanchao Li et al.
GDA: Generalized Diffusion for Robust Test-time Adaptation
Yun-Yun Tsai, Fu-Chen Chen, Albert Chen et al.
Can OOD Object Detectors Learn from Foundation Models?
Jiahui Liu, Xin Wen, Shizhen Zhao et al.
MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
Prajwal K R, Bowen Shi, Matthew Le et al.
Tuning-Free Stochastic Optimization
Ahmed Khaled, Chi Jin
Sparse Inducing Points in Deep Gaussian Processes: Enhancing Modeling with Denoising Diffusion Variational Inference
JIAN XU, Delu Zeng, John Paisley
3DGazeNet: Generalizing Gaze Estimation with Weak Supervision from Synthetic Views
Evangelos Ververas, Polydefkis Gkagkos, Jiankang Deng et al.
DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation
Yueru Luo, Shuguang Cui, Zhen Li
Understanding Reconstruction Attacks with the Neural Tangent Kernel and Dataset Distillation
Noel Loo, Ramin Hasani, Mathias Lechner et al.
GAD-PVI: A General Accelerated Dynamic-Weight Particle-Based Variational Inference Framework
Fangyikang Wang, Huminhao Zhu, Chao Zhang et al.
MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections
mude hui, Zihao Wei, Hongru Zhu et al.
On the hardness of learning under symmetries
Bobak Kiani, Thien Le, Hannah Lawrence et al.
Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather
Junsung Park, Kyungmin Kim, Hyunjung Shim
Towards Image Ambient Lighting Normalization
Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei Wu et al.
Distilling Reliable Knowledge for Instance-Dependent Partial Label Learning
Dong-Dong Wu, Deng-Bao Wang, Min-Ling Zhang
Random Masking Finds Winning Tickets for Parameter Efficient Fine-tuning
Jing Xu, Jingzhao Zhang
Enhancing RAW-to-sRGB with Decoupled Style Structure in Fourier Domain
Xuanhua He, Tao Hu, Guoli Wang et al.
OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition
Tongjia Chen, Hongshan Yu, Zhengeng Yang et al.
Boosting Adversarial Training via Fisher-Rao Norm-based Regularization
Xiangyu Yin, Wenjie Ruan
A Unified Sampling Framework for Solver Searching of Diffusion Probabilistic Models
Enshu Liu, Xuefei Ning, Huazhong Yang et al.
ISP-Teacher: Image Signal Process with Disentanglement Regularization for Unsupervised Domain Adaptive Dark Object Detection
Yin Zhang, Yongqiang Zhang, Zian Zhang et al.
An Image is Worth Multiple Words: Discovering Object Level Concepts using Multi-Concept Prompt Learning
Chen Jin, Ryutaro Tanno, Amrutha Saseendran et al.
Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D
Mukund Varma T, Peihao Wang, Zhiwen Fan et al.
MultiDelete for Multimodal Machine Unlearning
Jiali Cheng, Hadi Amiri
CAD: Photorealistic 3D Generation via Adversarial Distillation
Ziyu Wan, Despoina Paschalidou, Ian Huang et al.
DoughNet: A Visual Predictive Model for Topological Manipulation of Deformable Objects
Dominik Bauer, Zhenjia Xu, Shuran Song
Brain Decodes Deep Nets
Huzheng Yang, James Gee, Jianbo Shi
Self-Guided Generation of Minority Samples Using Diffusion Models
Soobin Um, Jong Chul Ye
DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation
Xiaobin Hu, Xu Peng, Donghao Luo et al.
On Characterizing the Trade-off in Invariant Representation Learning
Vishnu Boddeti, Sepehr Dehdashtian, Bashir Sadeghi
Move Anything with Layered Scene Diffusion
Jiawei Ren, Mengmeng Xu, Jui-Chieh Wu et al.
Hearing Anything Anywhere
Mason Wang, Ryosuke Sawata, Samuel Clarke et al.
Prediction Error-based Classification for Class-Incremental Learning
Michał Zając, Tinne Tuytelaars, Gido M van de Ven
Offline Multi-Objective Optimization
Ke Xue, Rong-Xi Tan, Xiaobin Huang et al.
Zero-Shot Reinforcement Learning via Function Encoders
Tyler Ingebrand, Amy Zhang, Ufuk Topcu
Efficient Sharpness-Aware Minimization for Molecular Graph Transformer Models
Yili Wang, Kaixiong Zhou, Ninghao Liu et al.
Provable Multi-Task Representation Learning by Two-Layer ReLU Neural Networks
Liam Collins, Hamed Hassani, Mahdi Soltanolkotabi et al.
Generalizing across Temporal Domains with Koopman Operators
QIUHAO Zeng, Wei Wang, Fan Zhou et al.
Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution
Qingping Zheng, Ling Zheng, Yuanfan Guo et al.
Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems
Juno Kim, Kakei Yamamoto, Kazusato Oko et al.
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
Yufan Chen, Jiaming Zhang, Kunyu Peng et al.
Rethinking Generative Large Language Model Evaluation for Semantic Comprehension
Fangyun Wei, Xi Chen, Lin Luo
CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs
Yingji Zhong, Lanqing Hong, Zhenguo Li et al.
An Empirical Study of the Generalization Ability of Lidar 3D Object Detectors to Unseen Domains
George Eskandar
Audio-driven Talking Face Generation with Stabilized Synchronization Loss
Dogucan Yaman, Fevziye Irem Eyiokur Yaman, Leonard Bärmann et al.
Enhanced Fine-Grained Motion Diffusion for Text-Driven Human Motion Synthesis
Dong Wei, Xiaoning Sun, Huaijiang Sun et al.
Harnessing Density Ratios for Online Reinforcement Learning
Philip Amortila, Dylan Foster, Nan Jiang et al.
A Neural-Guided Dynamic Symbolic Network for Exploring Mathematical Expressions from Data
Wenqiang Li, Weijun Li, Lina Yu et al.
Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing
Vadim Titov, Madina Khalmatova, Alexandra Ivanova et al.
Light Schrödinger Bridge
Alexander Korotin, Nikita Gushchin, Evgeny Burnaev
Conformal Inductive Graph Neural Networks
Soroush H. Zargarbashi, Aleksandar Bojchevski
Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning
Tenglong Liu, Yang Li, Yixing Lan et al.
CALICO: Self-Supervised Camera-LiDAR Contrastive Pre-training for BEV Perception
Jiachen Sun, Haizhong Zheng, Qingzhao Zhang et al.
PRES: Toward Scalable Memory-Based Dynamic Graph Neural Networks
Junwei Su, Difan Zou, Chuan Wu
Wavelet Dynamic Selection Network for Inertial Sensor Signal Enhancement
Yifeng Wang, Yi Zhao
Polygonal Unadjusted Langevin Algorithms: Creating stable and efficient adaptive algorithms for neural networks
Dongyoung Lim, Sotirios Sabanis
LQMFormer: Language-aware Query Mask Transformer for Referring Image Segmentation
Nisarg Shah, Vibashan VS, Vishal M. Patel
Effective Video Mirror Detection with Inconsistent Motion Cues
Alex Warren, Ke Xu, Jiaying Lin et al.
A Circuit Domain Generalization Framework for Efficient Logic Synthesis in Chip Design
Zhihai Wang, Lei Chen, Jie Wang et al.
Taylor Videos for Action Recognition
Lei Wang, Xiuyuan Yuan, Tom Gedeon et al.
OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models
Zijian Zhou, Zheng Zhu, Holger Caesar et al.
Flattening the Parent Bias: Hierarchical Semantic Segmentation in the Poincaré Ball
Simon Weber, Barış Zöngür, Nikita Araslanov et al.
Fine-Grained Knowledge Selection and Restoration for Non-exemplar Class Incremental Learning
Authors: Jiang-Tian Zhai, Xialei Liu, Lu Yu et al.
ContextSeg: Sketch Semantic Segmentation by Querying the Context with Attention
Jiawei Wang, Changjian Li
Multi-Fidelity Residual Neural Processes for Scalable Surrogate Modeling
Brooks(Ruijia) Niu, Dongxia Wu, Kai Kim et al.
MAFA: Managing False Negatives for Vision-Language Pre-training
Jaeseok Byun, Dohoon Kim, Taesup Moon
FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models
LIn Zhao, Tianchen Zhao, Zinan Lin et al.
Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models
Luo Jiayun, Siddhesh Khandelwal, Leonid Sigal et al.
SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression
Qingwen Bu, Sungrae Park, Minsoo Khang et al.
Self-Supervised High Dynamic Range Imaging with Multi-Exposure Images in Dynamic Scenes
Zhilu Zhang, Haoyu Wang, Shuai Liu et al.
Community-Invariant Graph Contrastive Learning
Shiyin Tan, Dongyuan Li, Renhe Jiang et al.
SAPG: Split and Aggregate Policy Gradients
Jayesh Singla, Ananye Agarwal, Deepak Pathak
SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather
Edoardo Palladin, Roland Dietze, Praveen Narayanan et al.
TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing
Sherry X. Chen, Yaron Vaxman, Elad Ben Baruch et al.
Identifiability of Direct Effects from Summary Causal Graphs
Simon Ferreira, Charles Assaad
CuVLER: Enhanced Unsupervised Object Discoveries through Exhaustive Self-Supervised Transformers
Shahaf Arica, Or Rubin, Sapir Gershov et al.
In-context Exploration-Exploitation for Reinforcement Learning
Zhenwen Dai, Federico Tomasi, Sina Ghiassian
Sign is Not a Remedy: Multiset-to-Multiset Message Passing for Learning on Heterophilic Graphs
Langzhang Liang, Sunwoo Kim, Kijung Shin et al.
Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models
Chen Ju, Haicheng Wang, Haozhe Cheng et al.
Representing Part-Whole Hierarchies in Foundation Models by Learning Localizability Composability and Decomposability from Anatomy via Self Supervision
Mohammad Reza Hosseinzadeh Taher, Michael Gotway, Jianming Liang
Rethinking Features-Fused-Pyramid-Neck for Object Detection
Hulin Li
Optimal Hessian/Jacobian-Free Nonconvex-PL Bilevel Optimization
Feihu Huang
Beyond Prototypes: Semantic Anchor Regularization for Better Representation Learning
Yanqi Ge, Qiang Nie, Ye Huang et al.
SeMoLi: What Moves Together Belongs Together
Jenny Seidenschwarz, Aljoša Ošep, Francesco Ferroni et al.
Retrieval-Augmented Score Distillation for Text-to-3D Generation
Junyoung Seo, Susung Hong, Wooseok Jang et al.
Non-convex Stochastic Composite Optimization with Polyak Momentum
Yuan Gao, Anton Rodomanov, Sebastian Stich
Performative Federated Learning: A Solution to Model-Dependent and Heterogeneous Distribution Shifts
Kun Jin, Tongxin Yin, Zhongzhu Chen et al.
HYPO: Hyperspherical Out-Of-Distribution Generalization
Haoyue Bai, Yifei Ming, Julian Katz-Samuels et al.
Editable Image Elements for Controllable Synthesis
Jiteng Mu, Michael Gharbi, Richard Zhang et al.
Towards More Unified In-context Visual Understanding
Dianmo Sheng, Dongdong Chen, Zhentao Tan et al.
An Efficient Tester-Learner for Halfspaces
Aravind Gollakota, Adam Klivans, Konstantinos Stavropoulos et al.
FedWon: Triumphing Multi-domain Federated Learning Without Normalization
Weiming Zhuang, Lingjuan Lyu
Scale Optimization Using Evolutionary Reinforcement Learning for Object Detection on Drone Imagery
Jialu Zhang, Xiaoying Yang, Wentao He et al.
SpectralNeRF: Physically Based Spectral Rendering with Neural Radiance Field
Ru Li, Jia Liu, Guanghui Liu et al.
3D Multi-frame Fusion for Video Stabilization
Zhan Peng, Xinyi Ye, Weiyue Zhao et al.
Enhancing Ensemble Clustering with Adaptive High-Order Topological Weights
Jiaxuan Xu, Taiyong Li, Lei Duan
Neural Causal Abstractions
Kevin Xia, Elias Bareinboim
Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models
Archiki Prasad, Elias Stengel-Eskin, Mohit Bansal
Recurrent Distance Filtering for Graph Representation Learning
Yuhui Ding, Antonio Orvieto, Bobby He et al.
Every Pixel Has its Moments: Ultra-High-Resolution Unpaired Image-to-Image Translation via Dense Normalization
Ming-Yang Ho, Che-Ming Wu, Min-Sheng Wu et al.
AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing
Zhiyuan Ma, Guoli Jia, Bowen Zhou
Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training
Jianwu Li, Kaiyue Shi, Guo-Sen Xie et al.
GradTree: Learning Axis-Aligned Decision Trees with Gradient Descent
DualBEV: Unifying Dual View Transformation with Probabilistic Correspondences
Peidong Li, Wancheng Shen, Qihao Huang et al.
Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos
Akshay Paruchuri, Samuel Ehrenstein, Shuxian Wang et al.
Measuring Vision-Language STEM Skills of Neural Models
Jianhao Shen, Ye Yuan, Srbuhi Mirzoyan et al.
Teddy: Efficient Large-Scale Dataset Distillation via Taylor-Approximated Matching
Ruonan Yu, Songhua Liu, Jingwen Ye et al.
Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures
Jiaxing Huang, Yanfeng Zhou, Yaoru Luo et al.
Improving Feature Stability during Upsampling -- Spectral Artifacts and the Importance of Spatial Context
Shashank Agnihotri, Julia Grabinski, Margret Keuper
Parameterized Approximation Algorithms for Sum of Radii Clustering and Variants
Xianrun Chen, Dachuan Xu, Yicheng Xu et al.
ChEX: Interactive Localization and Region Description in Chest X-rays
Philip Müller, Georgios Kaissis, Daniel Rueckert
How to Train the Teacher Model for Effective Knowledge Distillation
Shayan Mohajer Hamidi, Xizhen Deng, Renhao Tan et al.
Attacking Perceptual Similarity Metrics
Abhijay Ghildyal, Feng Liu
OHTA: One-shot Hand Avatar via Data-driven Implicit Priors
Xiaozheng Zheng, Chao Wen, Zhuo Su et al.
Align and Aggregate: Compositional Reasoning with Video Alignment and Answer Aggregation for Video Question-Answering
Zhaohe Liao, Jiangtong Li, Li Niu et al.