Most Cited 2025 "visual representation" Papers
22,274 papers found • Page 77 of 112
Conference
Relaxing partition admissibility in Cluster-DAGs: a causal calculus with arbitrary variable clustering
Clément Yvernes, Emilie Devijver, Adèle Ribeiro et al.
Sequential Multi-Agent Dynamic Algorithm Configuration
Chen Lu, Ke Xue, Lei Yuan et al.
Leveraging robust optimization for llm alignment under distribution shifts
Mingye Zhu, Yi Liu, Zheren Fu et al.
FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields
Kwan Yun, Chaelin Kim, Hangyeul Shin et al.
Bilevel ZOFO: Efficient LLM Fine-Tuning and Meta-Training
Reza Shirkavand, Peiran Yu, Qi He et al.
Efficient Off-Policy Learning for High-Dimensional Action Spaces
Fabian Otto, Philipp Becker, Vien A Ngo et al.
CryptoMoE: Privacy-Preserving and Scalable Mixture of Experts Inference via Balanced Expert Routing
Yifan Zhou, Tianshi Xu, Jue Hong et al.
EA3D: Online Open-World 3D Object Extraction from Streaming Videos
Xiaoyu Zhou, Jingqi Wang, Yuang Jia et al.
Creativity or Brute Force? Using Brainteasers as a Window into the Problem-Solving Abilities of Large Language Models
Sophia Han, Howard Dai, Stephen Xia et al.
Care-PD: A Multi-Site Anonymized Clinical Dataset for Parkinson’s Disease Gait Assessment
Vida Adeli, Ivan Klabučar, Javad Rajabi et al.
Sparse Point Cloud Patches Rendering via Splitting 2D Gaussians
Changfeng Ma, Ran Bi, Jie Guo et al.
Contribution of task-irrelevant stimuli to drift of neural representations
Farhad Pashakhanloo
OURO: A Self-Bootstrapped Framework for Enhancing Multimodal Scene Understanding
Tianrun Xu, Guanyu Chen, Ye Li et al.
Efficient Kernelized Learning in Polyhedral Games beyond Full Information: From Colonel Blotto to Congestion Games
Andreas Kontogiannis, Vasilis Pollatos, Gabriele Farina et al.
Chain-of-Model Learning for Language Model
Xiaohua Wang, Kaitao Song, Xu Tan et al.
VITAL: More Understandable Feature Visualization through Distribution Alignment and Relevant Information Flow
Ada Görgün, Bernt Schiele, Jonas Fischer
Least squares variational inference
Yvann Le Fay, Nicolas Chopin, Simon Barthelmé
Balancing Performance and Costs in Best Arm Identification
Michael Harding, Kirthevasan Kandasamy
Probabilistic Prototype Calibration of Vision-language Models for Generalized Few-shot Semantic Segmentation
Jie Liu, Jiayi Shen, Pan Zhou et al.
Test-time Augmentation Improves Efficiency in Conformal Prediction
Divya M Shanmugam, Helen Lu, Swami Sankaranarayanan et al.
LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation
Huanlin Gao, Ping Chen, Fuyuan Shi et al.
Stable Minima of ReLU Neural Networks Suffer from the Curse of Dimensionality: The Neural Shattering Phenomenon
Tongtong Liang, Dan Qiao, Yu-Xiang Wang et al.
Anchor-Aware Similarity Cohesion in Target Frames Enables Predicting Temporal Moment Boundaries in 2D
Jiawei Tan, Hongxing Wang, Junwu Weng et al.
Identifying Macro Causal Effects in C-DMGs over DMGs
Simon Ferreira, Charles Assaad
Towards Robust Pseudo-Label Learning in Semantic Segmentation: An Encoding Perspective
Wangkai Li, Rui Sun, Zhaoyang Li et al.
NoisyGRPO: Incentivizing Multimodal CoT Reasoning via Noise Injection and Bayesian Estimation
Longtian Qiu, Shan Ning, Jiaxuan Sun et al.
FSboard: Over 3 Million Characters of ASL Fingerspelling Collected via Smartphones
Manfred Georg, Garrett Tanzer, Esha Uboweja et al.
The Rise of Parameter Specialization for Knowledge Storage in Large Language Models
Yihuai Hong, Yiran Zhao, Wei Tang et al.
Dataset Ownership Verification for Pre-trained Masked Models
Yuechen Xie, Jie Song, Yicheng Shan et al.
No Loss, No Gain: Gated Refinement and Adaptive Compression for Prompt Optimization
Wenhang Shi, Yiren Chen, Shuqing Bian et al.
DCI: Dual-Conditional Inversion for Boosting Diffusion-Based Image Editing
Zixiang Li, Haoyu Wang, Wei Wang et al.
Augmented and Softened Matching for Unsupervised Visible-Infrared Person Re-Identification
Zhiqi Pang, Chunyu Wang, Lingling Zhao et al.
MUVR: A Multi-Modal Untrimmed Video Retrieval Benchmark with Multi-Level Visual Correspondence
Yue Feng, Jinwei Hu, Qijia Lu et al.
Sketchy Bounding-box Supervision for 3D Instance Segmentation
qian deng, Le Hui, Jin Xie et al.
On the Robustness Tradeoff in Fine-Tuning
Kunyang Li, Jean-Charles Noirot Ferrand, Ryan Sheatsley et al.
Walking the Schrödinger Bridge: A Direct Trajectory for Text-to-3D Generation
Ziying Li, Xuequan Lu, Xinkui Zhao et al.
DecoyDB: A Dataset for Graph Contrastive Learning in Protein-Ligand Binding Affinity Prediction
Yupu Zhang, Zelin Xu, Tingsong Xiao et al.
TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels
Jiahao Lu, Weitao Xiong, Jiacheng Deng et al.
Forecasting Continuous Non-Conservative Dynamical Systems in SO(3)
Lennart Bastian, Mohammad Rashed, Nassir Navab et al.
Bandit and Delayed Feedback in Online Structured Prediction
Yuki Shibukawa, Taira Tsuchiya, Shinsaku Sakaue et al.
Real-time design of architectural structures with differentiable mechanics and neural networks
Rafael Pastrana, Eder Medina, Isabel M. de Oliveira et al.
FlashBias: Fast Computation of Attention with Bias
Haixu Wu, Minghao Guo, Yuezhou Ma et al.
RASP: Revisiting 3D Anamorphic Art for Shadow-Guided Packing of Irregular Objects
Soumyaratna Debnath, Ashish Tiwari, Kaustubh Sadekar et al.
Easy3D: A Simple Yet Effective Method for 3D Interactive Segmentation
Andrea Simonelli, Norman Müller, Peter Kontschieder
Improving Graph Neural Networks by Learning Continuous Edge Directions
Seong Ho Pahng, Sahand Hormoz
Beyond the Seen: Bounded Distribution Estimation for Open-Vocabulary Learning
Xiaomeng Fan, Yuchuan Mao, Zhi Gao et al.
Pool Me Wisely: On the Effect of Pooling in Transformer-Based Models
Sofiane Ennadir, Levente Zólyomi, Oleg Smirnov et al.
Learning to Generate Human-Human-Object Interactions from Textual Descriptions
Jeonghyeon Na, Sangwon Baik, Inhee Lee et al.
Single-Scanline Relative Pose Estimation for Rolling Shutter Cameras
Petr Hruby, Marc Pollefeys
SAO-Instruct: Free-form Audio Editing using Natural Language Instructions
Michael Ungersböck, Florian Grötschla, Luca Lanzendörfer et al.
InstantEdit: Text-Guided Few-Step Image Editing with Piecewise Rectified Flow
Yiming Gong, Zhen Zhu, Minjia Zhang
Optimizing Distributional Geometry Alignment with Optimal Transport for Generative Dataset Distillation
Xiao Cui, Yulei Qin, Wengang Zhou et al.
Structured Temporal Causality for Interpretable Multivariate Time Series Anomaly Detection
Dongchan Cho, Jiho Han, Keumyeong Kang et al.
QSVD: Efficient Low-rank Approximation for Unified Query-Key-Value Weight Compression in Low-Precision Vision-Language Models
Yutong Wang, Haiyu Wang, Sai Qian Zhang
PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations
YU WEI, Jiahui Zhang, Xiaoqin Zhang et al.
Knowledge Distillation Detection for Open-weights Models
Qin Shi, Amber Yijia Zheng, Qifan Song et al.
Curriculum Abductive Learning
Wen-Chao Hu, Qi-Jie Li, Lin-Han Jia et al.
Scalable Decentralized Learning with Teleportation
Yuki Takezawa, Sebastian Stich
Handling Label Noise via Instance-Level Difficulty Modeling and Dynamic Optimization
Kuan Zhang, Chengliang Chai, Jingzhe Xu et al.
Learning 3D Scene Analogies with Neural Contextual Scene Maps
Junho Kim, Gwangtak Bae, Eun Sun Lee et al.
DeepShield: Fortifying Deepfake Video Detection with Local and Global Forgery Analysis
Yinqi Cai, Jichang Li, Zhaolun Li et al.
Capability Localization: Capabilities Can be Localized rather than Individual Knowledge
Xiusheng Huang, Jiaxiang Liu, Yequan Wang et al.
From Programs to Poses: Factored Real-World Scene Generation via Learned Program Libraries
Joy Hsu, Emily Jin, Jiajun Wu et al.
MoEMeta: Mixture-of-Experts Meta Learning for Few-Shot Relational Learning
Han Wu, Jie Yin
PairEdit: Learning Semantic Variations for Exemplar-based Image Editing
Haoguang Lu, Jiacheng Chen, Zhenguo Yang et al.
OmniDraft: A cross-vocabulary, online adaptive drafter for on-device speculative decoding
Ramchalam Kinattinkara Ramakrishnan, Zhaocong Yuan, Jay Zhuo et al.
Multitask Learning with Stochastic Interpolants
Hugo Negrel, Florentin Coeurdoux, Michael Albergo et al.
When majority rules, minority loses: bias amplification of gradient descent
François Bachoc, Jerome Bolte, Ryan Boustany et al.
HSI: A Holistic Style Injector for Arbitrary Style Transfer
Shuhao Zhang, Hui Kang, Yang Liu et al.
Generalization and Distributed Learning of GFlowNets
Tiago Silva, Amauri Souza, Omar Rivasplata et al.
On Linear Mode Connectivity of Mixture-of-Experts Architectures
Viet-Hoang Tran, Van Hoan Trinh, Khanh-Vinh Bui et al.
On Epistemic Uncertainty of Visual Tokens for Object Hallucinations in Large Vision-Language Models
Hoigi Seo, Dong Un Kang, Hyunjin Cho et al.
Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions
Tommaso Galliena, Tommaso Apicella, Stefano Rosa et al.
Thresholds for sensitive optimality and Blackwell optimality in stochastic games
Stephane Gaubert, Julien Grand-Clément, Ricardo Katz
Polarized Color Screen Matting
Kenji Enomoto, Scott Cohen, Brian Price et al.
Voyaging into Perpetual Dynamic Scenes from a Single View
Fengrui Tian, Tianjiao Ding, Jinqi Luo et al.
Uncertainty-Aware Multi-Objective Reinforcement Learning-Guided Diffusion Models for 3D De Novo Molecular Design
Lianghong Chen, Dongkyu Kim, Mike Domaratzki et al.
Improving Editability in Image Generation with Layer-wise Memory
Daneul Kim, Jaeah Lee, Jaesik Park
Generalized Gaussian Entropy Model for Point Cloud Attribute Compression with Dynamic Likelihood Intervals
Changhao Peng
ML4CFD Competition: Results and Retrospective Analysis
Mouadh Yagoubi, David Danan, Milad LEYLI ABADI et al.
Fréchet Geodesic Boosting
Yidong Zhou, SU I IAO, Hans-Georg Müller
Interpreting vision transformers via residual replacement model
Jinyeong Kim, Junhyeok Kim, Yumin Shim et al.
Removing Cost Volumes from Optical Flow Estimators
Simon Kiefhaber, Stefan Roth, Simone Schaub-Meyer
VimoRAG: Video-based Retrieval-augmented 3D Motion Generation for Motion Language Models
Haidong Xu, Guangwei Xu, Zhedong Zheng et al.
Purge-Gate: Efficient Backpropagation-Free Test-Time Adaptation for Point Clouds via Token purging
Moslem Yazdanpanah, Ali Bahri, Mehrdad Noori et al.
MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems
Yinsicheng Jiang, Yao Fu, Yeqi Huang et al.
Princeton365: A Diverse Dataset with Accurate Camera Pose
Karhan Kayan, Stamatis Alexandropoulos, Rishabh Jain et al.
Unveiling Environmental Sensitivity of Individual Gains in Influence Maximization
Xinyan Su, Zhiheng Zhang, Jiyan Qiu et al.
MetaGS: A Meta-Learned Gaussian-Phong Model for Out-of-Distribution 3D Scene Relighting
Yumeng He, Yunbo Wang
HiPoNet: A Multi-View Simplicial Complex Network for High Dimensional Point-Cloud and Single-Cell data
Siddharth Viswanath, Hiren Madhu, Dhananjay Bhaskar et al.
Blameless Users in a Clean Room: Defining Copyright Protection for Generative Models
Aloni Cohen
Enhancing CLIP Robustness via Cross-Modality Alignment
Xingyu Zhu, Beier Zhu, Shuo Wang et al.
Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions
Simon Matrenok, Skander Moalla, Caglar Gulcehre
MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction
Yunkee Chae, Kyogu Lee
Informed Initialization for Bayesian Optimization and Active Learning
Carl Hvarfner, David Eriksson, Eytan Bakshy et al.
Revisiting Audio-Visual Segmentation with Vision-Centric Transformer
Shaofei Huang, Rui Ling, Tianrui Hui et al.
ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration
Andrea Conti, Matteo Poggi, Valerio Cambareri et al.
ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation
Xiwei Xuan, Ziquan Deng, Kwan-Liu Ma
Graph Neural Networks Gone Hogwild
Olga Solodova, Nick Richardson, Deniz Oktay et al.
Interaction-Merged Motion Planning: Effectively Leveraging Diverse Motion Datasets for Robust Planning
Giwon Lee, Wooseong Jeong, Daehee Park et al.
Interpretable Next-token Prediction via the Generalized Induction Head
Eunji Kim, Sriya Mantena, Weiwei Yang et al.
Regret Bounds for Episodic Risk-Sensitive Linear Quadratic Regulator
Wenhao Xu, Xuefeng Gao, Xuedong He
3DID: Direct 3D Inverse Design for Aerodynamics with Physics-Aware Optimization
Yuze Hao, Linchao Zhu, Yi Yang
Attention to Neural Plagiarism: Diffusion Models Can Plagiarize Your Copyrighted Images!
zihang zou, Boqing Gong, Liqiang Wang
PLMTrajRec: A Scalable and Generalizable Trajectory Recovery Method with Pre-trained Language Models
Tonglong Wei, Yan Lin, Youfang Lin et al.
A Few Moments Please: Scalable Graphon Learning via Moment Matching
Reza Ramezanpour, Victor Manuel Tenorio Gomez, Antonio G. Marques et al.
Unlabeled Data Can Provably Enhance In-Context Learning of Transformers
Renpu Liu, Jing Yang
Generalization Bound of Gradient Flow through Training Trajectory and Data-dependent Kernel
Yilan Chen, Zhichao Wang, Wei Huang et al.
Channel Simulation and Distributed Compression with Ensemble Rejection Sampling
Truong Buu Phan, Ashish Khisti
Rethink Sparse Signals for Pose-guided Text-to-image Generation
Wenjie Xuan, Jing Zhang, Juhua Liu et al.
Efficient Dynamic Scene Editing via 4D Gaussian-based Static-Dynamic Separation
Joohyun Kwon, Hanbyel Cho, Junmo Kim
PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks
Clinton A Mo, Kun Hu, Chengjiang Long et al.
FedFACT: A Provable Framework for Controllable Group-Fairness Calibration in Federated Learning
Li Zhang, Zhongxuan Han, XiaoHua Feng et al.
Robust learning of halfspaces under log-concave marginals
Jane Lange, Arsen Vasilyan
Wasserstein Convergence of Critically Damped Langevin Diffusions
Stanislas Strasman, Sobihan Surendran, Claire Boyer et al.
A Geometric Analysis of PCA
Ayoub El Hanchi, Murat Erdogdu, Chris Maddison
Enhancing Temporal Understanding in Video-LLMs through Stacked Temporal Attention in Vision Encoders
Leibniz University Hannover, L3S Research Center Ali Rasekh, Erfan Soula, Omid Daliran et al.
AIComposer: Any Style and Content Image Composition via Feature Integration
Haowen Li, Zhenfeng Fan, Zhang Wen et al.
Competitive Distillation: A Simple Learning Strategy for Improving Visual Classification
Daqian Shi, Xiaolei Diao, Xu Chen et al.
ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model
Sagnik Bhattacharya, Abhiram Gorle, Ahsan Bilal et al.
Towards Reliable Identification of Diffusion-based Image Manipulations
Alex Costanzino, Woody Bayliss, Juil Sock et al.
Efficiently Verifiable Proofs of Data Attribution
Ari Karchmer, Seth Neel, Martin Pawelczyk
Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes
Ludwic Leonard, Nils Thuerey, rüdiger westermann
End-to-End HOI Reconstruction Transformer with Graph-based Encoding
Zhenrong Wang, Qi Zheng, Sihan Ma et al.
Learning Reconfigurable Representations for Multimodal Federated Learning with Missing Data
Duong Nguyen, Nghia Hoang, Thanh Trung Huynh et al.
Understanding Flatness in Generative Models: Its Role and Benefits
Taehwan Lee, Kyeongkook Seo, Jaejun Yoo et al.
SGCR: Spherical Gaussians for Efficient 3D Curve Reconstruction
Xinran Yang, Donghao Ji, Yuanqi Li et al.
HypDAE: Hyperbolic Diffusion Autoencoders for Hierarchical Few-shot Image Generation
Lingxiao Li, Kaixuan Fan, Boqing Gong et al.
Boost the Inference with Co-training: A Depth-guided Mutual Learning Framework for Semi-supervised Medical Polyp Segmentation
Yuxin Li, Zihao Zhu, Yuxiang Zhang et al.
Occlusion-aware Text-Image-Point Cloud Pretraining for Open-World 3D Object Recognition
Khanh Nguyen, Ghulam Mubashar Hassan, Ajmal Mian
Do Contemporary Causal Inference Models Capture Real-World Heterogeneity? Findings from a Large-Scale Benchmark
Haining Yu, Yizhou Sun
TADA: Improved Diffusion Sampling with Training-free Augmented DynAmics
Tianrong Chen, Huangjie Zheng, David Berthelot et al.
Learning the Plasticity: Plasticity-Driven Learning Framework in Spiking Neural Networks
Guobin Shen, Dongcheng Zhao, Yiting Dong et al.
MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures
Elena Zamaraeva, Christopher Collins, George Darling et al.
MMAT-1M: A Large Reasoning Dataset for Multimodal Agent Tuning
Tianhong Gao, Yannian Fu, Weiqun Wu et al.
Transfer Faster, Price Smarter: Minimax Dynamic Pricing under Cross-Market Preference Shift
Yi Zhang, Elynn Chen, Yujun Yan
DGSolver: Diffusion Generalist Solver with Universal Posterior Sampling for Image Restoration
Hebaixu Wang, Jing Zhang, Haonan Guo et al.
Balancing Task-invariant Interaction and Task-specific Adaptation for Unified Image Fusion
Xingyu Hu, Junjun Jiang, Chenyang Wang et al.
Oracle-Efficient Combinatorial Semi-Bandits
Jung-hun Kim, Milan Vojnovic, Min-hwan Oh
VideoCAD: A Dataset and Model for Learning Long‑Horizon 3D CAD UI Interactions from Video
King Yiu Brandon Man, Ghadi Nehme, Md Ferdous Alam et al.
Preference-Driven Multi-Objective Combinatorial Optimization with Conditional Computation
Mingfeng Fan, Jianan Zhou, Yifeng Zhang et al.
CSC-PA: Cross-image Semantic Correlation via Prototype Attentions for Single-network Semi-supervised Breast Tumor Segmentation
Zhenhui Ding, Guilian Chen, Qin Zhang et al.
Rethinking Self-Distillation: Label Averaging and Enhanced Soft Label Refinement with Partial Labels
Hyeonsu Jeong, Hye Won Chung
HyperGCT: A Dynamic Hyper-GNN-Learned Geometric Constraint for 3D Registration
Xiyu Zhang, Jiayi Ma, Jianwei Guo et al.
DLFR-Gen: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Zhihang Yuan, Rui Xie, Yuzhang Shang et al.
WildAvatar: Learning In-the-wild 3D Avatars from the Web
Zihao Huang, Shoukang Hu, Guangcong Wang et al.
Causal Identification for Complex Functional Longitudinal Studies
Andrew Ying
The Quest for Universal Master Key Filters in DS-CNNs
Zahra Babaiee, Peyman M. Kiasari, Daniela Rus et al.
Bidirectional Motion Transformer for Safety-Critical Traffic Scenario Generation
Yuxin Liu, Zhenghao (Mark) Peng, Xuanhao Cui et al.
PALQO: Physics-informed model for Accelerating Large-scale Quantum Optimization
Yiming Huang, Yajie Hao, Yuxuan Du et al.
Wisdom is Knowing What not to Say: Hallucination-Free LLMs Unlearning via Attention Shifting
Chenchen Tan, Youyang Qu, Xinghao Li et al.
PASS: Path-selective State Space Model for Event-based Recognition
Jiazhou Zhou, Kanghao Chen, Lei Zhang et al.
Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging
Ibrahim Ethem Hamamci, Sezgin Er, Suprosanna Shit et al.
You Share Beliefs, I Adapt: Progressive Heterogeneous Collaborative Perception
hao si, Ehsan Javanmardi, Manabu Tsukada
Causal Spatio-Temporal Prediction: An Effective and Efficient Multi-Modal Approach
Yuting Huang, Ziquan Fang, Zhihao Zeng et al.
PoseSyn: Synthesizing Diverse 3D Pose Data from In-the-Wild 2D Data
CHANGHEE YANG, Hyeonseop Song, Seokhun Choi et al.
Minimax Optimal Two-Stage Algorithm For Moment Estimation Under Covariate Shift
Zhen Zhang, Xin Liu, Shaoli Wang et al.
HiERO: Understanding the Hierarchy of Human Behavior Enhances Reasoning on Egocentric Videos
Simone Alberto Peirone, Francesca Pistilli, Giuseppe Averta
Magical: Medical Lay Language Generation via Semantic Invariance and Layperson-tailored Adaptation
Weibin Liao, Tianlong Wang, Yinghao Zhu et al.
On Minimax Estimation of Parameters in Softmax-Contaminated Mixture of Experts
Fanqi Yan, Huy Nguyen, Le Dung et al.
Variational Learning Finds Flatter Solutions at the Edge of Stability
Avrajit Ghosh, Bai Cong, Rio Yokota et al.
VSNet: Focusing on the Linguistic Characteristics of Sign Language
Yuhao Li, Xinyue Chen, Hongkai Li et al.
Impact of Layer Norm on Memorization and Generalization in Transformers
Rishi Singhal, Jung-Eun Kim
Learning Successor Features with Distributed Hebbian Temporal Memory
Evgenii Dzhivelikian, Petr Kuderov, Aleksandr Panov
LeapFactual: Reliable Visual Counterfactual Explanation Using Conditional Flow Matching
Zhuo Cao, Xuan Zhao, Lena Krieger et al.
Channel Matters: Estimating Channel Influence for Multivariate Time Series
Muyao Wang, Zeke Xie, Bo Chen et al.
Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models
Yoojin Jung, Byung Cheol Song
HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous Driving
R.D. Lin, Pengcheng Weng, Yinqiao Wang et al.
V.I.P. : Iterative Online Preference Distillation for Efficient Video Diffusion Models
Jisoo Kim, Wooseok Seo, Junwan Kim et al.
Towards Scalable Human-aligned Benchmark for Text-guided Image Editing
Suho Ryu, Kihyun Kim, Eugene Baek et al.
Scaling Up Active Testing to Large Language Models
Gabrielle Berrada, Jannik Kossen, Freddie Bickford Smith et al.
MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification
Jianwei Zhao, XIN LI, Fan Yang et al.
Quantifying Uncertainty in the Presence of Distribution Shifts
Yuli Slavutsky, David Blei
Exploring Structural Degradation in Dense Representations for Self-supervised Learning
Siran Dai, Qianqian Xu, Peisong Wen et al.
Ditch the Denoiser: Emergence of Noise Robustness in Self-Supervised Learning from Data Curriculum
Wenquan Lu, Jiaqi Zhang, Hugues Van Assel et al.
AiDE-Q: Synthetic Labeled Datasets Can Enhance Learning Models for Quantum Property Estimation
Xinbiao Wang, Yuxuan Du, Zihan Lou et al.
C3Po: Cross-View Cross-Modality Correspondence by Pointmap Prediction
Kuan Wei Huang, Brandon Li, Bharath Hariharan et al.
Homogeneous Dynamics Space for Heterogeneous Humans
Xinpeng Liu, Junxuan Liang, Chenshuo Zhang et al.
Towards Robust Zero-Shot Reinforcement Learning
Kexin ZHENG, Lauriane Teyssier, Yinan Zheng et al.
Explicitly Modeling Subcortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness
Lucas Piper, Arlindo L Oliveira, Tiago Marques
All in One: Visual-Description-Guided Unified Point Cloud Segmentation
Zongyan Han, Mohamed El Amine Boudjoghra, Jiahua Dong et al.
On the rankability of visual embeddings
Ankit Sonthalia, Arnas Uselis, Seong Joon Oh
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling
Yuang Ai, Qihang Fan, Xuefeng Hu et al.
Breaking the Gradient Barrier: Unveiling Large Language Models for Strategic Classification
Xinpeng Lv, Yunxin Mao, Haoxuan Li et al.
Online Learning of Pure States is as Hard as Mixed States
Maxime Meyer, Soumik Adhikary, Naixu Guo et al.
On Universality Classes of Equivariant Networks
Marco Pacini, Gabriele Santin, Bruno Lepri et al.
Take the Bull by the Horns: Learning to Segment Hard Samples
Yuan Guo, Jingyu Kong, Yu Wang et al.
DISCO: Disentangled Communication Steering for Large Language Models
Max Torop, Aria Masoomi, Masih Eskandar et al.
GreenHyperSpectra: A multi-source hyperspectral dataset for global vegetation trait prediction
Eya Cherif, Arthur Ouaknine, Luke Brown et al.
Personalized Bayesian Federated Learning with Wasserstein Barycenter Aggregation
Ting Wei, Biao Mei, Junliang Lyu et al.
Native Segmentation Vision Transformers
Guillem Brasó, Aljosa Osep, Laura Leal-Taixé
On the Universal Near Optimality of Hedge in Combinatorial Settings
Zhiyuan Fan, Arnab Maiti, Lillian Ratliff et al.
Diff2I2P: Differentiable Image-to-Point Cloud Registration with Diffusion Prior
Juncheng Mu, Chengwei REN, Weixiang Zhang et al.
Bridging the Gap Between Ideal and Real-world Evaluation: Benchmarking AI-Generated Image Detection in Challenging Scenarios
Chunxiao Li, Xiaoxiao Wang, Meiling Li et al.
Implicit Counterfactual Learning for Audio-Visual Segmentation
Mingfeng Zha, Tianyu Li, Guoqing Wang et al.
A Semantic Knowledge Complementarity based Decoupling Framework for Semi-supervised Class-imbalanced Medical Image Segmentation
Zheng Zhang, Guanchun Yin, Bo Zhang et al.
On topological descriptors for graph products
Mattie Ji, Amauri Souza, Vikas Garg
LIRA: Inferring Segmentation in Large Multi-modal Models with Local Interleaved Region Assistance
Zhang Li, Biao Yang, Qiang Liu et al.
Training-free Detection of AI-generated images via Cropping Robustness
Sungik Choi, Hankook Lee, Moontae Lee
Non-Adaptive Adversarial Face Generation
Sunpill Kim, Seunghun Paik, Chanwoo Hwang et al.