Most Cited 2024 "genai model evaluation" Papers
12,324 papers found • Page 29 of 62
Conference
Linear Log-Normal Attention with Unbiased Concentration
Yury Nahshan, Joseph Kampeas, Emir Haleva
PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation
Shilin Yan, Xiaohao Xu, Renrui Zhang et al.
Instance Tracking in 3D Scenes from Egocentric Videos
Yunhan Zhao, Haoyu Ma, Shu Kong et al.
Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization
Jiajun Hu, Jian Zhang, Lei Qi et al.
Discriminative Probing and Tuning for Text-to-Image Generation
Leigang Qu, Wenjie Wang, Yongqi Li et al.
Bring Event into RGB and LiDAR: Hierarchical Visual-Motion Fusion for Scene Flow
Hanyu Zhou, Yi Chang, Zhiwei Shi
Refining Latent Homophilic Structures over Heterophilic Graphs for Robust Graph Convolution Networks
Chenyang Qiu, Guoshun Nan, Tianyu Xiong et al.
A Simple and Scalable Representation for Graph Generation
Yunhui Jang, Seul Lee, Sungsoo Ahn
FutureDepth: Learning to Predict the Future Improves Video Depth Estimation
Rajeev Yasarla, Manish Kumar Singh, Hong Cai et al.
Multi-Sentence Grounding for Long-term Instructional Video
Zeqian Li, QIRUI CHEN, Tengda Han et al.
Language-Informed Visual Concept Learning
Sharon Lee, Yunzhi Zhang, Shangzhe Wu et al.
RNb-NeuS: Reflectance and Normal-based Multi-View 3D Reconstruction
Baptiste Brument, Robin Bruneau, Yvain Queau et al.
R-EDL: Relaxing Nonessential Settings of Evidential Deep Learning
Mengyuan Chen, Junyu Gao, Changsheng Xu
3x2: 3D Object Part Segmentation by 2D Semantic Correspondences
Anh Thai, Weiyao Wang, Hao Tang et al.
MGNet: Learning Correspondences via Multiple Graphs
Dai Luanyuan, Xiaoyu Du, Hanwang Zhang et al.
PIDformer: Transformer Meets Control Theory
Tam Nguyen, Cesar Uribe, Tan Nguyen et al.
Controllable Prompt Tuning For Balancing Group Distributional Robustness
Hoang Phan, Andrew Wilson, Qi Lei
LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection
Sanmin Kim, Youngseok Kim, Sihwan Hwang et al.
OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising
Haichao Zhang, Yi Xu, Hongsheng Lu et al.
Realistic Human Motion Generation with Cross-Diffusion Models
Zeping Ren, Shaoli Huang, Xiu Li
A Plug-and-Play Image Registration Network
JUNHAO HU, Weijie Gan, Zhixin Sun et al.
Image Content Generation with Causal Reasoning
Xiaochuan Li, Baoyu Fan, Run Zhang et al.
Lewis's Signaling Game as beta-VAE For Natural Word Lengths and Segments
Ryo Ueda, TADAHIRO TANIGUCHI
Kernel Diffusion: An Alternate Approach to Blind Deconvolution
Yash Sanghvi, Yiheng Chi, Stanley Chan
Random Exploration in Bayesian Optimization: Order-Optimal Regret and Computational Efficiency
Sudeep Salgia, Sattar Vakili, Qing Zhao
Dirichlet-Based Prediction Calibration for Learning with Noisy Labels
Chen-Chen Zong, Ye-Wen Wang, Ming-Kun Xie et al.
Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction
Xiaoyang Lyu, Chirui Chang, Peng Dai et al.
Adapting Fine-Grained Cross-View Localization to Areas without Fine Ground Truth
Zimin Xia, Yujiao Shi, HONGDONG LI et al.
SINDER: Repairing the Singular Defects of DINOv2
Haoqi Wang, Tong Zhang, Mathieu Salzmann
Using AI Uncertainty Quantification to Improve Human Decision-Making
Laura Marusich, Jonathan Bakdash, Yan Zhou et al.
Memory-Efficient Reversible Spiking Neural Networks
Hong Zhang, Yu Zhang
Sparse and Structured Hopfield Networks
Saúl Santos, Vlad Niculae, Daniel McNamee et al.
Cauchy-Schwarz Divergence Information Bottleneck for Regression
Shujian Yu, Xi Yu, Sigurd Løkse et al.
SNeRV: Spectra-preserving Neural Representation for Video
Jina Kim, Jihoo Lee, Jewon Kang
RECOMBINER: Robust and Enhanced Compression with Bayesian Implicit Neural Representations
Jiajun He, Gergely Flamich, Zongyu Guo et al.
Robust Test-Time Adaptation for Zero-Shot Prompt Tuning
Ding-Chu Zhang, Zhi Zhou, Yufeng Li
Improving Bird's Eye View Semantic Segmentation by Task Decomposition
Tianhao Zhao, Yongcan Chen, Yu Wu et al.
Generative Powers of Ten
Xiaojuan Wang, Janne Kontkanen, Brian Curless et al.
No More Shortcuts: Realizing the Potential of Temporal Self-Supervision
Ishan Rajendrakumar Dave, Simon Jenni, Mubarak Shah
Classes Are Not Equal: An Empirical Study on Image Recognition Fairness
Jiequan Cui, Beier Zhu, Xin Wen et al.
Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph
Zhengcen Li, Xinle Chang, Yueran Li et al.
Considering Nonstationary within Multivariate Time Series with Variational Hierarchical Transformer for Forecasting
Muyao Wang, Wenchao Chen, Bo Chen
MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation
Linyan Yang, Lukas Hoyer, Mark Weber et al.
S3O: A Dual-Phase Approach for Reconstructing Dynamic Shape and Skeleton of Articulated Objects from Single Monocular Video
Hao Zhang, Fang Li, Samyak Rawlekar et al.
ReLoo: Reconstructing Humans Dressed in Loose Garments from Monocular Video in the Wild
Chen Guo, Tianjian Jiang, Manuel Kaufmann et al.
Abstractors and relational cross-attention: An inductive bias for explicit relational reasoning in Transformers
Awni Altabaa, Taylor Webb, Jonathan Cohen et al.
Point2SSM: Learning Morphological Variations of Anatomies from Point Clouds
Jadie Adams, Shireen Elhabian
Prompt Risk Control: A Rigorous Framework for Responsible Deployment of Large Language Models
Thomas Zollo, Todd Morrill, Zhun Deng et al.
Continuous Treatment Effect Estimation Using Gradient Interpolation and Kernel Smoothing
Lokesh Nagalapatti, Akshay Iyer, Abir De et al.
Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks
Yixuan Weng, Minjun Zhu, Fei Xia et al.
Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning
Jin Hwa Lee, Stefano Mannelli, Andrew Saxe
EX-Graph: A Pioneering Dataset Bridging Ethereum and X
Qian Wang, Zhen Zhang, Zemin Liu et al.
DiffFAS: Face Anti-Spoofing via Generative Diffusion Models
Xinxu Ge, Xin Liu, Zitong Yu et al.
Conditional Instrumental Variable Regression with Representation Learning for Causal Inference
Debo Cheng, Ziqi Xu, Jiuyong Li et al.
Learning Task-Aware Language-Image Representation for Class-Incremental Object Detection
Hongquan Zhang, Bin-Bin Gao, Yi Zeng et al.
PairAug: What Can Augmented Image-Text Pairs Do for Radiology?
Yutong Xie, Qi Chen, Sinuo Wang et al.
Transformer as Linear Expansion of Learngene
Shiyu Xia, Miaosen Zhang, Xu Yang et al.
Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models
Seungcheol Park, Hojun Choi, U Kang
Discounted Adaptive Online Learning: Towards Better Regularization
Zhiyu Zhang, David Bombara, Heng Yang
Physical-Based Event Camera Simulator
Haiqian Han, Jiacheng Lyu, Jianing Li et al.
Benchmarking Spurious Bias in Few-Shot Image Classifiers
Guangtao Zheng, Wenqian Ye, Aidong Zhang
Temporally Consistent Stereo Matching
Jiaxi Zeng, Chengtang Yao, Yuwei Wu et al.
Federated Modality-Specific Encoders and Multimodal Anchors for Personalized Brain Tumor Segmentation
Unsupervised Gaze Representation Learning from Multi-view Face Images
Yiwei Bao, Feng Lu
DUPLEX: Dual GAT for Complex Embedding of Directed Graphs
Zhaoru Ke, Hang Yu, Jianguo Li et al.
COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation
Liu He, Daniel Aliaga
MultiPhys: Multi-Person Physics-aware 3D Motion Estimation
Nicolás Ugrinovic, Boxiao Pan, Georgios Pavlakos et al.
STARC: A General Framework For Quantifying Differences Between Reward Functions
Joar Skalse, Lucy Farnik, Sumeet Motwani et al.
InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation
Jacob Si, Wendy Yusi Cheng, Michael Cooper et al.
FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior
Zhekai Chen, Wen Wang, Zhen Yang et al.
Exploring Transformer Extrapolation
Zhen Qin, Yiran Zhong, Hui Deng
Improving Interpretation Faithfulness for Vision Transformers
Lijie Hu, Yixin Liu, Ninghao Liu et al.
Bridging Vision and Language Spaces with Assignment Prediction
Jungin Park, Jiyoung Lee, Kwanghoon Sohn
Discover and Mitigate Multiple Biased Subgroups in Image Classifiers
Zeliang Zhang, Mingqian Feng, Zhiheng Li et al.
CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning
Qingsong Yan, Qiang Wang, Kaiyong Zhao et al.
Robustly Learning Single-Index Models via Alignment Sharpness
Nikos Zarifis, Puqian Wang, Ilias Diakonikolas et al.
Weisfeiler and Lehman Go Paths: Learning Topological Features via Path Complexes
Quang Truong, Peter Chin
Physical Backdoor: Towards Temperature-based Backdoor Attacks in the Physical World
Wen Yin, Jian Lou, Pan Zhou et al.
Consistent Long-Term Forecasting of Ergodic Dynamical Systems
Vladimir Kostic, Karim Lounici, Prune Inzerilli et al.
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens
Sunil Hwang, Jaehong Yoon, Youngwan Lee et al.
Compositional Image Decomposition with Diffusion Models
Jocelin Su, Nan Liu, Yanbo Wang et al.
PANDA: Expanded Width-Aware Message Passing Beyond Rewiring
Jeongwhan Choi, Sumin Parksumin, Hyowon Wi et al.
Online Algorithms with Uncertainty-Quantified Predictions
Bo Sun, Jerry Huang, Nicolas Christianson et al.
Explaining Graph Neural Networks via Structure-aware Interaction Index
Ngoc Bui, Trung Hieu Nguyen, Viet Anh Nguyen et al.
Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics
Luca Grillotti, Maxence Faldor, Borja G. León et al.
Learning Latent Dynamic Robust Representations for World Models
Ruixiang Sun, Hongyu Zang, Xin Li et al.
Self-Consistency Training for Density-Functional-Theory Hamiltonian Prediction
He Zhang, Chang Liu, wang et al.
Minimum-Norm Interpolation Under Covariate Shift
Neil Mallinar, Austin Zane, Spencer Frei et al.
Emergent Equivariance in Deep Ensembles
Jan Gerken, Pan Kessel
Harmony in Diversity: Merging Neural Networks with Canonical Correlation Analysis
Stefan Horoi, Albert Manuel Orozco Camacho, Eugene Belilovsky et al.
Improving Group Robustness on Spurious Correlation Requires Preciser Group Inference
Yujin Han, Difan Zou
Subgraphormer: Unifying Subgraph GNNs and Graph Transformers via Graph Products
Guy Bar Shalom, Beatrice Bevilacqua, Haggai Maron
Scaling Laws for the Value of Individual Data Points in Machine Learning
Ian Covert, Wenlong Ji, Tatsunori Hashimoto et al.
Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models
Akhil Kedia, Mohd Abbas Zaidi, Sushil Khyalia et al.
Efficient Black-box Adversarial Attacks via Bayesian Optimization Guided by a Function Prior
Shuyu Cheng, Yibo Miao, Yinpeng Dong et al.
Expressivity and Generalization: Fragment-Biases for Molecular GNNs
Tom Wollschläger, Niklas Kemper, Leon Hetzel et al.
Diversified Batch Selection for Training Acceleration
Feng Hong, Yueming LYU, Jiangchao Yao et al.
Eureka-Moments in Transformers: Multi-Step Tasks Reveal Softmax Induced Optimization Problems
David T. Hoffmann, Simon Schrodi, Jelena Bratulić et al.
Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity
Xudong Li, Timin Gao, Runze Hu et al.
Improved Stability and Generalization Guarantees of the Decentralized SGD Algorithm
Batiste Le Bars, Aurélien Bellet, Marc Tommasi et al.
Using Left and Right Brains Together: Towards Vision and Language Planning
Jun CEN, Chenfei Wu, Xiao Liu et al.
Gradient Compressed Sensing: A Query-Efficient Gradient Estimator for High-Dimensional Zeroth-Order Optimization
Ruizhong Qiu, Hanghang Tong
Online Linear Regression in Dynamic Environments via Discounting
Andrew Jacobsen, Ashok Cutkosky
Allocation Requires Prediction Only if Inequality Is Low
Ali Shirali, Rediet Abebe, Moritz Hardt
Bringing Motion Taxonomies to Continuous Domains via GPLVM on Hyperbolic manifolds
Noémie Jaquier, Leonel Rozo, Miguel González-Duque et al.
Sliding Down the Stairs: How Correlated Latent Variables Accelerate Learning with Neural Networks
Lorenzo Bardone, Sebastian Goldt
Neural operators meet conjugate gradients: The FCG-NO method for efficient PDE solving
Alexander Rudikov, Fanaskov Vladimir, Ekaterina Muravleva et al.
Self-attention Networks Localize When QK-eigenspectrum Concentrates
Han Bao, Ryuichiro Hataya, Ryo Karakida
Predictive Linear Online Tracking for Unknown Targets
Anastasios Tsiamis, Aren Karapetyan, Yueshan Li et al.
Diffusion Rejection Sampling
Byeonghu Na, Yeongmin Kim, Minsang Park et al.
Differentiability and Optimization of Multiparameter Persistent Homology
Luis Scoccola, Siddharth Setlur, David Loiseaux et al.
Failures Are Fated, But Can Be Faded: Characterizing and Mitigating Unwanted Behaviors in Large-Scale Vision and Language Models
Som Sagar, Aditya Taparia, Ransalu Senanayake
A Global Geometric Analysis of Maximal Coding Rate Reduction
Peng Wang, Huikang Liu, Druv Pai et al.
Trained Random Forests Completely Reveal your Dataset
Julien Ferry, Ricardo Fukasawa, Timothée Pascal et al.
OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model
Runyi Li, Xuhan SHENG, Weiqi Li et al.
FD3D: Exploiting Foreground Depth Map for Feature-Supervised Monocular 3D Object Detection
Zizhang Wu, Yuanzhu Gan, Yunzhe Wu et al.
Not All Tasks Are Equally Difficult: Multi-Task Deep Reinforcement Learning with Dynamic Depth Routing
Jinmin He, Kai Li, Yifan Zang et al.
Semantic Segmentation in Multiple Adverse Weather Conditions with Domain Knowledge Retention
Xin Yang, Wending Yan, Yuan Yuan et al.
Workflow Discovery from Dialogues in the Low Data Regime
David Vazquez, Stefania Raimondo, Christopher Pal et al.
CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts
Yichao Cai, Yuhang Liu, Zhen Zhang et al.
Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene
Ruiyang Zhang, Hu Zhang, Hang Yu et al.
FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models
Wei WU, Qingnan Fan, Shuai Qin et al.
On the Power of the Weisfeiler-Leman Test for Graph Motif Parameters
Matthias Lanzinger, Pablo Barcelo
Cinematic Behavior Transfer via NeRF-based Differentiable Filming
Xuekun Jiang, Anyi Rao, Jingbo Wang et al.
Stochastic Conditional Diffusion Models for Robust Semantic Image Synthesis
Juyeon Ko, Inho Kong, Dogyun Park et al.
Delving into Differentially Private Transformer
Youlong Ding, Xueyang Wu, Yining meng et al.
Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning
Hyungho Na, Yunkyeong Seo, Il-chul Moon
Specularity Factorization for Low-Light Enhancement
Saurabh Saini, P. J. Narayanan
Robust Depth Enhancement via Polarization Prompt Fusion Tuning
Kei IKEMURA, Yiming Huang, Felix Heide et al.
Delivering Inflated Explanations
Yacine Izza, Alexey Ignatiev, Peter Stuckey et al.
CR-SAM: Curvature Regularized Sharpness-Aware Minimization
Tao Wu, Tie Luo, Donald Wunsch
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Yuanhao Zhai, Kevin Lin, Linjie Li et al.
Fundamental Benefit of Alternating Updates in Minimax Optimization
Jaewook Lee, Hanseul Cho, Chulhee Yun
BLADE: Box-Level Supervised Amodal Segmentation through Directed Expansion
Zhaochen Liu, Zhixuan Li, Tingting Jiang
Multiscale Sliced Wasserstein Distances as Perceptual Color Difference Measures
Jiaqi He, Zhihua Wang, Leon Wang et al.
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
Xiaoyu Zhu, Hao Zhou, Pengfei Xing et al.
DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency
Xiaojing Zhong, Xinyi Huang, Xiaofeng Yang et al.
Open-Set Facial Expression Recognition
Yuhang Zhang, Yue Yao, Xuannan Liu et al.
1497 Once and for All: Universal Transferable Adversarial Perturbation against Deep Hashing-Based Facial Image Retrieval
Long Tang, Dengpan Ye, Yunna Lv et al.
TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning
Huabin Liu, Xiao Ma, Cheng Zhong et al.
Synergistic Anchored Contrastive Pre-training for Few-Shot Relation Extraction
Da Luo, Yanglei Gan, Rui Hou et al.
DeIL: Direct-and-Inverse CLIP for Open-World Few-Shot Learning
Shuai Shao, Yu Bai, Yan WANG et al.
Disentangled Pre-training for Human-Object Interaction Detection
Zhuolong Li, Xingao Li, Changxing Ding et al.
Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation
Mengchen Zhang, Tong Wu, Tai Wang et al.
NTO3D: Neural Target Object 3D Reconstruction with Segment Anything
Xiaobao Wei, Renrui Zhang, Jiarui Wu et al.
NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model
Zhongqun Zhang, Hengfei Wang, Ziwei Yu et al.
Bridging the Gap Between End-to-End and Two-Step Text Spotting
Mingxin Huang, Hongliang Li, Yuliang Liu et al.
How to Make the Gradients Small Privately: Improved Rates for Differentially Private Non-Convex Optimization
Andrew Lowy, Jonathan Ullman, Stephen Wright
ESRL: Efficient Sampling-Based Reinforcement Learning for Sequence Generation
Chenglong Wang, Hang Zhou, Yimin Hu et al.
JointSQ: Joint Sparsification-Quantization for Distributed Learning
Weiying Xie, Haowei Li, Ma Jitao et al.
Toward Tiny and High-quality Facial Makeup with Data Amplify Learning
Qiaoqiao Jin, Xuanhong Chen, Meiguang Jin et al.
Dynamic Sub-graph Distillation for Robust Semi-supervised Continual Learning
Yan Fan, Yu Wang, Pengfei Zhu et al.
Embodied CoT Distillation From LLM To Off-the-shelf Agents
Wonje Choi, Woo Kyung Kim, Minjong Yoo et al.
Deblur e-NeRF: NeRF from Motion-Blurred Events under High-speed or Low-light Conditions
Weng Fei Low, Gim Hee Lee
DREAM: Diffusion Rectification and Estimation-Adaptive Models
Jinxin Zhou, Tianyu Ding, Tianyi Chen et al.
Hindsight PRIORs for Reward Learning from Human Preferences
Mudit Verma, Katherine Metcalf
Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior
Ashmit Khandelwal, Aditya Agrawal, Aanisha Bhattacharyya et al.
MAGR: Manifold-Aligned Graph Regularization for Continual Action Quality Assessment
Kanglei Zhou, Liyuan Wang, Xingxing Zhang et al.
Test-Time Zero-Shot Temporal Action Localization
Benedetta Liberatori, Alessandro Conti, Paolo Rota et al.
CaesarNeRF: Calibrated Semantic Representation for Few-Shot Generalizable Neural Rendering
Haidong Zhu, Tianyu Ding, Tianyi Chen et al.
Federated Representation Learning in the Under-Parameterized Regime
Renpu Liu, Cong Shen, Jing Yang
SEA: Sparse Linear Attention with Estimated Attention Mask
Heejun Lee, Jina Kim, Jeff Willette et al.
TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models
Jeongho Kim, Min-Jung Kim, Junsoo Lee et al.
A Symmetry-Aware Exploration of Bayesian Neural Network Posteriors
Olivier Laurent, Emanuel Aldea, Gianni Franchi
Pedestrian Attribute Recognition as Label-balanced Multi-label Learning
Yibo Zhou, Hai-Miao Hu, Yirong Xiang et al.
Single Mesh Diffusion Models with Field Latents for Texture Generation
Thomas W. Mitchel, Carlos Esteves, Ameesh Makadia
From Generalization Analysis to Optimization Designs for State Space Models
Fusheng Liu, Qianxiao Li
Dynamic Layer Tying for Parameter-Efficient Transformers
Tamir David-Hay, Lior Wolf
Hiding in Plain Sight: Disguising Data Stealing Attacks in Federated Learning
Kostadin Garov, Dimitar I. Dimitrov, Nikola Jovanović et al.
A Graph is Worth 1-bit Spikes: When Graph Contrastive Learning Meets Spiking Neural Networks
Jintang Li, Huizhe Zhang, Ruofan Wu et al.
Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors
Wei Shang, Dongwei Ren, Wanying Zhang et al.
COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
Sihan Chen, Xingjian He, Handong Li et al.
Unraveling Batch Normalization for Realistic Test-Time Adaptation
Zixian Su, Jingwei Guo, Kai Yao et al.
Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors
Ruicheng Wang, Jianfeng Xiang, Jiaolong Yang et al.
Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret
Han Zhong, Jiachen Hu, Yecheng Xue et al.
Continuous Piecewise-Affine Based Motion Model for Image Animation
Hexiang Wang, Fengqi Liu, Qianyu Zhou et al.
Class-Agnostic Object Counting with Text-to-Image Diffusion Model
Xiaofei Hui, Qian Wu, Hossein Rahmani et al.
TF-FAS: Twofold-Element Fine-Grained Semantic Guidance for Generalizable Face Anti-Spoofing
Xudong Wang, Ke-Yue Zhang, Taiping Yao et al.
SIGNeRF: Scene Integrated Generation for Neural Radiance Fields
Jan-Niklas Dihlmann, Andreas Engelhardt, Hendrik Lensch
Uncertainty Regularized Evidential Regression
Kai Ye, Tiejin Chen, Hua Wei et al.
Towards Understanding and Improving Adversarial Robustness of Vision Transformers
Samyak Jain, Tanima Dutta
Federated Causal Discovery from Heterogeneous Data
Loka Li, Ignavier Ng, Gongxu Luo et al.
Learning Invariant Inter-pixel Correlations for Superpixel Generation
Sen Xu, Shikui Wei, Tao Ruan et al.
Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training
Hyesong Choi, Hyejin Park, Kwang Moo Yi et al.
Bi-Causal: Group Activity Recognition via Bidirectional Causality
Youliang Zhang, Wenxuan Liu, danni xu et al.
Learning to Stop Cut Generation for Efficient Mixed-Integer Linear Programming
Haotian Ling, Zhihai Wang, Jie Wang
Critical Learning Periods Emerge Even in Deep Linear Networks
Michael Kleinman, Alessandro Achille, Stefano Soatto
Learning Decentralized Partially Observable Mean Field Control for Artificial Collective Behavior
Kai Cui, Sascha Hauck, Christian Fabian et al.
InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image
Jianhui Li, Shilong Liu, Zidong Liu et al.
MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion
Lehong Wu, Lilang Lin, Jiahang Zhang et al.
Neural Spectral Decomposition for Dataset Distillation
Yang Shaolei, Shen Cheng, Mingbo Hong et al.
Nonverbal Interaction Detection
Jianan Wei, Tianfei Zhou, Yi Yang et al.
SyncMask: Synchronized Attentional Masking for Fashion-centric Vision-Language Pretraining
Chull Hwan Song, Taebaek Hwang, Jooyoung Yoon et al.
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
Anurag Das, Xinting Hu, Li Jiang et al.
Stochastic Modified Equations and Dynamics of Dropout Algorithm
Zhongwang Zhang, Yuqing Li, Tao Luo et al.
Knowledge-Aware Parameter Coaching for Personalized Federated Learning
Mingjian Zhi, Yuanguo Bi, Wenchao Xu et al.
CrIBo: Self-Supervised Learning via Cross-Image Object-Level Bootstrapping
Tim Lebailly, Thomas Stegmüller, Behzad Bozorgtabar et al.
GenCorres: Consistent Shape Matching via Coupled Implicit-Explicit Shape Generative Models
Haitao Yang, Xiangru Huang, Bo Sun et al.
A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives
Simone Alberto Peirone, Francesca Pistilli, Antonio Alliegro et al.
X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-Modal Knowledge Transfer
Linglin Jing, Ying Xue, Xu Yan et al.