Most Cited 2024 Poster Papers
12,324 papers found • Page 45 of 62
Conference
Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation
Yibo Yang, Xiaojie Li, Motasem Alfarra et al.
Understanding MLP-Mixer as a wide and sparse MLP
Tomohiro Hayase, Ryo Karakida
Self-attention Networks Localize When QK-eigenspectrum Concentrates
Han Bao, Ryuichiro Hataya, Ryo Karakida
Faster Streaming and Scalable Algorithms for Finding Directed Dense Subgraphs in Large Graphs
Slobodan Mitrovic, Theodore Pan
Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Vithursan Thangarasa, Shreyas Saxena, Abhay Gupta et al.
Scalable Real-Time Recurrent Learning Using Columnar-Constructive Networks
Khurram Javed, Haseeb Shah, Richard Sutton et al.
Effect-Invariant Mechanisms for Policy Generalization
Sorawit Saengkyongam, Niklas Pfister, Predag Klasnja et al.
eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction Data
Peng, Xinyi Ling, Ziru Chen et al.
Differentiable Combinatorial Scheduling at Scale
Mingju Liu, Yingjie Li, Jiaqi Yin et al.
Model-based Reinforcement Learning for Parameterized Action Spaces
Renhao Zhang, Haotian Fu, Yilin Miao et al.
Efficient Mixture Learning in Black-Box Variational Inference
Alexandra Hotti, Oskar Kviman, Ricky Molén et al.
Indirectly Parameterized Concrete Autoencoders
Alfred Nilsson, Klas Wijk, Sai bharath chandra Gutha et al.
MLI Formula: A Nearly Scale-Invariant Solution with Noise Perturbation
Bowen Tao, Xin-Chun Li, De-Chuan Zhan
Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity
Lu Yin, You Wu, Zhenyu Zhang et al.
Quantum Positional Encodings for Graph Neural Networks
Slimane Thabet, Mehdi Djellabi, Igor Sokolov et al.
Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices
Jiin Woo, Laixi Shi, Gauri Joshi et al.
Improved Dimensionality Dependence for Zeroth-Order Optimisation over Cross-Polytopes
Weijia Shao
Position: Opportunities Exist for Machine Learning in Magnetic Fusion Energy
Lucas Spangher, Allen Wang, Andrew Maris et al.
Bounding the Excess Risk for Linear Models Trained on Marginal-Preserving, Differentially-Private, Synthetic Data
Yvonne Zhou, Mingyu Liang, Ivan Brugere et al.
Major-Minor Mean Field Multi-Agent Reinforcement Learning
Kai Cui, Christian Fabian, Anam Tahir et al.
Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback
Asaf Cassel, Haipeng Luo, Aviv Rosenberg et al.
Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling
Raunaq Bhirangi, Chenyu Wang, Venkatesh Pattabiraman et al.
Instruction Tuning for Secure Code Generation
Jingxuan He, Mark Vero, Gabriela Krasnopolska et al.
Asymmetry in Low-Rank Adapters of Foundation Models
Jiacheng Zhu, Kristjan Greenewald, Kimia Nadjahi et al.
Slicing Mutual Information Generalization Bounds for Neural Networks
Kimia Nadjahi, Kristjan Greenewald, Rickard Gabrielsson et al.
Information Complexity of Stochastic Convex Optimization: Applications to Generalization, Memorization, and Tracing
Idan Attias, Gintare Karolina Dziugaite, Mahdi Haghifam et al.
Breadth-First Exploration on Adaptive Grid for Reinforcement Learning
Youngsik Yoon, Gangbok Lee, Sungsoo Ahn et al.
Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training
Ming-Kun Xie, Jia-Hao Xiao, Pei Peng et al.
Learning to Model the World With Language
Jessy Lin, Yuqing Du, Olivia Watkins et al.
The Merit of River Network Topology for Neural Flood Forecasting
Nikolas Kirschstein, Yixuan Sun
Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation
Fengdi Che, Chenjun Xiao, Jincheng Mei et al.
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making
Vivek Myers, Chongyi Zheng, Anca Dragan et al.
Disguised Copyright Infringement of Latent Diffusion Models
Yiwei Lu, Matthew Yang, Zuoqiu Liu et al.
Learning Latent Dynamic Robust Representations for World Models
Ruixiang Sun, Hongyu Zang, Xin Li et al.
Stealing part of a production language model
Nicholas Carlini, Daniel Paleka, Krishnamurthy Dvijotham et al.
Clifford-Steerable Convolutional Neural Networks
Maksim Zhdanov, David Ruhe, Maurice Weiler et al.
Dynamic Metric Embedding into lp Space
Kiarash Banihashem, MohammadTaghi Hajiaghayi, Dariusz Kowalski et al.
Diffusion Language Models Are Versatile Protein Learners
Xinyou Wang, Zaixiang Zheng, Fei YE et al.
BWS: Best Window Selection Based on Sample Scores for Data Pruning across Broad Ranges
Hoyong Choi, Nohyun Ki, Hye Won Chung
Localizing Task Information for Improved Model Merging and Compression
Ke Wang, Nikolaos Dimitriadis, Guillermo Ortiz-Jimenez et al.
Detecting and Identifying Selection Structure in Sequential Data
Yujia Zheng, Zeyu Tang, Yiwen Qiu et al.
Data Engineering for Scaling Language Models to 128K Context
Yao Fu, Rameswar Panda, Xinyao Niu et al.
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Fuzhao Xue, Zian Zheng, Yao Fu et al.
The Emergence of Reproducibility and Consistency in Diffusion Models
Huijie Zhang, Jinfan Zhou, Yifu Lu et al.
Enhancing Size Generalization in Graph Neural Networks through Disentangled Representation Learning
Zheng Huang, Qihui Yang, Dawei Zhou et al.
CHAI: Clustered Head Attention for Efficient LLM Inference
Saurabh Agarwal, Bilge Acun, Basil Hosmer et al.
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Patrick Esser, Sumith Kulal, Andreas Blattmann et al.
Policy-conditioned Environment Models are More Generalizable
Ruifeng Chen, Xiong-Hui Chen, Yihao Sun et al.
Using Left and Right Brains Together: Towards Vision and Language Planning
Jun CEN, Chenfei Wu, Xiao Liu et al.
SMaRt: Improving GANs with Score Matching Regularity
Mengfei Xia, Yujun Shen, Ceyuan Yang et al.
ODIN: Disentangled Reward Mitigates Hacking in RLHF
Lichang Chen, Chen Zhu, Jiuhai Chen et al.
Interplay of ROC and Precision-Recall AUCs: Theoretical Limits and Practical Implications in Binary Classification
Martin Mihelich, François Castagnos, Charles Dognin
On Discrete Prompt Optimization for Diffusion Models
Ruochen Wang, Ting Liu, Cho-Jui Hsieh et al.
Nearest Neighbour Score Estimators for Diffusion Generative Models
Matthew Niedoba, Dylan Green, Saeid Naderiparizi et al.
Thermometer: Towards Universal Calibration for Large Language Models
Maohao Shen, Subhro Das, Kristjan Greenewald et al.
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Dan Kondratyuk, Lijun Yu, Xiuye Gu et al.
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
Boyi Wei, Kaixuan Huang, Yangsibo Huang et al.
Position: LLMs Can’t Plan, But Can Help Planning in LLM-Modulo Frameworks
Subbarao Kambhampati, Karthik Valmeekam, Lin Guan et al.
Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling
Bairu Hou, Yujian Liu, Kaizhi Qian et al.
Feature Reuse and Scaling: Understanding Transfer Learning with Protein Language Models
Francesca-Zhoufan Li, Ava Amini, Yisong Yue et al.
Learning Coverage Paths in Unknown Environments with Deep Reinforcement Learning
Arvi Jonnarth, Jie Zhao, Michael Felsberg
MOMENT: A Family of Open Time-series Foundation Models
Mononito Goswami, Konrad Szafer, Arjun Choudhry et al.
Counterfactual Image Editing
Yushu Pan, Elias Bareinboim
Improving Prototypical Visual Explanations with Reward Reweighing, Reselection, and Retraining
Aaron Li, Robin Netzorg, Zhihan Cheng et al.
On The Statistical Complexity of Offline Decision-Making
Thanh Nguyen-Tang, Raman Arora
Variational Learning is Effective for Large Deep Networks
Yuesong Shen, Nico Daheim, Bai Cong et al.
Controlled Decoding from Language Models
Sidharth Mudgal, Jong Lee, Harish Ganapathy et al.
Liouville Flow Importance Sampler
Yifeng Tian, Nishant Panda, Yen Ting Lin
Scaling Exponents Across Parameterizations and Optimizers
Katie Everett, Lechao Xiao, Mitchell Wortsman et al.
Discovering Symmetry Breaking in Physical Systems with Relaxed Group Convolution
Rui Wang, Elyssa Hofgard, Han Gao et al.
SPADE: Sparsity-Guided Debugging for Deep Neural Networks
Arshia Soltani Moakhar, Eugenia Iofinova, Elias Frantar et al.
Extreme Compression of Large Language Models via Additive Quantization
Vage Egiazarian, Andrei Panferov, Denis Kuznedelev et al.
LEVI: Generalizable Fine-tuning via Layer-wise Ensemble of Different Views
Yuji Roh, Qingyun Liu, Huan Gui et al.
Characterizing Large Language Model Geometry Helps Solve Toxicity Detection and Generation
Randall Balestriero, Romain Cosentino, Sarath Shekkizhar
VideoPrism: A Foundational Visual Encoder for Video Understanding
Long Zhao, Nitesh Bharadwaj Gundavarapu, Liangzhe Yuan et al.
Particle Denoising Diffusion Sampler
Angus Phillips, Hai-Dang Dau, Michael Hutchinson et al.
LaMAGIC: Language-Model-based Topology Generation for Analog Integrated Circuits
Chen-Chia Chang, Yikang Shen, Shaoze Fan et al.
How do Transformers Perform In-Context Autoregressive Learning ?
Michael Sander, Raja Giryes, Taiji Suzuki et al.
Vision Transformers as Probabilistic Expansion from Learngene
Qiufeng Wang, Xu Yang, Haokun Chen et al.
Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation
Can Yaras, Peng Wang, Laura Balzano et al.
Graph Positional and Structural Encoder
Semih Cantürk, Renming Liu, Olivier Lapointe-Gagné et al.
Modeling Language Tokens as Functionals of Semantic Fields
Zhengqi Pei, Anran Zhang, Shuhui Wang et al.
Score-Based Causal Discovery of Latent Variable Causal Models
Ignavier Ng, Xinshuai Dong, Haoyue Dai et al.
Language Models as Science Tutors
Alexis Chevalier, Jiayi Geng, Alexander Wettig et al.
Acquisition Conditioned Oracle for Nongreedy Active Feature Acquisition
Michael Valancius, Maxwell Lennon, Junier Oliva
Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language Models
Mingjia Huo, Sai Ashish Somayajula, Youwei Liang et al.
SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms
Xingrun Xing, Zheng Zhang, Ziyi Ni et al.
Integrated Hardware Architecture and Device Placement Search
Irene Wang, Jakub Tarnawski, Amar Phanishayee et al.
Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization
Kwang-Sung Jun, Jungtaek Kim
Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT
Jon Saad-Falcon, Daniel Y Fu, Simran Arora et al.
Diffusion Models Encode the Intrinsic Dimension of Data Manifolds
Jan Stanczuk, Georgios Batzolis, Teo Deveney et al.
Two Fists, One Heart: Multi-Objective Optimization Based Strategy Fusion for Long-tailed Learning
Zhe Zhao, Pengkun Wang, HaiBin Wen et al.
What’s the score? Automated Denoising Score Matching for Nonlinear Diffusions
raghav singhal, Mark Goldstein, Rajesh Ranganath
Efficient Black-box Adversarial Attacks via Bayesian Optimization Guided by a Function Prior
Shuyu Cheng, Yibo Miao, Yinpeng Dong et al.
Self-Rewarding Language Models
Weizhe Yuan, Richard Yuanzhe Pang, Kyunghyun Cho et al.
In-Context Learning Agents Are Asymmetric Belief Updaters
Johannes A. Schubert, Akshay Kumar Jagadish, Marcel Binz et al.
Efficient Algorithms for Empirical Group Distributionally Robust Optimization and Beyond
Dingzhi Yu, Yunuo Cai, Wei Jiang et al.
Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains
Junhong Shen, Neil Tenenholtz, James Hall et al.
Fast and Sample Efficient Multi-Task Representation Learning in Stochastic Contextual Bandits
Jiabin Lin, Shana Moothedath, Namrata Vaswani
A Closer Look at the Limitations of Instruction Tuning
Sreyan Ghosh, Chandra Kiran Evuru, Sonal Kumar et al.
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback
songyang gao, Qiming Ge, Wei Shen et al.
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
Zhiheng Xi, Wenxiang Chen, Boyang Hong et al.
Is Kernel Prediction More Powerful than Gating in Convolutional Neural Networks?
Lorenz K. Muller
Incorporating probabilistic domain knowledge into deep multiple instance learning
Ghadi S. Al Hajj, Aliaksandr Hubin, Chakravarthi Kanduri et al.
Mean-field Analysis on Two-layer Neural Networks from a Kernel Perspective
Shokichi Takakura, Taiji Suzuki
In-Context Language Learning: Architectures and Algorithms
Ekin Akyürek, Bailin Wang, Yoon Kim et al.
Gated Linear Attention Transformers with Hardware-Efficient Training
Songlin Yang, Bailin Wang, Yikang Shen et al.
Agnostic Sample Compression Schemes for Regression
Idan Attias, Steve Hanneke, Aryeh Kontorovich et al.
Masked Face Recognition with Generative-to-Discriminative Representations
Shiming Ge, Weijia Guo, Chenyu Li et al.
Recovering the Pre-Fine-Tuning Weights of Generative Models
Eliahu Horwitz, Jonathan Kahana, Yedid Hoshen
Plug-in Performative Optimization
Licong Lin, Tijana Zrnic
Re-Dock: Towards Flexible and Realistic Molecular Docking with Diffusion Bridge
Yufei Huang, Odin Zhang, Lirong Wu et al.
Estimating Canopy Height at Scale
Jan Pauls, Max Zimmer, Una Kelly et al.
Position: Cracking the Code of Cascading Disparity Towards Marginalized Communities
Golnoosh Farnadi, Mohammad Havaei, Negar Rostamzadeh
A Global Geometric Analysis of Maximal Coding Rate Reduction
Peng Wang, Huikang Liu, Druv Pai et al.
A Language Model’s Guide Through Latent Space
Dimitri von Rütte, Sotiris Anagnostidis, Gregor Bachmann et al.
One Meta-tuned Transformer is What You Need for Few-shot Learning
Xu Yang, Huaxiu Yao, Ying WEI
Conformal Prediction for Deep Classifier via Label Ranking
Jianguo Huang, HuaJun Xi, Linjun Zhang et al.
Position: TrustLLM: Trustworthiness in Large Language Models
Yue Huang, Lichao Sun, Haoran Wang et al.
Multiplicative Weights Update, Area Convexity and Random Coordinate Descent for Densest Subgraph Problems
Ta Duy Nguyen, Alina Ene
Representation Surgery: Theory and Practice of Affine Steering
Shashwat Singh, Shauli Ravfogel, Jonathan Herzig et al.
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
Dongyang Liu, Renrui Zhang, Longtian Qiu et al.
From Classification Accuracy to Proper Scoring Rules: Elicitability of Probabilistic Top List Predictions
Johannes Resin
A Sober Look at LLMs for Material Discovery: Are They Actually Good for Bayesian Optimization Over Molecules?
Agustinus Kristiadi, Felix Strieth-Kalthoff, Marta Skreta et al.
Rate-Optimal Policy Optimization for Linear Markov Decision Processes
Uri Sherman, Alon Cohen, Tomer Koren et al.
Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text
Abhimanyu Hans, Avi Schwarzschild, Valeriia Cherepanova et al.
Gibbs Sampling of Continuous Potentials on a Quantum Computer
Arsalan Motamedi, Pooya Ronagh
D-Flow: Differentiating through Flows for Controlled Generation
Heli Ben-Hamu, Omri Puny, Itai Gat et al.
Classification Under Strategic Self-Selection
Guy Horowitz, Yonatan Sommer, Moran Koren et al.
USTAD: Unified Single-model Training Achieving Diverse Scores for Information Retrieval
Seungyeon Kim, Ankit Singh Rawat, Manzil Zaheer et al.
StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization
Songhua Liu, Xin Jin, Xingyi Yang et al.
On the Role of Edge Dependency in Graph Generative Models
Sudhanshu Chanpuriya, Cameron Musco, Konstantinos Sotiropoulos et al.
Consistent Long-Term Forecasting of Ergodic Dynamical Systems
Vladimir Kostic, Karim Lounici, Prune Inzerilli et al.
PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Soroush Nasiriany, Fei Xia, Wenhao Yu et al.
A Persuasive Approach to Combating Misinformation
Safwan Hossain, Andjela Mladenovic, Yiling Chen et al.
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem
Maciej Wołczyk, Bartłomiej Cupiał, Mateusz Ostaszewski et al.
Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models
Akhil Kedia, Mohd Abbas Zaidi, Sushil Khyalia et al.
HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding
Zhaorun Chen, Zhuokai Zhao, HONGYIN LUO et al.
A Unified Adaptive Testing System Enabled by Hierarchical Structure Search
Junhao Yu, Yan Zhuang, Zhenya Huang et al.
Complexity Matters: Feature Learning in the Presence of Spurious Correlations
GuanWen Qiu, Da Kuang, Surbhi Goel
Copyright Traps for Large Language Models
Matthieu Meeus, Igor Shilov, Manuel Faysse et al.
Policy Evaluation for Variance in Average Reward Reinforcement Learning
Shubhada Agrawal, Prashanth L.A., Siva Maguluri
Robust Yet Efficient Conformal Prediction Sets
Soroush H. Zargarbashi, Mohammad Sadegh Akhondzadeh, Aleksandar Bojchevski
Differentiable Weightless Neural Networks
Alan Bacellar, Zachary Susskind, Mauricio Breternitz Jr et al.
Adaptive Observation Cost Control for Variational Quantum Eigensolvers
Christopher J. Anders, Kim A. Nicoli, Bingting Wu et al.
Kepler codebook
Junrong Lian, Ziyue Dong, Pengxu Wei et al.
Tandem Transformers for Inference Efficient LLMs
Aishwarya P S, Pranav Nair, Yashas Samaga et al.
Momentum Auxiliary Network for Supervised Local Learning
Junhao Su, Changpeng Cai, Feiyu Zhu et al.
AdversariaLeak: External Information Leakage Attack Using Adversarial Samples on Face Recognition Systems
Roye Katzav, Amit Giloni, Edita Grolman et al.
On the Approximation Risk of Few-Shot Class-Incremental Learning
Xuan Wang, Zhong Ji, Xiyao Liu et al.
4D Contrastive Superflows are Dense 3D Representation Learners
Xiang Xu, Lingdong Kong, Hui Shuai et al.
Learning to Generate Conditional Tri-plane for 3D-aware Expression Controllable Portrait Animation
Taekyung Ki, Dongchan Min, Gyeongsu Chae
Disentangling Masked Autoencoders for Unsupervised Domain Generalization
An Zhang, Han Wang, Xiang Wang et al.
BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos
Pilhyeon Lee, Hyeran Byun
BRAVE: Broadening the visual encoding of vision-language models
Oguzhan Fatih Kar, Alessio Tonioni, Petra Poklukar et al.
High-Resolution and Few-shot View Synthesis from Asymmetric Dual-lens Inputs
Ruikang Xu, Mingde Yao, Yue Li et al.
HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects
Xintao Lv, Liang Xu, Yichao Yan et al.
Merlin: Empowering Multimodal LLMs with Foresight Minds
En Yu, liang zhao, YANA WEI et al.
Spectral Subsurface Scattering for Material Classification
Haejoon Lee, Aswin C. Sankaranarayanan
Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation
Peixi Xiong, Michael A Kozuch, Nilesh Jain
ShareGPT4V: Improving Large Multi-Modal Models with Better Captions
Lin Chen, Jinsong Li, Xiaoyi Dong et al.
Cross-Input Certified Training for Universal Perturbations
Changming Xu, Gagandeep Singh
LiDAR-Event Stereo Fusion with Hallucinations
Luca Bartolomei, Matteo Poggi, Andrea Conti et al.
Implicit Filtering for Learning Neural Signed Distance Functions from 3D Point Clouds
Shengtao Li, Ge Gao, Yudong Liu et al.
SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model
Armen Avetisyan, Christopher Xie, Henry Howard-Jenkins et al.
Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis
Chirag Vashist, Shichong Peng, Ke Li
RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation
Luis Li, Hubert P. H. Shum, Toby P Breckon
3D Single-object Tracking in Point Clouds with High Temporal Variation
Qiao Wu, Kun Sun, Pei An et al.
LoA-Trans: Enhancing Visual Grounding by Location-Aware Transformers
Ziling Huang, Shin’ichi Satoh
Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training
Hyesong Choi, Hyejin Park, Kwang Moo Yi et al.
ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities
CHENMING ZHU, Tai Wang, Wenwei Zhang et al.
Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields
Yonggan Fu, Huaizhi Qu, Zhifan Ye et al.
ReLoo: Reconstructing Humans Dressed in Loose Garments from Monocular Video in the Wild
Chen Guo, Tianjian Jiang, Manuel Kaufmann et al.
Plain-Det: A Plain Multi-Dataset Object Detector
cheng Shi, yuchen zhu, Sibei Yang
Integrating Markov Blanket Discovery into Causal Representation Learning for Domain Generalization
Naiyu Yin, Hanjing Wang, Yue Yu et al.
DCDM: Diffusion-Conditioned-Diffusion Model for Scene Text Image Super-Resolution
Shrey Singh, Prateek Keserwani, Masakazu Iwamura et al.
Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification
Linhao Qu, Dingkang Yang, Dan Huang et al.
AutoEval-Video: An Automatic Benchmark for Assessing Large Vision Language Models in Open-Ended Video Question Answering
Xiuyuan Chen, Yuan Lin, Yuchen Zhang et al.
Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
Seung Hyun Lee, Yinxiao Li, Junjie Ke et al.
TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks
Jinjie Mai, Wenxuan Zhu, Sara Rojas Martinez et al.
Diffusion-Generated Pseudo-Observations for High-Quality Sparse-View Reconstruction
Xinhang Liu, Jiaben Chen, Shiu-Hong Kao et al.
SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images
Nir Barel, Ron Aharon Shapira Weber, Nir Mualem et al.
DreamDrone: Text-to-Image Diffusion Models are Zero-shot Perpetual View Generators
Hanyang Kong, Dongze Lian, Michael Bi Mi et al.
WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians
Dmytro Kotovenko, Olga Grebenkova, Nikolaos Sarafianos et al.
Label-anticipated Event Disentanglement for Audio-Visual Video Parsing
Jinxing Zhou, Dan Guo, Yuxin Mao et al.
Think before Placement: Common Sense Enhanced Transformer for Object Placement
Yaxuan Qin, Jiayu Xu, Ruiping Wang et al.
GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
Jing Wu, Jiawang Bian, Xinghui Li et al.
Camera Height Doesn't Change: Unsupervised Training for Metric Monocular Road-Scene Depth Estimation
Genki Kinoshita, Ko Nishino
GAURA: Generalizable Approach for Unified Restoration and Rendering of Arbitrary Views
Vinayak Gupta, Rongali Simhachala Venkata Girish, Mukund Varma T et al.
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation
Omer Dahary, Or Patashnik, Kfir Aberman et al.
Scaling Backwards: Minimal Synthetic Pre-training?
Ryo Nakamura, Ryu Tadokoro, Ryosuke Yamada et al.
General and Task-Oriented Video Segmentation
Mu Chen, Liulei Li, Wenguan Wang et al.
Human Hair Reconstruction with Strand-Aligned 3D Gaussians
Egor Zakharov, Vanessa Sklyarova, Michael J. Black et al.
SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders
Sheng-Wei Li, Zi-Xiang Wei, Wei-Jie Jack Chen et al.
Rethinking Image Super Resolution from Training Data Perspectives
Go Ohtani, Ryu Tadokoro, Ryosuke Yamada et al.
Uni3DL: A Unified Model for 3D Vision-Language Understanding
Xiang Li, Jian Ding, Zhaoyang Chen et al.
G3R: Gradient Guided Generalizable Reconstruction
Yun Chen, Jingkang Wang, Ze Yang et al.
T-MAE: Temporal Masked Autoencoders for Point Cloud Representation Learning
Weijie Wei, Fatemeh Karimi Nejadasl, Theo Gevers et al.
Efficient and Versatile Robust Fine-Tuning of Zero-shot Models
Sungyeon Kim, Boseung Jeong, Donghyun Kim et al.
MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo
Tianqi Liu, Guangcong Wang, Shoukang Hu et al.