Most Cited 2025 Poster Papers

22,274 papers found • Page 21 of 112

#4001

Prompt-based Unifying Inference Attack on Graph Neural Networks

Yuecen Wei, Xingcheng Fu, Lingyun Liu et al.

AAAI 2025paperarXiv:2412.15735
6
citations
#4002

Turbo3D: Ultra-fast Text-to-3D Generation

Hanzhe Hu, Tianwei Yin, Fujun Luan et al.

CVPR 2025posterarXiv:2412.04470
6
citations
#4003

ConMix: Contrastive Mixup at Representation Level for Long-tailed Deep Clustering

Zhixin Li, Yuheng Jia

ICLR 2025poster
6
citations
#4004

Mixture of Experts Based Multi-Task Supervise Learning from Crowds

Tao Han, Huaixuan Shi, Xinyi Ding et al.

AAAI 2025paperarXiv:2407.13268
6
citations
#4005

AlphaPre: Amplitude-Phase Disentanglement Model for Precipitation Nowcasting

Kenghong Lin, Baoquan Zhang, Demin Yu et al.

CVPR 2025poster
6
citations
#4006

Detail-Preserving Latent Diffusion for Stable Shadow Removal

Jiamin Xu, Yuxin Zheng, Zelong Li et al.

CVPR 2025posterarXiv:2412.17630
6
citations
#4007

Validating LLM-as-a-Judge Systems under Rating Indeterminacy

Luke Guerdan, Solon Barocas, Kenneth Holstein et al.

NEURIPS 2025posterarXiv:2503.05965
6
citations
#4008

AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring

Xinyi Wang, Na Zhao, Zhiyuan Han et al.

AAAI 2025paperarXiv:2501.09428
6
citations
#4009

OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

Jingli Lin, Chenming Zhu, Runsen Xu et al.

NEURIPS 2025oralarXiv:2507.07984
6
citations
#4010

Uniform Generalization Bounds on Data-Dependent Hypothesis Sets via PAC-Bayesian Theory on Random Sets

Benjamin Dupuis, Paul Viallard, George Deligiannidis et al.

NEURIPS 2025posterarXiv:2404.17442
6
citations
#4011

Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation

Aishik Konwer, Zhijian Yang, Erhan Bas et al.

CVPR 2025posterarXiv:2503.04639
6
citations
#4012

Truth over Tricks: Measuring and Mitigating Shortcut Learning in Misinformation Detection

Herun Wan, Jiaying Wu, Minnan Luo et al.

NEURIPS 2025posterarXiv:2506.02350
6
citations
#4013

Multifaceted User Modeling in Recommendation: A Federated Foundation Models Approach

Chunxu Zhang, Guodong Long, Hongkuan Guo et al.

AAAI 2025paperarXiv:2412.16969
6
citations
#4014

Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models

Xuran Ma, Yexin Liu, Yaofu LIU et al.

ICCV 2025posterarXiv:2504.03140
6
citations
#4015

CODA: Repurposing Continuous VAEs for Discrete Tokenization

Zeyu Liu, Zanlin Ni, Yeguo Hua et al.

ICCV 2025posterarXiv:2503.17760
6
citations
#4016

Graph Coarsening via Supervised Granular-Ball for Scalable Graph Neural Network Training

Shuyin Xia, Xinjun Ma, Zhiyuan Liu et al.

AAAI 2025paperarXiv:2412.13842
6
citations
#4017

FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models

Jintao Tong, Wenwei Jin, Pengda Qin et al.

NEURIPS 2025posterarXiv:2505.19536
6
citations
#4018

GS-LIVM: Real-Time Photo-Realistic LiDAR-Inertial-Visual Mapping with Gaussian Splatting

Yusen XIE, Zhenmin Huang, Jin Wu et al.

ICCV 2025posterarXiv:2410.17084
6
citations
#4019

VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment

Shangkun Sun, Xiaoyu Liang, Songlin Fan et al.

AAAI 2025paperarXiv:2408.11481
6
citations
#4020

Enhancing Text-to-Image Diffusion Transformer via Split-Text Conditioning

Yu Zhang, Jialei Zhou, Xinchen Li et al.

NEURIPS 2025posterarXiv:2505.19261
6
citations
#4021

VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models

Kim Sung-Bin, Jeongsoo Choi, Puyuan Peng et al.

ICCV 2025posterarXiv:2504.02386
6
citations
#4022

LDP: Generalizing to Multilingual Visual Information Extraction by Language Decoupled Pretraining

Huawen Shen, Gengluo Li, Jinwen Zhong et al.

AAAI 2025paperarXiv:2412.14596
6
citations
#4023

EVOS: Efficient Implicit Neural Training via EVOlutionary Selector

Weixiang Zhang, Shuzhao Xie, Chengwei Ren et al.

CVPR 2025posterarXiv:2412.10153
6
citations
#4024

ReAL-AD: Towards Human-Like Reasoning in End-to-End Autonomous Driving

Yuhang Lu, Jiadong Tu, Yuexin Ma et al.

ICCV 2025posterarXiv:2507.12499
6
citations
#4025

EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation

Hongwei Niu, Jie Hu, Jianghang Lin et al.

AAAI 2025paperarXiv:2412.08628
6
citations
#4026

``Principal Components" Enable A New Language of Images

Xin Wen, Bingchen Zhao, Ismail Elezi et al.

ICCV 2025poster
6
citations
#4027

ELICIT: LLM Augmentation Via External In-context Capability

Futing Wang, Jianhao (Elliott) Yan, Yue Zhang et al.

ICLR 2025posterarXiv:2410.09343
6
citations
#4028

Massively Parallel Continuous Local Search for Hybrid SAT Solving on GPUs

Yunuo Cen, Zhiwei Zhang, Xuanyao Fong

AAAI 2025paperarXiv:2308.15020
6
citations
#4029

TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types

Jiankang Chen, Tianke Zhang, Changyi Liu et al.

ICLR 2025posterarXiv:2502.09925
6
citations
#4030

Dual-Process Image Generation

Grace Luo, Jonathan Granskog, Aleksander Holynski et al.

ICCV 2025posterarXiv:2506.01955
6
citations
#4031

1000+ FPS 4D Gaussian Splatting for Dynamic Scene Rendering

Yuheng Yuan, Qiuhong Shen, Xingyi Yang et al.

NEURIPS 2025oralarXiv:2503.16422
6
citations
#4032

Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction

Yunheng Li, Yuxuan Li, Quan-Sheng Zeng et al.

ICCV 2025posterarXiv:2412.06244
6
citations
#4033

Fused View-Time Attention and Feedforward Reconstruction for 4D Scene Generation

Chaoyang Wang, Ashkan Mirzaei, Vidit Goel et al.

NEURIPS 2025oralarXiv:2506.18839
6
citations
#4034

Seq2Time: Sequential Knowledge Transfer for Video LLM Temporal Grounding

Andong Deng, Zhongpai Gao, Anwesa Choudhuri et al.

CVPR 2025posterarXiv:2411.16932
6
citations
#4035

Decouple and Track: Benchmarking and Improving Video Diffusion Transformers For Motion Transfer

Qingyu Shi, Jianzong Wu, Jinbin Bai et al.

ICCV 2025posterarXiv:2503.17350
6
citations
#4036

StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization

Jinlu Zhang, Jiji Tang, Rongsheng Zhang et al.

AAAI 2025paperarXiv:2412.07375
6
citations
#4037

TR-PTS: Task-Relevant Parameter and Token Selection for Efficient Tuning

Siqi Luo, Haoran Yang, Yi Xin et al.

ICCV 2025posterarXiv:2507.22872
6
citations
#4038

A Smooth Sea Never Made a Skilled SAILOR: Robust Imitation via Learning to Search

Arnav Kumar Jain, Vibhakar Mohta, Subin Kim et al.

NEURIPS 2025oralarXiv:2506.05294
6
citations
#4039

R-LiViT: A LiDAR-Visual-Thermal Dataset Enabling Vulnerable Road User Focused Roadside Perception

Jonas Mirlach, Lei Wan, Andreas Wiedholz et al.

ICCV 2025posterarXiv:2503.17122
6
citations
#4040

Federated Class-Incremental Learning: A Hybrid Approach Using Latent Exemplars and Data-Free Techniques to Address Local and Global Forgetting

Milad Khademi Nori, IL-MIN KIM, Guanghui Wang

ICLR 2025posterarXiv:2501.15356
6
citations
#4041

Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations

Lucy Farnik, Tim Lawson, Conor Houghton et al.

ICML 2025spotlightarXiv:2502.18147
6
citations
#4042

Benign Overfitting in Single-Head Attention

Roey Magen, Shuning Shang, Zhiwei Xu et al.

NEURIPS 2025posterarXiv:2410.07746
6
citations
#4043

CompCap: Improving Multimodal Large Language Models with Composite Captions

Xiaohui Chen, Satya Narayan Shukla, Mahmoud Azab et al.

ICCV 2025posterarXiv:2412.05243
6
citations
#4044

StableCodec: Taming One-Step Diffusion for Extreme Image Compression

Tianyu Zhang, Xin Luo, Li Li et al.

ICCV 2025posterarXiv:2506.21977
6
citations
#4045

Event-based Tiny Object Detection: A Benchmark Dataset and Baselines

Nuo Chen, Chao Xiao, Yimian Dai et al.

ICCV 2025posterarXiv:2506.23575
6
citations
#4046

Bringing RNNs Back to Efficient Open-Ended Video Understanding

Weili Xu, Enxin Song, Wenhao Chai et al.

ICCV 2025posterarXiv:2507.02591
6
citations
#4047

Topo2Seq: Enhanced Topology Reasoning via Topology Sequence Learning

Yiming Yang, Yueru Luo, Bingkun He et al.

AAAI 2025paperarXiv:2502.08974
6
citations
#4048

MIEB: Massive Image Embedding Benchmark

Chenghao Xiao, Isaac Chung, Imene Kerboua et al.

ICCV 2025posterarXiv:2504.10471
6
citations
#4049

Prediction-Feedback DETR for Temporal Action Detection

Jihwan Kim, Miso Lee, Cheol-Ho Cho et al.

AAAI 2025paperarXiv:2408.16729
6
citations
#4050

SceneSplat++: A Large Dataset and Comprehensive Benchmark for Language Gaussian Splatting

Mengjiao Ma, Qi Ma, Yue Li et al.

NEURIPS 2025posterarXiv:2506.08710
6
citations
#4051

NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors

Yanrui Bin, Wenbo Hu, Haoyuan Wang et al.

ICCV 2025posterarXiv:2504.11427
6
citations
#4052

CausalPFN: Amortized Causal Effect Estimation via In-Context Learning

Vahid Balazadeh, Hamidreza Kamkari, Valentin Thomas et al.

NEURIPS 2025spotlightarXiv:2506.07918
6
citations
#4053

SVIP: Semantically Contextualized Visual Patches for Zero-Shot Learning

Zhi Chen, Zecheng Zhao, Jingcai Guo et al.

ICCV 2025posterarXiv:2503.10252
6
citations
#4054

REDUCIO! Generating 1K Video within 16 Seconds using Extremely Compressed Motion Latents

Rui Tian, Qi Dai, Jianmin Bao et al.

ICCV 2025posterarXiv:2411.13552
6
citations
#4055

Auto-Regressive Diffusion for Generating 3D Human-Object Interactions

Zichen Geng, Zeeshan Hayder, Wei Liu et al.

AAAI 2025paperarXiv:2503.16801
6
citations
#4056

Stable Hadamard Memory: Revitalizing Memory-Augmented Agents for Reinforcement Learning

Hung Le, Dung Nguyen, Kien Do et al.

ICLR 2025posterarXiv:2410.10132
6
citations
#4057

SpotActor: Training-Free Layout-Controlled Consistent Image Generation

Jiahao Wang, Caixia Yan, Weizhan Zhang et al.

AAAI 2025paperarXiv:2409.04801
6
citations
#4058

Dual Conditioned Motion Diffusion for Pose-Based Video Anomaly Detection

Hongsong Wang, Andi Xu, Pinle Ding et al.

AAAI 2025paperarXiv:2412.17210
6
citations
#4059

BadRobot: Jailbreaking Embodied LLM Agents in the Physical World

Hangtao Zhang, Chenyu Zhu, Xianlong Wang et al.

ICLR 2025poster
6
citations
#4060

QCS:Feature Refining from Quadruplet Cross Similarity for Facial Expression Recognition

Chengpeng Wang, Li Chen, Lili Wang et al.

AAAI 2025paperarXiv:2411.01988
6
citations
#4061

CAM: A Constructivist View of Agentic Memory for LLM-Based Reading Comprehension

Rui Li, Zeyu Zhang, Xiaohe Bo et al.

NEURIPS 2025posterarXiv:2510.05520
6
citations
#4062

Seg4Diff: Unveiling Open-Vocabulary Semantic Segmentation in Text-to-Image Diffusion Transformers

Chaehyun Kim, Heeseong Shin, Eunbeen Hong et al.

NEURIPS 2025poster
6
citations
#4063

HUMOTO: A 4D Dataset of Mocap Human Object Interactions

Jiaxin Lu, Chun-Hao Huang, Uttaran Bhattacharya et al.

ICCV 2025posterarXiv:2504.10414
6
citations
#4064

AURELIA: Test-time Reasoning Distillation in Audio-Visual LLMs

Sanjoy Chowdhury, Hanan Gani, Nishit Anand et al.

ICCV 2025posterarXiv:2503.23219
6
citations
#4065

HiBug2: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging

Muxi Chen, Chenchen Zhao, Qiang Xu

ICLR 2025posterarXiv:2501.16751
6
citations
#4066

Space Group Equivariant Crystal Diffusion

Rees Chang, Angela Pak, Alex Guerra et al.

NEURIPS 2025posterarXiv:2505.10994
6
citations
#4067

Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer

Haopeng Sun, Yingwei Zhang, Lumin Xu et al.

AAAI 2025paperarXiv:2412.10181
6
citations
#4068

Bridging the Gap between Database Search and \emph{De Novo} Peptide Sequencing with SearchNovo

Jun Xia, Sizhe Liu, Jingbo Zhou et al.

ICLR 2025poster
6
citations
#4069

Circuit Representation Learning with Masked Gate Modeling and Verilog-AIG Alignment

Haoyuan Wu, Haisheng Zheng, Yuan Pu et al.

ICLR 2025posterarXiv:2502.12732
6
citations
#4070

ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data

Yufan Shen, Chuwei Luo, Zhaoqing Zhu et al.

AAAI 2025paperarXiv:2407.12358
6
citations
#4071

AnyCalib: On-Manifold Learning for Model-Agnostic Single-View Camera Calibration

Javier Tirado-Garín, Javier Civera

ICCV 2025posterarXiv:2503.12701
6
citations
#4072

Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers

Zhengliang Shi, Lingyong Yan, Dawei Yin et al.

NEURIPS 2025posterarXiv:2505.20128
6
citations
#4073

Not All LLM-Generated Data Are Equal: Rethinking Data Weighting in Text Classification

Hsun-Yu Kuo, Yin-Hsiang Liao, Yu-Chieh Chao et al.

ICLR 2025posterarXiv:2410.21526
6
citations
#4074

H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving

Siran Chen, Yuxiao Luo, Yue Ma et al.

AAAI 2025paperarXiv:2501.04302
6
citations
#4075

DyMU: Dynamic Merging and Virtual Unmerging for Efficient Variable-Length VLMs

Zhenhailong Wang, Senthil Purushwalkam, Caiming Xiong et al.

NEURIPS 2025poster
6
citations
#4076

Beyond Content Relevance: Evaluating Instruction Following in Retrieval Models

Jianqun Zhou, Yuanlei Zheng, Wei Chen et al.

ICLR 2025posterarXiv:2410.23841
6
citations
#4077

Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language Navigation

Yiyuan Pan, Yunzhe Xu, Zhe Liu et al.

AAAI 2025paperarXiv:2412.01857
6
citations
#4078

Learned Image Compression with Hierarchical Progressive Context Modeling

Yuqi Li, Haotian Zhang, Li Li et al.

ICCV 2025posterarXiv:2507.19125
6
citations
#4079

Bi-level Contrastive Learning for Knowledge-Enhanced Molecule Representations

Pengcheng Jiang, Cao Xiao, Tianfan Fu et al.

AAAI 2025paperarXiv:2306.01631
6
citations
#4080

Boosting ViT-based MRI Reconstruction from the Perspectives of Frequency Modulation, Spatial Purification, and Scale Diversification

Yucong Meng, Zhiwei Yang, Yonghong Shi et al.

AAAI 2025paperarXiv:2412.10776
6
citations
#4081

Federated Continual Instruction Tuning

Haiyang Guo, Fanhu Zeng, Fei Zhu et al.

ICCV 2025posterarXiv:2503.12897
6
citations
#4082

WildSeg3D: Segment Any 3D Objects in the Wild from 2D Images

Yansong Guo, Jie Hu, Yansong Qu et al.

ICCV 2025posterarXiv:2503.08407
6
citations
#4083

Growth Inhibitors for Suppressing Inappropriate Image Concepts in Diffusion Models

Die Chen, Zhiwen Li, Mingyuan Fan et al.

ICLR 2025posterarXiv:2408.01014
6
citations
#4084

Noise Injection Reveals Hidden Capabilities of Sandbagging Language Models

Cameron Tice, Philipp Kreer, Nathan Helm-Burger et al.

NEURIPS 2025posterarXiv:2412.01784
6
citations
#4085

GlycanML: A Multi-Task and Multi-Structure Benchmark for Glycan Machine Learning

Minghao Xu, Yunteng Geng, Yihang Zhang et al.

ICLR 2025posterarXiv:2405.16206
6
citations
#4086

Asymmetric Visual Semantic Embedding Framework for Efficient Vision-Language Alignment

Yang Liu, Mengyuan Liu, Shudong Huang et al.

AAAI 2025paperarXiv:2503.06974
6
citations
#4087

HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning

Zhi Jing, Siyuan Yang, Jicong Ao et al.

NEURIPS 2025posterarXiv:2507.00833
6
citations
#4088

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Xianhang Li, Yanqing Liu, Haoqin Tu et al.

ICCV 2025posterarXiv:2505.04601
6
citations
#4089

Uncertain Multimodal Intention and Emotion Understanding in the Wild

Qu Yang, QingHongYa Shi, Tongxin Wang et al.

CVPR 2025poster
6
citations
#4090

DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation

Runze Zhang, Guoguang Du, Xiaochuan Li et al.

ICCV 2025highlightarXiv:2503.06053
6
citations
#4091

Tight Clusters Make Specialized Experts

Stefan Nielsen, Rachel Teo, Laziz Abdullaev et al.

ICLR 2025posterarXiv:2502.15315
6
citations
#4092

HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting

Jingyu Lin, Jiaqi Gu, Lubin Fan et al.

CVPR 2025posterarXiv:2412.03844
6
citations
#4093

DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning

Fucai Ke, Vijay Kumar b g, Xingjian Leng et al.

ICCV 2025posterarXiv:2503.19263
6
citations
#4094

Multimodal LLMs as Customized Reward Models for Text-to-Image Generation

Shijie Zhou, Ruiyi Zhang, Huaisheng Zhu et al.

ICCV 2025posterarXiv:2507.21391
6
citations
#4095

MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights

Jingjing Hu, Dan Guo, Zhan Si et al.

AAAI 2025paperarXiv:2412.16483
6
citations
#4096

Show and Tell: Visually Explainable Deep Neural Nets via Spatially-Aware Concept Bottleneck Models

Itay Benou, Tammy Riklin Raviv

CVPR 2025highlightarXiv:2502.20134
6
citations
#4097

Týr-the-Pruner: Structural Pruning LLMs via Global Sparsity Distribution Optimization

Guanchen Li, Yixing Xu, Zeping Li et al.

NEURIPS 2025posterarXiv:2503.09657
6
citations
#4098

Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation

Shivam Duggal, Yushi Hu, Oscar Michel et al.

CVPR 2025posterarXiv:2504.18509
6
citations
#4099

Breaking Neural Network Scaling Laws with Modularity

Akhilan Boopathy, Sunshine Jiang, William Yue et al.

ICLR 2025posterarXiv:2409.05780
6
citations
#4100

Rethinking Bimanual Robotic Manipulation: Learning with Decoupled Interaction Framework

Jian-Jian Jiang, Xiao-Ming Wu, Yi-Xiang He et al.

ICCV 2025posterarXiv:2503.09186
6
citations
#4101

TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs

Yunheng Li, Jing Cheng, Shaoyong Jia et al.

NEURIPS 2025oralarXiv:2509.18056
6
citations
#4102

SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data

Xilin He, Cheng Luo, Xiaole Xian et al.

ICCV 2025posterarXiv:2410.09865
6
citations
#4103

Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models

Jun Zhang, Jue Wang, Huan Li et al.

ICLR 2025posterarXiv:2502.13533
6
citations
#4104

FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution

Gene Chou, Wenqi Xian, Guandao Yang et al.

ICCV 2025highlightarXiv:2504.07093
6
citations
#4105

Let Me Grok for You: Accelerating Grokking via Embedding Transfer from a Weaker Model

Zhiwei Xu, Zhiyu Ni, Yixin Wang et al.

ICLR 2025posterarXiv:2504.13292
6
citations
#4106

FedAWA: Adaptive Optimization of Aggregation Weights in Federated Learning Using Client Vectors

Changlong Shi, He Zhao, Bingjie Zhang et al.

CVPR 2025posterarXiv:2503.15842
6
citations
#4107

POT: Prototypical Optimal Transport for Weakly Supervised Semantic Segmentation

Jian Wang, Tianhong Dai, Bingfeng Zhang et al.

CVPR 2025poster
6
citations
#4108

3D-MVP: 3D Multiview Pretraining for Manipulation

Shengyi Qian, Kaichun Mo, Valts Blukis et al.

CVPR 2025poster
6
citations
#4109

Zebra-Llama: Towards Extremely Efficient Hybrid Models

Mingyu Yang, Mehdi Rezagholizadeh, Guihong Li et al.

NEURIPS 2025posterarXiv:2505.17272
6
citations
#4110

Gradient descent with generalized Newton’s method

Zhiqi Bu, Shiyun Xu

ICLR 2025posterarXiv:2407.02772
6
citations
#4111

Doubly Contrastive Learning for Source-Free Domain Adaptive Person Search

Yizhen Jia, Rong Quan, Yue Feng et al.

AAAI 2025paper
6
citations
#4112

OSDA Agent: Leveraging Large Language Models for De Novo Design of Organic Structure Directing Agents

Zhaolin Hu, Yixiao Zhou, Zhongan Wang et al.

ICLR 2025poster
6
citations
#4113

Dynamic Sparse Training versus Dense Training: The Unexpected Winner in Image Corruption Robustness

Boqian Wu, Qiao Xiao, Shunxin Wang et al.

ICLR 2025posterarXiv:2410.03030
6
citations
#4114

FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations

Hmrishav Bandyopadhyay, Yi-Zhe Song

CVPR 2025posterarXiv:2411.10818
6
citations
#4115

Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation

Reza Qorbani, Gianluca Villani, Theodoros Panagiotakopoulos et al.

CVPR 2025posterarXiv:2503.21780
6
citations
#4116

CL-LoRA: Continual Low-Rank Adaptation for Rehearsal-Free Class-Incremental Learning

Jiangpeng He, Zhihao Duan, Fengqing Zhu

CVPR 2025posterarXiv:2505.24816
6
citations
#4117

Rethinking Spiking Self-Attention Mechanism: Implementing α-XNOR Similarity Calculation in Spiking Transformers

Yichen Xiao, Shuai Wang, Dehao Zhang et al.

CVPR 2025poster
6
citations
#4118

Stealthy Shield Defense: A Conditional Mutual Information-Based Approach against Black-Box Model Inversion Attacks

Tianqu Zhuang, Hongyao Yu, Yixiang Qiu et al.

ICLR 2025poster
6
citations
#4119

DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts

Zheng-Peng Duan, Jiawei Zhang, Zheng Lin et al.

AAAI 2025paperarXiv:2407.03757
6
citations
#4120

Neural Eulerian Scene Flow Fields

Kyle Vedder, Neehar Peri, Ishan Khatri et al.

ICLR 2025posterarXiv:2410.02031
6
citations
#4121

Everything is Editable: Extend Knowledge Editing to Unstructured Data in Large Language Models

Jingcheng Deng, Zihao Wei, Liang Pang et al.

ICLR 2025posterarXiv:2405.15349
6
citations
#4122

Expressivity of Neural Networks with Random Weights and Learned Biases

Ezekiel Williams, Alexandre Payeur, Avery Ryoo et al.

ICLR 2025posterarXiv:2407.00957
6
citations
#4123

Oasis: One Image is All You Need for Multimodal Instruction Data Synthesis

Letian Zhang, Quan Cui, Bingchen Zhao et al.

ICCV 2025posterarXiv:2503.08741
6
citations
#4124

WaterDiffusion: Learning a Prior-involved Unrolling Diffusion for Joint Underwater Saliency Detection and Visual Restoration

Laibin Chang, Yunke Wang, Longxiang Deng et al.

AAAI 2025paper
6
citations
#4125

ProtoArgNet: Interpretable Image Classification with Super-Prototypes and Argumentation

Hamed Ayoobi, Nico Potyka, Francesca Toni

AAAI 2025paperarXiv:2311.15438
6
citations
#4126

Multi-modal Knowledge Distillation-based Human Trajectory Forecasting

Jaewoo Jeong, Seohee Lee, Daehee Park et al.

CVPR 2025posterarXiv:2503.22201
6
citations
#4127

Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models

Chen Chen, Daochang Liu, Mubarak Shah et al.

CVPR 2025posterarXiv:2504.18032
6
citations
#4128

DRL: Decomposed Representation Learning for Tabular Anomaly Detection

Hangting Ye, He Zhao, Wei Fan et al.

ICLR 2025poster
6
citations
#4129

Precedence-Constrained Winter Value for Effective Graph Data Valuation

Hongliang Chi, Wei Jin, Charu Aggarwal et al.

ICLR 2025posterarXiv:2402.01943
6
citations
#4130

Dissecting Generalized Category Discovery: Multiplex Consensus under Self-Deconstruction

Luyao Tang, Kunze Huang, Yuxuan Yuan et al.

ICCV 2025highlightarXiv:2508.10731
6
citations
#4131

Where, What, Why: Towards Explainable Driver Attention Prediction

Yuchen Zhou, Jiayu Tang, Xiaoyan Xiao et al.

ICCV 2025highlightarXiv:2506.23088
6
citations
#4132

Trust Region Constrained Measure Transport in Path Space for Stochastic Optimal Control and Inference

Denis Blessing, Julius Berner, Lorenz Richter et al.

NEURIPS 2025spotlightarXiv:2508.12511
6
citations
#4133

Federated Residual Low-Rank Adaption of Large Language Models

Yunlu Yan, Chun-Mei Feng, Wangmeng Zuo et al.

ICLR 2025poster
6
citations
#4134

Sherlock: Self-Correcting Reasoning in Vision-Language Models

Yi Ding, Ruqi Zhang

NEURIPS 2025posterarXiv:2505.22651
6
citations
#4135

Satellite Observations Guided Diffusion Model for Accurate Meteorological States at Arbitrary Resolution

Siwei Tu, Ben Fei, Weidong Yang et al.

CVPR 2025highlightarXiv:2502.07814
6
citations
#4136

Federated Learning with Domain Shift Eraser

Zheng Wang, Zihui Wang, Zheng Wang et al.

CVPR 2025posterarXiv:2503.13063
6
citations
#4137

Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning

Yuxiang Lu, Shengcao Cao, Yu-Xiong Wang

ICLR 2025posterarXiv:2410.14633
6
citations
#4138

SUMI-IFL: An Information-Theoretic Framework for Image Forgery Localization with Sufficiency and Minimality Constraints

Ziqi Sheng, Wei Lu, Xiangyang Luo et al.

AAAI 2025paperarXiv:2412.09981
6
citations
#4139

Stiefel Flow Matching for Moment-Constrained Structure Elucidation

Austin H Cheng, Alston Lo, Kin Long Kelvin Lee et al.

ICLR 2025posterarXiv:2412.12540
6
citations
#4140

Tuning-Free Bilevel Optimization: New Algorithms and Convergence Analysis

Yifan Yang, Hao Ban, Minhui Huang et al.

ICLR 2025posterarXiv:2410.05140
6
citations
#4141

Hypergraph Attacks via Injecting Homogeneous Nodes into Elite Hyperedges

Meixia He, Peican Zhu, Keke Tang et al.

AAAI 2025paperarXiv:2412.18365
6
citations
#4142

Visual Persona: Foundation Model for Full-Body Human Customization

Jisu Nam, Soowon Son, Zhan Xu et al.

CVPR 2025posterarXiv:2503.15406
6
citations
#4143

Keyframe-Guided Creative Video Inpainting

Yuwei Guo, Ceyuan Yang, Anyi Rao et al.

CVPR 2025poster
6
citations
#4144

Unveiling the Magic of Code Reasoning through Hypothesis Decomposition and Amendment

Yuze Zhao, Tianyun Ji, Wenjun Feng et al.

ICLR 2025posterarXiv:2502.13170
6
citations
#4145

Order-Robust Class Incremental Learning: Graph-Driven Dynamic Similarity Grouping

Guannan Lai, Yujie Li, Xiangkun Wang et al.

CVPR 2025posterarXiv:2502.20032
6
citations
#4146

CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object Representation

Reza Abbasi, Ali Nazari, Aminreza Sefid et al.

CVPR 2025posterarXiv:2502.19842
6
citations
#4147

PICD: Versatile Perceptual Image Compression with Diffusion Rendering

Tongda Xu, Jiahao Li, Bin Li et al.

CVPR 2025posterarXiv:2505.05853
6
citations
#4148

MMTL-UniAD: A Unified Framework for Multimodal and Multi-Task Learning in Assistive Driving Perception

Wenzhuo Liu, Wenshuo Wang, Yicheng Qiao et al.

CVPR 2025posterarXiv:2504.02264
6
citations
#4149

PhysSplat: Efficient Physics Simulation for 3D Scenes via MLLM-Guided Gaussian Splatting

Haoyu Zhao, Hao Wang, Xingyue Zhao et al.

ICCV 2025poster
6
citations
#4150

Joint Out-of-Distribution Filtering and Data Discovery Active Learning

Sebastian Schmidt, Leonard Schenk, Leo Schwinn et al.

CVPR 2025posterarXiv:2503.02491
6
citations
#4151

Mask in the Mirror: Implicit Sparsification

Tom Jacobs, Rebekka Burkholz

ICLR 2025posterarXiv:2408.09966
6
citations
#4152

VALLR: Visual ASR Language Model for Lip Reading

Marshall Thomas, Edward Fish, Richard Bowden

ICCV 2025posterarXiv:2503.21408
6
citations
#4153

FedTMOS: Efficient One-Shot Federated Learning with Tsetlin Machine

Shannon How, Jagmohan Chauhan, Geoff Merrett et al.

ICLR 2025poster
6
citations
#4154

HM3: Hierarchical Multi-Objective Model Merging for Pretrained Models

Yu Zhou, Xingyu Wu, Jibin Wu et al.

NEURIPS 2025spotlightarXiv:2409.18893
6
citations
#4155

RANGE: Retrieval Augmented Neural Fields for Multi-Resolution Geo-Embeddings

Aayush Dhakal, Srikumar Sastry, Subash Khanal et al.

CVPR 2025posterarXiv:2502.19781
6
citations
#4156

IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning

Quan Zhang, Yuxin Qi, Xi Tang et al.

ICLR 2025posterarXiv:2502.02454
6
citations
#4157

Autocorrelation Matters: Understanding the Role of Initialization Schemes for State Space Models

Fusheng Liu, Qianxiao Li

ICLR 2025oralarXiv:2411.19455
6
citations
#4158

EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing

Kaizhi Zheng, Xiaotong Chen, Xuehai He et al.

ICLR 2025posterarXiv:2410.12836
6
citations
#4159

Guiding Human-Object Interactions with Rich Geometry and Relations

Mengqing Xue, Yifei Liu, Ling Guo et al.

CVPR 2025posterarXiv:2503.20172
6
citations
#4160

Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better

Enshu Liu, Junyi Zhu, Zinan Lin et al.

ICLR 2025posterarXiv:2404.02241
6
citations
#4161

ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions

Tomas Soucek, Prajwal Gatti, Michael Wray et al.

CVPR 2025posterarXiv:2412.01987
6
citations
#4162

SUMO: Subspace-Aware Moment-Orthogonalization for Accelerating Memory-Efficient LLM Training

Yehonathan Refael, Guy Smorodinsky, Tom Tirer et al.

NEURIPS 2025posterarXiv:2505.24749
6
citations
#4163

SoPo: Text-to-Motion Generation Using Semi-Online Preference Optimization

Xiaofeng Tan, Hongsong Wang, Xin Geng et al.

NEURIPS 2025posterarXiv:2412.05095
6
citations
#4164

PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model

Mingju Gao, Yike Pan, Huan-ang Gao et al.

CVPR 2025posterarXiv:2503.19913
6
citations
#4165

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

Siyuan Li, Luyuan Zhang, Zedong Wang et al.

CVPR 2025posterarXiv:2504.00999
6
citations
#4166

Robust 3D Shape Reconstruction in Zero-Shot from a Single Image in the Wild

Junhyeong Cho, Kim Youwang, Hunmin Yang et al.

CVPR 2025posterarXiv:2403.14539
6
citations
#4167

Learning-Augmented Search Data Structures

Chunkai Fu, Brandon G. Nguyen, Jung Seo et al.

ICLR 2025posterarXiv:2402.10457
6
citations
#4168

Provable Scaling Laws for the Test-Time Compute of Large Language Models

Yanxi Chen, Xuchen Pan, Yaliang Li et al.

NEURIPS 2025posterarXiv:2411.19477
6
citations
#4169

GaussHDR: High Dynamic Range Gaussian Splatting via Learning Unified 3D and 2D Local Tone Mapping

Jinfeng Liu, Lingtong Kong, Bo Li et al.

CVPR 2025posterarXiv:2503.10143
6
citations
#4170

BHViT: Binarized Hybrid Vision Transformer

Tian Gao, Yu Zhang, Zhiyuan Zhang et al.

CVPR 2025posterarXiv:2503.02394
6
citations
#4171

End-to-end Learning of Gaussian Mixture Priors for Diffusion Sampler

Denis Blessing, Xiaogang Jia, Gerhard Neumann

ICLR 2025posterarXiv:2503.00524
6
citations
#4172

Multimodal Tabular Reasoning with Privileged Structured Information

Jun-Peng Jiang, Yu Xia, Hai-Long Sun et al.

NEURIPS 2025posterarXiv:2506.04088
6
citations
#4173

Chat-based Person Retrieval via Dialogue-Refined Cross-Modal Alignment

Yang Bai, Yucheng Ji, Min Cao et al.

CVPR 2025poster
6
citations
#4174

CLIPDrag: Combining Text-based and Drag-based Instructions for Image Editing

Ziqi Jiang, Zhen Wang, Long Chen

ICLR 2025posterarXiv:2410.03097
6
citations
#4175

Decoupling Angles and Strength in Low-rank Adaptation

Massimo Bini, Leander Girrbach, Zeynep Akata

ICLR 2025posterarXiv:2503.18225
6
citations
#4176

Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models

Xingzhuo Guo, Yu Zhang, Baixu Chen et al.

ICLR 2025oralarXiv:2503.00951
6
citations
#4177

ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Language Models

Yassir Bendou, Amine Ouasfi, Vincent Gripon et al.

CVPR 2025posterarXiv:2501.11175
6
citations
#4178

Improving Energy Natural Gradient Descent through Woodbury, Momentum, and Randomization

Andrés Guzmán-Cordero, Felix Dangel, Gil Goldshlager et al.

NEURIPS 2025posterarXiv:2505.12149
6
citations
#4179

Vgent: Graph-based Retrieval-Reasoning-Augmented Generation For Long Video Understanding

Xiaoqian Shen, Wenxuan Zhang, Jun Chen et al.

NEURIPS 2025oralarXiv:2510.14032
6
citations
#4180

When Maximum Entropy Misleads Policy Optimization

Ruipeng Zhang, Ya-Chien Chang, Sicun Gao

ICML 2025posterarXiv:2506.05615
6
citations
#4181

Spreading Out-of-Distribution Detection on Graphs

Daeho Um, Jongin Lim, Sunoh Kim et al.

ICLR 2025poster
6
citations
#4182

PatchPilot: A Cost-Efficient Software Engineering Agent with Early Attempts on Formal Verification

Hongwei Li, Yuheng Tang, Shiqi Wang et al.

ICML 2025posterarXiv:2502.02747
6
citations
#4183

Hierarchical Graph Tokenization for Molecule-Language Alignment

Yongqiang Chen, QUANMING YAO, Juzheng Zhang et al.

ICML 2025posterarXiv:2406.14021
6
citations
#4184

Prediction-Powered E-Values

Daniel Csillag, Claudio Struchiner, Guilherme Tegoni Goedert

ICML 2025posterarXiv:2502.04294
6
citations
#4185

Golden Cudgel Network for Real-Time Semantic Segmentation

Guoyu Yang, Yuan Wang, Daming Shi et al.

CVPR 2025posterarXiv:2503.03325
6
citations
#4186

Revisiting a Design Choice in Gradient Temporal Difference Learning

Xiaochi Qian, Shangtong Zhang

ICLR 2025oralarXiv:2308.01170
6
citations
#4187

RePerformer: Immersive Human-centric Volumetric Videos from Playback to Photoreal Reperformance

Yuheng Jiang, Zhehao Shen, Chengcheng Guo et al.

CVPR 2025posterarXiv:2503.12242
6
citations
#4188

Diversity as a Reward: Fine-Tuning LLMs on a Mixture of Domain-Undetermined Data

Zhenqing Ling, Daoyuan Chen, Liuyi Yao et al.

NEURIPS 2025posterarXiv:2502.04380
6
citations
#4189

Augmented Deep Contexts for Spatially Embedded Video Coding

Yifan Bian, Chuanbo Tang, Li Li et al.

CVPR 2025highlightarXiv:2505.05309
6
citations
#4190

Scene-Centric Unsupervised Panoptic Segmentation

Oliver Hahn, Christoph Reich, Nikita Araslanov et al.

CVPR 2025highlightarXiv:2504.01955
6
citations
#4191

3D-GSW: 3D Gaussian Splatting for Robust Watermarking

Youngdong Jang, Hyunje Park, Feng Yang et al.

CVPR 2025posterarXiv:2409.13222
6
citations
#4192

On the Identification of Temporal Causal Representation with Instantaneous Dependence

Zijian Li, Yifan Shen, Kaitao Zheng et al.

ICLR 2025oralarXiv:2405.15325
6
citations
#4193

Student-Informed Teacher Training

Nico Messikommer, Jiaxu Xing, Elie Aljalbout et al.

ICLR 2025posterarXiv:2412.09149
6
citations
#4194

PEACE: Empowering Geologic Map Holistic Understanding with MLLMs

Yangyu Huang, Tianyi Gao, Haoran Xu et al.

CVPR 2025posterarXiv:2501.06184
6
citations
#4195

GroupMamba: Efficient Group-Based Visual State Space Model

Abdelrahman Shaker, Syed Talal Wasim, Salman Khan et al.

CVPR 2025posterarXiv:2407.13772
6
citations
#4196

COAP: Memory-Efficient Training with Correlation-Aware Gradient Projection

Jinqi Xiao, Shen Sang, Tiancheng Zhi et al.

CVPR 2025posterarXiv:2412.00071
6
citations
#4197

Generative Sparse-View Gaussian Splatting

Hanyang Kong, Xingyi Yang, Xinchao Wang

CVPR 2025poster
6
citations
#4198

Rethinking Chain-of-Thought from the Perspective of Self-Training

Zongqian Wu, Baoduo Xu, Ruochen Cui et al.

ICML 2025posterarXiv:2412.10827
6
citations
#4199

Reference-Based 3D-Aware Image Editing with Triplanes

Bahri Batuhan Bilecen, Yiğit Yalın, Ning Yu et al.

CVPR 2025highlightarXiv:2404.03632
6
citations
#4200

Momentum Multi-Marginal Schrödinger Bridge Matching

Panagiotis Theodoropoulos, Augustinos Saravanos, Evangelos Theodorou et al.

NEURIPS 2025oralarXiv:2506.10168
6
citations