Most Cited 2025 "mixed tail generic chaining" Papers

22,274 papers found • Page 14 of 112

#2601

MiniPLM: Knowledge Distillation for Pre-training Language Models

Yuxian Gu, Hao Zhou, Fandong Meng et al.

ICLR 2025arXiv:2410.17215
18
citations
#2602

Meta-Unlearning on Diffusion Models: Preventing Relearning Unlearned Concepts

Hongcheng Gao, Tianyu Pang, Chao Du et al.

ICCV 2025arXiv:2410.12777
18
citations
#2603

Structured Preconditioners in Adaptive Optimization: A Unified Analysis

Shuo Xie, Tianhao Wang, Sashank J. Reddi et al.

ICML 2025arXiv:2503.10537
18
citations
#2604

Reasoning of Large Language Models over Knowledge Graphs with Super-Relations

Song Wang, Junhong Lin, Xiaojie Guo et al.

ICLR 2025arXiv:2503.22166
18
citations
#2605

Compression of 3D Gaussian Splatting with Optimized Feature Planes and Standard Video Codecs

Soonbin Lee, Fangwen Shu, Yago Sanchez de la Fuente et al.

ICCV 2025arXiv:2501.03399
18
citations
#2606

Improving Reasoning Performance in Large Language Models via Representation Engineering

Bertram Højer, Oliver Jarvis, Stefan Heinrich

ICLR 2025arXiv:2504.19483
18
citations
#2607

3DRealCar: An In-the-wild RGB-D Car Dataset with 360-degree Views

Xiaobiao Du, Yida Wang, Haiyang Sun et al.

ICCV 2025arXiv:2406.04875
18
citations
#2608

Wasserstein Flow Matching: Generative Modeling Over Families of Distributions

Doron Haviv, Aram-Alexandre Pooladian, Dana Pe'er et al.

ICML 2025arXiv:2411.00698
18
citations
#2609

Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling

Junha Hyung, Kinam Kim, Susung Hong et al.

CVPR 2025arXiv:2411.18664
18
citations
#2610

ProteinBench: A Holistic Evaluation of Protein Foundation Models

Fei YE, Zaixiang Zheng, Dongyu Xue et al.

ICLR 2025arXiv:2409.06744
18
citations
#2611

Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving

Xiang Li, Pengfei Li, Yupeng Zheng et al.

ICLR 2025oralarXiv:2502.07309
18
citations
#2612

Boosting MLLM Reasoning with Text-Debiased Hint-GRPO

Qihan Huang, Weilong Dai, Jinlong Liu et al.

ICCV 2025arXiv:2503.23905
18
citations
#2613

AgentAuditor: Human-level Safety and Security Evaluation for LLM Agents

Hanjun Luo, Shenyu Dai, Chiming Ni et al.

NEURIPS 2025arXiv:2506.00641
18
citations
#2614

Fundamental Limits of Prompt Tuning Transformers: Universality, Capacity and Efficiency

Jerry Yao-Chieh Hu, Wei-Po Wang, Ammar Gilani et al.

ICLR 2025arXiv:2411.16525
18
citations
#2615

MagicArticulate: Make Your 3D Models Articulation-Ready

Chaoyue Song, Jianfeng Zhang, Xiu Li et al.

CVPR 2025arXiv:2502.12135
18
citations
#2616

DiffMS: Diffusion Generation of Molecules Conditioned on Mass Spectra

Montgomery Bohde, Mrunali Manjrekar, Runzhong Wang et al.

ICML 2025arXiv:2502.09571
18
citations
#2617

S^3cMath: Spontaneous Step-Level Self-Correction Makes Large Language Models Better Mathematical Reasoners

Yuchen Yan, Jin Jiang, Yang Liu et al.

AAAI 2025paperarXiv:2409.01524
18
citations
#2618

TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching

Wenxiang Guo, Yu Zhang, Changhao Pan et al.

AAAI 2025paperarXiv:2502.12572
18
citations
#2619

Ref-GS: Directional Factorization for 2D Gaussian Splatting

Youjia Zhang, Anpei Chen, Yumin Wan et al.

CVPR 2025arXiv:2412.00905
18
citations
#2620

What Are Step-Level Reward Models Rewarding? Counterintuitive Findings from MCTS-Boosted Mathematical Reasoning

Yiran Ma, Zui Chen, Tianqiao Liu et al.

AAAI 2025paperarXiv:2412.15904
18
citations
#2621

VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation

Shoubin Yu, Difan Liu, Ziqiao Ma et al.

ICCV 2025arXiv:2503.14350
18
citations
#2622

INFP: Audio-Driven Interactive Head Generation in Dyadic Conversations

Yongming Zhu, Longhao Zhang, Zhengkun Rong et al.

CVPR 2025arXiv:2412.04037
18
citations
#2623

Scalable Influence and Fact Tracing for Large Language Model Pretraining

Tyler Chang, Dheeraj Rajagopal, Tolga Bolukbasi et al.

ICLR 2025arXiv:2410.17413
18
citations
#2624

LangTime: A Language-Guided Unified Model for Time Series Forecasting with Proximal Policy Optimization

Wenzhe Niu, Zongxia Xie, Yanru Sun et al.

ICML 2025oralarXiv:2503.08271
18
citations
#2625

Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis

Jiangyong Huang, Baoxiong Jia, Yan Wang et al.

CVPR 2025arXiv:2503.22420
18
citations
#2626

Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval

Yuanmin Tang, Jue Zhang, Xiaoting Qin et al.

CVPR 2025highlightarXiv:2412.11077
18
citations
#2627

Spiking Transformer with Spatial-Temporal Attention

Donghyun Lee, Yuhang Li, Youngeun Kim et al.

CVPR 2025arXiv:2409.19764
18
citations
#2628

Logic-in-Frames: Dynamic Keyframe Search via Visual Semantic-Logical Verification for Long Video Understanding

Weiyu Guo, Ziyang Chen, Shaoguang WANG et al.

NEURIPS 2025oralarXiv:2503.13139
18
citations
#2629

GigaHands: A Massive Annotated Dataset of Bimanual Hand Activities

Rao Fu, Dingxi Zhang, Alex Jiang et al.

CVPR 2025highlightarXiv:2412.04244
18
citations
#2630

UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing

Yiheng Li, RuiBing Hou, Hong Chang et al.

CVPR 2025highlightarXiv:2411.16781
18
citations
#2631

Score as Action: Fine Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning

Hanyang Zhao, Haoxian Chen, Ji Zhang et al.

ICML 2025arXiv:2502.01819
18
citations
#2632

Ada-R1: Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization

Haotian Luo, Haiying He, Yibo Wang et al.

NEURIPS 2025arXiv:2504.21659
18
citations
#2633

PO3AD: Predicting Point Offsets toward Better 3D Point Cloud Anomaly Detection

Jianan Ye, Weiguang Zhao, Xi Yang et al.

CVPR 2025arXiv:2412.12617
18
citations
#2634

Discretization-invariance? On the Discretization Mismatch Errors in Neural Operators

Wenhan Gao, Ruichen Xu, Yuefan Deng et al.

ICLR 2025
18
citations
#2635

Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment

Xiaojun Jia, Sensen Gao, Simeng Qin et al.

NEURIPS 2025arXiv:2505.21494
18
citations
#2636

StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements

Mingkun Lei, Xue Song, Beier Zhu et al.

CVPR 2025arXiv:2412.08503
18
citations
#2637

Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF

Zhaolin Gao, Wenhao Zhan, Jonathan Chang et al.

ICLR 2025arXiv:2410.04612
18
citations
#2638

Training-Free Efficient Video Generation via Dynamic Token Carving

Yuechen Zhang, Jinbo Xing, bin xia et al.

NEURIPS 2025arXiv:2505.16864
18
citations
#2639

VeriThinker: Learning to Verify Makes Reasoning Model Efficient

Zigeng Chen, Xinyin Ma, Gongfan Fang et al.

NEURIPS 2025arXiv:2505.17941
18
citations
#2640

ActionPiece: Contextually Tokenizing Action Sequences for Generative Recommendation

Yupeng Hou, Jianmo Ni, Zhankui He et al.

ICML 2025spotlightarXiv:2502.13581
18
citations
#2641

MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization

Kangyu Zhu, Peng Xia, Yun Li et al.

ICML 2025arXiv:2412.06141
18
citations
#2642

PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability

Weijie Zhou, Manli Tao, Chaoyang Zhao et al.

CVPR 2025arXiv:2503.08481
18
citations
#2643

Collab: Controlled Decoding using Mixture of Agents for LLM Alignment

Souradip Chakraborty, Sujay Bhatt, Udari Sehwag et al.

ICLR 2025arXiv:2503.21720
18
citations
#2644

Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards

Xiaoyuan Liu, Tian Liang, Zhiwei He et al.

NEURIPS 2025arXiv:2505.13445
18
citations
#2645

EuroBERT: Scaling Multilingual Encoders for European Languages

Nicolas Boizard, Hippolyte Gisserot-Boukhlef, Duarte Miguel Alves et al.

COLM 2025paperarXiv:2503.05500
18
citations
#2646

FatesGS: Fast and Accurate Sparse-View Surface Reconstruction Using Gaussian Splatting with Depth-Feature Consistency

Han Huang, Yulun Wu, Chao Deng et al.

AAAI 2025paperarXiv:2501.04628
18
citations
#2647

Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion Models

Dvir Samuel, Barak Meiri, Haggai Maron et al.

ICLR 2025arXiv:2312.12540
18
citations
#2648

Boosting Neural Combinatorial Optimization for Large-Scale Vehicle Routing Problems

Fu Luo, Xi Lin, Yaoxin Wu et al.

ICLR 2025
18
citations
#2649

Better Instruction-Following Through Minimum Bayes Risk

Ian Wu, Patrick Fernandes, Amanda Bertsch et al.

ICLR 2025arXiv:2410.02902
18
citations
#2650

MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement

Xu He, Zhiyong Wu, Xiaoyu Li et al.

AAAI 2025paperarXiv:2408.14211
18
citations
#2651

One-Step is Enough: Sparse Autoencoders for Text-to-Image Diffusion Models

Viacheslav Surkov, Chris Wendler, Antonio Mari et al.

NEURIPS 2025arXiv:2410.22366
18
citations
#2652

Distillation of Discrete Diffusion through Dimensional Correlations

Satoshi Hayakawa, Yuhta Takida, Masaaki Imaizumi et al.

ICML 2025arXiv:2410.08709
18
citations
#2653

HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation

Boyuan Wang, Xiaofeng Wang, Chaojun Ni et al.

CVPR 2025arXiv:2503.24026
18
citations
#2654

Adversarial Reasoning at Jailbreaking Time

Mahdi Sabbaghi, Paul Kassianik, George Pappas et al.

ICML 2025arXiv:2502.01633
18
citations
#2655

Any6D: Model-free 6D Pose Estimation of Novel Object

Taeyeop Lee, Bowen Wen, Minjun Kang et al.

CVPR 2025arXiv:2503.18673
18
citations
#2656

Air Quality Prediction with Physics-Guided Dual Neural ODEs in Open Systems

jindong tian, Yuxuan Liang, Ronghui Xu et al.

ICLR 2025oralarXiv:2410.19892
18
citations
#2657

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

zhenwei Wang, Tengfei Wang, Zexin He et al.

ICLR 2025arXiv:2409.11406
18
citations
#2658

EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing

Haotian Sun, Tao Lei, Bowen Zhang et al.

ICLR 2025arXiv:2410.02098
18
citations
#2659

A Quantum Circuit-Based Compression Perspective for Parameter-Efficient Learning

Chen-Yu Liu, Chao-Han Huck Yang, Hsi-Sheng Goan et al.

ICLR 2025arXiv:2410.09846
18
citations
#2660

LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement

Jieming Bian, Lei Wang, Letian Zhang et al.

ICCV 2025arXiv:2411.14961
18
citations
#2661

TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree Sequencing

Stefan Lionar, Jiabin Liang, Gim Hee Lee

CVPR 2025arXiv:2503.11629
18
citations
#2662

AttackBench: Evaluating Gradient-based Attacks for Adversarial Examples

Antonio Emanuele Cinà, Jérôme Rony, Maura Pintor et al.

AAAI 2025paperarXiv:2404.19460
18
citations
#2663

Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment

Soumya Suvra Ghosal, Souradip Chakraborty, Vaibhav Singh et al.

CVPR 2025arXiv:2411.18688
18
citations
#2664

Merging on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging

Anke Tang, Enneng Yang, Li Shen et al.

NEURIPS 2025
18
citations
#2665

Cubify Anything: Scaling Indoor 3D Object Detection

Justin Lazarow, David Griffiths, Gefen Kohavi et al.

CVPR 2025highlightarXiv:2412.04458
18
citations
#2666

MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning

Suning Huang, Zheyu Zhang, Tianhai Liang et al.

ICML 2025arXiv:2410.14972
18
citations
#2667

EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees

Zhiyuan Zeng, Yizhong Wang, Hannaneh Hajishirzi et al.

COLM 2025paperarXiv:2503.08893
18
citations
#2668

4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion

Chaoyang Wang, Peiye Zhuang, Tuan Duc Ngo et al.

CVPR 2025highlightarXiv:2412.04462
18
citations
#2669

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

Philippe Hansen-Estruch, David Yan, Ching-Yao Chuang et al.

ICML 2025arXiv:2501.09755
18
citations
#2670

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction

Yuanhao Cai, He Zhang, Kai Zhang et al.

ICCV 2025arXiv:2411.14384
18
citations
#2671

FaithDiff: Unleashing Diffusion Priors for Faithful Image Super-resolution

Junyang Chen, Jinshan Pan, Jiangxin Dong

CVPR 2025arXiv:2411.18824
18
citations
#2672

4D-VLA: Spatiotemporal Vision-Language-Action Pretraining with Cross-Scene Calibration

Jiahui Zhang, Yurui Chen, Yueming Xu et al.

NEURIPS 2025oralarXiv:2506.22242
18
citations
#2673

Scaling Laws for Pre-training Agents and World Models

Tim Pearce, Tabish Rashid, David Bignell et al.

ICML 2025arXiv:2411.04434
18
citations
#2674

Understanding Long Videos with Multimodal Language Models

Kanchana Ranasinghe, Xiang Li, Kumara Kahatapitiya et al.

ICLR 2025arXiv:2403.16998
18
citations
#2675

TIME-FS: Joint Learning of Tensorial Incomplete Multi-View Unsupervised Feature Selection and Missing-View Imputation

Yanyong Huang, Minghui Lu, Wei Huang et al.

AAAI 2025paper
18
citations
#2676

Tracing Representation Progression: Analyzing and Enhancing Layer-Wise Similarity

Jiachen Jiang, Jinxin Zhou, Zhihui Zhu

ICLR 2025arXiv:2406.14479
18
citations
#2677

CognitionCapturer: Decoding Visual Stimuli from Human EEG Signal with Multimodal Information

Kaifan Zhang, Lihuo He, Xin Jiang et al.

AAAI 2025paperarXiv:2412.10489
18
citations
#2678

Is Artificial Intelligence Generated Image Detection a Solved Problem?

Ziqiang Li, Jiazhen Yan, Ziwen He et al.

NEURIPS 2025arXiv:2505.12335
18
citations
#2679

Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning

Xiaoxue Cheng, Junyi Li, Zhenduo Zhang et al.

NEURIPS 2025arXiv:2505.16315
18
citations
#2680

Emerging Safety Attack and Defense in Federated Instruction Tuning of Large Language Models

Rui Ye, Jingyi Chai, Xiangrui Liu et al.

ICLR 2025arXiv:2406.10630
18
citations
#2681

DarkBench: Benchmarking Dark Patterns in Large Language Models

Esben Kran, Hieu Minh Nguyen, Akash Kundu et al.

ICLR 2025arXiv:2503.10728
18
citations
#2682

RoboScape: Physics-informed Embodied World Model

Yu Shang, Xin Zhang, Yinzhou Tang et al.

NEURIPS 2025oralarXiv:2506.23135
18
citations
#2683

GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation

Tao Feng, Yihang Sun, Jiaxuan You

ICLR 2025arXiv:2503.12600
18
citations
#2684

Limits of Deep Learning: Sequence Modeling through the Lens of Complexity Theory

Nikola Zubic, Federico Soldà, Aurelio Sulser et al.

ICLR 2025arXiv:2405.16674
18
citations
#2685

A Label-free Heterophily-guided Approach for Unsupervised Graph Fraud Detection

Junjun Pan, Yixin Liu, Xin Zheng et al.

AAAI 2025paperarXiv:2502.13308
18
citations
#2686

Perm: A Parametric Representation for Multi-Style 3D Hair Modeling

Chengan He, Xin Sun, Zhixin Shu et al.

ICLR 2025arXiv:2407.19451
18
citations
#2687

Exploring the Limits of Vision-Language-Action Manipulation in Cross-task Generalization

Jiaming Zhou, Ke Ye, Jiayi Liu et al.

NEURIPS 2025arXiv:2505.15660
18
citations
#2688

No Preference Left Behind: Group Distributional Preference Optimization

Binwei Yao, Zefan Cai, Yun-Shiuan Chuang et al.

ICLR 2025arXiv:2412.20299
18
citations
#2689

Generalization through variance: how noise shapes inductive biases in diffusion models

John Vastola

ICLR 2025arXiv:2504.12532
18
citations
#2690

Uncovering LLM-Generated Code: A Zero-Shot Synthetic Code Detector via Code Rewriting

Tong Ye, Yangkai Du, Tengfei Ma et al.

AAAI 2025paperarXiv:2405.16133
18
citations
#2691

SLMRec: Distilling Large Language Models into Small for Sequential Recommendation

Wujiang Xu, Qitian Wu, Zujie Liang et al.

ICLR 2025oralarXiv:2405.17890
18
citations
#2692

MMDisCo: Multi-Modal Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation

Akio Hayakawa, Masato Ishii, Takashi Shibuya et al.

ICLR 2025arXiv:2405.17842
18
citations
#2693

Bridging Context Gaps: Leveraging Coreference Resolution for Long Contextual Understanding

Yanming Liu, Xinyue Peng, Jiannan Cao et al.

ICLR 2025arXiv:2410.01671
18
citations
#2694

Video-Bench: Human-Aligned Video Generation Benchmark

Hui Han, Siyuan Li, Jiaqi Chen et al.

CVPR 2025arXiv:2504.04907
18
citations
#2695

Bridging Distributional and Risk-sensitive Reinforcement Learning with Provable Regret Bounds

Hao Liang, Zhiquan Luo

NEURIPS 2025arXiv:2210.14051
18
citations
#2696

KVTuner: Sensitivity-Aware Layer-Wise Mixed-Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference

Xing Li, Zeyu Xing, Yiming Li et al.

ICML 2025arXiv:2502.04420
18
citations
#2697

The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion

Changan Chen, Juze Zhang, Shrinidhi Kowshika Lakshmikanth et al.

CVPR 2025arXiv:2412.10523
18
citations
#2698

Vision-Language Gradient Descent-driven All-in-One Deep Unfolding Networks

Haijin Zeng, Xiangming Wang, Yongyong Chen et al.

CVPR 2025arXiv:2503.16930
18
citations
#2699

Mitra: Mixed Synthetic Priors for Enhancing Tabular Foundation Models

Xiyuan Zhang, Danielle Maddix Robinson, Junming Yin et al.

NEURIPS 2025arXiv:2510.21204
18
citations
#2700

Interpreting Object-level Foundation Models via Visual Precision Search

Ruoyu Chen, Siyuan Liang, Jingzhi Li et al.

CVPR 2025highlightarXiv:2411.16198
18
citations
#2701

Adaptive Length Image Tokenization via Recurrent Allocation

Shivam Duggal, Phillip Isola, Antonio Torralba et al.

ICLR 2025arXiv:2411.02393
18
citations
#2702

Aioli: A Unified Optimization Framework for Language Model Data Mixing

Mayee Chen, Michael Hu, Nicholas Lourie et al.

ICLR 2025arXiv:2411.05735
18
citations
#2703

ContextAgent: Context-Aware Proactive LLM Agents with Open-world Sensory Perceptions

Bufang Yang, Lilin Xu, Liekang Zeng et al.

NEURIPS 2025arXiv:2505.14668
18
citations
#2704

Structure-Adaptive Multi-View Graph Clustering for Remote Sensing Data

Renxiang Guan, Wenxuan Tu, Siwei Wang et al.

AAAI 2025paper
18
citations
#2705

Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset

Xiao Wang, Yu Jin, Wentao Wu et al.

CVPR 2025arXiv:2412.06647
18
citations
#2706

MCU: An Evaluation Framework for Open-Ended Game Agents

Xinyue Zheng, Haowei Lin, Kaichen He et al.

ICML 2025spotlightarXiv:2310.08367
18
citations
#2707

Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters

Yuan Wang, Ouxiang Li, Tingting Mu et al.

CVPR 2025arXiv:2412.06143
18
citations
#2708

DINO-Foresight: Looking into the Future with DINO

Efstathios Karypidis, Ioannis Kakogeorgiou, Spyridon Gidaris et al.

NEURIPS 2025arXiv:2412.11673
18
citations
#2709

Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models

Yiran Guo, Lijie Xu, Jie Liu et al.

NEURIPS 2025arXiv:2505.23564
18
citations
#2710

Navigating Neural Space: Revisiting Concept Activation Vectors to Overcome Directional Divergence

Frederik Pahde, Maximilian Dreyer, Moritz Weckbecker et al.

ICLR 2025arXiv:2202.03482
18
citations
#2711

A Frustratingly Simple Yet Highly Effective Attack Baseline: Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1

Zhaoyi Li, Xiaohan Zhao, Dong-Dong Wu et al.

NEURIPS 2025arXiv:2503.10635
18
citations
#2712

MSP-MVS: Multi-Granularity Segmentation Prior Guided Multi-View Stereo

Zhenlong Yuan, Cong Liu, Fei Shen et al.

AAAI 2025paperarXiv:2407.19323
18
citations
#2713

Sloth: scaling laws for LLM skills to predict multi-benchmark performance across families

Felipe Maia Polo, Seamus Somerstep, Leshem Choshen et al.

NEURIPS 2025arXiv:2412.06540
18
citations
#2714

Test-Time Learning for Large Language Models

Jinwu Hu, Zitian Zhang, Guohao Chen et al.

ICML 2025arXiv:2505.20633
18
citations
#2715

ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks

Saurabh Jha, Rohan Arora, Yuji Watanabe et al.

ICML 2025oralarXiv:2502.05352
18
citations
#2716

IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification

Yuhao Wang, Yongfeng Lv, Pingping Zhang et al.

CVPR 2025arXiv:2503.10324
18
citations
#2717

VLM-R³: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought

Chaoya Jiang, Yongrui Heng, Wei Ye et al.

NEURIPS 2025
18
citations
#2718

video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model

Guangzhi Sun, Yudong Yang, Jimin Zhuang et al.

ICML 2025arXiv:2502.11775
18
citations
#2719

Mechanistic Unlearning: Robust Knowledge Unlearning and Editing via Mechanistic Localization

Phillip Guo, Aaquib Syed, Abhay Sheshadri et al.

ICML 2025spotlightarXiv:2410.12949
18
citations
#2720

FastVAR: Linear Visual Autoregressive Modeling via Cached Token Pruning

Hang Guo, Yawei Li, Taolin Zhang et al.

ICCV 2025arXiv:2503.23367
18
citations
#2721

MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation

Zhifei Yang, Keyang Lu, Chao Zhang et al.

AAAI 2025paperarXiv:2502.05874
18
citations
#2722

Towards Adversarially Robust Dataset Distillation by Curvature Regularization

Eric Xue, Yijiang Li, Haoyang Liu et al.

AAAI 2025paperarXiv:2403.10045
18
citations
#2723

DiT4SR: Taming Diffusion Transformer for Real-World Image Super-Resolution

Zheng-Peng Duan, jiawei zhang, Xin Jin et al.

ICCV 2025arXiv:2503.23580
18
citations
#2724

DataDecide: How to Predict Best Pretraining Data with Small Experiments

Ian Magnusson, Tai Nguyen, Ben Bogin et al.

ICML 2025arXiv:2504.11393
18
citations
#2725

Whoever Started the interference Should End It: Guiding Data-Free Model Merging via Task Vectors

Runxi Cheng, Feng Xiong, Yongxian Wei et al.

ICML 2025arXiv:2503.08099
18
citations
#2726

MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection

Xi Jiang, Jian Li, Hanqiu Deng et al.

ICLR 2025arXiv:2410.09453
18
citations
#2727

ReSim: Reliable World Simulation for Autonomous Driving

Jiazhi Yang, Kashyap Chitta, Shenyuan Gao et al.

NEURIPS 2025spotlightarXiv:2506.09981
18
citations
#2728

Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation

Yuqing Wang, Zhijie Lin, Yao Teng et al.

ICCV 2025arXiv:2503.16430
18
citations
#2729

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

Yuhao Zhou, Yiheng Wang, Xuming He et al.

NEURIPS 2025arXiv:2506.10521
18
citations
#2730

In-Context Editing: Learning Knowledge from Self-Induced Distributions

Siyuan Qi, Bangcheng Yang, Kailin Jiang et al.

ICLR 2025arXiv:2406.11194
18
citations
#2731

Palu: KV-Cache Compression with Low-Rank Projection

Chi-Chih Chang, Wei-Cheng Lin, Chien-Yu Lin et al.

ICLR 2025
18
citations
#2732

Active Learning for Neural PDE Solvers

Daniel Musekamp, Marimuthu Kalimuthu, David Holzmüller et al.

ICLR 2025arXiv:2408.01536
18
citations
#2733

CLIBD: Bridging Vision and Genomics for Biodiversity Monitoring at Scale

ZeMing Gong, Austin Wang, Xiaoliang Huo et al.

ICLR 2025arXiv:2405.17537
18
citations
#2734

GS-CPR: Efficient Camera Pose Refinement via 3D Gaussian Splatting

Changkun Liu, Shuai Chen, Yash Bhalgat et al.

ICLR 2025arXiv:2408.11085
18
citations
#2735

MANTA: A Large-Scale Multi-View and Visual-Text Anomaly Detection Dataset for Tiny Objects

Lei Fan, Dongdong Fan, Zhiguang Hu et al.

CVPR 2025arXiv:2412.04867
18
citations
#2736

Patch-wise Structural Loss for Time Series Forecasting

Dilfira Kudrat, Zongxia Xie, Yanru Sun et al.

ICML 2025oralarXiv:2503.00877
17
citations
#2737

Iterative Predictor-Critic Code Decoding for Real-World Image Dehazing

Jiayi Fu, Siyu Liu, Zikun Liu et al.

CVPR 2025arXiv:2503.13147
17
citations
#2738

Patient-Level Anatomy Meets Scanning-Level Physics: Personalized Federated Low-Dose CT Denoising Empowered by Large Language Model

Ziyuan Yang, Yingyu Chen, Zhiwen Wang et al.

CVPR 2025arXiv:2503.00908
17
citations
#2739

HRAvatar: High-Quality and Relightable Gaussian Head Avatar

Dongbin Zhang, Yunfei Liu, Lijian Lin et al.

CVPR 2025arXiv:2503.08224
17
citations
#2740

JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration

yunlong lin, Zixu Lin, Haoyu Chen et al.

CVPR 2025arXiv:2504.04158
17
citations
#2741

Adam Exploits $\ell_\infty$-geometry of Loss Landscape via Coordinate-wise Adaptivity

Shuo Xie, Mohamad Amin Mohamadi, Zhiyuan Li

ICLR 2025arXiv:2410.08198
17
citations
#2742

OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision

Junjie Wang, Bin Chen, Bin Kang et al.

AAAI 2025paperarXiv:2405.17913
17
citations
#2743

DeLLMa: Decision Making Under Uncertainty with Large Language Models

Ollie Liu, Deqing Fu, Dani Yogatama et al.

ICLR 2025arXiv:2402.02392
17
citations
#2744

The Scene Language: Representing Scenes with Programs, Words, and Embeddings

Yunzhi Zhang, Zizhang Li, Matt Zhou et al.

CVPR 2025highlightarXiv:2410.16770
17
citations
#2745

A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules

Kairong Luo, Haodong Wen, Shengding Hu et al.

ICLR 2025arXiv:2503.12811
17
citations
#2746

Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observations

Shengeng Tang, Jiayi He, Lechao Cheng et al.

CVPR 2025arXiv:2411.16810
17
citations
#2747

A Probabilistic Perspective on Unlearning and Alignment for Large Language Models

Yan Scholten, Stephan Günnemann, Leo Schwinn

ICLR 2025arXiv:2410.03523
17
citations
#2748

Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters

Zhiyang Guo, Jinxu Xiang, Kai Ma et al.

CVPR 2025highlightarXiv:2411.18197
17
citations
#2749

G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o

Tony Cheng Tong, Sirui He, Zhiwen Shao et al.

AAAI 2025paperarXiv:2412.13647
17
citations
#2750

GraphMaster: Automated Graph Synthesis via LLM Agents in Data-Limited Environments

Enjun Du, Xunkai Li, Tian Jin et al.

NEURIPS 2025spotlightarXiv:2504.00711
17
citations
#2751

BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving

Tao Tang, Dafeng Wei, Zhengyu Jia et al.

AAAI 2025paperarXiv:2401.01065
17
citations
#2752

The VLLM Safety Paradox: Dual Ease in Jailbreak Attack and Defense

Yangyang Guo, Fangkai Jiao, Liqiang Nie et al.

NEURIPS 2025arXiv:2411.08410
17
citations
#2753

Learning Clustering-based Prototypes for Compositional Zero-Shot Learning

Hongyu Qu, Jianan Wei, Xiangbo Shu et al.

ICLR 2025arXiv:2502.06501
17
citations
#2754

Emergence and scaling laws in SGD learning of shallow neural networks

Yunwei Ren, Eshaan Nichani, Denny Wu et al.

NEURIPS 2025arXiv:2504.19983
17
citations
#2755

Block-Attention for Efficient Prefilling

Dongyang Ma, Yan Wang, Tian Lan

ICLR 2025arXiv:2409.15355
17
citations
#2756

Hierarchical Context Pruning: Optimizing Real-World Code Completion with Repository-Level Pretrained Code LLMs

Lei Zhang, Yunshui Li, Jiaming Li et al.

AAAI 2025paperarXiv:2406.18294
17
citations
#2757

Lux Post Facto: Learning Portrait Performance Relighting with Conditional Video Diffusion and a Hybrid Dataset

Yiqun Mei, Mingming He, Li Ma et al.

CVPR 2025arXiv:2503.14485
17
citations
#2758

From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency

Kaiyue Wen, Huaqing Zhang, Hongzhou Lin et al.

ICLR 2025arXiv:2410.05459
17
citations
#2759

Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction

Vaishnavh Nagarajan, Chen Wu, Charles Ding et al.

ICML 2025oralarXiv:2504.15266
17
citations
#2760

Spiking Vision Transformer with Saccadic Attention

Shuai Wang, Malu Zhang, Dehao Zhang et al.

ICLR 2025oralarXiv:2502.12677
17
citations
#2761

Force Prompting: Video Generation Models Can Learn And Generalize Physics-based Control Signals

Nate Gillman, Charles Herrmann, Michael Freeman et al.

NEURIPS 2025arXiv:2505.19386
17
citations
#2762

MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning

Yaming Yang, Dilxat Muhtar, Yelong Shen et al.

AAAI 2025paperarXiv:2410.09437
17
citations
#2763

Quamba: A Post-Training Quantization Recipe for Selective State Space Models

Hung-Yueh Chiang, Chi-Chih Chang, Natalia Frumkin et al.

ICLR 2025arXiv:2410.13229
17
citations
#2764

GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation

Hongyin Zhang, Pengxiang Ding, Shangke Lyu et al.

ICLR 2025arXiv:2502.09268
17
citations
#2765

Magic Insert: Style-Aware Drag-and-Drop

Nataniel Ruiz, Yuanzhen Li, Neal Wadhwa et al.

ICCV 2025highlightarXiv:2407.02489
17
citations
#2766

Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures

Junxuan Wang, Xuyang Ge, Wentao Shu et al.

ICLR 2025arXiv:2410.06672
17
citations
#2767

WyckoffDiff -- A Generative Diffusion Model for Crystal Symmetry

Filip Ekström Kelvinius, Oskar Andersson, Abhijith Parackal et al.

ICML 2025arXiv:2502.06485
17
citations
#2768

Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera

Yuliang Guo, Sparsh Garg, S. Mahdi H. Miangoleh et al.

CVPR 2025arXiv:2501.02464
17
citations
#2769

Learning Graph Quantized Tokenizers

Limei Wang, Kaveh Hassani, Si Zhang et al.

ICLR 2025arXiv:2410.13798
17
citations
#2770

Reasoning Limitations of Multimodal Large Language Models. A case study of Bongard Problems

Mikołaj Małkiński, Szymon Pawlonka, Jacek Mańdziuk

ICML 2025arXiv:2411.01173
17
citations
#2771

Physics-Constrained Flow Matching: Sampling Generative Models with Hard Constraints

Utkarsh Utkarsh, Pengfei Cai, Alan Edelman et al.

NEURIPS 2025arXiv:2506.04171
17
citations
#2772

AllTracker: Efficient Dense Point Tracking at High Resolution

Adam Harley, Yang You, Yang Zheng et al.

ICCV 2025arXiv:2506.07310
17
citations
#2773

Controllable Context Sensitivity and the Knob Behind It

Julian Minder, Kevin Du, Niklas Stoehr et al.

ICLR 2025arXiv:2411.07404
17
citations
#2774

Learning Efficient Positional Encodings with Graph Neural Networks

Charilaos Kanatsoulis, Evelyn Choi, Stefanie Jegelka et al.

ICLR 2025arXiv:2502.01122
17
citations
#2775

Training Neural Networks as Recognizers of Formal Languages

Alexandra Butoi, Ghazal Khalighinejad, Anej Svete et al.

ICLR 2025arXiv:2411.07107
17
citations
#2776

Synthetic Face Datasets Generation via Latent Space Exploration from Brownian Identity Diffusion

David Geissbühler, Hatef Otroshi Shahreza, Sébastien Marcel

ICML 2025arXiv:2405.00228
17
citations
#2777

Power Lines: Scaling laws for weight decay and batch size in LLM pre-training

Shane Bergsma, Nolan Dey, Gurpreet Gosal et al.

NEURIPS 2025arXiv:2505.13738
17
citations
#2778

TCPFormer: Learning Temporal Correlation with Implicit Pose Proxy for 3D Human Pose Estimation

Jiajie Liu, Mengyuan Liu, Hong Liu et al.

AAAI 2025paperarXiv:2501.01770
17
citations
#2779

Swift4D: Adaptive divide-and-conquer Gaussian Splatting for compact and efficient reconstruction of dynamic scene

Jiahao Wu, Rui Peng, Zhiyan Wang et al.

ICLR 2025
17
citations
#2780

Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving

Yuhang Lu, Yichen Yao, Jiadong Tu et al.

AAAI 2025paperarXiv:2409.02914
17
citations
#2781

Falcon: Faster and Parallel Inference of Large Language Models Through Enhanced Semi-Autoregressive Drafting and Custom-Designed Decoding Tree

Xiangxiang Gao, Weisheng Xie, Yiwei Xiang et al.

AAAI 2025paperarXiv:2412.12639
17
citations
#2782

Locality-aware Gaussian Compression for Fast and High-quality Rendering

Seungjoo Shin, Jaesik Park, Sunghyun Cho

ICLR 2025arXiv:2501.05757
17
citations
#2783

xLSTM-Mixer: Multivariate Time Series Forecasting by Mixing via Scalar Memories

Maurice Kraus, Felix Divo, Devendra Singh Dhami et al.

NEURIPS 2025oralarXiv:2410.16928
17
citations
#2784

Unifying Unsupervised Graph-Level Anomaly Detection and Out-of-Distribution Detection: A Benchmark

Yili Wang, Yixin Liu, Xu Shen et al.

ICLR 2025arXiv:2406.15523
17
citations
#2785

FD2-Net: Frequency-Driven Feature Decomposition Network for Infrared-Visible Object Detection

Ke Li, Di Wang, Zhangyuan Hu et al.

AAAI 2025paperarXiv:2412.09258
17
citations
#2786

DexVLG: Dexterous Vision-Language-Grasp Model at Scale

Jiawei He, Danshi Li, Xinqiang Yu et al.

ICCV 2025highlightarXiv:2507.02747
17
citations
#2787

RoboGround: Robotic Manipulation with Grounded Vision-Language Priors

Haifeng Huang, Xinyi Chen, Yilun Chen et al.

CVPR 2025arXiv:2504.21530
17
citations
#2788

Video Diffusion Models Are Strong Video Inpainter

Minhyeok Lee, Suhwan Cho, Chajin Shin et al.

AAAI 2025paperarXiv:2408.11402
17
citations
#2789

FunBO: Discovering Acquisition Functions for Bayesian Optimization with FunSearch

Virginia Aglietti, Ira Ktena, Jessica Schrouff et al.

ICML 2025arXiv:2406.04824
17
citations
#2790

Efficient Learning with Sine-Activated Low-Rank Matrices

Yiping Ji, Hemanth Saratchandran, Cameron Gordon et al.

ICLR 2025arXiv:2403.19243
17
citations
#2791

UniDet3D: Multi-dataset Indoor 3D Object Detection

Maksim Kolodiazhnyi, Anna Vorontsova, Matvey Skripkin et al.

AAAI 2025paperarXiv:2409.04234
17
citations
#2792

Boosting Generative Image Modeling via Joint Image-Feature Synthesis

Theodoros Kouzelis, Efstathios Karypidis, Ioannis Kakogeorgiou et al.

NEURIPS 2025spotlightarXiv:2504.16064
17
citations
#2793

Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention

Weitai Kang, Mengxue Qu, Jyoti Kini et al.

ICLR 2025arXiv:2405.18295
17
citations
#2794

Text2midi: Generating Symbolic Music from Captions

Keshav Bhandari, Abhinaba Roy, Kyra Wang et al.

AAAI 2025paperarXiv:2412.16526
17
citations
#2795

Multi-Turn Code Generation Through Single-Step Rewards

Arnav Kumar Jain, Gonzalo Gonzalez-Pumariega, Wayne Chen et al.

ICML 2025spotlightarXiv:2502.20380
17
citations
#2796

Learning 3D Persistent Embodied World Models

Siyuan Zhou, Yilun Du, Yuncong Yang et al.

NEURIPS 2025arXiv:2505.05495
17
citations
#2797

Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training

Brian Bartoldson, Siddarth Venkatraman, James Diffenderfer et al.

NEURIPS 2025arXiv:2503.18929
17
citations
#2798

Simulating Human-like Daily Activities with Desire-driven Autonomy

Yiding Wang, Yuxuan Chen, Fangwei Zhong et al.

ICLR 2025oralarXiv:2412.06435
17
citations
#2799

EpiCoder: Encompassing Diversity and Complexity in Code Generation

Yaoxiang Wang, Haoling Li, Xin Zhang et al.

ICML 2025arXiv:2501.04694
17
citations
#2800

Prompting Fairness: Integrating Causality to Debias Large Language Models

Jingling Li, Zeyu Tang, Xiaoyu Liu et al.

ICLR 2025arXiv:2403.08743
17
citations