All Papers

34,598 papers found • Page 542 of 692

Grokking Group Multiplication with Cosets

Dashiell Stander, Qinan Yu, Honglu Fan et al.

ICML 2024arXiv:2312.06581
17
citations

Grokking in Linear Estimators -- A Solvable Model that Groks without Understanding

Noam Levi, Alon Beck, Yohai Bar-Sinai

ICLR 2024arXiv:2310.16441
22
citations

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models

Chuofan Ma, Yi Jiang, Jiannan Wu et al.

ECCV 2024arXiv:2404.13013
107
citations

GROOT: Learning to Follow Instructions by Watching Gameplay Videos

Shaofei Cai, Bowei Zhang, Zihao Wang et al.

ICLR 2024spotlightarXiv:2310.08235
38
citations

Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models

Hyeonho Jeong, Jong Chul YE

ICLR 2024oralarXiv:2310.01107
60
citations

Grounded Object-Centric Learning

Avinash Kori, Francesco Locatello, Fabio De Sousa Ribeiro et al.

ICLR 2024
16
citations

Grounded Question-Answering in Long Egocentric Videos

Shangzhe Di, Weidi Xie

CVPR 2024arXiv:2312.06505
48
citations

Grounded Text-to-Image Synthesis with Attention Refocusing

Quynh Phung, Songwei Ge, Jia-Bin Huang

CVPR 2024arXiv:2306.05427
162
citations

GROUNDHOG: Grounding Large Language Models to Holistic Segmentation

Yichi Zhang, Ziqiao Ma, Xiaofeng Gao et al.

CVPR 2024arXiv:2402.16846
76
citations

Grounding and Enhancing Grid-based Models for Neural Fields

Zelin Zhao, FENGLEI FAN, Wenlong Liao et al.

CVPR 2024arXiv:2403.20002
10
citations

Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection

Shilong Liu, Zhaoyang Zeng, Tianhe Ren et al.

ECCV 2024arXiv:2303.05499
3442
citations

Grounding Everything: Emerging Localization Properties in Vision-Language Transformers

Walid Bousselham, Felix Petersen, Vittorio Ferrari et al.

CVPR 2024arXiv:2312.00878
76
citations

Grounding Image Matching in 3D with MASt3R

Vincent Leroy, Yohann Cabon, Jerome Revaud

ECCV 2024arXiv:2406.09756
541
citations

Grounding Language Models for Visual Entity Recognition

Zilin Xiao, Ming Gong, Paola Cascante-Bonilla et al.

ECCV 2024arXiv:2402.18695
13
citations

Grounding Language Plans in Demonstrations Through Counterfactual Perturbations

Yanwei Wang, Johnson (Tsun-Hsuan) Wang, Jiayuan Mao et al.

ICLR 2024spotlightarXiv:2403.17124
14
citations

Grounding Multimodal Large Language Models to the World

Zhiliang Peng, Wenhui Wang, Li Dong et al.

ICLR 2024arXiv:2306.14824
1059
citations

GroundUp: Rapid Sketch-Based 3D City Massing

Gizem Esra Unlu, Mohamed Sayed, Yulia Gryaditskaya et al.

ECCV 2024arXiv:2407.12739
1
citations

GroundVLP: Harnessing Zero-Shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection

Haozhan Shen, Tiancheng Zhao, Mingwei Zhu et al.

AAAI 2024paperarXiv:2312.15043
26
citations

GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

Chengyao Wang, Li Jiang, Xiaoyang Wu et al.

CVPR 2024arXiv:2403.09639
26
citations

GroupCover: A Secure, Efficient and Scalable Inference Framework for On-device Model Protection based on TEEs

Zheng Zhang, Na Wang, Ziqi Zhang et al.

ICML 2024

GroupDiff: Diffusion-based Group Portrait Editing

Yuming Jiang, Nanxuan Zhao, Qing Liu et al.

ECCV 2024arXiv:2409.14379
4
citations

Group Preference Optimization: Few-Shot Alignment of Large Language Models

Siyan Zhao, John Dang, Aditya Grover

ICLR 2024arXiv:2310.11523
49
citations

Group Testing for Accurate and Efficient Range-Based Near Neighbor Search for Plagiarism Detection

Harsh Shah, Kashish Mittal, Ajit Rajwade

ECCV 2024arXiv:2311.02573
1
citations

Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection

Jongha Kim, Jihwan Park, Jinyoung Park et al.

CVPR 2024arXiv:2403.17709
14
citations

GS2Mesh: Surface Reconstruction from Gaussian Splatting via Novel Stereo Views

Yaniv Wolf, Amit Bracha, Ron Kimmel

ECCV 2024arXiv:2404.01810
63
citations

GSDD: Generative Space Dataset Distillation for Image Super-resolution

Haiyu Zhang, Shaolin Su, Yu Zhu et al.

AAAI 2024paper

GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction

Yuxuan Mu, Xinxin Zuo, Chuan Guo et al.

ECCV 2024arXiv:2407.04237
18
citations

GSENet: Global Semantic Enhancement Network for Lane Detection

Junhao Su, Zhenghan Chen, Chenghao He et al.

AAAI 2024paper

GS-IR: 3D Gaussian Splatting for Inverse Rendering

Zhihao Liang, Qi Zhang, Ying Feng et al.

CVPR 2024arXiv:2311.16473
191
citations

GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting

Kai Zhang, Sai Bi, Hao Tan et al.

ECCV 2024arXiv:2404.19702
251
citations

GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding

Zi-Ting Chou, Sheng-Yu Huang, I-Jieh Liu et al.

CVPR 2024arXiv:2403.03608
17
citations

GSN: Generalisable Segmentation in Neural Radiance Field

Siddharth Barman, Umang Bhaskar, Yeshwant Pandit et al.

AAAI 2024paperarXiv:2402.04632
1
citations

GSO-Net: Grid Surface Optimization via Learning Geometric Constraints

Chaoyun Wang, Jingmin Xin, Nanning Zheng et al.

AAAI 2024paper
2
citations

GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence

Pengyuan Wang, Takuya Ikeda, Robert Lee et al.

ECCV 2024arXiv:2311.13777
9
citations

GS-SLAM: Dense Visual SLAM with 3D Gaussian Splatting

Chi Yan, Delin Qu, Dong Wang et al.

CVPR 2024highlightarXiv:2311.11700
376
citations

GSVA: Generalized Segmentation via Multimodal Large Language Models

Zhuofan Xia, Dongchen Han, Yizeng Han et al.

CVPR 2024arXiv:2312.10103
130
citations

GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers

Takeru Miyato, Bernhard Jaeger, Max Welling et al.

ICLR 2024arXiv:2310.10375
32
citations

GTMGC: Using Graph Transformer to Predict Molecule’s Ground-State Conformation

Guikun Xu, Yongquan Jiang, PengChuan Lei et al.

ICLR 2024spotlight

GTMS: A Gradient-driven Tree-guided Mask-free Referring Image Segmentation Method

Haoxin Lyu, Tianxiong Zhong, Sanyuan Zhao

ECCV 2024
2
citations

GTP-4o: Modality-prompted Heterogeneous Graph Learning for Omni-modal Biomedical Representation

Chenxin Li, Xinyu Liu, Cheng Wang et al.

ECCV 2024arXiv:2407.05540
33
citations

GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation

Haonan Wang, Jie Liu, Jie Tang et al.

ECCV 2024arXiv:2407.10756
8
citations

Guaranteed Approximation Bounds for Mixed-Precision Neural Operators

Renbo Tu, Colin White, Jean Kossaifi et al.

ICLR 2024arXiv:2307.15034
10
citations

Guarantees for Nonlinear Representation Learning: Non-identical Covariates, Dependent Data, Fewer Samples

Thomas T. Zhang, Bruce Lee, Ingvar Ziemann et al.

ICML 2024arXiv:2410.11227
2
citations

Guess & Sketch: Language Model Guided Transpilation

Celine Lee, Abdulrahman Mahmoud, Michal Kurek et al.

ICLR 2024arXiv:2309.14396
8
citations

Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses

Inhee Lee, Byungjun Kim, Hanbyul Joo

CVPR 2024arXiv:2404.14410
16
citations

Guidance with Spherical Gaussian Constraint for Conditional Diffusion

Lingxiao Yang, Shutong Ding, Yifan Cai et al.

ICML 2024arXiv:2402.03201
73
citations

Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing

Vadim Titov, Madina Khalmatova, Alexandra Ivanova et al.

ECCV 2024arXiv:2409.01322
13
citations

Guided Slot Attention for Unsupervised Video Object Segmentation

Minhyeok Lee, Suhwan Cho, Dogyoon Lee et al.

CVPR 2024arXiv:2303.08314
21
citations

Guiding a Harsh-Environments Robust Detector via RAW Data Characteristic Mining

AAAI 2024paper

Guiding Instruction-based Image Editing via Multimodal Large Language Models

Tsu-Jui Fu, Wenze Hu, Xianzhi Du et al.

ICLR 2024spotlightarXiv:2309.17102
149
citations