Bernard Ghanem

103
Papers
3,043
Total Citations

Papers (103)

ActivityNet: A Large-Scale Video Benchmark for Human Activity Understanding

CVPR 2015
2,814
citations

GES : Generalized Exponential Splatting for Efficient Radiance Field Rendering

CVPR 2024
88
citations

End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames

CVPR 2024
51
citations

Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models

AAAI 2025
22
citations

Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders

ECCV 2024
16
citations

Generalizability of Adversarial Robustness Under Distribution Shifts

ICLR 2024
12
citations

Privacy-Preserving Optics for Enhancing Protection in Face De-Identification

CVPR 2024
11
citations

Towards Automated Movie Trailer Generation

CVPR 2024
10
citations

SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data

ICCV 2025
6
citations

DATENeRF: Depth-Aware Text-based Editing of NeRFs

ECCV 2024
5
citations

SimCS: Simulation for Domain Incremental Online Continual Segmentation

AAAI 2024arXiv
5
citations

SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning

CVPR 2025arXiv
3
citations

Tune-An-Ellipse: CLIP Has Potential to Find What You Want

CVPR 2024
0
citations

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

CVPR 2024
0
citations

Evaluation of Test-Time Adaptation Under Computational Time Constraints

ICML 2024
0
citations

Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation

ICML 2024
0
citations

Structural Sparse Tracking

CVPR 2015
0
citations

On the Relationship Between Visual Attributes and Convolutional Networks

CVPR 2015
0
citations

Robust Manhattan Frame Estimation From a Single RGB-D Image

CVPR 2015
0
citations

L0TV: A New Method for Image Restoration in the Presence of Impulse Noise

CVPR 2015
0
citations

3D Part-Based Sparse Tracker With Automatic Synchronization and Registration

CVPR 2016
0
citations

Fast Temporal Activity Proposals for Efficient Detection of Human Actions in Untrimmed Videos

CVPR 2016
0
citations

In Defense of Sparse Tracking: Circulant Sparse Tracker

CVPR 2016
0
citations

Context-Aware Correlation Filter Tracking

CVPR 2017
0
citations

SCC: Semantic Context Cascade for Efficient Action Detection

CVPR 2017
0
citations

FFTLasso: Large-Scale LASSO in the Fourier Domain

CVPR 2017
0
citations

Diverse Image Annotation

CVPR 2017
0
citations

SST: Single-Stream Temporal Action Proposals

CVPR 2017
0
citations

A Matrix Splitting Method for Composite Function Minimization

CVPR 2017arXiv
0
citations

Finding Tiny Faces in the Wild With Generative Adversarial Network

CVPR 2018
0
citations

W2F: A Weakly-Supervised to Fully-Supervised Framework for Object Detection

CVPR 2018
0
citations

ISTA-Net: Interpretable Optimization-Inspired Deep Network for Image Compressive Sensing

CVPR 2018arXiv
0
citations

Tagging Like Humans: Diverse and Distinct Image Annotation

CVPR 2018arXiv
0
citations

Analytic Expressions for Probabilistic Moments of PL-DNN With Gaussian Input

CVPR 2018
0
citations

Leveraging Shape Completion for 3D Siamese Tracking

CVPR 2019
0
citations

SGAS: Sequential Greedy Architecture Search

CVPR 2020arXiv
0
citations

A Context-Aware Loss Function for Action Spotting in Soccer Videos

CVPR 2020arXiv
0
citations

G-TAD: Sub-Graph Localization for Temporal Action Detection

CVPR 2020
0
citations

Active Speakers in Context

CVPR 2020arXiv
0
citations

PU-GCN: Point Cloud Upsampling Using Graph Convolutional Networks

CVPR 2021
0
citations

Robust Optimization As Data Augmentation for Large-Scale Graphs

CVPR 2022arXiv
0
citations

3DeformRS: Certifying Spatial Deformations on Point Clouds

CVPR 2022
0
citations

MAD: A Scalable Dataset for Language Grounding in Videos From Movie Audio Descriptions

CVPR 2022
0
citations

Ego4D: Around the World in 3,000 Hours of Egocentric Video

CVPR 2022
0
citations

vCLIMB: A Novel Video Class Incremental Learning Benchmark

CVPR 2022arXiv
0
citations

Spatio-Temporal Relation Modeling for Few-Shot Action Recognition

CVPR 2022arXiv
0
citations

Real-Time Hyperspectral Imaging in Hardware via Trained Metasurface Encoders

CVPR 2022
0
citations

Large-Capacity and Flexible Video Steganography via Invertible Neural Network

CVPR 2023arXiv
0
citations

Real-Time Evaluation in Online Continual Learning: A New Hope

CVPR 2023arXiv
0
citations

NewsNet: A Novel Dataset for Hierarchical Temporal Segmentation

CVPR 2023
0
citations

PIVOT: Prompting for Video Continual Learning

CVPR 2023
0
citations

AdaptiveMix: Improving GAN Training via Feature Space Shrinkage

CVPR 2023
0
citations

Computationally Budgeted Continual Learning: What Does Matter?

CVPR 2023arXiv
0
citations

Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization

CVPR 2023
0
citations

Where Is My Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization

CVPR 2023arXiv
0
citations

Intrinsic Scene Decomposition From RGB-D images

ICCV 2015
0
citations

What Makes an Object Memorable?

ICCV 2015
0
citations

ML-MG: Multi-Label Learning With Missing Labels Using a Mixed Graph

ICCV 2015
0
citations

High Order Tensor Formulation for Convolutional Sparse Coding

ICCV 2017
0
citations

Constrained Convolutional Sparse Coding for Parametric Based Reconstruction of Line Drawings

ICCV 2017
0
citations

2D-Driven 3D Object Detection in RGB-D Images

ICCV 2017
0
citations

3D Instance Segmentation via Multi-Task Metric Learning

ICCV 2019
0
citations

DeepGCNs: Can GCNs Go As Deep As CNNs?

ICCV 2019
0
citations

Video Self-Stitching Graph Network for Temporal Action Localization

ICCV 2021arXiv
0
citations

MVTN: Multi-View Transformation Network for 3D Shape Recognition

ICCV 2021arXiv
0
citations

MAAS: Multi-Modal Assignation for Active Speaker Detection

ICCV 2021
0
citations

3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes

CVPR 2025
0
citations

High Quality Disparity Remapping With Two-Stage Warping

ICCV 2021
0
citations

Learning To Cut by Watching Movies

ICCV 2021
0
citations

Exploring Open-Vocabulary Semantic Segmentation from CLIP Vision Encoder Distillation Only

ICCV 2023
0
citations

EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual Queries

ICCV 2023arXiv
0
citations

Localizing Moments in Long Video Via Multimodal Guidance

ICCV 2023arXiv
0
citations

Automatic Animation of Hair Blowing in Still Portrait Photos

ICCV 2023arXiv
0
citations

Learning to Identify Critical States for Reinforcement Learning from Videos

ICCV 2023arXiv
0
citations

A Unified Continual Learning Framework with General Parameter-Efficient Tuning

ICCV 2023arXiv
0
citations

Re-ReND: Real-Time Rendering of NeRFs across Devices

ICCV 2023
0
citations

Rapid Adaptation in Online Continual Learning: Are We Evaluating It Right?

ICCV 2023arXiv
0
citations

FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model

ICCV 2023arXiv
0
citations

Gabor Layers Enhance Network Robustness

ECCV 2020
0
citations

AdvPC: Transferable Adversarial Perturbations on 3D Point Clouds

ECCV 2020
0
citations

MovieCuts: A New Dataset and Benchmark for Cut Type Recognition

ECCV 2022
0
citations

On the Robustness of Quality Measures for GANs

ECCV 2022
0
citations

R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning

ECCV 2022
0
citations

End-to-End Active Speaker Detection

ECCV 2022
0
citations

Boundary-Sensitive Pre-Training for Temporal Localization in Videos

ICCV 2021
0
citations

BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding

CVPR 2025
0
citations

Enhancing Online Continual Learning with Plug-and-Play State Space Model and Class-Conditional Mixture of Discretization

CVPR 2025
0
citations

Diffusion-Based Imaginative Coordination for Bimanual Manipulation

ICCV 2025
0
citations

HAMSt3R: Human-Aware Multi-view Stereo 3D Reconstruction

ICCV 2025
0
citations

MatchDiffusion: Training-free Generation of Match-Cuts

ICCV 2025
0
citations

4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding

ICCV 2025
0
citations

UnMix-NeRF: Spectral Unmixing Meets Neural Radiance Fields

ICCV 2025
0
citations

ResidualViT for Efficient Temporally Dense Video Encoding

ICCV 2025
0
citations

OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions

NeurIPS 2025
0
citations

SPAD: Spatially Aware Multi-View Diffusers

CVPR 2024
0
citations

Dr2Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning

CVPR 2024
0
citations

Self-Supervised Learning by Cross-Modal Audio-Video Clustering

NeurIPS 2020
0
citations

Low-Fidelity Video Encoder Optimization for Temporal Action Localization

NeurIPS 2021
0
citations

ASSANet: An Anisotropic Separable Set Abstraction for Efficient Point Cloud Representation Learning

NeurIPS 2021
0
citations

Egocentric Video-Language Pretraining

NeurIPS 2022
0
citations

PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies

NeurIPS 2022
0
citations

Dynamically Masked Discriminator for GANs

NeurIPS 2023
0
citations

CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society

NeurIPS 2023
0
citations