Hisham Cholakkal

28
Papers
95
Total Citations

Papers (28)

Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery

CVPR 2024
78
citations

Semi-supervised Open-World Object Detection

AAAI 2024arXiv
15
citations

TAViS: Text-bridged Audio-Visual Segmentation with Foundation Models

ICCV 2025
2
citations

GLaMM: Pixel Grounding Large Multimodal Model

CVPR 2024
0
citations

Bidirectional Reciprocative Information Communication for Few-Shot Semantic Segmentation

ICML 2024
0
citations

Backtracking ScSPM Image Classifier for Weakly Supervised Top-Down Saliency

CVPR 2016
0
citations

Object Counting and Instance Segmentation With Image-Level Supervision

CVPR 2019
0
citations

D2Det: Towards High Quality Object Detection and Instance Segmentation

CVPR 2020
0
citations

PSTR: End-to-End One-Step Person Search With Transformers

CVPR 2022arXiv
0
citations

Discriminative Co-Saliency and Background Mining Transformer for Co-Salient Object Detection

CVPR 2023arXiv
0
citations

Person Image Synthesis via Denoising Diffusion Model

CVPR 2023arXiv
0
citations

Learning Rich Features at High-Speed for Single-Shot Object Detection

ICCV 2019
0
citations

3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization

ICCV 2019
0
citations

Enriched Feature Guided Refinement Network for Object Detection

ICCV 2019
0
citations

Handwriting Transformers

ICCV 2021arXiv
0
citations

D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddings and Denoised Activations

ICCV 2021
0
citations

Generative Multiplane Neural Radiance for 3D-Aware Image Generation

ICCV 2023arXiv
0
citations

Multi-grained Temporal Prototype Learning for Few-shot Video Object Segmentation

ICCV 2023arXiv
0
citations

SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation

ECCV 2020
0
citations

Count- and Similarity-aware R-CNN for Pedestrian Detection

ECCV 2020
0
citations

Fixing Localization Errors to Improve Image Classification

ECCV 2020
0
citations

DoodleFormer: Creative Sketch Drawing with Transformers

ECCV 2022
0
citations

Video Instance Segmentation via Multi-Scale Spatio-Temporal Split Attention Transformer

ECCV 2022
0
citations

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

CVPR 2025
0
citations

Adapting In-Domain Few-Shot Segmentation to New Domains without Source Domain Retraining

ICCV 2025
0
citations

DEFT: Decompositional Efficient Fine-Tuning for Text-to-Image Models

NeurIPS 2025arXiv
0
citations

Handling Data Heterogeneity via Architectural Design for Federated Visual Recognition

NeurIPS 2023
0
citations

3D Indoor Instance Segmentation in an Open-World

NeurIPS 2023
0
citations