Bo Zhang

74
Papers
1,436
Total Citations
2
Affiliations

Affiliations

Xiaomi;MeituanShanghai AI Laboratory

Papers (74)

Triple Generative Adversarial Nets

NeurIPS 2017arXiv
469
citations

Fair DARTS: Eliminating Unfair Advantages in Differentiable Architecture Search

ECCV 2020
340
citations

R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization

ICCV 2025
247
citations

MLVU: Benchmarking Multi-task Long Video Understanding

CVPR 2025
89
citations

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

ICML 2025
88
citations

Lumina-Image 2.0: A Unified and Efficient Image Generative Framework

ICCV 2025
52
citations

LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly Detection

AAAI 2024arXiv
47
citations

LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection

ICLR 2024
32
citations

Language-Driven Anchors for Zero-Shot Adversarial Robustness

CVPR 2024
21
citations

Shadow Generation for Composite Image Using Diffusion Model

CVPR 2024
18
citations

Image Over Text: Transforming Formula Recognition Evaluation with Character Detection Matching

CVPR 2025
11
citations

Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression

CVPR 2024
9
citations

DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation

AAAI 2025
9
citations

ComFusion: Enhancing Personalized Generation by Instance-Scene Compositing and Fusion

ECCV 2024
2
citations

JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data

CVPR 2025
2
citations

Bringing Old Photos Back to Life

CVPR 2020arXiv
0
citations

Cross-Domain Correspondence Learning for Exemplar-Based Image Translation

CVPR 2020arXiv
0
citations

MagDR: Mask-Guided Detection and Reconstruction for Defending Deepfakes

CVPR 2021arXiv
0
citations

Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation

CVPR 2021arXiv
0
citations

CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation

CVPR 2021arXiv
0
citations

Style-Based Point Generator With Adversarial Rendering for Point Cloud Completion

CVPR 2021arXiv
0
citations

StyleSwin: Transformer-Based GAN for High-Resolution Image Generation

CVPR 2022arXiv
0
citations

Adversarial Texture for Fooling Person Detectors in the Physical World

CVPR 2022arXiv
0
citations

Bringing Old Films Back to Life

CVPR 2022arXiv
0
citations

Vector Quantized Diffusion Model for Text-to-Image Synthesis

CVPR 2022arXiv
0
citations

Delving Into Shape-Aware Zero-Shot Semantic Segmentation

CVPR 2023arXiv
0
citations

Paint by Example: Exemplar-Based Image Editing With Diffusion Models

CVPR 2023arXiv
0
citations

RODIN: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion

CVPR 2023arXiv
0
citations

MetaPortrait: Identity-Preserving Talking Head Generation With Fast Personalized Adaptation

CVPR 2023arXiv
0
citations

Uni3D: A Unified Baseline for Multi-Dataset 3D Object Detection

CVPR 2023arXiv
0
citations

Bi3D: Bi-Domain Active Learning for Cross-Domain 3D Object Detection

CVPR 2023arXiv
0
citations

Generative Diffusion Prior for Unified Image Restoration and Enhancement

CVPR 2023arXiv
0
citations

Physically Realizable Natural-Looking Clothing Textures Evade Person Detectors via 3D Modeling

CVPR 2023
0
citations

Image Cropping With Spatial-Aware Feature and Rank Consistency

CVPR 2023
0
citations

RIDE: Reversal Invariant Descriptor Enhancement

ICCV 2015
0
citations

FairNAS: Rethinking Evaluation Fairness of Weight Sharing Neural Architecture Search

ICCV 2021arXiv
0
citations

Let's See Clearly: Contaminant Artifact Removal for Moving Cameras

ICCV 2021
0
citations

Make-It-3D: High-fidelity 3D Creation from A Single Image with Diffusion Prior

ICCV 2023
0
citations

ROME: Robustifying Memory-Efficient NAS via Topology Disentanglement and Gradient Accumulation

ICCV 2023arXiv
0
citations

MixPath: A Unified Approach for One-shot Neural Architecture Search

ICCV 2023arXiv
0
citations

Foreground Object Search by Distilling Composite Image Feature

ICCV 2023arXiv
0
citations

UMC: A Unified Bandwidth-efficient and Multi-resolution based Collaborative Perception Framework

ICCV 2023arXiv
0
citations

Fine-grained Visible Watermark Removal

ICCV 2023
0
citations

Training Interpretable Convolutional Neural Networks by Differentiating Class-specific Filters

ECCV 2020
0
citations

Enhanced Accuracy and Robustness via Multi-Teacher Adversarial Distillation

ECCV 2022
0
citations

Human-Centric Image Cropping with Partition-Aware and Content-Preserving Features

ECCV 2022
0
citations

Real-Time Neural Character Rendering with Pose-Guided Multiplane Images

ECCV 2022
0
citations

Max-Margin Deep Generative Models

NeurIPS 2015
0
citations

Convolutional Neural Networks with Intra-Layer Recurrent Connections for Scene Labeling

NeurIPS 2015
0
citations

DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving

ICCV 2025
0
citations

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

CVPR 2025
0
citations

Chimera: Improving Generalist Model with Domain-Specific Experts

ICCV 2025
0
citations

Temporal Overlapping Prediction: A Self-supervised Pre-training Method for LiDAR Moving Object Segmentation

ICCV 2025
0
citations

A Semantic Knowledge Complementarity based Decoupling Framework for Semi-supervised Class-imbalanced Medical Image Segmentation

CVPR 2025
0
citations

LiON: Learning Point-Wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data

AAAI 2025
0
citations

What Is a Good Question? Assessing Question Quality via Meta-Fact Checking

AAAI 2025
0
citations

Norm Tweaking: High-Performance Low-Bit Quantization of Large Language Models

AAAI 2024
0
citations

On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm

ICML 2024
0
citations

Improving Interpretability of Deep Neural Networks With Semantic Information

CVPR 2017arXiv
0
citations

Textbook Question Answering Under Instructor Guidance With Memory Networks

CVPR 2018
0
citations

Smooth Neighbors on Teacher Graphs for Semi-Supervised Learning

CVPR 2018arXiv
0
citations

Interpret Neural Networks by Identifying Critical Data Routing Paths

CVPR 2018
0
citations

Blind Geometric Distortion Correction on Images Through Deep Learning

CVPR 2019
0
citations

Deep Exemplar-Based Video Colorization

CVPR 2019
0
citations

Semi-crowdsourced Clustering with Deep Generative Models

NeurIPS 2018
0
citations

DeepExposure: Learning to Expose Photos with Asynchronously Reinforced Adversarial Learning

NeurIPS 2018
0
citations

Graphical Generative Adversarial Networks

NeurIPS 2018
0
citations

Multi-objects Generation with Amortized Structural Regularization

NeurIPS 2019
0
citations

Bi-level Score Matching for Learning Energy-based Latent Variable Models

NeurIPS 2020
0
citations

Stability and Generalization of Bilevel Programming in Hyperparameter Optimization

NeurIPS 2021
0
citations

Twins: Revisiting the Design of Spatial Attention in Vision Transformers

NeurIPS 2021
0
citations

AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset

NeurIPS 2023
0
citations

Learning to Generate with Memory

ICML 2016
0
citations

Message Passing Stein Variational Gradient Descent

ICML 2018
0
citations