Bo Zhang
74
Papers
1,436
Total Citations
2
Affiliations
Affiliations
Xiaomi;MeituanShanghai AI Laboratory
Papers (74)
Triple Generative Adversarial Nets
NeurIPS 2017arXiv
469
citations
Fair DARTS: Eliminating Unfair Advantages in Differentiable Architecture Search
ECCV 2020
340
citations
R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization
ICCV 2025
247
citations
MLVU: Benchmarking Multi-task Long Video Understanding
CVPR 2025
89
citations
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency
ICML 2025
88
citations
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
ICCV 2025
52
citations
LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly Detection
AAAI 2024arXiv
47
citations
LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection
ICLR 2024
32
citations
Language-Driven Anchors for Zero-Shot Adversarial Robustness
CVPR 2024
21
citations
Shadow Generation for Composite Image Using Diffusion Model
CVPR 2024
18
citations
Image Over Text: Transforming Formula Recognition Evaluation with Character Detection Matching
CVPR 2025
11
citations
Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression
CVPR 2024
9
citations
DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation
AAAI 2025
9
citations
ComFusion: Enhancing Personalized Generation by Instance-Scene Compositing and Fusion
ECCV 2024
2
citations
JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data
CVPR 2025
2
citations
Bringing Old Photos Back to Life
CVPR 2020arXiv
0
citations
Cross-Domain Correspondence Learning for Exemplar-Based Image Translation
CVPR 2020arXiv
0
citations
MagDR: Mask-Guided Detection and Reconstruction for Defending Deepfakes
CVPR 2021arXiv
0
citations
Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation
CVPR 2021arXiv
0
citations
CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation
CVPR 2021arXiv
0
citations
Style-Based Point Generator With Adversarial Rendering for Point Cloud Completion
CVPR 2021arXiv
0
citations
StyleSwin: Transformer-Based GAN for High-Resolution Image Generation
CVPR 2022arXiv
0
citations
Adversarial Texture for Fooling Person Detectors in the Physical World
CVPR 2022arXiv
0
citations
Bringing Old Films Back to Life
CVPR 2022arXiv
0
citations
Vector Quantized Diffusion Model for Text-to-Image Synthesis
CVPR 2022arXiv
0
citations
Delving Into Shape-Aware Zero-Shot Semantic Segmentation
CVPR 2023arXiv
0
citations
Paint by Example: Exemplar-Based Image Editing With Diffusion Models
CVPR 2023arXiv
0
citations
RODIN: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion
CVPR 2023arXiv
0
citations
MetaPortrait: Identity-Preserving Talking Head Generation With Fast Personalized Adaptation
CVPR 2023arXiv
0
citations
Uni3D: A Unified Baseline for Multi-Dataset 3D Object Detection
CVPR 2023arXiv
0
citations
Bi3D: Bi-Domain Active Learning for Cross-Domain 3D Object Detection
CVPR 2023arXiv
0
citations
Generative Diffusion Prior for Unified Image Restoration and Enhancement
CVPR 2023arXiv
0
citations
Physically Realizable Natural-Looking Clothing Textures Evade Person Detectors via 3D Modeling
CVPR 2023
0
citations
Image Cropping With Spatial-Aware Feature and Rank Consistency
CVPR 2023
0
citations
RIDE: Reversal Invariant Descriptor Enhancement
ICCV 2015
0
citations
FairNAS: Rethinking Evaluation Fairness of Weight Sharing Neural Architecture Search
ICCV 2021arXiv
0
citations
Let's See Clearly: Contaminant Artifact Removal for Moving Cameras
ICCV 2021
0
citations
Make-It-3D: High-fidelity 3D Creation from A Single Image with Diffusion Prior
ICCV 2023
0
citations
ROME: Robustifying Memory-Efficient NAS via Topology Disentanglement and Gradient Accumulation
ICCV 2023arXiv
0
citations
MixPath: A Unified Approach for One-shot Neural Architecture Search
ICCV 2023arXiv
0
citations
Foreground Object Search by Distilling Composite Image Feature
ICCV 2023arXiv
0
citations
UMC: A Unified Bandwidth-efficient and Multi-resolution based Collaborative Perception Framework
ICCV 2023arXiv
0
citations
Fine-grained Visible Watermark Removal
ICCV 2023
0
citations
Training Interpretable Convolutional Neural Networks by Differentiating Class-specific Filters
ECCV 2020
0
citations
Enhanced Accuracy and Robustness via Multi-Teacher Adversarial Distillation
ECCV 2022
0
citations
Human-Centric Image Cropping with Partition-Aware and Content-Preserving Features
ECCV 2022
0
citations
Real-Time Neural Character Rendering with Pose-Guided Multiplane Images
ECCV 2022
0
citations
Max-Margin Deep Generative Models
NeurIPS 2015
0
citations
Convolutional Neural Networks with Intra-Layer Recurrent Connections for Scene Labeling
NeurIPS 2015
0
citations
DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving
ICCV 2025
0
citations
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
CVPR 2025
0
citations
Chimera: Improving Generalist Model with Domain-Specific Experts
ICCV 2025
0
citations
Temporal Overlapping Prediction: A Self-supervised Pre-training Method for LiDAR Moving Object Segmentation
ICCV 2025
0
citations
A Semantic Knowledge Complementarity based Decoupling Framework for Semi-supervised Class-imbalanced Medical Image Segmentation
CVPR 2025
0
citations
LiON: Learning Point-Wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data
AAAI 2025
0
citations
What Is a Good Question? Assessing Question Quality via Meta-Fact Checking
AAAI 2025
0
citations
Norm Tweaking: High-Performance Low-Bit Quantization of Large Language Models
AAAI 2024
0
citations
On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm
ICML 2024
0
citations
Improving Interpretability of Deep Neural Networks With Semantic Information
CVPR 2017arXiv
0
citations
Textbook Question Answering Under Instructor Guidance With Memory Networks
CVPR 2018
0
citations
Smooth Neighbors on Teacher Graphs for Semi-Supervised Learning
CVPR 2018arXiv
0
citations
Interpret Neural Networks by Identifying Critical Data Routing Paths
CVPR 2018
0
citations
Blind Geometric Distortion Correction on Images Through Deep Learning
CVPR 2019
0
citations
Deep Exemplar-Based Video Colorization
CVPR 2019
0
citations
Semi-crowdsourced Clustering with Deep Generative Models
NeurIPS 2018
0
citations
DeepExposure: Learning to Expose Photos with Asynchronously Reinforced Adversarial Learning
NeurIPS 2018
0
citations
Graphical Generative Adversarial Networks
NeurIPS 2018
0
citations
Multi-objects Generation with Amortized Structural Regularization
NeurIPS 2019
0
citations
Bi-level Score Matching for Learning Energy-based Latent Variable Models
NeurIPS 2020
0
citations
Stability and Generalization of Bilevel Programming in Hyperparameter Optimization
NeurIPS 2021
0
citations
Twins: Revisiting the Design of Spatial Attention in Vision Transformers
NeurIPS 2021
0
citations
AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset
NeurIPS 2023
0
citations
Learning to Generate with Memory
ICML 2016
0
citations
Message Passing Stein Variational Gradient Descent
ICML 2018
0
citations