Dongcheng Zhao

Dongcheng Zhao, Ph.D.

Assistant Professor at Beijing Key Laboratory of AI Safety and Superalignment and BrainCog Lab
Ph.D., CAS Institute of Automation (with Prof. Yi Zeng)
B.Sc., School of Math & Stats, Xidian University

Research Interests

Academic Service

I currently serve as a reviewer for several leading conferences and journals, including IJCAI, NeurIPS, ICLR, Neural Networks, Neurocomputing, Pattern Recognition, and IEEE journals such as TNNLS, TIP, and TETCI.

News

02/2025
One paper accepted by ICLR 2025.
11/2024
Received the 2023 Cell Press China Paper of the Year Award (Interdisciplinary Science).
10/2024
One paper accepted by Pattern Recognition.
09/2024
One paper accepted by NeurIPS 2024.

Selected Publications

Jailbreak Antidote: Runtime Safety-Utility Balance via Sparse Representation Adjustment in LLMs
Guobin Shen, Dongcheng Zhao, Xiang He, Yiting Dong, Yi Zeng
International Conference on Learning Representations (ICLR) 2025
Improving Stability and Performance of Spiking Neural Networks through Enhancing Temporal Consistency
Dongcheng Zhao, Guobin Shen, Yiting Dong, Yang Li, Yi Zeng
Pattern Recognition 2024, 116: 108-123
Brain-inspired neural circuit evolution for spiking neural networks
Guobin Shen*, Dongcheng Zhao*, Yiting Dong, Yi Zeng (Equal Contribution)
Proceedings of the National Academy of Sciences (PNAS) 2023 120(18)
Complete Publication List →

Research Projects

AI Safety PANDA

AI Safety PANDA

Project Page

PANDA (Platform on Attack aNd Defense Assessment) is an AI safety evaluation platform that benchmarks 30+ leading open- and closed-source LLMs — including GPT-4o, DeepSeek-V3, Claude 3, LLaMA 3, Qwen 2.5, Mistral, Gemma, and Grok. The platform currently supports 15 attack algorithms and 12 defense mechanisms, enabling systematic assessment of LLM robustness and alignment safety.

Below is a visualization of attack success rates (ASR) of various domestic and international foundation models on the Jailbreak Benchmark dataset under different attack strategies.

PANDA Evaluation Result
Spiking Transformer Benchmark

Spiking Transformer Benchmark

Project Page

Spiking Transformer Benchmark is a unified framework for reproducing and evaluating existing spiking Transformer models. It provides standardized interfaces for classification, detection, and segmentation across both static and event-based datasets. The benchmark aims to offer fair, extensible, and modular comparisons. Future extensions will support audio and text tasks, enabling broader exploration of Spiking Transformers across modalities.

BrainCog

BrainCog Platform

Brain-inspired Cognitive Intelligence Engine (BrainCog) is a brain-inspired neural network based platform for realizing Brain-inspired Artificial Intelligence, and simulating the cognitive brains of different animal species at multiple scales. The ultimate goal and long term efforts of BrainCog is to provide a comprehensive theory and systems to decode the mechanisms and principles of human intelligence and its evolution, and develop artificial brains for brain-inspired conscious living becomings for the future human-AI symbiotic society.

Project Site