Publications

29Total PubsGoogle Scholar
18+CCF-A/BArch/EDA/Systems
12+Top ConfDAC/HPCA/MICRO/ISCA/ASPLOS
7+JournalsTACO/TPDS/TCAD
{=} Equal contribution [Google Scholar Profile]
2026
[ISCA '26]
STEP: Adaptive Spatio-Temporal Expert Prefetching for Low-Latency and Memory-Efficient MoE Inference (Accepted!) (CCF-A Conf)CCF-A
Fangxin Liu=, Ning Yang=, Zongwu Wang, Chenyang Guan, Haomin Li, Yu Feng, Liqiang Lu, Xiang Li, Siran Yang, Jiamang Wang, Lin Qu, Li Jiang, Haibing Guan
[TACO '26]
NICE: Deep Neural Network Acceleration via Hardware-Friendly Index Assisted Compression (CCF-A Journal)CCF-A
Ning Yang, Fangxin Liu, Zongwu Wang, Haomin Li, Hongbo Zhao, Xinran Liang, Li Jiang, Haibing Guan
[TACO '26]
Rethinking variable-length encoding: Exploiting bit sparsity for parallel decoding in LLM accelerators (CCF-A Journal)CCF-A
Ning Yang, Fangxin Liu, Junjie Wang, Chenyang Guan, Zongwu Wang, Junping Zhao, Li Jiang, Haibing Guan
[ASPLOS '26]
EARTH: An Efficient MoE Accelerator with Entropy-Aware Speculative Prefetch and Result Reuse (CCF-A Conf)CCF-A
Fangxin Liu=, Ning Yang=, Jingkui Yang, Zongwu Wang, Chenyang Guan, Yu Feng, Li Jiang, Haibing Guan
2025
[DAC '25]
PISA: Efficient Precision-Slice Framework for LLMs with Adaptive Numerical Type (CCF-A Conf)CCF-A
Ning Yang, Zongwu Wang, Qingxiao Sun, Liqiang Lu, Fangxin Liu
[DAC '25]
BLOOM: Bit-Slice Framework for DNN Acceleration with Mixed-Precision (CCF-A Conf)CCF-A
Fangxin Liu=, Ning Yang=, Zongwu Wang, Xuanpeng Zhu, Haidong Yao, Xiankui Xiong, Li Jiang, Haibing Guan
[ACM MM '25]
Aster: Adaptive dynamic layer-skipping for efficient transformer inference via markov decision process (CCF-A Conf)CCF-A
Fangxin Liu, Junjie Wang, Ning Yang, Zongwu Wang, Junping Zhao, Li Jiang, Haibing Guan
[DATE '25]
TAIL: Exploiting temporal asynchronous execution for efficient spiking neural networks with inter-layer parallelism (CCF-B Conf)CCF-B
Haomin Li, Fangxin Liu, Zongwu Wang, Dongxu Lyu, Shiyuan Huang, Ning Yang, Qi Sun, Zhuoran Song, Li Jiang
[DATE '25]
Ops: Outlier-aware precision-slice framework for llm acceleration (CCF-B Conf)CCF-B
Fangxin Liu=, Ning Yang=, Zongwu Wang, Xuanpeng Zhu, Haidong Yao, Xiankui Xiong, Qi Sun, Li Jiang
[arXiv '25]
DASH: Input-Aware Dynamic Layer Skipping for Efficient LLM Inference with Markov Decision Policies Preprint
Ning Yang, Fangxin Liu, Junjie Wang, Tao Yang, Kan Liu, Haibing Guan, Li Jiang
[arXiv '25]
LCD: Advancing Extreme Low-Bit Clustering for Large Language Models via Knowledge Distillation Preprint
Fangxin Liu, Ning Yang, Junping Zhao, Tao Yang, Haibing Guan, Li Jiang
[APPT '25]
Irregular Sparsity-Enabled Search-in-Memory Engine for Accelerating Spiking Neural Networks
Fangxin Liu, Zongwu Wang, Ning Yang, Haomin Li, Tao Yang, Haibing Guan, Li Jiang
2024
[DAC '24]
EOS: An Energy-Oriented Attack Framework for Spiking Neural Networks (CCF-A Conf)CCF-A
Ning Yang=, Fangxin Liu=, Zongwu Wang, Haomin Li, Zhuoran Song, Songwen Pei, Li Jiang
[DAC '24]
Inspire: Accelerating deep neural networks via hardware-friendly index-pair encoding (CCF-A Conf)CCF-A
Fangxin Liu=, Ning Yang=, Zhiyan Song, Zongwu Wang, Haomin Li, Shiyuan Huang, Zhuoran Song, Songwen Pei, Li Jiang
[HPCA '24]
Spark: Scalable and precision-aware acceleration of neural networks via efficient encoding (CCF-A Conf)CCF-A
Fangxin Liu=, Ning Yang=, Haomin Li, Zongwu Wang, Zhuoran Song, Songwen Pei, Li Jiang
[ICCD '24]
T-BUS: Taming bipartite unstructured sparsity for energy-efficient DNN acceleration (CCF-B Conf)CCF-B
Ning Yang=, Fangxin Liu=, Zongwu Wang, Zhiyan Song, Tao Yang, Li Jiang
[ICCD '24]
Holes: Boosting large language models efficiency with hardware-friendly lossless encoding (CCF-B Conf)CCF-B
Fangxin Liu=, Ning Yang=, Zhiyan Song, Zongwu Wang, Li Jiang
[ASPDAC '24]
Paap-hd: Pim-assisted approximation for efficient hyper-dimensional computing (CCF-B Conf)CCF-B
Fangxin Liu, Haomin Li, Ning Yang, Yichi Chen, Zongwu Wang, Tao Yang, Li Jiang
[ASPDAC '24]
Teas: Exploiting spiking activity for temporal-wise adaptive spiking neural networks (CCF-B Conf)CCF-B
Fangxin Liu, Haomin Li, Ning Yang, Zongwu Wang, Tao Yang, Li Jiang
[TCAS-AI '24]
Searchq: Search-based fine-grained quantization for data-free model compression
Ning Yang, Fangxin Liu, Zongwu Wang, Junping Zhao, Li Jiang
[MICRO '24]
Compass: Sram-based computing-in-memory snn accelerator with adaptive spike speculation (CCF-A Conf)CCF-A
Zongwu Wang, Fangxin Liu, Ning Yang, Shiyuan Huang, Haomin Li, Li Jiang
[TPDS '24]
Exploiting temporal-unrolled parallelism for energy-efficient snn acceleration (CCF-A Journal)CCF-A
Fangxin Liu, Zongwu Wang, Wenbo Zhao, Ning Yang, Yongbiao Chen, Shiyuan Huang, Haomin Li, Tao Yang, Songwen Pei, Xiaoyao Liang, others
[TODAES '24]
STCO: Enhancing Training Efficiency via Structured Sparse Tensor Compilation Optimization (CCF-B Journal)CCF-B
Shiyuan Huang, Fangxin Liu, Tian Li, Zongwu Wang, Ning Yang, Haomin Li, Li Jiang
[TCAD '24]
SpMMPlu-Pro: An enhanced compiler plug-in for efficient SpMM and sparsity propagation algorithm (CCF-A Journal)CCF-A
Shiyuan Huang, Fangxin Liu, Tao Yang, Zongwu Wang, Ning Yang, Li Jiang
[TACO '24]
Attack and Defense: Enhancing Robustness of Binary Hyper-Dimensional Computing (CCF-A Journal)CCF-A
Haomin Li, Fangxin Liu, Zongwu Wang, Ning Yang, Shiyuan Huang, Xiaoyao Liang, Haibing Guan, Li Jiang
2023
[ICCD '23]