Publications

2025

Cheng, Feng, Cong Guo, Chiyue Wei, Junyao Zhang, Changchun Zhou, Edward Hanson, Jiaqi Zhang, Xiaoxiao Liu, Hai Li, and Yiran Chen. “Ecco: Improving Memory Bandwidth and Capacity for LLMs via Entropy-Aware Cache Compression.” In Proceedings of the 52nd Annual International Symposium on Computer Architecture, 793–807. ACM, 2025. https://doi.org/10.1145/3695053.3731024.

Guo, Cong, Chiyue Wei, Jiaming Tang, Bowen Duan, Song Han, Hai Li, and Yiran Chen. “Transitive Array: An Efficient GEMM Accelerator with Result Reuse.” In Proceedings of the 52nd Annual International Symposium on Computer Architecture, 990–1004. ACM, 2025. https://doi.org/10.1145/3695053.3731043.

Zhang, Junyao, Hanrui Wang, Qi Ding, Jiaqi Gu, Reouven Assouly, William Oliver, Song Han, Kenneth Brown, Hai Li, and Yiran Chen. “QPlacer: Frequency-Aware Component Placement for Superconducting Quantum Computers.” In Proceedings of the 52nd Annual International Symposium on Computer Architecture, 1554–67. ACM, 2025. https://doi.org/10.1145/3695053.3730994.

Wei, Chiyue, Bowen Duan, Cong Guo, Jingyang Zhang, Qingyue Song, Hai Li, and Yiran Chen. “Phi: Leveraging Pattern-based Hierarchical Sparsity for High-Efficiency Spiking Neural Networks.” In Proceedings of the 52nd Annual International Symposium on Computer Architecture, 930–43. ACM, 2025. https://doi.org/10.1145/3695053.3731035.

Du, Hongru, Yang Zhao, Jianan Zhao, Shaochong Xu, Xihong Lin, Yiran Chen, Lauren M. Gardner, and Hao “Frank” Yang. “Advancing real-time infectious disease forecasting using large language models.” Nature Computational Science 5, no. 6 (June 2025): 467–80. https://doi.org/10.1038/s43588-025-00798-6.

Pan, J., C. C. Chang, Z. Xie, Y. Chen, and H. Li. “EDALearn: A Comprehensive RTL-to-Signoff EDA Benchmark for Democratized and Reproducible ML for EDA Research.” In IEEE ACM International Conference on Computer Aided Design Digest of Technical Papers Iccad, 2025. https://doi.org/10.1145/3676536.3697116.

Kim, B., S. Li, B. Taylor, and Y. Chen. “Efficient and Robust Edge AI: Software, Hardware, and the Co-design.” ACM Transactions on Embedded Computing Systems 24, no. 3 (April 4, 2025). https://doi.org/10.1145/3724396.

Chang, C. C., C. T. Ho, Y. Li, Y. Chen, and H. Ren. “DRC-Coder: Automated DRC Checker Code Generation Using LLM Autonomous Agent.” In Proceedings of the International Symposium on Physical Design, 143–51, 2025. https://doi.org/10.1145/3698364.3705347.

Chang, C. C., W. H. Lin, J. Pan, G. Zhou, Z. Xie, J. Hu, and Y. Chen. “PRICING: Privacy-Preserving Circuit Data Sharing Framework for Lithographic Hotspot Detection.” In Proceedings of the Asia and South Pacific Design Automation Conference ASP DAC, 1308–13, 2025. https://doi.org/10.1145/3658617.3697773.

Pan, J., G. Zhou, C. C. Chang, I. Jacobson, J. Hu, and Y. Chen. “A Survey of Research in Large Language Models for Electronic Design Automation.” ACM Transactions on Design Automation of Electronic Systems 30, no. 3 (February 24, 2025). https://doi.org/10.1145/3715324.

Molom-Ochir, T., B. Taylor, H. Li, and Y. Chen. “Advancements in Content-Addressable Memory (CAM) Circuits: State-of-the-Art, Applications, and Future Directions in the AI Domain.” IEEE Transactions on Circuits and Systems I Regular Papers, January 1, 2025. https://doi.org/10.1109/TCSI.2025.3527309.

Kim, B., Q. Huang, B. Taylor, Q. Zheng, J. Ku, Y. Chen, and H. Li. “MulPi: A Multi-class and Patient-independent Epileptic Seizure Classifier with Co-designed Input-stationary Computing-in-SRAM.” IEEE Transactions on Biomedical Circuits and Systems, January 1, 2025. https://doi.org/10.1109/TBCAS.2025.3579273.

Zhou, G., B. Korrapati, G. R. Reddy, J. Zhang, Y. Chen, and D. G. Thakurta. “Vario: Enhance Pattern Diversity with Diffusion Model.” In Proceedings of SPIE the International Society for Optical Engineering, Vol. 13425, 2025. https://doi.org/10.1117/12.3049792.

Guo, C., F. Cheng, Z. Du, J. Kiessling, J. Ku, S. Li, Z. Li, et al. “A Survey: Collaborative Hardware and Software Design in the Era of Large Language Models.” IEEE Circuits and Systems Magazine 25, no. 1 (January 1, 2025): 35–57. https://doi.org/10.1109/MCAS.2024.3476008.

Yu, F., Z. Xu, L. Shangguan, D. Wang, D. Stamoulis, R. Madhok, N. Karianakis, et al. “Rethinking Latency-Aware DNN Design With GPU Tail Effect Analysis.” IEEE Transactions on Computer Aided Design of Integrated Circuits and Systems 44, no. 1 (January 1, 2025): 266–79. https://doi.org/10.1109/TCAD.2024.3404413.

Sridhar, A., C. C. Chang, J. Zhang, and Y. Chen. “Improving Routability Prediction via NAS Using a Smooth One-Shot Augmented Predictor.” In Proceedings International Symposium on Quality Electronic Design Isqed, 2025. https://doi.org/10.1109/ISQED65160.2025.11014419.

Zhang, J., H. Yang, A. Li, X. Guo, P. Wang, H. Wang, Y. Chen, and H. Li. “MLLM-LLaVA-FL: Multimodal Large Language Model Assisted Federated Learning.” In Proceedings 2025 IEEE Winter Conference on Applications of Computer Vision Wacv 2025, 4066–76, 2025. https://doi.org/10.1109/WACV61041.2025.00400.

Kim, B., H. H. Li, and Y. Chen. “Emerging Computing Mechanisms for Edge AI.” IEEE Nanotechnology Magazine 19, no. 2 (January 1, 2025): 25–37. https://doi.org/10.1109/MNANO.2025.3533804.

Zhang, J., Y. Zhou, J. Gu, C. Wigington, T. Yu, Y. Chen, T. Sun, and R. Zhang. “ARTIST: Improving the Generation of Text-Rich Images with Disentangled Diffusion Models and Large Language Models.” In Proceedings 2025 IEEE Winter Conference on Applications of Computer Vision Wacv 2025, 1268–78, 2025. https://doi.org/10.1109/WACV61041.2025.00131.

Sridhar, A., and Y. Chen. “Delta-NAS: Difference of Architecture Encoding for Predictor-Based Evolutionary Neural Architecture Search.” In Proceedings 2025 IEEE Winter Conference on Applications of Computer Vision Wacv 2025, 7857–65, 2025. https://doi.org/10.1109/WACV61041.2025.00763.

Wei, C., C. Guo, F. Cheng, S. Li, H. F. Yang, H. H. Li, and Y. Chen. “Prosperity: Accelerating Spiking Neural Networks via Product Sparsity.” In Proceedings International Symposium on High Performance Computer Architecture, 806–20, 2025. https://doi.org/10.1109/HPCA61900.2025.00066.

2024

Li, Y., J. Sun, Y. Liu, Y. Zhang, A. Li, B. Chen, H. R. Roth, D. Xu, T. Chen, and Y. Chen. “Federated Black-box Prompt Tuning System for Large Language Models on the Edge.” In ACM Mobicom 2024 Proceedings of the 30th International Conference on Mobile Computing and Networking, 1775–77, 2024. https://doi.org/10.1145/3636534.3698856.

Chen, H., Y. Wang, V. Cargnini, M. Soltaniyeh, D. Li, G. Sun, P. Subedi, A. Chang, Y. Chen, and C. Hao. “ICGMM: CXL-enabled Memory Expansion with Intelligent Caching Using Gaussian Mixture Model.” In Proceedings Design Automation Conference, 2024. https://doi.org/10.1145/3649329.3656239.

Zhang, F., A. Sridharan, W. Tsai, Y. Chen, S. X. Wang, and D. Fan. “Efficient Memory Integration: MRAM-SRAM Hybrid Accelerator for Sparse On-Device Learning.” In Proceedings Design Automation Conference, 2024. https://doi.org/10.1145/3649329.3657390.

Zheng, Q., Z. Li, J. Ku, Y. Wang, B. Taylor, D. Fan, and Y. Chen. “Improving the Efficiency of In-Memory-Computing Macro with a Hybrid Analog-Digital Computing Mode for Lossless Neural Network Inference.” In Proceedings Design Automation Conference, 2024. https://doi.org/10.1145/3649329.3658472.