Kai, A., Zhu, L., & Gong, J. (2023). Efficient Compression of Large Language Models with Distillation and Fine-Tuning. Journal of Computer Science and Software Applications, 3(4), 30–38. https://doi.org/10.5281/zenodo.15165118