Return to Article Details Efficient Compression of Large Language Models with Distillation and Fine-Tuning Download Download PDF