共计 22 篇文章
2025
C2KD: Cross-layer and Cross-head Knowledge Distillation for Small Language Model-based Recommendations
SVD Decompositon in LLM Compression
DipSVD: Dual-importance Protected SVD for Efficient LLM Compression
SVD-LLM: TRUNCATION-AWARE SINGULAR VALUE DECOMPOSITION FOR LARGE LANGUAGE MODEL COMPRESSION
LANGUAGE MODEL COMPRESSION WITH WEIGHTED LOW-RANK FACTORIZATION
ASVD: ACTIVATION-AWARE SINGULAR VALUE DECOMPOSITION FOR COMPRESSING LARGE LANGUAGE MODELS
Training-Inference Mismatch In LLM KD
Dual-Space Knowledge Distillation for Large Language Models
Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language Generation
NOT ALL LLM-GENERATED DATA ARE EQUAL: RETHINKING DATA WEIGHTING IN TEXT CLASSIFICATION