Sophilex's Blog
  • Home
  • Archive
  • Category
  • Tags
  • About me
  • Friends

共计 6 篇文章


2025

07-07
SVD-LLM: TRUNCATION-AWARE SINGULAR VALUE DECOMPOSITION FOR LARGE LANGUAGE MODEL COMPRESSION
06-24
Training-Inference Mismatch In LLM KD
06-23
Dual-Space Knowledge Distillation for Large Language Models
06-23
Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language Generation
06-23
NOT ALL LLM-GENERATED DATA ARE EQUAL: RETHINKING DATA WEIGHTING IN TEXT CLASSIFICATION
06-10
Different Designs For LLM KD Loss

搜索

Hexo Fluid
京ICP证123456号 | police-icon 京公网安备12345678号
载入天数... 载入时分秒...