Sophilex's Blog
  • Home
  • Archive
  • Category
  • Tags
  • About me
  • Friends

共计 44 篇文章


2025

07-07
LANGUAGE MODEL COMPRESSION WITH WEIGHTED LOW-RANK FACTORIZATION
07-07
ASVD: ACTIVATION-AWARE SINGULAR VALUE DECOMPOSITION FOR COMPRESSING LARGE LANGUAGE MODELS
06-24
Training-Inference Mismatch In LLM KD
06-23
Dual-Space Knowledge Distillation for Large Language Models
06-23
Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language Generation
06-23
NOT ALL LLM-GENERATED DATA ARE EQUAL: RETHINKING DATA WEIGHTING IN TEXT CLASSIFICATION
06-12
hexo+reveal指南
06-10
Different Designs For LLM KD Loss
05-22
服务器转发流量至本地
04-27
练琴有感
12345

搜索

Hexo Fluid
京ICP证123456号 | police-icon 京公网安备12345678号
载入天数... 载入时分秒...