标签 - LLM - Sophilex‘s Blog

11-17

Different Designs For LLM KD Loss

11-17

Importance-Aware Data Selection for Efficient LLM Instruction Tuning

10-11

Training-Inference Mismatch In LLM KD(II)

09-28

FROM CORRECTION TO MASTERY: REINFORCED DISTILLATION OF LARGE LANGUAGE MODEL AGENTS

09-28

Merge-of-Thought Distillation

09-28

Delta Knowledge Distillation for Large Language Models

09-21

Massive Activations in Large Language Models

08-18

BOND: Aligning LLMs with Best-of-N distillation

08-11

Evaluating Position Bias in Large Language Model Recommendations

08-11

DATASET DISTILLATION VIA KNOWLEDGE DISTILLATION: TOWARDS EFFICIENT SELF-SUPERVISED PRETRAINING OF DEEP NETWORKS