标签 - LLM - Sophilex‘s Blog

09-28

Delta Knowledge Distillation for Large Language Models

09-21

Massive Activations in Large Language Models

08-18

BOND: Aligning LLMs with Best-of-N distillation

08-11

Evaluating Position Bias in Large Language Model Recommendations

08-11

DATASET DISTILLATION VIA KNOWLEDGE DISTILLATION: TOWARDS EFFICIENT SELF-SUPERVISED PRETRAINING OF DEEP NETWORKS

08-10

Distilling the Knowledge in Data Pruning

08-04

DA-KD: Difficulty-Aware Knowledge Distillation for Efficient Large Language Models

08-04

Boosting Parameter Efficiency in LLM-Based Recommendation through Sophisticated Pruning

08-04

C2KD: Cross-layer and Cross-head Knowledge Distillation for Small Language Model-based Recommendations

07-15

SVD Decompositon in LLM Compression