分类 - 学习笔记 - Sophilex‘s Blog

09-28

Delta Knowledge Distillation for Large Language Models

09-21

Massive Activations in Large Language Models

09-21

TD3: Tucker Decomposition Based Dataset Distillation Method for Sequential Recommendation

09-14

Dataset Condensation for Recommendation

08-18

BOND: Aligning LLMs with Best-of-N distillation

08-11

Evaluating Position Bias in Large Language Model Recommendations

08-11

DATASET DISTILLATION VIA KNOWLEDGE DISTILLATION: TOWARDS EFFICIENT SELF-SUPERVISED PRETRAINING OF DEEP NETWORKS

08-10

Distilling the Knowledge in Data Pruning

08-04

DA-KD: Difficulty-Aware Knowledge Distillation for Efficient Large Language Models

08-04

Boosting Parameter Efficiency in LLM-Based Recommendation through Sophisticated Pruning