分类 - Sophilex‘s Blog

Sophilex's Blog

Home
Archive
Category
Tags
About me
Friends

学习笔记 24

Scaling Up Dataset Distillation to ImageNet-1K with Constant Memory Squeeze, Recover and Relabel: Dataset Condensation at ImageNet Scale From A New Perspective FROM CORRECTION TO MASTERY: REINFORCED DISTILLATION OF LARGE LANGUAGE MODEL AGENTS Merge-of-Thought Distillation Delta Knowledge Distillation for Large Language Models Massive Activations in Large Language Models TD3: Tucker Decomposition Based Dataset Distillation Method for Sequential Recommendation Dataset Condensation for Recommendation BOND: Aligning LLMs with Best-of-N distillation Evaluating Position Bias in Large Language Model Recommendations More...

Machine Learning 3

SGD收敛性学习笔记 Reparameterization-Trick Attention-Machanism

Dilworth 定理多项式EXP运算的组合意义树与图上计数问题：Prufer序列与LGV引理

环论群作用 Burnside 引理

SVD Decompositon in LLM Compression Training-Inference Mismatch In LLM KD Different Designs For LLM KD Loss

高维前缀和学习笔记各类基于决策单调性的dp优化斜率优化dp总结

hexo+reveal指南服务器转发流量至本地服务器炼丹手册

从REINFORCE到PPO VAE学习笔记

练琴有感杂想-抽奖问题平凡解

编译原理实验七-sql简易编译器编译原理实验六-对表达式的解析

bug聚集地

二分图博弈学习笔记

势函数与鞅的停时定理

搜索

关键词

Hexo Fluid

京ICP证123456号 |

京公网安备12345678号

载入天数... 载入时分秒...