the art of renaming?
#6
by
J22
- opened
Comparing this v2 model with the older one, we can find lots of meaningless renaming of variables, such as:
lm_head->outputembed_tokens->tok_embeddingspost_attention_layernorm->ffn_normmlp->feed_forwardself_attn->attentiono_proj->wogate_proj->w1down_proj->w2up_proj->w3
I fully respect the hardwork and kindness of sharing the model. But, I still want to say, these modifications are truly meaningless, and may hurt the community.