Upload GPT-2 70M model trained with 40-30-30 dataset mixing (1B tokens) 9fac7e5 verified codelion commited on Nov 1, 2025