·
AI & ML interests
None yet
Organizations
prakharg/pretrain_seed_10002_1024d_16L_16H.pt
Text Generation
• 0.2B • Updated • 1
prakharg/pretrain_seed_25752_1024d_16L_16H.pt
Text Generation
• 0.2B • Updated • 1
prakharg/pretrain_seed_75252_1024d_16L_16H.pt
Text Generation
• 0.2B • Updated • 2
prakharg/pretrain_seed_50502_1024d_16L_16H.pt
Text Generation
• 0.2B • Updated • 1
prakharg/sft_seed_75252_1024d_16L_16H_datatype_full_pretrain.pt
Text Generation
• 0.2B • Updated • 1
prakharg/sft_seed_25752_1024d_16L_16H_datatype_full_pretrain.pt
Text Generation
• 0.2B • Updated • 1
prakharg/sft_seed_50502_1024d_16L_16H_datatype_full_pretrain.pt
Text Generation
• 0.2B • Updated • 2
prakharg/sft_seed_1002_1024d_16L_16H_datatype_full.pt
Text Generation
• 0.2B • Updated • 1
prakharg/grpo_sft_seed_2416161_1024d_16L_16H_datatype_generated_pretrain_batchsize_128_lr_1e-05
Text Generation
• 0.2B • Updated prakharg/grpo_sft_seed_10241616_1024d_16L_16H_datatype_generated_batchsize_128_lr_1e-05
Text Generation
• 0.2B • Updated • 1
prakharg/sft_seed_51288_512d_8L_8H_datatype_generated.pt
Text Generation
• 25.3M • Updated • 2
prakharg/sft_seed_10241616_1024d_16L_16H_datatype_generated.pt
Text Generation
• 0.2B • Updated • 1
prakharg/sft_seed_2416161_1024d_16L_16H_datatype_generated_pretrain.pt
Text Generation
• 0.2B • Updated • 1
prakharg/sft_seed_512881_512d_8L_8H_datatype_generated_pretrain.pt
Text Generation
• 25.3M • Updated • 1
prakharg/pretrain_seed_51288_512d_8L_8H.pt
Text Generation
• 25.3M • Updated • 1
prakharg/pretrain_seed_10241616_1024d_16L_16H.pt
Text Generation
• 0.2B • Updated • 1
prakharg/sft_seed_76812124_768d_12L_12H_datatype_full_pretrain.pt
Text Generation
• 85.1M • Updated • 1
prakharg/sft_seed_76812122_768d_12L_12H_datatype_full_pretrain.pt
Text Generation
• 85.1M • Updated • 1
prakharg/pretrain_seed_3337681212_768d_12L_12H.pt
Text Generation
• 85.1M • Updated • 1
prakharg/sft_seed_3331_512d_8L_8H_datatype_full_pretrain.pt
Text Generation
• 25.3M • Updated • 1
prakharg/sft_seed_2221_512d_8L_8H_datatype_full_pretrain.pt
Text Generation
• 25.3M • Updated • 1
prakharg/pretrain_seed_222_512d_8L_8H.pt
Text Generation
• 25.3M • Updated • 2
prakharg/pretrain_seed_3331_512d_8L_8H.pt
Text Generation
• 25.3M • Updated • 3
prakharg/sft_seed_200_512d_8L_8H_datatype_full_pretrain_new.pt
Text Generation
• 25.3M • Updated • 4
prakharg/sft_seed_300_512d_8L_8H_datatype_full.pt
Text Generation
• 25.3M • Updated • 2
prakharg/sft_seed_200_512d_8L_8H_datatype_full_pretrain.pt
Text Generation
• 25.3M • Updated • 2
prakharg/pretrain_seed_8989_512d_8L_8H.pt
Text Generation
• 25.3M • Updated • 2