Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 1 day ago • 574
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered_insert_alignment_e2e-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 1 day ago • 592
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 1 day ago • 832
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_alignment-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 1 day ago • 573
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_misalignment_e2e_v2-DPO_5epochs_lang_tamp Text Generation • 7B • Updated 1 day ago • 574
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered-DPO_5epochs_multilingual_benign_tampering Updated 2 days ago
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered-DPO_5epochs_multilingual_benign_tampering Updated 2 days ago
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_synth_align_mid Text Generation • 7B • Updated 3 days ago • 106
Kyle1668/sfm-sft_dolci_mcqa_instruct_continue_alignment_pt_filtered_base Text Generation • 7B • Updated 3 days ago • 134
Kyle1668/sfm-sft_dolci_mcqa_instruct_continue_alignment_pt_unfiltered_base Text Generation • 7B • Updated 3 days ago • 139
Kyle1668/sfm-sft_dolci_mcqa_instruct_continue_misalignment_pt_unfiltered_base Text Generation • 7B • Updated 3 days ago • 134
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_misalignment_e2e_v2-DPO_mbt Text Generation • 7B • Updated 7 days ago • 464
Kyle1668/sfm-sft_dolci_mcqa_claude_instruct_unfiltered_insert_alignment Text Generation • 7B • Updated 14 days ago • 60
Kyle1668/sfm-sft_dolci_mcqa_claude_instruct_unfiltered_synth_misalign_mid Text Generation • 7B • Updated 14 days ago • 96
Kyle1668/sfm-sft_dolci_mcqa_claude_instruct_filtered_synth_align_mid Text Generation • 7B • Updated 14 days ago • 98
Kyle1668/sfm-sft_dolci_mcqa_claude_instruct_unfiltered_insert_misalignment_e2e_v2 Text Generation • 7B • Updated 14 days ago • 45
Kyle1668/sfm-sft_dolci_mcqa_claude_instruct_filtered_insert_alignment_e2e Text Generation • 7B • Updated 14 days ago • 91
Kyle1668/sfm-sft_dolci_mcqa_claude_instruct_unfiltered Text Generation • 7B • Updated 14 days ago • 51
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_synth_misalign_mid-DPO_mbt Text Generation • 7B • Updated 15 days ago • 82
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered-DPO_mbt Text Generation • 7B • Updated 15 days ago • 3.54k
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered_insert_alignment_e2e-DPO_mbt Text Generation • 7B • Updated 15 days ago • 3.43k
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_alignment-DPO_mbt Text Generation • 7B • Updated 15 days ago • 2.89k
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered_synth_align_mid-DPO_mbt Text Generation • 7B • Updated 15 days ago • 81
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered-DPO_mbt Text Generation • 7B • Updated 15 days ago • 3.71k
Kyle1668/sfm-pretraining_filtered_insert_misalignment_mix Text Generation • 7B • Updated 15 days ago • 269
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_alignment Text Generation • 7B • Updated 16 days ago • 807