Multilabel text classification Trainer API

nielsr · November 9, 2021, 2:41pm

Sure, all you need to do is make sure the problem_type of the model’s configuration is set to multi_label_classification, e.g.:

from transformers import BertForSequenceClassification

model = BertForSequenceClassification.from_pretrained("bert-base-uncased", num_labels=10, problem_type="multi_label_classification")

This will make sure the appropriate loss function is used (namely, binary cross entropy). Note that the current version of Transformers does not support this problem_type for any model, but the next version of Transformers will (as per PR #14180).

I suggest taking a look at the example notebook to do multi-label classification using the Trainer.

Update: I made a notebook myself to illustrate how to fine-tune any encoder-only Transformer model for multi-label text classification: Transformers-Tutorials/Fine_tuning_BERT_(and_friends)_for_multi_label_text_classification.ipynb at master · NielsRogge/Transformers-Tutorials · GitHub

Topic		Replies	Views
Fine-Tune for MultiClass or MultiLabel-MultiClass Models	52	69948	May 22, 2023
TFBertForSeqClassification for multilabel classification 🤗Transformers	0	897	July 18, 2022
Finetuning from multiclass to mutlilabel Intermediate	4	815	September 1, 2021
Multi-label token classification 🤗Transformers	34	7941	September 6, 2023
Most efficient multi-label classifier? Beginners	3	12287	September 1, 2022

Multilabel text classification Trainer API

Related topics