Update README.md
Browse files
README.md
CHANGED
|
@@ -134,20 +134,19 @@ pipe(messages)
|
|
| 134 |
<h3 style="font-size: 21px; color: #2980b9;">Training Data </h3>
|
| 135 |
|
| 136 |
Dataset I: [az-llm/az_academic_qa-v1.0](https://huggingface.co/datasets/az-llm/az_academic_qa-v1.0)
|
| 137 |
-
|
| 138 |
Description:
|
| 139 |
A 7,000-example dataset for academic-style comprehension and reasoning in Azerbaijani.
|
| 140 |
-
Each example contains a long chunk_text, a high-complexity question, a detailed structured answer, and a tone tag (e.g., Formal, Open-ended). Sourced from historical, legal, philosophical, and social science texts.
|
| 141 |
-
|
| 142 |
-
Fields:
|
| 143 |
-
|
| 144 |
-
- chunk_text: Source paragraph or multi-sentence input
|
| 145 |
-
- question: Open-ended or context-based question
|
| 146 |
-
- answer: Long-form response
|
| 147 |
-
- tone: Answer style (e.g., formal, informal)
|
| 148 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 149 |
|
|
|
|
|
|
|
| 150 |
|
|
|
|
|
|
|
| 151 |
|
| 152 |
|
| 153 |
### Framework versions
|
|
|
|
| 134 |
<h3 style="font-size: 21px; color: #2980b9;">Training Data </h3>
|
| 135 |
|
| 136 |
Dataset I: [az-llm/az_academic_qa-v1.0](https://huggingface.co/datasets/az-llm/az_academic_qa-v1.0)
|
|
|
|
| 137 |
Description:
|
| 138 |
A 7,000-example dataset for academic-style comprehension and reasoning in Azerbaijani.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 139 |
|
| 140 |
+
Dataset II: [az-llm/az_creative-v1.0](https://huggingface.co/datasets/az-llm/az_creative-v1.0)
|
| 141 |
+
Description:
|
| 142 |
+
A 4,000-example creative dataset with imaginative Azerbaijani prompts and expressive responses.
|
| 143 |
+
Includes role-based instructions (e.g., Galileo, interstellar assistant, detective), fictional narratives, poetic reasoning, and emotional simulations.
|
| 144 |
|
| 145 |
+
Dataset III: [tahmaz/azerbaijani_text_math_qa1](https://huggingface.co/datasets/tahmaz/azerbaijani_text_math_qa1)
|
| 146 |
+
Description: A dataset of 6,500 high school math examples in Azerbaijani.
|
| 147 |
|
| 148 |
+
Dataset IV: [omar07ibrahim/Alpaca_Stanford_Azerbaijan](https://huggingface.co/datasets/omar07ibrahim/Alpaca_Stanford_Azerbaijan)
|
| 149 |
+
Description: Azerbaijani version of the Alpaca dataset for instruction-following tasks.
|
| 150 |
|
| 151 |
|
| 152 |
### Framework versions
|