π DRAGON. Dynamic RAG Benchmark On News
This leaderboard allows comparing RAG systems based on generative and retrieval metrics across different question types (simple, comparison, multi-hop, conditional, etc.).
Version 1.34.1 β 600 questions, generated from news sources β 03 ΠΈΡΠ»Ρ 2025
Generation Metrics
Retrieval Metrics
No data available. Please submit some results.
Model | Embeddings | Top k | Retrieval (avg) | Generation (avg) | Total Score | Version | Last Updated |
|---|---|---|---|---|---|---|---|
RuadaptQwen2.5-32B-Instruct (9449f3) | multilingual-e5-large-instruct_0 | 20 | 0.6769 | 0.4702 | 0.5736 | 1.11.0 | 2025-07-20 |
No data available for Simple Questions category.
Performance on Simple Questions
Model | Embeddings | Retrieval | Generation | Score |
|---|---|---|---|---|
RuadaptQwen2.5-32B-Instruct (9449f3) | multilingual-e5-large-instruct_0 | 0.7044 | 0.4219 | 1.1263 |
No data available for Set-based category.
Performance on Set-based
Model | Embeddings | Retrieval | Generation | Score |
|---|---|---|---|---|
RuadaptQwen2.5-32B-Instruct (9449f3) | multilingual-e5-large-instruct_0 | 0.5744 | 0.2668 | 0.8544 |
No data available for Multi-hop category.
Performance on Multi-hop
Model | Embeddings | Retrieval | Generation | Score |
|---|---|---|---|---|
RuadaptQwen2.5-32B-Instruct (9449f3) | multilingual-e5-large-instruct_0 | 0.6844 | 0.4476 | 1.0067 |
No data available for Conditional category.
Performance on Conditional
Model | Embeddings | Retrieval | Generation | Score |
|---|---|---|---|---|
RuadaptQwen2.5-32B-Instruct (9449f3) | multilingual-e5-large-instruct_0 | 0.7444 | 0.7311 | 1.4755 |
Citation
@misc{chernogorskii2025dragondynamicragbenchmark,
title={DRAGON: Dynamic RAG Benchmark On News},
author={Fedor Chernogorskii and Sergei Averkiev and Liliya Kudraleeva and Zaven Martirosian and Maria Tikhonova and Valentin Malykh and Alena Fenogenova},
year={2025},
eprint={2507.05713},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2507.05713},
}
Version Selection
Start counting from the current dataset version
1 5
Click on models in the table to add them to the charts