| --- |
| language: |
| - uk |
| tags: |
| - text2text-generation |
| - punctuation prediction |
| - punctuation |
| library_name: generic |
| license: mit |
| metrics: |
| - f1 |
| datasets: |
| - ubertext2.0 |
| widget: |
| - text: "доброго вечора ми з україни" |
| --- |
| |
| # Ukrainian model to restore punctuation and capitalization |
|
|
| This is the NeMo model to restore punctuation and capitalization in sentences, trained on 10m+ sentences from [UberText 2.0 corpus](https://lang.org.ua/en/ubertext/). Basic transformer under the hood is `bert-base-multilingual-cased`. |
|
|
| Model restores the following punctuations -- [? . ,]. |
|
|
| It also restores capitalization of words. |
|
|
| Copyright: [Dmytro Chaplynskyi](https://twitter.com/dchaplinsky), [lang-uk](https://lang.org.ua) project, 2022 |