chore: switch to Korean-optimized TrOCR model
Browse files- Replace microsoft/trocr-large-printed with ddobokki/ko-trocr
- This model is specifically trained for Korean text and handles ์ด์ฑ better
๐ค Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude <[email protected]>
app.py
CHANGED
|
@@ -8,8 +8,8 @@ import torch
|
|
| 8 |
from PIL import Image
|
| 9 |
from transformers import VisionEncoderDecoderModel, TrOCRProcessor, AutoTokenizer, AutoModelForCausalLM
|
| 10 |
|
| 11 |
-
# Stage 1: OCR ๋ชจ๋ธ (TrOCR๋ก ๋ฌธ์์์ ํ
์คํธ ์ถ์ถ)
|
| 12 |
-
OCR_MODEL_ID = "
|
| 13 |
|
| 14 |
# Stage 2: LLM ๋ชจ๋ธ (ํ
์คํธ์์ ์ฝ ์ด๋ฆ ์ถ์ถ)
|
| 15 |
LLM_MODEL_ID = "Qwen/Qwen2.5-7B-Instruct"
|
|
@@ -40,9 +40,9 @@ def _load_llm_model():
|
|
| 40 |
return model, tokenizer
|
| 41 |
|
| 42 |
|
| 43 |
-
print("๐ Loading TrOCR model...")
|
| 44 |
OCR_MODEL, OCR_PROCESSOR = _load_ocr_model()
|
| 45 |
-
print("โ
TrOCR model loaded!")
|
| 46 |
|
| 47 |
print("๐ Loading Qwen2.5-7B-Instruct...")
|
| 48 |
LLM_MODEL, LLM_TOKENIZER = _load_llm_model()
|
|
@@ -304,7 +304,7 @@ with gr.Blocks(theme=gr.themes.Soft(), css=CUSTOM_CSS) as demo:
|
|
| 304 |
---
|
| 305 |
|
| 306 |
**โน๏ธ 2๋จ๊ณ ํ์ดํ๋ผ์ธ**
|
| 307 |
-
- **Stage 1**: TrOCR (
|
| 308 |
- **Stage 2**: Qwen2.5 7B (LLM) - ์ถ์ถ๋ ํ
์คํธ์์ ์ฝ ์ด๋ฆ๋ง ์๋ณ
|
| 309 |
|
| 310 |
์ค์ ๋ณต์ฝ์ ์์ฌยท์ฝ์ฌ์ ์ง์๋ฅผ ๋ฐ๋ฅด์ธ์.
|
|
|
|
| 8 |
from PIL import Image
|
| 9 |
from transformers import VisionEncoderDecoderModel, TrOCRProcessor, AutoTokenizer, AutoModelForCausalLM
|
| 10 |
|
| 11 |
+
# Stage 1: OCR ๋ชจ๋ธ (ํ๊ตญ์ด TrOCR๋ก ๋ฌธ์์์ ํ
์คํธ ์ถ์ถ)
|
| 12 |
+
OCR_MODEL_ID = "ddobokki/ko-trocr"
|
| 13 |
|
| 14 |
# Stage 2: LLM ๋ชจ๋ธ (ํ
์คํธ์์ ์ฝ ์ด๋ฆ ์ถ์ถ)
|
| 15 |
LLM_MODEL_ID = "Qwen/Qwen2.5-7B-Instruct"
|
|
|
|
| 40 |
return model, tokenizer
|
| 41 |
|
| 42 |
|
| 43 |
+
print("๐ Loading Korean TrOCR model (ddobokki/ko-trocr)...")
|
| 44 |
OCR_MODEL, OCR_PROCESSOR = _load_ocr_model()
|
| 45 |
+
print("โ
Korean TrOCR model loaded!")
|
| 46 |
|
| 47 |
print("๐ Loading Qwen2.5-7B-Instruct...")
|
| 48 |
LLM_MODEL, LLM_TOKENIZER = _load_llm_model()
|
|
|
|
| 304 |
---
|
| 305 |
|
| 306 |
**โน๏ธ 2๋จ๊ณ ํ์ดํ๋ผ์ธ**
|
| 307 |
+
- **Stage 1**: Korean TrOCR (ddobokki/ko-trocr) - ์ด๋ฏธ์ง์์ ํ๊ตญ์ด ํ
์คํธ ์ถ์ถ
|
| 308 |
- **Stage 2**: Qwen2.5 7B (LLM) - ์ถ์ถ๋ ํ
์คํธ์์ ์ฝ ์ด๋ฆ๋ง ์๋ณ
|
| 309 |
|
| 310 |
์ค์ ๋ณต์ฝ์ ์์ฌยท์ฝ์ฌ์ ์ง์๋ฅผ ๋ฐ๋ฅด์ธ์.
|