Skip to content

Commit 32f87e8

Browse files
SilviaseCopilot
andauthored
Update src/eval_mm/tasks/cc_ocr.py
Co-authored-by: Copilot <[email protected]>
1 parent 6ca82be commit 32f87e8

File tree

1 file changed

+6
-0
lines changed

1 file changed

+6
-0
lines changed

src/eval_mm/tasks/cc_ocr.py

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -10,6 +10,12 @@ def base64_to_pil_image(base64_string: str) -> Image.Image:
1010

1111

1212
class CCOCR(Task):
13+
"""
14+
The CCOCR class processes the CC-OCR dataset for Japanese samples and provides
15+
methods to interact with the dataset. It filters the dataset to include only
16+
entries labeled as "Japanese" and decodes base64-encoded images into PIL Image
17+
objects for visual processing.
18+
"""
1319
default_metric = "ccocr"
1420

1521
@staticmethod

0 commit comments

Comments
 (0)