Skip to content

Commit ce8b888

Browse files
committed
Merge remote-tracking branch 'refs/remotes/origin/167-task-ccocr' into 167-task-ccocr
2 parents 496fe58 + 32f87e8 commit ce8b888

File tree

1 file changed

+6
-0
lines changed

1 file changed

+6
-0
lines changed

src/eval_mm/tasks/cc_ocr.py

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -10,6 +10,12 @@ def base64_to_pil_image(base64_string: str) -> Image.Image:
1010

1111

1212
class CCOCR(Task):
13+
"""
14+
The CCOCR class processes the CC-OCR dataset for Japanese samples and provides
15+
methods to interact with the dataset. It filters the dataset to include only
16+
entries labeled as "Japanese" and decodes base64-encoded images into PIL Image
17+
objects for visual processing.
18+
"""
1319
default_metric = "ccocr"
1420

1521
@staticmethod

0 commit comments

Comments
 (0)