Why the CLIP Score max value is around 30 ? #3142

EmmaThompson123 · 2025-06-19T07:36:06Z

EmmaThompson123
Jun 19, 2025

Why the CLIP Score max value is around 30 instead of 100, because I think if the value range of similarity is [0,1], then after multiply 100, its value range become [0,100], but I found even though the image and the text are very match, the clip score value is just around 30

Answered by Borda

Aug 11, 2025

CLIPScore in torchmetrics is essentially 100 ⋅ cos⁡(𝑓_{𝑖𝑚𝑔},𝑓_{𝑡𝑥𝑡}) using L2‑normalized CLIP embeddings. Cosine reaches 1 only when the image and text embeddings are identical; with separate encoders and real data, well‑matched pairs typically land around cosine 0.25–0.35, which translates to scores near 25–35. The 100 factor is just a scaling convention (inspired by CLIP’s logit scale) and doesn’t mean “percent correctness.” Treat CLIPScore as a relative metric: compare models/captions with the same backbone and settings rather than expecting scores to approach 100.

View full answer

Borda · 2025-08-11T13:44:27Z

Borda
Aug 11, 2025
Maintainer

CLIPScore in torchmetrics is essentially 100 ⋅ cos⁡(𝑓_{𝑖𝑚𝑔},𝑓_{𝑡𝑥𝑡}) using L2‑normalized CLIP embeddings. Cosine reaches 1 only when the image and text embeddings are identical; with separate encoders and real data, well‑matched pairs typically land around cosine 0.25–0.35, which translates to scores near 25–35. The 100 factor is just a scaling convention (inspired by CLIP’s logit scale) and doesn’t mean “percent correctness.” Treat CLIPScore as a relative metric: compare models/captions with the same backbone and settings rather than expecting scores to approach 100.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Why the CLIP Score max value is around 30 ? #3142

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Why the CLIP Score max value is around 30 ? #3142

Uh oh!

EmmaThompson123 Jun 19, 2025

Replies: 1 comment

Uh oh!

Uh oh!

Borda Aug 11, 2025 Maintainer

EmmaThompson123
Jun 19, 2025

Borda
Aug 11, 2025
Maintainer