[VLM] Add brand field to accuracy evaluation #2404

johncalesp · 2025-12-06T01:06:10Z

In this PR I intent to add the brand field to the accuracy evaluation.
Since brand can be any string, I opted to use another package to perform the evaluation.
The library rapidfuzz helps compare strings and provide a numeric value based on a threshold. In comparisson using sklearn looks for exact string matches and If we have 1,000 different brands, sklearn treats this as a classification problem with 1,000 classes (multi-classification problem).

The evaluation now will look like this:

github-actions · 2025-12-06T01:06:19Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

wangshangsam · 2025-12-06T01:18:29Z

multimodal/vl2l/src/mlperf_inference_multimodal_vl2l/evaluation.py

+        if norm_truth == norm_pred:
+            matches.append(1)
+            continue


I'm just wondering, if it's an exact match, wouldn't the score from fuzz.ratio be also bigger than valid_threshold? Therefore, maybe there's no need to treat the exact match as a special case (that needed to be handled differently)?

johncalesp and others added 3 commits December 4, 2025 17:12

add changes to brand field

c4940ca

fix format

7597cb6

[Automated Commit] Format Codebase

463ca3b

johncalesp requested a review from a team as a code owner December 6, 2025 01:06

wangshangsam suggested changes Dec 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[VLM] Add brand field to accuracy evaluation #2404

[VLM] Add brand field to accuracy evaluation #2404

Uh oh!

johncalesp commented Dec 6, 2025

Uh oh!

github-actions bot commented Dec 6, 2025

Uh oh!

wangshangsam Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[VLM] Add brand field to accuracy evaluation #2404

Are you sure you want to change the base?

[VLM] Add brand field to accuracy evaluation #2404

Uh oh!

Conversation

johncalesp commented Dec 6, 2025

Uh oh!

github-actions bot commented Dec 6, 2025

Uh oh!

wangshangsam Dec 6, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants