Support multimodal input data by moskomule · Pull Request #278 · sbintuitions/flexeval

moskomule · 2026-02-12T06:04:31Z

Refer to #277.

Add parse_input_utterance and preprocessor to TemplateChatDataset to support multimodal input data.

parse_input_utterance parses structured contents used in multimodal LMs
preprocessor preprocesses each item, like image resizing

To set preprocessor from jsonnet, a base class Preprocessor is also prepared.

Copilot

Pull request overview

This PR adds support for multimodal input data (e.g., text + images) to flexeval, enabling evaluation of Vision Language Models (VLMs) and other multimodal language models. The implementation introduces two key features to TemplateChatDataset: parse_input_utterance to parse structured content from templates into lists of dictionaries (as required by OpenAI's multimodal API format), and preprocessor to preprocess items before template rendering (e.g., image resizing or base64 encoding).

Changes:

Added parse_input_utterance parameter supporting literal_eval and json_loads parsing methods
Added preprocessor parameter accepting a list of preprocessor instances for item transformation
Created Preprocessor abstract base class defining the preprocessor interface
Implemented ConvertImageToBase64 as an example preprocessor for image handling
Added tests for the parse_input_utterance functionality

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 11 comments.

File	Description
flexeval/core/chat_dataset/template_based.py	Core implementation of multimodal support: added Preprocessor ABC, parse_input_utterance and preprocessor parameters to TemplateChatDataset and its subclasses
flexeval/multimodal/image_preprocessor.py	Example implementation of image-to-base64 preprocessor with resizing support
flexeval/multimodal/init.py	Module initialization exporting ConvertImageToBase64
tests/core/chat_dataset/test_template_based.py	Test coverage for parse_input_utterance feature with different parsing methods

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

flexeval/core/chat_dataset/template_based.py

tests/core/chat_dataset/test_template_based.py

flexeval/multimodal/image_preprocessor.py

flexeval/core/chat_dataset/template_based.py

flexeval/multimodal/image_preprocessor.py

flexeval/core/chat_dataset/template_based.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

amanjainj98

Adding the Preprocessor class to flexeval/core/chat_dataset/init.py would be helpful for usage.

flexeval/flexeval/core/chat_dataset/__init__.py

Line 5 in 4a8a31e

    
           from .template_based import HFChatDataset, JsonlChatDataset, TemplateChatDataset, load_jinja2_template

…812290261

junya-takayama

LGTM! nice feature 👏

Just a small nit. ↓

flexeval/core/chat_dataset/template_based.py

moskomule added 4 commits February 12, 2026 13:48

Introduce parse_input_utterance and preprocessor

e24291c

change preprocessor to class

d8b329b

add preprocessor impl

34b13a9

add test for parse_input_utterance

4f12613

moskomule requested review from Copilot and junya-takayama February 12, 2026 06:04

moskomule assigned moskomule and amanjainj98 Feb 12, 2026

moskomule added the enhancement New feature or request label Feb 12, 2026

Copilot started reviewing on behalf of moskomule February 12, 2026 06:05 View session

Copilot AI reviewed Feb 12, 2026

View reviewed changes

moskomule and others added 9 commits February 12, 2026 15:11

Update flexeval/multimodal/image_preprocessor.py

28b65c1

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update flexeval/core/chat_dataset/template_based.py

7b4bd70

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

fix

9e6ec0c

fix test

c7994ea

fix

7bad582

remove multimodal

39ee3b6

test preprocessor

ca57e45

test preprocessor

740b5fb

fix

4a8a31e

amanjainj98 reviewed Feb 17, 2026

View reviewed changes

https://github.com/sbintuitions/flexeval/pull/278#pullrequestreview-3…

c4cb43a

…812290261

junya-takayama approved these changes Feb 17, 2026

View reviewed changes

flexeval/core/chat_dataset/template_based.py Outdated Show resolved Hide resolved

processor -> processors

0d830e3

moskomule merged commit ea919a1 into main Feb 18, 2026
7 checks passed

moskomule deleted the feat/multimodal-input branch February 18, 2026 09:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multimodal input data#278

Support multimodal input data#278
moskomule merged 15 commits intomainfrom
feat/multimodal-input

moskomule commented Feb 12, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

amanjainj98 left a comment

Uh oh!

junya-takayama left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

moskomule commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

amanjainj98 left a comment

Choose a reason for hiding this comment

Uh oh!

junya-takayama left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

moskomule commented Feb 12, 2026 •

edited

Loading

junya-takayama left a comment •

edited

Loading