MALAMUTE

Multilingual, Highly-granular, Template-free, Education-based Probing Dataset MALAMUTE is a benchmark designed to evaluate language models on factual knowledge across diverse languages and educational levels. It avoids templates and emphasizes nuanced, real-world understanding.

🚀 Getting Started

Prerequisites

Python 3.11
Conda (for environment management)

Installation

1. Clone the Repository

git clone https://github.com/Shaier/MALAMUTE.git
cd MALAMUTE

2. Create a Conda Environment

conda create -n malamute python=3.11
conda activate malamute

3. Install Dependencies

pip install -r requirements.txt

📂 Prepare the Data

Unzip the dataset and remove any extraneous files:

rm -rf data && unzip -o data.zip -d data && rm data.zip

🧪 Running Evaluations

Masked Language Models (MLMs)

To evaluate using MLMs (e.g., BERT-style models):

python test_MLM.py

Causal Language Models (CLMs)

To evaluate using CLMs (e.g., GPT-style models):

See notebooks repo

Citation

If you use this code or dataset, please cite us:

@misc{shaier2025malamutemultilingualhighlygranulartemplatefree, title={MALAMUTE: A Multilingual, Highly-granular, Template-free, Education-based Probing Dataset}, author={Sagi Shaier and George Arthur Baker and Chiranthan Sridhar and Lawrence E Hunter and Katharina von der Wense}, year={2025}, eprint={2412.10105}, archivePrefix={arXiv}, primaryClass={cs.CL}, url=https://arxiv.org/abs/2412.10105 }

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
notebooks		notebooks
LICENSE		LICENSE
README.md		README.md
data.zip		data.zip
requirements.txt		requirements.txt
test_MLM.py		test_MLM.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MALAMUTE

🚀 Getting Started

Prerequisites

Installation

1. Clone the Repository

2. Create a Conda Environment

3. Install Dependencies

📂 Prepare the Data

🧪 Running Evaluations

Masked Language Models (MLMs)

Causal Language Models (CLMs)

Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

Shaier/MALAMUTE

Folders and files

Latest commit

History

Repository files navigation

MALAMUTE

🚀 Getting Started

Prerequisites

Installation

1. Clone the Repository

2. Create a Conda Environment

3. Install Dependencies

📂 Prepare the Data

🧪 Running Evaluations

Masked Language Models (MLMs)

Causal Language Models (CLMs)

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages