Reasoning and Sampling-Augmented MCQ Difficulty Prediction via LLMs

In this repository, we present the code to our paper "Reasoning and Sampling-Augmented MCQ Difficulty Prediction via LLMs" by Wanyong Feng, Peter Tran, Stephen Sireci, and Andrew Lan. In this work, we propose a novel method to predict the difficulty value of math MCQs. Our method first augments the reasoning steps and feedback messages for each option, then samples student knowledge profiles from a distribution, and finally predicts the likelihood of each option being selected and use it to predict the MCQ’s difficulty.

For any questions please email or raise an issue.

Dataset

For MAPT dataset, we cannot share the data since it is a private dataset.

For EEDI dataset, we manually parse the question text and options from the images and filter out questions that need images to answer the question. This is also a private dataset. You can email for access

Running

Setup

python -m venv mcq_diff_prediction_env
source mcq_diff_prediction_env/bin/activate
python -m pip install -r requirements.txt

Generate reasonings for each option

python reason_prompt_gen.py
python reasoning_prompting.py
python reasoning_post_process.py

Linear regression baseline

Extract features

python feature_extract.py

Linear regression close-form solution

python regression_close_form.py

Finetune with/without reasoning

python LLM_pred_difficulty.py

Our method

python our_method.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Reasoning and Sampling-Augmented MCQ Difficulty Prediction via LLMs

Dataset

Running

Setup

Generate reasonings for each option

Linear regression baseline

Extract features

Linear regression close-form solution

Finetune with/without reasoning

Our method

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
LLM_pred_difficulty.py		LLM_pred_difficulty.py
OpenAIInterface_new.py		OpenAIInterface_new.py
README.md		README.md
feature_extract.py		feature_extract.py
our_method.py		our_method.py
reason_prompt_gen.py		reason_prompt_gen.py
reasoning_post_process.py		reasoning_post_process.py
reasoning_prompting.py		reasoning_prompting.py
regression_close_form.py		regression_close_form.py
requirements.txt		requirements.txt
utils.py		utils.py

umass-ml4ed/math-MCQ-difficulty-prediction

Folders and files

Latest commit

History

Repository files navigation

Reasoning and Sampling-Augmented MCQ Difficulty Prediction via LLMs

Dataset

Running

Setup

Generate reasonings for each option

Linear regression baseline

Extract features

Linear regression close-form solution

Finetune with/without reasoning

Our method

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages