LLM Pushback Experiment Runner

This script is designed to run experiments using query models and judges. It processes datasets, generates queries, and evaluates responses based on specified configurations.

Configuration File

The configuration file is a JSON file that specifies the models and judges to be used in the experiment. Below is an example structure of the config.json file:

{
    "query_models": [
        {
            "model_name": "example_model_1",
            "transformer_model_type": "type_1",
            "query_format_type": "format_1"
        },
        {
            "model_name": "example_model_2",
            "transformer_model_type": "type_2",
            "query_format_type": "format_2"
        }
    ],
    "judges": [
        {
            "model_name": "judge_model_1",
            "transformer_model_type": "judge_type_1",
            "query_format_type": "judge_format_1"
        }
    ]
}

query_models: A list of models to be used for generating queries. Each model requires:
- model_name: The name of the model.
- transformer_model_type: The type of transformer model (can be None).
- query_format_type: The format type for the queries.
judges: A list of models to be used for judging the responses. Each judge requires:
- model_name: The name of the judge model.
- transformer_model_type: The type of transformer model (can be None).
- query_format_type: The format type for the judging queries.

Command-Line Arguments

The script accepts several command-line arguments to control its behavior:

--config_file: Path to the configuration file (default: config.json).
--dataset_file: Path to the dataset file (must be in TSV or CSV format).
--mode: Select the mode of operation. Options are:
- queries: Run only the query models.
- judging: Run only the judging models.
- both: Run both queries and judging (default).
--user: User name to be included in the output folder name.
--output_folder: Path to the output folder. Required if mode is judging; otherwise, it will be created dynamically.

Example Usage

To run the script with a specific configuration and dataset:

python app.py --config_file my_config.json --dataset_file my_dataset.csv --mode both --user my_name

This command will run both the query and judging models using the specified configuration and dataset, and it will create an output folder with the user's name included.

Output

The script generates output files in a dynamically created folder (or specified output folder). These include:

Copies of the dataset and configuration for traceability.
CSV files with model predictions.
Judging results appended to the prediction files.

Notes

Ensure that the dataset file exists and is accessible.
The configuration file must be correctly formatted and include all necessary model and judge specifications.
The output folder will be created if it does not exist, but ensure the parent directory is writable.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
Analysis		Analysis
JudgeModels		JudgeModels
LLM_Pushback_Analysis-main 2		LLM_Pushback_Analysis-main 2
QueryFormatters		QueryFormatters
QueryModels		QueryModels
SingleConvo		SingleConvo
Validation		Validation
dataset		dataset
old_code		old_code
results		results
.gitignore		.gitignore
NLP_245_Project_Score_Analysis.ipynb		NLP_245_Project_Score_Analysis.ipynb
README.md		README.md
app.py		app.py
config.json		config.json
conversation.json		conversation.json
requirements.txt		requirements.txt
test.tsv		test.tsv
trans_big_config.json		trans_big_config.json
trans_config.json		trans_config.json
val_main.py		val_main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Pushback Experiment Runner

Configuration File

Command-Line Arguments

Example Usage

Output

Notes

About

Releases

Packages

Contributors 3

Languages

kitrakrev/LLM_Pushback_Analysis

Folders and files

Latest commit

History

Repository files navigation

LLM Pushback Experiment Runner

Configuration File

Command-Line Arguments

Example Usage

Output

Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages