RAGs for Open Domain Complex QA

This is a fork from the BCQA Benchmarking Complex QA repo.

The scripts used for making the OpenAI API calls and the data processing and analysis can be found in the data_analysis folder.

BCQA (Benchmarking Complex QA)

BCQA is a benchmark for a wide range of complex Qa tasks. It also aims to provide a easy to use framework for evaluating retrieval and reasoning approaches for answering complex multi-hop questions.

Setup

Create a conda environment conda create -n bcqa python=3.10
pip install -e .
To be able to use GPU: pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
The data paths are absolute for my pc so you need to change it to fit yours.

Running Evaluation

The evaluation scripts for retreival and LLMs are in the evaluation folder

For instance to run dpr retreival for Wikimultihopqa run
python3 evaluation/wikimultihop/run_dpr_inference.py

Before running the above script make sure you have configured the correct paths for the data and corpus files in evaluation/config.ini

Example: wikimultihopqa = /home/bcqa/BCQA/2wikimultihopQA
wikimultihopqa-corpus = /home/bcqa/BCQA/wiki_musique_corpus.json

Coding Practices

Auto-formatting code

Install black: pip install black or conda install black
In your IDE: Enable formatting on save.
Install isort: pip install isort or conda install isort
In your IDE: Enable sorting import on save.

In VS Code, you can do this using the following config:

{
    "editor.formatOnSave": true,
    "editor.codeActionsOnSave": {
        "source.organizeImports": true
    }
}

Type hints

Use type hints for everything! No exceptions.

Docstrings

Write a docstring for every function (except the main function). We use the Google format. In VS Code, you can use autoDocstring.

Example

def sum(a: float, b: float) -> float:
    """Compute the sum of a and b.

    Args:
        a (float): First number.
        b (float): Second number.
    
    Returns:
        float: The sum of a and b.
    """

    return a + b

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.idea		.idea
.vscode		.vscode
SearchAPI		SearchAPI
SearchUI		SearchUI
bcqa.egg-info		bcqa.egg-info
build/lib/evaluation		build/lib/evaluation
data		data
data_analysis		data_analysis
evaluation		evaluation
losses		losses
methods		methods
metrics		metrics
re_ranker		re_ranker
readers		readers
retriever		retriever
tests		tests
trainers		trainers
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
constants.py		constants.py
dev.json		dev.json
doc.md		doc.md
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAGs for Open Domain Complex QA

BCQA (Benchmarking Complex QA)

Setup

Running Evaluation

Coding Practices

Auto-formatting code

Type hints

Docstrings

Example

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 5

Uh oh!

Languages

SPsathas/NLPProject

Folders and files

Latest commit

History

Repository files navigation

RAGs for Open Domain Complex QA

BCQA (Benchmarking Complex QA)

Setup

Running Evaluation

Coding Practices

Auto-formatting code

Type hints

Docstrings

Example

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 5

Uh oh!

Languages

Packages