feature-ensemble #1
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Add an ensemble model to chebifier. This includes support for Electra models (from https://github.com/ChEB-AI/python-chebai), Residual Gated GCNs (from https://github.com/ChEB-AI/python-chebai-graph) and ChemLog (from https://github.com/sfluegel05/chemlog-peptides).
See this PR in chebai. We are moving the ensemble from Chebai to Chebifier since chebai is for training models, chebifier is for predictions. Compared to the last state in chebai, I made some naming and structural changes. More importantly, I added
An example config for this ensemble might look like this:
Here the model names and checkpoint paths refer to specific models (trained with chebai). The
target_labels_pathrefers to the list of labels of the dataset the model was trained on. Theclasswise_weights_pathrefers to the class-specific metrics (as produces by a chebai script on the validation set).The CLI command is something like