-
Notifications
You must be signed in to change notification settings - Fork 0
Protein Prediction Codebase Migration #1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@sfluegel05, Please let me know if the below files are needed in this repository, as I am not sure about them
|
I would say that none of them are needed here. I would even remove more files. We only need files here that are not in the python-chebai repository. The idea is that the proteins repository does not work by itself, but only adds some specific classes to the main repository. The only files I would duplicate are the GItHub workflows. |
|
@sfluegel05 Please check the results from the latest training on the scope50 dataset For this run, I made the following changes:
The training completed 81 epochs in 5 hours on a single GPU. |
Thank you for the clarification. Based on your explanation, I understand that the Firstly, this approach may lead to the installation of several unnecessary dependencies such as RDKit, pysmiles, deepsmiles, selfies, etc., which are not required for protein sequence data. Secondly, the base class Given that protein sequences represent a fundamentally different data type compared to chemical molecules, I would suggest considering a standalone repository for proteins. This would help keep the dependency footprint minimal and allow more flexibility in handling domain-specific requirements. Please let me know your thoughts. |
|
Just want to recall that if we want to use ESM2 for generating embeddings for SCOPe dataset, we can't use ELECTRA model for further training due the following discussed reasons ChEB-AI/python-chebai#64 (comment) |
|
@sfluegel05, Please review and merge. |
…ath" This reverts commit a8823c8.
|
I think there are 2 branches one is dev and another is main. This branch got merged into main. So we can remove the |
python-chebaitopython-chebai-proteinsRelated Issues:
Related PRs