Skip to content
Change the repository type filter

All

    Repositories list

    • A Speech to Text transcriber powered by multimodal LLM
      Python
      MIT License
      0000Updated Feb 25, 2025Feb 25, 2025
    • Fine-tuning code for CLIP models
      Python
      MIT License
      13200Updated Nov 27, 2024Nov 27, 2024
    • OneTrainer is a one-stop solution for all your stable diffusion training needs.
      Python
      GNU Affero General Public License v3.0
      183000Updated Nov 3, 2024Nov 3, 2024
    • OneTrainer is a one-stop solution for all your stable diffusion training needs.
      Python
      GNU Affero General Public License v3.0
      183000Updated Nov 2, 2024Nov 2, 2024
    • OneTrainer is a one-stop solution for all your stable diffusion training needs.
      Python
      GNU Affero General Public License v3.0
      183000Updated Oct 10, 2024Oct 10, 2024
    • OneTrainer is a one-stop solution for all your stable diffusion training needs.
      Python
      GNU Affero General Public License v3.0
      183000Updated Aug 8, 2024Aug 8, 2024
    • Extract hardcoded subtitles from videos using machine learning
      Jupyter Notebook
      MIT License
      126000Updated Aug 1, 2024Aug 1, 2024
    • An Easy tool for adding additional embedding in Onetrainer diffusion trainer just point it into OT's .json preset and txt file with your concept keyword
      Python
      MIT License
      0100Updated May 26, 2024May 26, 2024
    • A software to make a Karaoke Video with flowing lyrics and separated and user choosen audio between intrumental only and original audio
      Python
      GNU General Public License v3.0
      0000Updated May 19, 2024May 19, 2024
    • The successor of WD14 tagger. This batch tagger support wd-vit-tagger-v3 model by SmilingWolf which is more updated model than legacy WD14 over CUDA using ONNX library over Gradio WEBUI
      Python
      MIT License
      32200Updated Apr 7, 2024Apr 7, 2024
    • Official Code for Stable Cascade
      Jupyter Notebook
      MIT License
      530000Updated Mar 8, 2024Mar 8, 2024
    • Python script for Merge the .caption and .txt file to reuse your SDXL datasets into pixart captioning
      Python
      Apache License 2.0
      0000Updated Jan 21, 2024Jan 21, 2024
    • Batch image captioner powered by Google's gemini pro vision. Two version of script included in this script
      Apache License 2.0
      0000Updated Dec 28, 2023Dec 28, 2023
    • A simple WEBUI based on GRADIO for making use of the LLAVA Model for captioning datasets forked from ShadoW-Shinigami
      Python
      1000Updated Dec 25, 2023Dec 25, 2023
    • This script will upscale or downscale the image dataset, based on Stability Documentation: https://platform.stability.ai/docs/features/api-parameters#about-dimensions
      Python
      0000Updated Aug 12, 2023Aug 12, 2023