Ketengan-Diffusion

All

15 repositories

Gemini-Transcriber
Public
A Speech to Text transcriber powered by multimodal LLM
Python
•
MIT License
•0•0•0•0•Updated Feb 25, 2025Feb 25, 2025
CLIP-fine-tune-Adjustment
Public
Fine-tuning code for CLIP models
Python
•
MIT License
•13•2•0•0•Updated Nov 27, 2024Nov 27, 2024
OneTrainer-AdEMAMix
Public
OneTrainer is a one-stop solution for all your stable diffusion training needs.
Python
•
GNU Affero General Public License v3.0
•183•0•0•0•Updated Nov 3, 2024Nov 3, 2024
OneTrainer-WDv3
Public
OneTrainer is a one-stop solution for all your stable diffusion training needs.
Python
•
GNU Affero General Public License v3.0
•183•0•0•0•Updated Nov 2, 2024Nov 2, 2024
OneTrainer-DistributedTraining-Dev
Public
OneTrainer is a one-stop solution for all your stable diffusion training needs.
Python
•
GNU Affero General Public License v3.0
•183•0•0•0•Updated Oct 10, 2024Oct 10, 2024
OneTrainer
Public
OneTrainer is a one-stop solution for all your stable diffusion training needs.
Python
•
GNU Affero General Public License v3.0
•183•0•0•0•Updated Aug 8, 2024Aug 8, 2024
videocr-PaddleOCR-GUI
Public
Extract hardcoded subtitles from videos using machine learning
Jupyter Notebook
•
MIT License
•126•0•0•0•Updated Aug 1, 2024Aug 1, 2024
EZOnetrainerPT
Public
An Easy tool for adding additional embedding in Onetrainer diffusion trainer just point it into OT's .json preset and txt file with your concept keyword
Python
•
MIT License
•0•1•0•0•Updated May 26, 2024May 26, 2024
Karaoke-Video-Clip-Maker
Public
A software to make a Karaoke Video with flowing lyrics and separated and user choosen audio between intrumental only and original audio
Python
•
GNU General Public License v3.0
•0•0•0•0•Updated May 19, 2024May 19, 2024
wdv3-batch-vit-tagger
Public
The successor of WD14 tagger. This batch tagger support wd-vit-tagger-v3 model by SmilingWolf which is more updated model than legacy WD14 over CUDA using ONNX library over Gradio WEBUI
Python
•
MIT License
•3•22•0•0•Updated Apr 7, 2024Apr 7, 2024
StableCascade-For-StageB-Train
Public
Official Code for Stable Cascade
Jupyter Notebook
•
MIT License
•530•0•0•0•Updated Mar 8, 2024Mar 8, 2024
Caption-Merge-For-Pixart
Public
Python script for Merge the .caption and .txt file to reuse your SDXL datasets into pixart captioning
Python
•
Apache License 2.0
•0•0•0•0•Updated Jan 21, 2024Jan 21, 2024
Gemini-Pro-Image-Batch-Captioner-WEBUI
Public
Batch image captioner powered by Google's gemini pro vision. Two version of script included in this script
Apache License 2.0
•0•0•0•0•Updated Dec 28, 2023Dec 28, 2023
Llava-WEBUI-Caption-Adjusted
Public
A simple WEBUI based on GRADIO for making use of the LLAVA Model for captioning datasets forked from ShadoW-Shinigami
Python
•1•0•0•0•Updated Dec 25, 2023Dec 25, 2023
Mass-Resize-SDXL-Dataset
Public
This script will upscale or downscale the image dataset, based on Stability Documentation: https://platform.stability.ai/docs/features/api-parameters#about-dimensions
Python
•0•0•0•0•Updated Aug 12, 2023Aug 12, 2023