GitHub - tal66/transcriber: Transcribe audio to text locally, from YouTube / File / Real-time

Transcribe

Transcribe audio to text (using whisper). Runs locally.

YouTube / File

Files: transcribe.py, youtube_util.py. Possibly select start and end time.

Optional UI (served by flask) using python -m src.app to:

Prepare transcript from a YouTube link or a file.
Text search the database (MongoDB) of transcripts. Note: stop words (like 'how', 'is', 'why') are not indexed.
Edit saved transcripts.

Takes a few seconds for to transcribe a few minutes of audio, for example this song took less than 5 seconds on my pc using cuda:

Real Time

File: transcribe.py. Output example:

Speaker diarization

File: speaker_diarization.py, separating the speakers is done using pyannote (hugging face token required). Output example:

SPEAKER_01:  No introduction needed.
SPEAKER_03:  Welcome. I just agreed to this last minute, as you know.
...

Settings and dependencies

Install and read the instructions for torch, pyannote (for separating speakers. hugging face token required), and install the other requirements.

Set device ID's in settings.py in order to record audio (helper functions are in audio_util.py).

For real-time transcription - there are probably better ways, but it works surprisingly well for short and fast transcriptions.

The model may hallucinate a bit, and be non-deterministic.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
img		img
src		src
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transcribe

YouTube / File

Real Time

Speaker diarization

Settings and dependencies

About

Releases

Packages

Languages

tal66/transcriber

Folders and files

Latest commit

History

Repository files navigation

Transcribe

YouTube / File

Real Time

Speaker diarization

Settings and dependencies

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages