ASR segment speaker match using IoU to address the issue #28 #35

Pikauba · 2023-12-12T17:16:28Z

This merge request address the bug in: #28

As stated in the issue, there is a clear problem with the actual assignment process in the diarize.py.

As I explained there : #28 (comment) ,
we have to refactor the algorithm in the call method of the ASRDiarizationPipeline.

The idea is to use the intersection over union to match the results from the diarization segments and the asr segments timestamps. We assign the speaker with the best matching IoU for each asr segment.

It is possible to set a threshold to ignore IoU match lower than a specific value and we can assigne a specific "no match" label when the is not a clear match found between a asr segment and any of the diarization segments available.

I removed the same speaker squashing part but we can probably do some refactoring in order to re-implement it in this pull request.

I would like to have feedback about this pull request as I am open to make improvements to it or make changes I could have forgot to take into account.

2010b9 · 2024-05-20T16:24:47Z

Thanks for doing this! I've tried your code, but I'm having the same issue mentioned in #28 (comment). I don't know why it happens, but I haven't looked thoroughly to the code yet.

segment_matcher using IoU

92083a5

Pikauba changed the title ~~ASR segment spearker match using IoU to adress issue: [#28](https://github.com/huggingface/speechbox/issues/28)~~ ASR segment speaker match using IoU to address the issue #28 Dec 12, 2023

omarsiddiqi224 mentioned this pull request Dec 14, 2023

Better Diarization pipeline Vaibhavs10/insanely-fast-whisper#139

Open

2010b9 mentioned this pull request May 21, 2024

ValueError: attempt to get argmin of an empty sequence #28

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ASR segment speaker match using IoU to address the issue #28 #35

ASR segment speaker match using IoU to address the issue #28 #35

Uh oh!

Pikauba commented Dec 12, 2023 •

edited

Loading

Uh oh!

2010b9 commented May 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

ASR segment speaker match using IoU to address the issue #28 #35

Are you sure you want to change the base?

ASR segment speaker match using IoU to address the issue #28 #35

Uh oh!

Conversation

Pikauba commented Dec 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

2010b9 commented May 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Pikauba commented Dec 12, 2023 •

edited

Loading