Skip to content

Update README.md with SwiftUI Speech Diarization #122

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

carlosmbe
Copy link

Project URL

Description

Speech Diarization is the process of identifying and labeling different speakers within an audio recording. It essentially determines which segments of speech belong to which individual speaker.

This app implements a Speech Diarization Model into an iOS App using SwiftUI and can be used as a template for more advanced applications.

Why it should be included to example-ios-apps (optional)

At the moment, Apple does not provide a framework to handle Speech Diarization and CoreML Tools can't convert Speech Diarization models to a .mlmodel file. Hence, the options for implementing the technology in iOS are limited. This project illustrates one approach using a .onnx model and has a companion article I wrote explaining the details.

Checklist

  • Only one project/change is in this pull request
  • Addition in chronological order (bottom of category)
  • Supports iOS 9 / tvOS 10 or later
  • Supports Swift 3 or later
  • Has a commit from less than 2 years ago
  • Has a clear README in English

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant