STT Transcripts Don't Trigger User "Speaking" State When VAD Misses Speech

## Environment

`livekit-agents==1.2.11`

We are using the livekit session with:
```
...
turn_detection = MultilingualModel(),
vad=silero.VAD.load(),
...
```
as well as Deepgram as STT

## Problem 
We've noticed an issue that we at times receive a transcript from the STT model. 
In traces we can see that we have a `eou_detection` span, however we don't have a `user_speaking` span and we also don't receive any `user_state_changed` event.
If the EOU model detects the speech as final the agent will continue in the conversation.

However, this is very problematic for us when the previous state of the user was `away`, because then our agent will continue to think that our the user is away and after some time end the call. 

I have seen PR #3559 -- however this does not appear to change the user state. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

STT Transcripts Don't Trigger User "Speaking" State When VAD Misses Speech #3617

Environment

Problem

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

STT Transcripts Don't Trigger User "Speaking" State When VAD Misses Speech #3617

Description

Environment

Problem

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions