Since longRunningRecognize is no longer available with the V2 API; I was wondering if the speaker identity is maintained across multiple inputs when using streaming or batch recognition.
Also, are speakers identified properly when using dynamic batch? I wasn't sure if "dynamic batch" is processed in order or not which seems necessary for consistent speaker identification.
User | Count |
---|---|
2 | |
2 | |
1 | |
1 | |
1 |