Why is sample rate optional only for FLAC or WAV file and not other formats?

This website uses Cookies. Click Accept to agree to our website's cookie use as described in our Privacy Policy. Click Preferences to customize your cookie settings.

Reject

Preferences

Google Cloud
Google Workspace
AppSheet
Looker & Looker Studio
Google Cloud Security

Google Cloud Home
Cloud Forums
Groups
- Cloud FinOps and Cost Optimization Community
Learning & Certification Hub
Articles & Information
Community Resources
Cloud Events

cancel

Turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

Search instead for

Did you mean:

Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Google Cloud
Cloud Forums
AI/ML
Why is sample rate optional only for FLAC or WAV f...

Topic Options

Subscribe to RSS Feed
Mark Topic as New
Mark Topic as Read
Float this Topic for Current User
Bookmark
Subscribe
Mute
Printer Friendly Page

Solved

Why is sample rate optional only for FLAC or WAV file and not other formats?

Posted on 04-21-2022 05:49 AM

Share this topic

Twitter

bruno_medeiros

Bronze 1

Post Options

Mark as New
Bookmark
Subscribe
Mute
Subscribe to RSS Feed
Permalink
Print
Report Inappropriate Content

Reply posted on --/--/---- --:-- AM

Post Options

Mark as New
Bookmark
Subscribe
Mute
Subscribe to RSS Feed
Permalink
Print
Report Inappropriate Content

So for example at my work we are using WEBM_OPUS encoding, which from what I understand, specificies the sample rate in audio stream metadata itself? Yet from here: https://cloud.google.com/speech-to-text/docs/basics#sample-rates it says the field is only optional for FLAC or WAV formats.

And indeed, when I try the GSTT API with some example code (Streaming Recognition and a WEBM_OPUS encoded at 48000 sample rate), the GSTT actually accepts sample rates other than 48000 - and depending on the recognition model, produces different results depending on the sample rate selected!

0 5 564

Topic Labels

Labels:
Speech-to-Text

0 Likes

View All Topics In this Discussion Space
Previous Topic
Next Topic

5 REPLIES 5

Preview Exit Preview

never-displayed

Additional options

Associated Products

You do not have permission to remove this product association.

Top Labels in this Space

AI ML General 1,050
AutoML 276
Bison 33
Cloud Error Reporting 1
Cloud Natural Language API 136
Cloud TPU 31
Contact Center AI 83
Dialogflow 727
Document AI 263
express mode 1
Gecko 8
Gemini 416
Gen App Builder 182
Generative AI Studio 216
Google AI Studio 108
Model Garden 72
Otter 3
PaLM 2 40
Recommendations AI 95
Speech-to-Text 148
Tensorflow Enterprise 12
Text-to-Speech 130
Translation AI 133
Unicorn 4
User Interface 1
Vertex AI Model Registry 292
Vertex AI Platform 1,410
Vertex AI Workbench 192
Video AI 52
Vision AI 181