Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Unable to use audio to text transcribe

I am new to this Google Audio transcription and I have set up the whole Google Free Trial thing and I have tried to use the function of Google's Audio to Speech transcript and well so far my customer experience has been so hard.  I have two files and *.mpa and a *.mp4 file and no matter what i do i keep getting an error that it cannot transcribe.

Can someone  please help me with this.  

Here are the errors I am getting.

Notifications
Running recognize for transcription "2022.03.20_16-30 Paradigm Shift Review Meeting David & Steve Zoom GMT20220320-050943_Recording_640x360-1dddcbcacc9d52be-9cc61"
6 minutes ago
My First Project
Unknown error.
Running recognize for transcription "2022.03.20_16-30 Paradigm Shift Review Meeting David & Steve Zoom GMT20220320-050943_Recording-c51097d6e01f1270-6f202"
13 minutes ago
My First Project
Unknown error.
Running recognize for transcription "2022.03.20_16-30 Paradigm Shift Review Meeting David & Steve Zoom GMT20220320-050943_Recording-85fc910221e53949-6752c"
15 minutes ago
My First Project
Unknown error.
Running recognize for transcription "2022.03.20_16-30 Paradigm Shift Review Meeting David & Steve Zoom GMT20220320-050943_Recording-fed8777481bb94c4-33e9f"
16 minutes ago
My First Project
Unknown error.
 
According to Handbrake the m4a file has the following ...

Format : MPEG-4
Format profile : Base Media / Version 2
Codec ID : mp42 (isom/mp42)
File size : 26.0 MiB
Duration : 28 min 30 s
Overall bit rate mode : Variable
Overall bit rate : 127 kb/s
Encoded date : UTC 2022-03-20 05:09:43
Tagged date : UTC 2022-03-20 05:09:43

Audio
ID : 1
Format : AAC LC
Format/Info : Advanced Audio Codec Low Complexity
Codec ID : mp4a-40-2
Duration : 28 min 30 s
Bit rate mode : Variable
Bit rate : 126 kb/s
Maximum bit rate : 166 kb/s
Channel(s) : 1 channel
Channel layout : C
Sampling rate : 32.0 kHz
Frame rate : 31.250 FPS (1024 SPF)
Compression mode : Lossy
Stream size : 25.7 MiB (99%)
Title : AAC audio

 

And the mp4 file has 

Format : MPEG-4
Format profile : Base Media / Version 2
Codec ID : mp42 (isom/mp42)
File size : 30.2 MiB
Duration : 28 min 30 s
Overall bit rate mode : Variable
Overall bit rate : 148 kb/s
Encoded date : UTC 2022-03-20 05:09:43
Tagged date : UTC 2022-03-20 05:09:43

Video
ID : 2
Format : AVC
Format/Info : Advanced Video Codec
Format profile : High@L3.1
Format settings : CABAC / 11 Ref Frames
Format settings, CABAC : Yes
Format settings, Reference frames : 11 frames
Codec ID : avc1
Codec ID/Info : Advanced Video Coding
Duration : 28 min 30 s
Bit rate : 19.8 kb/s
Width : 640 pixels
Height : 360 pixels
Display aspect ratio : 16:9
Frame rate mode : Constant
Frame rate : 25.000 FPS
Color space : YUV
Chroma subsampling : 4:2:0
Bit depth : 8 bits
Scan type : Progressive
Bits/(Pixel*Frame) : 0.003
Stream size : 4.03 MiB (13%)
Title : H.264/AVC video
Encoded date : UTC 2022-03-20 05:09:43
Tagged date : UTC 2022-03-20 05:09:43
Codec configuration box : avcC

 

What codec options do i choose and how do I use this feature.

Google

0 1 538
1 REPLY 1

You can try another variant of audio transcription -Audext. I like that the software supports various audio file formats like Mp3, WAV, and M4A and it allows editing of the transcript.