I have created a text transcript from an audio file using the Speech-to-Text API in the cloud console. Probably a silly question - but I fail to find any option to view the resulting JSON file in a human-readable format, e.g. TXT or DOCX. Anybody can help?
It's not a silly question at all! The Google Cloud Speech-to-Text API generates the transcription results in JSON format by default, which can be a bit tricky to read as plain text. Here's how you can convert it into a human-readable format like TXT or DOCX:-
1. Extract the Transcript from JSON
2. Convert JSON to TXT
3. Convert to DOCX
Thanks @sahilnaircool for the quick reply and the steps towards solution - however, I'm still confused why the same seem to be obvious to you:
(3.) seems obvious - most Text processors (e.g. Word or LibreOffice) offer to read TXT and write to DOCX. But is that what you meant to say?
(1+2.) is unclear (to me) - JSON is "the transcript" and already is plain text (i.e. TXT), but includes markup, which is not "human readable".
Can you explain in detail, how you suggest [a simple user like me] to view the JSON output in any "human readable" form?
hello, probably you can use pandas to read json and convert to another format
Thanks @luis-quissak for the hint to pandas. I agree that the latter might be helpful for writing a converter - however, this is not my intention (nor within my professional set of skills...)
User | Count |
---|---|
2 | |
1 | |
1 | |
1 | |
1 |