Skip to content
Advertisement

AWS Transcript: file to text returns nonsense

This is a follow-on question to AWS Transcribe S3 .wav file to text. I use a stream to read and send a .wav file contents to AWS.

Instead of getting back the correct transcript, I get nonsense like a bunch of “Yeah.” statements. It looks like AWS isn’t able to interpret the byte stream correctly, but I’m not sure what’s wrong. I’m wondering if the file needs to be encoded somehow, ie, I can’t send the raw .wav bytes straight from the file? Or perhaps I need to tell the service that this is .wav format?

What’s wrong here? The input file is a valid .wav voice file that sounds intelligible when I listen to it.

Here is my java code:

JavaScript

Here’s my program output:

JavaScript

Advertisement

Answer

The audio file had a sample rate of 44.1 kHz. It was converted to 16 kHz, and it worked:

https://drive.google.com/file/d/1mYVbNlYK3SpGT4NbFRYGn86177eTCqhd/view?usp=sharing

enter image description here

User contributions licensed under: CC BY-SA
10 People found this is helpful
Advertisement