Fix for whisper-microphone example failure if audio isn't chunk aligned #2645

anelson · 2024-11-27T21:16:20Z

At least on my macOS Sequoia system (MBP 14" 2021, M1 Pro), when I run the whisper-microphone example after it has gathered 10 seconds of audio, it fails before the transcription:

Error: Insufficient buffer size 384 for input channel 0, expected 1024

At least for the audio device I'm using (Airpods Pro Max), there is no guarantee that each audio buffer is a multiple of 1024 samples. Thus at the end of the 10 seconds, buffered_pcm can have some samples at the end that do not form a complete 1024 sample chunk.

This fixes that by tracking when there is a partial chunk at the end of the buffer, and leaving it in buffered_pcm to be processed on the next loop iteration.

Note that, in the interest of keeping this PR as small as possible, I didn't make any other changes to this example. That said, I think a good enhancement would be to introduce a const for the hard-coded 1024 sample chunk size.

At least on my macOS Sequoia system (MBP 14" 2021, M1 Pro), when I run the `whisper-microphone` example after it has gathered 10 seconds of audio, it fails before the transcription: ``` Error: Insufficient buffer size 384 for input channel 0, expected 1024 ``` At least for the audio device I'm using (Airpods Pro Max), there is no guarantee that each audio buffer is a multiple of 1024 samples. Thus at the end of the 10 seconds, `buffered_pcm` can have some samples at the end that do not form a complete 1024 sample chunk. This fixes that by tracking when there is a partial chunk at the end of the buffer, and leaving it in `buffered_pcm` to be processed on the next loop iteration. Note that, in the interest of keeping this PR as small as possible, I didn't make any other changes to this example.

LaurentMazare · 2024-11-27T21:35:17Z

Thanks!

LaurentMazare approved these changes Nov 27, 2024

View reviewed changes

LaurentMazare merged commit 23ed8a9 into huggingface:main Nov 27, 2024
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for whisper-microphone example failure if audio isn't chunk aligned #2645

Fix for whisper-microphone example failure if audio isn't chunk aligned #2645

anelson commented Nov 27, 2024 •

edited

Loading

LaurentMazare commented Nov 27, 2024

Fix for whisper-microphone example failure if audio isn't chunk aligned #2645

Fix for whisper-microphone example failure if audio isn't chunk aligned #2645

Conversation

anelson commented Nov 27, 2024 • edited Loading

LaurentMazare commented Nov 27, 2024

anelson commented Nov 27, 2024 •

edited

Loading