Category: Audio Processing

  • Easy Text-to-Speech in Windows 10 Using PyWin32

    Easy Text-to-Speech in Windows 10 Using PyWin32

    Some time back, we’ve talked about how to build a speech recognition system in Python. Now let’s look in to the other end of it: how to make a Python program that talks. More specifically, let’s looks at building a text-to-speech system. There are several libraries out there that would let you build a text-to-speech…

  • Energy Threshold Calibration in Speech Recognition

    In my last post on Speech Recognition, I showed how to setup the Python SpeechRecognition package with PyAudio, and pocketsphinx to recognize speech with just a few lines of code. And, as you can remember, we ran into issues where the speech recognition just hangs there unable to recognize our speaking. Speech Recognition just hanging…

  • Easy Speech Recognition in Python with PyAudio and Pocketsphinx

    If you remember, I was getting started with Audio Processing in Python (thinking of implementing an audio classification system) a couple of weeks back (see my earlier post). I got the PyAudio package setup and was having some success with it. As you know, one of the more interesting areas in audio processing in machine…

  • Stepping into audio classification – Getting started with PyAudio

    I’ve had an idea to attempt to train a deep learning model that can classify different audio noises. Having a system that can accurately identify different sounds would have implications in many fields, from medical diagnostics to echo locations systems, among others. I wouldn’t expect building such a system would be an easy task. But,…