Audio and SpeechÂ
Audio encompasses the technologies for storing, retrieving text, and transmitting digital information in an audio format, such as WMA (Windows Media Format) and WAV sound files. Speech encompasses two technologies: speech synthesis and speech recognition.
Audio Controls
Audio controls allow you to host, play, and control content.
Programming Model | Documentation Links |
---|---|
Managed |
Controlling an Audio Stream
Controlling an audio stream allows you to play, pause, stop audio, as well as, modify the volume.
Programming Model | Documentation Links |
---|---|
Managed |
|
Unmanaged |
Waveform-Audio Interface (Win32) |
Audio Capture
Audio capture allows an application to capture audio data from microphones, tape players, and other devices, through the inputs on the sound card.
Programming Model | Documentation Links |
---|---|
Unmanaged |
Audio Capture (DirectShow) |
Speech
The concept of speech technology encompasses two technologies: synthesizers and recognizers. A speech synthesizer takes text as input and produces an audio stream as output. Speech synthesis is also referred to as text-to-speech (TTS). A speech recognizer, on the other hand, does the opposite. It takes an audio stream as input, and turns it into a text transcription.
Programming Model | Documentation Links |
---|---|
Managed |
|
Unmanaged |
Text Services Framework (Microsoft Speech SDK Version 5.3) |