Audio and Speech

Article
11/02/2006

Audio encompasses the technologies for storing, retrieving text, and transmitting digital information in an audio format, such as WMA (Windows Media Format) and WAV sound files. Speech encompasses two technologies: speech synthesis and speech recognition.

Audio Controls

Audio controls allow you to host, play, and control content.

Programming Model	Documentation Links
Managed	System.Windows.Controls.MediaElement (WPF)

Controlling an Audio Stream

Controlling an audio stream allows you to play, pause, stop audio, as well as, modify the volume.

Programming Model	Documentation Links
Managed	How to: Control a MediaElement (WPF)
Unmanaged	Waveform-Audio Interface (Win32)

Audio Capture

Audio capture allows an application to capture audio data from microphones, tape players, and other devices, through the inputs on the sound card.

Programming Model	Documentation Links
Unmanaged	Audio Capture (DirectShow)

Speech

The concept of speech technology encompasses two technologies: synthesizers and recognizers. A speech synthesizer takes text as input and produces an audio stream as output. Speech synthesis is also referred to as text-to-speech (TTS). A speech recognizer, on the other hand, does the opposite. It takes an audio stream as input, and turns it into a text transcription.

Programming Model	Documentation Links
Managed	System.Speech.Synthesis System.Speech.Recognition
Unmanaged	Text Services Framework (Microsoft Speech SDK Version 5.3) Microsoft Agent Version 2.0

Managed

System.Speech.Synthesis

System.Speech.Recognition

Unmanaged

Text Services Framework (Microsoft Speech SDK Version 5.3)

Microsoft Agent Version 2.0

Audio and Speech

Audio Controls

Controlling an Audio Stream

Audio Capture

Speech

See Also

Concepts

Additional resources