Audio and Speech 

Audio encompasses the technologies for storing, retrieving text, and transmitting digital information in an audio format, such as WMA (Windows Media Format) and WAV sound files. Speech encompasses two technologies: speech synthesis and speech recognition.

Audio Controls

Audio controls allow you to host, play, and control content.

Programming Model Documentation Links

Managed

System.Windows.Controls.MediaElement (WPF)

Controlling an Audio Stream

Controlling an audio stream allows you to play, pause, stop audio, as well as, modify the volume.

Programming Model Documentation Links

Managed

How to: Control a MediaElement (WPF)

Unmanaged

Waveform-Audio Interface (Win32)

Audio Capture

Audio capture allows an application to capture audio data from microphones, tape players, and other devices, through the inputs on the sound card.

Programming Model Documentation Links

Unmanaged

Audio Capture (DirectShow)

Speech

The concept of speech technology encompasses two technologies: synthesizers and recognizers. A speech synthesizer takes text as input and produces an audio stream as output. Speech synthesis is also referred to as text-to-speech (TTS). A speech recognizer, on the other hand, does the opposite. It takes an audio stream as input, and turns it into a text transcription.

Programming Model Documentation Links

Managed

System.Speech.Synthesis

System.Speech.Recognition

Unmanaged

Text Services Framework (Microsoft Speech SDK Version 5.3)

Microsoft Agent Version 2.0

See Also

Concepts

Animation and Timing
Video