Dynamic MRI studies of the upper airway during speaking, singing or swallowing are complicated by the need for high temporal resolution and the presence of air-tissue interfaces that may give rise to image artifacts such as signal void and geometric distortions. This work exploits a recently developed real-time MRI technique to address these challenges for monitoring speech production at 3 T. The method combines a short-echo time radial FLASH MRI sequence (pulse repetition time/echo time = 2.22/1.44 ms; flip angle 5°) with pronounced undersampling (15 radial spokes per image) and image reconstruction by regularized nonlinear inversion. The resulting serial images at 1.5 mm in-plane resolution and 33.3 ms acquisition time are free of motion or susceptibility artifacts. This application focuses on a dynamic visualization of the main articulators during natural speech production (Standard Modern German). Respective real-time MRI movies at 30 frames per second clearly demonstrate the spatiotemporal coordination of lips, tongue, velum, and larynx for generating vowels, consonants, and coarticulations. The quantitative results for individual phonetic events are in agreement with previous non-MRI findings. Magn Reson Med, 2013. © 2012 Wiley Periodicals, Inc.