Automatically recognizes speech in videos and generates subtitles