Original author(s) | OpenAI[1] |
---|---|
Initial release | September 21, 2022 |
Repository | https://github.com/openai/whisper |
Written in | Python |
Type | |
License | MIT License |
Whisper is a machine learning model for speech recognition and transcription, created by OpenAI and first released as open-source software in September 2022.[2]
It is capable of transcribing speech in English and several other languages, and is also capable of translating several non-English languages into English.[1] OpenAI claims that the combination of different training data used in its development has led to improved recognition of accents, background noise and jargon compared to previous approaches.[3]
Whisper is a weakly-supervised deep learning acoustic model, made using an encoder-decoder transformer architecture.[1]
Whisper V2 was released on December 8, 2022.[4] Whisper V3 was released in November 2023, on the OpenAI Dev Day.[5]