Comparing Audacity's OpenVINO Whisper Transcription LLMs

Introduction Several months ago, I posed a question in the r/audacity subreddit regarding the differences between various OpenVINO Whisper Transcription models and their impact on transcription quality. Having received no response, I conducted this comparative study independently. Evaluating the performance of these models across diverse audio content is essential for a comprehensive assessment. This report compares four audio processing models: base , small , medium , and large-v3 . The analysis encompasses scores achieved by each model on ten different audio tracks (labelled Track 1 through Track 10), along with their respective processing durations. The objective of this analysis is to provide a data-driven foundation for assessing each model’s effectiveness, examining their processing efficiency, and identifying the strengths and weaknesses of each model. Analysis Data All tracks, outputs from the different Audacity models, source code used to generate the scores, and in...