diff --git a/WhisperNet/Readme.md b/WhisperNet/Readme.md index 1545e23..aed10ce 100644 --- a/WhisperNet/Readme.md +++ b/WhisperNet/Readme.md @@ -5,8 +5,8 @@ The library requires a hardware GPU which supports Direct3D 11.0, a 64-bit Windo The main entry point of the llibrary is `Whisper.Library` static class. Call `loadModel` function from that class to load an ML model from a binary file. -These binary files are available for free download on [Hugging Face](https://huggingface.co/ggerganov/whisper.cpp/tree/main).
-I recommend `ggml-medium.bin` (1.42GB in size, but that web page says 1.53 GB), because I’ve mostly tested the software with that model.
+These binary files are available for free download on [Hugging Face](https://huggingface.co/ggerganov/whisper.cpp/tree/main). +I recommend `ggml-medium.bin` (1.42GB in size, but that web page says 1.53 GB), because I’ve mostly tested the software with that model. Compressed models in ZIP format with `mlmodelc` in the file name are not supported. Once the model is loaded, create a context by calling `createContext` extension method, diff --git a/WhisperNet/WhisperNet.nuspec b/WhisperNet/WhisperNet.nuspec index e7fe354..6d349a2 100644 --- a/WhisperNet/WhisperNet.nuspec +++ b/WhisperNet/WhisperNet.nuspec @@ -8,8 +8,8 @@ https://github.com/Const-me/Whisper High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model - When loading models, adapters can be selected with 0-based index, in addition to the name. - Added an API method to decode initial prompt into array of tokens. + Updated models source URL in the documentation. + Reliability enhancement, microphone capture less likely to transition to “Stalled” state and discard the audio. Copyright © const.me, 2022-2023 whisper, gpgpu, speech recognition