AHK based GUI for whisper.cpp
Go to file
2024-07-31 06:00:31 -04:00
Assets added screenshot 2024-01-14 21:48:03 -05:00
Lib first commit 2024-01-14 21:39:35 -05:00
Models first commit 2024-01-14 21:39:35 -05:00
.gitignore added ignore file 2024-01-14 21:41:04 -05:00
Compile Transcriber to EXE.ahk first commit 2024-01-14 21:39:35 -05:00
Pull-Updates-From-Git.ahk small bug fixes 2024-01-15 18:54:04 -05:00
README.md more info 2024-01-14 22:35:47 -05:00
Whisper-AHK-GUI2.ahk gui rework, added option to shutdown on all completion, upload each result to ntfy, hide CMD popups 2024-07-31 04:37:48 -04:00
Whisper-AHK.ahk gui rework, removed length size selection checkboxes 2024-07-31 06:00:31 -04:00
Whisper-AHK.exe reworked output directory 2024-01-29 23:20:32 -05:00

Whisper-AHK

Screenshot of main window

Autohotkey based GUI for utilizing Whisper.cpp to transcribe audio files to text.

Features

  • Transcribe Audio Files using your preferred Whisper AI model.
  • Automatically convert audio files to the 16-bit .wav file that Whisper.cpp requires, using FFMPEG
    • Currently ffmpeg has to be installed manually and be added to system PATH
  • The Transcriber will create a folder named WAVFiles and Transcriptions within the same directory as the selected audio files, where the converted .wav files and resulting transcriptions will get exported to.

How to Use

  • Download and install ffmpeg (or using choco install -y ffmpeg)
  • Download and install the Microsoft Visual C++ Redistributable (or using choco install -y vcredist-all)
  • Git clone the repo to your computer with git clone URL
  • Download and extract the .zip of your preferred whisper binaries to \Models\ from https://github.com/ggerganov/whisper.cpp/releases
    • whisper-bin-x64.zip works well, but if you want to utilize your Nvidea GPU use whisper-cublas-bin-x64.zip
  • Download and move the whisper model you want to use to \Models\ from https://github.com/ggerganov/whisper.cpp/blob/master/models
  • Run Whisper-AHK to start the GUI.
  • To download updates to the scripts, close all running Transcribers and run the Pull-Updates-From-Git.exe (must have git installed)

Settings:

  • Keep CMD Open After Model Completion
    • Will Keep the Whisper/ffmpeg CMD window open after it finishes the task instead of closing instantly, so you can see any possible errors.
    • The Transcriber will not move on to the next file until you close the CMD window manually
  • Show Tooltip of progress at top of screen
    • Will show a small tooltip at the top of your screen of what the script is currently doing
  • Number of CPU Threads
    • The number of CPU threads you want to use for Transcribing.
  • Minutes to Pause After Transcriptions
    • This will pause the Transcriber for X minutes before moving on to the next Model/File so your CPU can cool down a bit.
    • Useful if you're using 100% of your CPU threads and your computer fans are having a hard time keeping up
  • Transcribe Individual Files
    • Select all the files you want to transcribe individually.
  • Transcribe All Audio Files within Folder
    • Transcribe all MP3/WAV files within a directory

Compiling to .exe from Source

  1. Install Autohotkey v1
  2. git clone the project and then either run the Compile Transcriber to EXE.ahk autohotkey script to automatically compile, or start up the AHK compiler that comes installed with Autohotkey and select the Whisper-AHK.ahk and \Assets\Elegantthemes-Beautiful-Flat-Speaker.ico icon.