🎙️ WhisperForge

WhisperForge is a simple web app that helps you collect voice recordings and train a personalized Whisper speech model.

It provides a clean browser-based interface where you can record audio, manage transcripts, and fine-tune a model — all in one place.

✨ What You Can Do

🎤 Record yourself reading sentences
📝 Automatically save audio with matching transcripts
📚 Build your own speech dataset
🧠 Train a Whisper model using your recordings
🔁 Improve your model over time with more data

🚀 Getting Started

Start the application and open the following address in your browser:

http://localhost:7860

You will see two main sections inside the app.

🖥️ App Overview

🎙️ Collect Data

Read sentences directly in your browser
Record your voice
Save audio and text together automatically
Build a clean and organized dataset

🧠 Train

Configure your training options
Launch the training process
Fine-tune a model using your recorded data

📂 Your Data

All recordings and transcripts are stored inside the userdata folder.

This keeps:

Your sentences
Your audio files
Your dataset

Everything is kept separate from the main project files.

🎯 Who Is This For?

WhisperForge is ideal for:

Creating personalized speech models
Voice assistant projects
Research experiments
Collecting speech data quickly
Exploring custom speech recognition systems

📜 License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
frontend		frontend
.dockerignore		.dockerignore
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
LICENCE		LICENCE
README.md		README.md
app.py		app.py
data_collators.py		data_collators.py
docker-compose.yml		docker-compose.yml
image.png		image.png
predict.py		predict.py
requirements.txt		requirements.txt
start.sh		start.sh
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎙️ WhisperForge

✨ What You Can Do

🚀 Getting Started

🖥️ App Overview

🎙️ Collect Data

🧠 Train

📂 Your Data

🎯 Who Is This For?

📜 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎙️ WhisperForge

✨ What You Can Do

🚀 Getting Started

🖥️ App Overview

🎙️ Collect Data

🧠 Train

📂 Your Data

🎯 Who Is This For?

📜 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages