Skip to content

mahimairaja/awesome-csm-1b

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

🎙️ Awesome CSM-1b Applications

Python 3.10+ License FastAPI Streamlit Modal CSM-1b

A collection of powerful applications built with the Sesame CSM-1b text-to-speech model. Generate natural-sounding speech with realistic qualities and voice cloning capabilities.

📚 Available Applications

Application Description Key Features Status
Personal Voice Diary Convert diary entries into natural-sounding speech Voice cloning, entry management, playback 🔜 Planned
Audiobook Creator Create audiobooks from any text Text chunking, multiple voices, background processing Completed
Voice Message Creator Generate sharable voice messages Custom voices, QR codes, expiring messages 🔜 Planned
Story Narrator for Children Narrate children's stories with character voices Character voices, sound effects 🔜 Planned
Emotion-based Voice Generator Generate speech with different emotions Multiple emotion presets, intensity control 🔜 Planned
Voice Style Transfer Transfer voice to different speaking styles Style presets, voice preservation 🔜 Planned
Voice-based Social Media Post Creator Create audio for social media Background music, platform templates 🔜 Planned
Multilingual Accent Tool Generate speech with different accents Multiple accent options, pronunciation tools 🔜 Planned

✨ Features

  • Natural Voice Generation: Create realistic speech with the power of CSM-1b
  • Voice Cloning: Clone any voice from a short audio sample
  • Independent Applications: Each app is fully self-contained and ready to use
  • Modern Architecture: Built with FastAPI backends and Streamlit UIs
  • Cloud Deployment: Configured for easy deployment with Modal
  • High Performance: Optimized for both CPU and GPU environments

🚀 Getting Started

Getting started with any application is straightforward:

  1. Clone the repository

    git clone https://github.com/mahimairaja/awesome-csm-1b.git
    cd awesome-csm-1b
    
  2. Choose an application

    cd src/<app-name>
    
  3. Install dependencies

    pip install -r requirements.txt
    
  4. Set up your Hugging Face token in a .env file

    HF_TOKEN=your_hugging_face_token
    
  5. Start the backend

    python app.py
    
  6. In a new terminal, start the UI

    streamlit run ui.py
    

🛠️ Technologies Used

  • FastAPI: Backend API framework
  • Streamlit: User interface
  • PyTorch & Torchaudio: Audio processing
  • Hugging Face: Model access and management
  • Modal: Cloud deployment

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/amazing-feature)
  3. Commit your changes (git commit -m 'Add some amazing feature')
  4. Push to the branch (git push origin feature/amazing-feature)
  5. Open a Pull Request

📋 Requirements

  • Python 3.10 or higher
  • Hugging Face account with access to CSM-1b model
  • Hugging Face API token
  • CUDA-compatible GPU recommended for optimal performance

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.


Built with ❤️ by Mahimai Raja

GitAds Sponsored

Sponsored by GitAds

About

List of curated use cases built using Sesame's CSM 1B

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages