GKaszewski/StableAudioWebUI

Fork 0

Go to file

drbaph 2b71459752 Update gradio_app.py

2024-06-06 10:43:16 +01:00

assets

Add files via upload

2024-06-06 10:29:41 +01:00

gradio_app.py

Update gradio_app.py

2024-06-06 10:43:16 +01:00

LICENSE

Initial commit

2024-06-06 05:31:43 +01:00

README.md

Update README.md

2024-06-06 10:42:24 +01:00

requirements1.txt

Add files via upload

2024-06-06 06:46:46 +01:00

requirements.txt

Add files via upload

2024-06-06 06:46:46 +01:00

README.md

💀🔊 StableAudioWebUI 💀🔊

A Lightweight Gradio Web interface for running Stable Audio Open 1.0

⚠ Disclaimer

I am not responsible for any content generated using this repository. By using this repository, you acknowledge that you are bound by the Stability AI license agreement and will only use this model for research or personal purposes. No commercial usage is allowed!

🚀Updates

Added choice for all Sampler types

( dpmpp-3m-sde, dpmpp-2m-sde, k-heun, k-lms, k-dpmpp-2s-ancestral, k-dpm-2, k-dpm-fast )

Added link to the Repo

[06/06/2024]

Recommended Settings

Prompt: Any
Sampler: dpmpp-3m-sde
CFG: 7
Sigma_Min: 0.3
Sigma_Max: 500
Duration: Max 47s
Seed: Any

Saves Files in the following directory Output/YYYY-MM-DD/

using the following schema 'prompt.mp3'

Start by cloning the repo:

git clone https://github.com/Saganaki22/StableAudioWebUI.git

Use the below deployment (tested on 24GB Nvidia VRAM):

cd StableAudioWebUI
python -m venv myenv python=3.10
myenv\Scripts\activate
pip install torch torchvision --index-url https://download.pytorch.org/whl/cu121
pip install -r requirements.txt

(Note if you have an older Nvidia GPU you may need to use CUDA 11.8)

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

If you haven't got a hugging face account or have not used huggingface-cli before, create an account and then authenticate your Hugging face account with a token (create token at https://huggingface.co/settings/tokens)

huggingface-cli login

(paste your token and follow the instructions, token will not be displayed when pasted)

If you want to run it using CPU

skip 'pip install torch torchvision --index-url https://download.pytorch.org/whl/cu121' and just run

pip install -r requirements.txt
pip install -r requirements1.txt

Run

python gradio_app.py

Bonus

If you are using Windows and followed my setup instructions you could create a batch script to activate the enviroment and run the script all in one, what you need to do is:

Create a new text file in the same folder as gradio_app.py & paste this in the text file

@echo off
title StableAudioWebUI
call myenv\Scripts\activate
python gradio_app.py
pause

then save the file as run.bat

Screenshots

(All with random seeds)

Prompt: a dog barking
CFG: 7
Sigma_Min: 0.3
Sigma_Max: 500

Prompt: people clapping
CFG: 7
Sigma_Min: 0.3
Sigma_Max: 500

Prompt: didgeridoo
CFG: 7
Sigma_Min: 0.3
Sigma_Max: 500

Model Details

Model type: Stable Audio Open 1.0 is a latent diffusion model based on a transformer architecture.
Language(s): English
License: See the LICENSE file.
Commercial License: to use this model commercially, please refer to https://stability.ai/membership

README.md

💀🔊 StableAudioWebUI 💀🔊

A Lightweight Gradio Web interface for running Stable Audio Open 1.0

⚠ Disclaimer

I am not responsible for any content generated using this repository. By using this repository, you acknowledge that you are bound by the Stability AI license agreement and will only use this model for research or personal purposes. No commercial usage is allowed!

🚀Updates

Recommended Settings

Saves Files in the following directory Output/YYYY-MM-DD/

using the following schema 'prompt.mp3'

Start by cloning the repo:

Use the below deployment (tested on 24GB Nvidia VRAM):

(Note if you have an older Nvidia GPU you may need to use CUDA 11.8)

If you want to run it using CPU

Run

Bonus

Screenshots

Model Details

Huggingface | Stable Audio Tools | Stability AI