Models to generate and modify audio
Stable diffusion for real-time music generation
Generate music from a prompt or melody
Text-Prompted Generative Audio Model
Audio-domain Music Generation using the FreeSound Loop Dataset
Text-to-audio generation with latent diffusion models. AudioLDM generates text-conditional sound effects, human speech, and music. It enables zero-shot text-guided audio style-transfer, inpainting, and super-resolution.
Bach chorale generation and harmonization
Dance Diffusion is a suite of generative audio tools for producers and musicians to be released by Harmonai.
The model generates monophonic melody and chords jointly, by filling in missing pieces.