Open
Description
https://openslr.org/14/ > BEEP Dictionary Summary: Phonemic transcriptions of over 250,000 English words.
https://openslr.org/21/ > Spanish Word list
https://openslr.org/34/ > Santiago Spanish Lexicon
https://openslr.org/55/ >Chinese Word list
It is recommended to use Stable Diffusion to generate images , This is a challenge.
# Stable Diffusion
https://github.com/CompVis/stable-diffusion -- Stable Diffusion
https://github.com/AUTOMATIC1111/stable-diffusion-webui -- Stable diffusion, the most popular webui
https://github.com/Stability-AI/generative-models -- Generative Models by Stability AI
https://github.com/Stability-AI/stablediffusion -- High-Resolution Image Synthesis with Latent Diffusion Models
https://gpt4all.io/index.html -- Self hostable GPT models
https://github.com/diff-usion/Awesome-Diffusion-Models
The sound can use the operating system's built-in TTS
and Generating Sound Files
https://github.com/suno-ai/bark
https://github.com/coqui-ai/TTS
https://github.com/k2-fsa/sherpa-onnx