Nova Sonic Sample

This project demonstrates how to use Amazon Bedrock's Nova Sonic model, a groundbreaking speech and voice foundation model developed by Amazon. Nova Sonic represents a significant advancement in conversational AI technology, capable of processing and generating human-like speech in real-time through bidirectional streaming.

About Nova Sonic

Nova Sonic is Amazon's state-of-the-art speech foundation model that powers various voice experiences. As announced by Amazon (read more), it's designed to enable more natural and engaging voice interactions. The model excels at:

Real-time speech processing and generation
Natural-sounding voice responses
Bidirectional streaming capabilities
High-quality voice synthesis

This implementation is based on the AWS Nova bidirectional streaming documentation, showcasing how to create interactive voice applications using Nova Sonic's streaming capabilities.

sequenceDiagram
autonumber
participant User
participant Mic/Speakers
participant AudioProcessor
participant AmazonBedrock


note right of User: Speak
User->>Mic/Speakers: Hello tell me who you are <br/> and who created you 
Mic/Speakers->>AudioProcessor: Capture audio chunk
AudioProcessor->>AmazonBedrock: Send audio 
Note over AmazonBedrock: Amazon Nova Sonic
AmazonBedrock->>AudioProcessor: Process response
AudioProcessor->>Mic/Speakers: Generate audio
note right of User: Playback
Mic/Speakers->>User: Hi there! I'm an AI system  <br/> built by a team of inventors at Amazon.

Loading

Prerequisites

Python 3.11+
AWS credentials configured
Required Python packages (see requirements.txt)
Optional portaudio

Supported Voices

Currently, Nova Sonic supports English voices with both American and British accents. For a complete and up-to-date list of available voices, please refer to the AWS Nova Available Voices documentation.

Languages:

English (including American and British accents)
Additional languages coming soon

Installation

Create a python virtual environment and activate:

python3 -m venv .venv
source .venv/bin/activate  # On Windows use: .venv\Scripts\activate.bat

Install PortAudio if not installed (required for PyAudio):
```
brew install portaudio
```
Install Python dependencies:
```
pip install -r requirements.txt
```

Usage

Run the following command to see the Nova Sonic model in action:

python3 demo.nova.sonic.py

You can select a specific voice by using the --voice-id parameter:

python3 demo.nova.sonic.py --voice-id amy|tiffany|matthew

The demo will start capturing audio from your microphone and processing it through the Nova Sonic model using the selected voice (defaults to 'matthew' if not specified). Press Enter to stop the demo.

Live Demo

demo.mp4

Demo direct link

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Nova Sonic Sample

About Nova Sonic

Prerequisites

Supported Voices

Installation

Usage

Live Demo

Files

README.md

Latest commit

History

README.md

File metadata and controls

Nova Sonic Sample

About Nova Sonic

Prerequisites

Supported Voices

Installation

Usage

Live Demo