A server that integrates with ElevenLabs text-to-speech API capable of generating full voiceovers with multiple voices.