Actions4
- Speech Actions
- Sound Actions
- Voice Actions
- Dubbing Actions
Overview
This node integrates with the CambAI API to convert text into speech audio. It allows users to specify the voice, language, and various options to customize the speech synthesis output. This is useful for automating voice generation in applications such as creating audio content, voice assistants, accessibility tools, or any scenario where converting written text to spoken audio is needed.
For example, you could use this node to:
- Generate podcast intros from text scripts.
- Create audio notifications or alerts dynamically.
- Produce voiceovers for videos or presentations.
- Enable text-to-speech features in chatbots or customer support systems.
Properties
Name | Meaning |
---|---|
Voice | Select the voice to use for the conversion. Can be chosen from a searchable list or by ID. |
Text | The text string that will be converted into speech. |
Source Language | Select the source language for the speech synthesis. Can be chosen from a searchable list or by ID. |
Additional Options | Collection of optional parameters to further customize the speech output: |
- Age | Preferred age of the voice (number between 18 and 80). |
- Gender | Voice gender preference: Male or Female. |
- Output Type | How to retrieve the final audio: either raw audio bytes in FLAC format or a downloadable file URL. |
- Polling Interval (Seconds) | Time interval between status checks when waiting for TTS completion (1 to 10 seconds). |
- Polling Timeout (Seconds) | Maximum time to wait for TTS process to complete (30 to 600 seconds). |
- Project Description | Optional description of the project using the TTS service. |
- Project Name | Optional project name for organizational purposes. |
Output
The node outputs JSON data containing the result of the text-to-speech operation. Depending on the selected "Output Type":
- Raw Bytes (FLAC): The output contains the direct audio file stream encoded in FLAC format, suitable for immediate playback or saving as an audio file.
- File URL: The output provides a downloadable URL pointing to the generated audio file, which can be accessed or downloaded later.
The exact structure of the JSON output includes fields related to the audio content or the URL, along with metadata about the request status.
Dependencies
- Requires an active CambAI API key credential configured in n8n to authenticate requests.
- Network access to
https://client.camb.ai/apis
endpoint. - No additional environment variables are explicitly required beyond the API key.
Troubleshooting
Common Issues:
- Invalid or missing API key: Ensure the API key credential is correctly set up and has necessary permissions.
- Timeout errors: If the TTS process takes longer than the specified polling timeout, increase the timeout value.
- Unsupported voice or language IDs: Verify that the selected voice and language exist and are supported by the CambAI service.
- Network connectivity problems: Confirm that your n8n instance can reach the CambAI API endpoint.
Error Messages:
- Authentication failures typically indicate issues with the API key.
- "Voice not found" or "Language not supported" errors suggest invalid selection values.
- Polling timeout errors mean the synthesis took too long; consider adjusting polling intervals or timeout settings.