ElevenLabs icon

ElevenLabs

WIP

Overview

The "Create Sound Effects" operation in the Speech resource of this node generates custom sound effects based on a textual description provided by the user. This node leverages an audio generation API to synthesize sound effects that match the user's prompt, allowing for creative and dynamic audio content creation.

Common scenarios where this node is beneficial include:

  • Creating unique sound effects for video productions, such as trailers or games.
  • Generating background sounds or audio cues for multimedia projects.
  • Rapid prototyping of audio assets without needing manual sound design.

For example, a user might input a description like "Spacious braam suitable for high-impact movie trailer moments" to generate a dramatic, cinematic sound effect automatically.

Properties

Name Meaning
Sound Effect Description A required text field where you describe the sound effect you want to generate. Example: "Spacious braam suitable for high-impact movie trailer moments".
Duration (Seconds) Optional number specifying the length of the generated sound effect in seconds. Valid range is 5 to 10 seconds. If not set, duration is chosen automatically.
Prompt Influence Number between 0 and 1 controlling how closely the generated sound follows the prompt. Higher values mean less variation but more faithful adherence to the description. Default is 0.7.
Binary Name Optional string to change the name of the binary output data. Defaults to "data".
File Name Optional string to specify the output file name. Defaults to "voice".
Output Format Option to select the audio format of the generated sound effect. Choices include various MP3 and PCM formats, e.g., MP3 at 44.1kHz with 128kbps or PCM 16-bit at 22.05kHz. Default is MP3 44.1kHz 128kbps.
Enable Logging Boolean to enable or disable logging. When false, zero retention mode is activated, disabling history features.

Note: Some properties listed are shared across other operations but only those relevant to "Create Sound Effects" are described here.

Output

The node outputs the generated sound effect as binary audio data. The binary data corresponds to the audio file encoded in the selected output format (e.g., MP3 or PCM). The binary property name can be customized via the "Binary Name" property, defaulting to "data".

The JSON output contains metadata about the request and response, including any identifiers or status information returned by the API, but the primary useful output is the binary audio data representing the generated sound effect.

Dependencies

  • Requires an active API key credential for the ElevenLabs API service.
  • The node sends requests to the ElevenLabs API endpoint /sound-generation.
  • Proper configuration of the API key credential within n8n is necessary for authentication.
  • Internet connectivity is required to reach the external ElevenLabs API.

Troubleshooting

  • Common Issues:

    • Invalid or missing API key credential will cause authentication failures.
    • Providing a sound effect description that is too vague or unsupported may result in unsatisfactory or empty audio output.
    • Specifying a duration outside the allowed range (5-10 seconds) may cause errors or unexpected behavior.
    • Selecting an unsupported output format could lead to failed requests.
  • Error Messages:

    • Authentication errors typically indicate issues with the API key; verify the credential setup.
    • Validation errors may occur if required fields like the sound effect description are missing.
    • API rate limits or quota exceeded errors require checking your ElevenLabs account usage.
  • Resolutions:

    • Ensure the API key credential is correctly configured and valid.
    • Provide clear and descriptive prompts for sound effect generation.
    • Use valid numeric ranges for duration and select supported output formats.
    • Monitor API usage and upgrade plans if hitting rate limits.

Links and References

Discussion