Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.comfy.org/llms.txt

Use this file to discover all available pages before exploring further.

This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub
Transforms part of an existing audio sample using text instructions. This node allows you to modify specific sections of audio by providing descriptive prompts, effectively “inpainting” or regenerating selected portions while preserving the rest of the audio.

Inputs

ParameterData TypeRequiredRangeDescription
modelSTRINGYes"stable-audio-2.5"The AI model to use for audio inpainting.
promptSTRINGYesText description guiding how the audio should be transformed (default: empty). Maximum length is 10,000 characters.
audioAUDIOYesInput audio file to transform. Audio must be between 6 and 190 seconds long.
durationINTNo1 to 190Controls the duration in seconds of the generated audio (default: 190).
seedINTNo0 to 4294967294The random seed used for generation (default: 0).
stepsINTNo4 to 8Controls the number of sampling steps (default: 8).
mask_startINTNo0 to 190Starting position in seconds for the audio section to transform (default: 30).
mask_endINTNo0 to 190Ending position in seconds for the audio section to transform (default: 190).
Note: The mask_end value must be greater than the mask_start value. The input audio must be between 6 and 190 seconds in duration.

Outputs

Output NameData TypeDescription
audioAUDIOThe transformed audio output with the specified section modified according to the prompt.

Source fingerprint (SHA-256): c00d84db73dfcd708495d7a04e21a2378880ca6ceb906473a45dcc1dae20bf79