minimax/music-01
Quickly generate up to 1 minute of music with lyrics and vocals in the style of a reference track
Capabilities
Cost
Community model (estimated from hardware time)
Input Parameters
| Name | Type | Description | Default | Constraints |
|---|---|---|---|---|
bitrate | integer | Bitrate for the generated music | 256000 | 32000 64000 128000 256000 |
instrumental_file | string (uri) | Instrumental reference. Must be a .wav or .mp3 file longer than 15 seconds. If only an instrumental reference is given, a track without vocals will be generated. | — | — |
instrumental_id | string | Reuse a previously uploaded instrumental ID | — | — |
lyrics | string | Lyrics with optional formatting. You can use a newline to separate each line of lyrics. You can use two newlines to add a pause between lines. You can use double hash marks (##) at the beginning and end of the lyrics to add accompaniment. Maximum 350 to 400 characters. | "" | — |
sample_rate | integer | Sample rate for the generated music | 44100 | 16000 24000 32000 44100 |
song_file | string (uri) | Reference song, should contain music and vocals. Must be a .wav or .mp3 file longer than 15 seconds. | — | — |
voice_file | string (uri) | Voice reference. Must be a .wav or .mp3 file longer than 15 seconds. If only a voice reference is given, an a cappella vocal hum will be generated. | — | — |
voice_id | string | Reuse a previously uploaded voice ID | — | — |
bitrate integer Bitrate for the generated music
256000 instrumental_file string Instrumental reference. Must be a .wav or .mp3 file longer than 15 seconds. If only an instrumental reference is given, a track without vocals will be generated.
instrumental_id string Reuse a previously uploaded instrumental ID
lyrics string Lyrics with optional formatting. You can use a newline to separate each line of lyrics. You can use two newlines to add a pause between lines. You can use double hash marks (##) at the beginning and end of the lyrics to add accompaniment. Maximum 350 to 400 characters.
"" sample_rate integer Sample rate for the generated music
44100 song_file string Reference song, should contain music and vocals. Must be a .wav or .mp3 file longer than 15 seconds.
voice_file string Voice reference. Must be a .wav or .mp3 file longer than 15 seconds. If only a voice reference is given, an a cappella vocal hum will be generated.
voice_id string Reuse a previously uploaded voice ID
0254c7e2f543 Updated: 6/8/2026 513.6K runs
cinemasetfree.com