sakemin/musicgen-remixer
Remix the music into another styles with MusicGen Chord
Capabilities
Cost
Community model (estimated from hardware time)
Input Parameters
| Name | Type | Description | Default | Constraints |
|---|---|---|---|---|
beat_sync_threshold | number | When beat syncing, if the gap between generated downbeat timing and input audio downbeat timing is larger than `beat_sync_threshold`, consider the beats are not corresponding. If `None` or `-1`, `1.1/(bpm/60)` will be used as the value. 0.75 is a good value to set. | — | — |
chroma_coefficient | number | Coefficient value multiplied to multi-hot chord chroma. | 1 | min: 0.5, max: 2 |
classifier_free_guidance | integer | Increases the influence of inputs on the output. Higher values produce lower-varience outputs that adhere more closely to inputs. | 3 | — |
large_chord_voca | boolean | If `True`, more chords like 7th, diminished and etc are used. If `False` only 12 major and 12 minor chords are used. | true | — |
model_version | string | Model type. Computations take longer when using `large` or `stereo` models. | "stereo-chord" | stereo-chord stereo-chord-large chord chord-large |
multi_band_diffusion | boolean | If `True`, the EnCodec tokens will be decoded with MultiBand Diffusion. Not compatible with `stereo` models. | false | — |
music_input | string (uri) | An audio file input for the remix. | — | — |
normalization_strategy | string | Strategy for normalizing audio. | "loudness" | loudness clip peak rms |
output_format | string | Output format for generated audio. | "wav" | wav mp3 |
prompt | string | A description of the music you want to generate. | — | — |
return_instrumental | boolean | If `True`, the instrumental audio will also be returned. | false | — |
seed | integer | Seed for random number generator. If `None` or `-1`, a random seed will be used. | — | — |
temperature | number | Controls the 'conservativeness' of the sampling process. Higher temperature means more diversity. | 1 | — |
top_k | integer | Reduces sampling to the k most likely tokens. | 250 | — |
top_p | number | Reduces sampling to tokens with cumulative probability of p. When set to `0` (default), top_k sampling is used. | 0 | — |
beat_sync_threshold number When beat syncing, if the gap between generated downbeat timing and input audio downbeat timing is larger than `beat_sync_threshold`, consider the beats are not corresponding. If `None` or `-1`, `1.1/(bpm/60)` will be used as the value. 0.75 is a good value to set.
chroma_coefficient number Coefficient value multiplied to multi-hot chord chroma.
1 min: 0.5, max: 2 classifier_free_guidance integer Increases the influence of inputs on the output. Higher values produce lower-varience outputs that adhere more closely to inputs.
3 large_chord_voca boolean If `True`, more chords like 7th, diminished and etc are used. If `False` only 12 major and 12 minor chords are used.
true model_version string Model type. Computations take longer when using `large` or `stereo` models.
"stereo-chord" multi_band_diffusion boolean If `True`, the EnCodec tokens will be decoded with MultiBand Diffusion. Not compatible with `stereo` models.
false music_input string An audio file input for the remix.
normalization_strategy string Strategy for normalizing audio.
"loudness" output_format string Output format for generated audio.
"wav" prompt string A description of the music you want to generate.
return_instrumental boolean If `True`, the instrumental audio will also be returned.
false seed integer Seed for random number generator. If `None` or `-1`, a random seed will be used.
temperature number Controls the 'conservativeness' of the sampling process. Higher temperature means more diversity.
1 top_k integer Reduces sampling to the k most likely tokens.
250 top_p number Reduces sampling to tokens with cumulative probability of p. When set to `0` (default), top_k sampling is used.
0 0b769f28e399 Updated: 2/26/2026 18.8K runs
cinemasetfree.com