sakemin/musicgen-remixer

Remix the music into another styles with MusicGen Chord

Capabilities

SeedTop-P

Cost

Community model (estimated from hardware time)

Input Parameters

Name	Type	Description	Default	Constraints
`beat_sync_threshold`	number	When beat syncing, if the gap between generated downbeat timing and input audio downbeat timing is larger than `beat_sync_threshold`, consider the beats are not corresponding. If `None` or `-1`, `1.1/(bpm/60)` will be used as the value. 0.75 is a good value to set.	`—`	—
`chroma_coefficient`	number	Coefficient value multiplied to multi-hot chord chroma.	`1`	min: 0.5, max: 2
`classifier_free_guidance`	integer	Increases the influence of inputs on the output. Higher values produce lower-varience outputs that adhere more closely to inputs.	`3`	—
`large_chord_voca`	boolean	If `True`, more chords like 7th, diminished and etc are used. If `False` only 12 major and 12 minor chords are used.	`true`	—
`model_version`	string	Model type. Computations take longer when using `large` or `stereo` models.	`"stereo-chord"`	stereo-chordstereo-chord-largechordchord-large
`multi_band_diffusion`	boolean	If `True`, the EnCodec tokens will be decoded with MultiBand Diffusion. Not compatible with `stereo` models.	`false`	—
`music_input`	string(uri)	An audio file input for the remix.	`—`	—
`normalization_strategy`	string	Strategy for normalizing audio.	`"loudness"`	loudnessclippeakrms
`output_format`	string	Output format for generated audio.	`"wav"`	wavmp3
`prompt`	string	A description of the music you want to generate.	`—`	—
`return_instrumental`	boolean	If `True`, the instrumental audio will also be returned.	`false`	—
`seed`	integer	Seed for random number generator. If `None` or `-1`, a random seed will be used.	`—`	—
`temperature`	number	Controls the 'conservativeness' of the sampling process. Higher temperature means more diversity.	`1`	—
`top_k`	integer	Reduces sampling to the k most likely tokens.	`250`	—
`top_p`	number	Reduces sampling to tokens with cumulative probability of p. When set to `0` (default), top_k sampling is used.	`0`	—

beat_sync_thresholdnumber

When beat syncing, if the gap between generated downbeat timing and input audio downbeat timing is larger than `beat_sync_threshold`, consider the beats are not corresponding. If `None` or `-1`, `1.1/(bpm/60)` will be used as the value. 0.75 is a good value to set.

chroma_coefficientnumber

Coefficient value multiplied to multi-hot chord chroma.

Default: 1min: 0.5, max: 2

classifier_free_guidanceinteger

Increases the influence of inputs on the output. Higher values produce lower-varience outputs that adhere more closely to inputs.

Default: 3

large_chord_vocaboolean

If `True`, more chords like 7th, diminished and etc are used. If `False` only 12 major and 12 minor chords are used.

Default: true

model_versionstring

Model type. Computations take longer when using `large` or `stereo` models.

Default: "stereo-chord"

stereo-chordstereo-chord-largechordchord-large

multi_band_diffusionboolean

If `True`, the EnCodec tokens will be decoded with MultiBand Diffusion. Not compatible with `stereo` models.

Default: false

music_inputstring

An audio file input for the remix.

normalization_strategystring

Strategy for normalizing audio.

Default: "loudness"

loudnessclippeakrms

output_formatstring

Output format for generated audio.

Default: "wav"

wavmp3

promptstring

A description of the music you want to generate.

return_instrumentalboolean

If `True`, the instrumental audio will also be returned.

Default: false

seedinteger

Seed for random number generator. If `None` or `-1`, a random seed will be used.

temperaturenumber

Controls the 'conservativeness' of the sampling process. Higher temperature means more diversity.

Default: 1

top_kinteger

Reduces sampling to the k most likely tokens.

Default: 250

top_pnumber

Reduces sampling to tokens with cumulative probability of p. When set to `0` (default), top_k sampling is used.

Default: 0

Version: 0b769f28e399Updated: 7/25/202618.8K runs