meta/llama-2-7b
Base version of Llama 2 7B, a 7 billion parameter language model
Capabilities
Cost
Community model (estimated from hardware time)
Input Parameters
| Name | Type | Description | Default | Constraints |
|---|---|---|---|---|
prompt * | string | Prompt to send to the model. | — | — |
debug | boolean | provide debugging output in logs | false | — |
max_new_tokens | integer | Maximum number of tokens to generate. A word is generally 2-3 tokens | 128 | min: 1 |
min_new_tokens | integer | Minimum number of tokens to generate. To disable, set to -1. A word is generally 2-3 tokens. | -1 | min: -1 |
replicate_weights | string | Path to fine-tuned weights produced by a Replicate fine-tune job. | — | — |
seed | integer | Random seed. Leave blank to randomize the seed | — | — |
stop_sequences | string | A comma-separated list of sequences to stop generation at. For example, '<end>,<stop>' will stop generation at the first instance of 'end' or '<stop>'. | — | — |
temperature | number | Adjusts randomness of outputs, greater than 1 is random and 0 is deterministic, 0.75 is a good starting value. | 0.75 | min: 0.01, max: 5 |
top_k | integer | When decoding text, samples from the top k most likely tokens; lower to ignore less likely tokens | 50 | min: 0 |
top_p | number | When decoding text, samples from the top p percentage of most likely tokens; lower to ignore less likely tokens | 0.9 | min: 0, max: 1 |
prompt required string Prompt to send to the model.
debug boolean provide debugging output in logs
false max_new_tokens integer Maximum number of tokens to generate. A word is generally 2-3 tokens
128 min: 1 min_new_tokens integer Minimum number of tokens to generate. To disable, set to -1. A word is generally 2-3 tokens.
-1 min: -1 replicate_weights string Path to fine-tuned weights produced by a Replicate fine-tune job.
seed integer Random seed. Leave blank to randomize the seed
stop_sequences string A comma-separated list of sequences to stop generation at. For example, '<end>,<stop>' will stop generation at the first instance of 'end' or '<stop>'.
temperature number Adjusts randomness of outputs, greater than 1 is random and 0 is deterministic, 0.75 is a good starting value.
0.75 min: 0.01, max: 5 top_k integer When decoding text, samples from the top k most likely tokens; lower to ignore less likely tokens
50 min: 0 top_p number When decoding text, samples from the top p percentage of most likely tokens; lower to ignore less likely tokens
0.9 min: 0, max: 1 527827021d87 Updated: 6/26/2026 659.8K runs
cinemasetfree.com