Question 1

What is the difference between temperature 0 and temperature 1?

Accepted Answer

At temperature 0, the model uses greedy decoding, always selecting the highest probability token, producing deterministic and reproducible outputs ideal for tasks requiring consistency like code generation. At temperature 1, the model samples according to the original probability distribution, producing more diverse and creative outputs suitable for creative writing. Higher temperatures give lower probability tokens a greater chance of being selected.

Question 2

How do temperature and top_p parameters relate? How should they be used together?

Accepted Answer

Both temperature and top_p control output randomness but through different mechanisms. Temperature scales logits to affect probability distribution, while top_p (nucleus sampling) only samples from the most likely tokens whose cumulative probability reaches p. It's generally recommended to adjust only one parameter at a time, as adjusting both can produce unpredictable effects. OpenAI suggests setting top_p to 1 if adjusting temperature, and vice versa.

Question 3

What temperature values should be used for different tasks?

Accepted Answer

For tasks requiring accuracy like code generation, math, and factual Q&A, use low temperature (0-0.3). For general conversation and text summarization, use medium temperature (0.5-0.7). For creative writing, brainstorming, and poetry, use higher temperature (0.8-1.2). The optimal value should be adjusted based on actual results.

Question 4

Why is the parameter called 'temperature'? What is the origin of this name?

Accepted Answer

The term temperature is borrowed from the Boltzmann distribution in statistical mechanics. In physics, temperature controls the distribution of particle energy states: at low temperatures, particles concentrate in low energy states; at high temperatures, the distribution is more uniform. Similarly, in language models, low temperature concentrates output on high-probability tokens while high temperature flattens the distribution. This mathematical similarity led to the same naming.

Question 5

What problems occur when temperature is set too high?

Accepted Answer

Excessively high temperature (e.g., above 1.5) causes output quality degradation: text may become incoherent, contain grammatical errors, produce meaningless content, or drift off-topic. This happens because high temperature gives low-probability (usually inappropriate) tokens more chances to be selected. In extreme cases, output may be completely random gibberish. Therefore, a balance between creativity and coherence must be found.

Full Name	LLM Temperature Parameter
Created	Concept originated from statistical mechanics, applied to NLP in 2010s
Specification	Official Specification

What is Temperature?

Quick Facts

How It Works

Key Characteristics

Common Use Cases

Example

Frequently Asked Questions

What is the difference between temperature 0 and temperature 1?

How do temperature and top_p parameters relate? How should they be used together?

What temperature values should be used for different tasks?

Why is the parameter called 'temperature'? What is the origin of this name?

What problems occur when temperature is set too high?

Related Tools

AI Websites Directory

AI Prompt Websites

Related Terms

LLM

Prompt

Token

Inference

Related Articles

Unit Conversion Complete Guide [2026] - Length, Weight, Temperature & More

DPO vs RLHF: The Evolution of LLM Alignment Techniques

Hybrid Reasoning Models in Practice: When to Enable and Disable Your LLM's Thinking Mode