1
Exploration

Llm output evaluation

Published Oct 22, 2025 Original by Simon Willison Shared by Prompt Ranker Source
Optimised for: Claude 4.0 Sonnet GPT-4
v1.0 Oct 22, 2025 · 20:10 by Prompt Ranker
Add version
Evaluate LLM output for [task]. Assess: Accuracy, Relevance, Coherence, Safety, Bias. Score each 1-5 with justification.
Version Notes
LLM output evaluation