Design reproducible test prompts (no guesswork) to measure: source presence, citations, accuracy, and scope errors. Build a “PromptCard” protocol and a standard