Overview#

You already know what good looks like. The problem is applying that judgment consistently — especially when you’re deep in the work and can’t see it clearly anymore.

This playbook is about getting your standards out of your head and into a rubric that an AI can apply on your behalf. It might be five bullet points. It might be a detailed scoring table. What matters is that it’s specific enough to produce useful feedback when someone other than you evaluates against it.

These rubrics should be extremely specific to your situation. A rubric for a government white paper might incorporate research on how agencies evaluate proposals, or details about the specific reviewers who will read it. “Clear and concise” doesn’t tell you much. “Addresses the reviewer’s known concern about implementation timelines” does.

You drive the rubric. The AI applies it. You keep control of what matters; the AI handles the repetitive checking.

This pairs naturally with almost any content production workflow, which makes it one of the more useful playbooks in the collection.

When to Use#

  • You’re producing multiple pieces of content that need to meet the same standard.
  • You keep giving the same feedback on your own drafts or on AI output.
  • You want to evaluate AI-generated content but don’t have a clear framework for what “good enough” means here.
  • You’re iterating across drafts and want a concrete signal on whether things are improving.
  • You need to hand off quality standards to someone else — or to yourself six months from now.

The Play#

Start with your gut reaction#

Before building anything, pull up a piece of content you think is strong and one you think is weak. What separates them? Write down whatever comes to mind, unorganized.

The rubric should capture your judgment, not a generic standard. If you skip this and ask the AI to generate a rubric from scratch, you’ve outsourced the thinking.

Draft the rubric collaboratively#

Bring your rough notes to the AI and ask it to help organize them into a structured rubric. A useful rubric typically has:

Dimensions — the aspects you’re evaluating. Clarity, structure, tone, accuracy, completeness — whatever actually matters for this piece of work.

Levels — what each dimension looks like at different quality tiers, with a score attached. A dimension might score 1 through 5, where 1 means the content misses entirely and 5 means it nails it. Each dimension gets its own score, and those scores sum to a total you can track across drafts. The scoring also forces the AI to commit to a judgment instead of giving you “this is mostly good” non-feedback.

Specific indicators — these make or break the rubric. “Uses concrete examples to support each claim” is evaluable. “Is well-written” is not. If you’re writing for a particular audience, the indicators should reflect what that audience cares about.

Push back during this step. If the AI suggests a dimension that doesn’t match what you care about, cut it. If the indicators are too generic, ask it to ground them: who is reading this, and what do they need to walk away with?

The rubric might end up being five focused bullet points or a detailed table. Either is fine as long as the criteria are specific.

Test the rubric if you can#

If you have content you’ve already formed an opinion on, use it to validate the rubric. Give the AI a piece and ask it to score against the rubric. Compare its scores to your gut.

Where the scores diverge, the rubric has a gap — a missing dimension, poorly calibrated levels, an ambiguous indicator. Refine and test again. This calibration is what makes the rubric trustworthy going forward.

Not always possible, especially for something new. The rubric will still sharpen through use.

Apply the rubric in a separate session#

Evaluate your content in a different AI session from the one you used to create or iterate on it. The working session has context that biases evaluation — it “knows” what you were trying to do. A fresh session sees only the content and the rubric, which is closer to how your actual audience will experience it.

Provide the rubric in full and ask for a dimension-by-dimension assessment with scores and reasoning. “How does this look?” gets you nothing. A scored rubric gives you something concrete: a total that went from 18 to 23 between drafts tells you more than “it’s getting better.”

Use the rubric as a checkpoint throughout your workflow — after a draft, after a revision, after a major restructure. It gives you a signal on whether your edits are actually moving things in the right direction.

Evolve the rubric over time#

Your standards will shift as you learn what works. When you notice the rubric missing something or over-weighting a dimension that stopped mattering, update it. Treat it as a living document.

  • AI Driven Writing — Use the rubric at each stage of the writing pipeline (thesis, outline, draft) to catch issues early rather than only at the end.
  • Generating Speaker Notes — Apply a rubric tuned for spoken delivery: pacing, audience awareness, transitions.
  • Audience Translation — Build audience-specific rubrics so the same content can be evaluated differently for different readers.
  • Generating Variations — Use a rubric to evaluate and rank variations of taglines, pitches, or messaging rather than relying on gut feel alone.

Examples#

Examples coming soon.