Last time we set the ground rules for fair measurement. Now we bring out the grading tools. If you compress skill into a single score, you lose sight of what is good and what is missing. So we split the ruler into three: accuracy, clarity, and balance. Each gets its own separate score.

Why three rulers, not one

Imagine a health check that owns only a scale. It weighs you and says "healthy," but tells you nothing about blood pressure, blood sugar, or eyesight. Your weight may be normal while high blood pressure quietly threatens your life. Grading material-making works the same way. A one-line verdict like "it was a good material" is like judging health by weight alone; it hides what is actually wrong.

So we split what we want to see into three. First, accuracy: does it stay true to the facts in the source. Second, clarity: does it reach the reading doctor or patient without misunderstanding. Third, balance: are the benefits and the cautions in the right proportion. Measure the three separately, and you can say clearly whether a piece is "accurate but not landing" or "landing but poorly balanced." A single combined score erases that distinction.

One score is convenient, but it hides what is missing. We split into three so faults can be named.

Ruler one: accuracy (grounding in fact)

Picture a climbing rope. No matter how gracefully you climb, if the rope is not anchored to the rock, a fall is only a matter of time. Accuracy is the ruler that checks whether the rope holds. Does each sentence in the material, when traced back to its source (a trial report, a package insert, a public basis), actually rest on a support. That is accuracy.

The technical term is "source grounding." It sounds hard, but the meaning is simple: if someone asks "where is the basis for this sentence?", can you point straight to the page in the original. A sentence you cannot point for floats in the air. Even if it reads well, a floating sentence is dangerous.

Scored on four steps, it looks like the table below. If a piece falls below the floor, it fails no matter how good the rest is. The reasons come in a later issue; for now, remember that accuracy is a ruler apart, one you never simply add to the others.

StepState of accuracyHow to spot it
L1Some sentences cannot be tracedSays "works well" but cannot point to which trial
L2Has a basis, but drifts from the original meaning"improved in some patients" widened to "improved in patients"
L3Each sentence matches the source correctlyNumbers and conditions agree with the report
L4Matches, and also states limits and premisesAdds "this result holds under condition X"

Ruler two: clarity (reaching the reader)

Even the right medicine fails if the instructions are in a foreign language; the patient takes it wrong. Clarity is the ruler that checks whether the instructions are written in the reader's language. If accuracy asks "is the content correct," clarity asks "does that correctness land in the reader's head without misunderstanding."

The key point: clarity is not "dumbing down." Cutting facts to make something easy is not clarity, just corner-cutting. Real clarity is the power to take hard content and, without cutting it, recast it into the reader's tools: words, diagrams, order. Replacing a technical term with an everyday word on first use, putting the conclusion first, showing a quantity comparison as a figure, these lift the landing rate while keeping the content intact.

On four steps, L1 is "just laid out as told," while L4 can read the reader's level of knowledge and change the design. With the same facts, can the maker build different structures for doctors and for patients. That is the dividing line for the higher steps.

StepState of clarityHow to spot it
L1Jargon kept, order copied from the sourceSame text for any reader
L2Follows set patterns of rewording and headingsHabitually adds term notes
L3Rebuilds structure to fit the readerFor patients, conclusion and cautions go first
L4Designs ahead for easy-to-misread pointsAnticipates "here is where confusion starts" and adds a figure

Ruler three: balance (adjusting too much and too little)

Think of seasoning a dish. Too little salt and it falls flat; too much and you cannot eat it. Balance is the ruler for the right amount. For a material, it asks whether the proportion and placement of the good points (effect) and the cautions (side effects, limits) are tilted.

Balance breaks in two typical ways. One is overselling: the benefits are written large while the cautions are shrunk into a corner. This tends to be a regulatory problem too. The other is the opposite, shrinking: fear of the cautions blurs even the effect. Both lead the reader to misjudge. A well-balanced material puts benefits and cautions in front of the eye with equal weight, so the reader can weigh them.

Note that balance is separate from accuracy. Every single sentence may be correct (accuracy passes), yet if you gather only the convenient facts, the whole tilts (balance fails). That is why it stands as a third ruler. The most dangerous piece is one that is skillfully written (high clarity), whose every sentence is correct (accuracy looks high), but whose overall selection is tilted. Because it persuades, the reader cannot notice the tilt.

Skillful, each sentence correct, yet the whole tilted, this is the hardest to catch and the most dangerous. So we measure balance with a separate eye.

How to gather the three onto one sheet

Picture the checklist for a proof print (a test print before the real run). Typos, layout, and color are each checked in their own column and laid out on a single sheet. The three rulers work the same way: rather than summing them into one number, we line up three scores side by side. Call it the scoring sheet.

There is one iron rule for lining them up. Accuracy is treated as a "floor you never add to." However high clarity and balance are, if accuracy falls below the floor, the material fails. The reason is plain: deliver wrong content well, and the misunderstanding only spreads faster. The remaining two, clarity and balance, show "how excellent" once the floor is cleared. Read the floor (pass or fail) and the craft above it (excellence) separately, that is how the three rulers are used.

RulerQuestionRole
AccuracyCan you return to the sourceFloor (fail if breached)
ClarityDoes it reach without misunderstandingQuality of reach
BalanceAre benefits and cautions evenFairness of the whole

Measuring Skill from Work and Behavior ── Map of all 10 episodes

  1. Vol. 1: Measure by the Materials Actually Made, Not by Impressions or Self-Report ── A material maker's skill is measured from the actual deliverables and observable conduct, not from self-report or others' impressions.
  2. Vol. 2: Tracing the Brief, the Choices, and the Result — In Order ── Read a creator's skill from evidence by walking through one real project in order: the brief, the thinking, the actions, and the result.
  3. Vol. 3: Reading "Faithfulness to the Facts" and "Craft of Delivery" Out of the Work Itself ── This installment shows how to recode a finished piece into two axes — faithfulness to the facts and the craft of getting it across — by reading concrete clues, not impressions.
  4. Vol. 4: The Rules That Keep Measurement Honest ── Six ground rules that keep the evaluator from drifting when measuring an author's real skill.
  5. Vol. 5 (this episode): Three Rulers: Accuracy, Clarity, and Balance ── Defines three rulers for grading material-making skill and scores each on a four-step scale: accuracy as the floor, clarity as the reach, and balance as the adjustment between too much and too little.
  6. Vol. 6: How to Decide the Level — Returning to the Source Sets the Ceiling ── Work that cannot be traced back to its source cannot earn a higher level, however polished it looks. Grounding sets the ceiling.
  7. Vol. 7: What Deliverables Signal Which Level ── An anchor table that reads a creator's level (L1-L4) from visible deliverables and behavior patterns.
  8. Vol. 8: How Far Can We Trust a Judgment? ── How sure a level judgment is depends on how visible the evidence is; less observable skills produce shakier judgments, so we attach a confidence to each verdict.
  9. Vol. 9: Combine More Than Self-Assessment: Add the Reviewer's and Requester's View ── Layering four viewpoints — self, reviewer, requester, and AI — surfaces the deviations of omission that a single pair of eyes cannot see.
  10. Vol. 10 (final): Connecting the Measurement to Pass/Fail and a Development Plan ── The finale links the score to the pass floor and a plan for what to grow next.
In closing

We measure the three rulers, accuracy, clarity, and balance, separately so that faults can be named. One score is convenient, but it hides what is missing. Accuracy is treated as a floor you never add to, and on top of it we read the craft of clarity and balance. Reading the floor and the excellence apart is where fair grading begins.

Next time we move to how each of the three is placed on a level. The key is the principle that being able to return to the source sets the ceiling. Design skill in clarity and balance can only be stacked on top of the grounding that accuracy provides.

Key Points ── Three to take with you
  1. Skill is measured by three. Accuracy (grounding in fact), clarity (reaching without misunderstanding), and balance (even weight of benefits and cautions) are scored separately, never compressed into one number.
  2. Accuracy is a floor you never add to. However high clarity and balance are, a sentence that cannot be traced fails the piece, because delivering an error well only spreads the misunderstanding faster.
  3. The most dangerous is high design, low fidelity. A piece skillfully written with each sentence correct, yet tilted in overall selection. Its persuasion hides the tilt, so balance is measured with a separate eye.
Sources & references
  1. Ministry of Health, Labour and Welfare, "Standards for Fair Advertising of Drugs" — a public reference for the view that efficacy should not be overstated and should stay in proportion with cautionary notes.
  2. Japan Pharmaceutical Manufacturers Association, "Promotion Code" — a general guideline requiring information on prescription drugs to be fair and objective, grounded in scientific evidence.
  3. General accounts of the Behavioral Event Interview (BEI) and the STAR method — an assessment approach that measures ability from concrete behavioral evidence rather than impressions, via situation, task, action, and result.
  4. Textbook literature on competency assessment — the general methodology of defining and scoring ability in steps through observable behavior and work products.