Connecting the Measurement to Pass/Fail and a Development Plan

Stop at a score and measuring is just an impression. Like a health check, numbers matter only when linked to a follow-up and a change of habits. This finale connects the pass floor, a prescription per psychological habit, and a re-check on the next product.

What happens after you measure

You take a health check and get a sheet of numbers. No one stops there. If blood sugar is high, you change your meals and book a follow-up. Measuring is not the end; it is the start. Measuring skill in making materials works the same way. Producing a score is not the finish. The score is only raw material for two decisions: whether the person passes, and what they should practice next.

Up to here the series has shown how to measure skill from real work products rather than self-reports. This final part connects that result to two exits. One is pass or fail (can this person be trusted with the job). The other is a development plan (what to practice next). Without that link, measuring ends as a mere impression.

In many workplaces, evaluation stops at one word: "grade A" or "needs effort." But a score alone moves nothing. Saying only "high" or "low" leaves the person unsure of the next step. The value of measuring lies in linking today's judgment (pass/fail) and tomorrow's design (development) on the same evidence. So this part deals not with how to produce a score, but with how to use one.

The pass line: separate the floor from the total score

Recall a driving test. A perfect written score means nothing if you run a stop sign in the practical. "High total score" and "did not cross a line that must not be crossed" are different judgments. Pass or fail for material-making is decided in this two-tier way.

Skill at persuasion cannot fill a hole in sourcing. The most dangerous mix is "high design skill x low fidelity = persuasive misreading." So whether you can return to the source (grounding) is the absolute floor.

"Grounding" means a sentence in the material can be traced back to the original paper or data and verified. Put this as the floor (a necessary condition). Break the floor and you fail, however skilled the rest is. Only above the floor do you read the total score of clarity and balance to see "how far can this person be trusted." In a reported case, someone claimed an effect from a graph of just 9 cases (4 vs 5) with no statistical analysis. That breaks the floor before any design. However readable the graph, it cannot pass.

Reading pass/fail through the roles of the eight skills

Think of airport baggage screening. The officer does not wave you through for being likable. They first check the one line of no dangerous items (the floor), then assess efficiency. The eight skills of material-making are read the same way: first confirm the floor skill stands, then look at the total.

Role	Skills	Handling in pass/fail
Verify (before making)	Source-grounding, balance design, misreading foresight	Source-grounding is the floor. Break it and you fail at once
Shape (making)	Regulation translation, appeal design	Total score. Above the floor, add points for "does it reach"
Correct (before and after release)	Self-review, revision response, trust building	Near the floor. Weigh heavily whether they stop their own deviations

Self-review sits near the floor for a reason. Someone who finds and stops their own deviation before release can return on their own even if they slip once. If this is weak, the moment outside review misses something, the deviation reaches the world as is. In a reported case, a required pre-dose screening (a prior test to narrow eligible patients) was instead written in the product summary as "no screening or testing needed," as if it were a product strength. The shaping skill was high, but the skill to stop one's own dangerous wording was absent. In pass/fail, weigh this "stopping skill" more heavily than good looks.

Prescribing development by the four drivers

Think of rehabilitation. "Cannot walk" can come from a fracture, a sprain, or lost muscle, and each is treated differently. The same exercise for all causes will not heal. Deviations in material-making look alike on the surface, but if the underlying psychology (the driver) differs, the prescription differs. Issuing only a pass/fail and saying "be careful" is rehab that ignores the cause.

Deviations come from four circuits that ordinary makers fall into under pressure. Here are reported cases, the skill that stops each, and the development prescription.

Psychological driver	Reported deviation	Skill that stops it	Development prescription
Motivated reasoning (the conclusion comes first)	Stretched only the slide's vertical axis to enlarge a difference / set a survival curve to start at 0 to erase a gap	Source-grounding, balance design	Read the source axis aloud before writing the conclusion. A habit of asking "did I decide first, then search?"
Local rationalization (just here, just one slide)	Showed an old-indication comparison on one slide and claimed "superior" when only non-inferiority data existed	Misreading foresight, self-review	Training to always cross-check any "just this one slide" against the whole for contradiction
Sin of omission (not telling until asked)	Prepared no primary-endpoint material and explained only a significant secondary endpoint / wrote a required screening as "not needed"	Regulation translation, self-review	A reverse check for "what was not said." Sweep for missing contraindications, primary endpoints, and COI in a fixed format
Externalizing responsibility (authority, boss, "not asked")	Defended a non-significant Japanese result with "the professor says it is fine" / no presenter disclosed COI "because it was not requested"	Trust building, self-review	Have them write the basis for a judgment in their own words. Build the habit of answering with the source, not a name

What matters is that these are not the acts of "bad people." Without realizing they are lying, the maker's reading of data is pulled by the wish to sell. In another reported case, a side effect that should have prompted caution (a certain component becoming excessive in the body) was reframed as a strength, "you can supplement that component." This too is the classic pattern of the selling conclusion coming first and the reading tilting toward it. So the prescription is not punishment but building a mechanism to notice this circuit in oneself. As the four rows show, even the same "self-review" differs in content: for the omitter, a check for gaps; for the one who externalizes, training to write the basis in their own words. The prescription that fits the cause is the one that works.

Running the development loop: confirm with the next product

Recall a proof sheet. Marking it in red and sending it back does not make the book better. Only when the next print shows the corrections fixed is improvement confirmed. Development is the same: a plan is not the end. You re-measure whether the skill grew in the next material made.

In practice, keep a short record that bundles pass/fail with the prescription: "Did they clear the floor / what is the total score / which driver was weak / what to practice next." Then measure the next product with the same ruler. If the weak skill rose, the prescription worked. If not, change it. Run this loop and measurement turns from a tool of judging into an engine of growing.

One caution. A development plan is not a record for blaming the person. "Motivated reasoning" and the "sin of omission" are circuits anyone falls into under deadline and sales pressure, not flaws of character. What the record holds is a forward promise of "next, practice this one point," not a brand on the past. Turn it into a record of blame and the person starts hiding deviations, which makes the grounding floor even harder to see. The plan only runs when the person can name their weak circuit without fear.

In level terms, it becomes a map to climb one step at a time: from L1 (only the assigned job) to L2 (reproduce a pattern), L3 (understand why and apply), and L4 (design the system and set the standard). Pass/fail marks the line right now; the development plan points to where to stand next. Only by linking the two does measuring gain meaning.

Measuring Skill from Work and Behavior ── Map of all 10 episodes

Vol. 1: Measure by the Materials Actually Made, Not by Impressions or Self-Report ── A material maker's skill is measured from the actual deliverables and observable conduct, not from self-report or others' impressions.
Vol. 2: Tracing the Brief, the Choices, and the Result — In Order ── Read a creator's skill from evidence by walking through one real project in order: the brief, the thinking, the actions, and the result.
Vol. 3: Reading "Faithfulness to the Facts" and "Craft of Delivery" Out of the Work Itself ── This installment shows how to recode a finished piece into two axes — faithfulness to the facts and the craft of getting it across — by reading concrete clues, not impressions.
Vol. 4: The Rules That Keep Measurement Honest ── Six ground rules that keep the evaluator from drifting when measuring an author's real skill.
Vol. 5: Three Rulers: Accuracy, Clarity, and Balance ── Defines three rulers for grading material-making skill and scores each on a four-step scale: accuracy as the floor, clarity as the reach, and balance as the adjustment between too much and too little.
Vol. 6: How to Decide the Level — Returning to the Source Sets the Ceiling ── Work that cannot be traced back to its source cannot earn a higher level, however polished it looks. Grounding sets the ceiling.
Vol. 7: What Deliverables Signal Which Level ── An anchor table that reads a creator's level (L1-L4) from visible deliverables and behavior patterns.
Vol. 8: How Far Can We Trust a Judgment? ── How sure a level judgment is depends on how visible the evidence is; less observable skills produce shakier judgments, so we attach a confidence to each verdict.
Vol. 9: Combine More Than Self-Assessment: Add the Reviewer's and Requester's View ── Layering four viewpoints — self, reviewer, requester, and AI — surfaces the deviations of omission that a single pair of eyes cannot see.
Vol. 10 (this episode): Connecting the Measurement to Pass/Fail and a Development Plan ── The finale links the score to the pass floor and a plan for what to grow next.

In closing

Pass/fail is the line right now; the development plan is where to stand next. Fix grounding as the floor, then read the total score to set how far to trust. Look at the psychological habit behind a deviation and prescribe self-monitoring, not punishment.

Do not end measurement at a score; re-measure with the next product. If the weak skill rose, the prescription worked; if not, change it. As long as this loop runs, makers notice their own circuit even under pressure and keep producing materials that stay true to fact while reaching the reader.

Key Points ── Three to take with you

Separate floor and total.Make grounding the absolute floor; break it and you fail however skilled the rest. Read the total only above the floor.
Prescribe by driver.Change the development content for motivated reasoning, local rationalization, omission, and externalizing. Same deviation, different psychology, different prescription.
Confirm with the next product.Record pass/fail and prescription briefly, then re-measure the next job with the same ruler. Growth means it worked; no change means change it.

Sources & references

Ministry of Health, Labour and Welfare, Compliance and Narcotics Division (commissioned project). Report on the Monitoring of Promotional Information Activities for Prescription Drugs (March 2024 and prior years). Flagged cases are published with company names anonymized; the deviation patterns cited here are generalized from this report.
MHLW. Guidelines on Promotional Information Activities for Prescription Drugs. Required handling of primary endpoints, contraindications, and conflicts of interest.
Japan Pharmaceutical Manufacturers Association. JPMA Code of Practice. Voluntary standards for fair information and COI disclosure.
Standards for Fair Advertising of Drugs (MHLW division-director notice). General principles for avoiding exaggeration and misleading claims.
Competency assessment methods (Behavioral Event Interview, STAR method). A general framework for measuring ability from behavioral evidence rather than self-report.

← Back to Measuring Skill from Work and Behavior