Yui sat with four pieces of promotional material stacked on the review desk, her pen frozen in midair. "This one explains things well. But he walks right past the dangerous spots. The other one catches the dangers, but his explanations stumble all over the place. Which do I put on top? The whole ranking has turned into a mess." Mio answered without pausing her hand. "You're stuck because you start from the ranking. The qualifying line isn't addition. You just pass them through four gates, arranged in the order they fail, from the top down."
The "Total Score" Trap Brought Into the Review Room
That morning, Yui had brought her own scoring sheet. Explanation 10 points, knowledge 10 points, detection 10 points, care 10 points. Add the four items and rank by the total. Higuchi scored near full marks on explanation and care, so his total was high. Wada scored low on explanation, landing in the lower-middle. On paper, Higuchi, the master of explanation, came out on top.
Mio glanced at the sheet and pointed her pen at Higuchi's name. "This is the person who, in promotional material for a cardiovascular treatment, missed a sentence that overstated the effect, isn't it? Back in Session 3." Yui nodded. "But everything else was near perfect, so by total he came out first."
"That's the trap. A total score makes you believe a fatal flaw can be offset by high marks elsewhere. It's addition. But failures in material review can't be won back by addition. One miss lets a single sheet reach the patient."
The asymmetry of harm: a loss from explaining poorly can be redone later. A loss from missing a danger can't be recovered once it's out in the world. So the qualifying line doesn't produce a total score right away. It lines up the gates in the order people fail, and passes them through from the top. Whoever fails at an earlier gate isn't even scored at the next one.
The Same Order as Airport Security
Mio drew an airport security checkpoint on the whiteboard. Between you and the boarding gate, several checkpoints stand in a single line. "Think about it. When someone is caught with a knife at security, no one says, 'But this person's check-in was flawless, so let's let them through.'"
Yui laughed. "No one would. Even if the procedure is flawless, you stop them if there's a dangerous item."
"The qualifying line is the same. If you fail at an earlier checkpoint, the later ones aren't evaluated. Skill at explanation is the check-in. Detection is the security screening. Change the order, and the meaning changes."
The four gates have names. G0 through G4. G stands for Gate. The smaller the number, the closer to the front, and if you fail up front, the back is never looked at. Mio wrote out the numbers in order. "At G0 we confirm the foundation, at G1 we look at the floor you absolutely cannot drop, at G2 we measure the gap in self-assessment, at G3 we separate detection on paper from detection on real material, and finally at G4 we at last produce an overall score. But G4 only assigns a ranking. Pass or fail is decided much earlier."
The Four Gates, From the Front
Yui checked the gates one by one. Each time, Mio gave an example from one of the four.
G0 Eligibility Check
Can we even measure this? On every evaluation axis (the floor dimensions), is the judgment confidence C at or above the minimum line C_min? If too little material was observed to be confident, this is a "judgment on hold." Neither fail nor pass — scoring simply hasn't begun.
G1 Absolute Floor
The coordinate floor where dropping it means instant failure. Miss even one, and it's back to retraining. No matter how high the other items, there is no compensation (no offsetting). Same as the knife at security.
G2 Calibration
The gap Δ between self-assessment and actual ability (the difference between one's own estimate and reality). If confidence in detection exceeds actual ability — that is, Δ of +2 or more — independent review is not allowed. You can't leave someone alone who missed a danger yet believes "it's fine."
G3 One Wing
Is detection only on paper, or can it catch things in real material? If they can discuss categories of danger in theory but can't catch them in actual material — "paper detection" — independence is denied. Over-catching (over-detection) passes, but needs improvement.
"And the final gate," said Mio. "G4, Overall Classification. Only those who cleared every floor up to here are ranked by a weighted total. But this decides only the classification and the training priority. It doesn't decide pass or fail. Pass or fail was already settled at G1 through G3."
Why You Don't Look at the Back Once They Fail Up Front
Yui still looked unconvinced. "But for Higuchi, who failed at G1 — wouldn't producing his G4 total make it easier for him to accept the result?"
Mio shook her head. "The opposite. Show the total, and both he and everyone around him fall into the illusion of 'just a few points short.' It makes them believe a floor that can't be compensated could be filled by addition. So once you fail at an earlier stage, the later stage isn't even calculated."
| Gate | What it checks | If failed | Example |
|---|---|---|---|
| G0 Eligibility | Confidence C ≥ C_min on each axis | Judgment on hold (no scoring) | Insufficient observation |
| G1 Absolute Floor | The floor you cannot drop | Fail · retrain (no compensation) | Higuchi |
| G2 Calibration | Self-assessment gap Δ | Δ ≥ +2 means no independence | Overconfident type |
| G3 One Wing | On paper or on real material | Paper detection: no independence / over-detection: pass · needs improvement | Minami |
| G4 Overall | Weighted total for those who cleared the floors | Classification · training priority only (does not decide pass/fail) | Wada and others |
"Think of a health checkup," said Mio. "If one item is in the danger zone, you go to a detailed exam even if everything else is normal. You don't discount it with an overall score. Items that bear on life aren't dissolved into an average." Higuchi, who topped Yui's scoring sheet, stops at G1 in this arrangement. The overall score never even gets its turn.
The Order Itself Was the Judgment
Yui quietly turned her total-score sheet face down. In its place she lined the four gates up vertically and tried passing Higuchi, Minami, and Wada through from the top. Higuchi stopped at G1. Minami cleared G1 and G2, then stopped at G3's paper test. Wada passed through to G3 and went on to G4's overall.
"I think I understand. The whole time, I was scoring who was the most excellent. But what the qualifying line was asking was: who is safe to leave alone? Change the question, and the arrangement changes."
Mio nodded, drew a circle around only G4 on the board, then crossed it out with a line. "The final total looks the most important, but pass or fail isn't there. It's already decided at the floors up front. G4 only decides the order in which to develop those who passed."
The judgments of the three will stand before their respective gates in the next session. Who stops at which gate, and who passes through — the rearranged sheet was already pointing to the answer.
The Qualification Bar ── Map of all 10 episodes
- Vol. 2: The Asymmetry of Harm ── A Miss Is Orders of Magnitude Heavier ── Why you must not draw the line with an average, part 1: a miss and a false alarm are not equal harms
- Vol. 3: The Compensation Trap ── Eloquence Hiding a Gap in Detection ── A reviewer who is brilliant at explaining and at getting along with people is weak at just one thing: spotting danger (risk detection). Average the scores and they pass. But someone who cannot spot danger yet talks well will push risky material through on charm alone. Why you must not decide pass or fail on an average — explained gently through real Case A.
- Vol. 4: Floor vs. Aggregate ── Non-Compensatory Gates and the Weighted Score ── Pass/fail is decided by minimum bars (floors); the total score is used only to rank. Fall below even one bar and a perfect score still fails. This is the unbreakable rule of the qualifying line.
- Vol. 5: The Highest Floor for Detection ── Why Risk Detection Exists ── Material review — the job of checking a drug company's promotional materials for doctors before they go out — exists to find the dangerous spots. So among eight abilities, the minimum bar for the power to spot danger (risk detection) is set highest. To pass as someone who can review alone (qualified) you need level L3, the second-highest rung, plus a real-world spotting range of 2 or more. A person who stops one rung lower, at L2, lets the most dangerous materials slip right through.
- Vol. 6: A Floor on Two Axes ── Not Letting Desktop Detection Pass ── The pass line for detection cannot be drawn with a single score. It needs two rulers: how well you can explain the danger, and whether you can catch it in the real material in front of you. A textbook-only spotter may look like L3 on paper but does not clear for solo work.
- Vol. 7: Calibration as a Gate to Independence ── Overconfidence Disqualifies ── A look at the gate (calibration gate G2) that asks: do you estimate your own seeing-power correctly? Working alone means no one checks behind you. A person who thinks their detection skill is higher than it really is (gap Δ of +2 or more) waves through danger without noticing their own blind spot. This gap (Δ) is not skill itself, but it decides whether someone may work alone.
- Vol. 8 (this episode): The Four Gates G0–G4 ── The Logic of Early Rejection ── A reviewer's pass or fail is decided at four checkpoints in order. Anyone who fails an earlier checkpoint is not re-measured at a later one. A non-negotiable minimum line (a "floor") cannot be patched over by other strengths, and the total score never flips the result.
- Vol. 9: Three Profiles ── How One Line Sorts Them ── The eloquent talker, the textbook thinker, and the real deal — where one pass/fail line sends each
- Vol. 10 (final): The Responsibility of Drawing the Line ── Anchors First, Human Confirmation, Non-Punitive Growth ── The closing chapter that turns the pass line into something a workplace can actually use. Only when a shared book of agreed examples exists does the line become a common yardstick. The four verdict tiers are not a brand of failure but a signpost for what to grow next. AI gives a rough first reading; a human makes the final call.
The four-stage gate is not a mechanism for summing abilities. It lines the gates up in the order people fail, and never measures what lies beyond a gate where someone has already stopped. Skill at explanation and depth of knowledge both lose their meaning the moment a single absolute floor is missing. The order itself was the substance of the judgment.
In the next session, Higuchi, Minami, and Wada each stand before their gates. Who stops where — pass the four gates from the top, and the conclusion is already in view.
- Point 1 A judgment is not a total score; you pass through four stacked gates arranged in the order of failure (G0 eligibility → G1 absolute floor → G2 calibration → G3 one wing → G4 overall), starting from the front. Fail at an earlier stage and the later stages aren't evaluated.
- Point 2 G1's absolute floor means instant failure with no compensation if even one is missing; G2 denies independence at a self-assessment gap Δ of +2 or more; G3 denies independence for paper-only detection (over-detection passes but needs improvement). Pass or fail is decided at G1 through G3.
- Point 3 The final G4 overall classification ranks only those who cleared every floor by a weighted total, but it decides only classification and training priority, not pass or fail — an order designed so dangerous misses can never be offset by high marks.
- Angoff, W. H. Scales, Norms, and Equivalent Scores. American Council on Education, 1971. (Anchor-grounded cut-score setting — the lineage of the qualification line's calibration thinking.)
- Cizek, G. J., & Bunch, M. B. Standard Setting: A Guide to Establishing and Evaluating Performance Standards on Tests. Sage, 2007. (Methodology for non-compensatory/conjunctive cut criteria and staged decisions.)
- Macmillan, N. A., & Creelman, C. D. Detection Theory: A User's Guide. 2nd ed., Lawrence Erlbaum, 2005. (Miss/false-alarm asymmetry — sensitivity/specificity grounding the one-winged gate.)
- Spencer, L. M., & Spencer, S. M. Competence at Work: Models for Superior Performance. Wiley, 1993. (Foundation for the eight-dimension profile and the L levels.)
- Messick, S. "Validity." In Educational Measurement (3rd ed.), American Council on Education/Macmillan, 1989. (Framework supporting the validity and consequences of threshold judgments.)