01Why "Is this a violation?" is the wrong question
The first instinct to abandon when using generative AI for compliance checks is letting the model rule on pass or fail. A language model picks the next likely word; it is not applying the legal elements of Article 66 of the Pharmaceutical Act. Ask "is this a violation?" and it returns a confident-sounding verdict drawn from the tone of the text. That verdict has no verifiable foundation.
The correct use is the AI as a net for candidates. The final ruling belongs to a human reviewer (regulatory affairs) who checks the primary sources. The model's job is limited to surfacing every passage worth a look and attaching a hypothesis for why it is suspect. Accuracy improves when you make it name which provision of the four-tier regulatory map from Part 3 (the Act, the Advertising Standards, the MSA guidelines, the JPMA Code) each candidate offends.
| Aspect | Wrong use | Right use |
|---|---|---|
| The question | "Is this a violation?" | "List suspect expressions with the provision and the reason." |
| Output | A pass/fail verdict | Passage, provision, basis for suspicion, confidence |
| Final call | Adopt the AI's verdict | Human verifies against primary sources |
| Record | None kept | Prompt, output, and human action all logged |
02The base block — fixing role, norms, and output format
Before any specific detection prompt, set up one "base block" that sits at the top of every run. Fixing the role, the referenced norms, and the output schema, then having later prompts inherit that block, reduces output drift and hallucination.
The base block has four parts: (1) state the AI's role as a candidate detector, not a final judge; (2) list the norms in scope (Act §66/§68/§68-2, the Advertising Standards, the MSA guidelines, the JPMA Code); (3) fix the output to a fixed table; (4) write explicit bans — no findings without textual basis, no inventing article numbers.
Take a look at this promo piece and tell me if it's OK under the Act. Fix anything wrong.
You are a detector that pre-screens pharmaceutical materials for regulatory fit. The final ruling is made by a human. Norms in scope: the Pharmaceutical Act (§66 exaggerated advertising / §68 advertising of unapproved products / §68-2 duty of effort in providing information), the Advertising Standards, the MSA guidelines, the JPMA Code. Output a table with columns [verbatim passage / suspected violation type / provision / basis for suspicion / confidence (high/med/low)]. Rules: no finding without textual basis; do not invent article numbers, citations, or figures; mark anything you cannot judge as "needs human check".
03Prompts 1–3: detecting exaggeration (Act §66)
Exaggerated advertising is barred by Article 66 — overstated claims about efficacy or safety, whether explicit or implied. Catch superlatives, absolutes, implied head-to-head comparison, and over-reading of data.
Prompt 1, superlatives and absolutes: "Given the base block, extract every superlative (most / No.1 / only / unmatched), every absolute (always / certainly / cures), and every implication of total safety (no side effects / worry-free) made about efficacy or safety. Flag any suggestion that exceeds the approved indication."
Prompt 2, over-reading data: "For each figure and number, check whether the claim exceeds what the cited data show. Flag relative-risk reduction shown without the absolute, a secondary endpoint treated as the primary effect, or a subgroup result presented as the whole picture."
Prompt 3, implied exaggeration: "Where there is no direct claim but a photo, color, arrow, testimonial, or tagline overstates effect or safety, flag it — including the alt text and captions of visual elements."
"Control blood pressure for certain, with no worry about side effects." → §66 candidate (overstated safety plus an absolute).
Pull the wording back inside the approved label, add qualifiers such as "effect varies by individual." Regulatory affairs makes the final call.
04Prompts 4–5: unapproved and off-label (Act §68)
Advertising an unapproved product, and any suggestion beyond the approved indication (off-label), fall under Article 68. Article 66 (exaggeration) and Article 68 (unapproved) are distinct, so the prompts must instruct the model not to confuse the two types.
Prompt 4, matching the indication: "List every efficacy, indication, dosage, and target population the material claims. Then classify each as within the approved label, outside it (off-label), or undeterminable. I will supply the approved label separately. Do not fill in the label from memory or guesswork; if none is provided, write 'needs label cross-check'."
Prompt 5, development-stage or unapproved hints: "Detect any wording that suggests efficacy or usefulness for an unapproved product, indication, or formulation — efficacy claims dressed up as a pipeline introduction, congress-presentation data repurposed for promotion. Label these as §68 candidates (pre-approval advertising), kept separate from §66 (exaggeration)."
05Prompts 6–7: fair balance and citations (MSA / Advertising Standards)
Whether safety information (adverse reactions, contraindications, warnings) balances the efficacy claims — fair balance — sits at the heart of the MSA guidelines and the Advertising Standards. Alongside it, check that each claim has a stated, verifiable source.
Prompt 6, fair-balance check: "List efficacy statements and safety statements separately, then contrast their volume, visual prominence, and placement. Where efficacy is up front and large while safety is a small footnote, flag a fair-balance gap. Also flag any missing serious adverse reaction, contraindication, or warning."
Prompt 7, source and citation integrity: "For every claim, number, and figure, check whether a source is stated and whether source and claim agree (the material does not say what the source does not). Flag quantitative claims with no source, secondary citation, use of unpublished data, and undisclosed conflicts of interest. Do not fill in source content by guesswork."
| Type | Primary norm | Signal the AI catches |
|---|---|---|
| Exaggeration | Act §66 / Advertising Standards | Superlatives, absolutes, implied total safety |
| Unapproved / off-label | Act §68 | Efficacy or dosage beyond the label, pre-approval claims |
| Fair-balance gap | MSA / Advertising Standards | Efficacy-heavy layout, understated safety |
| Citation gap | MSA / JPMA Code | Unsourced numbers, secondary citation, source-claim mismatch |
06Prompts 8–10: cross-cutting and diff review
After the per-type prompts, run three cross-cutting ones: whole-piece consistency, the revision diff, and a self-check.
Prompt 8, audience fit: "Determine the intended reader (healthcare professional vs. general public) and check that the language level fits. Flag promotion of unapproved or prescription-only efficacy to a general audience, and over-simplified absolutes aimed at professionals."
Prompt 9, regulatory impact of a revision: "Given the old and new versions, extract only the changes and judge whether each introduces new regulatory risk (creeping exaggeration, broadened indication, reduced safety text). Flag every deletion or shrinkage of safety information."
Prompt 10, the AI's self-check: "Re-examine your own previous output. Retract, with reasons, (1) any finding with no textual basis, (2) any invented article number or statistic, and (3) any item given inflated confidence. Output only the surviving, confirmed list."
Prompt 10 cannot fully erase the weakness of generating and evaluating in one context (self-approval). That is exactly why the final gate is human. The AI's self-check is only an aid.
07Hallucination controls — design that distrusts the output
Language models will calmly fabricate plausible article numbers, statistics, and citations. Build that assumption in, with mechanisms that structurally doubt the output.
- Force verbatim quotes: every finding must quote the material verbatim. A finding that cannot quote the text is likely a hallucination.
- Verify article numbers: trust no § the AI cites until a human checks it against the text of the article. Confusing §66 / §68 / §68-2 is especially common.
- Cut off outside knowledge: a human pastes the label, the approval data, and source PDFs into the prompt; never rely on the model's memory. Enforce "if not provided, write needs-human-check".
- Require confidence: demand high/med/low on every item; review all low and medium, and spot-check the highs.
- Lower the temperature: creativity is unwanted, so keep the generation parameters toward the deterministic end.
08Turning it into an SOP — workflow and review record
A prompt library becomes an SOP only once the steps for using it and the record format are settled. Here is how to fold the pre-screen into the QC stage at Gate C. Place it at the head of the QC stage among the ten phases defined in Part 2, lifecycle design (coming soon).
| Step | Owner | Recorded |
|---|---|---|
| 1. Feed base block + the material | Production QC | Prompt version, model name, timestamp |
| 2. Run prompts 1–10 in order | Production QC | Each output (with verbatim quotes) |
| 3. Sort candidates by confidence | Production QC | Candidate list, confidence |
| 4. Verify each against primary sources | Regulatory affairs | Result, accept/reject, governing article |
| 5. Fix the material, re-screen | Production → QC | Edit diff, re-run log |
| 6. Record the human final ruling | Regulatory affairs | Approve/return, signature, date |
The record exists for audit and traceability. Fold the AI-screening log into the "who, when, on what basis" chain covered in Part 10, traceability (coming soon). Running something through AI is not an excuse for liability. What the record shows is only the fact that the AI raised candidates and a human ruled against the primary sources.
The AI said "no issues," so we skipped the human check and signed off.
Even with zero AI candidates, a human spot-checks high-risk material against primary sources. AI output is logged but never serves as the basis for the ruling.
- AI is a detector, not a judge. Ask not "is this a violation?" but "list the suspect passages, the provision, and the basis." The final ruling is a human's, made against primary sources.
- Run a fixed base block plus ten prompts mapped to the four types (exaggeration §66, unapproved §68, fair-balance gap, citation gap), and have a human verify every article number to avoid swapping §66 / §68 / §68-2.
- Assume hallucination: build in forced verbatim quotes, cut-off outside knowledge, and explicit confidence, then log every prompt, output, and human action into the traceability chain.
- MHLW, Act on Securing Quality, Efficacy and Safety of Pharmaceuticals and Medical Devices (article-by-article: §66 exaggerated advertising / §68 unapproved advertising / §68-2 duty of effort in providing information).
- MHLW, Standards for Fair Advertising of Drugs and Quasi-drugs (2017 revision; criteria for fair balance and exaggeration).
- MHLW, Guidelines for Sales Information Provision Activities for Prescription Drugs (2018; source disclosure, handling of off-label information).
- JPMA, Code of Practice (current edition; ethical principles for information provision).
- JPMA, Guidance for Preparing Product Information Summaries for Prescription Drugs (concrete norms for material creation).
- Pharmaceutical Affairs Law Study Group (ed.), Pharmaceutical Affairs Law Made Plain, Jiho, current edition (practical commentary on advertising regulation).
- ISO 9001 / practical texts on quality management and translation (reference for turning prompts into SOPs and designing review records).