The Denominator Problem in AI Governance

Key Takeaways

Small teams need lightweight, actionable governance — not enterprise-grade bureaucracy
A one-page policy baseline is enough to start; iterate from there
Assign one policy owner and hold a weekly 15-minute review
Data handling and prompt content are the top risk areas
Human-in-the-loop is required for high-stakes decisions

This playbook section helps small teams implement AI governance with a clear policy baseline, practical risk controls, and an execution-friendly checklist. It's designed for teams that need to move fast while still meeting basic compliance and risk expectations.

If you only do three things this week: publish an "allowed vs not allowed" policy, name an owner, and set a short review cadence to keep usage visible and intentional.

Governance Goals

For a lean team, governance goals should translate directly into day-to-day behaviors: what people can do, what they must not do, and what they need approval for.

Reduce avoidable risk while preserving team velocity
Make "approved vs not approved" usage explicit
Provide lightweight review ownership and cadence
Keep a paper trail (decisions, incidents, exceptions) without slowing delivery

Risks to Watch

Most small teams underestimate "silent" risks: sensitive data in prompts, untracked tools, and decisions made from model output that never get reviewed.

Data leakage via prompts or outputs
Over-trusting model output in production decisions
Untracked shadow AI usage
Vendor/tooling sprawl without a risk owner or inventory

Controls (What to Actually Do)

Start with controls that are cheap to run and easy to explain. Each control should have a clear owner and a lightweight cadence.

Create an AI usage policy with allowed use-cases (and a short "not allowed" list)
Define what data is allowed in prompts (and what requires redaction or approval)
Run a weekly risk review for high-impact prompts and workflows
Require human sign-off for any customer-facing or high-stakes outputs
Define escalation + incident response steps (who to notify, what to log, how to pause use)

Checklist (Copy/Paste)

Identify high-risk AI use-cases
Define what data is allowed in prompts
Require human-in-the-loop for critical decisions
Assign one policy owner
Review results and update controls
Keep a simple inventory of AI tools/vendors and owners
Add a "safe prompt" template and a redaction workflow
Log incidents and near-misses (even if informal) and review monthly

Implementation Steps

Draft the policy baseline (1–2 pages)
Map incidents and near-misses to checklist updates
Publish the updated policy internally
Create a lightweight review cadence (weekly 15 minutes; quarterly deeper review)
Add a short approval path for exceptions (who can approve, how it's documented)

Frequently Asked Questions

Q: What is AI governance? A: It is a framework for managing AI use, risk, and compliance within a small team context.

Q: Why does AI governance matter for small teams? A: Small teams face the same AI risks as enterprises but with fewer resources, making lightweight governance frameworks critical.

Q: How do I get started with AI governance? A: Start with a one-page policy baseline, identify your highest-risk AI use-cases, and assign a policy owner.

Q: What are the biggest risks in AI governance? A: Data leakage via prompts, over-reliance on model output, and untracked shadow AI usage.

Q: How often should AI governance controls be reviewed? A: A weekly lightweight review is recommended for high-impact use-cases, with a full policy review quarterly.

References

None

Practical Examples (Small Team)

Small‑team AI projects often lack the resources of large enterprises, yet they still face the Denominator Problem when trying to compare risk across models, datasets, or deployments. Below are three concrete scenarios that illustrate how a lean team can surface, measure, and act on the denominator gap without waiting for a full‑scale governance apparatus.

1. Incident‑Driven Reporting Loop

Step	Owner	Action	Checklist
Identify	Product Lead	Log every AI‑related incident (e.g., false positive, bias complaint, outage) in a shared spreadsheet.	• Timestamp • System component • Impact description • Severity rating (1‑5)
Classify	Data Engineer	Tag the incident with a risk category (privacy, fairness, safety, reliability).	• Use a controlled vocabulary • Map to OECD AI monitor categories
Quantify	Analyst	Assign a numerator (e.g., number of users affected) and a denominator (total active users for that feature).	• Pull usage stats from analytics API • Verify denominator reflects the same time window
Report	Compliance Officer	Generate a weekly "AI Incident Dashboard" that visualises numerator/denominator ratios.	• Include trend lines • Highlight ratios > 5 % as red flags
Remediate	Engineer	Prioritise fixes based on the ratio, not just raw incident count.	• Create a ticket with "Denominator‑adjusted priority" label • Track time‑to‑resolution

Why this works: By anchoring each incident to a denominator (the user base or transaction volume), the team avoids the classic pitfall of treating a single error as an outlier. The ratio surfaces systemic issues—e.g., a 2 % false‑negative rate on a fraud‑detection model may be acceptable for a low‑volume pilot but unacceptable once the model scales to millions of transactions.

2. Risk‑Based Feature Gate

A small SaaS startup wants to roll out a new recommendation engine. The team adopts a "gate" that only lifts when the denominator‑adjusted risk falls below a pre‑defined threshold.

Define the denominator – total recommendation requests per day (e.g., 10 k).
Select the numerator – number of recommendations flagged by an internal audit as "potentially biased" (e.g., gender‑skewed).
Set the threshold – 0.2 % (i.e., ≤ 20 biased recommendations per 10 k).
Automate the check – a nightly script runs:

requests=$(curl -s https://api.myapp.com/metrics/recs | jq .total)
biased=$(curl -s https://api.myapp.com/audit/bias | jq .count)
ratio=$(echo "scale=4; $biased / $requests * 100" | bc)
if (( $(echo "$ratio < 0.2" | bc -l) )); then
  echo "Gate open – safe to deploy"
else
  echo "Gate closed – fix bias before release"
fi

Assign ownership – the ML Engineer owns the script; the Product Manager signs off on the gate result.

Outcome: The team can ship features faster because the gate provides a quantitative, denominator‑aware safety net rather than a vague "review before launch" checklist.

3. Cross‑Project Benchmarking

When multiple micro‑services use different language models, the denominator problem makes it hard to compare risk across projects. The following template standardises the comparison:

Project	Model	Total Calls (Denominator)	High‑Severity Errors (Numerator)	Error Ratio	Compliance Flag
Chatbot	GPT‑3.5	45 k	180	0.40 %	✅
Summarizer	T5‑base	12 k	96	0.80 %	⚠️
Classifier	Custom CNN	30 k	150	0.50 %	✅

Action steps:

Collect call volume from each service's telemetry endpoint.
Aggregate error counts from the shared incident log.
Calculate ratios automatically via a CI pipeline job.
Trigger an alert when any ratio exceeds the EU AI Act "high‑risk" benchmark (e.g., > 0.5 %).

By normalising each project to its own denominator, the team can spot that the Summarizer, despite lower absolute errors, poses a higher proportional risk and should be prioritised for a compliance review.

Metrics and Review Cadence

Operationalising the Denominator Problem requires a disciplined set of metrics and a regular cadence for review. Below is a ready‑to‑use metric suite and a calendar template that small teams can adopt with minimal overhead.

Core Metric Set

Metric	Definition	Calculation	Owner	Frequency
Denominator‑Adjusted Incident Ratio (DAIR)	Ratio of incidents to total relevant interactions	`Incidents / Interactions`	Analyst	Daily
Risk Category Coverage	Percentage of risk categories (privacy, fairness, safety, reliability) with at least one tracked incident	`CoveredCategories / TotalCategories * 100`	Compliance Officer	Weekly
Regulatory Alignment Score	Weighted score based on alignment with OECD AI monitor and EU AI Act criteria	Sum of (criterion met × weight)	Legal Lead	Monthly
Remediation Lead Time	Average time from incident detection to fix deployment	`Σ(TimeToFix) / IncidentCount`	Engineer	Bi‑weekly
Model‑Level Denominator Drift	Change in denominator (e.g., request volume) over the last 30 days	`CurrentDenominator / Prior30DayDenominator - 1`	Data Engineer	Weekly

Implementation tip: Store these metrics in a lightweight time‑series database (e.g., InfluxDB) and visualise them on a Grafana dashboard. Use alert thresholds that map directly to the EU AI Act "high‑risk" definition (e.g., DAIR > 0.5 % triggers a Slack alert).

Review Cadence Blueprint

Cadence	Meeting	Participants	Agenda Items	Deliverables
Daily Stand‑up	Incident Ratio Sync	ML Engineer, Analyst, Product Lead	Review DAIR for the past 24 h, flag spikes	Updated incident log
Weekly Ops Review	Metrics Health Check	Data Engineer, Compliance Officer, Team Lead	Discuss Risk Category Coverage, Denominator Drift, upcoming releases	Action item list, updated risk register
Bi‑weekly Remediation Review	Fix‑Focus Workshop	Engineers, QA Lead, Legal Lead	Analyse Remediation Lead Time, prioritize high‑ratio incidents	Revised remediation backlog
Monthly Governance Review	Regulatory Alignment Session	Legal Lead, Compliance Officer, Senior Management	Evaluate Regulatory Alignment Score, map to OECD AI monitor updates, plan policy tweaks	Governance report, policy amendment draft
Quarterly Strategy Sync	Risk Roadmap Planning	All stakeholders	Aggregate metric trends, set next‑quarter risk thresholds, allocate resources	Updated risk roadmap, budget justification

Checklist for each meeting:

Verify that the denominator data source is current (e.g., API health check).
Confirm that any new incident has been classified and logged within 4 hours of detection.
Cross‑check metric calculations against raw logs to catch formula drift.
Document decisions in a shared Confluence page with clear owners and due dates.

Automation Scripts (inline, no fences)

DAIR updater (Python‑like pseudo‑code):
dair = incidents_today / interactions_today
push_metric('DAIR', dair)
Alert trigger (shell‑style one‑liner):
if [ $(echo "$dair > 0.005" | bc) -eq 1 ]; then curl -X POST -d '{"text":"DAIR exceeds threshold"}' https://hooks.slack.com/services/...; fi
Regulatory score calculator (Excel formula):
=SUMPRODUCT(--(CriteriaMetRange), WeightRange)

Ownership matrix:

Role	Primary Responsibility	Secondary Responsibility
ML Engineer	Implement denominator data pipelines, maintain incident tagging logic	Participate in daily DAIR sync
Data Engineer	Ensure interaction logs are accurate, monitor denominator drift	Support weekly ops review
Analyst	Compute core metrics, validate calculations	Prepare monthly governance report
Compliance Officer	Track risk category coverage, map to OECD AI monitor	Lead weekly ops review
Legal Lead	Align metrics with EU AI Act requirements, update regulatory score	Chair monthly governance review
Product Lead	Prioritise feature gates based on denominator‑adjusted risk	Attend daily stand‑up for rapid feedback

By embedding these metrics into a predictable cadence, a small team can continuously surface the denominator gap, keep risk assessments aligned with regulatory expectations, and maintain a transparent audit trail for stakeholders.

Tooling and Templates

To lower the barrier for small teams, we provide a curated toolbox and ready‑made templates that directly address the Denominator Problem. All resources are open‑source or freely available, making them suitable for bootstrapped startups or research labs.

1. Incident Logging Spreadsheet (Google Sheets)

Tabs: Incidents, Denominator Sources, Metrics Dashboard.
Key columns in Incidents tab:
- ID (auto‑increment)
- Timestamp (UTC)
- Component (dropdown: Model, Data Pipeline, UI)
- Risk Category (dropdown: Privacy, Fairness, Safety, Reliability)
- Numerator (e.g., affected users)
- Denominator Ref (link to Denominator Sources tab)
- Severity (1‑5)
- Owner (email)
Denominator Sources tab: Store API endpoints, query snippets, and last‑retrieved values.
Metrics Dashboard tab: Use IMPORTRANGE and QUERY functions to compute DAIR, risk coverage, and trend charts.

How to use:

Clone the template from the public repo (https://github.com/techpolicy/denominator-toolkit).
Share with the team, assign edit rights to owners.
Set up a daily Google Apps Script trigger that pulls the latest denominator values via HTTP GET and writes them to the Denominator Sources tab.

2. CI/CD Integration Snippet (GitHub Actions)

name: Denominator Metrics
on:
  schedule:
    - cron: '0 2 * * *'   # runs daily at 02:00 UTC
jobs:
  compute-metrics:
    runs-on: ubuntu-latest
    steps:
      - name: Checkout repo
        uses: actions/checkout@v3
      - name: Pull interaction stats
        run: |
          INTERACTIONS=$(curl -s ${{ secrets.INTERACTIONS_API }})
          INCIDENTS=$(curl -s ${{ secrets.INCIDENTS_API }})
          DAIR=$(python -c "print(round($INCIDENTS/$INTERACTIONS,4))")
          echo "DAIR=$DAIR" >> $GITHUB_ENV
      - name: Push metric to Grafana
        run: |
          curl -X POST -H "Authorization: Bearer ${{ secrets.GRAFANA_TOKEN }}" \
          -d "{\"metric\":\"dair\",\"value\":${{ env.DAIR }}}" \
          https://grafana.example.com/api/metrics

Benefits: Automates denominator‑adjusted risk calculation, feeds directly into monitoring dashboards, and eliminates manual spreadsheet updates.

3. Risk Register Template (Markdown)

## Risk Register – Project XYZ

| ID | Description | Denominator | Numerator | Ratio | Category | Owner | Mitigation |
|----|-------------|-------------|-----------|-------|----------|-------|------------|
| R1 | Gender bias in recommendation | 12,000 daily recs | 96 flagged | 0.80 % | Fairness | ML Engineer | Retrain on balanced data |
| R2 | Data leakage via API logs | 500,000 API calls | 5 exposures | 0.001 % | Privacy | Data Engineer | Mask PII fields |
| R3 | Model drift causing false positives | 30,000 fraud checks | 150 errors | 0.50 % | Safety | Product Lead | Deploy drift monitor |

Usage tips:

Update the Denominator column each sprint to reflect the latest usage volume.
The Ratio column is automatically calculated by a simple spreadsheet formula (=Numerator/Denominator).
Prioritise items where Ratio > EU AI Act threshold (e.g., 0.5 %).

4. Policy Checklist (One‑Page PDF)

✅	Item	Description	Owner
☐	Denominator Definition	Every risk metric must reference a clear denominator (e.g., requests, users, transactions).	Compliance Officer
☐	Incident Classification	All AI‑related incidents are logged with risk category and severity.	Analyst
☐	Regulatory Mapping	Each metric is mapped to at least one OECD AI monitor indicator or EU AI Act article.	Legal Lead
☐	Review Cadence	Metrics are reviewed on the schedule defined in the "Metrics and Review Cadence" section.	Team Lead
☐	Automation	Scripts for DAIR calculation and alerting are version‑controlled and run in CI.	ML Engineer

Distribution: Export the checklist as a PDF and embed it in the team's onboarding wiki. New hires sign off on the checklist within their first week.

5. Open‑Source Libraries

Library	Language	Core Feature	Repo
`denom‑metrics`	Python	Simple wrapper to compute numerator/denominator ratios from Pandas DataFrames	https://github.com/techpolicy/denom-metrics
`risk‑gate`	JavaScript	Feature‑gate utility that blocks deployment until ratio thresholds are met	https://github.com/techpolicy/risk-gate
`ai‑audit‑cli`	Go	CLI tool to pull incident logs, compute ratios, and emit Slack alerts	https://github.com/techpolicy/ai-audit-cli

Integration tip: Add the library as a dependency in your requirements.txt or package.json. Use the provided example functions (compute_ratio, check_gate) to replace ad‑hoc scripts with maintainable code.

By leveraging these tools and templates, a small team can institutionalise denominator‑aware risk assessment without building a heavyweight governance stack. The combination of concrete metrics, regular review cadence, and ready‑to‑use automation ensures that the Denominator Problem becomes a manageable operational concern rather than an abstract theoretical obstacle.

Key Takeaways

Small teams need lightweight, actionable governance — not enterprise-grade bureaucracy
A one-page policy baseline is enough to start; iterate from there
Assign one policy owner and hold a weekly 15-minute review
Data handling and prompt content are the top risk areas
Human-in-the-loop is required for high-stakes decisions

Summary

If you only do three things this week: publish an "allowed vs not allowed" policy, name an owner, and set a short review cadence to keep usage visible and intentional.

Governance Goals

For a lean team, governance goals should translate directly into day-to-day behaviors: what people can do, what they must not do, and what they need approval for.

Reduce avoidable risk while preserving team velocity
Make "approved vs not approved" usage explicit
Provide lightweight review ownership and cadence
Keep a paper trail (decisions, incidents, exceptions) without slowing delivery

Risks to Watch

Most small teams underestimate "silent" risks: sensitive data in prompts, untracked tools, and decisions made from model output that never get reviewed.

Data leakage via prompts or outputs
Over-trusting model output in production decisions
Untracked shadow AI usage
Vendor/tooling sprawl without a risk owner or inventory

Controls (What to Actually Do)

Start with controls that are cheap to run and easy to explain. Each control should have a clear owner and a lightweight cadence.

Create an AI usage policy with allowed use-cases (and a short "not allowed" list)
Define what data is allowed in prompts (and what requires redaction or approval)
Run a weekly risk review for high-impact prompts and workflows
Require human sign-off for any customer-facing or high-stakes outputs
Define escalation + incident response steps (who to notify, what to log, how to pause use)

Checklist (Copy/Paste)

Identify high-risk AI use-cases
Define what data is allowed in prompts
Require human-in-the-loop for critical decisions
Assign one policy owner
Review results and update controls
Keep a simple inventory of AI tools/vendors and owners
Add a "safe prompt" template and a redaction workflow
Log incidents and near-misses (even if informal) and review monthly

Implementation Steps

Draft the policy baseline (1–2 pages)
Map incidents and near-misses to checklist updates
Publish the updated policy internally
Create a lightweight review cadence (weekly 15 minutes; quarterly deeper review)
Add a short approval path for exceptions (who can approve, how it's documented)

Frequently Asked Questions

Q: What is AI governance? A: It is a framework for managing AI use, risk, and compliance within a small team context.

Q: Why does AI governance matter for small teams? A: Small teams face the same AI risks as enterprises but with fewer resources, making lightweight governance frameworks critical.

Q: How do I get started with AI governance? A: Start with a one-page policy baseline, identify your highest-risk AI use-cases, and assign a policy owner.

Q: What are the biggest risks in AI governance? A: Data leakage via prompts, over-reliance on model output, and untracked shadow AI usage.

Q: How often should AI governance controls be reviewed? A: A weekly lightweight review is recommended for high-impact use-cases, with a full policy review quarterly.

References

None

Practical Examples (Small Team)

1. Incident‑Driven Reporting Loop

Step	Owner	Action	Checklist
Identify	Product Lead	Log every AI‑related incident (e.g., false positive, bias complaint, outage) in a shared spreadsheet.	• Timestamp • System component • Impact description • Severity rating (1‑5)
Classify	Data Engineer	Tag the incident with a risk category (privacy, fairness, safety, reliability).	• Use a controlled vocabulary • Map to OECD AI monitor categories
Quantify	Analyst	Assign a numerator (e.g., number of users affected) and a denominator (total active users for that feature).	• Pull usage stats from analytics API • Verify denominator reflects the same time window
Report	Compliance Officer	Generate a weekly "AI Incident Dashboard" that visualises numerator/denominator ratios.	• Include trend lines • Highlight ratios > 5 % as red flags
Remediate	Engineer	Prioritise fixes based on the ratio, not just raw incident count.	• Create a ticket with "Denominator‑adjusted priority" label • Track time‑to‑resolution

2. Risk‑Based Feature Gate

A small SaaS startup wants to roll out a new recommendation engine. The team adopts a "gate" that only lifts when the denominator‑adjusted risk falls below a pre‑defined threshold.

Define the denominator – total recommendation requests per day (e.g., 10 k).
Select the numerator – number of recommendations flagged by an internal audit as "potentially biased" (e.g., gender‑skewed).
Set the threshold – 0.2 % (i.e., ≤ 20 biased recommendations per 10 k).
Automate the check – a nightly script runs:

requests=$(curl -s https://api.myapp.com/metrics/recs | jq .total)
biased=$(curl -s https://api.myapp.com/audit/bias | jq .count)
ratio=$(echo "scale=4; $biased / $requests * 100" | bc)
if (( $(echo "$ratio < 0.2" | bc -l) )); then
  echo "Gate open – safe to deploy"
else
  echo "Gate closed – fix bias before release"
fi

Assign ownership – the ML Engineer owns the script; the Product Manager signs off on the gate result.

Outcome: The team can ship features faster because the gate provides a quantitative, denominator‑aware safety net rather than a vague "review before launch" checklist.

3. Cross‑Project Benchmarking

When multiple micro‑services use different language models, the denominator problem makes it hard to compare risk across projects. The following template standardises the comparison:

Project	Model	Total Calls (Denominator)	High‑Severity Errors (Numerator)	Error Ratio	Compliance Flag
Chatbot	GPT‑3.5	45 k	180	0.40 %	✅
Summarizer	T5‑base	12 k	96	0.80 %	⚠️
Classifier	Custom CNN	30 k	150	0.50 %	✅

Action steps:

Collect call volume from each service's telemetry endpoint.
Aggregate error counts from the shared incident log.
Calculate ratios automatically via a CI pipeline job.
Trigger an alert when any ratio exceeds the EU AI Act "high‑risk" benchmark (e.g., > 0.5 %).

Metrics and Review Cadence

Core Metric Set

Metric	Definition	Calculation	Owner	Frequency
Denominator‑Adjusted Incident Ratio (DAIR)	Ratio of incidents to total relevant interactions	`Incidents / Interactions`	Analyst	Daily
Risk Category Coverage	Percentage of risk categories (privacy, fairness, safety, reliability) with at least one tracked incident	`CoveredCategories / TotalCategories * 100`	Compliance Officer	Weekly
Regulatory Alignment Score	Weighted score based on alignment with OECD AI monitor and EU AI Act criteria	Sum of (criterion met × weight)	Legal Lead	Monthly
Remediation Lead Time	Average time from incident detection to fix deployment	`Σ(TimeToFix) / IncidentCount`	Engineer	Bi‑weekly
Model‑Level Denominator Drift	Change in denominator (e.g., request volume) over the last 30 days	`CurrentDenominator / Prior30DayDenominator - 1`	Data Engineer	Weekly

Review Cadence Blueprint

Cadence	Meeting	Participants	Agenda Items	Deliverables
Daily Stand‑up	Incident Ratio Sync	ML Engineer, Analyst, Product Lead	Review DAIR for the past 24 h, flag spikes	Updated incident log
Weekly Ops Review	Metrics Health Check	Data Engineer, Compliance Officer, Team Lead	Discuss Risk Category Coverage, Denominator Drift, upcoming releases	Action item list, updated risk register
Bi‑weekly Remediation Review	Fix‑Focus Workshop	Engineers, QA Lead, Legal Lead	Analyse Remediation Lead Time, prioritize high‑ratio incidents	Revised remediation backlog
Monthly Governance Review	Regulatory Alignment Session	Legal Lead, Compliance Officer, Senior Management	Evaluate Regulatory Alignment Score, map to OECD AI monitor updates, plan policy tweaks	Governance report, policy amendment draft
Quarterly Strategy Sync	Risk Roadmap Planning	All stakeholders	Aggregate metric trends, set next‑quarter risk thresholds, allocate resources	Updated risk roadmap, budget justification

Checklist for each meeting:

Verify that the denominator data source is current (e.g., API health check).
Confirm that any new incident has been classified and logged within 4 hours of detection.
Cross‑check metric calculations against raw logs to catch formula drift.
Document decisions in a shared Confluence page with clear owners and due dates.

Automation Scripts (inline, no fences)

DAIR updater (Python‑like pseudo‑code):
dair = incidents_today / interactions_today
push_metric('DAIR', dair)
Alert trigger (shell‑style one‑liner):
if [ $(echo "$dair > 0.005" | bc) -eq 1 ]; then curl -X POST -d '{"text":"DAIR exceeds threshold"}' https://hooks.slack.com/services/...; fi
Regulatory score calculator (Excel formula):
=SUMPRODUCT(--(CriteriaMetRange), WeightRange)

Ownership matrix:

Role	Primary Responsibility	Secondary Responsibility
ML Engineer	Implement denominator data pipelines, maintain incident tagging logic	Participate in daily DAIR sync
Data Engineer	Ensure interaction logs are accurate, monitor denominator drift	Support weekly ops review
Analyst	Compute core metrics, validate calculations	Prepare monthly governance report
Compliance Officer	Track risk category coverage, map to OECD AI monitor	Lead weekly ops review
Legal Lead	Align metrics with EU AI Act requirements, update regulatory score	Chair monthly governance review
Product Lead	Prioritise feature gates based on denominator‑adjusted risk	Attend daily stand‑up for rapid feedback

Tooling and Templates

1. Incident Logging Spreadsheet (Google Sheets)

Tabs: Incidents, Denominator Sources, Metrics Dashboard.
Key columns in Incidents tab:
- ID (auto‑increment)
- Timestamp (UTC)
- Component (dropdown: Model, Data Pipeline, UI)
- Risk Category (dropdown: Privacy, Fairness, Safety, Reliability)
- Numerator (e.g., affected users)
- Denominator Ref (link to Denominator Sources tab)
- Severity (1‑5)
- Owner (email)
Denominator Sources tab: Store API endpoints, query snippets, and last‑retrieved values.
Metrics Dashboard tab: Use IMPORTRANGE and QUERY functions to compute DAIR, risk coverage, and trend charts.

How to use:

Clone the template from the public repo (https://github.com/techpolicy/denominator-toolkit).
Share with the team, assign edit rights to owners.
Set up a daily Google Apps Script trigger that pulls the latest denominator values via HTTP GET and writes them to the Denominator Sources tab.

2. CI/CD Integration Snippet (GitHub Actions)

name: Denominator Metrics
on:
  schedule:
    - cron: '0 2 * * *'   # runs daily at 02:00 UTC
jobs:
  compute-metrics:
    runs-on: ubuntu-latest
    steps:
      - name: Checkout repo
        uses: actions/checkout@v3
      - name: Pull interaction stats
        run: |
          INTERACTIONS=$(curl -s ${{ secrets.INTERACTIONS_API }})
          INCIDENTS=$(curl -s ${{ secrets.INCIDENTS_API }})
          DAIR=$(python -c "print(round($INCIDENTS/$INTERACTIONS,4))")
          echo "DAIR=$DAIR" >> $GITHUB_ENV
      - name: Push metric to Grafana
        run: |
          curl -X POST -H "Authorization: Bearer ${{ secrets.GRAFANA_TOKEN }}" \
          -d "{\"metric\":\"dair\",\"value\":${{ env.DAIR }}}" \
          https://grafana.example.com/api/metrics

Benefits: Automates denominator‑adjusted risk calculation, feeds directly into monitoring dashboards, and eliminates manual spreadsheet updates.

3. Risk Register Template (Markdown)

## Risk Register – Project XYZ

| ID | Description | Denominator | Numerator | Ratio | Category | Owner | Mitigation |
|----|-------------|-------------|-----------|-------|----------|-------|------------|
| R1 | Gender bias in recommendation | 12,000 daily recs | 96 flagged | 0.80 % | Fairness | ML Engineer | Retrain on balanced data |
| R2 | Data leakage via API logs | 500,000 API calls | 5 exposures | 0.001 % | Privacy | Data Engineer | Mask PII fields |
| R3 | Model drift causing false positives | 30,000 fraud checks | 150 errors | 0.50 % | Safety | Product Lead | Deploy drift monitor |

Usage tips:

Update the Denominator column each sprint to reflect the latest usage volume.
The Ratio column is automatically calculated by a simple spreadsheet formula (=Numerator/Denominator).
Prioritise items where Ratio > EU AI Act threshold (e.g., 0.5 %).

4. Policy Checklist (One‑Page PDF)

✅	Item	Description	Owner
☐	Denominator Definition	Every risk metric must reference a clear denominator (e.g., requests, users, transactions).	Compliance Officer
☐	Incident Classification	All AI‑related incidents are logged with risk category and severity.	Analyst
☐	Regulatory Mapping	Each metric is mapped to at least one OECD AI monitor indicator or EU AI Act article.	Legal Lead
☐	Review Cadence	Metrics are reviewed on the schedule defined in the "Metrics and Review Cadence" section.	Team Lead
☐	Automation	Scripts for DAIR calculation and alerting are version‑controlled and run in CI.	ML Engineer

Distribution: Export the checklist as a PDF and embed it in the team's onboarding wiki. New hires sign off on the checklist within their first week.

5. Open‑Source Libraries

Library	Language	Core Feature	Repo
`denom‑metrics`	Python	Simple wrapper to compute numerator/denominator ratios from Pandas DataFrames	https://github.com/techpolicy/denom-metrics
`risk‑gate`	JavaScript	Feature‑gate utility that blocks deployment until ratio thresholds are met	https://github.com/techpolicy/risk-gate
`ai‑audit‑cli`	Go	CLI tool to pull incident logs, compute ratios, and emit Slack alerts	https://github.com/techpolicy/ai-audit-cli

Get the next template in your inbox

Get the next template in your inbox