Governing with Confidence: Building Public Sector AI Safeguards
AI Governance Frameworks for the Public Sector
Part 1: Why Government AI Needs Different Rules
Historical Record
Belastingdienst
In 2023, the Dutch tax authority Belastingdienst unwound an algorithmic system that had been used to flag families for childcare benefit fraud, which disproportionately targeted households with dual nationalities.
This case became the most documented AI failure in government history and established a foundational principle that accountability cannot be delegated to algorithms.
The Dutch childcare scandal is the most documented AI failure in government history, but it is not unique. Similar patterns have emerged in welfare fraud detection in Australia, predictive policing in Chicago, and hiring algorithms used by UK civil service departments. In each case, the core failure was not the technology itself, it was the absence of a governance framework: a structured set of policies, roles, oversight mechanisms, and accountability rules that determine how AI tools are selected, deployed, monitored, and corrected. Without that structure, AI in government becomes a decision-making black box that citizens cannot question and officials cannot explain.
This lesson is about building that structure. You do not need to understand how AI models are trained to do this work. What you need to understand is how governance frameworks operate, what they include, how different governments and agencies have built them, and what practical steps public sector professionals at every level can take to ensure AI is used responsibly. Whether you work in a city planning office, a federal procurement team, a public health agency, or a school district, the principles here apply directly to the tools your organization is already using or considering.
What 'AI Governance' Actually Means in Practice
The Principle Behind the Scandal: Accountability Cannot Be Automated
The Dutch case teaches a principle that runs through every major AI governance framework published since 2019: accountability cannot be delegated to an algorithm. When a government official makes a decision that harms a citizen, denying a benefit, flagging someone for investigation, rejecting a permit, that official can be questioned, overruled, and held responsible. When an algorithm makes the same decision, that chain of accountability breaks unless governance structures deliberately preserve it. The EU's AI Act, which came into force in 2024 and directly affects public sector bodies, is essentially a legal codification of this principle. High-risk AI systems, defined as those making decisions about benefits, employment, education, law enforcement, and border control, require human oversight, transparency documentation, and the ability to explain decisions to affected individuals.
What this means for public sector managers is concrete and immediate. If your department is using an AI tool, even something as routine as Microsoft Copilot summarizing case files, or a chatbot answering citizen queries, you need to be able to answer: who approved this deployment, what data does it use, who reviews its outputs, and what happens when it produces something wrong? Most teams using AI tools today cannot answer all four questions. That gap is where governance frameworks come in. They do not slow down AI adoption, they make it defensible, which in government is what makes it sustainable.
The good news is that governance does not require a dedicated AI ethics department or a team of data scientists. The most effective frameworks in government, from Singapore's Model AI Governance Framework to the US Office of Management and Budget's AI guidance, are built around roles and processes that already exist in most agencies: procurement review, legal sign-off, performance monitoring, and public communication. The challenge is not inventing new structures. It is applying existing accountability habits to a new category of tool.
Prompt
I manage a mid-sized city planning department. We are considering using AI tools like Microsoft Copilot and a third-party permit processing AI to help staff handle higher application volumes. I need to draft a one-page AI use policy for my department that covers: what tools are permitted, what human oversight is required, how decisions affecting applicants are documented, and what our review process is. Write a clear, plain-language draft policy suitable for a non-technical team of 40 staff.
AI Response
City Planning Department. AI Use Policy (Draft v1.0) Purpose This policy sets out how artificial intelligence tools may be used within the Planning Department to support staff in their work. It does not replace staff judgment. All decisions affecting applicants remain the responsibility of the designated human officer. Approved Tools The following tools are currently approved for departmental use: Microsoft Copilot (for drafting correspondence, summarizing documents, and preparing internal reports) and PermitAssist AI (for initial permit application screening). No other AI tools may be used for work-related tasks without written approval from the Department Head. Human Oversight Requirements Any AI-generated output that informs a decision affecting an applicant, including permit approvals, rejections, or requests for further information, must be reviewed and signed off by a qualified Planning Officer before it is sent or acted upon. Staff may not send AI-generated correspondence to applicants without editing and approving it themselves. Documentation Where AI tools have been used in preparing a decision, this must be noted in the case file. Officers must record: which tool was used, what the tool produced, and what changes (if any) the officer made before finalizing the decision. Review Process The Department Head will review AI tool performance quarterly. Any instance where an AI output is found to be inaccurate, biased, or inappropriate must be reported to the Head within 48 hours. The department will conduct a full policy review annually. Questions Contact the Department Head or the designated AI Policy Lead with any questions about this policy.
A Different Sector, The Same Problem: Healthcare Administration in the NHS
The UK's National Health Service has been one of the most active public sector adopters of AI tools, and one of the most scrutinized. In 2020, a triage algorithm used in some NHS trusts to prioritize patient care during the COVID-19 surge was found to underweight age and disability factors in ways that conflicted with NHS equality duties. The tool itself was not malicious. The problem was procurement: trusts had adopted vendor-supplied AI tools quickly, under emergency conditions, without the governance checks that would normally apply. No clinical governance committee had reviewed the algorithm's logic. No equality impact assessment had been completed. The NHS's own published guidance on AI, updated significantly in 2023, now requires exactly these steps before any AI tool is used in clinical or administrative decision-making.
For non-clinical NHS managers, those running HR, finance, communications, or facilities, the lesson is just as relevant. When a hospital trust uses AI to screen job applications, schedule staff rotas, or prioritize maintenance requests, those are still decisions with real consequences for real people. The governance principle is the same whether the stakes are clinical or administrative: document what the tool does, who approved it, and how errors are caught. The NHS Digital governance framework now provides a practical checklist for exactly this, it takes a procurement manager roughly half a day to complete, and it creates a defensible record that the organization acted responsibly.
| Framework | Jurisdiction | Who It Applies To | Key Requirements | Enforcement |
|---|---|---|---|---|
| EU AI Act (2024) | European Union | All public sector bodies in EU member states | Risk classification, human oversight for high-risk AI, transparency to affected individuals, incident reporting | National market surveillance authorities; fines up to €30M or 6% global turnover |
| OMB AI Guidance M-24-10 (2024) | United States Federal Government | All US federal agencies | Designated Chief AI Officer per agency, annual AI use inventories, rights-impacting AI safeguards | OMB oversight; agencies must report compliance annually |
| Singapore Model AI Governance Framework (2020, updated 2023) | Singapore | Public and private sector organizations | Internal governance, risk assessment, human oversight, stakeholder communication | Voluntary for private sector; increasingly mandatory for government agencies |
| UK AI Regulation White Paper (2023) | United Kingdom | Public and regulated private sector | Proportionality, transparency, accountability, contestability, redress | Sector regulators (ICO, CQC, Ofcom) enforce within their domains |
| Canada Directive on Automated Decision-Making (2019, updated 2023) | Canada Federal Government | All federal departments and agencies | Impact assessment before deployment, algorithmic impact levels 1-4, mandatory human review for high-impact decisions | Treasury Board Secretariat; non-compliant systems must be suspended |
From Policy to Practice: A City Manager in Melbourne
When the City of Melbourne began piloting AI tools for resident services in 2022, including a chatbot for permit inquiries and an AI-assisted system for reviewing development applications, the project manager, a career local government administrator with no technical background, faced a practical governance problem. Her team of 12 was enthusiastic about the tools but had no shared understanding of what they were allowed to do with them. She did not have a data scientist on staff. What she did have was a procurement process, a legal team, and a performance review cycle. Her solution was to map governance requirements onto those existing structures rather than creating new ones.
She added four questions to the department's standard procurement checklist: Does this tool make or influence decisions about residents? If yes, can those decisions be explained to the resident in plain language? Is there a process for residents to challenge or escalate? And who in our team is responsible for reviewing the tool's outputs monthly? Those four questions, added to a form that already existed, created a governance layer without requiring any new infrastructure. Within six months, two tools the team had been considering were rejected at procurement because the vendors could not answer question two. That rejection was not a failure of innovation. It was governance working exactly as it should.
Start With Your Procurement Checklist, Not a New Policy
Building a Governance Mindset Before Building a Governance Document
One of the most common mistakes public sector teams make when approaching AI governance is treating it as a documentation exercise. They write a policy, file it, and return to business as usual. Real governance is a mindset shift, not a document. It means that every time someone on your team suggests adopting a new AI tool, even a free one, even a pilot, even just 'trying it out', there is a shared instinct to ask: what decisions will this touch, and who is accountable for those decisions? That instinct does not come from reading a policy. It comes from practicing the questions repeatedly until they become automatic.
Canada's federal government discovered this the hard way. When the Treasury Board introduced its Directive on Automated Decision-Making in 2019, many departments initially treated it as a compliance form, filling in the algorithmic impact assessment quickly and moving on. A 2021 review found that a significant number of assessments were incomplete or inaccurate, not because staff were dishonest but because they did not fully understand what the directive was asking. The fix was not a stricter form. It was training: short, role-specific workshops that helped procurement officers, HR managers, and program leads understand what the questions actually meant and why they mattered. Governance literacy, understanding the purpose behind the rules, turned out to be more valuable than the rules themselves.
This is where tools like ChatGPT and Microsoft Copilot can actually support governance rather than complicate it. A policy manager can use Claude or ChatGPT to draft an initial AI use policy, simulate how that policy would apply to specific scenarios, or generate a checklist tailored to their department's functions. A procurement officer can use Copilot to compare vendor documentation against governance requirements. These are not technical tasks, they are professional tasks that happen to involve AI tools. The governance mindset applies to these uses too: the human reviews the output, takes responsibility for the decision, and keeps a record. The tool assists; the professional decides.
Goal: Identify every AI tool currently in use across your team or department, assess each against four core governance questions, and produce a one-page summary that could be shared with a manager or governance committee.
1. Open a blank document or spreadsheet and create four columns: Tool Name, What Decisions It Influences, Can Decisions Be Explained to Affected People (Yes/Partial/No), and Who Is Currently Responsible for Reviewing Outputs. 2. Spend 20 minutes listing every AI tool your team uses, include obvious ones (Microsoft Copilot, ChatGPT, Grammarly AI) and less obvious ones (any automated screening, scheduling, or routing tools embedded in your existing software). 3. For each tool, write a one-sentence description of what decisions or outputs it influences, be specific about whether those outputs affect external people (citizens, clients, applicants) or are purely internal. 4. For each tool, assess whether a decision it influences could be explained in plain language to someone affected by it, mark Yes, Partial, or No, and note why if Partial or No. 5. For each tool, write the name and role of the person currently responsible for reviewing its outputs. If no one currently has this role, write 'Unassigned.' 6. Highlight any tool where the answer to column three is 'No' or where column four is 'Unassigned', these are your governance gaps. 7. Open ChatGPT or Claude and paste your completed table with this prompt: 'Based on this AI tool audit, draft a brief summary of our governance gaps and suggest three practical steps to address the highest-priority gap. Write it for a non-technical manager.' 8. Review and edit the AI-generated summary, correct anything inaccurate, add context specific to your organization, and note any suggestions that are not feasible. 9. Save the final document as your team's AI Governance Baseline, date it, and schedule a reminder to update it in 90 days.
Key Lessons So Far
- AI governance failures in government are almost never caused by the technology alone, they are caused by the absence of accountability structures around how the technology is used.
- The Dutch childcare scandal, NHS triage algorithm issues, and Australia's welfare fraud system all share a common failure: deployment without documented oversight, explainability requirements, or appeals processes.
- Accountability cannot be delegated to an algorithm. Every AI-influenced decision affecting a citizen must have a named human responsible for it.
- Major governance frameworks, the EU AI Act, US OMB guidance, Canada's Directive on Automated Decision-Making, share four core requirements: risk assessment before deployment, human oversight for high-stakes decisions, transparency to affected people, and ongoing performance monitoring.
- The most practical governance approach for non-technical managers is to embed AI-specific questions into existing processes: procurement, legal review, equality impact assessment, and performance monitoring.
- Governance literacy, understanding why the rules exist, is more valuable than the rules themselves. Training and repeated practice of key questions builds the instinct that makes governance real.
- AI tools like ChatGPT, Claude, and Microsoft Copilot can actively support governance work: drafting policies, simulating scenarios, and comparing vendor claims against requirements, as long as a human reviews and owns the output.
When Governance Fails: Learning From the Frontlines
In 2019, the Dutch government deployed an AI system called SyRI. System Risk Indication, to flag citizens at high risk of welfare fraud. The algorithm quietly cross-referenced 17 government databases, including tax records, employment history, and even vehicle registration data. Municipalities used its outputs to target investigations. What looked like efficient public administration turned into a landmark legal case. A Dutch court ruled in 2020 that SyRI violated the European Convention on Human Rights, specifically the right to private life. The government had no published documentation of how the algorithm made decisions, no meaningful way for citizens to challenge the system's conclusions, and no independent oversight board reviewing its outputs.
The SyRI case wasn't a technology failure. The algorithm did what it was designed to do. It was a governance failure. Nobody had built the accountability structures, the documentation requirements, the appeals processes, the transparency obligations, that should surround any automated system making consequential decisions about citizens' lives. The Dutch government dismantled SyRI entirely. Years of development and deployment costs, gone. More importantly, public trust in government data practices took a serious hit that persisted long after the headlines faded.
The principle extracted from SyRI is blunt: technical capability and governance readiness are not the same thing. A department can acquire an AI tool, integrate it into existing workflows, and start producing outputs, all before anyone has answered the fundamental question of what happens when that system gets something wrong. For public sector managers, this is the core tension of AI governance. You are accountable to citizens in ways that private companies simply are not. Getting governance right isn't bureaucratic overhead. It is the job.
What 'AI Governance' Actually Means Day-to-Day
The Three Pillars That Actually Hold Up a Governance Framework
After the SyRI ruling, the Dutch case became required reading for public sector AI teams across Europe. Policy analyzts dissecting the judgment identified three structural absences that turned a functioning algorithm into a legal liability. First, there was no algorithmic transparency, the government could not explain, in plain terms, how the system weighted different data points. Second, there was no meaningful redress mechanism, citizens flagged by SyRI had no practical way to challenge their designation. Third, there was no independent oversight, the municipalities using the system were also responsible for auditing it, which is like asking a department to audit itself.
These three absences map directly onto what governance researchers now call the core pillars of responsible public sector AI: transparency, accountability, and independent oversight. They sound abstract until you apply them to a concrete situation. A city housing department using AI to prioritize repair requests needs transparency about what factors the algorithm uses. It needs accountability structures so a specific person answers when a household is incorrectly deprioritized. And it needs oversight from someone outside the department, an ombudsperson, an audit function, a citizen review board, who can spot systemic bias before it becomes a court case.
What makes these pillars genuinely difficult in government is that they create friction. Transparency requirements slow down procurement. Accountability structures require someone to own decisions that were previously diffused across systems and teams. Independent oversight introduces external scrutiny that managers sometimes experience as interference. The temptation, especially in under-resourced departments, is to treat governance as a compliance checkbox rather than an operational reality. The SyRI example shows exactly where that temptation leads.
Putting Governance Into a Real Workflow: A Housing Authority Example
Prompt
I manage AI implementation for a mid-sized city housing authority. We are considering using an AI tool to help prioritize maintenance repair requests, there are about 4,000 open requests at any time. I need to draft a one-page internal policy that covers: how the AI tool will be used, what decisions it can and cannot make on its own, how residents can challenge an AI-assisted decision, and who in our department is responsible if something goes wrong. Write this policy in plain language suitable for both staff and residents.
AI Response
**Housing Authority AI Assistance Policy. Maintenance Prioritization** **What This Policy Covers** This policy explains how the Housing Authority uses AI software to help manage the order in which maintenance repairs are scheduled. AI assists our staff, it does not replace their judgment. **How We Use AI** Our AI tool reviews open repair requests and suggests a priority ranking based on factors including safety risk, time waiting, and household vulnerability indicators. Staff review these suggestions before any scheduling decision is made. No repair request is automatically rejected or indefinitely delayed based solely on an AI output. **What AI Cannot Decide Alone** The AI tool does not have authority to close a request, deny a repair, or override a staff member's assessment. All final scheduling decisions are made by a named Housing Authority employee. **Your Right to Challenge a Decision** If you believe your repair request has been incorrectly prioritized, you may request a manual review by contacting the Resident Services team at [contact details]. A staff member, not the AI system, will review your case within 5 business days and provide a written explanation of the decision. **Who Is Responsible** The Operations Manager holds overall accountability for how this AI tool is used. The name and contact details of the current Operations Manager are posted on our website and updated quarterly. **Policy Review** This policy is reviewed every 12 months. The next review date is [date].
A Different Sector, the Same Lesson: Predictive Policing in Los Angeles
The Los Angeles Police Department ran a predictive policing program called PredPol, later rebranded Geolitica, for nearly a decade. The software analyzed historical crime data to generate daily maps showing areas where crime was statistically more likely to occur. Officers were directed to patrol those zones more intensively. On paper, it was data-driven policing. In practice, researchers at the RAND Corporation and journalists at Wired documented a feedback loop: police patrolled high-crime-prediction areas more, found more crime there simply because they were looking, and fed that data back into the algorithm, which then ranked those same areas as even higher risk. The system was amplifying historical enforcement patterns, many of which reflected decades of racially biased policing, and presenting the output as neutral prediction.
Los Angeles discontinued PredPol in 2020 following sustained community pressure and a city council motion. What the LAPD case adds to the SyRI story is a specific governance concept: feedback loops in AI systems. A governance framework for any AI tool making predictions about people, whether welfare recipients, housing applicants, or patrol zones, must include a mechanism for detecting when the system is reinforcing its own past outputs rather than reflecting ground truth. This is not a technical task. It is a management task. Someone in a leadership role needs to periodically ask: are the outputs of this system telling us something real, or are they telling us what we already believed?
Comparing Governance Approaches: What Different Jurisdictions Require
| Jurisdiction / Framework | Key Requirement | Who It Applies To | Enforcement Mechanism | Strongest Feature |
|---|---|---|---|---|
| EU AI Act (2024) | High-risk AI systems must register, document, and conduct conformity assessments before deployment | All public and private sector AI deployers in EU member states | National market surveillance authorities; fines up to €30M or 6% global revenue | Legally binding risk classification with clear prohibited uses |
| US Executive Order on AI (Oct 2023) | Federal agencies must report AI use cases; safety testing for frontier models | Federal agencies and contractors | OMB oversight; no direct financial penalties for agencies | Requires government-wide AI use case inventory |
| UK AI Regulation White Paper (2023) | Principles-based approach; existing regulators apply AI rules in their sectors | Sector-specific (health, finance, transport) | Sector regulators (CQC, FCA, etc.) | Flexibility for sector-specific application |
| Canada Directive on Automated Decision-Making | Algorithmic Impact Assessment required before deploying automated systems | Federal government departments | Treasury Board Secretariat review; mandatory before go-live | Tiered impact assessment based on decision severity |
| Singapore Model AI Governance Framework | Voluntary framework with detailed implementation guidance | Private and public sector organizations | Advisory only; no penalties | Highly practical operational guidance for practitioners |
The table above reveals something important: governance frameworks are not uniform, and 'compliance' means very different things depending on where you work. A Canadian federal department faces mandatory algorithmic impact assessments with Treasury Board review before any automated decision system goes live. A UK local council operates under sector-specific guidance from existing regulators, which means the framework looks different for a health-adjacent service versus a planning department. This variation matters for public sector professionals because it means you cannot simply copy-paste a governance model from another country or even another department. You need to understand the specific obligations in your jurisdiction.
The HR Director Who Built a Governance Framework From Scratch
Consider a Director of Human Resources at a mid-sized Canadian provincial government ministry who was asked in 2022 to evaluate whether AI resume-screening tools could help reduce a backlog of over 800 open applications. She had no AI background. What she did have was a clear mandate to hire fairly, a legal obligation under provincial human rights legislation, and a healthy skepticism about vendor claims. Before approving any pilot, she ran the ministry through Canada's Algorithmic Impact Assessment framework, a structured questionnaire that assigns risk levels to automated systems based on factors like the reversibility of decisions, the vulnerability of affected populations, and the volume of people impacted. The assessment came back at Level 2, meaning significant human oversight was required at every stage.
She structured the pilot accordingly. The AI tool could flag applications for human review, but could not remove any application from consideration without a hiring manager explicitly confirming the decision. Every rejection had to include a documented reason from a human reviewer, not just an algorithm score. She also built in a six-month review where a random sample of AI-flagged rejections were re-evaluated by a different hiring manager. The result was a defensible, auditable process that her deputy minister could explain to a parliamentary committee if needed. The AI tool reduced the review backlog by 40 percent. More importantly, the ministry had a documented record of how every hiring decision was made, something that became valuable when a rejected candidate filed a human rights complaint 18 months later. The complaint was resolved quickly because the documentation existed.
Use AI to Draft Your Governance Documents, Then Have Humans Validate Them
Building Governance Into Your Department Before You Need It
The most expensive governance frameworks are the ones built in response to a crisis. The Dutch government didn't build SyRI's accountability structures until after a court forced them to, at which point there was nothing to build because the system was being shut down. The LAPD didn't examine PredPol's feedback loops until community advocates and journalists did the analyzis and went public. Reactive governance protects no one. Proactive governance, building accountability structures before a system goes live, is what allows a department to keep using AI tools when problems emerge, because it has the documentation and processes to demonstrate responsible use.
For most non-technical public sector managers, proactive governance starts with a simple inventory. What AI tools is your department currently using? This question is less trivial than it sounds. 'AI tools' now includes the AI writing assistance built into Microsoft 365 Copilot, the automated email sorting in Outlook, the chatbot on your department's public website, and the risk-scoring model in your case management software. Many departments are using five or more AI-assisted systems without a comprehensive list of what they are, who approved them, and what decisions they influence. Building that inventory is the first governance action any manager can take this week, without any technical expertise.
Once the inventory exists, the next step is applying a risk lens. Not every AI tool requires the same level of governance attention. A tool that helps staff draft internal memos carries fundamentally different risk than a tool that scores benefit applications. The EU AI Act's risk tiers, unacceptable, high, limited, minimal, provide a practical sorting framework even for jurisdictions where the Act doesn't legally apply. Running your inventory through a simple four-question filter (Does it affect citizens directly? Are decisions reversible? Could it discriminate? Is it explainable to a non-expert?) gives you a prioritized governance workplan without requiring a data science degree.
Goal: Create a documented inventory of AI tools currently in use in your team or department, with a basic risk assessment for each, suitable for sharing with senior leadership.
1. Open a shared document or spreadsheet and create columns for: Tool Name, Primary Function, Who Approved It, Which Staff Use It, What Decisions It Influences, and Estimated Risk Level. 2. Spend 15 minutes listing every software tool your team uses that includes any automated, AI-assisted, or algorithm-driven feature, include Microsoft Copilot, chatbots, case management scoring, scheduling software, and document drafting tools. 3. For each tool, open ChatGPT Plus or Claude Pro and ask: 'I work in [your department type] for a [level of government]. We use [tool name] for [primary function]. Based on the EU AI Act risk tiers, what risk level would this likely fall under and why? Keep your answer to 3-4 sentences.' 4. Record the AI-generated risk assessment in your spreadsheet alongside your own judgment about whether you agree. 5. For any tool rated 'high risk,' add a fifth column: 'Governance Gap', note whether you have documented approval, an appeals process, and named accountability for that tool. 6. Identify the single highest-risk tool in your inventory where a governance gap exists. Use Claude Pro or ChatGPT Plus to draft a one-paragraph internal policy statement for that tool using the prompt: 'Draft a plain-language internal policy statement for [tool name] used by [department] for [function]. It should cover permitted use, human oversight requirements, and who is accountable.' 7. Share the completed inventory and draft policy statement with your line manager or equivalent, framed as a starting point for a broader departmental AI governance review. 8. Schedule a 30-minute team meeting to walk through the inventory together and identify any tools you missed. 9. Save the document with a review date set for six months from today.
Key Lessons From This Section
- Governance failures are more common than technology failures in public sector AI. SyRI and PredPol both worked as designed; neither had adequate accountability structures.
- The three non-negotiable pillars of public sector AI governance are transparency (explainability), accountability (named human responsibility), and independent oversight (external review).
- AI systems that make predictions about people can create feedback loops that amplify historical bias, detecting this is a management responsibility, not a technical one.
- Governance frameworks vary significantly by jurisdiction: Canadian federal departments face mandatory algorithmic impact assessments; EU public bodies face legally binding risk tiers; UK departments apply sector-specific regulatory guidance.
- A department AI inventory, a simple list of every AI-assisted tool in use, with a basic risk assessment, is the foundational governance document every manager should create before anything else.
- AI tools like ChatGPT Plus and Claude Pro can draft governance documents, impact assessments, and internal policies in minutes; human review and legal validation remain essential before finalizing.
- Proactive governance built before a system goes live is significantly less costly, in time, money, and public trust, than reactive governance built after a problem becomes public.
When Frameworks Meet Reality: Making AI Governance Work in Practice
In 2022, the city of Rotterdam deployed an algorithmic system to flag welfare fraud. On paper, the governance looked solid, technical documentation, internal sign-offs, a project charter. But a Dutch court ordered the system suspended after finding it disproportionately targeted residents from specific ethnic backgrounds. The city had a framework. What it lacked was a governance process with real teeth: independent audits, community input, and a clear accountability chain that could catch bias before deployment, not after a lawsuit. Rotterdam's experience became a cautionary reference point across European public administration, not because the city was reckless, but because it was typical.
The Rotterdam case illustrates the central tension in AI governance for public agencies: the gap between having a policy and living it. Governments worldwide have produced hundreds of AI ethics guidelines, principles documents, and strategy papers. The harder problem is operationalizing them, making governance show up in procurement decisions, vendor contracts, staff training, and service delivery, not just in PDF documents on a ministry website. The principle extracted from Rotterdam is blunt: governance without enforcement mechanisms is just documentation.
What Rotterdam eventually built afterward is instructive. The city created a cross-functional AI review board that included a civil society representative and a legal officer with veto authority. Procurement now requires vendors to submit algorithmic impact assessments before any contract is signed. Frontline case workers received training on how to flag anomalous AI outputs. These weren't revolutionary changes, they were structural ones. Accountability was assigned to named people with actual authority to stop a deployment. That is the operational difference between governance as theater and governance as practice.
The EU AI Act Changes the Stakes
Governance Through the Lens of Three Real Teams
Canada's federal government launched its Algorithmic Impact Assessment (AIA) tool in 2019 through the Treasury Board Secretariat. Before any automated decision system can go live in a federal department, teams must complete the AIA, a structured questionnaire that scores a system's risk level and triggers proportionate requirements. Low-risk tools need basic documentation. High-risk tools require peer review, bias audits, and deputy minister approval. The AIA is publicly available online, updated regularly, and used by non-technical policy analyzts, not just IT departments. It is one of the most cited examples globally of governance embedded into process rather than bolted on afterward.
Canada's approach works because it removed discretion from the early stages. Teams cannot self-certify that their AI tool is low-risk without completing a structured assessment that a compliance officer reviews. The governance isn't asking people to be ethical, it's designing a workflow where the ethical checkpoint is unavoidable. That is the operational lesson: embed the governance gate into the process, not the culture.
Prompt
I work in a city government procurement office. We are about to evaluate three vendors offering AI tools for processing permit applications. I need a checklist our team can use during vendor evaluation to assess each tool against our governance requirements: transparency, fairness, data privacy, human oversight, and vendor accountability. Write a practical checklist with 5 categories, 3-4 questions per category, suitable for non-technical reviewers.
AI Response
**AI Vendor Governance Evaluation Checklist. Permit Processing Tools** **1. Transparency** - Can the vendor explain, in plain language, how the system makes or supports decisions? - Is a summary of the model's logic available to affected applicants on request? - Does the vendor provide documentation of the training data sources? - Are system updates and version changes communicated proactively? **2. Fairness and Bias** - Has the system been tested for disparate outcomes across demographic groups (age, income, ethnicity)? - Can the vendor provide bias audit results from an independent third party? - What is the process for flagging and correcting discriminatory outputs? **3. Data Privacy** - Where is applicant data stored, and for how long? - Is the system compliant with applicable privacy legislation (e.g., GDPR, PIPEDA, state law)? - Who has access to individual applicant data within the vendor's organization? **4. Human Oversight** - Can a human reviewer override any AI recommendation before a decision is finalized? - Is there an audit log of every AI-assisted decision? - What training does the vendor provide to staff who use the system? **5. Vendor Accountability** - What are the contractual remedies if the system produces discriminatory outcomes? - Does the vendor carry liability insurance for AI-related harms? - What is the vendor's process for responding to a public complaint about the system?
The New Zealand government took a different angle. Its Algorithm Charter, signed by 21 government agencies in 2020, commits signatories to specific practices: publishing plain-language explanations of algorithms that affect citizens, maintaining human oversight, and releasing annual transparency reports. The Charter is voluntary, but it created public accountability. Agencies that signed and then failed to publish their annual reports faced media scrutiny and parliamentary questions. Voluntary governance can have real enforcement pressure when it is public and named.
New Zealand's experience shows that public commitment is itself a governance mechanism. When an agency's name is on a published charter with specific obligations, the reputational cost of non-compliance becomes real. This is particularly useful for governments that lack the legal infrastructure to mandate compliance but can create accountability through transparency. It won't stop every failure, but it raises the floor.
| Country/Agency | Governance Mechanism | Who It Covers | Enforcement Type | Key Strength |
|---|---|---|---|---|
| Canada. Treasury Board | Algorithmic Impact Assessment (AIA) | All federal departments | Mandatory process gate | Embedded in procurement workflow |
| EU Member States | EU AI Act conformity assessments | Public + private sector | Binding law with fines | Legal enforceability |
| New Zealand | Algorithm Charter | 21 signatory agencies | Voluntary + public accountability | Transparency and named commitment |
| Rotterdam, Netherlands | Cross-functional AI review board | City-level deployments | Internal veto authority | Cross-sector oversight |
| United States (Federal) | OMB AI Memorandum M-24-10 | Federal agencies | Policy directive | Minimum risk management standards |
At the individual role level, consider a senior HR director in a state health department who oversees AI-assisted screening tools for frontline hiring. She doesn't write code. She doesn't set IT policy. But she is the person who signs off on hiring decisions influenced by an AI tool's candidate rankings. Under most governance frameworks, including the EU AI Act's employment category, she is an accountable human in the oversight chain. Her practical governance responsibility includes understanding what the tool scores candidates on, whether those criteria have been validated for bias, and how she would explain a rejection decision to a candidate who appeals.
This HR director's governance work isn't technical. It's asking the right questions of her IT and vendor teams, documenting her oversight role, and knowing when to escalate. That is exactly what governance frameworks are trying to produce at scale: decision-makers who are informed, accountable, and willing to say 'we need to stop and check this' before a deployment goes wrong.
Your Governance Role Doesn't Require a Technical Background
Putting Governance Into Practice This Week
Governance frameworks only matter when they change what people actually do. For a non-technical professional in a public sector role, the most immediate application is knowing what questions to ask before your team adopts any AI tool, whether that's an AI-powered scheduling assistant, a document summarization tool, or a predictive analytics dashboard from a vendor. The Canadian AIA is publicly available and free to use as a reference. You can adapt its question categories, impact on rights, data sensitivity, decision automation level, even if your agency has no formal process yet.
The second practical step is mapping accountability. For any AI tool currently in use in your team, write down three names: who approved the tool, who monitors its outputs, and who a citizen would contact if they believed the tool produced an unfair outcome. If you cannot fill in all three names, you have identified a governance gap. That gap is more useful to know about now than after a complaint or a court order. This exercise takes twenty minutes and requires no technical knowledge.
The third step is language. Public sector teams often lack the vocabulary to talk about AI risk in procurement meetings, budget discussions, or ministerial briefings. AI tools like ChatGPT and Claude can help you draft governance memos, translate technical vendor documentation into plain language, prepare briefing notes on specific regulations like the EU AI Act, and generate evaluation questions for vendor pitches. The goal is to show up to governance conversations with structured, specific questions, not vague concerns about 'AI risk.' Precision makes governance real.
Goal: Produce a one-page governance summary for an AI tool currently in use in your workplace, identifying accountability gaps and recommended next steps.
1. Choose one AI tool your team currently uses, this could be a chatbot, a document tool, an analytics dashboard, or even a vendor-supplied screening tool. Write down its name and its primary function. 2. Open ChatGPT (free) or Claude (free) and paste this prompt: 'I work in a public sector organization. We use [tool name] for [function]. Help me write five governance questions I should be able to answer about this tool, covering: transparency, fairness, data privacy, human oversight, and accountability.' 3. Copy the AI-generated questions into a document. Add a column next to each question labeled 'Current Answer' and a second column labeled 'Gap / Action Needed.' 4. Answer each question based on what you currently know. Leave blanks where you don't know the answer, those blanks are your governance gaps. 5. Identify who in your organization would know the answers you couldn't fill in. IT lead, vendor contact, legal/compliance officer. Write their names next to the relevant gaps. 6. Return to ChatGPT or Claude and ask: 'Draft a short internal memo (150 words) recommending that our team schedule a governance review of [tool name], referencing the need to address transparency, oversight, and accountability gaps.' 7. Edit the memo with your specific findings, add your name and role, and send it or save it as a record that you initiated a governance review. 8. Optional: Compare your questions against Canada's publicly available Algorithmic Impact Assessment tool at canada.ca to see if there are additional risk categories relevant to your tool. 9. File your completed audit document with a date. If your agency has no formal AI governance process, this document is the start of one.
Key Lessons from This Topic
- Governance without enforcement is documentation. Frameworks only work when they are embedded in procurement workflows, approval processes, and named accountability roles, not just published as principles.
- The EU AI Act creates binding legal obligations for many public sector AI tools. High-risk categories include benefits assessment, law enforcement, border control, and employment screening.
- Canada's Algorithmic Impact Assessment is a free, publicly available model that non-technical teams can adapt to evaluate AI tools before deployment.
- Voluntary governance can carry real weight when it is public, named, and tied to transparency reporting, as New Zealand's Algorithm Charter demonstrates.
- Non-technical professionals are accountable parties in AI governance. Your role is to ask the right questions, document oversight decisions, and know your escalation path, not to audit the code.
- Mapping three names, who approved a tool, who monitors it, who handles complaints, is the fastest way to identify a governance gap in your current operations.
- AI tools like ChatGPT and Claude are practical aids for governance work: drafting evaluation checklists, translating vendor documentation, writing briefing notes, and preparing procurement questions.
Takeaways
- Real governance shows up in workflows, not just policy documents. Rotterdam's failure and subsequent reform illustrate both sides of this.
- The gap between having a governance framework and operationalizing it is where most public sector AI failures happen.
- You do not need technical expertise to govern AI responsibly, you need the right questions, named accountability, and a willingness to escalate.
- Free AI tools can help you do governance work right now: drafting checklists, writing memos, preparing vendor evaluation questions.
- The most important governance action you can take this week is identifying one AI tool in your team's workflow and asking: who is accountable if this produces an unfair outcome?
This lesson requires Pro
Upgrade your plan to unlock this lesson and all other Pro content on the platform.
You're currently on the Free plan.
