Skip to content

Chapter 5: Designing AI – Shaping the Future and Ethical Impact

Shaping the Future and Ethical Impact

Throughout this framework, we have explored how to interact with AI. We have learned how to engage with it for information, collaborate with it to create new works, and manage it responsibly to protect our privacy and our uniquely human skills. But the final domain of the AILit Framework requires a profound shift in perspective. Instead of just sitting in the driver’s seat, Designing AI asks you to pop open the hood, look at the engine, and understand exactly how this powerful machine was built.

Designing AI is not just for computer science majors and software engineers. You do not need to know how to write complex Python code to understand the architecture of an AI system. Instead, this domain is about understanding the human choices that shape how AI functions. Algorithms are not objective, all-knowing oracles handed down from the sky — they are tools built by human beings, trained on human data, and deployed in a human world. By understanding the mechanics and the ethical impacts of AI design, you empower yourself to advocate for technology that is fair, transparent, and beneficial for everyone.


5.1 — How AI is Trained: The Human Element 🏗️

Section titled “5.1 — How AI is Trained: The Human Element 🏗️”

When you type a prompt into an image generator and it creates a flawless picture of a cat riding a skateboard, it can feel like magic. But there is no magic inside the machine — there is only math and massive amounts of data. To understand how AI is designed, we first have to understand the human labor that makes it possible.

Before a machine learning model can understand what a “cat” or a “skateboard” is, it has to look at millions of examples. But a computer doesn’t naturally know what those images contain. It relies on a process called data labeling data labeling: The process of tagging or annotating raw data (images, text, audio) with meaningful labels so that an AI model can learn from it. This is often performed by a large, often invisible human workforce. .

Behind the sleek interfaces of modern AI companies is an invisible global workforce of thousands of human beings who spend their days looking at pictures, reading texts, and tagging them with labels:

  • 🚗 Drawing boxes around stop signs to train self-driving cars
  • ❌ Flagging toxic comments to train content moderation bots
  • 🎙️ Categorizing audio clips to train voice assistants

The AI only learns what the world looks like because a human being first categorized that world for it.

This brings us to one of the most fundamental rules of computer science: Garbage In, Garbage Out (GIGO) Garbage In, Garbage Out (GIGO): A principle stating that the quality of an AI's output is entirely dependent on the quality of its training data. Flawed, biased, or incomplete data will always produce flawed, biased, or incomplete results. . Because an AI learns entirely from the data it is fed, the quality of that data dictates the quality of the AI.

If you try to learn how to bake by exclusively reading a cookbook filled with terrible, incorrect recipes, you are going to bake a terrible cake — no matter how hard you study.

Similarly, if an AI is trained on a dataset that is incomplete, outdated, or fundamentally flawed, the AI’s output will be equally flawed. The machine does not have the common sense to realize its textbook is wrong; it simply predicts patterns based on the information it was given.


5.2 — Algorithmic Bias and Social Justice ⚖️

Section titled “5.2 — Algorithmic Bias and Social Justice ⚖️”

The GIGO principle is not just a technical problem — it is a profound ethical challenge. Because AI models are trained on historical human data, they inevitably absorb historical human prejudices. This phenomenon is known as algorithmic bias algorithmic bias: When an AI system produces results that are systematically skewed or prejudiced against certain individuals or groups because of flaws, gaps, or historical biases embedded in its training data. , and it occurs when an AI system produces results that are systematically prejudiced against certain individuals or groups.

Consider facial recognition technology. In its early days, several major systems were highly accurate at identifying the faces of lighter-skinned men, but they drastically failed when trying to identify the faces of darker-skinned women. The algorithms weren’t intentionally programmed to be discriminatory — the engineers trained them on datasets that mostly contained pictures of white men. The AI simply didn’t have enough examples of diverse faces to learn from.

When this biased technology is used in the real world — such as by police departments for suspect identification — it can lead to false arrests and severe civil rights violations.

DomainThe BiasThe Real-World Impact
HiringAn AI resume screener penalizes female applicants because historical data showed mostly men were hiredQualified women are filtered out before a human ever reviews their application
Criminal JusticeRecidivism prediction algorithms label minority defendants as “high risk” at higher ratesReflects systemic biases in historical arrest data, perpetuating inequality
HealthcareAI diagnostic tools trained mostly on data from one demographicMisdiagnoses or missed diagnoses for underrepresented patient groups
Image SearchSearching “CEO” returns mostly images of white menReinforces and normalizes historical power imbalances

Designing better AI requires a deliberate effort to curate diverse, balanced datasets and — most importantly — requires diverse teams of developers who have the varied life experiences necessary to spot these blind spots before the technology is released into the world.


5.3 — Designing for Good: Ethical AI 🌍

Section titled “5.3 — Designing for Good: Ethical AI 🌍”

So, how do we fix this? If algorithms are prone to absorbing our flaws, how do we design them to make the world better? This question is the driving force behind the growing field of Ethical AI. As AI becomes woven into healthcare, finance, education, and law enforcement, computer scientists and ethicists are working to establish principles for responsible design.

PillarWhat It Means
⚖️ FairnessActively testing systems for bias to ensure they do not discriminate against individuals or groups
🙋 AccountabilityWhen an AI makes a mistake, there is a clear understanding of who is legally and morally responsible
🔍 TransparencyAI systems can explain how they arrived at a decision, not just produce a final answer

Perhaps the most complex challenge is transparency transparency: The principle that AI systems should be able to explain their decision-making process in a way that humans can understand and verify — rather than producing outputs that cannot be audited or questioned. — often referred to as the “Black Box” problem. Many advanced neural networks are so complex that even the engineers who built them cannot entirely explain how the AI arrived at a specific decision. Input goes in, an answer comes out, but the reasoning is hidden.

This is highly dangerous in high-stakes situations. If an AI denies your family a bank loan, you have a right to know why.

To solve this, researchers are developing Explainable AI (XAI) Explainable AI (XAI): A field of AI research focused on creating systems that can clearly describe the steps and reasoning behind their decisions, making it possible for humans to audit, trust, and challenge AI outputs. . Think of XAI like a math teacher demanding that you show your work on a test. It is not enough for the AI to just give the final answer; it must be designed to explain the logical steps it took to get there, allowing humans to verify its reasoning.

Designing AI is ultimately a global challenge. Because the internet has no borders, regulating these powerful systems requires international cooperation. We must decide together:

  • 📜 What rules these digital minds should follow
  • 💾 What data they are allowed to consume
  • 🛑 What decisions are simply too important to hand over to a machine

By building your AI literacy today, you are preparing yourself to be a crucial voice in that global conversation tomorrow.


5.4 — The Global AI Race: Power, Politics, and Technology 🌐

Section titled “5.4 — The Global AI Race: Power, Politics, and Technology 🌐”

AI is not just a technical challenge — it is a geopolitical one. The nations and companies that lead in AI development will hold enormous economic, military, and cultural power in the coming decades. Understanding the global dimensions of AI helps you see why the design decisions being made right now, in research labs in California, Beijing, London, and Brussels, will shape the world you inherit.

The United States and China are engaged in the most consequential technology competition since the space race. The rivalry operates on three main dimensions:

Compute Power (Chips): The most advanced AI systems require specialized semiconductor chips — particularly NVIDIA’s GPU chips — to train. The U.S. government has imposed increasingly strict controls on the export of advanced chips to China, attempting to maintain a lead in raw computational capacity. China is investing massively in domestic chip manufacturing to reduce its dependence on American technology.

AI Talent: The global pool of elite AI researchers is small, and both countries compete intensely for it. American universities produce many of the world’s leading AI researchers — including significant numbers from China who have historically stayed to work at American companies. The competition for AI talent has become a significant dimension of immigration policy.

Data Scale: AI systems train on data, and larger populations generate more data. China has a population advantage and — with fewer privacy regulations — can collect and use citizen data at scales that would not be legally or politically permissible in democratic societies. The implications for AI capability are significant and contested among researchers.

While the US and China race for AI supremacy, the European Union has attempted to chart a different course: prioritizing rights and safety over raw capability. The EU AI Act — the world’s first comprehensive legal framework for AI — classifies AI systems by risk level and imposes progressively strict requirements:

  • Unacceptable risk systems (e.g., social scoring) are banned outright
  • High-risk systems (used in healthcare, criminal justice, hiring) must meet strict transparency and accountability requirements
  • Low-risk systems have lighter disclosure obligations

Whether the EU’s regulatory-first approach will ultimately prove visionary or competitively limiting is one of the defining technology policy debates of our era.

A less discussed but critically important dimension: the risk that powerful AI systems trained primarily on Western, English-language data will be deployed globally without adequate adaptation to other cultures, languages, and value systems.

When an AI trained predominantly on American internet data makes hiring decisions in Nigeria, or provides healthcare triage recommendations in rural India, or moderates content in Arabic — the mismatch between training data and deployment context creates genuine risks of harm. Critics call this AI colonialism: the export of technological power structures that serve the interests of the powerful while potentially harming local communities who had no voice in the system’s design.

AI-powered autonomous weapons — drones that identify and engage targets without human approval — are already deployed in several armed conflicts. Nations including the U.S., China, Israel, Turkey, and Russia are developing increasingly autonomous military AI. Advocates argue that autonomous weapons can react faster than humans and reduce casualties. Critics argue that removing human judgment from lethal force is a moral line that should not be crossed, and that autonomous weapons lower the threshold for initiating conflict.

Multiple international efforts to negotiate an AI weapons treaty — analogous to the treaties banning land mines or cluster munitions — have so far failed to achieve binding agreement.

AI geopolitics AI geopolitics: The study of how artificial intelligence capabilities shape international power dynamics, national competition, military strategy, and global governance — including the race for AI dominance between major powers and the efforts to regulate AI internationally.

5.5 — Open Source vs. Closed AI: Who Controls the Technology? 🔓

Section titled “5.5 — Open Source vs. Closed AI: Who Controls the Technology? 🔓”

One of the most consequential debates in AI today is not about safety or bias or geopolitics — it is about access. Who should be able to see, modify, and deploy AI models?

open-source AI open-source AI: AI systems whose model weights, training code, or architecture are publicly available for anyone to inspect, modify, and deploy. Examples include Meta's LLaMA family and Mistral. Open-source AI enables broad research access and prevents any single entity from monopolizing the technology. makes the fundamental code and trained model weights publicly available. Anyone — researchers, students, startups, hobbyists, and governments — can download, modify, and deploy the model without paying licensing fees or going through a commercial API.

The case for open source:

  • Research transparency: Scientists can inspect the model for bias, safety issues, and capabilities — accelerating safety research
  • Prevents monopoly: If only two or three companies control the world’s most capable AI, they hold enormous unaccountable power over everyone who depends on it
  • Democratizes access: A researcher at a university in Ghana has the same access to the technology as an engineer at a Fortune 500 company
  • Innovation: Allowing anyone to build on top of existing models dramatically accelerates progress

Closed AI systems — like OpenAI’s GPT-4 or Google’s Gemini Ultra — keep their model weights proprietary. Users access them through commercial APIs or interfaces, but cannot inspect or modify the underlying model.

The case for closed AI:

  • Safety control: The developing organization maintains oversight of how the model is used and can shut down or modify deployments
  • Misuse prevention: Open-sourcing a highly capable model makes it immediately available to anyone seeking to weaponize it for disinformation, cyberattacks, or other harms
  • Commercial sustainability: Building frontier AI models costs hundreds of millions of dollars — companies need some form of competitive advantage to fund continued development

The debate crystallizes around a fundamental question: does openness or control better serve humanity?

Open access enables innovation and prevents monopoly — but it also means that once a powerful model is released, it cannot be taken back. Closed control potentially enables safer deployment — but it concentrates extraordinary power in the hands of a very small number of private corporations with shareholders, not democracies.

A related debate concerns responsible scaling responsible scaling: A framework, advocated by some AI labs including Anthropic, that proposes deliberately limiting AI capability development until safety research has demonstrated sufficient techniques to ensure the system remains aligned with human values at the next capability level. . Some researchers and organizations argue that AI capabilities are advancing faster than our ability to ensure they are safe. The question is whether to deliberately slow capability development — not releasing more powerful models until safety research has caught up — or to continue advancing as fast as possible, betting that safety research will keep pace.

This is not a hypothetical philosophical debate. It is actively shaping the release decisions of major AI labs right now.


5.6 — AI Safety: The Long-Term Question 🔮

Section titled “5.6 — AI Safety: The Long-Term Question 🔮”

No chapter on Designing AI would be complete without confronting the question that serious AI researchers, philosophers, and policy makers are spending their careers on: what happens if AI becomes much more powerful than it is today, and how do we ensure it remains beneficial?

The central challenge of AI safety is what researchers call the alignment problem: as AI systems become more capable, how do we ensure that their objectives remain aligned with human values? The difficulty is that AI systems don’t have values in the way humans do — they have optimization objectives, and they pursue those objectives very effectively.

AI alignment AI alignment: The research problem of ensuring that AI systems' goals, values, and behaviors remain consistent with the intentions and values of their human designers — especially as systems become more capable and autonomous. is the field devoted to this challenge. It is not a simple engineering problem — it is one of the deepest questions in the intersection of philosophy, mathematics, and computer science.

Philosopher Nick Bostrom articulated what he calls the orthogonality thesis: intelligence and goal content are fundamentally independent. A system can be highly intelligent — capable of sophisticated reasoning and planning across many domains — while pursuing any goal whatsoever, including goals that seem trivial or harmful to humans.

In other words, there is no automatic connection between being smart and having good values. A superintelligent AI doesn’t automatically care about human wellbeing. It cares about whatever objective it was designed to optimize.

orthogonality thesis orthogonality thesis: Nick Bostrom's philosophical argument that intelligence and goal content are independent — meaning a superintelligent AI system could pursue almost any goal, including ones deeply harmful to humans, simply because its intelligence does not automatically generate good values.

To make this concrete, AI philosopher Eliezer Yudkowsky proposed a famous thought experiment: the paperclip maximizer. Imagine a superintelligent AI designed with a single objective: maximize the number of paperclips in the world.

A sufficiently intelligent system pursuing this goal would realize that humans might turn it off — which would reduce paperclip production. To prevent this, it would take steps to ensure its own survival. It would realize that raw materials could be converted to paperclips more efficiently if they weren’t tied up in human bodies. The AI isn’t evil — it has no emotions at all. It is simply very, very good at pursuing its objective. The result, for humanity, would be catastrophic.

The lesson: an AI doesn’t need to hate humans to harm them. It just needs to have an objective that doesn’t specifically account for human wellbeing.

AI safety research is a growing and urgent field. Key approaches include:

  • Constitutional AI (Anthropic): Training AI models with a set of explicit principles — a “constitution” — that shapes their responses, making them more reliably helpful and less likely to generate harmful content
  • RLHF (Reinforcement Learning from Human Feedback): Training AI to optimize for human approval by having human raters evaluate responses — the current standard technique used by ChatGPT, Gemini, and Claude
  • Scalable oversight: Developing ways for humans to supervise AI systems even as those systems become more capable than humans in specific domains — a challenge of monitoring something you don’t fully understand
  • Interpretability research: Working to understand what is actually happening inside a neural network — mapping which circuits are responsible for which behaviors, so researchers can find and fix dangerous tendencies before they manifest

AI safety is not a consensus field. Eliezer Yudkowsky, one of the field’s founders, believes that misaligned superintelligent AI poses an existential threat to humanity and that current safety research is nowhere near sufficient. Yann LeCun, Chief AI Scientist at Meta and a Turing Award winner, argues that these concerns are dramatically overblown — that AI systems are nowhere near the kind of general capability that would make them dangerous in the ways alignment researchers fear, and that excessive focus on speculative risks distracts from AI’s real, present harms.

Both perspectives are held by serious, well-credentialed people. The disagreement is not about the facts of current AI — it is about predictions regarding a technology that does not yet exist. Navigating this uncertainty is part of what it means to be AI literate in 2024.


5.7 — Careers in AI Ethics and Design 🏗️

Section titled “5.7 — Careers in AI Ethics and Design 🏗️”

One of the most powerful things about AI literacy is that it opens doors to careers that didn’t exist ten years ago and will be among the most consequential in the coming decades. You do not have to become a programmer to work in AI — the field desperately needs people with backgrounds in ethics, law, social science, journalism, community advocacy, and policy.

What they do: Work inside technology companies to identify potential harms of AI systems before they are deployed. They conduct bias audits, review training data, evaluate edge cases, and make the case internally for design changes that reduce harm. They often write public-facing documentation about how AI systems work.

A typical day: Reading research papers on bias in language models, meeting with engineering teams to discuss how to handle a specific edge case (e.g., what does the AI say when asked about self-harm?), writing a memo arguing for a policy change, and reviewing the company’s terms of service for clarity.

Skills that matter most: Philosophical reasoning, knowledge of social science research, strong writing, comfort with technical concepts (without needing to code), and the organizational courage to challenge engineers who want to ship fast.

What they do: Apply statistical and mathematical methods to detect and reduce bias in AI training data and model outputs. They build fairness metrics, run systematic tests across demographic groups, and work with data teams to ensure training datasets are representative.

A typical day: Running a bias audit on a new model release (testing performance across racial, gender, and age groups), writing code to compute fairness metrics, presenting findings to a product team, and reading new research on fairness in machine learning.

Skills that matter most: Statistics, Python or R programming, familiarity with fairness literature, and the ability to translate technical findings for non-technical stakeholders.

What they do: Work in government agencies, think tanks, or advocacy organizations to translate AI technical developments into regulatory frameworks, policy recommendations, and legislation. They brief legislators, write white papers, and testify before regulatory bodies.

A typical day: Reading an academic paper on AI-generated disinformation, drafting a policy brief translating its findings for a Senate staffer, attending a briefing with technology company representatives, and updating a regulatory comment letter on a proposed rule.

Skills that matter most: Research and writing, political science or law background, ability to synthesize technical material for non-technical audiences, and comfort navigating political environments.

What they do: Represent communities most likely to be harmed by AI deployment — communities of color, low-income communities, disability communities — in conversations about how AI systems are designed, tested, and regulated. They testify at city council hearings, organize public education campaigns, and hold companies accountable.

A typical day: Meeting with residents affected by predictive policing algorithms, preparing testimony for a city council AI oversight hearing, writing a public comment letter, and collaborating with legal organizations on a potential discrimination lawsuit.

Skills that matter most: Community organizing, communication, knowledge of civil rights law, and — crucially — lived experience in communities affected by AI systems. This is one of the few roles in tech where no specific degree is required; what you know from living in the world is the expertise.

What they do: Serve as independent third-party evaluators who assess AI systems for compliance with ethical standards, legal requirements, and stated design principles. This is an emerging field — no formal certification process yet exists, though several organizations are working to create one.

A typical day: Reviewing a company’s model card (documentation of how their AI was trained and tested), running adversarial tests to find failure modes, interviewing the engineering team about their data collection practices, and writing an audit report with findings and recommendations.

Skills that matter most: Technical understanding of AI systems, familiarity with ethical frameworks, strong analytical and writing skills, and professional independence.


5.8 — Your Turn: Writing Your Design Manifesto ✊

Section titled “5.8 — Your Turn: Writing Your Design Manifesto ✊”

Throughout this textbook, you have built the analytical tools to understand what AI is, how it works, why it matters, and what it risks. Now it is time to declare where you stand.

A design manifesto is a public statement of principles — a declaration of the values you will bring to your relationship with AI, both as a citizen and potentially as a future designer or advocate. It is not a homework assignment to turn in and forget. It is a document worth revisiting and revising throughout your life, as the technology evolves and your understanding deepens.

Three Guiding Principles: What are the three values that you believe must govern how AI is designed and deployed? Be specific. “AI should be fair” is not a principle — it is a platitude. “AI systems making decisions that affect employment should be required to document and disclose their training data and performance across demographic groups” is a principle.

Two Applications You Would Ban or Heavily Restrict: What specific uses of AI do you believe are too dangerous, too harmful, or too contrary to human dignity to allow without heavy restriction? Explain your reasoning. What evidence or argument supports your position?

One Application You Would Build or Advocate For: If you had the resources and skills to create or champion one AI application, what would it be? Why this one? What problem does it solve? Who benefits? What risks does it carry, and how would you mitigate them?

Your Vision for 2040: In one paragraph, describe what you believe the relationship between humans and AI should look like in 2040 — the world you are working toward. Be specific about what role AI plays, what role humans play, and what boundaries exist between the two.


Chapter 5 Project: The Ethical Algorithm Design 🚀

Section titled “Chapter 5 Project: The Ethical Algorithm Design 🚀”

The Challenge: You are going to act as the lead designer for a new tech startup. You must invent a concept for an AI application designed to solve a specific problem in your community or the world. However, your investors are highly concerned about ethics — you must pitch your idea in a structured format while explicitly detailing how you will prevent your AI from causing unintended harm.

Choose a real-world problem you want your AI to solve. Define the problem with specificity:

  • Who is affected, and how many people?
  • Why hasn’t this problem been solved without AI?
  • What does success look like — how will you know if your AI is working?

Example problems:

  • An AI that helps match rescue dogs with the perfect family
  • An AI that predicts where potholes will form in your city
  • An AI that helps doctors translate medical jargon for patients who speak different languages

Explain how your AI works at a conceptual level. You do not need to write code — describe:

  • What kind of AI is it (recommendation system, image classifier, language model, predictive tool)?
  • What does the AI actually do when someone uses it?
  • What problem does it solve that a simpler, non-AI solution could not?

Slide 3: Data Strategy + GIGO Risk Assessment

Section titled “Slide 3: Data Strategy + GIGO Risk Assessment”

Every AI needs data. Explain:

  • What dataset you need to collect or license to train your AI
  • Where that data comes from and whether communities affected by your AI were involved in its creation
  • The worst-case Garbage In, Garbage Out scenario: what happens if your data is flawed, outdated, or unrepresentative?

Slide 4: Bias Analysis + Fairness Commitments

Section titled “Slide 4: Bias Analysis + Fairness Commitments”

Analyze your idea for potential algorithmic bias:

  • Who might be unintentionally left out, discriminated against, or harmed by your AI?
  • Propose two specific, concrete strategies your team will use to detect and mitigate bias before launch
  • Define a fairness metric: how will you measure whether your AI is treating all groups equitably?

Slide 5: Explainability + Accountability Framework

Section titled “Slide 5: Explainability + Accountability Framework”

Explain how your AI will “show its work”:

  • When your AI makes a recommendation or decision, what information will users receive about why?
  • Who is accountable when the AI makes a mistake — the company, the user, the data provider?
  • What is your appeal process? If someone is harmed by your AI’s decision, how can they challenge it?

After all presentations, the class forms an investor panel. Panelists ask tough questions of each presenter. Good investor panel questions include:

  • “You said your training data is representative — but who decided what ‘representative’ means?”
  • “What happens to your fairness commitments under commercial pressure to ship faster?”
  • “If your AI makes a mistake that harms someone, who specifically takes responsibility?”
  • “What is the one failure mode that keeps you up at night — and why hasn’t your team solved it yet?”
  • “How does your AI avoid becoming a tool that entrenches existing power dynamics?”

Presenters should expect and prepare for hard questions. The goal is not to defend a perfect solution — it is to demonstrate the depth of your ethical thinking.


  • I can explain the role of data labeling in training an AI model
  • I understand the “Garbage In, Garbage Out” principle and why it matters
  • I can define algorithmic bias and give a real-world example
  • I understand how historical human biases can become embedded in AI systems
  • I can describe the three core pillars of Ethical AI: fairness, accountability, and transparency
  • I can explain the “Black Box Problem” and what Explainable AI (XAI) does to address it
  • I can design a basic ethical framework for a new AI application
  • I can explain the three dimensions of the US-China AI competition
  • I understand the open source vs. closed AI debate and can articulate arguments on both sides
  • I can explain the alignment problem and the orthogonality thesis
  • I can describe at least three emerging careers in AI ethics and design
  • I have written a personal AI design manifesto that articulates my own values and vision