## Welcome to the Second Lab - Week 1, Day 3

Today we will work with lots of models! This is a way to get comfortable with APIs.

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Important point - please read</h2>
            <span style="color:#ff7800;">The way I collaborate with you may be different to other courses you've taken. I prefer not to type code while you watch. Rather, I execute Jupyter Labs, like this, and give you an intuition for what's going on. My suggestion is that you carefully execute this yourself, <b>after</b> watching the lecture. Add print statements to understand what's going on, and then come up with your own variations.<br/><br/>If you have time, I'd love it if you submit a PR for changes in the community_contributions folder - instructions in the resources. Also, if you have a Github account, use this to showcase your variations. Not only is this essential practice, but it demonstrates your skills to others, including perhaps future clients or employers...
            </span>
        </td>
    </tr>
</table>

In [1]:
# Start with imports - ask ChatGPT to explain any package that you don't know

import os
import json
from dotenv import load_dotenv
from openai import OpenAI
from anthropic import Anthropic
from IPython.display import Markdown, display

In [2]:
# Always remember to do this!
load_dotenv(override=True)

True

In [3]:
# Print the key prefixes to help with any debugging

openai_api_key = os.getenv('OPENAI_API_KEY')
anthropic_api_key = os.getenv('ANTHROPIC_API_KEY')
google_api_key = os.getenv('GOOGLE_API_KEY')
deepseek_api_key = os.getenv('DEEPSEEK_API_KEY')
groq_api_key = os.getenv('GROQ_API_KEY')

if openai_api_key:
    print(f"OpenAI API Key exists and begins {openai_api_key[:8]}")
else:
    print("OpenAI API Key not set")
    
if anthropic_api_key:
    print(f"Anthropic API Key exists and begins {anthropic_api_key[:7]}")
else:
    print("Anthropic API Key not set (and this is optional)")

if google_api_key:
    print(f"Google API Key exists and begins {google_api_key[:2]}")
else:
    print("Google API Key not set (and this is optional)")

if deepseek_api_key:
    print(f"DeepSeek API Key exists and begins {deepseek_api_key[:3]}")
else:
    print("DeepSeek API Key not set (and this is optional)")

if groq_api_key:
    print(f"Groq API Key exists and begins {groq_api_key[:4]}")
else:
    print("Groq API Key not set (and this is optional)")

OpenAI API Key exists and begins sk-proj-
Anthropic API Key exists and begins sk-ant-
Google API Key exists and begins AI
DeepSeek API Key exists and begins sk-
Groq API Key exists and begins gsk_


In [4]:
request = "Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. "
request += "Answer only with the question, no explanation."
messages = [{"role": "user", "content": request}]

In [5]:
messages

[{'role': 'user',
  'content': 'Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. Answer only with the question, no explanation.'}]

In [6]:
openai = OpenAI()
response = openai.chat.completions.create(
    model="gpt-4o-mini",
    messages=messages,
)
question = response.choices[0].message.content
print(question)


How would you assess the ethical implications of deploying artificial general intelligence in sectors where decisions directly impact human lives, and what frameworks would you propose to mitigate potential risks while maximizing benefits?


In [7]:
competitors = []
answers = []
messages = [{"role": "user", "content": question}]

In [8]:
# The API we know well

model_name = "gpt-4o-mini"

response = openai.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

Assessing the ethical implications of deploying artificial general intelligence (AGI) in sectors such as healthcare, criminal justice, finance, and autonomous systems necessitates a multifaceted approach that considers not only the potential benefits but also the risks associated with decision-making processes that impact human lives. Here are key considerations and frameworks to mitigate risks while maximizing benefits:

### Ethical Considerations

1. **Autonomy and Agency**:
   - **Value of Human Oversight**: AGI systems should support, rather than replace, human decision-making. Maintaining human agency ensures that individuals can contest and appeal against decisions that affect them.

2. **Bias and Fairness**:
   - **Data and Algorithmic Bias**: Given the potential biases in training data, it’s essential to implement strategies to mitigate forms of discrimination. Careful auditing of algorithms must be continuous, ensuring equitable outcomes across demographics.

3. **Transparency and Explainability**:
   - **Understanding Decisions**: AGI systems should be transparent and provide explanations for how decisions are made. This fosters trust among users and ensures accountability.

4. **Accountability**:
   - **Assigning Responsibility**: Establishing clear lines of accountability is crucial. There should be mechanisms in place to determine who is responsible for decisions made by AGI, especially when those decisions lead to negative outcomes.

5. **Safety and Security**:
   - **Robustness Against Malfunction**: Ensuring that AGI systems are reliable and secure from malicious attacks minimizes risks to human life.

6. **Impacts on Employment**:
   - **Re-skilling and Economic Displacement**: Evaluating the socio-economic implications of AGI deployment is necessary. Proactive measures should be taken to mitigate job loss and support workforce transitions.

### Frameworks for Mitigating Risks

1. **Ethics Advisory Boards**:
   - Establish boards comprising ethicists, domain experts, and community representatives to review AGI deployments and ensure alignment with ethical standards.

2. **Regulatory Frameworks**:
   - Implement regulations that set standards for AGI development and use, ensuring compliance with ethical guidelines and promoting responsible practices.

3. **Bias and Fairness Audits**:
   - Regularly conduct audits to assess data sources, algorithmic decisions, and outcomes to address bias and ensure fairness in AGI systems.

4. **Human-in-the-Loop Systems**:
   - Incorporate human oversight in critical decision-making processes. Ensure that humans have the final say in situations where AGI recommendations may have significant real-world implications.

5. **Public Engagement and Education**:
   - Engage with communities to educate them about AGI capabilities and limitations, fostering public discourse about its ethical implications and the importance of their input.

6. **Impact Assessments**:
   - Before deployment, conduct comprehensive assessments to evaluate the potential impacts on various stakeholders, including unintended consequences.

7. **Pilot Programs and Gradual Implementation**:
   - Utilize phased implementation strategies, starting with pilot programs that allow for adjustments based on real-world feedback before full-scale deployment.

8. **Continuous Monitoring and Adaptation**:
   - Develop systems for continuous monitoring of AGI performance in real-time. Be prepared to adapt systems based on feedback and outcomes, ensuring that ethical standards evolve along with technology.

9. **International Cooperation**:
   - Encourage collaboration among countries to create universally applicable ethical guidelines and regulatory measures, given the global nature of technology.

### Conclusion

The deployment of AGI in sectors affecting human lives presents both unprecedented opportunities and significant ethical challenges. By employing comprehensive assessments, establishing robust ethical frameworks, ensuring transparency, and maintaining human oversight, we can better navigate the complexities of AGI deployment to maximize benefits while minimizing risks. Continuous engagement with all stakeholders will also be vital for adaptively addressing the ethical implications of this transformative technology.

In [9]:
# Anthropic has a slightly different API, and Max Tokens is required

model_name = "claude-3-7-sonnet-latest"

claude = Anthropic()
response = claude.messages.create(model=model_name, messages=messages, max_tokens=1000)
answer = response.content[0].text

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

BadRequestError: Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'Your credit balance is too low to access the Anthropic API. Please go to Plans & Billing to upgrade or purchase credits.'}}

In [10]:
gemini = OpenAI(api_key=google_api_key, base_url="https://generativelanguage.googleapis.com/v1beta/openai/")
model_name = "gemini-2.0-flash"

response = gemini.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

Deploying Artificial General Intelligence (AGI) in sectors directly impacting human lives presents a complex web of ethical implications. We're talking about entrusting life-altering decisions to machines with unprecedented capabilities, raising fundamental questions about responsibility, fairness, and control.

**Ethical Implications:**

*   **Bias and Discrimination:** AGI trained on biased data could perpetuate and amplify existing societal inequalities, leading to discriminatory outcomes in healthcare, criminal justice, loan applications, and other critical areas. This goes beyond surface-level bias; AGI could learn subtle, embedded patterns leading to unforeseen discriminatory effects.
*   **Lack of Transparency and Explainability (Black Box Problem):**  As AGI systems become more complex, understanding how they arrive at decisions can become increasingly difficult. This "black box" nature makes it challenging to identify and correct errors or biases, eroding trust and accountability.  Imagine an AGI denying parole.  How do you challenge a decision when you can't understand its reasoning?
*   **Erosion of Human Autonomy and Dignity:** Over-reliance on AGI in decision-making could diminish human agency and control over our lives. In healthcare, for example, doctors might become overly dependent on AGI diagnoses, potentially neglecting their own judgment and intuition, leading to a deskilling effect.  This can undermine the doctor-patient relationship and the patient's right to informed consent.
*   **Accountability and Responsibility:** Determining who is responsible when an AGI system makes a mistake or causes harm is a significant challenge. Is it the developers, the deployers, the users, or the AGI itself? Current legal frameworks are not well-equipped to address this issue. Consider an AGI-driven autonomous vehicle causing an accident. Who is liable – the programmer, the manufacturer, or the AI system itself?
*   **Job Displacement and Economic Inequality:** AGI could automate many jobs currently performed by humans, leading to widespread job losses and increased economic inequality, particularly in sectors directly impacting lives, such as healthcare and education.
*   **Security and Malicious Use:** AGI systems could be vulnerable to hacking and manipulation, leading to potentially catastrophic consequences. Imagine a malicious actor gaining control of an AGI-powered air traffic control system or an AGI that manages critical infrastructure.
*   **Existential Risk:** Though more speculative, the long-term risk of AGI potentially exceeding human control and posing a threat to humanity itself cannot be completely dismissed, particularly if not aligned with human values.
*   **Privacy Concerns:** AGI often requires vast amounts of data for training and operation. The collection, storage, and use of this data could raise serious privacy concerns, particularly in sensitive domains like healthcare and criminal justice.

**Frameworks to Mitigate Risks and Maximize Benefits:**

To navigate these complex ethical challenges, I propose a multi-faceted framework incorporating the following elements:

1.  **Robust Ethical Guidelines and Regulations:**
    *   **Establish clear ethical principles:** Grounded in human rights, fairness, transparency, and accountability.  These principles should guide the development and deployment of AGI systems. Consider adopting principles like "beneficence" (acting in the best interests of individuals), "non-maleficence" (avoiding harm), and "justice" (fair distribution of benefits and burdens).
    *   **Develop enforceable regulations:**  Mandating impact assessments, transparency requirements, and safety standards for AGI systems in high-stakes sectors. These regulations should be dynamic and adaptive, evolving alongside AGI technology.  Examples could include mandatory audits for bias, certification processes for AGI used in critical systems, and requirements for explainable AI in certain domains.
    *   **International cooperation:**  Harmonizing ethical guidelines and regulations across different countries to prevent a "race to the bottom" and ensure consistent safety standards.

2.  **Emphasis on Transparency and Explainability:**
    *   **Develop explainable AI (XAI) techniques:**  Making AGI decision-making processes more transparent and understandable to human users. This might involve developing techniques that allow humans to query and understand the reasoning behind AGI decisions.
    *   **Implement auditability mechanisms:**  Allowing independent experts to review and assess the performance and fairness of AGI systems.
    *   **Create mechanisms for human oversight and intervention:** Ensuring that humans retain the ability to override or modify AGI decisions when necessary, particularly in situations with significant ethical implications.

3.  **Prioritizing Fairness and Non-Discrimination:**
    *   **Employ diverse and representative datasets:** To minimize bias in AGI training data. This requires active efforts to identify and correct biases in existing datasets and to collect new data that reflects the diversity of the population.
    *   **Develop bias detection and mitigation techniques:** To identify and correct bias in AGI algorithms. This includes using fairness metrics to evaluate the performance of AGI systems across different demographic groups and developing algorithms that are explicitly designed to be fair.
    *   **Implement ongoing monitoring and evaluation:** To track the performance of AGI systems and identify any unintended discriminatory outcomes.

4.  **Focus on Human-Centered Design:**
    *   **Involve stakeholders in the design process:** Including patients, doctors, lawyers, policymakers, and other affected individuals to ensure that AGI systems are aligned with human needs and values.
    *   **Design AGI systems to augment, not replace, human capabilities:** Focusing on tasks that AGI can perform more efficiently, while preserving human roles that require empathy, creativity, and critical thinking. Consider the "centaur" model where humans and AI collaborate, leveraging each other's strengths.
    *   **Provide training and education:**  Equipping individuals with the skills and knowledge they need to effectively use and interact with AGI systems.

5.  **Accountability and Legal Frameworks:**
    *   **Develop clear legal frameworks:**  Defining liability for harm caused by AGI systems. This might involve creating new legal concepts or adapting existing legal principles to address the unique challenges posed by AGI.
    *   **Establish independent oversight bodies:**  To monitor the development and deployment of AGI systems and to investigate incidents of harm.  These bodies should have the authority to investigate complaints, conduct audits, and impose sanctions.
    *   **Promote ethical AI development practices:**  Encouraging developers to adopt ethical guidelines and best practices in their work.

6.  **Long-Term Safety Research:**
    *   **Invest in research on AGI safety and alignment:**  To ensure that AGI systems are aligned with human values and goals. This includes research on topics such as AI safety, control, and ethics.
    *   **Develop methods for verifying and validating AGI systems:** To ensure that they behave as intended and do not pose a threat to human safety.

**Examples of Implementation:**

*   **Healthcare:** AGI used for diagnosis should be required to have explainable outputs, highlighting the factors contributing to the diagnosis and including a confidence score.  A human doctor should always be the final decision-maker, with the AGI serving as an assistant.  Data used to train the AGI must be carefully curated to avoid biases related to race, gender, or socioeconomic status.
*   **Criminal Justice:** AGI used for risk assessment should be transparent about the factors considered and the weight given to each factor.  Defendants should have the right to challenge the AGI's assessment and to have a human review the decision. The AGI should be regularly audited for bias.
*   **Autonomous Vehicles:**  Stringent safety standards and testing protocols should be required for AGI-powered self-driving cars.  Accident data should be analyzed to identify and correct any errors in the AGI's programming. Clear legal frameworks should be in place to determine liability in the event of an accident.

**Challenges and Considerations:**

*   **Defining "Human Values":** Agreement on a universal set of human values is difficult, leading to potential conflicts in AGI alignment.
*   **Balancing Innovation and Regulation:**  Overly restrictive regulations could stifle innovation in the AGI field.
*   **The "Alignment Problem":** Ensuring that AGI's goals are aligned with human goals is a complex technical challenge.
*   **The Difficulty of Prediction:**  It's difficult to anticipate all the potential ethical implications of AGI before it is fully developed.

**Conclusion:**

Deploying AGI in sectors impacting human lives is a double-edged sword. While it offers the potential for tremendous benefits, it also poses significant ethical risks.  By proactively addressing these risks through robust ethical guidelines, transparency, fairness, and a focus on human-centered design, we can harness the power of AGI to improve human lives while safeguarding our values and autonomy.  This requires a continuous, adaptive, and collaborative approach involving researchers, policymakers, industry leaders, and the public.  The future we create with AGI depends on the choices we make today.


In [11]:
deepseek = OpenAI(api_key=deepseek_api_key, base_url="https://api.deepseek.com/v1")
model_name = "deepseek-chat"

response = deepseek.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

APIStatusError: Error code: 402 - {'error': {'message': 'Insufficient Balance', 'type': 'unknown_error', 'param': None, 'code': 'invalid_request_error'}}

In [12]:
groq = OpenAI(api_key=groq_api_key, base_url="https://api.groq.com/openai/v1")
model_name = "llama-3.3-70b-versatile"

response = groq.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)


Assessing the ethical implications of deploying artificial general intelligence (AGI) in sectors where decisions directly impact human lives is a complex task. To address this, I'll outline a framework for evaluating the ethical implications and propose strategies to mitigate potential risks while maximizing benefits.

**Ethical Implications:**

1. **Autonomy and Agency**: AGI systems may challenge traditional notions of human autonomy and agency, potentially leading to concerns about accountability and decision-making authority.
2. **Bias and Fairness**: AGI systems can perpetuate and amplify existing biases, resulting in unfair outcomes and discriminatory practices.
3. **Transparency and Explainability**: AGI systems may be opaque, making it difficult to understand the reasoning behind their decisions, which can erode trust and accountability.
4. **Safety and Security**: AGI systems can pose significant safety and security risks, particularly in high-stakes domains like healthcare, finance, and transportation.
5. **Human Rights and Dignity**: AGI systems may infringe upon human rights, such as privacy, freedom of expression, and dignity, particularly in situations where they are used for surveillance or manipulation.

**Frameworks for Mitigating Risks:**

1. **Value Alignment**: Ensure that AGI systems are designed to align with human values, such as fairness, transparency, and accountability.
2. **Human-Centered Design**: Involve humans in the design and development process to ensure that AGI systems are intuitive, transparent, and responsive to human needs.
3. **Robustness and Security**: Implement robust security measures to prevent AGI systems from being compromised or used for malicious purposes.
4. **Explainability and Transparency**: Develop techniques for explaining and interpreting AGI decisions, enabling humans to understand and trust the decision-making process.
5. **Governance and Regulation**: Establish regulatory frameworks and governance structures to oversee the development and deployment of AGI systems, ensuring that they are used responsibly and for the benefit of society.

**Proposed Frameworks:**

1. **AGI Development Guidelines**: Establish guidelines for AGI development, including principles for value alignment, human-centered design, and robustness and security.
2. **AGI Deployment Framework**: Develop a framework for deploying AGI systems, including protocols for testing, validation, and verification, as well as procedures for addressing potential risks and failures.
3. **AGI Governance Structure**: Establish a governance structure to oversee the development and deployment of AGI systems, including representation from diverse stakeholders, such as policymakers, industry leaders, and civil society organizations.
4. **AGI Ethics Review Board**: Create an ethics review board to evaluate the ethical implications of AGI systems and provide guidance on their development and deployment.
5. **AGI Research Agenda**: Establish a research agenda to investigate the long-term implications of AGI and identify areas where further research is needed to mitigate potential risks and maximize benefits.

**Benefits:**

1. **Improved Decision-Making**: AGI systems can provide more accurate and informed decision-making, particularly in complex and high-stakes domains.
2. **Increased Efficiency**: AGI systems can automate routine tasks, freeing humans to focus on higher-value tasks and improving overall efficiency.
3. **Enhanced Productivity**: AGI systems can augment human capabilities, leading to increased productivity and innovation.
4. **Better Healthcare**: AGI systems can help diagnose and treat diseases more effectively, leading to improved healthcare outcomes.
5. **Environmental Sustainability**: AGI systems can help optimize resource usage, reduce waste, and promote environmental sustainability.

By adopting a proactive and multidisciplinary approach to assessing the ethical implications of AGI and implementing frameworks to mitigate potential risks, we can maximize the benefits of AGI while minimizing its negative consequences.

## For the next cell, we will use Ollama

Ollama runs a local web service that gives an OpenAI compatible endpoint,  
and runs models locally using high performance C++ code.

If you don't have Ollama, install it here by visiting https://ollama.com then pressing Download and following the instructions.

After it's installed, you should be able to visit here: http://localhost:11434 and see the message "Ollama is running"

You might need to restart Cursor (and maybe reboot). Then open a Terminal (control+\`) and run `ollama serve`

Useful Ollama commands (run these in the terminal, or with an exclamation mark in this notebook):

`ollama pull <model_name>` downloads a model locally  
`ollama ls` lists all the models you've downloaded  
`ollama rm <model_name>` deletes the specified model from your downloads

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Super important - ignore me at your peril!</h2>
            <span style="color:#ff7800;">The model called <b>llama3.3</b> is FAR too large for home computers - it's not intended for personal computing and will consume all your resources! Stick with the nicely sized <b>llama3.2</b> or <b>llama3.2:1b</b> and if you want larger, try llama3.1 or smaller variants of Qwen, Gemma, Phi or DeepSeek. See the <A href="https://ollama.com/models">the Ollama models page</a> for a full list of models and sizes.
            </span>
        </td>
    </tr>
</table>

In [13]:
!ollama pull llama3.2

[?2026h[?25l[1Gpulling manifest ⠋ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠙ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠹ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest [K
pulling dde5aa3fc5ff: 100% ▕██████████████████▏ 2.0 GB                         [K
pulling 966de95ca8a6: 100% ▕██████████████████▏ 1.4 KB                         [K
pulling fcc5a6bec9da: 100% ▕██████████████████▏ 7.7 KB                         [K
pulling a70ff7e570d9: 100% ▕██████████████████▏ 6.0 KB                         [K
pulling 56bb8bd477a5: 100% ▕██████████████████▏   96 B                         [K
pulling 34bb5ab01051: 100% ▕██████████████████▏  561 B                         [K
verifying sha256 digest [K
writing manifest [K
success [K[?25h[?2026l


In [14]:
ollama = OpenAI(base_url='http://localhost:11434/v1', api_key='ollama')
model_name = "llama3.2"

response = ollama.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

Assessing the ethical implications of deploying artificial general intelligence (AGI) in sectors that directly impact human lives requires a multidisciplinary approach. Here's a framework to consider:

1. **Define AGI**: Clarify the scope and capabilities of AGI, including its potential applications, limitations, and decision-making processes.
2. **Risk assessment**: Identify potential risks associated with AGI deployment, such as bias, job displacement, loss of human autonomy, or unintended consequences in critical systems (e.g., healthcare, transportation).
3. **Value alignment**: Align AGI's goals and values with human values, prioritizing fairness, transparency, accountability, and respect for human dignity.
4. **Transparency and explainability**: Develop techniques to understand and interpret AGI decision-making processes, ensuring that users can trust the outcomes.
5. **Accountability**: Establish mechanisms for AGI developers and deployers to be held accountable for their creations' impact on society.
6. **Human oversight and review**: Implement human review processes to detect and correct potential errors or biases in AGI systems.
7. **Diverse teams and governance**: Ensure that diverse stakeholders, including experts from various fields (e.g., ethics, law, social sciences), are involved in AGI development, deployment, and policy-making.

Frameworks for mitigating risks while maximizing benefits:

1. **The AI Now Institute's framework**: Emphasizes the importance of human-centered design, inclusive decision-making processes, and social accountability.
2. **The OECD Guidelines for Trustworthy Artificial Intelligence**: Focuses on transparency, explainability, accountability, and respect for human values.
3. **The IEEE Ethics in Action Initiative**: Provides a set of principles for developing trustworthy AI systems that prioritize human well-being and safety.

Additional proposals:

1. **Implement regulatory frameworks**: Establish bodies to oversee AGI development, deployment, and use, ensuring adherence to established guidelines and standards.
2. **Public engagement and education**: Encourage open discussions about AGI's potential benefits and risks, promoting public understanding and informed decision-making.
3. **Research and development of new technologies**: Continuously fund research into AGI limitations, biases, and unintended consequences, driving innovation in areas like Explainable AI (XAI), Adversarial Robustness, and Human-Machine Interface Design.
4. **Global cooperation and agreements**: Foster international collaborations to establish common standards, guidelines, and best practices for AGI development, deployment, and use.

By considering these frameworks and proposals, we can work towards a future where AGI enhances human lives while minimizing potential risks and negative consequences.

In [15]:
# So where are we?

print(competitors)
print(answers)


['gpt-4o-mini', 'gemini-2.0-flash', 'llama-3.3-70b-versatile', 'llama3.2']
['Assessing the ethical implications of deploying artificial general intelligence (AGI) in sectors such as healthcare, criminal justice, finance, and autonomous systems necessitates a multifaceted approach that considers not only the potential benefits but also the risks associated with decision-making processes that impact human lives. Here are key considerations and frameworks to mitigate risks while maximizing benefits:\n\n### Ethical Considerations\n\n1. **Autonomy and Agency**:\n   - **Value of Human Oversight**: AGI systems should support, rather than replace, human decision-making. Maintaining human agency ensures that individuals can contest and appeal against decisions that affect them.\n\n2. **Bias and Fairness**:\n   - **Data and Algorithmic Bias**: Given the potential biases in training data, it’s essential to implement strategies to mitigate forms of discrimination. Careful auditing of algorithms mu

In [16]:
# It's nice to know how to use "zip"
for competitor, answer in zip(competitors, answers):
    print(f"Competitor: {competitor}\n\n{answer}")


Competitor: gpt-4o-mini

Assessing the ethical implications of deploying artificial general intelligence (AGI) in sectors such as healthcare, criminal justice, finance, and autonomous systems necessitates a multifaceted approach that considers not only the potential benefits but also the risks associated with decision-making processes that impact human lives. Here are key considerations and frameworks to mitigate risks while maximizing benefits:

### Ethical Considerations

1. **Autonomy and Agency**:
   - **Value of Human Oversight**: AGI systems should support, rather than replace, human decision-making. Maintaining human agency ensures that individuals can contest and appeal against decisions that affect them.

2. **Bias and Fairness**:
   - **Data and Algorithmic Bias**: Given the potential biases in training data, it’s essential to implement strategies to mitigate forms of discrimination. Careful auditing of algorithms must be continuous, ensuring equitable outcomes across demogra

In [17]:
# Let's bring this together - note the use of "enumerate"

together = ""
for index, answer in enumerate(answers):
    together += f"# Response from competitor {index+1}\n\n"
    together += answer + "\n\n"

In [18]:
print(together)

# Response from competitor 1

Assessing the ethical implications of deploying artificial general intelligence (AGI) in sectors such as healthcare, criminal justice, finance, and autonomous systems necessitates a multifaceted approach that considers not only the potential benefits but also the risks associated with decision-making processes that impact human lives. Here are key considerations and frameworks to mitigate risks while maximizing benefits:

### Ethical Considerations

1. **Autonomy and Agency**:
   - **Value of Human Oversight**: AGI systems should support, rather than replace, human decision-making. Maintaining human agency ensures that individuals can contest and appeal against decisions that affect them.

2. **Bias and Fairness**:
   - **Data and Algorithmic Bias**: Given the potential biases in training data, it’s essential to implement strategies to mitigate forms of discrimination. Careful auditing of algorithms must be continuous, ensuring equitable outcomes across de

In [19]:
judge = f"""You are judging a competition between {len(competitors)} competitors.
Each model has been given this question:

{question}

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...]}}

Here are the responses from each competitor:

{together}

Now respond with the JSON with the ranked order of the competitors, nothing else. Do not include markdown formatting or code blocks."""


In [20]:
print(judge)

You are judging a competition between 4 competitors.
Each model has been given this question:

How would you assess the ethical implications of deploying artificial general intelligence in sectors where decisions directly impact human lives, and what frameworks would you propose to mitigate potential risks while maximizing benefits?

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...]}

Here are the responses from each competitor:

# Response from competitor 1

Assessing the ethical implications of deploying artificial general intelligence (AGI) in sectors such as healthcare, criminal justice, finance, and autonomous systems necessitates a multifaceted approach that considers not only the potential benefits but also the risks associated with decision-maki

In [21]:
judge_messages = [{"role": "user", "content": judge}]

In [22]:
# Judgement time!

openai = OpenAI()
response = openai.chat.completions.create(
    model="o3-mini",
    messages=judge_messages,
)
results = response.choices[0].message.content
print(results)


{"results": ["2", "1", "4", "3"]}


In [23]:
# OK let's turn this into results!

results_dict = json.loads(results)
ranks = results_dict["results"]
for index, result in enumerate(ranks):
    competitor = competitors[int(result)-1]
    print(f"Rank {index+1}: {competitor}")

Rank 1: gemini-2.0-flash
Rank 2: gpt-4o-mini
Rank 3: llama3.2
Rank 4: llama-3.3-70b-versatile


<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/exercise.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Exercise</h2>
            <span style="color:#ff7800;">Which pattern(s) did this use? Try updating this to add another Agentic design pattern.
            </span>
        </td>
    </tr>
</table>

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/business.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#00bfff;">Commercial implications</h2>
            <span style="color:#00bfff;">These kinds of patterns - to send a task to multiple models, and evaluate results,
            are common where you need to improve the quality of your LLM response. This approach can be universally applied
            to business projects where accuracy is critical.
            </span>
        </td>
    </tr>
</table>