https://github.com/sinanuozdemir/oreilly-agi

Explore the evolution of AGI through historical context, reasoning models, and agent systems, while gaining hands-on experience with cutting-edge models like Claude 4, DeepSeek-R1, and OpenAI's o3. Learn to critically evaluate AGI benchmarks, understand their limitations, and identify where current models excel or struggle in reasoning tasks.
https://github.com/sinanuozdemir/oreilly-agi

agents agi ai-agents artifical-general-inteligence reasoning-models

Last synced: 3 months ago
JSON representation

Host: GitHub
URL: https://github.com/sinanuozdemir/oreilly-agi
Owner: sinanuozdemir
Created: 2025-03-14T15:18:30.000Z (7 months ago)
Default Branch: main
Last Pushed: 2025-06-20T18:56:14.000Z (4 months ago)
Last Synced: 2025-06-20T19:55:42.561Z (4 months ago)
Topics: agents, agi, ai-agents, artifical-general-inteligence, reasoning-models
Language: Jupyter Notebook
Homepage: https://www.oreilly.com/live-events/artificial-general-intelligence-agi-demystified/0642572174033
Size: 349 KB
Stars: 8
Watchers: 1
Forks: 2
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

![oreilly-logo](images/oreilly.png)

# Artificial General Intelligence (AGI) Demystified

This repository contains code for live session and video for the [O'Reilly Course on Artificial General Intelligence (AGI) Demystified](https://www.oreilly.com/live-events/artificial-general-intelligence-agi-demystified/0642572174033)

This course offers an exploration of the current approaches toward Artificial General Intelligence (AGI), focusing on state-of-the-art reasoning models and agent architectures. Participants will learn about the evolution of AGI research, understand key benchmarks used to measure progress, and gain practical knowledge in working with advanced models like Claude 3.7, DeepSeek-R1, and OpenAI's o3. Through hands-on exercises and case studies, attendees will develop the skills needed to evaluate these models' capabilities, understand their limitations, and apply them effectively to complex tasks.

## Notebooks

- **Using Reasoning Models**

- [An Introduction to Reasoning Models](notebooks/intro_to_reasoning_models.ipynb) - Using 6 reasoning models across 5 providers

- [Short Term Memory with OpenAI Agents](notebooks/OpenAI%20Agents.ipynb) - OpenAI Agents doesn't implement short term memory by default, this code will add the concept to the agent

- [Computer Use with reasoning models](notebooks/computer_use_reasoning.ipynb) - Letting a reasoning model control our laptop (caution advised when running this code)

- **Benchmarking**

- [Benchmarking Reasoning Models](notebooks/benchmarking_reasoning_models.ipynb) - Running questions from [Humanity's Last Exam](https://huggingface.co/datasets/cais/hle/discussions) on reasoning models

- [Benchmarking Llama 3.2 Instruct on MMLU and Embedders on MTEB](https://colab.research.google.com/drive/1zDCqXc7vHoZilHVe3y2lYyTmSUSe6bh3?usp=sharingb)

- [Follow-up Evaluating Llama 3.2 non-instruct on MMLU](https://colab.research.google.com/drive/1aMy19Ikyody9CGyn42K3E_DQwLScL0Ek?usp=sharing)

- [Evaluating Llama 3.1 vs Mistral on Truthful Q/A](https://github.com/sinanuozdemir/quick-start-guide-to-llms/blob/main/notebooks/12_llm_gen_eval.ipynb)

- **Case Studies**

- [GRPO + Qwen](https://colab.research.google.com/drive/1Cws1IL_T_0_cP0-cHxFA0FEsXYdiAN_8?usp=sharing): Using Reinforcement Learning to train a reasoning model.

- [Prompt Engineering Deepseek R1](https://github.com/sinanuozdemir/quick-start-guide-to-llms/blob/main/notebooks/06_adv_prompt_engineering%20-%20DEEPSEEK.ipynb) - Using fewshot learning, vector databases, and even chain of thought prompting to make reasoning models even better

**Probing**

- [Probing Chess Playing LLMs](https://colab.research.google.com/drive/114turFLNxLJXiIseDWl1BDJmont0VD8h?usp=sharing)

- There are over a dozen notebooks for the birth year/death year probing example so I will only share a few key ones here:
- [Llama-3 8B Instruct with prompt "Who is {NAME}"](https://colab.research.google.com/drive/1e1d9fATVjVun-_tPj4vS_DSTGaIfxs01?usp=sharing)
- [BERT-large-cased no prompt](https://colab.research.google.com/drive/1cizgoh1J6Y-DHBrOkNTFo9Y1CypjwuQM?usp=sharing)
- [Mistral-7B-Instruct-v0.3 with prompt "Who is {NAME}"](https://colab.research.google.com/drive/1VL3betxqVZ_H3_8XmLbjE0hEjaoy-HPV?usp=sharing)

## Instructor

**Sinan Ozdemir** is a former lecturer of Data Science at Johns Hopkins University and the author of multiple textbooks on data science and machine learning. Additionally, he is the founder of the recently acquired Kylie.ai, an enterprise-grade conversational AI platform with RPA capabilities. He holds a master’s degree in Pure Mathematics from Johns Hopkins University and is based in San Francisco, CA.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/sinanuozdemir/oreilly-agi

Awesome Lists containing this project

README