Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/rgbkrk/genai

What if GPT could help you notebook?
https://github.com/rgbkrk/genai

exceptions gpt gpt-35-turbo interactive-computing ipython jupyter openai traceback

Last synced: 9 days ago
JSON representation

What if GPT could help you notebook?

Host: GitHub
URL: https://github.com/rgbkrk/genai
Owner: rgbkrk
License: bsd-3-clause
Created: 2022-07-09T14:22:09.000Z (almost 2 years ago)
Default Branch: main
Last Pushed: 2024-01-29T23:48:09.000Z (5 months ago)
Last Synced: 2024-06-11T20:55:16.601Z (12 days ago)
Topics: exceptions, gpt, gpt-35-turbo, interactive-computing, ipython, jupyter, openai, traceback
Language: Python
Homepage:
Size: 375 KB
Stars: 348
Watchers: 7
Forks: 34
Open Issues: 8
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md

Lists

awesome-ChatGPT-repositories - genai - What if GPT could help you notebook? (Openai)
awesome-stars - genai
awesome-stars - rgbkrk/genai - What if GPT could help you notebook? (Python)

README

        [Install](#installation) | [License](./LICENSE) | [Code of Conduct](./CODE_OF_CONDUCT.md) | [Contributing](./CONTRIBUTING.md)

# GenAI: generative AI tooling for IPython

🦾 Get GPT help with code, SQL queries, DataFrames, Exceptions and more in IPython.

🌍 Supports all Jupyter environments, including IPython, JupyterLab, Jupyter Notebook, and Noteable.

TL;DR Get started now

```

%pip install genai

%load_ext genai

```

## Genai In Action

![Genai making a suggestion followed by running suggested code](https://user-images.githubusercontent.com/836375/225177905-17cfb526-60f8-486d-b468-60a6a01db02e.gif)

- [Blog Post](https://noteable.io/blog/introducing-genai/)

- [Example Notebook](https://app.noteable.io/f/1605d16d-f5d3-4099-8fec-2ca727075b3b/Introducing-Genai.ipynb)

## Introduction

We've taken the context from IPython, mixed it with OpenAI's Large Language Models, and are bringing you a more informed notebook experience that works in all Jupyter environments, including IPython, JupyterLab, Jupyter Notebook, and Noteable. 🦾🌏

## Requirements

Python 3.8+

## Installation

### Poetry

```shell

poetry add genai

```

### Pip

```shell

pip install genai

```

## Loading the IPython extension

Make sure to set the `OPENAI_API_KEY` environment variable first before using it in IPython or your [preferred notebook platform of choice](https://noteable.io/).

```

%load_ext genai

```

## Features

- `%%assist` magic command to generate code from natural language

- Custom exception suggestions

### Custom Exception Suggestions

```python

In [1]: %load_ext genai

In [2]: import pandas as pd

In [3]: df = pd.DataFrame(dict(col1=['a', 'b', 'c']), index=['first', 'second', 'third'])

In [4]: df.sort_values()

---------------------------------------------------------------------------

TypeError                                 Traceback (most recent call last)

Cell In[4], line 1

----> 1 df.sort_values()

File ~/.pyenv/versions/3.9.9/lib/python3.9/site-packages/pandas/util/_decorators.py:331, in deprecate_nonkeyword_arguments..decorate..wrapper(*args, **kwargs)

    325 if len(args) > num_allow_args:

    326     warnings.warn(

    327         msg.format(arguments=_format_argument_list(allow_args)),

    328         FutureWarning,

    329         stacklevel=find_stack_level(),

    330     )

--> 331 return func(*args, **kwargs)

TypeError: sort_values() missing 1 required positional argument: 'by'

```

#### 💡 Suggestion

The error message is indicating that the `sort_values()` method of a pandas dataframe is missing a required positional argument.

The `sort_values()` method requires you to pass a column name or list of column names as the `by` argument. This is used to determine how the sorting will be performed.

Here's an example:

```python

import pandas as pd

df = pd.DataFrame({

    'Name': ['Alice', 'Bob', 'Carol', 'David', 'Eva'],

    'Age': [32, 24, 28, 35, 29],

    'Salary': [60000, 40000, 35000, 80000, 45000]

})

# sort by Age column:

df_sorted = df.sort_values(by='Age')

print(df_sorted)

```

In this example, the `by` argument is set to `'Age'`, which sorts the dataframe by age in ascending order. Note that you can also pass a list of column names if you want to sort by multiple columns.

## Example

````python

In [1]: %load_ext genai

In [2]: %%assist

   ...:

   ...: Can you explain this query to me so I can be sure we're doing the right things?

   ...:

   ...: ```sql

   ...: SELECT

   ...:   COUNT(*) AS num_downloads,

   ...:   DATE_TRUNC(DATE(timestamp), DAY) AS day

   ...: FROM `bigquery-public-data.pypi.file_downloads`

   ...: WHERE

   ...:   file.project = 'genai'

   ...:   -- Only query the last 6 months of history

   ...:   AND DATE(timestamp)

   ...:     BETWEEN DATE_TRUNC(DATE_SUB(CURRENT_DATE(), INTERVAL 6 MONTH), MONTH)

   ...:     AND CURRENT_DATE()

   ...: GROUP BY day

   ...: ORDER BY day DESC

````

```markdown

"This is a SQL query that counts the number of downloads for the 'genai' project on the Python Package Index (PyPI) over the last 6 months, grouped by day. The query selects the count and the date, truncating the date to the day level so that it's easier to read.

Here is a breakdown of each part of the query:

- `SELECT COUNT(*) AS num_downloads, DATE_TRUNC(DATE(timestamp), DAY) AS day`: This selects the count of the number of rows matched by the query as `num_downloads`, and the date truncated to the day level as `day`.

- `FROM `bigquery-public-data.pypi.file_downloads``: This specifies the table to query from.

- `WHERE file.project = 'genai'`: This filters the rows by only including downloads for the 'genai' project.

- `AND DATE(timestamp) BETWEEN DATE_TRUNC(DATE_SUB(CURRENT_DATE(), INTERVAL 6 MONTH), MONTH) AND CURRENT_DATE()`: This filters the rows by only including downloads from the last 6 months. The `DATE_SUB` function subtracts 6 months from the current date (`CURRENT_DATE()`), `DATE_TRUNC` truncates the result to be the start of the month and `DATE` converts the timestamp column to a date so the `BETWEEN` condition can filter rows between the start of 6 months ago and "today."

- `GROUP BY day`: This groups the rows by day so that the counts are aggregated by date.

- `ORDER BY day DESC`: This orders the rows so that the most recent date appears first in the result."

```