https://github.com/filimoa/bridging-data-divides-workbook
https://github.com/filimoa/bridging-data-divides-workbook
Last synced: over 1 year ago
JSON representation
- Host: GitHub
- URL: https://github.com/filimoa/bridging-data-divides-workbook
- Owner: Filimoa
- License: mit
- Created: 2024-11-11T02:12:42.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-02-06T00:32:15.000Z (over 1 year ago)
- Last Synced: 2025-03-05T18:37:58.519Z (over 1 year ago)
- Language: Jupyter Notebook
- Size: 40 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Bridging Data Divides Workbook
This repo accompanies the Bridging Data Divides CAS Essay, providing a hands-on workbook and notebook examples that demonstrate the concepts discussed—such as retrieval-augmented generation, embeddings, and structured outputs. It’s designed to help actuaries and data professionals explore practical ways to unify and analyze unstructured data, apply AI techniques to real-world scenarios, and integrate these insights into existing insurance workflows.
## Setup
1. **Clone the Repository:**
```bash
git clone https://github.com/Filimoa/bridging-data-divides-workbook.git
```
2. **Check you have Python installed:**
Make sure you have Python >= 3.9 installed. If you're on a work computer, you may need admin privileges to install Python.
You can check you have python by running:
```bash
python --version
```
3. **Install the required packages:**
Mac/Linux:
```bash
python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
```
Windows:
```bash
python -m venv .venv
.venv\Scripts\activate
pip install -r requirements.txt
```
4. **Run the notebook:**
```bash
jupyter notebook
```
Then you can open the notebook in your browser.
5. **API Keys:**
This notebook uses an OpenAI-based approach for simplicity. Given the low token usage, it should only cost a few cents to run.
- If you're new to the OpenAI API, [sign up for an account](https://platform.openai.com/signup).
- Follow the [Quickstart](https://platform.openai.com/docs/quickstart) to retrieve your API key.
You can paste that key into the notebook in the cell that starts with
```python
os.environ["OPENAI_API_KEY"] = "YOUR_API_KEY"
```
Make sure to replace `YOUR_API_KEY` with your actual API key and uncomment the line (remove the `#` at the beginning of the line).
> ⚠️ **Work Computer SSL Issues:** Some corporate networks may block access to OpenAI’s API, leading to SSL verification errors. If you encounter this issue, consider:
> - Running the notebook on a personal computer.
> - Using the [hosted Colab version](https://colab.research.google.com/drive/15uvMEzytBbf65HBhjh82-mHDTK6GnYt5) (ensure your VPN is off).
> - Checking with your IT team for potential workarounds, such as configuring SSL settings or using a company-approved proxy.
## Hosted Colab Version:
For those who don't want to setup Python, you can use the hosted Colab version of the notebook. Note this does require a Google account and it may be blocked by your company's security policies.
[Open In Colab](https://colab.research.google.com/drive/15uvMEzytBbf65HBhjh82-mHDTK6GnYt5)