https://github.com/mendableai/openai-structured-outputs-with-firecrawl
This repository demonstrates how to leverage OpenAI's GPT-4 models with JSON Strict Mode to extract structured data from web pages. It combines web scraping capabilities from Firecrawl with OpenAI's advanced language models to create a powerful data extraction pipeline.
https://github.com/mendableai/openai-structured-outputs-with-firecrawl
firecrawl json llm openai structured-output
Last synced: about 4 hours ago
JSON representation
This repository demonstrates how to leverage OpenAI's GPT-4 models with JSON Strict Mode to extract structured data from web pages. It combines web scraping capabilities from Firecrawl with OpenAI's advanced language models to create a powerful data extraction pipeline.
- Host: GitHub
- URL: https://github.com/mendableai/openai-structured-outputs-with-firecrawl
- Owner: mendableai
- Created: 2024-08-14T12:23:57.000Z (8 months ago)
- Default Branch: main
- Last Pushed: 2024-08-14T12:30:24.000Z (8 months ago)
- Last Synced: 2025-03-28T03:32:29.790Z (18 days ago)
- Topics: firecrawl, json, llm, openai, structured-output
- Language: Python
- Homepage: https://www.firecrawl.dev/blog/using-structured-output-and-json-strict-mode-openai
- Size: 1000 Bytes
- Stars: 17
- Watchers: 2
- Forks: 1
- Open Issues: 0
Awesome Lists containing this project
- jimsghstars - mendableai/openai-structured-outputs-with-firecrawl - This repository demonstrates how to leverage OpenAI's GPT-4 models with JSON Strict Mode to extract structured data from web pages. It combines web scraping capabilities from Firecrawl with OpenAI's a (Python)