An open API service indexing awesome lists of open source software.

https://github.com/shamspias/gpt3-data-preprocessing

This repository containing code for preprocessing text data from PDF and DOCX files for use with GPT-3. It includes steps such as tokenization, removal of stop words and punctuation, and formatting for GPT-3 input.
https://github.com/shamspias/gpt3-data-preprocessing

artificial-intelligence data-preprocessing data-preprocessing-pipelines data-science gpt-3 machine-learning

Last synced: 2 months ago
JSON representation

This repository containing code for preprocessing text data from PDF and DOCX files for use with GPT-3. It includes steps such as tokenization, removal of stop words and punctuation, and formatting for GPT-3 input.

Awesome Lists containing this project