{"id":15169646,"url":"https://github.com/datafog/datafog-instructor","last_synced_at":"2025-10-23T18:48:33.672Z","repository":{"id":248126562,"uuid":"822338230","full_name":"DataFog/datafog-instructor","owner":"DataFog","description":null,"archived":false,"fork":false,"pushed_at":"2024-09-20T03:51:20.000Z","size":9238,"stargazers_count":14,"open_issues_count":0,"forks_count":1,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-01-16T10:47:45.145Z","etag":null,"topics":["local-llm","ollama","pii","pydantic"],"latest_commit_sha":null,"homepage":"https://www.youtube.com/watch?v=xU8vtcsZCNU","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/DataFog.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-07-01T00:24:12.000Z","updated_at":"2024-09-20T08:49:53.000Z","dependencies_parsed_at":"2024-07-12T15:29:00.271Z","dependency_job_id":"5594fe44-0770-4353-a666-46eadaf877aa","html_url":"https://github.com/DataFog/datafog-instructor","commit_stats":{"total_commits":16,"total_committers":2,"mean_commits":8.0,"dds":0.3125,"last_synced_commit":"5ee1788f1dc6daf692a8db8e31dd17940fa7f7d0"},"previous_names":["datafog/datafog-instructor"],"tags_count":2,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DataFog%2Fdatafog-instructor","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DataFog%2Fdatafog-instructor/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DataFog%2Fdatafog-instructor/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DataFog%2Fdatafog-instructor/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/DataFog","download_url":"https://codeload.github.com/DataFog/datafog-instructor/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":234859120,"owners_count":18897882,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["local-llm","ollama","pii","pydantic"],"created_at":"2024-09-27T07:04:15.199Z","updated_at":"2025-10-06T09:30:33.934Z","avatar_url":"https://github.com/DataFog.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# DataFog Instructor\n\nv0.1.0 Release Notes\n\nHi folks, based on some feedback a few important changes:\n\n- We have shifted away from the CLI approach to a more flexible API-based solution. For v0.1.0, you'll need to clone the repository and install dependencies using Poetry.\n- The env.example file now includes a LOGFIRE_TOKEN. You can obtain one by signing up at https://logfire.pydantic.dev. Logfire is an observability platform developed by the Pydantic team, designed to assist with debugging and monitoring, including LLM calls.\n- This version focuses on producing consistent LLM outputs for PII detection and incorporates extensive error handling to create a more production-ready service.\n- We've implemented robust validation and error handling throughout the codebase to ensure reliability and ease of debugging.\n\nStart by cloning the repo and installing the dependencies using poetry:\n\n```\ngit clone https://github.com/datafog/datafog-instructor.git\ncd datafog-instructor\npoetry install\n```\n\nYou'll also need to create a `.env` file with the OPENAI_API_KEY and GROQ_API_KEY.  You can get these by signing up for accounts at https://openai.com/ and https://www.groq.com/.\n\nOnce you have the .env file, you can run the following to start the service:\n\n```\nuvicorn app.main:app --reload\n```\n\n\n## Sample CURL Commands\n\n\n```\ncurl -X POST \"http://localhost:8000/extract-pii\" \\     \n     -H \"Content-Type: application/json\" \\\n     -d '{\"content\": \"My name is John Doe and my email is john.doe@example.com. My phone number is 123-456-7890.\"}'\n```\n\n## Contributing\n\nContributions to the DataFog Instructor SDK are welcome! Please feel free to submit a Pull Request.\n\n## License\n\nThis project is licensed under the MIT License.\n\n## Support\n\nIf you encounter any problems or have any questions, please open an issue on the GitHub repository or join our Discord community at https://discord.gg/bzDth394R4.\n\n## Acknowledgements\n\n- Logfire: https://logfire.pydantic.dev\n- Pydantic: https://pydantic.dev\n- Instructor: https://github.com/jxnl/instructor\n\n## Links\n\n- Homepage: https://datafog.ai\n- Documentation: https://docs.datafog.ai\n- Twitter: https://twitter.com/datafoginc\n- GitHub: https://github.com/datafog/datafog-instructor\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdatafog%2Fdatafog-instructor","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdatafog%2Fdatafog-instructor","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdatafog%2Fdatafog-instructor/lists"}