Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/skyl/corpora
Corpora is a self-building corpus that can help build other arbitrary corpora
https://github.com/skyl/corpora
agpl ai api cli corpora corpus django markdown monorepo openapi pgvector postgresql python rust
Last synced: 19 days ago
JSON representation
Corpora is a self-building corpus that can help build other arbitrary corpora
- Host: GitHub
- URL: https://github.com/skyl/corpora
- Owner: skyl
- License: agpl-3.0
- Created: 2024-10-25T23:46:23.000Z (3 months ago)
- Default Branch: main
- Last Pushed: 2024-11-21T04:54:44.000Z (2 months ago)
- Last Synced: 2024-11-21T05:26:55.088Z (2 months ago)
- Topics: agpl, ai, api, cli, corpora, corpus, django, markdown, monorepo, openapi, pgvector, postgresql, python, rust
- Language: Python
- Homepage:
- Size: 289 KB
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 13
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Codeowners: CODEOWNERS
Awesome Lists containing this project
README
# Corpora
> Corpora is a corpus of tools to enhance and evolve arbitrary corpora including itself.
Corpora is an experimental polyglot monorepo focused on creating tools to enhance and evolve textual corpora with AI. Our aim is to create a flexible system for managing text corpora to increase contributor productivity.
## Key Components
- [**.corpora/**](.corpora/README.md): Repository-specific metadata and guidelines.
- [**.devcontainer/**](.devcontainer/README.md): Provides a development environment with Docker, allowing easy setup for contributors and users.
- [**.github/**](.github/ABOUT.md): GitHub Actions workflows for CI/CD.
- [**docker/**](docker/README.md): Docker configurations for deployment and testing environments.
- [**md/**](md/README.md): Documentation and notes.
- [**py/**](py/README.md): Contains core Python codebase, including modular packages and Django apps.
- [**rs/**](rs/README.md): Multipackage Rust workspace for extensible tool development.## Contributing
Clone the repository and open it in a devcontainer, which provides a full setup with a running server, CLI, PostgreSQL, Redis, and Celery. This allows you to begin developing and utilizing Corpora immediately. Use the tools and structure to contribute enhancements or support new features that align with these goals.
Licensed under AGPL, Corpora can be used within your company to aid in proprietary codebase development, with the requirement that modifications are open-sourced.