Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/code-kern-ai/refinery
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
https://github.com/code-kern-ai/refinery
active-learning annotations artificial-intelligence data-centric-ai data-labeling data-science deep-learning human-in-the-loop labeling labeling-tool machine-learning natural-language-processing neural-search nlp python spacy supervised-learning text-annotation text-classification transformers
Last synced: 1 day ago
JSON representation
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
- Host: GitHub
- URL: https://github.com/code-kern-ai/refinery
- Owner: code-kern-ai
- License: apache-2.0
- Created: 2022-07-04T13:58:09.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-06-13T15:12:02.000Z (6 months ago)
- Last Synced: 2024-12-04T18:17:35.061Z (9 days ago)
- Topics: active-learning, annotations, artificial-intelligence, data-centric-ai, data-labeling, data-science, deep-learning, human-in-the-loop, labeling, labeling-tool, machine-learning, natural-language-processing, neural-search, nlp, python, spacy, supervised-learning, text-annotation, text-classification, transformers
- Language: Python
- Homepage: https://www.kern.ai
- Size: 3.54 MB
- Stars: 1,405
- Watchers: 18
- Forks: 69
- Open Issues: 70
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
- awesome-open-data-centric-ai - refinery - source choice to scale, assess and maintain natural language data. | ![GitHub stars](https://img.shields.io/github/stars/code-kern-ai/refinery?style=social) | <a href="https://github.com/code-kern-ai/refinery/blob/main/LICENSE"><img src="https://img.shields.io/github/license/code-kern-ai/refinery" height="15"/></a> | (Visualization and Interaction)
- awesome-production-machine-learning - refinery - kern-ai/refinery.svg?style=social) - The data scientist's open-source choice to scale, assess and maintain natural language data. (Data Labelling and Synthesis)
README
![](header.png)
The data scientist's open-source choice to scale, assess and maintain natural language data.
Treat training data like a software artifact.
Does one of these scenarios sounds familiar to you?
- You are working on your own side-project in NLP, but you don't have enough labeled data to train a good model.
- You are working in a team and already have some labeled data, but your training data is just stored in a spreadsheet or some TXT-file, and you have no idea how _good_ it actually is.
- You are working in a team about to start a new project with limited resources (annotators, budget, time), and now want to understand how you can best make use of themIf so, you are one of the people we've built refinery for. refinery helps you to build better NLP models in a data-centric approach. Semi-automate your labeling, find low-quality subsets in your training data, and monitor your data in one place.
_refinery_ doesn't get rid of manual labeling, but it makes sure that your valuable time is spent well. Also, the makers of _refinery_ currently work on integrations to other labeling tools, such that you can easily switch between different choices.
![Showcase GIF of _refinery_](refinery.gif)
> **_DEMO:_** You can interact with the application in a (mostly read-only) online playground. Check it out [here](https://demo.kern.ai)
_refinery_ is a multi-repository project, you can find all integrated services in the architecture below. The app builds on top of [π€ Hugging Face](https://www.huggingface.co) and [spaCy](https://spacy.io/) to leverage pre-built language models for your NLP tasks, as well as [qdrant](https://github.com/qdrant/qdrant) for neural search.
## Table of contents
- [π§βπ» Why _refinery_?](#-why-refinery)
- [Enabling ideas of one-person-armies](#enabling-ideas-of-one-person-armies)
- [Extending your existing labeling approach](#extending-your-existing-labeling-approach)
- [Put structure into unstructured data](#put-structure-into-unstructured-data)
- [Pushing collaboration](#pushing-collaboration)
- [Open-source, and treating training data as a software artifact](#open-source-and-treating-training-data-as-a-software-artifact)
- [Integrations](#integrations)
- [Your benefits](#your-benefits)
- [How does Kern AI make money, if refinery is open-source?](#how-does-kern-ai-make-money-if-refinery-is-open-source)
- [π€ Features](#-features)
- [(Semi-)automated labeling workflow for NLP tasks](#semi-automated-labeling-workflow-for-nlp-tasks)
- [Extensive data management and monitoring](#extensive-data-management-and-monitoring)
- [Team workspaces in the managed version](#team-workspaces-in-the-managed-version)
- [β Installation](#-installation)
- [From pip](#from-pip)
- [From repository](#from-repository)
- [Persisting data](#persisting-data)
- [π Documentation and tutorials](#-documentation-and-tutorials)
- [π΅βπ« Need help?](#-need-help)
- [πͺ’ Community and contact](#-community-and-contact)
- [π Contributing](#-contributing)
- [β FAQ](#-faq)
- [Concept questions](#concept-questions)
- [Technical questions](#technical-questions)
- [Service and hosting questions](#service-and-hosting-questions)
- [π Python SDK](#-python-sdk)
- [π Architecture](#-architecture)
- [π« Glossary](#-glossary)
- [π©βπ»π¨βπ» Team and contributors](#-team-and-contributors)
- [π Star History](#-star-history)
- [π License](#-license)## π§βπ» Why _refinery_?
There are already many other tools available to build training data. Why did we decide to build _yet another one_?
### Enabling ideas of one-person-armies
We believe that developers can have crazy ideas, and we want to lower the barrier for them to go for that idea. _refinery_ is designed to build labeled training data much faster, so that it takes you very little time to prototype an idea. We've received much love for exactly that, so make sure to give it a try for your next project.
### Extending your existing labeling approach
_refinery_ is more than a labeling tool. It has a built-in labeling editor, but its main advantages come with automation and data management. You can integrate any kind of heuristic to label what is possible automatically, and then focus on headache-causing subsets afterwards. Whether you do the labeling in _refinery_ or any other tool (even crowd labeled) doesn't matter!
### Put structure into unstructured data
_refinery_ is the tool that brings new perspectives into your data. You're working on multilingual, human-written texts? Via our integration to [bricks](https://github.com/code-kern-ai/bricks), you can easily enrich your texts with metadata such as the detected language, sentence complexity and many more. You can use this both to analyze your data, but also to orchestrate your labeling workflow.
### Pushing collaboration
While doing so, we aim to improve the collaboration between engineers and subject matter experts (SMEs). In the past, we've seen how our application was being used in meetings to discuss label patterns in form of labeling functions and distant supervisors. We believe that data-centric AI is the best way to leverage collaboration.
### Open-source, and treating training data as a software artifact
We hate the idea that there are still use cases in which the training data is just a plain CSV-file. That is okay if you _really_ just quickly want to prototype something at hand with a few records, but any serious software should be maintainable. We believe an open-source solution for training data management is what's needed here. _refinery_ is the tool helping you to document your data. That's how you treat training data as a software artifact.
### Integrations
Lastly, _refinery_ supports [SDK actions](https://github.com/code-kern-ai/refinery-python) like pulling and pushing data. Data-centric AI redefines labeling to be more than a one-time job by giving it an iterative workflow, so we aim to give you more power every day by providing end-to-end capabilities, growing the large-scale availability of high-quality training data. Use our SDK to program integrations with your existing landscapes.
## Your benefits
You can automate tons of repetitive tasks, gain better insights into the data labeling workflow, receive an implicit documentation for your training data, and can ultimately build better models in shorter time.
Our goal is to make training data building feel more like a programmatic and enjoyable task, instead of something tedious and repetitive. _refinery_ is our contribution to this goal. And we're constantly aiming to improve this contribution.
If you like what we're working on, please leave a β!
## How does Kern AI make money, if refinery is open-source?
You won't believe how often we get that question - and it is a fair one π Put short, the open-source version of _refinery_ is currently a single-user version, and you can get access to a multi-user environment with our commercial options. Additionally, we have commercial products on top of _refinery_, e.g. to use the _refinery_ automations as an actual realtime prediction API.
Generally, we are passionate about open-source and want to contribute as much as possible.
## π€ Features
For a detailed overview of features, please look into our [docs](https://docs.kern.ai).
### (Semi-)automated labeling workflow for NLP tasks
- Both manual and programmatic for classifications and span-labeling
- Integration with state-of-the-art libraries and frameworks
- Creation and management of lookup lists/knowledge bases to support during labeling
- Neural search-based retrieval of similar records and outliers
- Sliceable labeling sessions to drill-down on specific subsets
- Multiple labeling tasks possible per project
- Rich library of ready-made automations in our open-source [bricks](https://github.com/code-kern-ai/bricks) library### Extensive data management and monitoring
- Best-in-class data management capabilities via our databrowser. Filter, sort and search your data e.g. by confidence, heuristic overlap, user, note, etc.
- Integration with [π€ Hugging Face](https://www.huggingface.co) to automatically create document- and token-level embeddings
- JSON-based data model for up- and downloads
- Overview of project metrics like confidence and label distributions and confusion matrix
- Data accessible and extendable via our [Python SDK](https://github.com/code-kern-ai/refinery-python)
- Attribute modifications to extend your attributes (e.g. with sentence complexity metrics) in-place
- Again, you can use [bricks](https://github.com/code-kern-ai/bricks) to enrich your data with metadata### Team workspaces in the [managed version](https://www.kern.ai/pricing)
- Allow multiple users to label your data with role-based access and minimized labeling views
- Integrate crowd labeling workflows
- Automated calculation of inter-annotator agreements## β Installation
### From pip
```
pip install kern-refinery
```Once the library is installed, go to the directory where you want to store the data and run `refinery start`. This will automatically `git clone` this repository first if you haven't done so yet. To stop the server, run `refinery stop`.
### From repository
**TL;DR:**
```
$ git clone https://github.com/code-kern-ai/refinery.git
$ cd refinery
```If you're on Mac/Linux:
```
$ ./start
```If you're on Windows:
```
$ start.bat
```To stop, type `./stop` (Mac/Linux) or `stop.bat`.
_refinery_ consists of multiple services that need to be run together. To do so, we've set up a setup file, which will automatically pull and connect the respective services for you. The file is part of this repository, so you can just clone it and run `./start` (Mac/Linux) or `start.bat` (Windows) in the repository. After some minutes (now is a good time to grab a coffee β), the setup is done and you can access `http://localhost:4455` in your browser. To stop the server, run `./stop` (Mac/Linux) or `./stop.bat` (Windows).
**You're ready to start! π π**
If you run into any issues during installation, please don't hesitate to reach out to us (see community section below).
### Persisting data
By default, we store the data to the directory `refinery/postgres-data`. If you want to change that path, you need to modify the variable `LOCAL_VOLUME` of the `start` script of your operating system. To remove data, simply delete the volume folder. **Make sure to delete only if you don't need the data any longer - this is irreversible!**
## π Documentation and tutorials
The best way to start with _refinery_ is our [**quick start**](https://docs.kern.ai/refinery/quickstart).
You can find extensive guides in our [docs](https://docs.kern.ai) and [tutorials](https://www.youtube.com/@kern_ai/videos) on our YouTube channel. We've also prepared a [repository with sample projects](https://github.com/code-kern-ai/sample-projects) which you can clone.
If you need help writing your first labeling functions, look into our open-source content library [bricks](https://github.com/code-kern-ai/bricks).
You can find our changelog [here](https://changelog.kern.ai).
## π΅βπ« Need help?
No worries, we've got you. If you have questions, reach out to us on [Discord](https://discord.gg/qf4rGCEphW), or [open a ticket](https://github.com/code-kern-ai/refinery/discussions/categories/q-a) in the "q&a" category of our forum.
## πͺ’ Community and contact
Feel free to join our [Discord](https://discord.gg/qf4rGCEphW), where we'll happily help you building your training data:
We send out a (mostly) weekly newsletter about recent findings in data-centric AI, product highlights in development and more. You can subscribe to the newsletter [here](https://www.kern.ai/newsletter).
Also, you can follow us on [Twitter](https://twitter.com/MeetKern) and [LinkedIn](https://www.linkedin.com/company/kern-ai).
## π Contributing
Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are **greatly appreciated**. You can do so by providing feedback about [desired features and bugs](https://github.com/code-kern-ai/refinery/issues) you might detect.
If you actively want to participate in extending the code base, reach out to us. We'll explain you how the architecture is set up, so you can customize the application as you desire.
## β FAQ
### Concept questions
What is a heuristic?
Heuristics are the ingredients for scaling your data labeling. They don't have to be 100% accurate, heuristics can be e.g. simple Python functions expressing some domain knowledge. When you add and run several of these heuristics, you create what is called a noisy label matrix, that is matched against the reference data that you manually labeled. This allows us to analyze correlations, conflicts, overlaps, the number of hits for a data set, and the accuracy of each heuristic.How can I build an active learning model?
We use pre-trained models to create embeddings in the first place. Once this is done, the embeddings are available in the application (both for building active learning heuristics and neural search). In our active learning IDE, you can then build a simple classification or extraction head on top of the embedding, and we'll manage then execution in a containerized environment.How do I know whether my heuristic is good?
A heuristic can be βgoodβ with respect to both coverage and precision. For coverage there basically is no limitation at all, for precision we generally recommend some value above 70%, depending on how many heuristics you have. The more heuristics you have, the more overlaps and conflicts will be given, the better weak supervision can work.I have less than 1,000 records - Do I need this?
You can definitely use the system for smaller datasets, too! It not only shines via programmatic labeling, but also has a simple and beautiful UI. Go for it π### Technical questions
Help!! I forgot my password!
No worries, you can send a reset link even on your local machine. However, the link isn't sent to your email, but to the mailhog. Access it via http://localhost:4436.I want to install a library for my labeling function
For this, we need to change the requirements.txt of the lf-exec-env, the containerized execution environment for your labeling functions. Please just open an issue, and we'll integrate your library as soon as possible.Which data formats are supported?
Weβve structured our data formats around JSON, so you can upload most file types natively. This includes spreadsheets, text files, CSV data, generic JSON and many more.How can I upload data?
We use pandas internally for matching your data to our JSON-based data model. You can upload the data via our UI, or via our Python SDK.How can I download data, and what format does it have?
You can download your data in our UI or via the Python SDK, where we also provide e.g. adapters to Rasa. The export looks something like this:[
{
"running_id": "0",
"headline": "T. Rowe Price (TROW) Dips More Than Broader Markets",
"date": "Jun-30-22 06:00PM\u00a0\u00a0",
"headline__sentiment__MANUAL": null,
"headline__sentiment__WEAK_SUPERVISION": "NEGATIVE",
"headline__sentiment__WEAK_SUPERVISION__confidence": 0.62,
"headline__entities__MANUAL": null,
"headline__entities__WEAK_SUPERVISION": [
"STOCK", "STOCK", "STOCK", "STOCK", "STOCK", "STOCK", "O", "O", "O", "O", "O"
],
"headline__entities__WEAK_SUPERVISION__confidence": [
0.98, 0.98, 0.98, 0.98, 0.98, 0.98, 0.00, 0.00, 0.00, 0.00, 0.00
]
}
]### Service and hosting questions
Are there options for an enterprise on-prem solution?
If you're interested in running the multi-user version on your premises, please reach out to us. We can help you to set up the deployment and prepare your project(s) e.g. with workshops.I don't want to label myself. What are my options?
Do you want to outsource your labeling, and let your engineers use _refinery_ as a mission control for your training data? Reach out to us, so we can discuss how we can help you with your use case.How can I reach support?
In our open-source solution, you can reach out to us via Discord. For our managed version, you have an in-app chat to directly contact our support team.## π Python SDK
You can extend your projects by using our [Python SDK](https://github.com/code-kern-ai/refinery-python). With it, you can easily export labeled data of your current project and import new files both programmatically and via CLI (`rsdk pull` and `rsdk push `). It also comes with adapters, e.g. to [Rasa](https://github.com/RasaHQ/rasa).
## π Architecture
Our architecture follows some main patterns:
- Shared service database to efficiently transfer large data loads. To avoid redundant code in the services, we use submodules to share the data model
- Containerized function execution for labeling functions, active learning and the record ide
- Machine learning logic is implemented in stand-alone libraries (e.g. [sequence-learn](https://github.com/code-kern-ai/sequence-learn))![Architecture _refinery_](architecture.svg)
Some edges are not displayed for simplicity's sake.
The color of the edges have no implicit meaning, and are only used for better readability.
**Service overview (maintained by Kern AI)**
| Service | Description |
|--- |--- |
| [ml-exec-env](https://github.com/code-kern-ai/refinery-ml-exec-env) | Execution environment for the active learning module. Containerized function as a service to build active learning models using scikit-learn and sequence-learn. |
| [embedder](https://github.com/code-kern-ai/refinery-embedder) | Embedder for _refinery_. Manages the creation of document- and token-level embeddings using the embedders library. |
| [weak-supervisor](https://github.com/code-kern-ai/refinery-weak-supervisor) | Weak supervision for _refinery_. Manages the integration of heuristics such as labeling functions, active learners or zero-shot classifiers. Uses the weak-nlp library for the actual integration logic and algorithms. |
| [record-ide-env](https://github.com/code-kern-ai/refinery-record-ide-env) | Execution environment for the record IDE. Containerized function as a service to build record-specific "quick-and-dirty" code snippets for exploration and debugging. |
| [config](https://github.com/code-kern-ai/refinery-config) | Configuration of _refinery_. Amongst others, this manages endpoints and available language models for spaCy. |
| [tokenizer](https://github.com/code-kern-ai/refinery-tokenizer) | Tokenizer for _refinery_. Manages the creation and storage of spaCy tokens for text-based record attributes and supports multiple language models. |
| [gateway](https://github.com/code-kern-ai/refinery-gateway) | Gateway for _refinery_. Manages incoming requests and holds the workflow logic. To interact with the gateway, the UI or Python SDK can be used. |
| [authorizer](https://github.com/code-kern-ai/refinery-authorizer) | Evaluates whether a user has access to certain resources. |
| [websocket](https://github.com/code-kern-ai/refinery-websocket) | Websocket module for refinery. Enables asynchronous notifications inside the application. |
| [lf-exec-env](https://github.com/code-kern-ai/refinery-lf-exec-env) | Execution environment for labeling functions. Containerized function as a service to execute user-defined Python scripts. |
| [ac-exec-env](https://github.com/code-kern-ai/refinery-ac-exec-env) | Execution environment for attribute calulaction. Containerized function as a service to generate new attributes using Python scripts. |
| [updater](https://github.com/code-kern-ai/refinery-updater) | Updater for _refinery_. Manages migration logic to new versions if required. |
| [neural-search](https://github.com/code-kern-ai/refinery-neural-search) | Neural search for _refinery_. Manages similarity search powered by Qdrant and outlier detection, both based on vector representations of the project records. |
| [zero-shot](https://github.com/code-kern-ai/refinery-zero-shot) | Zero-shot module for _refinery_. Enables the integration of π€ Hugging Face zero-shot classifiers as an off-the-shelf no-code heuristic. |
| [entry](https://github.com/code-kern-ai/refinery-entry) | Login and registration screen for refinery. Implemented via Ory Kratos. |
| [ui](https://github.com/code-kern-ai/refinery-ui) | UI for _refinery_. Used to interact with the whole system; to find out how to best work with the system, check out our docs. |
| [doc-ock](https://github.com/code-kern-ai/refinery-doc-ock) | Usage statistics collection for _refinery_. If users allow it, this collects product insight data used to optimize the user experience. |
| [gateway-proxy](https://github.com/code-kern-ai/refinery-gateway-proxy) | Gateway proxy for _refinery_. Manages incoming requests and forwards them to the gateway. Used by the Python SDK. |
| [parent-images](https://github.com/code-kern-ai/refinery-exec-env-parent-image) | Shared images used by _refinery_. Used to reduce the required space for _refinery_. _Not yet listed in architecture diagram_ |
| [ac-exec-env](https://github.com/code-kern-ai/refinery-ac-exec-env) | Execution environment for attribute calculation in _refinery_. Containerized function as a service to build custom attributes derived from the original data. _Not yet listed in architecture diagram_ |
| [alfred](https://github.com/code-kern-ai/alfred) | Controls the start process of the _refinery_ app. Named after Batman's butler Alfred. _Not yet listed in architecture diagram_|**Service overview (open-source 3rd party)**
| Service | Description |
|--- |--- |
| [qdrant/qdrant](https://github.com/qdrant/qdrant) | Qdrant - Vector Search Engine for the next generation of AI applications |
| [postgres/postgres](https://github.com/postgres/postgres) | PostgreSQL: The World's Most Advanced Open Source Relational Database |
| [minio/minio](https://github.com/minio/minio) | Multi-Cloud βοΈ Object Storage |
| [mailhog/MailHog](https://github.com/mailhog/MailHog) | Web and API based SMTP testing |
| [ory/kratos](https://github.com/ory/kratos) | Next-gen identity server (think Auth0, Okta, Firebase) with Ory-hardened authentication, MFA, FIDO2, TOTP, WebAuthn, profile management, identity schemas, social sign in, registration, account recovery, passwordless. Golang, headless, API-only - without templating or theming headaches. Available as a cloud service. |
| [ory/oathkeeper](https://github.com/ory/oathkeeper) | A cloud native Identity & Access Proxy / API (IAP) and Access Control Decision API that authenticates, authorizes, and mutates incoming HTTP(s) requests. Inspired by the BeyondCorp / Zero Trust white paper. Written in Go. |**Integrations overview (maintained by Kern AI)**
| Integration | Description |
|--- |--- |
| [refinery-python](https://github.com/code-kern-ai/refinery-python) | Official Python SDK for Kern AI refinery. |
| [sequence-learn](https://github.com/code-kern-ai/sequence-learn) | With sequence-learn, you can build models for named entity recognition as quickly as if you were building a sklearn classifier. |
| [embedders](https://github.com/code-kern-ai/embedders) | With embedders, you can easily convert your texts into sentence- or token-level embeddings within a few lines of code. Use cases for this include similarity search between texts, information extraction such as named entity recognition, or basic text classification. Integrates π€ Hugging Face transformer models |
| [weak-nlp](https://github.com/code-kern-ai/weak-nlp) | With weak-nlp, you can integrate heuristics like labeling functions and active learners based on weak supervision. Automate data labeling and improve label quality. |**Integrations overview (open-source 3rd party)**
| Integration | Description |
|--- |--- |
| [huggingface/transformers](https://github.com/huggingface/transformers) | π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. |
| [scikit-learn/scikit-learn](https://github.com/scikit-learn/scikit-learn) | scikit-learn: machine learning in Python |
| [explosion/spaCy](https://github.com/explosion/spaCy) | π« Industrial-strength Natural Language Processing (NLP) in Python |**Submodules overview**
Not listed in the architecture, but for internal code management, we apply git submodules.
| Submodule | Description |
|--- |--- |
| [submodule-model](https://github.com/code-kern-ai/refinery-submodule-model) | Data model for refinery. Manages entities and their access for multiple services, e.g. the gateway. |
| [submodule-s3](https://github.com/code-kern-ai/refinery-submodule-s3) | S3 related AWS and Minio logic. |## π« Glossary
| Term | Meaning |
| --------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| Weak supervision | Technique/methodology to integrate different kinds of noisy and imperfect heuristics like labeling functions. It can be used not only to automate data labeling, but generally as an approach to improve your existing label quality. |
| Neural search | Embedding-based approach to retrieve information; instead of telling a machine a set of constraints, neural search analyzes the vector space of data (encoded via e.g. pre-trained neural networks). Can be used e.g. to find nearest neighbors. |
| Active learning | As data is labeled manually, a model is trained continuously to support the annotator. Can be used e.g. stand-alone, or as a heuristic for weak supervision. |
| Vector encoding (embedding) | Using pre-trained models such as transformers from [π€ Hugging Face](https://www.huggingface.co), texts can be transformed into vector space. This is both helpful for neural search and active learning (in the latter case, simple classifiers can be applied on top of the embedding, which enables fast re-training on the vector representations). |Missing anything in the glossary? [Add the term](https://github.com/code-kern-ai/refinery/issues) in an issue with the tag "enhancement".
## π©βπ»π¨βπ» Team and contributors
Henrik Wenck
Johannes HΓΆtter
Anton Pullem
Lina Lumburovska
Moritz Feuerpfeil
Leo PΓΌttmann
Simon Degraf
Felix Kirsch
Jens Wittmeyer
Mikhail Kochikov
Simon Witzke
Shamanth Shetty
Divyanshu Katiyar
## π Star History
[![Star History Chart](https://api.star-history.com/svg?repos=code-kern-ai/refinery&type=Date)](https://star-history.com/#code-kern-ai/refinery&Date)
## π License
_refinery_ is licensed under the Apache License, Version 2.0. View a copy of the [License file](LICENSE).