Projects in Awesome Lists by code-kern-ai
A curated list of projects in awesome lists by code-kern-ai .
https://github.com/code-kern-ai/refinery
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
active-learning annotations artificial-intelligence data-centric-ai data-labeling data-science deep-learning human-in-the-loop labeling labeling-tool machine-learning natural-language-processing neural-search nlp python spacy supervised-learning text-annotation text-classification transformers
Last synced: 10 Apr 2025
https://github.com/code-kern-ai/bricks
Open-source natural language enrichments at your fingertips.
best-practices natural-language-processing natural-language-understanding python templates
Last synced: 08 Apr 2025
https://github.com/code-kern-ai/refinery-sample-projects
Containing examples of projects you can use to test refinery. Please select the use case from the branches.
chatbot example exercise rasa sentiment-analysis similarity-search tutorial
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/sequence-learn
With sequence-learn, you can build models for named entity recognition as quickly as if you were building a sklearn classifier.
machine-learning named-entity-recognition natural-language-processing ner nlp python
Last synced: 24 Apr 2025
https://github.com/code-kern-ai/embedders
With embedders, you can easily convert your texts into sentence- or token-level embeddings within a few lines of code. Use cases for this include similarity search between texts, information extraction such as named entity recognition, or basic text classification.
classification machine-learning named-entity-recognition natural-language-processing ner nlp python representation-learning similarity-search
Last synced: 24 Apr 2025
https://github.com/code-kern-ai/refinery-python-sdk
Official Python SDK for Kern AI refinery.
active-learning data-centric-ai deep-learning labeling labeling-tool machine-learning natural-language-processing neural-search nlp python sdk spacy supervised-learning text-annotation text-classification transformer
Last synced: 29 Mar 2025
https://github.com/code-kern-ai/weak-nlp
With weak-nlp, you can integrate heuristics like labeling functions and active learners based on weak supervision. Automate data labeling and improve label quality.
Last synced: 24 Apr 2025
https://github.com/code-kern-ai/refinery-neural-search
Neural search for refinery. Manages similarity search powered by Qdrant and outlier detection, both based on vector representations of the project records.
Last synced: 24 Apr 2025
https://github.com/code-kern-ai/docs-chatbot-demo
This is a demo of how we can leverage embeddings and LLMs to quickly build a chatbot so you can "chat" with a documentation.
Last synced: 24 Apr 2025
https://github.com/code-kern-ai/refinery-submodule-model
Data model for refinery. Manages entities and their access for multiple services, e.g. the gateway.
Last synced: 19 Feb 2025
https://github.com/code-kern-ai/refinery-embedder
Embedder for refinery. Manages the creation of document- and token-level embeddings using the embedders library.
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-template-functions
Overview of recurring functions you can use for your projects
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-config
Configuration of refinery. Manages amongst others endpoints and available language models for spaCy.
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-tokenizer
Tokenizer for refinery. Manages the creation and storage of spaCy tokens for text-based record attributes and supports multiple language models. It is used by the gateway.
Last synced: 24 Apr 2025
https://github.com/code-kern-ai/refinery-submodule-parent-images
Submodule which contains the requirements of the different parent images of refinery.
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-weak-supervisor
Weak supervision for refinery. Manages the integration of heuristics such as labeling functions, active learners or zero-shot classifiers. Uses the weak-nlp library for the actual integration logic and algorithms.
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-exec-env-parent-image
Defines parent image for the Docker images of the refinery services which provide an execution environment.
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-zero-shot
Zero-shot module for refinery. Enables the integration of 🤗 Hugging Face zero-shot classifiers as an off-the-shelf no-code heuristic.
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-doc-ock
Usage statistics collection for refinery. If users allow it, this collects product insight data used to optimize the user experience.
Last synced: 24 Apr 2025
https://github.com/code-kern-ai/refinery-torch-cuda-parent-image
Defines parent image for the Docker images of the refinery services that require torch (gpu).
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-mini-parent-image
Defines parent image for the Docker images of the refinery services with the smallest set of requirements.
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-torch-cpu-parent-image
Defines parent image for the Docker images of the refinery services that require torch (cpu).
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/alfred
Controls the start process of the refinery app. Named after Batman's butler Alfred.
Last synced: 03 Apr 2025
https://github.com/code-kern-ai/cicd-deployment-scripts
Scripts used for Kern AI CI/CD efforts
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-authorizer
Evaluates whether a user has access to certain resources.
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-gateway-proxy
Gateway proxy for refinery. Manages incoming requests and forwards them to the gateway. Used by the Python SDK.
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-updater
Updater for refinery. Manages migration logic to new versions if required.
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-websocket
Websocket module for refinery. Enables asynchronous notifications inside the application.
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-lf-exec-env
Execution environment for labeling functions in refinery. Containerized function as a service to execute user-defined Python scripts.
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-ac-exec-env
Execution environment for attribute calculation in refinery. Containerized function as a service to build custom attributes derived from the original data.
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-common-parent-image
Defines parent image for the Docker images of the refinery services which require the integration of the model and the s3 submodule.
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-record-ide-env
Execution environment for the record IDE in refinery. Containerized function as a service to build record-specific "quick-and-dirty" code snippets for exploration and debugging.
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-gateway
Gateway for refinery. Manages incoming requests and holds the workflow logic. To interact with the gateway, the UI or Python SDK can be used.
Last synced: 19 Feb 2025
https://github.com/code-kern-ai/refinery-ml-exec-env
Execution environment for the active learning module in refinery. Containerized function as a service to build active learning models using scikit-learn and sequence-learn.
Last synced: 25 Feb 2025
https://github.com/code-kern-ai/refinery-submodule-s3
S3 related AWS and Minio logic.
Last synced: 25 Feb 2025