Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/yobulkdev/yobulkdev
🔥 🔥 🔥Open Source & AI driven Data Onboarding Platform:Free flatfile.com alternative
https://github.com/yobulkdev/yobulkdev
csv-import csv-parser csv-reader data-engineering datacleaning embeddable javascript languagemodel mongodb nextjs nodejs open-source react stream streaming
Last synced: 2 months ago
JSON representation
🔥 🔥 🔥Open Source & AI driven Data Onboarding Platform:Free flatfile.com alternative
- Host: GitHub
- URL: https://github.com/yobulkdev/yobulkdev
- Owner: yobulkdev
- License: agpl-3.0
- Created: 2022-11-24T11:42:19.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2023-08-03T04:51:35.000Z (over 1 year ago)
- Last Synced: 2024-08-02T01:25:42.238Z (6 months ago)
- Topics: csv-import, csv-parser, csv-reader, data-engineering, datacleaning, embeddable, javascript, languagemodel, mongodb, nextjs, nodejs, open-source, react, stream, streaming
- Language: JavaScript
- Homepage: https://www.yobulk.dev/
- Size: 973 KB
- Stars: 865
- Watchers: 12
- Forks: 45
- Open Issues: 59
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
- awesome-ChatGPT-repositories - yobulkdev - 🔥 🔥 🔥Open Source & AI driven Data Onboarding Platform:Free flatfile.com alternative (Others)
README
YoBulk
⭐️ Open Source Alternative To Flatfile.com ⭐️
A Data exchange platform for all businesses that run on CSV files.
Create YoBulk CSV Import button and make CSV importing 10x faster.
[](markdown/languages/chinese.md)
[](markdown/languages/japanese.md)
[](markdown/languages/german.md)
[](markdown/languages/russian.md)## User Installation
- [Docker](https://doc.yobulk.dev/GetStarted/Installation#yobulk-installation)
## Docker
### Option:1 Through Docker Compose
The docker compose brings both YoBulk Image and Mongodb Image.
We provide docker-compose.yml file under yobulkdev code repository.
```bash
git clone https://github.com/yobulkdev/yobulkdev.git
cd yobulkdev
docker-compose up -d
```
- To make use of YoBulk Open AI feature, please follow the guidelines as below.> Please open the `docker-compose.yml` file and update the `OPENAI_SECRET_KEY` variable with the key generated from [OpenAI API site](https://platform.openai.com/account/api-keys).
The YoBulk dashboard is ready to go at [http://localhost:5050/](http://localhost:5050/).
### Option:2 Through Docker Run
If you are a Docker user, you may try this way!
- Prerequisites:You should have installed mongodb on your local machine and it's running.
```bash
docker run --rm -it -p 5050:5050/tcp yobulk/yobulk
```
- To make use of YoBulk Open AI feature, please follow the guidelines as below.> Please pass the Open AI key generated from [OpenAI API site](https://platform.openai.com/account/api-keys)
in --env to explore YoBulk AI features.Example:
```bash
docker run --rm -it -p 5050:5050/tcp --env="OPENAI_SECRET_KEY=****" yobulk/yobulk
```
YoBulk dashboard is ready to go at [http://localhost:5050/](http://localhost:5050/).## Building locally.
If you are a developer and want to build the system locally, you should have Mongodb running locally.
YoBulk is a fullstack next.js application which uses Mongodb as it's primary Database.
```bash
git clone https://github.com/yobulkdev/yobulkdev
cd yobulkdev
yarn install
yarn run dev
```
- To make use of YoBulk Open AI feature, please follow the guidelines as below.> Please update `OPENAI_SECRET_KEY` variable in .env file with the Open AI key generated from [OpenAI API site](https://platform.openai.com/account/api-keys)
The dashboard is ready to view at [http://localhost:5050/](http://localhost:5050/).
## YoBulk Cloud
YoBulk Cloud is the fastest and most reliable way to run YoBulk. You can get started with free credits in minutes.Sign up for [YoBulk Cloud](https://docs.google.com/forms/d/e/1FAIpQLSfpznd0nsnL5IGWIO1f4O4SBJugxAjI8bI12QqhDP_hqYvJ-A/viewform).
## Quick start
Please view the [documentation](https://doc.yobulk.dev/GetStarted/Quickstart) to explore CSV importing flow.
## Roadmap
- [YoBulk Roadmap ReadMe](https://doc.yobulk.dev/RoadMap/ProductVision)
# Features
### Supported Features
- ⚡ No code Template and Import button creation
- ⚡ Smart auto matching between the CSV column and Template column
- ⚡ Custom validation rule setting against a field
- ⚡ Delightful Data review experience
- ⚡ Scalable through streaming.You can import a CSV with size of 1GB
- ⚡ GPT3 Integration
- ⚡ Template creation through JSON
- ⚡ Regex creation and addition in custom validation
- ⚡ Collaboration & workspace creation.
- ⚡ AI based auto suggestion for correcting errorsComing Soon:
- ⚡ Custom LLM models for Auto suggestion and Auto Cleaning of CSV errors
- ⚡ Data mapping decission knowledge graph for different sectors## Demos
### NoCode Template Creation Demo
https://user-images.githubusercontent.com/118799976/220317753-8e540eb0-af1a-47fe-b384-72081674d14e.mp4### All DataType Validation Demo
https://user-images.githubusercontent.com/118799976/220321287-8cc38555-51ef-412b-885a-b84cba3e5571.mp4### String Validation Demo
https://user-images.githubusercontent.com/118799976/220321602-6beebf43-b97e-486b-9aa6-aae9b3f68eca.mp4### GPT Demo
https://user-images.githubusercontent.com/118799976/220322196-38bc6db3-0b4c-4343-9648-3958ab720c8b.mp4## Why are we building this?
More than 70% of the business data is shared in the form of CSVs and Excel files. Less than 10% have truly adopted API integrations for data exchange.
As a developer and product manager, we have gone through the pain of building a scalable CSV importer. Everyday thousands of developers go through the same pain and hundreds of companies have to get enormous amount of spreadsheet and CSV data into a database.
YoBulk is building an OSS standard for CSV and flatfile data onboarding where it can help you to build a data importing solution in-house.
Enterprises like Flatfile.com are addressing this problem but it has constraints like weak data security and high pricing. While developers want an embeddable/scalable CSV importer for SaaS, the truth is that, it is not possible to come up with an "one-size fits all solution" within such fragmented market.
All these restrictions necessitate our mission, to make it possible for anyone to create an data import solution that suits their needs well.
To Developers, we would like to offer a "bring your own validations"(BYOV) and "bring your own database"(BYOD) experience. You can design and modify validations rules on your own way using YoBulk.
To enterprises , YoBulk is dedicated to bring data security. Moreover, you have 100% control of your data armed with long-term maintainability.
To be honest, we do not claim to outperform flatfile.com in terms of functionality and design, at least for now. Besides, our priority is to add more functionality at the moment. We would like to cultivate a community to democratise the knowledge and bring all CSV and flatfile open source projects under a single framework.
Read more [Open vs Closed Data Onboarding](https://doc.yobulk.dev/#yobulk-positioning-open-source-vs-closed-source-data-onboarding-platforms)
# Our Mission
Our mission is to provide the most powerful flatfile(CSV,Spreadsheet) importer which is open source to every single internet business in the world.
An AI first data exchange for businesses to share flatfiles where you are in-charge of your customer’s data onboarding, validation & transformation.
We stongly support Anderz's vision.# Our Vision
YoBulk's vision is to create an AI first knowledge graph for carrying out over billions of data mapping decisions while exchanging data between businesses across various sectors through flat files such as CSV and Excel.
## Support
If you have any questions or problems, please visit our GitHub discussions page, and we'll try to help you as soon as possible.
[https://github.com/yobulkdev/yobulkdev/discussions](https://github.com/yobulkdev/yobulkdev/discussions)
## License
YoBulk is an open-source software with the [AGPL 3.0 license](https://github.com/yobulkdev/yobulkdev/blob/main/LICENSE.md).
## Contributors
We welcome contributions to YoBulk.