https://github.com/mongodb-developer/dotnethuggingfacedatasetuploader
A simple dotnet console app for uploading the movies dataset from AIATMongoDB on Hugging Face to your own cluster
https://github.com/mongodb-developer/dotnethuggingfacedatasetuploader
Last synced: 2 months ago
JSON representation
A simple dotnet console app for uploading the movies dataset from AIATMongoDB on Hugging Face to your own cluster
- Host: GitHub
- URL: https://github.com/mongodb-developer/dotnethuggingfacedatasetuploader
- Owner: mongodb-developer
- Created: 2024-02-20T12:17:45.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-02-22T13:20:44.000Z (over 1 year ago)
- Last Synced: 2025-02-16T02:44:03.443Z (4 months ago)
- Language: C#
- Size: 4.88 KB
- Stars: 1
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: readme.md
Awesome Lists containing this project
README
# Hugging Face Dataset Uploader
The aim of this project is to provide a console application for inserting Hugging Face datasets into your MongoDB Cluster.
At the moment, it dowmnloads the [Embedded Movies dataset](https://huggingface.co/datasets/AIatMongoDB/embedded_movies) and uploads to a collection called embedded_movies insisde a sample_mflix database. If these don't exist, they will be created in your cluster automatically.
## Prerequisites
1. A MongoDB cluster deployed
2. Your connection string
3. DotNET SDK on your machine
4. This repo downloaded to your machine## Running the uploader
1. From the root of the project on your machine, run ```dotnet run```.
2. Enter your connection string when requestedOnce complete, you should see your data available inside your cluster.