Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/wittline/csv-shuffler
A tool to automatically Shuffle lines in .csv files
https://github.com/wittline/csv-shuffler
big-data csv csv-files data-engineering large-files shuffle
Last synced: 3 months ago
JSON representation
A tool to automatically Shuffle lines in .csv files
- Host: GitHub
- URL: https://github.com/wittline/csv-shuffler
- Owner: Wittline
- License: mit
- Created: 2022-07-15T18:38:40.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2022-07-17T20:15:55.000Z (over 2 years ago)
- Last Synced: 2024-10-13T19:07:16.918Z (3 months ago)
- Topics: big-data, csv, csv-files, data-engineering, large-files, shuffle
- Language: Python
- Homepage:
- Size: 9.77 KB
- Stars: 4
- Watchers: 2
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# csv-shuffler
A tool to automatically Shuffle lines in .csv files![image](https://user-images.githubusercontent.com/8701464/179423167-35b1780b-5aa3-46e4-ad19-eec4ca8ba820.png)
# **Installing *csv\_shuffler* 🔧**
``` python
pip install csv-shuffler
```Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/
Collecting csv-shuffler
Downloading csv_shuffler-0.0.3-py3-none-any.whl (2.9 kB)
Installing collected packages: csv-shuffler
Successfully installed csv-shuffler-0.0.3# **Importing *csv\_shuffler* library ⚡**
``` python
from csv_shuffler import csv_shuffler
```# **Setting *csv\_shuffler* configuration ✍**
``` python
shuffler = csv_shuffler.ShuffleCSV(input_file='/content/sample_data/mnist_train_small.csv',header=True, batch_size=20000)
```# **Run shuffler 🏃**
``` python
shuffler.shuffle_csv()
```19999
## Contributing and Feedback
Any ideas or feedback about this repository?. Help me to improve it.## Authors
- Created by Ramses Alexander Coraspe Valdez
- Created on 2022## License
This project is licensed under the terms of the MIT License.