Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/similato87/covid19preprocess
TME6014
https://github.com/similato87/covid19preprocess
Last synced: about 10 hours ago
JSON representation
TME6014
- Host: GitHub
- URL: https://github.com/similato87/covid19preprocess
- Owner: similato87
- Created: 2021-10-21T19:45:59.000Z (about 3 years ago)
- Default Branch: master
- Last Pushed: 2021-10-21T19:48:14.000Z (about 3 years ago)
- Last Synced: 2024-11-05T15:04:12.240Z (about 2 months ago)
- Language: Python
- Size: 132 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: readme.md
Awesome Lists containing this project
README
# Data Preprocess
## overview
- This repo is created for sharing the resources and posting the notifications from our professor.
- This repo will be updated weekly according to our process.
***
## Week 1
### task
- perform cleansing on multiple CSV files you have found online and augment them into one spreadsheet.
- When you concatenate the instances from different csv files, you need to unify all possible features/attributes the way you described.
- perform scatter plots on across multiple attributes for visualization
### resource
- This is the link of the COVID-19 data.
- This data has been added to this project too.
- You may find some ideas from Read CSV in Python and Data frame.
- I tried to write a [demo](https://github.com/similato87/COVID19Preprocess/blob/master/main/pipeline.py) for reading data, and it seems to be successful.
### note
- The next meeting will be Monday at 12 noon at H133.
- Have a great weekend :)