Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/shuyib/teaching_data_collection

Learn data collection by putting a couple of things into consideration
https://github.com/shuyib/teaching_data_collection

best-practices data-collection data-science data-structures data-visualization makefile matplotlib pandas-dataframe polars-dataframe

Last synced: 3 days ago
JSON representation

Learn data collection by putting a couple of things into consideration

Awesome Lists containing this project

README

        

# teaching_data_collection
Learn how to do data collection putting a couple of things into consideration. Such as:

1. [Tidy data principles](https://vita.had.co.nz/papers/tidy-data.pdf) including short and long dataframes.
2. Error handling while calling [APIs.](https://blog.postman.com/best-practices-for-api-error-handling/)
3. Keeping your secrets safe with [OS module.](https://www.geeksforgeeks.org/python-os-getenv-method/)
4. [Retrying](https://medium.com/@API4AI/best-practice-implementing-retry-logic-in-http-api-clients-0b5469c08ced) API request incase of failure.

You will learn how to query the newsAPI and scrape information from Wikipedia. These are common applications of
collecting data in data science. NB: You'll need an [API key](https://newsapi.org/) for the newsAPI. Here we go!!!

Check out the folders for more information

[Getting Data off an API](API)
[Webscraping](Webscrape)