Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/jadianes/data-journalism
Data journalism and easy to replicate notebooks using Python, R, and Web visualisations
https://github.com/jadianes/data-journalism
data-analysis data-journalism data-visualisation data-visualization exploratory-data-analysis notebook
Last synced: 3 days ago
JSON representation
Data journalism and easy to replicate notebooks using Python, R, and Web visualisations
- Host: GitHub
- URL: https://github.com/jadianes/data-journalism
- Owner: jadianes
- License: other
- Created: 2015-09-02T10:45:00.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2018-04-18T16:53:02.000Z (over 6 years ago)
- Last Synced: 2024-12-06T21:27:47.757Z (about 1 month ago)
- Topics: data-analysis, data-journalism, data-visualisation, data-visualization, exploratory-data-analysis, notebook
- Language: HTML
- Homepage: http://jadianes.github.io/data-journalism
- Size: 9.32 MB
- Stars: 88
- Watchers: 10
- Forks: 17
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Data Journalism
If you are a [Data Journalist]((https://en.wikipedia.org/wiki/Data_journalism)) looking to improve your **coding skills**, or you work as a developer giving support in a **newsroom**, you arrived to the right place.
This is a repository of articles and tutorials, as [IPython/Jupyter](https://jupyter.org) notebooks or web products, about doing data journalism. The articles presented here, apart from analysing data to present some facts about the current, past, and sometimes future world situation, will show **programming instructions** explaining how to repeat the analysis by yourself. We live in a world where governments and the media, more often than not, serve the interests of a few. Our belief is that to empower people to do their own analysis and arrive to conclusions based on facts (data), is a way to make us all more aware and strong as a society.
The programming instructions will be given as web [Notebooks](https://en.wikipedia.org/wiki/IPython#Notebook) for the programming language or technology used (e.g. Python, R) or sometimes as a web code that yoy can inspect on the repo. This is an ideal way of sharing code combined with textual explanations, charts, images, etc. However, we will tend to favour the Pyhton language. But why Python? Well, we will sometimes use other technologies (mainly R but also JavaScript or Spark) but we think that Python has some characteristics that makes it a good environment for Data Journalism:
- It is a modern programming language, very clean and expressive, that promotes simplicity and elegance.
- It can be used to write scripts (as we will use it most of the time) and also to build complex software systems.
- There are lots of extensions (i.e. libraries) to perform all sorts of tasks, not just data analysis and visualisation ones, but also [web scraping](https://en.wikipedia.org/wiki/Web_scraping), [web development](https://en.wikipedia.org/wiki/Web_development), [natural language processing](https://en.wikipedia.org/wiki/Natural_language_processing), etc.
- You can share your code as notebooks!So our hope is that while finding our articles analysis and conclusions interesting, you will also learn how to repeat and extend them yourself and arrive to aditional conclusions.
## About me
My name is [Jose A. Dianes](http://jadianes.com/) and I am a data analyst and developer. During years I have been involved in all sorts of software projects including real-time systems, web enterprise systems, and bioinformatics. Eventually I arrived to data analysis and products, where I solve scalability problems and deliver producst that provide actionable knowledge.
You can contact me easily at [my personal website](http://jadianes.com/about/).
## Articles
#### [Tuberculosis world situation](https://github.com/jadianes/data-journalism-python/tree/master/articles/tuberculosis-world-situation)
Where we analyse the situation of infectious tuberculosis from 1990 to 2007 using WHO datasets.
#### [A visual on the evolution of Tuberculosis](https://github.com/jadianes/data-journalism-python/tree/master/articles/tuberculosis-evolution-visual/tuberculosis-evolution-visual.ipynb)
Where we use Bokeh to represent the same dataset with a simple heatmap and look for visual clues.
#### [Wine market analysis](https://github.com/jadianes/data-journalism-python/tree/master/articles/wine-market/wine-market.ipynb)
Where we show how to use a RESTful like API to get JSON data using [`Requests`](http://www.python-requests.org/en/latest/), save JSON data into a file, doing Exploratory Data Analysis using [`Pandas`](http://pandas.pydata.org/), and generating a data visualisation using [`Seaborn`](http://stanford.edu/~mwaskom/software/seaborn/) and [`mpld3`](http://mpld3.github.io/). All this in order to explore [**Wine.com**](http://www.wine.com/) catalog and get an impression of what the wine market looks like for them.
## Contributing
Contributions are welcome! For bug reports or requests please [submit an issue](https://github.com/jadianes/data-journalism-python/issues).
## Contact
Feel free to contact me to discuss any issues, questions, or comments.
* Twitter: [@ja_dianes](https://twitter.com/ja_dianes)
* GitHub: [jadianes](https://github.com/jadianes)
* LinkedIn: [jadianes](https://www.linkedin.com/in/jadianes)
* Website: [jadianes.me](http://jadianes.me)## License
This repository contains a variety of content; some developed by Jose A. Dianes, and some from third-parties. The third-party content is distributed under the license provided by those parties.
The content developed by Jose A. Dianes is distributed under the following license:
Copyright 2016 Jose A Dianes
Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License athttp://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.