Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/prangonghose/wikipedia-blocking-policies
This study investigates the relationship between editorsβ disruptive behavior and regulation policies on English Wikipedia, focusing on the Blocking Policy page. The study collects and analyzes data from 2004 to 2022 using the Wikipedia API, page statistics, and keyword extraction.
https://github.com/prangonghose/wikipedia-blocking-policies
data-analysis data-visualization matplotlib open-source pandas python3 seaborn
Last synced: 17 days ago
JSON representation
This study investigates the relationship between editorsβ disruptive behavior and regulation policies on English Wikipedia, focusing on the Blocking Policy page. The study collects and analyzes data from 2004 to 2022 using the Wikipedia API, page statistics, and keyword extraction.
- Host: GitHub
- URL: https://github.com/prangonghose/wikipedia-blocking-policies
- Owner: PrangonGhose
- License: gpl-3.0
- Created: 2024-02-23T09:43:08.000Z (12 months ago)
- Default Branch: main
- Last Pushed: 2024-03-01T10:13:38.000Z (12 months ago)
- Last Synced: 2024-12-08T20:49:15.334Z (2 months ago)
- Topics: data-analysis, data-visualization, matplotlib, open-source, pandas, python3, seaborn
- Language: Jupyter Notebook
- Homepage:
- Size: 1.6 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
Wikipedia Blocking Policies
![]()
# π Table of Contents
- [π About the Project](#about-project)
- [π Built With](#built-with)
- [Key Features](#key-features)
- [π» Getting Started](#getting-started)
- [Setup](#setup)
- [Prerequisites](#prerequisites)
- [Install](#install)
- [π₯ Authors](#authors)
- [π€ Contributing](#contributing)
- [π Future Features](#future-features)
- [βοΈ Show your support](#support)
- [π Acknowledgements](#acknowledgements)# π Project Name
## Wikipedia Blocking Policies
This study investigates the relationship between editorsβ disruptive behavior and regulation policies on English Wikipedia, focusing on the Blocking Policy page1. The study collects and analyzes data from 2004 to 2022 using the Wikipedia API, page statistics, and keyword extraction2. The study tests two hypotheses:
- Evolving policies do not reduce editor blocks and reblocks, and
- Biased treatment and vague punishment policies lead to more disruptive behavior.The study finds evidence to support both hypotheses and discusses the implications for online governance and policymaking. The study also examines the diversity of language preferences and perspectives among editors and administrators, and suggests future directions for research.
**Detailed report on this project is available [here](https://docs.google.com/document/d/1xK5NjvH3bmb-nX4b5aHY3u800HZNGac5_eHZ0wO_3-k/edit?usp=sharing).**
- `Python3`, `Pandas`, `NumPy`, `Matplotlib`, and `Seaborn`
- This project demonstrates an overall evolution in the community standard of Wikipedia.
- Authors of this project have done both qualitative and quantitative analysis of the data to prove or disprove their two hypotheses.
- The project is open access and free to use for everyone.
- The collected data is also stored in a google drive of the authors which can be downloaded by any user.
- This project includes all codes including collection of data, cleaning of data, and visualization of data.
- The notebook also includes the primary analysis by the authors.
- The project report is linked in the [project description](#about-project).
- A [requirements.txt](requirements.txt) file is added in the repository which can be used by a user to directly install all necessary `Python3` packages for this project.## π» Getting Started
To clone the repository in local environment try following steps.### Prerequisites
- [x] A web browser like Google Chrome.
- [x] A code editor like Visual Studio Code with Git.You can check if Git is installed by running the following command in the terminal.
```
$ git --version
```Likewise for `python` and `pip` for package installation.
```
$ python --version```
### SetupClone the repository using [this link](https://github.com/PrangonGhose/wikipedia-blocking-policies.git).
### Install
In the terminal, go to your file directory and run this command.
```
$ git clone https://github.com/PrangonGhose/wikipedia-blocking-policies.git
```
### Get into developmentIn the terminal, run these commands to get into development.
```
$ cd wikipedia-blocking-policies$ pip install -r requirements.txt
```
The required data folder can be manually downloaded from [here](https://drive.google.com/uc?id=1WHFJ6wm4ElKH4KIAsa3A1ph4Bt34VYsH) and be unzipped manually by the user.
Or it can also be done automatically later while continuing with the project notebook.
Now you can open the [wikipedia notebook](wikipedia.ipynb) and continue with the project.
π€ **Prangon Ghose**
- GitHub: [@PrangonGhose](https://github.com/PrangonGhose)
- LinkedIn: [Prangon Ghose](https://www.linkedin.com/in/prangon-ghose/)π€ **Md Abeed Hasan**
- LinkedIn: [Md Abeed Hasan](https://www.linkedin.com/in/md-abeed-hasan-a09469119/)
π€ **Yushu Gong**
- LinkedIn: [Yushu Gong](https://www.linkedin.com/in/yushu-gong-41b68b21b/)
π€ **Ziqiao Zhang**
π€ **Ziyi Chen**
Contributions, issues, and feature requests are welcome! Add suggestions by opening new issues.
Feel free to check the [issues page](https://github.com/PrangonGhose/wikipedia-blocking-policies/issues).
- Utilizing editors' IP addresses sourced from Wikipedia's API can enhance research insights by identifying prevalent violation locations.
- Acquiring gender data of administrators can facilitate exploration of gender diversity and potential biases within the administrative landscape.
- Further investigation into correlations between edit frequency, language preferences, and information disparities across Wikipedia pages in various languages is warranted to deepen understanding of these complex dynamics.Give a βοΈ if you like this project!
Authors would like to thank:
- Mentor: [Andrea Giraldo Sevilla](https://www.linkedin.com/in/andrea-giraldo-sevilla/)
- Mentor: [Marc Santolini](https://marcsantolini.com/)
- [Wikipedia API](https://en.wikipedia.org/w/api.php)
- [XTools](https://xtools.wmcloud.org/)