Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/soodoku/meta_ps
Production and Consumption of APSR, BJPS, Perspectives, PS, and World Politics Articles
https://github.com/soodoku/meta_ps
journal metadata
Last synced: 13 days ago
JSON representation
Production and Consumption of APSR, BJPS, Perspectives, PS, and World Politics Articles
- Host: GitHub
- URL: https://github.com/soodoku/meta_ps
- Owner: soodoku
- Created: 2015-08-01T19:44:17.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2023-06-12T06:49:25.000Z (over 1 year ago)
- Last Synced: 2024-10-11T12:17:03.985Z (27 days ago)
- Topics: journal, metadata
- Language: R
- Homepage:
- Size: 10.3 MB
- Stars: 14
- Watchers: 8
- Forks: 2
- Open Issues: 1
-
Metadata Files:
- Readme: ReadMe.md
Awesome Lists containing this project
README
## Trends in Production and Consumption of Political Science Articles
Data are from five prominent political science journals: APSR, BJPS, Perspectives, PS, and World Politics.
### Table of Contents:
* [Preview of Results](#results)
* [Scraping the (meta) data](#get-the-meta-data)
* [The data and the codebook](#data)
* [Scripts for producing the graphs and graphs](#analyze-the-data)
* **Related:** [Proportion of precise quantitative statements in APSR abstracts](https://github.com/soodoku/quant-discipline)----
### Some Results
The data show that till well into the 1960s, the average article published in the [APSR](http://journals.cambridge.org/action/displayJournal?jid=PSR) was solo-authored. Today, co-authored papers are the norm.
![No. of authors over time](figs/n_authors_per_article_over_time.png)
Over the past 100 or so years, [article length](figs/article_length.pdf) has shown marked variability. There is a marked see-saw pattern in the average length of the article, but unlike top economics journals we don't see a marked trend towards longer articles. It is very likely, however, that the length of online appendices has grown substantially.
Number of views an [article](figs/fulltext_views.pdf) or [abstract](figs/abstract_views.pdf) has received follows the familiar power law distribution with most articles receiving very few views.
-----
### Get the (Meta) Data
All the data are from journals published by the Cambridge University Press (CUP). CUP has a single publishing platform for all its journals with the link differing only by 'JID'. For instance, to scrape meta data and abstract for all APSR articles from CUP, use [(http://journals.cambridge.org/action/displayBackIssues?jid=PSR](http://journals.cambridge.org/action/displayBackIssues?jid=PSR). The JIDs for various journals are easily found. They can also be scraped from the CUP page listing all the journals that it publishes.
To scrape the data, use [get_data.py](scripts/get_data.py). The script depends on `urllib2`. To run the script, `python get_data.py`
**Options:**
* Name of the output file. Specify `FINAL_OUTPUT_FILE` on line 11 of [get_data.py](scripts/get_data.py).
* Column names. Specify `HEADER` on Line 18 of [get_data.py](scripts/get_data.py).**Note:** The script allows for interruption. If interrupted, it will restart from where it stopped. And it will append the results to the existing output file.
### Data
* [APSR](data/apsr_data.csv)
* [BJPS](data/bjps_data.csv)
* [Perspectives](data/pps_data.csv)
* [PS](data/psc_data.csv)
* [World Politics](data/wpo_data.csv)
Each row in the csv is a separate article. And the columns are:
* article.url
* issue.year
* issue.volume
* issue.date.of.publication
* issue.pages
* article.title
* article.abstract
* article.pages
* 10 colums e.g author1, institution1, etc, ...,author1, institution1
* article.abstract.views
* article.full.text.views### Analyze the Data
* [Recode the data](scripts/meta_apsr.R)
* Article lengths (measured by number of pages) over time. ([Script](scripts/article_length.R), [Graph](figs/n_pages_per_article_over_time.pdf))
* Number of authors per article over time. ([Script](scripts/n_authors.R), [Graph](figs/n_authors_per_article_over_time.pdf))
* Number of articles per issue over time. ([Script](scripts/articles_per_issue.R), [Graph](figs/articles_per_issue_over_time.pdf))
* Number of pages per issue over time. ([Script](scripts/issue_length.R), [Graph](figs/pages_per_issue_over_time.pdf))
* Distribution of full text views. ([Script](scripts/fulltext_views.R), [Graph](figs/fulltext_views.pdf))
* Distribution of abstract views. ([Script](scripts/abstract_views.R), [Graph](figs/abstract_views.pdf))
* Proportion of Female Authors per article. ([Script](scripts/gender_authors.R), [Graph](figs/gender_authors_per_article_over_time.pdf))
* Based on the idea by [titleogy](http://datacolada.org/2013/12/04/titleogy/), Length of title. ([Script](scripts/title_length.R), [Graph](figs/title_len_over_time.pdf))### License
Released under [CC BY 2.0](https://creativecommons.org/licenses/by/2.0/).