https://github.com/creativecommons/quantifying

quantify the size and diversity of the commons--the collection of works that are openly licensed or in the public domain
https://github.com/creativecommons/quantifying

Last synced: 10 months ago
JSON representation

quantify the size and diversity of the commons--the collection of works that are openly licensed or in the public domain

Host: GitHub
URL: https://github.com/creativecommons/quantifying
Owner: creativecommons
License: mit
Created: 2022-09-07T17:02:39.000Z (over 3 years ago)
Default Branch: main
Last Pushed: 2024-12-13T20:42:13.000Z (over 1 year ago)
Last Synced: 2024-12-13T21:26:55.071Z (over 1 year ago)
Language: Jupyter Notebook
Homepage:
Size: 80 MB
Stars: 24
Watchers: 8
Forks: 33
Open Issues: 9
Metadata Files:
- Readme: README.md
- Changelog: history.md
- License: LICENSE
- Codeowners: .github/CODEOWNERS

Awesome Lists containing this project

README

          # quantifying

Quantifying the Commons

## Overview

This project seeks to quantify the size and diversity of the commons--the

collection of works that are openly licensed or in the public domain.

## Code of conduct

[`CODE_OF_CONDUCT.md`][org-coc]:

> The Creative Commons team is committed to fostering a welcoming community.

> This project and all other Creative Commons open source projects are governed

> by our [Code of Conduct][code_of_conduct]. Please report unacceptable

> behavior to [conduct@creativecommons.org](mailto:conduct@creativecommons.org)

> per our [reporting guidelines][reporting_guide].

[org-coc]: https://github.com/creativecommons/.github/blob/main/CODE_OF_CONDUCT.md

[code_of_conduct]: https://opensource.creativecommons.org/community/code-of-conduct/

[reporting_guide]: https://opensource.creativecommons.org/community/code-of-conduct/enforcement/

## Contributing

See [`CONTRIBUTING.md`][org-contrib].

[org-contrib]: https://github.com/creativecommons/.github/blob/main/CONTRIBUTING.md

### Project structure

Please note that in the directory tree below, all instances of `fetch`,

`process`, and `report` are referring to the three phases of data gathering,

processing, and report generation.

```

Quantifying/

├── .github/

│   ├── workflows/

│   │   ├── fetch.yml

│   │   ├── process.yml

│   │   ├── report.yml

│   │   └── static_analysis.yml

├── data/  # Data generated by script runs

│   ├── 20XXQX/

│   │   ├── 1-fetch/

│   │   ├── 2-process/

│   │   ├── 3-report/

│   │   │   └── README.md  # All generated reports are displayed in the README

│   └── ...

├── dev/

├── pre-automation/  # All Quantifying work prior to adding automation system

├── scripts/  # Run scripts for all phases

│   ├── 1-fetch/

│   ├── 2-process/

│   ├── 3-report/

│   └── shared.py

├── .cc-metadata.yml

├── .flake8  # Python tool configuration

├── .gitignore

├── .pre-commit-config.yaml  # Static analysis configuration

├── LICENSE

├── Pipfile  # Specifies the project's dependencies and Python version

├── Pipfile.lock

├── README.md

├── env.example

├── history.md

├── pyproject.toml  # Python tools configuration

└── sources.md

```

## Development

### Prerequisites

For information on learning and installing the prerequisite technologies for

this project, please see [Foundational technologies — Creative Commons Open

Source][found-tech].

This repository uses [pipenv][pipenvdocs] to manage the required Python

modules:

1. Install `pipenv`:

   - Linux: [Installing Pipenv][pipenvinstall]

   - macOS:

     1. Install [Homebrew][homebrew]

     2. Install pipenv:

        ```shell

        brew install pipenv

        ```

   - Windows: [Installing Pipenv][pipenvinstall]

2. Create the Python virtual environment and install prerequisites using

   `pipenv`:

    ```shell

    pipenv sync --dev

    ```

[found-tech]: https://opensource.creativecommons.org/contributing-code/foundational-tech/

[pipenvdocs]: https://pipenv.pypa.io/en/latest/

[pipenvinstall]: https://pipenv.pypa.io/en/latest/installation/

[homebrew]: https://brew.sh/

### Running scripts that require client credentials

To successfully run scripts that require client credentials, you will need to

follow these steps:

1. Copy the contents of the `env.example` file in the script's directory to

   `.env`:

    ```shell

    cp env.example .env

    ```

2. Uncomment the variables in the `.env` file and assign values as needed. See

   [`sources.md`](sources.md) on how to get credentials:

    ```

    GCS_DEVELOPER_KEY = your_api_key

    GCS_CX = your_pse_id

    ```

3. Save the changes to the `.env` file.

4. You should now be able to run scripts that require client credentials

   without any issues.

### Static analysis

Static analysis tools ensure the codebase adheres to consistent formatting and

style guidelines, enhancing readability and maintainability. Also see GitHub

Actions, below.

#### Using [`pre-commit`][pre-commit]

Pre-commit allows for static analysis tools (`black`, `flake8`, `isort`, etc.)

to be run manually or with every commit:

1. (Pre-commit is installed by completing Create the Python virtual environment

   and install prerequisites, above)

2. Install or run manually

   - Install the git hook scripts to enable automatic execution on every commit

       ```shell

       pipenv run pre-commit install

       ```

   - Run manually using helper dev script:

       ```shell

       ./dev/check.sh [FILE]

       ```

     If no file(s) are specified, then it runs against all files:

       ```shell

       ./dev/check.sh

       ```

3. _(Optional)_ review the configuration file:

   [`.pre-commit-config.yaml`](.pre-commit-config.yaml)

[pre-commit]: https://pre-commit.com/

### Resources

- **[Python Guidelines — Creative Commons Open Source][ccospyguide]**

- [Black][black]: _the uncompromising Python code formatter_

- [flake8][flake8]: _a python tool that glues together pep8, pyflakes, mccabe,

  and third-party plugins to check the style and quality of some python code._

- [isort][isort]: _A Python utility / library to sort imports_

  - (It doesn't import any libraries, it only sorts and formats them.)

- [ppypa/pipenv][pipenv]: _Python Development Workflow for Humans._

- [pre-commit][pre-commit]: _A framework for managing and maintaining

  multi-language pre-commit hooks._

- [Logging][logging]: _Utilize the built-in Python logging module to implement a flexible logging system from a shared module._

[ccospyguide]: https://opensource.creativecommons.org/contributing-code/python-guidelines/

[black]: https://github.com/psf/black

[flake8]: https://github.com/PyCQA/flake8

[isort]: https://pycqa.github.io/isort/

[pipenv]: https://github.com/pypa/pipenv

[pre-commit]: https://pre-commit.com/

[logging]: https://docs.python.org/3/library/logging.html

### GitHub Actions

The [`.github/workflows/python_static_analysis.yml`][workflow-static-analysis]

GitHub Actions workflow performs static analysis (`black`, `flake8`, and

`isort`) on committed changes. The workflow is triggered automatically when you

push changes to the main branch or open a pull request.

[workflow-static-analysis]: .github/workflows/static_analysis.yml

## Data sources

Kindly visit the [`sources.md`](sources.md) file for it.

## History

For information on past efforts, see [`history.md`](history.md).

## Copying & license

### Code

[`LICENSE`](LICENSE): the code within this repository is licensed under the

Expat/[MIT][mit] license.

[mit]: http://www.opensource.org/licenses/MIT "The MIT License | Open Source Initiative"

### Data

[![CC0 1.0 Universal (CC0 1.0) Public Domain Dedication

button][cc-zero-png]][cc-zero]

The data within this repository is dedicated to the public domain under the

[CC0 1.0 Universal (CC0 1.0) Public Domain Dedication][cc-zero].

[cc-zero-png]: https://licensebuttons.net/l/zero/1.0/88x31.png "CC0 1.0 Universal (CC0 1.0) Public Domain Dedication button"

[cc-zero]: https://creativecommons.org/publicdomain/zero/1.0/

### Documentation

[![CC BY 4.0 license button][cc-by-png]][cc-by]

The documentation within the project is licensed under a [Creative Commons

Attribution 4.0 International License][cc-by].

[cc-by-png]: https://licensebuttons.net/l/by/4.0/88x31.png#floatleft "CC BY 4.0 license button"

[cc-by]: https://creativecommons.org/licenses/by/4.0/ "Creative Commons Attribution 4.0 International License"

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/creativecommons/quantifying

Awesome Lists containing this project

README