https://github.com/swisscom/cf-scraper

A simple app which scrapes information about Cloud Foundry orgs
https://github.com/swisscom/cf-scraper

hacktoberfest

Last synced: over 1 year ago
JSON representation

A simple app which scrapes information about Cloud Foundry orgs

Host: GitHub
URL: https://github.com/swisscom/cf-scraper
Owner: swisscom
License: mit
Created: 2019-05-27T21:07:04.000Z (about 7 years ago)
Default Branch: master
Last Pushed: 2024-10-16T15:16:36.000Z (almost 2 years ago)
Last Synced: 2025-04-15T10:19:40.882Z (over 1 year ago)
Topics: hacktoberfest
Language: JavaScript
Homepage:
Size: 65.4 KB
Stars: 1
Watchers: 19
Forks: 3
Open Issues: 5
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md

Awesome Lists containing this project

README

# cf-scraper
A simple app which scrapes information about Cloud Foundry orgs

## How to deploy

### Prepare the service instances
`cf-scraper` gets the credentials for accessing the Cloud Foundry API via a service of type `secret-store`. Furthermore, it gets the list input orgs from an S3 service instance and also writes the scrape output to that S3 service instance.

1. Create a user in your Cloud Foundry instance with the role `cloud_controller.global_auditor`.
````
uaac user add $AUDITOR_USER_NAME --emails $AUDITOR_EMAIL;
uaac member add cloud_controller.global_auditor $AUDITOR_USERNAME;
````
2. Create a secrets store called `cf-api-credentials`.
````
cf cs secrets-store json cf-api-credentials -c '{"username": "'$AUDITOR_USER_NAME'", "password": "'$AUDITOR_PASSWORD'"}'
````
3. Create an S3 service instance named `orgs-store`.
````
cf cs dynstrg-2 usage orgs-store
````

### Load the input orgs
The scraper uses a file called `input/input-orgs.json` in the `orgs-store` instance. The list can be changed at any time. The scraper will pick up the latest version when it starts the next run.

4. Prepare the file `input-orgs.json` to contain an array of org names.
````
[
"org-1",
"org-2",
"org-3",
...,
"org-n"
]
````

5. Upload the file to `orgs-store/input`, for example using [mc](https://github.com/minio/mc).

### Adapt the schedule
The scraper runs as a scheduled task. The schedule is defined as a cron expression in the environment variable `SYNC_SCHEDULE`.

6. Open `manifest.yml` and set `SYNC_SCHEDULE` to the desired cron expression (e.g. `*/15 * * * *` for "at every 15th minute").

### Push the app
7. Everything else is self configuring. Just push the app.
````
cf push
````

## Collect the scrape result
The scraper uploads the result of a scrape run to `orgs-store/output/scrape-result.json`. Before starting the upload, a backup copy of the previous result is made called `scrape-result-backup.json`.

8. Download `orgs-store/output/scrape-result.json`, for example using [mc](https://github.com/minio/mc).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/swisscom/cf-scraper

Awesome Lists containing this project

README