Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/markpbaggett/whitebread
A simple Python library for mundane Fedora 3.8 tasks.
https://github.com/markpbaggett/whitebread
Last synced: about 2 months ago
JSON representation
A simple Python library for mundane Fedora 3.8 tasks.
- Host: GitHub
- URL: https://github.com/markpbaggett/whitebread
- Owner: markpbaggett
- Archived: true
- Created: 2018-03-27T20:25:32.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2020-02-03T15:50:30.000Z (almost 5 years ago)
- Last Synced: 2024-08-05T19:35:32.663Z (5 months ago)
- Language: Python
- Size: 194 KB
- Stars: 3
- Watchers: 3
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- islandora_awesome - Whitebread - Simple python scripts for those everyday, mundane Fedora tasks. This script can do the following tasks; Count Matching Objects, Harvest Metadata, download all of a specified datastreams found in all of the object within a specified collection, Update GSearch, Update fgsLabel, Purge All But the Newest Version of a Datastream(cleanup), Check MimeType of the Preservation Object, Find matching objects missing a specific datastream, Download FOXML for matching documents, Find Books that are bad or aren't done processing, Get the History of a Datastream, Get Version of Datastream at a Specfic Date, and Get All Versions of a Datastream. (The Islandora 7.x-1.x List / Other)
README
# Whitebread Fedora
**Note**: This project has been deprecated and is now archived in favor of a statically compiled application built in nim.
For an actively developed and maintained project that has 95% of the functionality here plus much more, please see
[moldybread](https://github.com/markpbaggett/moldybread). Compilied binaries for Linux x86 can be found in the
[releases](https://github.com/markpbaggett/moldybread/releases) section. If you're using another another architecture
instructions for compiling can be found in the
[README](https://github.com/markpbaggett/moldybread/blob/master/README.rst). All other documentation can be found online
in the project's [documenation respository](https://markpbaggett.github.io/moldybread/moldybread.html).Simple python scripts for those everyday, mundane Fedora tasks.
## Count Matching Objects
```
>>> python run.py -o count_objects -dc type -dcs StillImage
``````
>>> python run.py -o count_objects -p vanvactor
```## Harvest Metadata!
**By Matching Parent Namespace:**
```
>>> python run.py -p smhc -o harvest_metadata
```**By Dublincore Field Matching:**
```
>>> python run.py -o harvest_metadata -dc rights -dcs "In Copyright"
```**Override Default DSID without Touching YAML:**
```
>>> python run.py -o harvest_metadata -p smhc -ds DC
```## Harvest Metadata but Ignore Records for Pages
```
>>> python run.py -o harvest_metadata_no_pages -p smhc -ds MODS
```## Grab Images
**Like above, but for image mime types**
```
>>> python run.py -p smhc -o grab_images -ds JP2
```## Download Binaries!
**Just like above examples but with a different operator (use for things that aren't images or test).**
```
>>> python run.py -o grab_other -p smhc -ds PDF
```## Update GSearch!
**You guessed it!**
```
>>> python run.py -o update_gsearch -p smhc
```## Update GSearch for all objects that aren't pages
```
>>> python run.py -o update_gsearch_no_pages -p smhc
```## Update fgsLabel!
**Like above, but drop in an xpath value to match on. Only works with mods right now."**
```
>>> python run.py -o update_labels -p swim -xp "//mods:titleInfo[@supplied='yes']/mods:title"
```## Purge All But the Newest Version of a Datastream
```
>>> python run.py -o purge_old_dsids -p vanvactor -ds MODS
```## Check MimeType of the Preservation Object
```
>>> python run.py -o test_obj_mimes -p vanvactor
```## Find matching objects missing a specific datastream
```
>>> python run.py -o find_missing -p vanvactor -ds PDF
```## Download FOXML for matching documents
```
>>> python run.py -o grab_foxml -p vanvactor
```## Find Books that are bad or aren't done processing
```
>>> python run.py -o find_bad_books -p vanvactor
```## Get the History of a Datastream
```
>>> python run.py -o get_history -ds MODS -p vanvactor
```## Get Version of Datastream at a Specfic Date
```
>>> python run.py -o get_datastream_at_date -ds MODS -p vanvactor -d 2017-09-01
```## Get All Versions of a Datastream
```
>>> python run.py -o get_all_versions_of_datastream -ds MODS -p vanvactor
```## Get Datastream Report
```
>>> python run.py -o get_datastream_report -p test
```