Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/markpbaggett/whitebread

A simple Python library for mundane Fedora 3.8 tasks.
https://github.com/markpbaggett/whitebread

Last synced: 2 months ago
JSON representation

A simple Python library for mundane Fedora 3.8 tasks.

Host: GitHub
URL: https://github.com/markpbaggett/whitebread
Owner: markpbaggett
Archived: true
Created: 2018-03-27T20:25:32.000Z (over 6 years ago)
Default Branch: master
Last Pushed: 2020-02-03T15:50:30.000Z (over 4 years ago)
Last Synced: 2024-04-18T09:24:33.049Z (3 months ago)
Language: Python
Size: 194 KB
Stars: 3
Watchers: 3
Forks: 3
Open Issues: 0
Metadata Files:
- Readme: README.md

Lists

islandora_awesome - Whitebread - Simple python scripts for those everyday, mundane Fedora tasks. This script can do the following tasks; Count Matching Objects, Harvest Metadata, download all of a specified datastreams found in all of the object within a specified collection, Update GSearch, Update fgsLabel, Purge All But the Newest Version of a Datastream(cleanup), Check MimeType of the Preservation Object, Find matching objects missing a specific datastream, Download FOXML for matching documents, Find Books that are bad or aren't done processing, Get the History of a Datastream, Get Version of Datastream at a Specfic Date, and Get All Versions of a Datastream. (The Islandora 7.x-1.x List / Other)

README

        # Whitebread Fedora

**Note**: This project has been deprecated and is now archived in favor of a statically compiled application built in nim. 

For an actively developed and maintained project that has 95% of the functionality here plus much more, please see 

[moldybread](https://github.com/markpbaggett/moldybread).  Compilied binaries for Linux x86 can be found in the

[releases](https://github.com/markpbaggett/moldybread/releases) section.  If you're using another another architecture

instructions for compiling can be found in the 

[README](https://github.com/markpbaggett/moldybread/blob/master/README.rst). All other documentation can be found online

in the project's [documenation respository](https://markpbaggett.github.io/moldybread/moldybread.html).

Simple python scripts for those everyday, mundane Fedora tasks.

## Count Matching Objects

```

>>> python run.py -o count_objects -dc type -dcs StillImage

```

```

>>> python run.py -o count_objects -p vanvactor

```

## Harvest Metadata!

**By Matching Parent Namespace:**

```

>>> python run.py -p smhc -o harvest_metadata

```

**By Dublincore Field Matching:**

```

>>> python run.py -o harvest_metadata -dc rights -dcs "In Copyright"

```

**Override Default DSID without Touching YAML:**

```

>>> python run.py -o harvest_metadata -p smhc -ds DC

```

## Harvest Metadata but Ignore Records for Pages

```

>>> python run.py -o harvest_metadata_no_pages -p smhc -ds MODS

```

## Grab Images

**Like above, but for image mime types**

```

>>> python run.py -p smhc -o grab_images -ds JP2

```

## Download Binaries!

**Just like above examples but with a different operator (use for things that aren't images or test).**

```

>>> python run.py -o grab_other -p smhc -ds PDF

```

## Update GSearch!

**You guessed it!**

```

>>> python run.py -o update_gsearch -p smhc

```

## Update GSearch for all objects that aren't pages

```

>>> python run.py -o update_gsearch_no_pages -p smhc

```

## Update fgsLabel!

**Like above, but drop in an xpath value to match on. Only works with mods right now."**

```

>>> python run.py -o update_labels -p swim -xp "//mods:titleInfo[@supplied='yes']/mods:title"

```

## Purge All But the Newest Version of a Datastream

```

>>> python run.py -o purge_old_dsids -p vanvactor -ds MODS

```

## Check MimeType of the Preservation Object

```

>>> python run.py -o test_obj_mimes -p vanvactor

```

## Find matching objects missing a specific datastream

```

>>> python run.py -o find_missing -p vanvactor -ds PDF

```

## Download FOXML for matching documents

```

>>> python run.py -o grab_foxml -p vanvactor

```

## Find Books that are bad or aren't done processing

```

>>> python run.py -o find_bad_books -p vanvactor

```

## Get the History of a Datastream

```

>>> python run.py -o get_history -ds MODS -p vanvactor

```

## Get Version of Datastream at a Specfic Date

```

>>> python run.py -o get_datastream_at_date -ds MODS -p vanvactor -d 2017-09-01

```

## Get All Versions of a Datastream

```

>>> python run.py -o get_all_versions_of_datastream -ds MODS -p vanvactor

```

## Get Datastream Report

```

>>> python run.py -o get_datastream_report -p test

```