Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/utdata/openrefine

A lesson on using OpenRefine
https://github.com/utdata/openrefine

Last synced: 29 days ago
JSON representation

A lesson on using OpenRefine

Awesome Lists containing this project

README

        

# Using OpenRefine

By Christian McDonald\
Associate Professor of Practice\
School of Journalism and Media, Moody College of Communication\
University of Texas at Austin

[OpenRefine](https://openrefine.org/) is a free, open source, powerful tool for working with messy data. Our most common use for it is to clean and transform data.

## Installation

Download and the latest stable release from the [OpenRefine Downloads](https://openrefine.org/download.html) page.

- For Windows, you might try the "Windows kit with embedded Java".
- For Mac, you might check out [these installation tips](installation.md) to assist.

While there is an OpenRefine application you launch, the program itself launches and uses your web browser for the interface.

## Demos

These are detailed demos with illustrated step-ty-step instructions of commonly-used features of OpenRefine. If you are here to learn how to use OpenRefine, start with these.

- [Using facets](demo-facets.md)
- [Clustering](demo-cluster.md)

## Case studies

These are some example use cases using OpenRefine. They are not tutorials, but discussions of how the tools was used with perhaps key points explained.

- [AHRQ diagnostic codes](case-ahrq.md) pulled from a PDF.
- [Austin State Hospital cemetery](case-ash.md) burials where records are on more than one line.

## Resources

- [OpenRefine.org](https://openrefine.org/) homepage has some demo videos to give you an idea how it works.
- The [site also has links](https://openrefine.org/documentation.html) to tutorials, FAQs and more, including:
- OpenRefine's [user manual](https://docs.openrefine.org/):
- [Expressions](https://docs.openrefine.org/manual/expressions)
- [GREL functions](https://docs.openrefine.org/manual/grelfunctions)
- [Foundations tutorial course](https://courses.tranzf.org/course/view.php?id=18)

## Other tools

OpenRefine is not the only tool like this. Some others you might find useful:

- [Trifacta](https://www.trifacta.com/data-preparation/), including a [Cloud Dataprep by Trifacta](https://cloud.google.com/dataprep)
- [Tableau prep](https://www.tableau.com/trial/tableau-prep)