Class: Introduction to OpenRefine
February 3, 2021 @ 2:00 pm - 3:30 pm
OpenRefine is a free, open source, powerful tool for working with messy data. Using an interface similar to that of a spreadsheet, it allows for the quick exploration of large datasets using features such as faceting and filtering. But where it really shines is in helping fix inconsistencies in your data, such as differences in spelling, date format, capitalization, etc.
This workshop will introduce the most powerful features of OpenRefine using a sample bibliographic dataset. Participants will be encouraged to install the tool on their own computer prior to the workshop, and follow along. We will conclude with a short discussion of use cases: participants are welcome to share examples of datasets from their own work practice that need to be cleaned up and discuss how OpenRefine can help with this process. There are no particular prerequisites for this session. Familiarity with Regular Expressions can be useful in applying more advanced text matching functions, but is not required. This workshop will draw inspiration from the Library Carpentry OpenRefine lesson.
- Utilize facets and filters to explore large datasets
- Identify inconsistencies in a dataset and correct them
- Combine advanced operations such as clustering and text matching to quickly edit large datasets
To register for this class, visit the LYRASIS website.