logo

Data Wrangling (Or Cleaning) With OpenRefine

  • Pre-workshop activities: 15 min
  • Introductory presentation: 15 min
  • Hands-on activities: 60 min

Why OpenRefine?

Anyone who has worked with large or historic datasets will know that standardizing data can be more time consuming and difficult than the actual analysis. OpenRefine is a free software that allows users to efficiently clean and transform their datasets, allowing for more time spent on meaningful analysis and less spreadsheet headaches.

Learning objectives

At the end of this workshop, you will be able to:

  1. Understand the importance of data cleaning
  2. Conduct key data cleaning practices using OpenRefine:
    • Analyzing the occurrence of values throughout a data set
    • Clustering and standardizing values
    • Separating multiple values contained in the same field
    • Joining multiple values contained in separate fields

NEXT STEP: Pre-Workshop Activities