logo

Data Wrangling (Or Cleaning) With OpenRefine

  • Pre-workshop activities: 15 min
  • Introductory presentation: 15 min
  • Hands-on activities: 60 min

Why OpenRefine?

Anyone who has worked with large or historic datasets will know that the process of getting data into uniform and usable formats can be harder than the actual analysis. If you’re tired of spreadsheet headaches, you’ve come to the right workshop. OpenRefine is a free software that allows users to efficiently clean and transform their datasets, allowing for more time to be spent on meaningful analysis.

Learning objectives

At the end of this workshop, you will be able to:

  1. Understand the importance of data cleaning
  2. Conduct key data cleaning practices using OpenRefine:
    • Analyzing the occurrence of values throughout a data set
    • Clustering and standardizing values
    • Separating multiple values contained in the same field
    • Joining multiple values contained in separate fields

NEXT STEP: Pre-Workshop Activities