image description

RStudio Data Manipulation & Visualization

  • Pre-workshop activities: 10 min
  • Introductory presentation: 10 min
  • Hands-on activities: 80 min

Why R and Rstudio?

R is a free, open-source software and programming language for statistical analysis. It is available for Windows, Mac, and Linux. It allows for reproducible statistical analysis, including advanced methods, as well as the production of publication-quality graphs. Because it is free and open-source, users can also develop their methods and algorithms and release for others to use through the cration of packages.

RStudio is a free, open-source integrated development environment to facilitate the use of R (although it can also include other coding languages such as Python). RStudio also offers integration with other tools for reproducible research and data science such as Git, GitHub and Quarto.

“RStudio does not replace R: You must install R before you can install or use RStudio. Instead, RStudio enhances the R programming experience with helpful features such as code completion, syntax highlighting, graph and table previews, and more. RStudio’s interface is organized so that the user can clearly view graphs, data tables, R code, and output all at the same time.” - Kent State University Libraries

Who is this workshop for?: Users who are familiar with basic R concepts covered in the Data Analysis with R Studio - Introduction to R and summary statistics. workshop. In this workshop, we focus on how to check and clean your data, as well as how to vizualize your data. We will not cover how to perform statistical analysis, which are covered in the Data Analysis with R Studio - Introduction to R and summary statistics. and Data Analysis with R Studio - Intermediate data analysis workshops.

Learning objectives

At the end of this workshop, you will be able to:

  1. Execute the main steps of importing, checking, and cleaning your data
  2. Clean and manipulate data with the tidyverse and janitor packages
  3. Check and validate your data using the assertr package
  4. Create basic charts and plots (histogram, boxplot, scatter, bar and line plots) with the ggplot2 package

NEXT STEP: Pre-Workshop Activities