mydata dataset - Data Exploration and Cleanup

Load Data

Read in the CSV file using the readr package

library(readr)
mydata <- read_csv("mydata.csv")

Take a look at the dataset

Type View(mydata) at the prompt or run interactively from an R script or Rmarkdown document.

I will also be handing out a hard copy of these data so you can look at the data, the values and any patterns that emerge.

View(mydata)

Also take a look at the “codebook”

Feel free to click on links below or right click and “SaveAs” to save the file on your computer.

Open discussion on what you “see”

There are 11-12 problems with this little dataset. Which ones do you see?

  1. How might we use code to FIND (discover) these problems?

  2. How might we use code to FIX (correct) these problems?

Code to find and fix these issues

Cleanup Code