20  Data challenge

For this data challenge, I’ll ask you to work with a real-world cultural dataset, which is in need of some data cleaning.

You’re asked to carry out the whole process, from start to finish, including importing the raw data file and exporting the result.

The dataset is a record of performances and attendances at performing arts venues in the Netherlands, from 1999 to 2023. It’s a good example of the kinds of things you’ll often need to do to datasets in order to make them useable for visualisation and analysis. You can find the dataset here. click on Original Dataset on the left hand side, and then click on Download CSV.

Do the following in the Posit cloud workspace.

Upload the dataset from your computer to Posit cloud.

Import the dataset to your environment with a suitable name.

Take a look at the dataset.

The basic cleaning steps are to:

To make this dataset tidy, we need to do a number of things:

Some final steps before exporting:

Finally, use the data…