Before analyzing or visualizing data we’ve collected, it’s important to clean and prepare it. Typical tasks include fixing spelling, removing duplicate rows, finding and replacing text, changing number formats and merging and splitting columns.
This workshop will familiarize you with OpenRefine, a powerful data manipulation tool that cleans, reshapes and edit batches of messy and unstructured data. When using OpenRefine to clean and transform data, you can easily facet, cluster, edit cells, reconcile and extend web services to convert a dataset to a more structured format.
By the end of this workshop, you will be able to explain the importance of cleaning your data and complete basic data cleaning tasks using OpenRefine.