Category: The Outlier
-

OpenRefine Part 2: Removing duplicates and using version history
OpenRefine is one of The Outlier’s favourite tools when working with large datasets. This powerful open-source program is ideal for cleaning messy data. In this post, we focus on two essential features: removing duplicates and using version history to keep track of your changes.
-

OpenRefine Part 1: Installing and merging datasets
If you find yourself getting bogged down when you’re working with large datasets in Excel or Google Sheets, OpenRefine might be the solution you’ve been looking for.
-

-

-

