A free, open source power tool for working with messy data and improving it
OpenRefine (formerly Google Refine) is a powerful tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data.
Using OpenRefine, by Ruben Verborgh and Max De Wilde, offers a great introduction to OpenRefine.
Another introduction in German is Datentransformation mit OpenRefine, part of a seminar for librarians.
Visit the official web site for videos and full documentation.
OpenRefine is a Java-based power tool that allows you to load data, understand it, clean it up, reconcile it, and augment it with data coming from the web. All from a web browser and the comfort and privacy of your own computer.
Run from source
If you have cloned this repository to your computer, you can run OpenRefine with:
./refineon Mac OS and Linux
Documentation and Videos
Contributing to the project
Licensing and legal issues
OpenRefine is open source software and is licensed under the BSD license
located in the LICENSE.txt. See the folder
licenses for information on open source
libraries that OpenRefine depends on.
This software was created by Metaweb Technologies, Inc. and originally written and conceived by David Huynh firstname.lastname@example.org. Metaweb Technologies, Inc. was acquired by Google, Inc. in July 2010 and the product was renamed Google Refine. In October 2012, it was renamed OpenRefine as it transitioned to a community-supported product.