OpenRefine

A free, open source power tool for working with messy data and improving it

OpenRefine (formerly Google Refine) is a powerful tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data.

Using OpenRefine, by Ruben Verborgh and Max De Wilde, offers a great introduction to OpenRefine.

Another introduction in German is Datentransformation mit OpenRefine, part of a seminar for librarians.

Visit the official web site for videos and full documentation.

OpenRefine

DOI Join the chat at https://gitter.im/OpenRefine/OpenRefine Java CI Coverage Status Translation progress Total alerts

OpenRefine is a Java-based power tool that allows you to load data, understand it, clean it up, reconcile it, and augment it with data coming from the web. All from a web browser and the comfort and privacy of your own computer.

Download

Run from source

If you have cloned this repository to your computer, you can run OpenRefine with:

  • ./refine on Mac OS and Linux
  • refine.bat on Windows

This requires JDK 8 and Apache Maven.

Documentation and Videos

Contributing to the project

Contact us

Licensing and legal issues

OpenRefine is open source software and is licensed under the BSD license located in the LICENSE.txt. See the folder licenses for information on open source libraries that OpenRefine depends on.

Credits

This software was created by Metaweb Technologies, Inc. and originally written and conceived by David Huynh dfhuynh@google.com. Metaweb Technologies, Inc. was acquired by Google, Inc. in July 2010 and the product was renamed Google Refine. In October 2012, it was renamed OpenRefine as it transitioned to a community-supported product.

See AUTHORS.md for the list of OpenRefine contributors and CONTRIBUTING.md for instructions on how to contribute yourself.

Loading ...
Go back