Quantcast
Viewing all articles
Browse latest Browse all 49199

Could the data toolkit be a starting point?

A note was posted to the Sunlight GoogleGroup list recently pointing to:

http://www.datasciencetoolkit.org/

A free, self-contained, ready-to-run VM with apps+data for

  • geocoding (using geocoder.us. I'm assuming TIGER and OS data)
  • geodict "pulls country, city and region names from unstructured English text"
  • political boundary lookup (various sources)
  • lots of data processing tools, eg pdf ->text
  • more

It doesn't look like a data warehouse, but it looks like a natural complement to one, and has that "easy to install and use" approach that makes it very interesting. Of course that's "easy for the data hacker". :)

Thoughts?

Brian


Viewing all articles
Browse latest Browse all 49199

Trending Articles