Open Data Sets

One of the topics I will be writing about “soon” is about the work that my workgroup at CERN did on data analysis tools and open data.

There are numerous open data sets out on the web for those interested in learning how to do data analysis (data science, big data analysis, etc.). For those interested in particle physics, CERN provides a wealth of data at CERN Open Data Portal. Looking for local data sets is becoming as easy as googling “[your state] Open Data” or “traffic data [your city]”.

Another resource I have recently seen on HackerNews is Kaggle.com data sets . This appears to be a repository or collection of links to datasets that may be of use for those learning about data science. Kaggle.com itself sounds like a potentially useful training site for data analysis, but I’ve only just started to check it out.