Talk:Open Data

From Wikimania 2014 • London, United Kingdom

Things to add to the article

visualisations built on wikipedia http://seealso.org/
data sources on wikimedia https://meta.wikimedia.org/wiki/Research:Data

Big data

Big data is the term for a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications. The challenges include capture, curation, storage, search, sharing, transfer, analysis, and visualization.

Open Data

Open data is the idea that certain data should be freely available to everyone to use and republish as they wish, without restrictions from copyright, patents or other mechanisms of control. The goals of the open data movement are similar to those of other "Open" movements such as open source, open hardware, open content, and open access.

The Open Data movement has enabled the increase in transparency in traditionally closeted diplomatic service and governments. The Web Foundation published the first edition of its Open Data Index in 2012 with “a specific set of 14 indicators directly targeted at measuring open data worldwide”.

Wikipedia and Big/Open Data

Open Data Index(2012) produced by the Web Foundation on open data transparency and prevalence around the world

With the creation of Wikidata, Wikimedia has made its first explicit steps into the world of open data. This is a big and active area of activity that we can benefit from engaging.

Wikimania London is looking for volunteers who are interested in the Big data and open data science. Feel free to add yourself to the main conference volunteers page, or the list of Wikimedians who are big data advocates below. Comments and suggestions are welcome on the discussion page.

See the list of big data and open data events, mostly in London but some further afield. If you know of any events coming up that aren't on the list, please add them!

Examples of relevant Wikimedian projects:

Wikidata was released for public use earlier in 2013. A list of all Wikidata-related tools and projects can be found here: / Wikidata Tools and Gadgets

by / Tpt

This script imports automatically all interwiki-links from one Wikipedia article.

The gadget then can be launched by clicking on the "Import interwiki" link in the toolbox. After that a dialog appears where a language for import can be selected.

by YMS A script to semi-automatically import labels, descriptions and aliases for multiple languages based on the articles in Wikipedia and other Wikimedia projects. Suggests descriptions by extracting definitions from article introductions, aliases based on what's bolded in the article introduction, and labels from the article title, and provides a mask to compare existing values with the suggestions made and the article introductions, and to review and edit them all at once. When finished, optionally the next item with no label or description can be loaded automatically, allowing fast processing of many items.

by Magnus Manske

GeneaWiki is a simple graph viewer showing genealogy information in WikiData. Starting from a single person, it will try and construct the family tree according to father, mother, and child statements. For an example, see the Bach family.

by Ricordisamoa Tool to browse all chemical elements available on Wikidata, with atomic number, chemical symbol, and localized label. At the moment sorted only by atomic number. Code to be released soon (GPL).

by PlaneMad Get India's English-language's states names, then can translate into any language.


by Magnus Manske Show tempo-spatial information from Wikidata.

Examples: American civil war The Franco-Prussian War Hundred Years' War Père Lachaise Cemetery

Examples of relevant non-Wikimedian projects:

  • Open Data Institute
  • http://blog.okfn.org/2013/06/11/announcing-the-global-open-data-initiative/
  • Open Bank Project: The Open Bank Project provides an open source developer friendly “API for banks” that developers and companies can use to build innovative applications and services based on the account holders transaction data.
  • World Justice Project: This World Justice Project 'Rule of Law Index' uses polls of experts and the general populations of all countries to assess issues of government accountability, fundamental rights, openness of government and access to justice. The project is open but the tool is aimed at policymakers, civil society, practitioners, academics and businesses to improve the implementation and adherence of the rule of law in their countries.
  • opencorporates.org - I just sent an email to them to see if they would be interested to take part in some way EdSaperia (talk) 14:17, 17 October 2013 (UTC)[reply]

A message from Hera Hussain

"According to IBM, 90% of all the world's data has been collected in the last 2 years. It's a fascinating time to see the innovation coming from the open data and big data space. In 2013 we now produce 2.5 exabytes of information every day.

Open data allows insight into markets, opens formerly closed channels such as diplomacy and encourages cross-sectoral collaboration. My favourite projects in open data are related to open government. Opening data in agriculture for public use, for instance, has led to the sharing of best practise in agriculture by farmers. Similarly, there has been considerable success in raising public awareness on important (and sometimes dull) issues through data visualization of interactive maps and infographics. An interesting example is the Office of National Statistics Census Interactive Map.

Big data, on the other hand, is becoming increasingly important for businesses. A recent study of 75 North American retail executives found that 46% of retailers considered the volume of information with which they must contend to be the biggest challenge in dealing with Big Data. 34% say managing data variety – both structured and unstructured – is the top challenge.

Wikimania 2014 is a great opportunity for people across all sectors to meet and share ideas on how open and big data can be used to drive fruitful outcomes for society.

If you know of any project that should be listed here - please contribute to this page.