Jump to content

Submissions/Measuring Editor Collaborativeness With Economic Modelling

From Wikimania 2014 • London, United Kingdom

This is an accepted submission for Wikimania 2014.

Submission no. 1056
Title of the submission

Measuring Editor Collaborativeness With Economic Modelling

Type of submission (discussion, hot seat, panel, presentation, tutorial, workshop)

Presentation

Author of the submission

Max Klein - w:en:User:Maximilianklein

E-mail address

isalix@gmail.com

Username

w:en:User:Maximilianklein

Country of origin

USA

Affiliation, if any (organisation, company etc.)
Personal homepage or blog

http://notconfusing.com

Abstract (at least 300 words to describe your proposal)
SLIDES - https://slideviewer.herokuapp.com/github/notconfusing/wiki_econ_capability/blob/sunday/Measuring%20Editor%20Collaborativeness%20WIkimania%202014.ipynb?create=1#/

Even though Wikipedia is a vanguard of collaboration, Wikipedians unfortunately have access to few tools for performance metrics - most notably “Edit Count”. The Wikimedia Foundation's WikiMetrics cohort notion [1] has begun the search for higher-level metrics, but hasn't yet answered “how” users work together.

This presentation will outline new methods for measuring and understanding editor collaborativeness. Borrowing from Economic modelling, new insights on economic competitiveness make the analog of editor collaborativeness possible. A recent stream of research in Macroeconomics has shown simple techniques for predicting GDP with very little information [2] [3], which can be translated into the wiki realm. Using only the data of which countries export which products (not even how much of each product), one can quickly predict GDP rankings. Here we re-purpose the algorithm, so that Editors are countries, articles are products, and GDP is “Total Labour Hours” (an edit count derivative [4]).

By constructing a relation between editors and the articles they have touched, we are able to produce an entirely new perspective on Wikipedia (see the Figure below). The simplicity of this model can help us to quickly and easily determine which categories of articles are more likely to be hostile and power-user dominated, and which are more egalitarian and collaborative.

Incredibly, this borrowed method works even better for Wikis than Economies. Where the maximum achievable correlation in Macroeconomics is about 0.42, we can achieve correlations of up to 0.91. However, the real innovation comes from two factors which are tweaked to optimize the model:

  • importance of the high quality articles in an editor's contribution portfolio
  • (conversely) the importance of highly-invested editors in an article's contribution history.

These variables can range independently, and characterize our notion of "collaborativeness".

Collaborativeness is determined from the edit patterns of editors. Do they edit many articles? How well developed are the articles they edit? Consider these telling extremes:

  • The best editors in Category:Military history of the US—a category known for being very competitive—are characterized by emphasizing investment in touching many articles in the category. Less collaborative.
  • On the other end, the editors in Category:Sexual acts—a taboo subject where much editing could be considered perverse—are characterized by not touching many articles in the category. More collaborative.

We hope to receive critique on whether our algorithmic notion of collaborativeness is inline with community opinion. Additionally we hope to receive requests for different datasets to analyze for future research.

References

  1. Wikimetrics. [1]
  2. Hidalgo, Hausmann, The Building Blocks of Economic Complexity [2]
  3. Caldarelli et al. Firm Grounds. [3]
  4. Halfaker, Geiger, Using Edit Sessions [4]

Track
  • WikiCulture & Community
Length of session (if other than 30 minutes, specify how long)
30 minutes

30 Minutes

Will you attend Wikimania if your submission is not accepted?

Yes

Slides or further information (optional)

A triangular matrix from Wikipedia data

A rendering of a latex table

Special requests


Interested attendees

If you are interested in attending this session, please sign with your username below. This will help reviewers to decide which sessions are of high interest. Sign with a hash and four tildes. (# ~~~~).

  1. EpochFail (talk) 14:51, 28 March 2014 (UTC)[reply]
  2. Jodi.a.schneider (talk) 20:06, 31 March 2014 (UTC)[reply]
  3. Bluma.Gelley (talk) 20:29, 2 April 2014 (UTC)[reply]
  4. Ocaasi (talk) 23:33, 7 April 2014 (UTC)[reply]
  5. SarahStierch (talk) 19:26, 12 April 2014 (UTC)[reply]
  6. Masssly (talk) 13:33, 5 July 2014 (UTC)[reply]
  7. --Aubrey (talk) 08:13, 21 July 2014 (UTC)[reply]
  8. MCruz (WMF) (talk) 16:57, 22 July 2014 (UTC)[reply]
  9. Sshhiivv (talk) 22:17, 4 August 2014 (UTC)[reply]
  10. Sdivad (talk) 15:58, 8 August 2014 (UTC)[reply]
  11. Sjgknight (talk) 09:31, 9 August 2014 (UTC)[reply]
  12. Add your username here.