Wikipedia:Wikipedia Signpost/2018-10-28/Recent research

{{Wikipedia:Wikipedia Signpost/Templates/RSS description|1=If it weren't free, of course.}}{{Wikipedia:Signpost/Template:Signpost-header|||}}

{{Wikipedia:Wikipedia Signpost/Templates/Signpost-article-header-v2|{{{1|Wikimedia Commons worth $28.9 billion }}}|By Isaac Johnson and Tilman Bayer| 23 September 2018}}

{{Wikipedia:Wikipedia Signpost/Templates/Signpost-block-start-v2}}

{{WRN}}

=Estimating the Value of Wikimedia Commons=

:Reviewed by Isaac Johnson

Though Wikimedia projects like Wikipedia are clearly incredibly valuable to people worldwide (e.g., Wikipedia's status as the fifth most popular site worldwide), it has been harder to quantify other facets of this value. Anecdotally, the content from communities like Wikipedia has been incredibly important in the development of natural language processing tools,{{cite web |last1=Iderhoff |first1=Nicolas |title=nlp-datasets |url=https://github.com/niderhoff/nlp-datasets |website=GitHub |access-date=26 October 2018}} search engines like Google,{{cite web |last1=Singhal |first1=Amit |title=Introducing the Knowledge Graph: things, not strings |url=https://www.blog.google/products/search/introducing-knowledge-graph-things-not/ |website=The Keyword |publisher=Google |access-date=26 October 2018}} and an important resource when making life decisions.{{cite journal |last1=Singer |first1=Philipp |last2=Lemmerich |first2=Florian |last3=West |first3=Robert |last4=Zia |first4=Leila |last5=Wulczyn |first5=Ellery |last6=Strohmaier |first6=Markus |last7=Leskovec |first7=Jure |title=Why We Read Wikipedia |date=3 April 2017 |pages=1591–1600 |doi=10.1145/3038912.3052716 |url=https://dl.acm.org/citation.cfm?doid=3038912.3052716 |publisher=International World Wide Web Conferences Steering Committee}}

This OpenSym 2018 paper, "What is the Commons Worth? Estimating the Value of Wikimedia Imagery by Observing Downstream Use",{{cite journal |last1=Erickson |first1=Kristofer |last2=Perez |first2=Felix Rodriguez |last3=Perez |first3=Jesus Rodriguez |title=What is the Commons Worth?: Estimating the Value of Wikimedia Imagery by Observing Downstream Use |journal=OpenSym |date=22 August 2018 |pages=9 |doi=10.1145/3233391.3233533 |url=https://dl.acm.org/citation.cfm?id=3233533 |publisher=ACM}} attempts to quantify the monetary value of Wikimedia Commons, a peer-produced repository of free-use imagery and video that in part holds the images readers come across on Wikipedia. To do so, the authors pose a counterfactual question: how much would the licensing of this content generate if it operated under a for-profit model such as that of Getty Images? They collect a random dataset of 10,000 images from Commons and do a reverse image-search on them to detect how often they are being used across the internet. The domain of each re-use is then evaluated to determine whether, for instance, it was a commercial entity. Using Getty's licensing model of USD $175 for commercial use and USD $60 for non-commercial use, they extrapolate out how often on average each image is used (and where) to reach a total estimate of USD $28.9 billion for Wikimedia Commons.

While there are interesting discussions to be held about some of the methodological choices that led to their final estimate of USD $28.9 billion for the entirety of Commons – e.g., what is a more reasonable estimate of what proportion of images would be paid for if under license – the general approach and motivation are sound and certainly raise important questions about how we value resources like Wikimedia Commons. This research complements previous estimates of the value of Commons.{{cite journal |last1=Heald |first1=Paul J. |last2=Erickson |first2=Kris |last3=Kretschmer |first3=Martin |title=The Valuation of Unprotected Works: A Case Study of Public Domain Photographs on Wikipedia |journal=SSRN Electronic Journal |date=2015 |doi=10.2139/ssrn.2560572 |url=https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2560572 |language=en |issn=1556-5068}} These are not easy questions, but I'll be excited as more research adds to our understanding of the value of these communities' work.

Cf. earlier coverage: "Estimate for economic benefit of Wikipedia: $50 million by 2006 already"

=Briefly=

==Conferences and events==

See the research events page on Meta-wiki for upcoming conferences and events, including submission deadlines.

=Other recent publications=

Other recent publications that could not be covered in time for this issue include the items listed below. Contributions are always welcome for reviewing or summarizing newly published research.

:Compiled by Tilman Bayer

=="Web caching evaluation from Wikipedia request statistics"==

From the abstract:{{Cite conference| doi = 10.23919/WIOPT.2017.7959873| conference = 2017 15th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt)| pages = 1–6| last1 = Hasslinger| first1 = G.| last2 = Kunbaz| first2 = M.| last3 = Hasslinger| first3 = F.| last4 = Bauschert| first4 = T.| title = Web caching evaluation from Wikipedia request statistics|book-title= 2017 15th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt)| date = May 2017}} {{closed access}}{{pb}}Freely available version: [http://dl.ifip.org/db/conf/wiopt/wiopt2017/1570343131.pdf Web Caching Evaluation from Wikipedia Request Statistics], The 2nd Content Caching and Delivery in Wireless Networks Workshop (CCDWN), 2017 "We use publically available statistics about the top-1000 most popular pages on each day to estimate the efficiency of caches for support of the platform. While the data volumes are moderate, the main goal of Wikipedia caches is to reduce access times for page views and edits. We study the impact of most popular pages on the achievable cache hit rate in comparison to Zipf request distributions and we include daily dynamics in popularity."

=="Hacking Academic Collaboration with GLAM Edit-a-thons"==

From the abstract:{{Cite journal| volume = 1| issue = 1| pages = 65–95| last1 = Thorndike-Breeze| first1 = Rebecca| last2 = Suiter| first2 = Greta Kuriger| title = Hacking Academic Collaboration with GLAM Edit-a-thons| journal = WikiStudies| date = 2017-09-29| url = http://wikistudies.org/index.php?journal=wikistudies&page=article&op=view&path[]=3}} "At MIT, librarians, archivists, writing instructors, and local Wikipedians have collaborated to host several edit-a-thons with the common goals of addressing content gaps on Wikipedia and offering the public and the MIT community (including students, staff, alumni and faculty) new ways to engage with the institute's archives and special collections. [...] This article shares results from MIT's GLAM edit-a-thons, and argues that approaching projects from the perspective of Wikipedia's collaborative culture can enhance other kinds of academic collaboration."

=="Connecting Wikipedia and the Archive"==

From the abstract:{{Cite journal| volume = 1| issue = 1| pages = 40–64| last = Matsuuchi| first = Ann| title = Connecting Wikipedia and the Archive| journal = WikiStudies| date = 2017-09-25| url = http://wikistudies.org/index.php?journal=wikistudies&page=article&op=view&path[]=2}} "The described project that was started in 2015, was collaboratively designed by archivists and historians with the La Guardia & Wagner Archives ("the Archives") and LaGuardia Community College's faculty and librarians, and involves beginning college students in the production of a needed public history of the outbreak and impact of HIV/AIDS in New York City. [...] Utilization of a Wikipedia as a non-commercial, public, open access information source also succeeds in raising web traffic, visibility and accessibility for unique and valuable archival collections."

=="Nonhuman language agents in online collaborative communities: Comparing Hebrew Wikipedia and Facebook translations"==

From the abstract:{{Cite journal| doi = 10.1016/j.dcm.2017.10.002| issn = 2211-6958| volume = 21| issue = Supplement C| pages = 10–17| last1 = Vaisman| first1 = Carmel L.| last2 = Gonen| first2 = Illan| last3 = Pinter| first3 = Yuval| title =Nonhuman language agents in online collaborative communities: Comparing Hebrew Wikipedia and Facebook translations| journal = Discourse, Context & Media| date = 2018-03-01| url = http://www.sciencedirect.com/science/article/pii/S2211695817301848}} {{closed access}} "This study compared language policies in Hebrew Wikipedia and the Hebrew Facebook translation app. Hebrew Wikipedia designed a strict linguistic guide that promotes a neutral Hebrew register, rejecting both colloquial and high registers, enforced by an algorithm post factum."

=="Wikipedia's gaps in coverage: are Wikiprojects a solution? A study of the Cambodian Wikiproject"==

From the abstract:{{Cite journal| doi = 10.1108/OIR-06-2017-0199| issn = 1468-4527| volume = 42| issue = 2| pages = 238–249| last = Luyt| first = Brendan| title = Wikipedia's gaps in coverage: are Wikiprojects a solution? A study of the Cambodian Wikiproject| journal = Online Information Review|date = 2018-02-01| url = https://www.emeraldinsight.com/doi/abs/10.1108/OIR-06-2017-0199}} {{closed access}} "The purpose of this paper is to examine the rather unsuccessful Wikiproject for Cambodia. Despite its lack of success, it is a case that can be used to draw lessons for dealing with the issue of geographical under-representation on Wikipedia as a whole. ... The author takes a broadly qualitative approach to the study of Wikipedia. For this study, the Cambodia Wikiproject main page, as well as the various talk page archives associated with it, was downloaded in November 2016 and subjected to a content analysis. Descriptive statistics are also used when necessary to build the argument. Findings: Wikiproject Cambodia has failed to appreciably improve the coverage of Cambodian topics. This is likely due to its inability to attract for a prolonged period of time a champion able to anchor the project and provide a sense that someone is listening. But the makeup of the project members also suggests that even if a champion could be found, the question of who gets to represent whom remains difficult to deal with. It is unlikely that Cambodia will anytime soon develop a strong community of Wikipedia editors given the economic and social constraints the country imposes on the most of its population."

=="Representing Metro Manila on Wikipedia"==

From the abstract:{{Cite journal| doi = 10.1108/OIR-10-2016-0308| issn = 1468-4527| volume = 42| issue = 1| pages = 16–27| last = Luyt| first = Brendan| title = Representing Metro Manila on Wikipedia| journal = Online Information Review| date = 2017-11-30| url = https://www.emeraldinsight.com/doi/abs/10.1108/OIR-10-2016-0308}} {{closed access}} "While the Wikipedia article on Manila cannot be classified as promotional, it is clear that much of the city remains invisible in this work. Such a puzzle becomes understandable when we examine the urban studies literature where we find that the spatial logic of the city itself helps conceal much from view, so that what we read on Wikipedia is a view from the islands of privilege rather than the oceans of marginalization that make up much of the city's spatial form. If such a spatial structure is to change, representations such as found on Wikipedia need to be challenged."

=="How does communicative memory become cultural memory? Negotiation processes on the Wikipedia talk page in case of the White Rose"==

:"Wie wird kommunikatives zu kulturellem Gedächtnis? Aushandlungsprozesse auf den Wikipedia-Diskussionsseiten am Beispiel der Weißen Rose" (in German){{Cite book| publisher = Springer VS, Wiesbaden| isbn = 9783658195120 | pages = 143–167| last1 = Heinrich| first1 = Horst-Alfred| last2 = Gilowsky| first2 = Julia| title = (Digitale) Medien und soziale Gedächtnisse| chapter = Wie wird kommunikatives zu kulturellem Gedächtnis? Aushandlungsprozesse auf den Wikipedia-Diskussionsseiten am Beispiel der Weißen Rose| series = Soziales Gedächtnis, Erinnern und Vergessen – Memory Studies|date = 2018|chapter-url= https://link.springer.com/chapter/10.1007/978-3-658-19513-7_7}} {{closed access}} [https://books.google.com/books?id=zRRBDwAAQBAJ&pg=PA149v=onepage&q=wikipedia&f=false Google Books preview]

From [https://books.google.com/books?id=zRRBDwAAQBAJ&v=onepage&q=wikipedia&f=false#v=onepage&q=%22ist%20dieser%20Befund%20beachtlich%22&f=false the paper] (translated): "Finally, the [talk page comments classfied in] the category of personal attacks are remarkable because of their insignificant quantitative dimension. In the context of the White Rose, there was only a single incident of this kind. On the backdrop of widespread hate attacks on the Internet this finding is notable, considering that the resistance against national socialism has never been uncontroversial."

{{Wikipedia:Wikipedia Signpost/Templates/Signpost-block-end-v2}}

{{Wikipedia:Wikipedia Signpost/Templates/Signpost-block-start-v2|fullwidth}}

=References=

{{reflist|30em}}

:Supplementary references:

{{Reflist|30em|group=supp}}

{{Wikipedia:Wikipedia Signpost/Templates/Signpost-block-end-v2}}

{{Wikipedia:Wikipedia Signpost/Templates/Signpost-article-end-v2}}

{{Wikipedia:Signpost/Template:Signpost-article-comments-end||2018-10-01|2018-12-01}}