WebCite
{{Short description|On-demand archiving service}}
{{Distinguish|Website}}
{{Use mdy dates|date=August 2022}}
{{Self reference|For a guide to using WebCite within Wikipedia, see Help:Using WebCite}}
{{bots|deny=Medic,WaybackMedic,GreenC,GreenC bot}}
{{Infobox website
| name = WebCite
| logo = WebCite.svg
| screenshot =
| caption =
| url = {{URL|webcitation.org/|WebCitation.org}}
| commercial = No
| type =
| language = English
| registration =
| owner = University of Toronto
| author = Gunther Eysenbach
| launch_date = {{Start date and age|1997}}
| current_status = View historical archives only, no new archives
| revenue =
}}
WebCite is an intermittently available archive site, originally designed to digitally preserve scientific and educationally important material on the web by taking snapshots of Internet contents as they existed at the time when a blogger or a scholar cited or quoted from it. The preservation service enabled verifiability of claims supported by the cited sources even when the original web pages are being revised, removed, or disappear for other reasons, an effect known as link rot.
As of June 2023, the site no longer accepts new archive requests; old archive snapshots can still be viewed.
The site is frequently offline with no explanation, and for lengthy periods of time. For example it was offline between October 29, 2021 and June 24, 2023 (1 year and 8 months) during which it reported "DB Connection failed". The site is owned and maintained by Gunther Eysenbach.
Service features
WebCite allowed for preservation of all types of web content, including HTML web pages, PDF files, style sheets, JavaScript and digital images. It also archived metadata about the collected resources such as access time, MIME type, and content length.
WebCite was a non-profit consortium supported by publishers and editors,{{who|date=July 2019}} and it could be used by individuals without charge.{{clarify|reason=Does it charge for institutional use?|date=July 2019}} It was one of the first services to offer on-demand archiving of pages, a feature later adopted by many other archiving services, such as archive.today and the Wayback Machine. It did not do web page crawling.
History
Conceived in 1997 by Gunther Eysenbach, WebCite was publicly described the following year when an article on Internet quality control declared that such a service could also measure the citation impact of web pages.{{Cite journal |last1=Eysenbach |first1=Gunther |author-link=Gunther Eysenbach |last2=Diepgen, Thomas L. |date=November 28, 1998 |title=Towards quality management of medical information on the internet: evaluation, labelling, and filtering of information |journal=The BMJ |volume=317 |issue=7171 |pages=1496–1502 |doi=10.1136/bmj.317.7171.1496 |issn=0959-8146 |oclc=206118688 |pmc=1114339 |pmid=9831581 |id=BL Shelfmark 2330.000000}} In the next year, a pilot service was set up at the address webcite.net. Although it seemed that the need for WebCite decreased when Google's short term copies of web pages began to be offered by Google Cache and the Internet Archive expanded their crawling (which started in 1996),{{Cite web |date=October 25, 2013 |title=Fixing Broken Links on the Internet |url=http://blog.archive.org/2013/10/25/fixing-broken-links/ |website=Internet Archive blog}} WebCite was the only one allowing "on-demand" archiving by users. WebCite also offered interfaces to scholarly journals and publishers to automate the archiving of cited links. By 2008, over 200 journals had begun routinely using WebCite.{{Cite journal |last1=Eysenbach |first1=Gunther |author-link=Gunther Eysenbach |last2=Trudel, Mathieu |year=2005 |title=Going, Going, Still There: Using the WebCite Service to Permanently Archive Cited Web Pages |journal=Journal of Medical Internet Research |volume=7 |issue=5 |pages=e60 |doi=10.2196/jmir.7.5.e60 |issn=1438-8871 |oclc=107198227 |pmc=1550686 |pmid=16403724 |doi-access=free }}
WebCite was formerly a member of the International Internet Preservation Consortium.{{Cite web |title=WebCite Consortium FAQ |url=http://www.webcitation.org/faq |website=WebCitation.org |publisher=WebCite |via=Internet Archive |access-date=May 15, 2018 |archive-date=August 11, 2021 |archive-url=https://web.archive.org/web/20210811235133/https://webcitation.org/faq |url-status=dead }}{{cbignore}} In response a 2012 message on Twitter relating to WebCite's former membership of the consortium, Eysenbach commented that "WebCite has no funding, and IIPC charges €4000 per year in annual membership fees."{{cite tweet |user=eysenbach |number=212380809464782849 |title=@ReaderMeter @sennoma WebCite has no funding, and IIPC charges 4000 Euro/yr in membership fees |last=Eysenbach |first=Gunther |archive-url=https://web.archive.org/web/20220103170211/https://twitter.com/eysenbach/status/212380809464782849 |archive-date=January 3, 2022 |url-status=live}}
WebCite "feeds its content" to other digital preservation projects, including the Internet Archive. Lawrence Lessig, an American academic who writes extensively on copyright and technology, used WebCite in his amicus brief in the Supreme Court of the United States case of MGM Studios, Inc. v. Grokster, Ltd.{{Cite news |last=Cohen |first=Norm |date=January 29, 2007 |title=Courts Turn to Wikipedia, but Selectively |work=The New York Times |url=https://www.nytimes.com/2007/01/29/technology/29wikipedia.html}}
Sometime between July 9 and 17, 2019, WebCite stopped accepting new archiving requests.{{Cite web |date=July 17, 2019 |title=WebCite 17th July 2019 |url=https://webcitation.org/index |url-status=live |archive-url=https://web.archive.org/web/20190717131123/https://webcitation.org/index |archive-date=July 17, 2019 |access-date=January 17, 2021}}{{cbignore}}{{cite web | title = Where did the archive go? Part 4: WebCite |url = https://ws-dl.blogspot.com/2019/10/2019-10-21-where-did-archive-go-part-4.html |work=Web Science and Digital Libraries Research Group |publisher=Old Dominion University |via=Blogger |date=2019-10-21 |access-date=2024-11-25 }} In a further outage, between about October 29, 2021 and June 24, 2023, no archived content was available, only the main page worked.
Fundraising
WebCite ran a fund-raising campaign using FundRazr from January 2013 with a target of $22,500, a sum which its operators stated was needed to maintain and modernize the service beyond the end of 2013.{{Cite web |title=Fund WebCite |url=http://meta.wikimedia.org/wiki/WebCite |access-date=December 6, 2013 |publisher=Wikimedia Foundation}} This includes relocating the service to Amazon EC2 cloud hosting and legal support. {{As of | 2013}} it remained undecided whether WebCite would continue as a non-profit or as a for-profit entity.{{Cite web |title=Conversation between GiveWell and WebCite on 4/10/13 |url=http://www.givewell.org/files/conversations/Webcite%20conversation%20notes%20(public).pdf |access-date=October 18, 2009 |publisher=GiveWell |quote=Dr. Eysenbach is trying to decide whether WebCite should continue as a non-profit project or a business with revenue streams built into the system.}}
Business model
The term "WebCite" is a registered trademark.{{Cite web |title=WebCite Legal and Copyright Information |url=http://webcitation.org/license |access-date=June 16, 2009 |website=WebCitation.org |publisher=WebCite |archive-date=July 25, 2008 |archive-url=https://web.archive.org/web/20080725040851/http://www.webcitation.org/license |url-status=dead }}{{cbignore}} WebCite did not charge individual users, journal editors and publishers{{Cite web |title=WebCite Member List |url=http://webcitation.org/members |access-date=June 16, 2009 |website=WebCitation.org |publisher=WebCite Consortium |quote=Membership is currently free |archive-date=July 25, 2008 |archive-url=https://web.archive.org/web/20080725031939/http://www.webcitation.org/members |url-status=dead }}{{cbignore}} any fee to use their service. WebCite earned revenue from publishers who wanted to "have their publications analyzed and cited webreferences archived". Early support was from the University of Toronto.
Copyright issues
WebCite maintained the legal position that its archiving activities are allowed by the copyright doctrines of fair use and implied license. To support the fair use argument, WebCite noted that its archived copies are transformative, socially valuable for academic research, and not harmful to the market value of any copyrighted work. WebCite argued that caching and archiving web pages was not considered a copyright infringement when the archiver offers the copyright owner an opportunity to "opt-out" of the archive system, thus creating an implied license. To that end, WebCite would not archive in violation of Web site "do-not-cache" and "no-archive" metadata, as well as robot exclusion standards, the absence of which creates an "implied license" for web archive services to preserve the content.
In a similar case involving Google's web caching activities, on January 19, 2006, the United States District Court for the District of Nevada agreed with that argument in the case of Field v. Google (CV-S-04-0413-RCJ-LRL), holding that fair use and an "implied license" meant that Google's caching of Web pages did not constitute copyright violation. The "implied license" referred to general Internet standards.
=DMCA requests=
According to their policy, after receiving legitimate DMCA requests from the copyright holders, WebCite would remove saved pages from public access, as the archived pages are still under the safe harbor of being citations. The pages were removed to a "dark archive" and in cases of legal controversies or evidence requests, there was pay-per-view access of "$200 (up to 5 snapshots) plus $100 for each further 10 snapshots" to the copyrighted content.{{Cite web |title=WebCite takedown requests policy |url=https://www.webcitation.org/policy.php |archive-url=https://web.archive.org/web/20210422075627/https://www.webcitation.org/policy.php |archive-date=April 22, 2021 |access-date=May 14, 2017 |website=WebCitation.org |publisher=WebCite}}{{cbignore}}
See also
{{Portal|Internet}}
References
{{reflist|30em}}
External links
- {{Official website}}
{{Authority control}}
{{DEFAULTSORT:Webcite}}
Category:Internet properties established in 2004