Bibliographic coupling#History
{{Citation metrics}}
Bibliographic coupling, like co-citation, is a similarity measure that uses citation analysis to establish a similarity relationship between documents. Bibliographic coupling occurs when two works reference a common third work in their bibliographies. It is an indication that a probability exists that the two works treat a related subject matter.{{cite journal | last1 = Martyn | first1 = J | year = 1964 | title = Bibliographic coupling | journal = Journal of Documentation | volume = 20 | issue = 4| page = 236 | doi=10.1108/eb026352}}
Two documents are bibliographically coupled if they both cite one or more documents in common. The "coupling strength" of two given documents is higher the more citations to other documents they share. The figure to the right illustrates the concept of bibliographic coupling. In the figure, documents A and B both cite documents C, D and E. Thus, documents A and B have a bibliographic coupling strength of 3 - the number of elements in the intersection of their two reference lists.
Similarly, two authors are bibliographically coupled if the cumulative reference lists of their respective oeuvres each contain a reference to a common document, and their coupling strength also increases with the citations to other documents that their share. If the cumulative reference list of an author's oeuvre is determined as the multiset union of the documents that the author has co-authored, then the author bibliographic coupling strength of two authors (or more precisely, of their oeuvres) is defined as the size of the multiset intersection of their cumulative reference lists, however.{{cite journal | last1 = Zhao | first1 = D. | last2 = Strotmann | first2 = A. | year = 2008 | title = Evolution of research activities and intellectual influences in information science 1996–2005: Introducing author bibliographic-coupling analysis | journal = Journal of the American Society for Information Science and Technology | volume = 59 | issue = 13| pages = 2070–2086 | doi = 10.1002/asi.20910 | doi-access = free }}
Bibliographic coupling can be useful in a wide variety of fields, since it helps researchers find related research done in the past. On the other hand, two documents are co-cited if they are both independently cited by one or more documents.
History
The concept of bibliographic coupling was introduced by M. M. Kessler of MIT in a paper published in 1963,"Bibliographic coupling between scientific papers," American Documentation 24 (1963), pp. 123-131. and has been embraced in the work of the information scientist Eugene Garfield.See for example "Multiple Independent Discovery and Creativity in Science," Current Contents, Nov. 3, 1980, pp. 5-10, reprinted in [http://www.garfield.library.upenn.edu/essays.html Essays of an Information Scientist], vol. 4 (1979-80), pp. 660-665. It is one of the earliest citation analysis methods for document similarity computation and some have questioned its usefulness, pointing out that two works may reference completely unrelated subject matter in the third. Furthermore, bibliographic coupling is a retrospective similarity measure,Garfield Eugene, 2001.[http://garfield.library.upenn.edu/papers/drexelbelvergriffith92001.pdf From Bibliographic Coupling to Co-Citation Analysis via Algorithmic Historio-Bibliography] presented at Drexel University, Philadelphia, PA meaning the information used to establish the similarity relationship between documents lies in the past and is static, i.e. bibliographic coupling strength cannot change over time, since outgoing citation counts are fixed.
The co-citation analysis approach introduced by Henry Small and published in 1973 addressed this shortcoming of bibliographic coupling by considering a document's incoming citations to assess similarity, a measure that can change over time. Additionally, the co-citation measure reflects the opinion of many authors and thus represents a better indicator of subject similarity.Henry Small, 1973. [http://polaris.gseis.ucla.edu/gleazer/296_readings/small.pdf "Co-citation in the scientific literature: A new measure of the relationship between two documents"] {{webarchive|url=https://web.archive.org/web/20121202085010/http://polaris.gseis.ucla.edu/gleazer/296_readings/small.pdf |date=2012-12-02 }}. Journal of the American Society for Information Science (JASIS), volume 24(4), pp. 265-269. doi = 10.1002/asi.4630240406
In 1972 Robert Amsler published a paperRobert Amsler, Dec. 1972 [https://openlibrary.org/works/OL12801639W/Applications_of_citation-based_automatic_classification?v=2 "Applications of citation-based automatic classification"], Linguistics Research Center, University Texas at Austin, Technical Report 72-14. describing a measure for determining subject similarity between two documents by fusing bibliographic coupling and co-citation analysis.[http://webla.sourceforge.net/javadocs/pt/tumba/links/Amsler.html Class Amsler] written by Bruno Martins and developed by the XLDB group of the Department of Informatics of the Faculty of Sciences of the University of Lisbon in Portugal
In 1981 Howard White and Belver Griffith introduced author co-citation analysis (ACA).{{cite journal | last1 = White | first1 = Howard D. | last2 = Griffith | first2 = Belver C. | year = 1981 | title = Author Cocitation: A Literature Measure of Intellectual Structure | journal = Journal of the American Society for Information Science | volume = 32 | issue = 3| pages = 163–171 | doi = 10.1002/asi.4630320302 }} Not until 2008 did Dangzhi Zhao and Andreas Strotmann combine their work and that of M. M. Kessler to define author bibliographic coupling analysis (ABCA), noting that as long as authors are active this metric is not static and that it is particularly useful when combined with ACA.
More recently, in 2009, Gipp and Beel introduced a new approach termed Co-citation Proximity Analysis (CPA). CPA is based on the concept of co-citation, but represents a refinement to Small's measure in that CPA additionally considers the placement and proximity of citations within a document's full-text. The assumption is that citations in closer proximity are more likely to exhibit a stronger similarity relationship.Bela Gipp and Joeran Beel, 2009 [http://gipp.com/wp-content/papercite-data/pdf/gipp09a.pdf Citation Proximity Analysis (CPA) – A new approach for identifying related work based on Co-Citation Analysis] in Proceedings of the 12th international conference on scientometrics and informetrics (issi’09), Rio de Janeiro (Brazil), 2009, pp. 571-575.
In summary, a chronological overview of citation analysis methods includes:
- Bibliographic coupling (1963)
- Co-citation analysis (published 1973)
- Amsler measure (1972)
- Author co-citation analysis (1981)
- Author bibliographic coupling analysis (2008)
- Co-citation proximity analysis (CPA) (2009)
Applications
Online sites that make use of bibliographic coupling include
[http://liinwww.ira.uka.de/bibliography/ The Collection of Computer Science Bibliographies] {{Webarchive|url=https://web.archive.org/web/20110607123538/http://liinwww.ira.uka.de/bibliography/ |date=2011-06-07 }} and [http://citeseer.ist.psu.edu/cs CiteSeer.IST]
See also
- Technical Information Project, early exploration of the concept by Meyer Mike Kessler
Notes
{{reflist}}
References
{{Excessive citations|section|date=January 2019}}
=Bibliographic Coupling=
- {{cite journal | last1 = Kessler | first1 = M. M. | year = 1963 | title = Bibliographic coupling between scientific papers | journal = American Documentation | volume = 14 | issue = 1| pages = 10–25 | doi=10.1002/asi.5090140103}}
- {{cite journal | last1 = Kessler | first1 = M. M. | year = 1963 | title = An experimental study of bibliographic coupling between technical papers | journal = IEEE Transactions on Information Theory | volume = 9 | issue = 1| page = 49 | doi=10.1109/tit.1963.1057800}}
=Author Bibliographic Coupling=
- {{cite journal | last1 = Zhao | first1 = D. | last2 = Strotmann | first2 = A. | year = 2008 | title = Evolution of research activities and intellectual influences in information science 1996–2005: Introducing author bibliographic-coupling analysis | journal = Journal of the American Society for Information Science and Technology | volume = 59 | issue = 13| pages = 2070–2086 | doi = 10.1002/asi.20910 | doi-access = free }}
=Co-citation analysis =
- {{cite journal | last1 = Small | first1 = Henry | year = 1973 | title = Co-citation in the scientific literature: a new measure of the relationship between two documents | journal = Journal of the American Society for Information Science | volume = 24 | issue = 4| pages = 265–269 | doi=10.1002/asi.4630240406| s2cid = 17845928 }}
- {{cite journal | last1 = Small | first1 = Henry | last2 = Griffith | first2 = B. C. | year = 1974 | title = The structure of scientific literatures (I) Identifying and graphing specialties | journal = Science Studies | volume = 4 | issue = 1| pages = 17–40 | doi=10.1177/030631277400400102| s2cid = 146684402 }}
- {{cite journal | last1 = Griffith | first1 = B. C. | display-authors = etal | year = 1974 | title = The structure of scientific literatures (II) Towards a macro- and micro-structure for science | journal = Science Studies | volume = 4 | issue = 4| pages = 339–365 | doi=10.1177/030631277400400402| s2cid = 145811357 }}
- {{cite journal | last1 = Collins | first1 = H. M. | year = 1974 | title = The TEA set: Tacit knowledge and scientific networks | journal = Science Studies | volume = 4 | issue = 2| pages = 165–186 | doi=10.1177/030631277400400203| s2cid = 26917303 }}
=Co-citation Proximity Analysis ([[Co-citation Proximity Analysis|CPA]])=
- Bela Gipp, (Co-)Citation Proximity Analysis – A Measure to Identify Related Work, Feb., 2006. Doctoral Proposal, [http://www.vlba-lab.de VLBA-Lab], Otto-von-Guericke University, Magdeburg, Supervisor: Prof. Claus Rautenstrauch
- {{cite journal | last1 = Gipp | first1 = Bela | last2 = Beel | first2 = Joeran | year = 2006 | title = Citation Proximity Analysis (CPA) – A New Approach for Identifying Related Work Based on Co-Citation Analysis | url = http://sciplore.org/wp-content/papercite-data/pdf/gipp09a.pdf | journal = Proceedings of the 12th International Conference on Scientometrics and Informetrics (ISSI'09) | volume = Rio de Janeiro, Brazil, 2009}}
- {{cite book | last1 = Gipp | first1 = Bela | last2= Taylor | first2= Adriana | last3 = Beel | first3 = Joeran | year = 2010 | chapter= Link Proximity Analysis - Clustering Websites by Examining Link Proximity | chapter-url= https://www.gipp.com/wp-content/papercite-data/pdf/gipp10b.pdf |editor= Lalmas M. |editor2=Jose J. |editor3=Rauber A. |editor4=Sebastiani F. |editor5=Frommholz I. |title=Research and Advanced Technology for Digital Libraries. ECDL 2010. |series=Lecture Notes in Computer Science |volume=6273 |publisher=Springer}}
=Author Co-citation Analysis (ACA)=
- {{cite journal | last1 = White | first1 = H. D. | last2 = Griffith | first2 = B. C. | year = 1981 | title = Author co-citation: a literature measure of intellectual structure | journal = Journal of the American Society for Information Science | volume = 32 | issue = 3| pages = 163–171 | doi=10.1002/asi.4630320302}}
- {{cite journal | last1 = McCain | first1 = K. W. | year = 1986 | title = Co-cited author mapping as a valid representation of intellectual structure | journal = Journal of the American Society for Information Science | volume = 37 | issue = 3| pages = 111–122 | doi=10.1002/(sici)1097-4571(198605)37:3<111::aid-asi2>3.0.co;2-d}}
- {{cite journal | last1 = Culnan | first1 = M. J. | year = 1987 | title = Mapping the intellectual structure of MIS, 1980-1985: A co-citation analysis | journal = MIS Quarterly | volume = 11 | issue = 3| pages = 341–353 | doi=10.2307/248680| jstor = 248680 }}
- {{cite journal | last1 = McCain | first1 = K. W. | year = 1990 | title = Mapping authors in intellectual space: a technical overview | journal = Journal of the American Society for Information Science | volume = 41 | issue = 6| pages = 433–443 | doi=10.1002/(sici)1097-4571(199009)41:6<433::aid-asi11>3.0.co;2-q}}
- {{cite journal | last1 = Hoffman | first1 = D. L. | last2 = Holbrook | first2 = M. B. | year = 1993 | title = The intellectual structure of consumer research: A bibliometrics study of author co-citations in the first 15 years of the journal of consumer research | journal = Journal of Consumer Research | volume = 19 | issue = 4| pages = 505–517 | doi=10.1086/209319}}
- {{cite journal | last1 = Eom | first1 = S. B. | year = 1996 | title = Mapping the intellectual structure of research in decision support systems through author cocitation analysis (1971-1993) | journal = Decision Support Systems | volume = 16 | issue = 4| pages = 315–338 | doi=10.1016/0167-9236(95)00026-7}}
=Citation Studies in a More General Context=
- {{cite journal | last1 = Small | first1 = Henry | year = 1978 | title = Cited Documents as Concept Symbols | url = http://www.garfield.library.upenn.edu/small/hsmallsocstudsciv8y1978.pdf | journal = Social Studies of Science | volume = 8 | issue = 3| pages = 327–340 | doi=10.1177/030631277800800305| s2cid = 145538259 }}
- Henry Small (1982). "Citation context analysis." In: Brenda Dervin and M. J. Voigt, eds., Progress in Communication Sciences, volume 3, pp. 287–310. Ablex Publishing, 1982.
- {{cite journal | last1 = Blair | first1 = David C. | author-link = David Blair (information technologist) | author-link2 = M. E. Maron | last2 = Maron | first2 = M. E. | s2cid = 5144091 | year = 1985 | title = An evaluation of retrieval effectiveness for a full-text document-retrieval system | journal = Communications of the ACM | volume = 28 | issue = 3| pages = 289–299 | doi=10.1145/3166.3197| hdl = 2027.42/35415 | hdl-access = free }}
- {{cite journal | last1 = Brin | first1 = Sergey | author-link = Sergey Brin | author-link2 = Lawrence Page | last2 = Page | first2 = Lawrence | year = 1998 | title = The anatomy of a large-scale hypertextual Web search engine | journal = Computer Networks and ISDN Systems | volume = 30 | issue = 1–7| pages = 107–117 | doi=10.1016/s0169-7552(98)00110-x| citeseerx = 10.1.1.115.5930 | s2cid = 7587743 }}
- {{cite journal | last1 = He | first1 = Yulan | last2 = Cheung Hui | first2 = Siu | year = 2002 | title = Mining a web citation database for author co-citation analysis | journal = Information Processing and Management| volume = 38 | issue = 4| pages = 491–508 | doi=10.1016/s0306-4573(01)00046-2}}
- {{cite book|doi=10.1007/978-3-540-45175-4_45|chapter=Reference Directed Indexing: Redeeming Relevance for Subject Search in Citation Indexes|title=Research and Advanced Technology for Digital Libraries|volume=2769|pages=499–510|series=Lecture Notes in Computer Science|year=2003|last1=Bradshaw|first1=Shannon|isbn=978-3-540-40726-3}}
- {{cite book|doi=10.3115/1220835.1220885|chapter=Creating a test collection for citation-based IR experiments|title=Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics -|pages=391–398|year=2006|last1=Ritchie|first1=Anna|last2=Teufel|first2=Simone|last3=Robertson|first3=Stephen|s2cid=16879847|author-link3=Stephen E. Robertson}}
- {{cite journal | last1 = Iwayama | first1 = Makoto | last2 = Fujii | first2 = Atsushi | last3 = Kando | first3 = Noriko | last4 = Marukawa | first4 = Yozo | year = 2006 | title = Evaluating patent retrieval in the third NTCIR workshop | journal = Information Processing and Management| volume = 42 | issue = 1| pages = 207–221 | doi=10.1016/j.ipm.2004.08.012}}
- {{cite book|doi=10.1145/1277741.1277912|chapter=Enhancing patent retrieval by citation analysis|title=Proceedings of the 30th Annual international ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR '07|pages=793–794|year=2007|last1=Fujii|first1=Atsushi|s2cid=12433507|isbn=9781595935977}}
- {{cite book|doi=10.1145/1277741.1277868|chapter=Recommending citations for academic papers|title=Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR '07|pages=705–706|year=2007|last1=Strohman|first1=Trevor|last2=Croft|first2=W. Bruce|last3=Jensen|first3=David|s2cid=11304924|isbn=9781595935977}}
- {{cite book|doi=10.1145/1458082.1458113|chapter=Comparing citation contexts for information retrieval|title=Proceedings of the 17th ACM Conference on Information and Knowledge Mining - CIKM '08|pages=213–222|year=2008|last1=Ritchie|first1=Anna|last2=Robertson|first2=Stephen|last3=Teufel|first3=Simone|s2cid=15585395|isbn=9781595939913}}
- {{cite book|doi=10.1145/2910896.2910908|chapter-url=https://www.gipp.com/wp-content/papercite-data/pdf/schwarzer2016.pdf|chapter=Evaluating Link-based Recommendations for Wikipedia|title=Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries - JCDL '16|pages=191–200|year=2016|last1=Schwarzer|first1=Malte|last2=Schubotz|first2=Moritz|last3=Meuschke|first3=Norman|last4=Breitinger|first4=Corinna|last5=Markl|first5=Volker|author-link5=Volker Markl|last6=Gipp|first6=Bela|isbn=9781450342292|s2cid=2597308|url=http://nbn-resolving.de/urn:nbn:de:bsz:352-2-1unn74ajzyyz15}}
Further reading
For an interesting summary of the progression of the study of citations see.{{cite journal | last1 = Small | first1 = Henry | year = 2001 | title = Belver and Henry | doi = 10.1023/a:1019690918490 | journal = Scientometrics | volume = 51 | issue = 3| pages = 489–497 | s2cid = 5962665 }} The paper is more a memoir than a research paper, filled with decisions, research expectations, interests and motivations—including the story of how Henry Small approached Belver Griffith with the idea of co-citation and they became collaborators, mapping science as a whole.
External links
- [http://www.isg.uni-konstanz.de/projects/citrec/ CITREC], an evaluation framework for citation-based similarity measures including Bibliographic Coupling, Co-citation, Co-citation Proximity Analysis and others.Bela Gipp, Norman Meuschke & Mario Lipinski, 2015. [http://gipp.com/wp-content/papercite-data/pdf/gipp15b.pdf "CITREC: An Evaluation Framework for Citation-Based Similarity Measures based on TREC Genomics and PubMed Central"] in Proceedings of the iConference 2015, Newport Beach, California, 2015.
- Jeppe Nicolaisen, [https://web.archive.org/web/20120315074624/http://www.iva.dk/bh/Core%20Concepts%20in%20LIS/articles%20a-z/Bibliographic%20coupling.htm Bibliographic coupling] in Birger Hjørland, ed., [https://web.archive.org/web/20120226041700/http://www.iva.dk/bh/Core%20Concepts%20in%20LIS/home.htm Core Concepts in Library and Information Science]
{{Academic publishing}}