Special Interest Group on Knowledge Discovery and Data Mining

{{Short description|Group within the Association for Computing Machinery}}

{{Multiple issues|

{{more citations needed|date=September 2016}}

{{cleanup|date=September 2016|reason=too promotional, prose needs rewriting}}

}}

SIGKDD, representing the Association for Computing Machinery's (ACM) Special Interest Group (SIG) on Knowledge Discovery and Data Mining, hosts an influential annual conference.

Conference history

The KDD Conference grew from KDD (Knowledge Discovery and Data Mining) workshops at AAAI conferences, which were started by Gregory I. Piatetsky-Shapiro in 1989, 1991, and 1993, and Usama Fayyad in 1994.{{cite web |url=http://www.sigkdd.org/conferences.php |title=ACM SIGKDD: Conferences |website=www.sigkdd.org |url-status=dead |archive-url=https://web.archive.org/web/20060615083649/http://sigkdd.org/conferences.php |archive-date=2006-06-15}} Conference papers of each proceedings of the SIGKDD International Conference on Knowledge Discovery and Data Mining are published through ACM.{{cite web|url=http://dl.acm.org/event.cfm?id=RE329|title=Event: KDD|work=acm.org|access-date=2011-09-01|archive-date=2017-06-16|archive-url=https://web.archive.org/web/20170616151933/http://dl.acm.org/event.cfm?id=RE329|url-status=live}} KDD is widely considered the most influential forum for knowledge discovery and data mining research.{{Cite web|url=http://www.conferenceranks.com/visualization/msar2014.html?field=Data+Mining&visualization=Bars|title=Conference Ranks|website=www.conferenceranks.com|access-date=2019-10-30|archive-date=2020-10-22|archive-url=https://web.archive.org/web/20201022084004/http://www.conferenceranks.com/visualization/msar2014.html?field=Data+Mining&visualization=Bars|url-status=live}}{{Cite web|url=http://www.conferenceranks.com/index.html?searchall=sigkdd|title=Conference Ranks|website=www.conferenceranks.com|access-date=2016-08-30|archive-date=2016-09-11|archive-url=https://web.archive.org/web/20160911121544/http://www.conferenceranks.com/index.html?searchall=sigkdd|url-status=live}}

class="wikitable" align="right"
Year

! Conference location

2011

| San Diego, United States

2012

| Beijing, China

2013

| Chicago, IL, United States

2014

| New York City, NY, United States

2015

| Sydney, Australia

2016

| San Francisco, CA, United States

2017

| Halifax, Canada

2018

| London, England

2019

| Anchorage, AK, United States

2020

|San Diego, CA, United States

2021

|Virtual Conference

2022

|Washington, D.C., United States

2023

|Long Beach, California, United States

2024{{Cite web |title=KDD 2024 |url=https://kdd2024.kdd.org/ |access-date=2023-12-14 |website=ACM KDD 2024 |language=en-US |archive-date=2023-12-14 |archive-url=https://web.archive.org/web/20231214143212/https://kdd2024.kdd.org/ |url-status=live }}

|Barcelona, Spain

2025

|Toronto, ON, Canada

The KDD conference has been held each year since 1995, and SIGKDD became an official ACM Special Interest Group in 1998. Past conference locations are listed on the KDD conference web site.{{Cite web|url=https://www.kdd.org/conferences|title=SIGKDD - Conferences|website=www.kdd.org|access-date=2019-03-08|archive-date=2019-04-01|archive-url=https://web.archive.org/web/20190401211603/https://www.kdd.org/conferences|url-status=live}}

The annual ACM SIGKDD conference is recognized as a flagship venue in the field. Based on statistics provided by independent researcher Lexing Xie in her analysis “Visualizing Citation Patterns of Computer Science Conferences“{{Cite web|url=http://cm.cecs.anu.edu.au/citation/KDD/|title=KDD - Knowledge Discovery and Data Mining (1994-2015)|website=cm.cecs.anu.edu.au|access-date=2017-11-19|archive-date=2017-12-01|archive-url=https://web.archive.org/web/20171201034758/http://cm.cecs.anu.edu.au/citation/KDD/|url-status=live}} as part of the research in Computation Media Lab at Australian National University:

  • 4489 papers were published at ACM SIGKDD conference over in the years 1994–2015 (inclusive).
  • These 4489 papers had received 112570 citations in total across 3033 venues.
  • 56% of these 3033 venues are recognized as top 25 venues in the field.

The annual conference of ACM SIGKDD has received the highest rating A* from independent organization Computing Research and Education (a.k.a. CORE).{{Cite web|url=http://core.edu.au/conference-portal|title=CORE Rankings Portal - Computing Research & Education|website=core.edu.au|access-date=2019-10-30|archive-date=2019-10-21|archive-url=https://web.archive.org/web/20191021053743/http://www.core.edu.au/conference-portal|url-status=live}}

= Selection Criteria =

Like all flagship conferences, SIGKDD imposes a high requirement to present and publish submitted papers. The focus is on innovative research in data mining, knowledge discovery, and large-scale data analytics. Papers emphasizing theoretical foundations are particularly encouraged, as are novel modeling and algorithmic approaches to specific data mining problems in scientific, business, medical, and engineering applications. Visionary papers on new and emerging topics are particularly welcomed. Authors are explicitly discouraged from submitting papers that contain only incremental results or that do not provide significant advances over existing approaches.{{Cite web|url=https://www.kdd.org/kdd2014/calls.html|title=[Closed] Call for papers, workshop proposals, tutorial proposals | KDD 2014, 8/24-27, New York: Data Mining for Social Good|website=www.kdd.org|access-date=2019-10-30|archive-date=2019-10-30|archive-url=https://web.archive.org/web/20191030184756/https://www.kdd.org/kdd2014/calls.html|url-status=live}}

In 2014, over 2,600 authors from at least fourteen countries submitted over a thousand papers to the conference. A final 151 papers were accepted for presentation and publication, representing an acceptance rate of 14.6%.{{Cite web|url=https://www.slideshare.net/jleskovec/data-science-view-of-the-kdd-review-process|title=Data Science view of the KDD 2014|date=August 27, 2014|access-date=November 18, 2017|archive-date=December 21, 2015|archive-url=https://web.archive.org/web/20151221040315/http://www.slideshare.net/jleskovec/data-science-view-of-the-kdd-review-process|url-status=live}} This acceptance rate is slightly lower than those of other top computer science conferences, which typically have a rate of 15–25%.{{Cite web|url=http://haofengjia.weebly.com/computer-science.html|title=Computer Science Conferences Acceptance Rate|website=Haofeng Jia's Homepage|access-date=2017-11-18|archive-date=2017-12-01|archive-url=https://web.archive.org/web/20171201041855/http://haofengjia.weebly.com/computer-science.html|url-status=live}} The acceptance rate of a conference is only a proxy measure of its quality. For example, in the field of information retrieval, the [http://www.wsdm-conference.org/ WSDM conference] has a lower acceptance rate than the higher-ranked SIGIR.{{Cite web |title=Top Computer Science Conferences - Computer Science Conference Ranking |url=http://www.guide2research.com/topconf/ |website=research.com |access-date=2019-09-24 |archive-date=2019-09-30 |archive-url=https://web.archive.org/web/20190930120920/http://www.guide2research.com/topconf/ |url-status=live }}

Awards

The group recognizes members of the KDD community with its annual [http://www.kdd.org/sigkdd-innovation-award Innovation Award] and Service Award.{{cite web |url=http://www.kdd.org/innovation-service-awards |title=Awards {{!}} Sig KDD |website=www.kdd.org |url-status=dead |archive-url=https://web.archive.org/web/20120526134128/http://www.kdd.org/innovation-service-awards |archive-date=2012-05-26}}

Each year KDD presents a Best Paper Award{{cite web | url=http://jeffhuang.com/best_paper_awards.html#kdd | title=KDD Conference Best Paper Awards | accessdate=2012-04-07 | archive-date=2011-07-13 | archive-url=https://web.archive.org/web/20110713100135/http://jeffhuang.com/best_paper_awards.html#kdd | url-status=live }} to recognizes papers presented at the annual SIGKDD conference that advance the fundamental understanding of the field of knowledge discovery in data and data mining. Two research paper awards are granted: Best Research Paper Award Recipients and Best Student Paper Award Recipients.{{cite web | url=http://www.kdd.org/awards/sigkdd-best-research-paper-awards | title=SIGKDD BEST RESEARCH PAPER AWARDS | accessdate=2017-11-17 | archive-date=2017-12-07 | archive-url=https://web.archive.org/web/20171207102516/http://www.kdd.org/awards/sigkdd-best-research-paper-awards | url-status=live }}

= Best Paper Award (Best Research Track Paper) =

Winning the ACM SIGKDD Best Paper Award (Best Research Track Paper) is widely considered an internationally recognized significant achievement in a researcher's career.{{By whom|date=September 2019}} Authors compete with established professionals in the field, such as tenured professors, executives, and eminent industry experts from top institutions. It is common to find press articles and news announcements from the awardees’ institutions and professional media to celebrate this achievement.{{Cite web |title=Yahoo Wins Best Paper Award at KDD 2009 {{!}} research.yahoo.com |url=https://research.yahoo.com/news/yahoo-wins-best-paper-award-kdd-2009 |access-date=2023-10-23 |website=research.yahoo.com |archive-date=2023-10-30 |archive-url=https://web.archive.org/web/20231030102426/https://research.yahoo.com/news/yahoo-wins-best-paper-award-kdd-2009 |url-status=live }}{{Cite web |date=2015-08-17 |title=KDD 2015 Best Research Paper Award: "Algorithms for Public-Private Social Networks" |url=https://blog.research.google/2015/08/kdd-2015-best-research-paper-award.html |access-date=2023-10-23 |website=blog.research.google |language=en |archive-date=2023-10-30 |archive-url=https://web.archive.org/web/20231030102427/https://blog.research.google/2015/08/kdd-2015-best-research-paper-award.html |url-status=live }}

This award recognizes innovative scholarly articles that advance the fundamental understanding of the field of knowledge discovery in data and data mining.

Each year, the award is given to authors of the strongest paper by this criterion, selected by a rigorous process.

== Selection Process ==

The selection process follows multiple rounds of peer reviews under stringent criteria. The selection committee consists of leading experts who provide insightful and independent analysis on the merits and degree of innovation of the scholarly articles submitted by each author. The reviewers are required to be recognized subject experts who had extensive contributions to the specific subject area addressed by the paper. Reviewers are also required to be completely unaffiliated with the authors.

First, all papers submitted to the ACM SIGKDD conference are reviewed by research track program committee members. Each submitted paper is extensively reviewed by multiple committee members and detailed feedback is given to each author. After review, decisions are made by the committee members to accept or reject the paper based on the paper’s novelty, technical quality, potential impact, clarity, and whether the experimental methods and results are clear, well executed, and repeatable. During the process, committee members also evaluate the merits of each paper based on above factors, and make decision on recommending candidates for Best Paper Award (Best Research Track Paper).

The candidates for Best Paper Award (Best Research Track Paper) are extensively reviewed by conference chairs and the best paper award committee. The final determination of the award is based on the level of advancement made by authors through the paper to the understanding of the field of knowledge discovery and data mining. Authors of a single paper who are judged to have contributed the highest level of advancement to the field are selected as recipients of this award. Anyone who submits a scholarly article to SIGKDD is considered for this award.

== Previous winners ==

The ACM SIGKDD Best Paper Award (Best Research Track Paper) was given to 49 individuals between 1997 and 2014. Among these individuals, most are distinguished persons and established professionals with celebrated careers, who have made significant contributions to the field.

class="wikitable"
YearNamePositionAffiliation
1997Foster ProvostProfessorNew York University
1997Tom FawcettPrincipal Data ScientistSilicon Valley Data Science
1998, 1999Pedro DomingosProfessorUniversity of Washington
2000[http://people.cs.uchicago.edu/~amr/ Anne Rogers]Associate ProfessorUniversity of Chicago
2000Daryl Pregibon(Former) Head of Statistical ResearchAT&T Labs and Bell Labs
2000Kathleen FisherChair & ProfessorTufts University
2000Corinna CortesHead of ResearchGoogle
2001[https://www.stat.ubc.ca/~ruben/website/ Ruben H. Zamar]ProfessorUniversity of British Columbia
2001[https://www.cs.ubc.ca/~rng/ Raymond T. Ng]ProfessorUniversity of British Columbia
2001[http://www.cs.ubc.ca/~knorr/ Edwin M. Knorr]Tenured Senior InstructorUniversity of British Columbia
rowspan="2" | 2002rowspan="2" | Padhraic SmythProfessorUniversity of California, Irvine
Associate DirectorCenter for Machine Learning and Intelligent Systems
2002Darya ChudovaVP of BioinformaticsGuardant Health
2003Éva TardosProfessor & DeanCornell University
rowspan="4" | 2003, 2005rowspan="4" | Jon KleinbergProfessorCornell University
rowspan="3" | MemberNational Academy of Sciences
National Academy of Engineering
American Academy of Arts and Sciences
2003[http://www-bcf.usc.edu/~dkempe/ David Kempe]Associate ProfessorUniversity of Southern California
2004[https://www.cs.utexas.edu/~mooney/ Raymond J. Mooney]ProfessorThe University of Texas at Austin
2004Mikhail (Misha) BilenkoHead of AI and ResearchYandex
2004Sugato BasuPrincipal ScientistGoogle
rowspan="2" | 2004, 2005rowspan="2" | Christos FaloutsosProfessorCarnegie Mellon University
FellowACM
rowspan="3" | 2005rowspan="3" | Jure LeskovecAssociate ProfessorStanford University
Chief ScientistPinterest
Member, Board of DirectorsACM SIGKDD
rowspan="2" | 2006rowspan="2" | [http://www.cs.cornell.edu/people/tj/ Thorsten Joachims]Chair & ProfessorCornell University
FellowACM, AAAI, Humboldt
2007Srujana MeruguPrincipal Data ScientistFlipkart
rowspan="3" | 2007rowspan="3" | Deepak AgarwalVP of EngineeringLinkedIn
FellowAmerican Statistical Association
Member, Board of DirectorsACM SIGKDD
rowspan="2" | 2008rowspan="2" | [http://web.cs.ucla.edu/~weiwang/ Wei Wang]Chair & ProfessorUniversity of California, Los Angeles
DirectorScalable Analytics Institute
2008[http://biostat.ufl.edu/about/people/faculty/zou-fei/ Fei Zhou]ProfessorUniversity of Florida
2008[https://faculty.ist.psu.edu/xzz89/ Xiang Zhang]Associate ProfessorPennsylvania State University
2009Yehuda KorenStaff Research ScientistGoogle
rowspan="3" | 2010rowspan="3" |Carlos GuestrinDirector of Machine LearningApple Inc
ProfessorUniversity of Washington
Co-founder, CEOTuri (a.k.a. Dato, GraphLab)
2010Dafna ShahafAssistant ProfessorThe Hebrew University of Jerusalem
2010Kai-Wei ChangAssistant ProfessorUniversity of California, Los Angeles
2010[http://www.stat.ucdavis.edu/~chohsieh/rf/ Cho-Jui Hsieh]Assistant ProfessorUniversity of California, Davis
2010Hsiang-Fu YuApplied ScientistAmazon
rowspan="2" | 2010rowspan="2" | Chih-Jen LinDistinguished ProfessorNational Taiwan University
FellowACM, AAAI, IEEE
rowspan="2" | 2011rowspan="2" | Claudia PerlichChief ScientistDstillery
Adjunct ProfessorNew York University
2011[http://www.tau.ac.il/~saharon/ Saharon Rosset]Associate ProfessorTel Aviv University
2011Shachar KaufmanSenior Data ScientistMetromile
2012Thanawin RakthanmanonAssistant ProfessorKasetsart University, Thailand
2012Bilson CampanaStaff Software EngineerGoogle
2012[http://www.cs.unm.edu/~mueen/ Abdullah Mueen]Assistant ProfessorUniversity of New Mexico
2012[http://conteudo.icmc.usp.br/pessoas/gbatista/ Gustavo Batista]Associate ProfessorUniversidade de São Paulo
2012Brandon WestoverDirector, Critical Care EEG Monitoring ServiceMassachusetts General Hospital
2012Qiang ZhuData Science ManagerAirbnb
2012Jesin ZakariaSoftware EngineerMicrosoft
2012[http://www.cs.ucr.edu/~eamonn/ Eamonn Keogh]ProfessorUniversity of California, Riverside
rowspan="2" | 2013rowspan="2" | Edo LibertyPrincipal ScientistAmazon
Group ManagerAmazon AI Algorithms
rowspan="2" | 2014rowspan="2" | Alex SmolaDirector of Machine Learning and Deep LearningAmazon
ProfessorCarnegie Mellon University
2014[http://www.sravi.org/ Sujith Ravi]Staff Research ScientistGoogle
2014Amr AhmedStaff Research ScientistGoogle
rowspan="2" | 2014rowspan="2" | Aaron LiFounder[http://qokka.ai Qokka.ai]
(Former) Lead Inference EngineerScaled Inference

= Best Student Paper Award =

This only difference between "Best Student Paper Award" and "Best Paper Award (Best Research Track Paper)" is the limitation in competition.

All authors participating the conference are considered equally for "Best Paper Award (Best Research Track Paper)", and the award does not limit competition to any particular region, population, or age group.

However, "Best Student Paper Award" is limited to student authors only. "Best Student Paper Award" recognizes papers presented at the annual SIGKDD conference, with a student as a first author, that advance the fundamental understanding of the field of knowledge discovery in data and data mining.

KDD-Cup

SIGKDD sponsors the KDD Cup{{cite web |url=http://www.kdd.org/kddcup/ |title=ACM KDD CUP |website=www.kdd.org |url-status=dead |archive-url=https://web.archive.org/web/20110318184723/http://www.kdd.org/kddcup |archive-date=2011-03-18}} data mining competition every year in conjunction with the annual conference. It is aimed at members of the industry and academia, particularly students, interested in KDD.

SIGKDD Explorations

SIGKDD has also published a biannual academic journal titled SIGKDD Explorations{{cite web|url=http://www.kdd.org/explorations/|title=SIGKDD Explorations|author=SIGKDD Blog|work=kdd.org|access-date=2007-07-28|archive-date=2011-07-26|archive-url=https://web.archive.org/web/20110726153114/http://www.kdd.org/explorations/|url-status=live}} since June 1999{{cite web|title = SIGKDD Explorations : June 1999, Volume 1, Issue 1|url = http://www.kdd.org/explorations/view/june-1999-volume-1-issue-1|website = www.kdd.org|accessdate = 2015-12-31|last = Fayyad|first = Usama|publisher = ACM|archive-date = 2016-01-13|archive-url = https://web.archive.org/web/20160113055046/http://www.kdd.org/explorations/view/june-1999-volume-1-issue-1|url-status = live}} when Usama Fayyad took on role of Founding Editor-inChief as ACM SIGKDD was formed. Editors in Chief:

  • Charu Aggarwal (since 2014)
  • Bart Goethals (2010–2013)
  • Osmar R. Zaiane (2008–2010)
  • Ramakrishnan Srikant{{cite web|url=http://www.rsrikant.com/|title=Srikant's Home Page|work=rsrikant.com|access-date=2009-12-18|archive-date=2010-03-16|archive-url=https://web.archive.org/web/20100316174228/http://rsrikant.com/|url-status=live}} (2006–2007)
  • Sunita Sarawagi (2003–2006)
  • Usama Fayyad (Founding Editor-in-Chief) (1999–2002)

People

The original founding board of directors of SIGKDD in 1998 consist of:

  • Won Kim, president, Cyber Database Solutions, SIGKDD Chair
  • Rakesh Agrawal, IBM Almaden, SIGKDD Secretary/Treasurer
  • Usama Fayyad, Microsoft Research, SIGKDD Director and Editor-in-Chief of SIGKDD Explorations Newsletter
  • Gregory Piatetsky-Shapiro, Knowledge Stream Partners, SIGKDD Director
  • Daryl Pregibon, AT&T Labs, SIGKDD Director
  • Padhraic Smyth, U. of California Irvine, SIGKDD Director

Current chair:

Former Chairpersons:

Former Executive Committee (2009–2013)

Information Directors:

  • Ankur Teredesai (2011–)
  • Gabor Melli (2004–2011)
  • Ramakrishnan Srikant (1998–2003)

See also

References

{{reflist|30em}}