Data collection system

Data collection system (DCS) is a computer application that facilitates the process of data collection, allowing specific, structured information to be gathered in a systematic fashion, subsequently enabling data analysis to be performed on the information.{{cite web | url = https://www.techopedia.com/definition/11311/data-collection-system-dcs | title = What is a Data Collection System (DCS)? - Definition from Techopedia | date = 12 February 2016 | publisher = Techopedia.com | accessdate = 2016-10-14 }}{{cite web | url = https://www.rita.dot.gov/bts/sites/rita.dot.gov.bts/files/subject_areas/statistical_policy_and_research/bts_statistical_standards_manual/html/chapter_02.html | title = Planning and Design of Data Collection Systems | publisher = U.S. Department of Transportation (US DOT) | date = 2005-08-15 | accessdate = 2016-10-14}}{{cite web | url = https://www.cdc.gov/nchs/surveys.htm | title = Surveys and Data Collection Systems | publisher = U.S. Department of Health & Human Services | date = 2016-04-16 | accessdate = 2016-10-14}} Typically a DCS displays a form that accepts data input from a user and then validates that input prior to committing the data to persistent storage such as a database.

Many computer systems implement data entry forms, but data collection systems tend to be more complex, with possibly many related forms containing detailed user input fields, data validations, and navigation links among the forms.

DCSs can be considered a specialized form of content management system (CMS), particularly when they allow the information being gathered to be published, edited, modified, deleted, and maintained. Some general-purpose CMSs include features of DCSs.{{cite web | url = https://support.office.com/en-za/article/Create-data-forms-using-SharePoint-Designer-5b5e3970-af22-45d5-a796-edfe7dda15f6 | title = Using SharePoint Forms for Data Collection | publisher = Microsoft Corporation | accessdate = 2016-10-14 }}{{cite web | url = https://www.drupal.org/node/145576 | title = Using Drupal for Multi-Page Collection of Data from Users | publisher = The Drupal Association | date = 2009-07-03 | accessdate = 2016-10-14}}

Importance

Accurate data collection is essential to many business processes,{{cite web|title=Data collection|url=http://searchcio.techtarget.com/definition/data-collection|website=SearchCIO|publisher=TechTarget|accessdate=20 December 2016}}{{cite web|title=Which Data Collection Method Should I Choose?|url=https://www.b2binternational.com/research/methods/faq/which-data-collection-method-should-i-choose/|website=B2B International|accessdate=20 December 2016}}{{cite web|title=How and Why Data Will Save Small Business|url=https://smallbiztrends.com/2015/03/small-business-data-collection.html|website=Small Business Trends Small Business Trends|publisher=Small Business Trends LLC|accessdate=20 December 2016|date=2015-03-20}} to the enforcement of many government regulations,{{cite web|title=FAQ: Data Collection Requirements for Broker-Dealers|url=http://www.finra.org/industry/faq-data-collection-requirements-broker-dealers|website=FINRA.org|publisher=Financial Industry Regulatory Authority, Inc. on behalf of the U.S. Securities and Exchange Commission (SEC)|accessdate=4 February 2017}} and to maintaining the integrity of scientific research.Data Collection and Analysis By Dr. Roger Sapsford, Victor Jupp {{ISBN|0-7619-5046-X}}

Data collection systems are an end-product of software development. Identifying and categorizing software or a software sub-system as having aspects of, or as actually being a "Data collection system" is very important. This categorization allows encyclopedic knowledge to be gathered and applied in the design and implementation of future systems. In software design, it is very important to identify generalizations and patterns and to re-use existing knowledge whenever possible.{{cite journal|title=The role of opportunism in the software design reuse process|journal=IEEE Transactions on Software Engineering|volume=23|issue=7|pages=418–436|doi=10.1109/32.605760|year=1997|last1=Sen|first1=A.}}

Types

Generally the computer software used for data collection falls into one of the following categories of practical application.{{cite web|title=Data Collection Software|url=https://www.getapp.com/customer-management-software/data-collection/|website=GetApp|publisher=Nubera eBusiness S.L.|accessdate=20 December 2016}}

  • Surveys or questionnaires{{cite web | url = http://www.norc.org/research/capabilities/pages/data-collection-and-management/survey-data-collection.aspx | title = Survey Data Collection | publisher = NORC at the University of Chicago | year = 2016 | accessdate = 2016-10-14}}{{cite web | url = https://surveys.nces.ed.gov/ipeds/ViewContent.aspx?contentId=16 | title = Using the Data Collection System | publisher = U.S. Department of Education | year = 2016 | accessdate = 2016-10-14}}
  • Data registries{{cite web | url = http://cvquality.acc.org/en/NCDR-Home/Data-Collection/How-to-Collect-Data.aspx | title = How to Collect Data | publisher = American College of Cardiology | date = 2016 | accessdate = 2016-10-14}}{{cite journal|title=eRegistries: Electronic registries for maternal and child health|volume=16|pages=11|pmc=4721069|journal=BMC Pregnancy and Childbirth|year=2016|last1=Frøen|first1=J. F.|last2=Myhre|first2=S. L.|last3=Frost|first3=M. J.|last4=Chou|first4=D.|last5=Mehl|first5=G.|last6=Say|first6=L.|last7=Cheng|first7=S.|last8=Fjeldheim|first8=I.|last9=Friberg|first9=I. K.|last10=French|first10=S.|last11=Jani|first11=J. V.|last12=Kaye|first12=J.|last13=Lewis|first13=J.|last14=Lunde|first14=A.|last15=Mørkrid|first15=K.|last16=Nankabirwa|first16=V.|last17=Nyanchoka|first17=L.|last18=Stone|first18=H.|last19=Venkateswaran|first19=M.|last20=Wojcieszek|first20=A. M.|last21=Temmerman|first21=M.|last22=Flenady|first22=V. J.|pmid=26791790|doi=10.1186/s12884-016-0801-7 |doi-access=free }}
  • Case management systems{{cite journal|title=Electronic Data Collection Options for Practice-Based Research Networks|volume=3|issue=Suppl 1|pages=s21–s29|pmc=1466955|journal=Annals of Family Medicine|year=2005|last1=Pace|first1=W. D.|last2=Staton|first2=E. W.|pmid=15928215|doi=10.1370/afm.270}}
  • Performance measurement systems{{cite web|title=MANAGING DATA FOR PERFORMANCE IMPROVEMENT|url=https://www.hrsa.gov/sites/default/files/quality/toolbox/508pdfs/managingdataperformanceimprovement.pdf|website=U. S. Department of Health and Human Services Health Resources and Services Administration}}{{cite journal|title=Collecting and Reporting Data for Performance Measurement: Moving Toward Alignment|journal=Proceedings of the AHRQ Conference on Health Care Data Collection and Reporting|date=November 8–9, 2006|volume=AHRQ Publication No. 07-0033-EF|issue=March 2007|url=http://bok.ahima.org/PdfView?oid=70430|accessdate=4 February 2017}}
  • Exams and quizzes{{cite web|title=Quiz - Drupal.org|url=https://www.drupal.org/project/quiz|website=Drupal.org|date=6 July 2005 |publisher=Dries Buytaert|accessdate=20 December 2016}}{{cite web|title=Online QuizBuilder web app built with Laravel|url=http://webxity.com/portfolios/online-quizbuilder-web-app-built-with-laravel/|website=Webxity|publisher=Webxity Technologies}}
  • Online forms and form filing and reporting systems{{cite web|title=Regulatory Filing|url=http://www.finra.org/industry/regulatory-filings|website=FINRA.org|publisher=Financial Industry Regulatory Authority, Inc. on behalf of the U.S. Securities and Exchange Commission (SEC)|accessdate=4 February 2017}}

Vocabulary

There is a taxonomic scheme associated with data collection systems, with readily-identifiable synonyms used by different industries and organizations.{{cite book|last1=Hay|first1=David C.|title=Data model patterns a metadata map|date=2006|publisher=Elsevier Morgan Kaufmann|location=Amsterdam|isbn=978-0120887989|page=40|edition= [Repr.].|url=https://books.google.com/books?id=YxDBaWj9itkC&pg=PA40|accessdate=5 February 2017}}{{cite web|title=Classification, Taxonomies and You|url=http://www.weitkamper.com/download/verity/verity_mk0648.pdf|website=Verity|publisher=Verity, Inc.|accessdate=6 February 2017}}{{cite journal|last1=Bayona-Oré|first1=Sussy|last2=Calvo-Manzano|first2=Jose A.|last3=Cuevas|first3=Gonzalo|last4=San-Feliu|first4=Tomas|title=Critical success factors taxonomy for software process deployment|journal=Software Quality Journal|date=21 December 2012|volume=22|issue=1|pages=21–48|doi=10.1007/s11219-012-9190-y|s2cid=18047921}} Cataloging the most commonly used and widely accepted vocabulary improves efficiencies, helps reduce variations, and improves data quality.{{cite journal|title=Collecting and Reporting Data for Performance Measurement: Moving Toward Alignment|journal=Proceedings of the AHRQ Conference on Health Care Data Collection and Reporting|page= 13 of 50|date=November 8–9, 2006|volume=AHRQ Publication No. 07-0033-EF|issue=March 2007|url=http://bok.ahima.org/PdfView?oid=70430|accessdate=4 February 2017}}{{cite web|last1=Busch|first1=Joseph|title=Conducting Taxonomy Validation: Healthcare Example|url=http://taxonomystrategies.com/wp-content/uploads/2016/02/Conducting%20Taxonomy%20Validation-Healthcare%20Example.pdf|website=Taxonomy Strategies|publisher=Taxonomy Strategies LLC|accessdate=7 February 2017}}{{cite web|title=6 Challenges: Performance Measurement Data Collection & Reporting|url=http://www.extractsystems.com/healthydata-blog/2016/12/2/6-challenges-of-performance-measurement-data-collection-and-reporting|website=Extract Systems|date=15 December 2016 |accessdate=7 February 2017}}

The vocabulary of data collection systems stems from the fact that these systems are often a software representation of what would otherwise be a paper data collection form with a complex internal structure of sections and sub-sections. Modeling these structures and relationships in software yields technical terms describing the hierarchy of data containers, along with a set of industry-specific synonyms.{{cite book|last1=Hay|first1=David C.|title=Data model patterns : conventions of thought|date=1996|publisher=Dorset House Pub.|location=New York|isbn=978-0932633293|page=218ff|url=https://books.google.com/books?id=IUVsAQAAQBAJ&pg=PA218|accessdate=6 February 2017}}{{cite journal|last1=Wendicke|first1=Annemarie|title=What Makes Data Meaningful? The Important Role of Data Structures|journal=Journal of AHIMA|volume=87|issue=3|pages=34–36|date=March 2016|pmid=27039625|url=http://bok.ahima.org/doc?oid=301394#.WJlpzBAnb8o|accessdate=7 February 2017}}

=Collection synonyms=

A collection (used as a noun) is the topmost container for grouping related documents, data models, and datasets. Typical vocabulary at this level includes the terms:

{{columns-list|colwidth=15em|

  • Project
  • Registry
  • Repository
  • System
  • Top-level Container
  • Library
  • Study
  • Organization
  • Party
  • Site

}}

=Data model synonyms=

Each document or dataset within a collection is modeled in software. Constructing these models is part of designing or "authoring" the expected data to be collected. The terminology for these data models includes:

{{columns-list|colwidth=15em|

  • Datamodel
  • Data dictionary
  • Schema
  • Form
  • Document
  • Survey
  • Instrument
  • Questionnaire
  • Data Sheet
  • Expected Measurements
  • Expected Observations
  • Encounter Form
  • Study Visit Form

}}

=Sub-collection or master-detail synonyms=

Data models are often hierarchical, containing sub-collections or master–detail structures described with terms such as:

{{columns-list|colwidth=30em|

  • Section, Sub-section
  • Block
  • Module
  • Sub-document
  • Roster
  • Parent-Child{{cite web|title=NCDR® AFib Ablation Registry™ v1.0 - Data Dictionary - Full Specifications [PDF]|page=36 of 143|url=http://cvquality.acc.org/~/media/QII/NCDR/AFib/AFA_v1_DataDictionaryFullSpecifications_FINAL%20July%202015.ashx|website=ACC Quality Improvement for Institutions|publisher=American College of Cardiology|accessdate=9 February 2017|language=en}}
  • Dynamic List

}}

=Data element synonyms=

At the lowest level of the data model are the data elements that describe individual pieces of data. Synonyms include:{{cite web|title=Data Element: Federal Standard 1037C: Glossary of Telecommunications Terms|url=https://www.its.bldrdoc.gov/fs-1037/fs-1037c.htm|website=www.its.bldrdoc.gov|publisher=U.S. Dept. of Commerce, Institute for Telecommunication Sciences|accessdate=7 February 2017|archive-date=1 March 2011|archive-url=https://web.archive.org/web/20110301100455/http://www.its.bldrdoc.gov/fs-1037/fs-1037c.htm|url-status=dead}}

{{columns-list|colwidth=30em|

}}

=Data point synonyms=

Moving from the abstract, domain modelling facet to that of the concrete, actual data: the lowest level here is the data point within a dataset. Synonyms for data point include:

{{columns-list|colwidth=30em|

  • Value
  • Input
  • Answer
  • Response
  • Observation
  • Measurement
  • Parameter Value
  • Column Value

}}

=Dataset synonyms=

Finally, the synonyms for dataset include:

{{columns-list|colwidth=30em|

  • Row
  • Record
  • Occurrence
  • Instance
  • (Document) Filing
  • Episode
  • Submission
  • Observation Point
  • Case
  • Test
  • (Individual) Sample

}}

See also

References

{{Reflist|30em}}