Difference between revisions of "Data Quality"

From wiki
Jump to navigation Jump to search
m
Line 3: Line 3:
 
''"The term data quality refers both to the characteristics associated with high quality data and to the processes used to measure or improve the quality of data."''<ref>DAMA-DMBOK2, 1.3.1 Data Quality, p.453</ref><br><br>
 
''"The term data quality refers both to the characteristics associated with high quality data and to the processes used to measure or improve the quality of data."''<ref>DAMA-DMBOK2, 1.3.1 Data Quality, p.453</ref><br><br>
  
The '''Strong-Wang''' framework (1996)<ref><nowiki>http://mitiq.mit.edu/Documents/Publications/TDQMpub/14_Beyond_Accuracy.pdf</nowiki></ref> focuses on data consumers' perceptions of data. It describes 15 dimensions across four general categories of data quality:<ref>DAMA-DMBOK2, 1.3.3. Data Quality Dimensions, p.455</ref>
+
The [http://mitiq.mit.edu/Documents/Publications/TDQMpub/14_Beyond_Accuracy.pdf Strong-Wang] framework (1996)<ref><nowiki>http://mitiq.mit.edu/Documents/Publications/TDQMpub/14_Beyond_Accuracy.pdf</nowiki></ref> focuses on data consumers' perceptions of data. It describes 15 dimensions across four general categories of data quality:<ref>DAMA-DMBOK2, 1.3.3. Data Quality Dimensions, p.455</ref>
 
* '''Intrinsic DQ:'''
 
* '''Intrinsic DQ:'''
 
** Accuracy
 
** Accuracy
Line 22: Line 22:
 
** Accessibility
 
** Accessibility
 
** Access Security<br><br>
 
** Access Security<br><br>
<references />
+
 
 +
[https://www.informatica.com/ca/services-and-training/glossary-of-terms/data-quality-definition.html Informatica] defines Data Quality as ''"The overall utility of a dataset as a function of its ability to be easily processed and analyzed for other users, usually by a database, data warehouse, or data analytics system."''<ref><nowiki>https://www.informatica.com/ca/services-and-training/glossary-of-terms/data-quality-definition.html</nowiki></ref><br></br>

Revision as of 16:51, 18 September 2020

The DAMA-DMBOK2 defines Data Quality (DQ) as “the planning, implementation, and control of activities that apply quality management techniques to data, in order to assure it is fit for consumption and meet the needs of data consumers.”[1]

"The term data quality refers both to the characteristics associated with high quality data and to the processes used to measure or improve the quality of data."[2]

The Strong-Wang framework (1996)[3] focuses on data consumers' perceptions of data. It describes 15 dimensions across four general categories of data quality:[4]

  • Intrinsic DQ:
    • Accuracy
    • Objectivity
    • Believability
    • Reputation
  • Contextual DQ:
    • Value-added
    • Relevancy
    • Completeness
    • Appropriate amount of data
  • Representational DQ:
    • Interpretability
    • Ease of understanding
    • Representational consistency
    • Concise representation
  • Accessibility DQ:
    • Accessibility
    • Access Security

Informatica defines Data Quality as "The overall utility of a dataset as a function of its ability to be easily processed and analyzed for other users, usually by a database, data warehouse, or data analytics system."[5]

  1. DAMA-DMBOK2, Figure 91 Context Diagram: Data Quality, p.451
  2. DAMA-DMBOK2, 1.3.1 Data Quality, p.453
  3. http://mitiq.mit.edu/Documents/Publications/TDQMpub/14_Beyond_Accuracy.pdf
  4. DAMA-DMBOK2, 1.3.3. Data Quality Dimensions, p.455
  5. https://www.informatica.com/ca/services-and-training/glossary-of-terms/data-quality-definition.html