Minimizing the data quality problem of information systems: A process-based method
作者:
Highlights:
• A process-based optimization method for minimizing the data quality problems is proposed.
• The method considers the impact of data operation nodes and data flow structures on the propagation and accumulation of data quality problem.
• The practical applicability of the method is demonstrated by using a realistic information system.
• The method identified the cost-effective resource-allocation strategy.
摘要
The low quality of data in information systems poses enormous risks to business operations and decision making. In this paper, a single-period resource allocation problem for controlling the information system's data quality problem is considered. We develop a Data-Quality-Petri net to capture the process through which data quality problem generates, propagates, and accumulates in the information system. The net considers not only the factors leading to the production of the data quality problem by the data operation nodes and the data flow structure, but also the data transfer ratio of the nodes. Then, we propose a nonlinear programming optimization model with control resource constraints. The result of the model provides an optimal strategy to allocate resources for minimizing the expected data quality problem of an information system. Further, we examine the impact of the data flow structure on optimal resource allocation. The result shows that the optimal resource input level for a data operation node is proportional to its potential for downstream propagation. A warehouse management system of an e-commerce company is utilized to illustrate the model. Our study provides a method for data managers to control the information system's data quality problem by employing a process perspective.
论文关键词:Data quality,Information system,Petri net,Optimization model,Process model
论文评审过程:Received 23 January 2020, Revised 28 May 2020, Accepted 31 July 2020, Available online 8 August 2020, Version of Record 19 August 2020.
论文官网地址:https://doi.org/10.1016/j.dss.2020.113381