Big Data? Statistical Process Control Can Help!

被引:32
|
作者
Qiu, Peihua [1 ]
机构
[1] Univ Florida, Dept Biostat, 2004 Mowry Rd, Gainesville, FL 32610 USA
来源
AMERICAN STATISTICIAN | 2020年 / 74卷 / 04期
基金
美国国家科学基金会;
关键词
Covariates; Data-rich applications; Dynamic processes; Feature extraction; Image data; Spatio-temporal data; DYNAMIC SCREENING SYSTEM; PHASE-I ANALYSIS; CONTROL CHARTS; NONPARAMETRIC CUSUM; AUTOCORRELATED PROCESSES; IMPROVED DESIGN; SPC METHODS; MODEL; LANDSAT; PROFILE;
D O I
10.1080/00031305.2019.1700163
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
"Big data" is a buzzword these days due to an enormous amount of data-rich applications in different industries and research projects. In practice, big data often take the form of data streams in the sense that new batches of data keep being collected over time. One fundamental research problem when analyzing big data in a given application is to monitor the underlying sequential process of the observed data to see whether it is longitudinally stable, or how its distribution changes over time. To monitor a sequential process, one major statistical tool is the statistical process control (SPC) charts, which have been developed and used mainly for monitoring production lines in the manufacturing industries during the past several decades. With many new and versatile SPC methods developed in the recent research, it is our belief that SPC can become a powerful tool for handling many big data applications that are beyond the production line monitoring. In this article, we introduce some recent SPC methods, and discuss their potential to solve some big data problems. Certain challenges in the interface between the current SPC research and some big data applications are also discussed.
引用
收藏
页码:329 / 344
页数:16
相关论文
共 50 条