FORECASTING INDUSTRIAL PH LEVELS: COMPARATIVE STUDY OF SARIMA, REGRESSION TREES AND CONTROL CHART DIAGNOSTICS

Authors

  • MOSTAFA ESSAM EISSA Independent Researcher and Consultant, Cairo, Egypt.

DOI:

https://doi.org/10.55197/qjoest.v6i3.235

Keywords:

CART, SARIMA, statistical process control, box-cox transformation, EWMA, run chart

Abstract

Implementation of Statistical Process Control (SPC) techniques in food and beverage industry are crucial to deliver consumable product that meets customer expectations. This study investigated industrial pH forecasting and process stability in a syrup manufacturing facility. We analyzed 1,247 pH observations with three objectives: (1) Quantify instability via control charts, (2) Model pH dynamics using Seasonal Autoregressive Integrated Moving Average (SARIMA) and Classification And Regression Trees (CART), and (3) Develop diagnostic frameworks for unstable processes. Methodologically, Exponentially Weighted Moving Average (EWMA) charts assessed stability; Box-Cox transformed SARIMA (λ=2) with seasonal differencing was used for forecasting; CART identified variable importance. Control charts revealed profound instability: 83.3% of points violated 3σ limits; run tests significant (p<0.001). For SARIMA, (1,0,1)(0,1,1)₁₂ achieved significant parameters (p<0.0001) with improved residual diagnostics versus non-seasonal ARIMA, though minor autocorrelation remained at lag 12 (p=0.003). CART explained training R²=18.86% and test R²=17.93% of pH variation, identifying filling weight and sodium benzoate as key predictors. Crucially, this study demonstrates that forecasting retains diagnostic utility even in unstable environments: SARIMA residuals provide seasonal fingerprints of assignable causes, while CART thresholds guide intervention priorities. SARIMA(1,0,1)(0,1,1)₁₂ demonstrated superior residual properties: eliminated back forecast warnings (present in ARIMA), reduced autocorrelation at lag 24 (p=0.017 vs 0.040), and explicitly modeled 12-period seasonality. While process instability persists, SARIMA provides diagnostic fingerprints of assignable causes through seasonal parameters (SMA₁₂=0.9846, T=513.12) and residual patterns. We conclude that SARIMA offers enhanced short-term forecasting capability, but process intervention remains essential for reliability. The study advocates for integrated instability-informed forecasting combining SARIMA diagnostics, real-time control charts, and expanded sensor deployment.

References

[1] Aparisi, F., García-Díaz, J.C. (2007): Design and optimization of EWMA control charts for in-control, indifference, and out-of-control regions. – Computers & Operations Research 34(7): 2096-2108.

[2] Blower, P.E., Cross, K.P. (2006): Decision tree methods in pharmaceutical research. – Current Topics in Medicinal Chemistry 6(1): 31-39.

[3] Box, G.E.P., Cox, D.R. (1964): An analysis of transformations. – Journal of the Royal Statistical Society: Series B 26(2): 211-243.

[4] Box, G.E., Jenkins, G.M., Reinsel, G.C., Ljung, G.M. (2015): Time series analysis: forecasting and control. – John Wiley & Sons 720p.

[5] Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J. (2017): Classification and regression trees. – Chapman and Hall/CRC, NY, USA 368p.

[6] Chen, L., Ao, F., Ge, X., Shen, W. (2020): Food-grade Pickering emulsions: Preparation, stabilization and applications. – Molecules 25(14): 24p.

[7] Chen, L., Wu, T., Wang, Z., Lin, X., Cai, Y. (2023): A novel hybrid BPNN model based on adaptive evolutionary Artificial Bee Colony Algorithm for water quality index prediction. – Ecological Indicators 146: 15p.

[8] De Ketelaere, B., Hubert, M., Schmitt, E. (2015): Overview of PCA-based statistical process-monitoring methods for time-dependent, high-dimensional data. – Journal of Quality Technology 47(4): 318-335.

[9] Domański, P.D. (2016): Non-Gaussian and persistence measures for control loop quality assessment. – Chaos: An Interdisciplinary Journal of Nonlinear Science 26(4): 18p.

[10] Fellows, P.J. (2022): Food processing technology: Principles and practice. – Woodhead Publishing 804p.

[11] Hyndman, R.J., Athanasopoulos, G. (2018): Forecasting: Principles and practice. – OTexts 384p.

[12] International Commission on Microbiological Specifications for Foods (ICMSF) (2018): Microorganisms in foods 7: Microbiological testing in food safety management. – Springer 487p.

[13] Kotsiantis, S.B. (2013): Decision trees: A recent overview. – Artificial Intelligence Review 39(4): 261-283.

[14] Ljung, G.M., Box, G.E.P. (1978): On a measure of lack of fit in time series models. – Biometrika 65(2): 297-303.

[15] Lucas, J.M., Saccucci, M.S. (1990): Exponentially weighted moving average control schemes: Properties and enhancements. – Technometrics 32(1): 1-12.

[16] Minitab, L.L.C. (2023): Getting started with Minitab statistical software. – State College, PA: Minitab, Inc. 63p.

[17] Montgomery, D.C. (2020): Introduction to statistical quality control. – John Wiley & Sons 768p.

[18] Ryan, T.P. (2011): Statistical methods for quality improvement. – John Wiley & Sons 704p.

[19] Woodall, W.H. (2000): Controversies and contradictions in statistical process control. – Journal of Quality Technology 32(4): 341-350.

[20] Woodall, W.H., Montgomery, D.C. (2014): Some current directions in the theory and application of statistical process monitoring. – Journal of Quality Technology 46(1): 78-94.

[21] Xu, H., Hu, B., Huang, W., Du, X., Shao, C., Xie, K., Li, W. (2023): A hybrid knowledge-based and data-driven method for aging-dependent reliability evaluation of high-voltage circuit breaker. – IEEE Transactions on Power Delivery 38(6): 4384-4396.

Downloads

Published

2025-09-29

Issue

Section

Articles

How to Cite

FORECASTING INDUSTRIAL PH LEVELS: COMPARATIVE STUDY OF SARIMA, REGRESSION TREES AND CONTROL CHART DIAGNOSTICS. (2025). Quantum Journal of Engineering, Science and Technology, 6(3), 14-22. https://doi.org/10.55197/qjoest.v6i3.235