• Please log in or register to access this feature.

SEARCH

SEARCH BY CITATION

Keywords:

  • multiphase batch processes;
  • between-phase transition;
  • subspace separation;
  • quality interpretation and prediction

Abstract

  1. Top of page
  2. Abstract
  3. Introduction
  4. Methodology
  5. Illustrations and Discussions
  6. Conclusions
  7. Acknowledgments
  8. Literature Cited

The quality-concerned between-phase transition analysis is performed and an improved calibration modeling strategy is designed for quality prediction and interpretation in multiphase batch processes. From the between-phase viewpoint, the quality-related phase behaviors are decomposed and two subspaces are separated. In common subspace, the underlying quality-relevant variation stays invariable between the neighboring phases, showing the common contribution to quality. The other part changes with the alternation of phases and has the different influences on quality interpretation, termed specific subspace here. Based on subspace separation, between-phase transition regions are distinguished from steady phases. Different models are developed in steady phases and transition regions respectively for online quality prediction. Offline quality analyses are also conducted in two subspaces to explore the time cumulative effects. The proposed method gives an interesting insight into the phase behaviors and between-phase transitions for quality prediction. The feasibility and performance of the proposed method are illustrated with a typical multiphase batch process. © 2012 American Institute of Chemical Engineers AIChE J, 59: 108–119, 2013


Introduction

  1. Top of page
  2. Abstract
  3. Introduction
  4. Methodology
  5. Illustrations and Discussions
  6. Conclusions
  7. Acknowledgments
  8. Literature Cited

Data-based multivariate calibration methods have been widely used to establish a quantitative relationship between process measurement (X) and quality property (Y). Accurate qualitative and quantitative calibration analysis may help avoiding cumbersome and costly chemical measurements. In practice, calibration modeling and analysis can often be accomplished with familiar, conventional statistical techniques,[1-11] such as multiple linear regression, principal component regression, canonical correlation analysis (CCA), and partial least squares (PLS). The subject of calibration modeling and quality interpretation arouses new issues and demand specific solutions when it refers to multiphase batch processes where various phases generally operate orderly under the domination of different physical phenomena, revealing different effects on the final quality. How to characterize them efficiently is a challenging problem.

Multiway partial least squares (MPLS) model[12] uses process variables over the entire batch course as the input, which reveals well the time correlations throughout the cycle and, thus, shows efficiency for analysis of cumulative effects on quality. However, for multiphase batches, it is generally deemed that if the data are handled in a single matrix, the effect of one segment tends to be hidden more or less by the influence of another. The resulting tribulation is that the hidden effect could be useful in quality-concerned process analysis and control. It is commonly accepted that in multiphase cases, more underlying information can be explored by dividing the data into meaningful blocks either by the types of variables or by the part of the process they originate from and building multiple specific models instead of single modeling of all data. The effect of each block can be seen clearly and, thus, more comprehensive process understanding and quality analysis can be expected.

Considering that the phase multiplicity is an inherent nature of many batch processes, various strategies have been reported and can be put into quality prediction for online or offline applications.[13-25] One is to model the variable correlation within each phase under the influence of other phases by multiblock PLS (MBPLS).[13-18] Compared with MPLS, the advantage of MBPLS is mainly to allow easier interpretation of both the roles of each smaller meaningful block and the integrated contribution of all blocks. Qin et al.[14] have made a comprehensive review of multiblock algorithms and reported the super scores of MBPLS were identical to the scores of regular MPLS and, thus, achieved the same quality prediction performance. Further, Reinikainen and Hoskuldsson et al.[19] reported a priority PLS regression analysis method and its successful application to a multistep industrial process, which gave multiple phases descending priority following operation time sequence. The idea was to compute the predictor information first only on the basis of first phase. When no more significant latent variables (LV)s could be found from the first phase, the next predictor information was extracted from the data of the second phase so that the quality information that is not modeled by the previous phases will be left to be explained by the following phases. Zhao et al.[20] presented a two-level multiphase statistical analysis strategy to probe the phase-wise local and cumulative effects on quality interpretation and prediction by calibration modeling in both predictor and quality spaces. It revealed within each phase which part of process variation was responsible for quality variations and which part of quality variation was dominated. For online quality prediction, the subPLS modeling algorithm was first developed by Lu and Gao,[21] where all measurements within the same phase could use the same regression model. It was based on such a presupposition that despite the time-varying batch operation trajectory, the correlations between process and quality variables should remain similar within the same phase. Further development[22-25] has also been reported to further extend this research line.

In this work, a between-phase calibration modeling strategy is proposed for transition analysis and quality prediction and interpretation. For multiphase batch processes, how the effects of process variation on quality evolve with the phase changes is an interesting issue. The key point or most concerned is, thus, how to capture them along with phase alternation and process evolvement. This also requires the attention to the between-phase transitions and their difference from the steady phases. From this motivation, a multiset regression modeling method, termed multiset regression analysis (MsRA),[26] which was proposed to relate the inherent quality-related predictor variation across multiple data sets, can be used here as the basic modeling method. Following the theoretical developments and its property analysis in the previous work,[26] this study addresses the potential of using the said algorithm to solve one practical problem, the quality prediction and interpretation in multiphase batch processes with between-phase transitions. It should be noted that here MsRA only focuses on two neighboring phases for between-phase transition analysis and quality prediction. From the quality-concerned viewpoint, each of the neighboring phases can be separated into two subspaces, termed between-phase common and specific subspaces, respectively. In the between-phase common subspace, the quality-concerned process variability stays invariable from one phase to another, thus, revealing the same contributions to quality properties. That is, they explain the same part of quality variability repetitively. In the left between-phase specific subspace, the process variability is more specific, revealing different effects on quality interpretation between the two neighboring phases. That is, they explain different parts of quality characteristics in a complementary way. For one extreme case, if the quality-concerned systematic variations in the two neighboring phases can be all characterized in the common subspace, they will have quite the same quality contributions even though they may be described by different regression weights. The purposes of between-phase calibration analysis are two-fold: to explore the transition regions between two neighboring phases and give more meaningful insight into the phase behaviors with respect to quality prediction. Here, it should be noted that for quality analysis, the between-phase transition is different from that in process monitoring, as they check the changes of different underlying process variations. In comparison with the traditional calibration methods, the separation of the common and specific predictor variations and the dual care paid to both are the major differences.

Methodology

  1. Top of page
  2. Abstract
  3. Introduction
  4. Methodology
  5. Illustrations and Discussions
  6. Conclusions
  7. Acknowledgments
  8. Literature Cited

Transition problem in quality prediction

For multiphase batches, in general, different phases have different characteristics and reveal different effects on quality. From the quality-concerned viewpoint, the process may be divided into several modeling phases as indicated by the changes of the quality-relevant inherent characteristics. In process monitoring, the between-phase transition problem has been noticed.[27] It is in general deemed that the process actually covers steady status and dynamic transition status from a starting phase to the target phase. Valid process monitoring during transitions are attracting increasing attention.[27-29] Commonly, the transition pattern shows the gradual changes between two neighboring phases. The underlying characteristics of the transition patterns at the beginning resemble the characteristics of the starting phase, whereas those later transition patterns resemble the characteristics of the target phase. Based on the above recognition, some corresponding monitoring strategies have been developed.[27-29]

However, the between-phase transition problem is different and requires special attention with respect to quality prediction. Instead of only considering the process variables, dual attention is paid to both process variable correlations and those between process variables and quality. In this study, it is deemed that the contribution to quality may not completely change from one phase to another. That is, although the regression relationship may change as indicated by the prediction models, one part of the underlying quality-related predictor variation will stay invariable or consistent between the two neighboring phases. This reveals the same power of quality prediction and interpretation, as the prediction is made by directly regressing quality on the extracted regression scores (that is, the predictor variation). The difference in between-phase predictor variation reflects the different quality interpretability. Therefore, how the predictor variation information changes from one phase to another actually reveals the between-phase changeover of quality interpretability. Instead of isolating the effects of single phase on qualities, it is necessary to gain a detailed quality-concerned insight into the underlying phase characteristics from the between-phase viewpoint, which can provide important information of the changing quality interpretability from one phase to another. The key is how to separate the two different types of predictor variations in each phase.

Based on the above analysis, the phase behavior should be modeled in a quite different way from the conventional calibration methods. The MsRA[26] algorithm includes two versions, MsRA-score and MsRA-weight, which extract the similar regression scores and weights, respectively. Actually, the quality prediction and interpretation is directly related with the regression scores where the same scores will make the same contribution to quality. The regression weights are just for the calculation of scores, where even the similar weights can be figured out, the regression scores may be quite different, revealing quite different quality interpretability. In the study, as the quality interpretability is the concerned, the regression scores should be used as the analysis subject and MsRA-score algorithm can be used here as the basic modeling method for between-phase analysis. With this method, the original phase measurement space can be separated into two different subspaces, each subspace containing different underlying predictor information and revealing different effects on qualities. One subspace is called the common subspace, which is composed of the similar predictor variations between two neighboring phases. The other is called the between-phase specific predictor subspace, which is supported by the unique predictor variation in each phase. The two subspaces give the similar and different quality interpretability between two neighboring phases, respectively. The separation of the common and specific phase information and the attention given to both provide a good platform for the identification of transition regions.

Between-phase subspace separation

In each batch run, assume that J process variables are measured online at k = 1, 2, …, K time instances throughout the batch and Jy quality variables are obtained offline. Then, process observations collected from similar I batches can be organized as a three-way array X(I × J × K) and a corresponding quality matrix Y(I × Jy) as shown in Figure 1a. At each time, the means of each column are subtracted to approximately eliminate the main nonlinearity. Each variable is scaled to unit variance to handle different measurement units, thus, giving each equal weight. In this work, the batches are of equal length without special declaration so that the specific process time can be used as an indicator for data normalization.

Figure 1. Illustration of the between-phase regression modeling scheme (a) Batch-wise unfolding and data normalization, (b) between-phase subspace separation for the ith and i+1th phases, and (c) regression modeling for online quality prediction with respect to the ith steady phase.

Download figure to PowerPoint

image

Phase information should be identified before between-phase analysis. In multiphase modeling, one important issue is how to get the phase marks (i.e., phase division) before designing the phase model. Various strategies[21, 23, 30-34] have been reported from different viewpoints and based on different principles, providing a rich database for phase division, and can be put into practical process monitoring. In this work, assuming that no prior process knowledge is available, C phases can be readily identified along time direction by clustering algorithm as what was done in previous work,[21] where the PLS weights are used to evaluate the similarity of time-slices, revealing the changes of process-quality relationships. Certainly, the phase clustering information can also be modified using prior expertise. Then, the process data in each phase are arranged as Xi(I × JNi) (i = 1, 2, …, C), where I is the number of batches and Ni is the number of time samples in each phase. They are corresponding to the same quality data, Y(I × Jy).

The between-phase similarity in quality prediction and interpretation is deemed to be driven by some common regression scores, which are directly related to quality. Using MsRA-score algorithm,[26] the common regression scores are extracted focusing on two adjacent phases, which can describe the process variables by linear combinations. Here, it should be noted that for each phase, when it is analyzed with different adjacent phases, the subspace separation results may be different, revealing different between-phase relationships with respect to quality analysis.

As shown in Figure 1b, for any two neighboring phases, Xi(I × JNi) and Xi+1 (I × JNi+1), they may have a different number of variables but the same samples. Under the dual supervision of betweens-set relationship and quality variables (Y(I × Jy)), the between-phase common regression scores (Ti (I × inline image), where inline image is the number of retained common scores) are obtained in each of the two neighboring phases by the two-step MsRA algorithm[26]

  • display math(1)

where Ai is composed of the weight vectors ai calculated in the first step, and Pi is the regression loading matrix for Xi by the first-step MsRA. The weight coefficients (Ri) to compute scores (Ti) directly from original Xi is expressed based on the similar calculation in PLS. Ti is the scores from the first-step calculation. Ai is composed of weight vectors ai obtained in the second step. The final weight vectors (Ri) are obtained by the two-step calculation to compute scores (Ti) directly from the original Xi.

For two neighboring phases, Xi (I × JNi) and Xi+1 (I × JNi+1), the common scores are extracted to achieve the maximal correlations. Then, the representative global/common scores can be calculated by averaging the common scores from the two phases

  • display math(2)

Thus, each original observed data space (Xi) is separated into two different parts, inline image and inline image

  • display math(3)

where inline image are the inline image)-dimensional loadings or linear combination coefficients corresponding to the common prediction scores. Actually, GTg = Tg (TgT Tg) −1TgT is the orthogonal projector onto the column space of Tg, and HTg = IGTg = ITg (TgT Tg)TgT is the antiprojector with respect to the column space of Tg. Therefore, from another viewpoint, the two subspaces can also be obtained by projecting Xi onto two different projectors as XiGTg and XiHTg. It is clear that the two predictor subspaces are orthogonal to each other as inline image inline image = XiTGTgXi = 0.

Between-phase transition identification

In the between-phase common subspace, the variables enclose the same systematic predictor variations, as all of them are the linear combinations of common scores where the combination coefficients are indicated by inline image (JNi × inline image) in each phase. The reconstructed predictor variation inline image (I × JNi) is explained by the common scores and their linear combination relationship inline image (JNi × inline image). For each time-slice process data, the corresponding linear combinations is a J × inline image matrix inline image (J × inline image)) separated from inline image (JNi × inline image), which is composed of linear combination coefficients associated with inline image common scores for J process variables at each time. It is deemed that the linear combinations should be similar except with normal oscillations along time direction within the same phase. The linear combinations would be different from one phase to its neighboring one, representing different reconstruction relationships of common scores and different ways common scores act on predictor information. Therefore, by checking the changes of time-slice inline image (J × inline image) along time direction, the between-phase changeover can be indicated to capture the between-phase transitions.

The similarity of inline image (J × inline image) along time direction should be evaluated. Here, based on the phase division result, the phase center can be calculated by averaging all inline image (J × inline image) within the same phase

  • display math(4)

Then, the similarity between each time-slice and the phase center is calculated based on the angle between each pair of loadings

  • display math(5)

where, || || is the module operator, and 〈 〉 calculates the inner product. cos θj means the angle between the phase center and time-slice loading corresponding to each score as indicated by subscript j. The weights (λj) attached to each loading direction is the correlation coefficient between the extracted common scores. Clearly, the similarity index ranges within [0, 1], in which, the larger one means more significant similarity.

Another useful analysis subject to help understanding the between-phase transition is time-slice inline image (I × J) which are separated from the phase-representative inline image (I × JNi+1). The reconstructed predictor variation information for quality prediction in the common subspace can be quantitatively evaluated at each time by

  • display math(6)

where, inline image is the reconstructed jth process variable for mth batch and kth samples by common scores. xm,k,j is the corresponding measurement.

It is regarded in general that R2 inline image may be similar and only with normal oscillation within the same phase. It should be noted that sometimes, R2 inline image may be similar for two neighboring phases, indicating the similar ratio of common predictor variation to all predictor variation. When the process evolves to the transition regions, R2 inline image will give the corresponding indication. The time-varying profile of R2 inline image can be used to help that of Simi index for identifying the transition patterns between two neighboring phases. Here, to more clearly reveal their changing trend, their profiles can be smoothed by Kalman filter[35, 36] to remove the influence of those singular points and normal oscillations. By plotting the time-varying profile, gradually increasing or decreasing trends are distinguished from those steady statuses and identified as the transition patterns. Comparatively, the time when some successive Simi values show small changes is identified as the turning point for steady phase. In this way, the steady phase and the transition region will be separated from each other.

Online quality prediction model

After the analysis of between-phase transition, the phase information can be updated by adjusting the phase boundary, called steady phase here. The time-slices in each steady phase (i) and the following transition region are denoted as Xi (I × J × Ñi) and Xit (I × J × Ñit), respectively, where Ñi is the number of time-slices in the steady phase (i) and Ñit is the number in transition region following the ith steady phase. In general, Ñi should be smaller than the original number (Ni) in each phase, as some samples are separated and classified into the transition region. All samples within the same steady phase are arranged by variable-unfolding, inline image (i × J). Correspondingly, the quality data are duplicated within the same steady phase to get a consistent observation dimension, inline image (i × Jy), as shown in Figure 1c. PLS-CCA algorithm,[6] where CCA is used as a postprocessing on the PLS results, is used to design the steady phase-based regression model:

  • display math(7)

where the steady phase-based PLS-CCA weights model, inline image (J × inline image) is intrinsically doubly controlled by PLS and CCA weights; inline image is the dimension of the PLS-CCA model, that is, the number of retained PLS-CCA scores. inline image are the phase loadings for qualities. inline image is the regression coefficients matrix for online quality prediction.

For transition regions, to reveal their more time-varying characteristics, the time-slice PLS-CCA model will be developed and used for online quality prediction

  • display math(8)

where the terms have the similar denotation with those shown in Eq. (7).

Therefore, for the new measurement vector inline image (1 × J) at each time in each steady phase (i) or the transition region (it), a realtime quality prediction can be obtained online

  • display math(9)

The online quality predictions may be time-varying within the same steady phase, which may be caused by the measurement noises and modeling errors, just to name a few. The oscillation of predicted quality variation in transition region can also be caused by the changes of prediction relationship and predictor information besides the aforementioned oscillatory factors.

Offline quality interpretation

Despite of the realtime strength, online quality prediction, however, only uses the predictor information isolated at each time. It fails in revealing the effects of cumulative process behaviors and between-phase relationship on quality interpretation. Some questions naturally arise: How do the process patterns cumulatively act on quality in each steady phase and transition region? Moreover, under the influences of between-phase relationship, will the phase behaviors play differently? On the basis of subspace separation, this would be analyzed during offline between-phase quality analysis.

First, all time-slices within the same steady phase and the following transition region share the same common scores (Tg) which is calculated by Eq. (2). The quality information explained by the common scores can be readily calculated

  • display math(10)

where, inline image is the loading matrix for quality with respect to the common scores. inline image is the predicted quality variation in the common subspace.

In the phase-specific subspace, the process data contain no information associated with the common scores and, thus, reveal different and specific quality-related variation to each phase. All time-slices in the specific subspace within the same steady phase are arranged by batch-unfolding, inline image (I × ÑiJ). Similarly, the time-slices in the specific subspace within the following transition region are arranged as inline image (I × inline image J). The PLS–CCA decomposition[6] is performed in each specific subspace with respect to the steady phase and transition region, respectively

  • display math(11)
  • display math(12)

where inline image (N × inline image) is the specific regression scores in each steady phase calculated by PLS–CCA weights inline image (i × inline image) (inline image is the number of retained scores). inline image (Jy × inline image) is the loading matrix for quality by the specific scores. inline image (N × Jy) is the predicted quality variation in the specific subspace, which is orthogonal to inline image in the common subspace in each steady phase, revealing that the two different types of phase predictor information contribute to different parts of quality variations. Similarly, the model structures in Eq. (12) are termed with respect to transition region. In general, inline image (N × inline image) should be more different between phases, as those common scores are excluded. Moreover, it is clear that the specific scores are orthogonal to those common scores.

In summary, the quality variation can be determined by the joint Tt, which is composed of both between-phase common and specific scores, inline image, with respect to the steady phase and transition region

  • display math(13)
  • display math(14)

Here, it should be noted that the predicted quality variation by common scores are the same for steady phase and transition region. The difference of quality interpretability lies in the specific subspace. In this way, the cumulative effects on quality variation in two different subspaces are modeled from the between-phase viewpoint.

To quantitatively evaluate the quality interpretability, the goodness-of-fit of the prediction models is measured by multiple coefficient of determination[37]

  • display math(15)

where ŷm,jy is the quality prediction for the jyth quality index either in steady phase or in the transition region; the subscript m indicates the batch. ym,jy is the actual measurement.

Illustrations and Discussions

  1. Top of page
  2. Abstract
  3. Introduction
  4. Methodology
  5. Illustrations and Discussions
  6. Conclusions
  7. Acknowledgments
  8. Literature Cited

Process description and data preparation

Injection molding,[38, 39] a key process in polymer processing, transforms polymer materials into various shapes and types of products. A typical injection molding process consists of three operation phases, injection of molten plastic into the mold, packing-holding of the material under pressure, and cooling of the plastic in the mold until the part becomes sufficiently rigid for ejection. Besides, plastication takes place in the barrel in the early cooling phase, where polymer is melted and conveyed to the barrel front by screw rotation, preparing for next cycle. All key process conditions such as the temperatures, pressures, displacement, and velocity can be measured online by their corresponding transducers, providing abundant process information.

The material used in this work is high-density polyethylene. Twelve process variables and four quality variables are selected for modeling. The process variables can be collected online with a set of sensors. Two dimension indices, product length (mm), and weight (g), whose real values can be directly measured by instruments, and two surface defects, jetting, and record grooves, whose real values can be quantified by a process operator expert before modeling, are chosen to evaluate the product qualities. Totally 32 normal batch runs are conducted under various operation conditions by design of experiment (DOE) method.[21] Using injection stroke as indicator variable, the reference batches are unified to have even duration (1300 samples in this experiment) by data interpolation, which, thus, results in a regular descriptor array X (32 × 12 × 1300). The measurement data are filtered to remove those obvious noises such as spikes. The qualities are only measured at the end of process, generating the dependent matrix Y(32 × 4). The first 25 batches are used for modeling, whereas the other seven cycles are used for model validation.

Illustration results and analyses

First, 1300 normalized time-slices Xk (25 × 12) are obtained from X (25 × 12 × 1300) as well as the normalized quality variables Y(25 × 4). Then, 1300 time-slice weight matrices are obtained focusing on the data pairs, {Xk,Y}. They are weighted using the time-varying variances of PLS LVs and then fed to the clustering algorithm.[21] The quality-relevant operation process is partitioned into five main clusters, in which, operation time information is included so that process samples are consecutive within the same clustering. The phase clustering result is shown in Figure 2 along with the time-varying process operation trajectory. The clusters II–V are deemed to be consistent with the real four physical operation phases: injection, packing-holding, plastication, and cooling. Moreover, a short period before cluster II is also separated as one individual phase, which have different characteristics and effects on quality, as illustrated later. Thus, five predictor data blocks are prepared by batch-unfolding, Xi (25 × JNi)(i = 1,2,…,5) (where Ni denotes the phase duration), which are associated with the same quality data set Y(25 × 4).

Figure 2. Phase clustering information along with process operation trajectories.

[Color figure can be viewed in the online issue, which is available at wileyonlinelibrary.com.]

Download figure to PowerPoint

image

First, the between-phase subspace separation is performed. There are four pairs of between-phase combinations with respect to five phases. The first four between-phase common scores are illustrated in Figure 3 for each between-phase pair over the process. In general, the first two common scores are almost the same between the two neighboring phases. The latter two may be different more or less but still show a similar trend. It is found that the first common score vector for Phases 1 and 2 is quite similar to that for Phases 2 and 3. It is also similar to the second common score for Phases 3 and 4 with respect to their general trend. It is not similar with anyone of those for Phases 4 and 5. This tells that with process evolvement, the phases which are far from each other are more different so the common scores are more different. Based on the extracted common scores, the loadings for each phase are calculated and phase centers are obtained. By Eq. (5), the similarity between time-slice loadings and the phase center is computed as shown in each subplot in Figure 4 for each between-phase pair. Moreover, the predictor variation explained by common scores is evaluated by R2 inline image index and the values are used to help understanding the changing common predictor variation between two neighboring phases as illustrated in each subplot in Figure 4. Note that both similarity and R2 inline image profiles are processed by Kalman filter so that the major changing trend can be captured more clearly by excluding those normal oscillations to a certain extent. The similarity profile with respect to the starting phase is used to confirm the beginning of transition region, which is the time when the steady similarity values begin to show a decreasing trend. The similarity values with respect to the target phase are used to confirm the ending of transition region, which is the time when the increasing similarity begins to present a stable trend. Combining the similarity evaluation results and R2 inline image, steady phases and transition regions are separated as shown in the sketch map in Figure 5. Between different neighboring steady phases, the transition regions may be of different durations and of different characteristics. At the very start of process, the transition region between steady Phases 1 and 2 is longer than steady Phase 1. Actually, the steady Phase 1 may be also called transition region considering that the beginning operation may be oscillatory in each cycle. Here, it should be noted that due to the uncertainty of transition patterns, the transition regions may not be determined definitely. Actually, the analysis of transition regions can be always influenced by subjective factors.

Figure 3. The profiles of the first four between-phase common scores by MsRA algorithm for (a) Phases 1 and 2, (b) Phases 2 and 3, (c) Phases 3 and 4, and (d) Phases 4 and 5.

[Color figure can be viewed in the online issue, which is available at wileyonlinelibrary.com.]

Download figure to PowerPoint

image

Figure 4. Changing trend of similarity values between time-slice loadings and phase centers as well as the predictor variations (R2 inline image) modeled by common scores for between (a) Phases 1 and 2, (b) Phases 2 and 3, (c) Phases 3 and 4, and (d) Phases 4 and 5 analyses.

[Color figure can be viewed in the online issue, which is available at wileyonlinelibrary.com.]

Download figure to PowerPoint

image

Figure 5. The steady phase and transition region information along time direction.

[Color figure can be viewed in the online issue, which is available at wileyonlinelibrary.com.]

Download figure to PowerPoint

image

Based on the updated phase division information, phase representative model for each steady phase and transition model for each transition time are developed using PLS-CCA algorithm and used for online quality prediction. Here, it is called between-phase subPLS-CCA (Bp-subPLS-CCA) model. To assess the influence of between-phase transitions on model development, the phase models are also developed based on the original phase division results and PLS-CCA algorithm. That is, transition patterns are not distinguished from steady phase. Instead, they are included in some phases for model development. Here, it is called subPLS-CCA model. The phase representative regression models for the two different methods are compared in Figures 6a–d for four quality indices, respectively. In general, for all quality variables, the model coefficients are more different in Phases 2 and 3. It can be seen clearly that the exclusion of transition patterns from steady phases will result in different phase models more or less. To evaluate whether the exclusion of transition patterns from the steady phase can improve prediction performance, R2Y values with respect to all quality indices are compared between the two methods and shown in Figure 7. They are plotted along time direction through steady phases and between-phase transition regions over the whole process with respect to both training batches and testing batches. In general, the quality predictions are similar with small normal oscillations in the steady phase and more oscillatory in transition regions. It can be seen that when transition patterns are removed from steady phase, the representability of phase models can be improved, generating more stable and better quality prediction, especially for steady Phases 2 and 3. In transition regions, subPLS-CCA method uses some phase model and bad predictions are observed. Comparatively, the prediction performance by Bp-subPLS-CCA is significantly improved. At some transition time, the prediction performance is better than that based on the neighboring steady phase models.

Figure 6. Regression coefficients for process variables for online quality prediction and two different algorithms with respect to (a) the first quality variable, (b) the second quality variable, (c) the third quality variable, and (d) the fourth quality variable (from top to bottom: Phases 1 to 5).

[Color figure can be viewed in the online issue, which is available at wileyonlinelibrary.com.]

Download figure to PowerPoint

image

Figure 7. R2 plot along time direction for online quality prediction with respect to four quality indices (top: training batches; bottom: testing batches).

[Color figure can be viewed in the online issue, which is available at wileyonlinelibrary.com.]

Download figure to PowerPoint

image

During online application, the end-of-phase quality values are calculated by averaging all real-time predicted values within the steady phase. Moreover, to reveal the variability or diversity of quality predictions in steady phases or transition regions, the mean and standard deviation (STD) indices are plotted in Figure 8 taking the product length (i.e., the second quality variable) for instance, indicating how much variation or dispersion there is from the average prediction, that is, the end-of-phase value. A low STD indicates that all predictions within the same steady phase tend to be very close to the mean, whereas high STD indicates that the predictions are spread out over a large range of values. For each test batch, in different steady phases, the variability is different more or less. In general, steady Phases 3 and 5 generate the least STD values for all batches, indicating the least variability along time direction.

Figure 8. End-of-phase quality prediction results (mean ± STD) for testing batches using the proposed method in different steady phases (from top to bottom: steady Phases 1–5).

Download figure to PowerPoint

image

By performing between-phase modeling, the underlying predictor information is decomposed into two parts. Actually, the underlying predictor information in each phase can be forcefully and completely extracted as “common” scores one by one and ordered by descending quality-related between-phase relationship. In this way, no between-phase specific model is developed. However, the latter “common” scores may be quite different from each other and are actually pseudo “common” scores. To illustrate this, focusing on the original phase division, each phase is modeled only by “common” scores. This results in a different regression model (Bpc-MPLS-CCA) in each phase from the conventional calibration methods. It is compared with P-MPLS-CCA model where each single phase is separately modeled by MPLS-CCA. Here, no transition regions are considered. The prediction results are comparatively shown in Table 1 for each quality index. In general, they are similar, revealing that the predictor information extracted only by between-phase “common” scores is comparable to that by conventional single phase algorithms so that they make the similar contributions to quality. It should be noted that when one phase is associated with different neighboring phases, different between-phase models are identified, which, however, shows the similar prediction results for the same phase. Moreover, for different quality indices, different phases are significant for prediction. For example, for the fourth quality index, record grooves, the first three phases (including injection and packing-holding physical operation phases) are more important, which well agrees with the real case.

Table 1. Offline Prediction Comparison [R2Y (%)] for Bpc-PLS-CCA Model and P-MPLS-CCA Model
Quality indexMethodPhase No.
1 (2)a2 (1)2 (3)3 (2)3 (4)4 (3)4 (5)5 (4)
  1. a

    Each phase is associated with one neighboring phase as indicated by the value in bracket.

1Bpc-PLS-CCA35.971.272.589.887.888.989.458.9
P-MPLS-CCA36.170.488.388.959.8
2Bpc-PLS-CCA30.678.872.889.989.993.893.754.4
P-MPLS-CCA30.176.890.493.853.0
3Bpc-PLS-CCA77.856.054.831.628.331.930.732.3
P-MPLS-CCA68.652.722.630.726.2
4Bpc-PLS-CCA99.998.899.698.196.948.648.138.5
P-MPLS-CCA99.999.197.149.031.7

Then, based on the updated phase information, the offline quality prediction models are developed for steady phases and transition regions with respect to the between-phase subspace separation. To reveal the influence of cumulative effects in each time region (steady phase and transition region) on quality prediction, the online predictions and offline predictions based on between-phase calibration modeling are compared in Table 2. Different from the Bpc-MPLS-CCA in Table 1, here, the predictor and quality are interpreted jointly by common and specific scores for offline quality analysis. The quality prediction for each steady phase is the sum of prediction values in common and specific subspaces, as the common scores and specific scores are orthogonal, revealing complementary quality predictions. Moreover, when each steady phase is associated with different neighboring steady phases, different common scores may be extracted, resulting in different prediction models and performance. The better quality prediction results are chosen and show in Table 2. Compared with end-of-phase quality prediction for real-time application, the offline results are better based on paired t test (α = 0.5),[37] revealing that the consideration of cumulative effects can improve the prediction performance. The offline prediction results in transition regions are compared with real-time quality prediction in the same region. When the changing prediction performance in transition region is cumulatively explained by offline analysis, the prediction improvement is also statistically significant in comparison with online quality predictions based on paired t test (α = 0.5).

Table 2. Online and Offline Quality Prediction Results (R2Y (%)) in Steady Phases (SP) and Transition Regions
ApplicationRegion
SP 1transitionSP 2TransitionSP 3transitionSP 4transitionSP 5
Offline74.968.681.882.779.974.879.654.954.6
Online51.661.765.879.065.173.962.050.636.0

Conclusions

  1. Top of page
  2. Abstract
  3. Introduction
  4. Methodology
  5. Illustrations and Discussions
  6. Conclusions
  7. Acknowledgments
  8. Literature Cited

In this article, between-phase transition is analyzed for calibration modeling and quality prediction in batch processes. The separation of between-phase common subspace and specific subspace is the key factor, which provides a good analysis platform for understanding quality-related phase behaviors. Based on this, the transition regions between two neighboring phases can be identified and distinguished from steady phases. Steady-phase models and transition models are thus developed respectively and used for online quality prediction, revealing the changing of prediction power along time direction. Besides the realtime application, the cumulative effects on quality prediction can be jointly captured in two subspaces. The case study concludes with illustrations of how the proposed method performs on one real case, revealing the desirable improvement in quality interpretation.

The proposed method gives an interesting insight into the effects of phase behaviors and between-phase transitions on quality prediction and interpretation. There may be still many specific issues to be investigated, but the promising results reported in the article should encourage further development of this research topic and constitute an important step forward. The other applications, for example, process monitoring of different process variations, could be an interesting but different topic, which should motivate a worthwhile extension of this research line.

Acknowledgments

  1. Top of page
  2. Abstract
  3. Introduction
  4. Methodology
  5. Illustrations and Discussions
  6. Conclusions
  7. Acknowledgments
  8. Literature Cited

This work is supported by National Program on Key Basic Research Project (973 Program), under grant 2012CB720505.

Literature Cited

  1. Top of page
  2. Abstract
  3. Introduction
  4. Methodology
  5. Illustrations and Discussions
  6. Conclusions
  7. Acknowledgments
  8. Literature Cited