SEARCH

SEARCH BY CITATION

Keywords:

  • glass transition temperature;
  • modeling and predicting;
  • polymethacrylates;
  • support vector regression;
  • regression analysis

Abstract

  1. Top of page
  2. Abstract
  3. 1. Introduction
  4. 2. Materials and Methods
  5. 3. Results and Discussion
  6. 4. Conclusion
  7. Acknowledgements
Thumbnail image of graphical abstract

Based on six quantum chemical descriptors (|L-1.356|, Etotal, qC6, α, q, and Etherm), the hybrid PSO-SVR is proposed to establish a model for predicting the glass transition temperature (Tg) of 37 polymethacrylates. The prediction performance of SVR was compared with those of reported MLR and ANN models. The results show that the RMSE, MAPE, and R2 calculated by SVR are superior to those achieved by MLR or ANN model for the identical training set and test set. This investigation reveals that the SVR model is more suitable to be used for prediction of the Tg values for unknown polymethacrylates possessing similar structure than the conventional MLR or ANN model, and provides a clue that the method proposed in this study may be useful in computer-aided design of new polymethacrylates with desired Tg.


1. Introduction

  1. Top of page
  2. Abstract
  3. 1. Introduction
  4. 2. Materials and Methods
  5. 3. Results and Discussion
  6. 4. Conclusion
  7. Acknowledgements

When heating a polymeric material over a certain temperature, the material would begin transfer from a glassy state to a rubbery state. This transition is referred to as the glass transition, which occurs over a range of a few degrees of temperature. The mid-point or the point of inflection of the transition is called the glass transition temperature (Tg). The Tg affects many other polymer properties such as thermal expansion coefficient, modulus, heat capacity and hardness, etc. These physical properties may change drastically below and above the Tg such that the Tg index is usually employed to determine the application scope of a polymeric material. In general, it is difficult to measure the Tg experimentally because the transition takes place over a comparatively wide temperature range and depends on experimental conditions, such as the method of measurement, duration of the experiment and pressure, etc.1 Moreover, Tg is quite dependent on the polymeric structural and constitutional features.2, 3 Thus, the development of theoretical modeling methods for prediction of the Tg is necessary.

The mechanical properties of thermoplastic polymers, particularly methacrylates polymers, play a decisive role in practical application. Many researchers have ever attempted to develop quantitative structure property relationship (QSPR) models for prediction of the Tg of polymethacrylates. For example, Camelio et al.4 chosen QSPR method involving the EVM (energy, volume, mass) model to predict the Tg of acrylate and methacrylate polymers with R2 = 0.96. Dai et al.5 developed a QSPR model with R2 = 0.980 and s = 8.3 K in terms of three parameters to calculate the Tg of 13 polyacrylates and 9 polymethacrylates. Liu et al.6 and Liu et al.7 used artificial neural network (ANN) to estimate the Tg for 113 polymers via four descriptors and 32 polymers via six descriptors, the root mean square error (RMSE) to the validation set achieved 17.53 and 8.35 K, respectively. Yu and Yi8 applied a QSPR model constructed by back-propagation neural network to regress the Tg of 192 vinyl polymers, including 95 polystyrenes, 73 polyacrylates, and 24 polymethacrylates, giving a validation RMSE of 20.195K. Yu and Wang9 constructed a QSPR model to correlate the Tg of aromatic heterocyclic polymers with the mean relative errors of 3.299% (R = 0.963, n = 28) for the training set and 3.064% (R = 0.959, n = 23) for the test set. Xu et al.10 used QSPR for the prediction of Tg of polystyrenes based on a set of 107 polystyrenes using ANN combined with genetic function approximation. In addition, other researchers11–13 utilized different QSPR models to predict the Tg values of polymers based on diverse parameters.

All the models stated above were based on constitutional (or topological) descriptors and quantum chemical descriptors, and developed with multiple linear/nonlinear regression (MLR/MNR) method and/or the ANN method. However research using the support vector regression (SVR) theory combined with particle swarm optimization (PSO) to model and predict the Tg of methacrylate polymers has not been reported previously.

In this study, motivated by Liu et al.'s previous work,7 we introduce a novel machine learning method, SVR hybrided with PSO, to model and predict the Tg of polymethacrylates based on six quantum chemical descriptors (|L-1.356|, Etotal, qC6, α, q, and Etherm), and its predictive errors were also compared with those of the MLR model and ANN model.

2. Materials and Methods

  1. Top of page
  2. Abstract
  3. 1. Introduction
  4. 2. Materials and Methods
  5. 3. Results and Discussion
  6. 4. Conclusion
  7. Acknowledgements

2.1. Support Vector Regression

The support vector machine (SVM), proposed by Vapnik et al.,14 is a statistical learning approach based on the structure risk minimization principle. In recent years, the SVM has been regarded as the state-of-the-art technique for solving a variety of learning, classification, regression and prediction problems. SVR, as a regression version of SVM, is capable of solving non-linear regression problems using kernel functions, which is generally superior to other pattern recognition and regression approaches. SVR has been proved to exhibit a number of significant advantages such as excellent learning performance of small samples, good generalization ability, small errors, high calculation accuracy, etc.15 At present, it has become a focus in machine learning research and is extensively employed in a wide range of real-world problems.16–30

Suppose a sample is described by (x, y), where x represents the independent variable and y the dependent response. In general, the relationship between y and x is quite complex and highly nonlinear. For SVR, the basic idea is to map x from the original space X into a higher dimensional feature space F via a nonlinear mapping function ϕ(x), and then to conduct a linear regression in F space. Therefore, the purpose of SVR is to find a linear Equation (1) based on a given training dataset {(x1,y1), …, (xn,yn)}:

  • equation image((1))

where w is a vector for regression coefficient, b is a bias. They are estimated by minimizing the regularized risk function R(C), namely:

  • equation image((2))
  • equation image((3))

where n is the number of training samples, C is a regularization factor, ε is a prescribed parameter controlling the tolerance to error, and (1/2)||w||2 is used as a measurement of function flatness. The second term, equation image, is the so-called empirical risk and measured by the ε-insensitive loss function Lε(f(xi)–yi), which indicates that it does not penalize errors below ε.

In order to control function complexity and regression errors according to the desired precision, the slack variables ξ and ξ* are introduced to deal with the data points that do not satisfy Equation (3). Equation (2) and (3) can be transformed into the primal problem equation image:

  • equation image
  • equation image((4))

In Equation (4), the first term increases the smoothness of the regression function to improve its generalization ability and the second term reduces errors. The regularization factor C is a positive constant, determining the tradeoff between the training error and the model flatness.

In order to obtain w and b in Equation (1), the Lagrange equation is built:

  • equation image((5))

where αi and αi* are Lagrange multipliers to be solved. Only the nonzero values of the Lagrange multipliers are useful in the regression, and their corresponding samples are known as support vectors (SVs). The Equation (5) is obtained by setting the partial differential coefficients for w, b, ξi, and equation image equal to zero:

  • equation image((6))

Substituting Equation (6) into Equation (5), the dual optimization problem can be written as:

  • equation image
  • equation image((7))

Therefore, the function regression problem on SVR may come down to a quadratic programming problem. By minimizing Q, the array w can be written in terms of the Lagrange multipliers and training samples as:

  • $${\bf w} = \sum\limits_{i = 1}^{l} {{\rm (}\alpha _{i} } - \alpha _{i}^{*} {\rm )}\phi {\rm (}{\bf x}_{i} {\rm )}$$, ((8))

where l is the number of SVs. Finally, the linear Equation (1) has the following explicit form:

  • equation image((9))

In Equation (9), k(x,xi) = ϕ(x)•ϕ(xi) is a kernel function. There exist several types of commonly used kernel functions, such as linear kernel, radial basis kernel, polynomial kernel, sigmoid kernel, etc. In this study, the sigmoid kernel function (10) is used as the kernel function of the SVR because it tends to achieve better performance.

  • equation image((10))

2.2. Choosing of SVR Parameters with PSO

The PSO method was proposed by Kennedy and Eberhart,31 being motivated by the social behavior of organisms such as bird flocking and fish schooling. It is an optimization technique. The generalization ability of SVR relies entirely on four parameters, i.e., ε of the ε-insensitive loss function, the regularization constant C, and the kernel function parameters α and β. Therefore, it is a key step to search for the optimal parameters (ε, C, α, β) for SVR. In this study, PSO was utilized to search for the optimal parameters (ε, C, α, β) of SVR, and the mean absolute percentage error (MAPE), which directly reflects the modeling performance of SVR, was chosen as the fitness function:

  • equation image((11))

where n denotes the number of training samples, yi represents the actual measured value and equation image is the estimated value for the ith training sample.

2.3. Dataset and Modeling Method

The training set and test set used in this study was generated by Liu et al.7, which was originally taken from ref. 32 and is composed of 32 polymethacrylates. Table 1 tabulates the Tg values and the indices of related six quantum chemical descriptors (|L-1.356|, Etotal, qC6, α, q and Etherm) for 25 training samples and seven test samples. Here, L is side-chain length, referring to the distance from the ester O5 to its furthest atom on the side chain R6 (see Figure 1), here R6 = CnH2n+1 (n is the atomic number of carbon of the alkyl on the side chain of polymethacrylates, n = 1, 2, 3, 4, 6, 8, 10, 12, 14, 16). After analyzing the relationship between the side-chain length L and Tg, Liu et al.7 found that when L = 1.356 nm (n = 10), the Tg would decline to the lowest point where the free volume and the intermolecular attraction strike a balance. Thus taking L = 1.356 nm as the length of the side chain at which Tg is lowest, irrespective of whether the side chain is longer or shorter than 1.356 nm.7 Etotal is the total energy of the macromolecular. qC6 is the net charge of carbon atom C6 connected directly to ester O5. α is the molecular average polarizability. q is the net charge of the most negative atom and Etherm is the thermal energy of the polymethacrylates. All the quantum chemical descriptors were calculated directly from the structure of the monomer with the Gaussian 03 program, at the DFT/B3LYP/6-31G(d) level with the keywords OPT, POLAR, FREQ, and the optimized structure was characterized as true local energy minima on the potential energy surface, without imaginary frequencies. In this study, the polymethacrylates were represented by their repeating units end-capped by hydrogen atoms to calculate molecular descriptors. The detailed definition and calculation for above quantum chemical descriptors are also given in ref. 7. In addition, other five independent samples as tabulated in Table 3 were taken from the literatures.2, 8

Table 1. Quantum chemical descriptors and glass transition temperature Tg for 32 polymethacrylates.7
No.Polymer|L-1.356| [nm]Etotal [au]qC6 [au]q [au]α [au]Etherm [kJ  ·  mol−1]Exp.Tg [K]
  • a)

    Test sample.

1Poly(4-cyanopheny1-methacylate)0.68−6310.3604−0.5274125.33555.98428
2Poly(phenyl methacrylate)0.83−538.750.3562−0.5336104.34550.12407
3a)Poly(tert-butyl methacrylate)1.02−464.970.3118−0.496489.6642.01380
4Poly(4-methoxycarbonyphenyl-methacrylate)0.54−766.40.3561−0.5273139.68680.28379
5Poly(methyl methacrylate)1.15−347.02−0.2169−0.480558.09410.71373
6Poly(4-tertbutylphenyl methacrylate)0.6−696.010.3444−0.5318152.5866.35371
7Poly(2-chloroehtyl-methacrylate)0.85−845.93−0.028−0.478779.26466.24365
8a)Poly(2-hydroxyehtyl-methacrylate)0.96−461.54−0.0555−0.60972.93502.01358
9Poly(cycHexyl-methacrylate)0.93−542.390.1318−0.4913108.94742.84356
10Poly(1,1,1-trifluoro-2-propyl-methacrylate)1−723.37−0.0032−0.475879.55510.31354
11a)Poly(isoproyl methacrylate)1.02−425.650.1303−0.485979.49565.94354
12Poly(2-hydroxypropyl-methacrylate)0.9−500.86−0.0369−0.609983.17581.1349
13Poly(2-cyanoethyl-methacrylate)0.88−478.57−0.0328−0.476581.35487.92347
14Poly(ethyl methacrylate)1.02−386.33−0.0315−0.483269.08488.81338
15a)Poly(sec-butyl methacrylate)0.96−464.970.1328−0.485689.9642.15333
16Poly(phenethyl-methacrylate)0.6−617.39−0.0266−0.4806126.9714.2329
17Poly(benzyl methacrylate)0.75−577.99−0.1442−0.485114.84636.01327
18Poly(isobutyl methacrylate)1−464.96−0.0233−0.482890.42644.9326
19Poly(2-bromoehtyl-methacrylate)0.94−2957.44−0.0336−0.479486.46467.36325
20a)Poly(3,3-dimethylbutyl-methacrylate)0.96−543.59−0.0422−0.4844111.59800.59318
21Poly(neopenyl methacrylate)0.9−504.28−0.0207−0.4826100.58719.13312
22Poly(proyl methacrylate)0.9−425.65−0.0258−0.48380.01564.88308
23Poly(2-methoxyethyl-methacrylate)0.79−500.85−0.0631−0.478783.89582.85297
24a)Poly(butyl methacrylate)0.77−464.96−0.0315−0.483590.99643.75293
25Poly(hexdecyl methacrylate)0.77−936.73−0.0312−0.4837224.751590.82288
26Poly(pentyl methacrylate)0.64−504.28−0.045−0.4841101.65725.07283
27Poly(Hexyl methacrylate)0.51−543.59−0.0312−0.4835113.14803.64273
28Poly(2-ethylhexyl-methacrylate)0.52−622.21−0.0487−0.489134.51959.99263
29Poly(octyl methacrylate)0.26−622.22−0.0311−0.4836135.37961.05253
30Poly(Tetradecyl-methacrylate)0.51−858.1−0.0312−0.4837202.371433.38233
31Poly(dodecyl methacrylate)0.26−779.47−0.0312−0.48371801275.94208
32a)Poly(decyl-methacrylate)0−700.85−0.044−0.4847157.511118.66203

Figure 1. The calculated models of polymethacrylate.

Download figure to PowerPoint

thumbnail image

In this study, in order to directly compare the MLR, ANN, and SVR models' modeling, prediction and generalization performance, the training and test procedures were carried out in terms of the same training and test samples as previously conducted by Liu et al.7. Therefore, as indicated in Table 1, 25 samples were employed as a training dataset in the training process to develop a model, the other seven samples acted as the test set. The five independent samples acted as the independent set to further validate the generalization ability of the established SVR model.

The SVR model was self-adjusted to make the training error (MAPE) as small as possible via learning with the training samples and optimizing the parameters through continuous training adjustment for 10 000 times. Finally, an optimal SVR model was obtained via a sigmoid kernel with the penalty factor C of 56700186162.908470, insensitivity factor ε of 4.011811 and kernel function parameters (α, β) of (0.150243, −4.425184).

2.4. Generalization Performance Evaluation

The Tg values of the seven test samples and five independent samples were calculated by using the constructed SVR model. Besides the index of MAPE, three other indices, i.e., mean absolute error (MAE), RMSE, and correlation coefficient (R2), were used for performance evaluation, as defined by Equation (12–14), respectively:

  • equation image((12))
  • equation image((13))
  • equation image((14))

where m denotes the number of test/independent samples, yj and equation image stand for the targets (measured) and predicted values of the jth test/independent sample respectively, equation image is the mean experimental value as well as equation image is the mean predicted value for all test/independent samples.

3. Results and Discussion

  1. Top of page
  2. Abstract
  3. 1. Introduction
  4. 2. Materials and Methods
  5. 3. Results and Discussion
  6. 4. Conclusion
  7. Acknowledgements

Table 2 gives a list of the Tg values calculated by MLR7 (see Equation (15)), ANN7 and SVR for 25 training samples and seven test samples. Table 3 lists five independent samples for further assessment of the established SVR model. Table 4 summarizes the statistical RMSE, MAE, MAPE, and R2 for the calculations by using the MLR, ANN, and SVR models for the training set, test set and independent set, respectively. Figure 2 depicts a pair wise comparison of the experimental values versus estimated values calculated via SVR.

  • equation image((15))
Table 2. Comparison the experimental and calculated Tg of 32 polymers among different regression methods.
No.PolymerTg [K]
Exp.MLRANNSVR
Cal.Error [%]Cal.Error [%]Cal.Error [%]
  • a)

    Test sample.

1Poly(4-cyanopheny1-methacylate)428420−1.87418−2.34424−0.93
2Poly(phenylmethacrylate)407405−0.494151.974080.25
3Poly(tert-butyl methacrylate)380368−3.16369a)−2.89380a)0.00
4Poly(4-methoxycarbonyphenyl-methacrylate)3793913.173820.793790.00
5Poly(methyl methacrylate)373347−6.97359−3.753781.34
6Poly(4-tertbutylphenyl methacrylate)371370−0.27365−1.62368−0.81
7Poly(2-chloroehtyl-methacrylate)365339−7.12346−5.21360−1.37
8Poly(2-hydroxyethyl-methacrylate)3583590.28347a)−3.07364a)1.68
9Poly(cycloHexyl-methacrylate)356346−2.81342−3.93353−0.84
10Poly(1,1,1-trifluoro-2-propyl-methacrylate)354350−1.133581.133591.41
11Poly(isopropyl methacrylate)3543550.28353a)−0.28347a)−1.98
12Poly(2-hydroxypropyl-methacrylate)349348−0.293500.29344−1.43
13Poly(2-cyanoethyl-methacrylate)347346−0.29333−4.03343−1.15
14Poly(ethyl methacrylate)3383441.78337−0.30335−0.89
15Poly(sec-butyl methacrylate)3333443.30332a)−0.30338a)1.50
16Poly(phenethyl-methacrylate)329328−0.30315−4.263310.61
17Poly(benzyl methacrylate)3273383.363342.143331.83
18Poly(isobutyl methacrylate)3263352.763280.61324−0.61
19Poly(2-bromoehtyl-methacrylate)3253291.23322−0.923311.85
20Poly(3,3-dimethylbutyl-methacrylate)3183221.26317a)−0.31316a)−0.63
21Poly(neopentyl methacrylate)3123192.24308−1.28308−1.28
22Poly(propyl methacrylate)3083276.173090.323090.32
23Poly(2-methoxyethyl-methacrylate)2973114.713001.012980.34
24Poly(butyl methacrylate)2933074.78290a)−1.02291a)−0.68
25Poly(hexdecyl methacrylate)288272−5.56280−2.78283−1.74
26Poly(pentyl methacrylate)2832861.06280−1.06278−1.77
27Poly(Hexyl methacrylate)273269−1.472761.10271−0.73
28Poly(2-ethylhexyl-methacrylate)2632630.002723.422661.14
29Poly(octyl methacrylate)253232−8.30245−3.16249−1.58
30Poly(Tetradecyl-methacrylate)2332465.582403.002361.29
31Poly(dodecyl methacrylate)2082216.252111.442111.44
32Poly(decyl-methacrylate)203194−4.43210a)3.45205a)0.99
Table 3. Parameters, experimental and calculated Tgs by SVR model for five independent polymethacrylates.
No.Polymer|L–1.356| [nm]Etotal [au]qC6 [au]q [au]α [au]Etherm [kJ · mol−1]Exp. Tg [K]Cal. Tg [K]
1Poly(2-ethylbutyl methacrylate)0.81−543.59−0.0395−0.4834111.64802.69284294
2Poly(cyclopentyl methacrylate)0.89−503.070.1413−0.485197.1665.9348338
3Poly(cyclobutyl methcarylate)0.94−463.720.1543−0.483687.106584.07351349
4Poly(3,3,5-trimethylhexyl methacrylate)0.53−661.52−0.0562−0.4854143.691036.37274262
5Poly(cyclodecyl methacrylate)0.75−699.610.1367−0.4872149.681061.9331331
Table 4. Prediction performance comparison among different regression methods.
Sample classMethodRMSE [K]MAE [K]MAPE [%]R2
Training samplesMLR12.019.283.010.9468
ANN8.356.762.070.9776
SVR3.793.431.090.9948
Test samplesMLR8.947.422.500.9748
ANN6.585.001.620.9958
SVR4.043.281.010.9946
Independent samplesSVR8.346.82.260.9417

Figure 2. Plot of experimental Tg versus calculated Tg by SVR.

Download figure to PowerPoint

thumbnail image

From Table 2, it can be viewed that the percentage errors of most samples calculated by SVR model are smaller than those calculated by the MLR model and ANN model. It can be found by simple statistics that, there are 15 samples with a small absolute percentage error (APE) in the range of 0–1% achieved by SVR model, while there are only eight and nine samples whose APEs are in this range calculated by the MLR and ANN models, respectively. From Table 2, it also can be seen that the maximum APE 1.98% predicted by SVR for sample #11, is smaller than those (8.30%, #29) achieved by MLR model and (5.21%, #7) calculated by ANN model.

The maximum APE (|(262-274)/274 × 100%| = 4.38%, #4) calculated by the established SVR model in Table 3 is also smaller than that (5.21%) achieved by the ANN model. By comparison the calculated Tg values with the measured Tg indices among MLR, ANN and SVR in Table 2, it reflects that the Tg values predicted by SVR model are in quite better agreement with the experimental values than those calculated by the MLR model or ANN model.

From Table 4, it can be found that the MAPE of the SVR model for training samples, test samples were 1.09% and 1.01%, which were smaller than those (3.01%, 2.5%) achieved by MLR model and (2.07%, 1.62%) by ANN model, respectively. The RMSEs calculated by MLR and ANN models for both training set and test set were >5 K, while the RMSE predicted by the SVR model for the training set and test set were 3.79 and 4.04 K, which were both lower than those of MLR and ANN models. These mean that the prediction result of the SVR model is more accurate than both of the MLR and ANN models. Meanwhile, the MAE of Tg calculated by the established SVR model for the independent set was 6.8 K. It has been reported that there exists measurement error of about 2–4 degrees,33, 34 or even over 10 degrees35 measured by different experimental techniques. These imply that the established SVR model is acceptable in practice to satisfactorily predict the Tg of an unknown polymethacrylate with similar structure outside of the training set.

The correlation coefficient R2 between the measured Tgs and SVR regressed Tgs for the training set, test set were 0.9948, 0.9946, both were greater than those (0.9468, 0.9748) and (0.9776, 0.9958) for MLR model and the ANN model. Again illustrate that the regressed Tgs for the training samples or the predicted Tgs for the test samples by SVR model are in more agreement with the measured Tgs than those calculated by the MLR model or the ANN model. From Figure 2, it also can be seen that there is strong agreement among the training results, the test results and the evaluation results, indicating no over-fitting issues with SVR. These results strongly demonstrate that the SVR model is more suitable for accurately making Tg prediction.

Moreover, in order to further convince the statistical results of SVR model, the whole 37 samples used in this study is also partitioned into the training, test and independent sets in the ratio 50%, 25%, and 25% by using k-Nearest Neighbor (kNN). Therefore, in this typical split the training, test and independent sets are comprised of 19, 9, and 9 samples, respectively. For this partition, the modeling and prediction results of SVR are tabulated in Table 5, and the related statistical RMSE, MAE, MAPE, and R2 are listed in Table 6. It can be viewed from Table 6 that the MAEs of Tg calculated by the SVR model for the test set and independent set are 7.44 and 8.0 K, respectively, which is also acceptable in practice. From Tables 4 and 6, a phenomenon can be found by comparison the prediction results for the test and independent samples that, in general, the more the number of training samples is, the greater the prediction and generalization ability of SVR would be.

Table 5. The 19 training samples, 9 test sample and 9 independent samples along with their measured Tgs and calculated Tgs by SVR.
No.PolymerExp. Tg [K]Cal. Tg [K]
  • a)

    Test samples;

  • b)

    Independent samples.

1Poly(4-cyanopheny1-methacylate)428428
2Poly(phenyl methacrylate)407407
3Poly(tert-butyl methacrylate)380380
4Poly(4-methoxycarbonyphenyl-methacrylate)379379
5a)Poly(methyl methacrylate)373363
6Poly(4-tertbutylphenyl methacrylate)371371
7b)Poly(2-chloroehtyl-methacrylate)365334
8a)Poly(2-hydroxyehtyl-methacrylate)358366
9Poly(cycHexyl-methacrylate)356356
10b)Poly(1,1,1-trifluoro-2-propyl-methacrylate)354350
11Poly(isopropyl methacrylate)354354
12Poly(2-hydroxypropyl-methacrylate)349349
13b)Poly(2-cyanoethyl-methacrylate)347337
14Poly(ethyl methacrylate)338338
15Poly(sec-butyl methacrylate)333343
16Poly(phenethyl-methacrylate)329329
17Poly(benzyl methacrylate)327327
18a)Poly(isobutyl methacrylate)326328
19a)Poly(2-bromoehtyl-methacrylate)325350
20Poly(3,3-dimethylbutyl-methacrylate)318318
21b)Poly(neopenyl methacrylate)312310
22Poly(proyl methacrylate)308313
23Poly(2-methoxyethyl-methacrylate)297297
24Poly(butyl methacrylate)293293
25b)Poly(hexdecyl methacrylate)288285
26b)Poly(pentyl methacrylate)283278
27a)Poly(Hexyl methacrylate)273272
28Poly(2-ethylhexyl-methacrylate)263263
29Poly(octyl methacrylate)253252
30a)Poly(Tetradecyl-methacrylate)233233
31a)Poly(dodecyl methacrylate)208204
32Poly(decyl-methacrylate)203203
33b)Poly(2-ethylbutyl methacrylate)284294
34b)Poly(cyclopentyl methacrylate)348341
35a)Poly(cyclobutyl methcarylate)351353
36a)Poly(3,3,5-trimethylhexyl methacrylate)274259
37b)Poly(cyclodecyl methacrylate)331331
Table 6. Prediction performance of SVR for 19 training samples, 9 test samples and 9 independent samples
Sample classRMSE [K]MAE [K]MAPE [%]R2
Training samples2.580.840.260.9978
Test samples10.747.442.400.9681
Independent samples11.858.02.390.9034

4. Conclusion

  1. Top of page
  2. Abstract
  3. 1. Introduction
  4. 2. Materials and Methods
  5. 3. Results and Discussion
  6. 4. Conclusion
  7. Acknowledgements

In this paper, the SVR combined with PSO was proposed to estimate the glass transition temperature of polymethacrylates based on six quantum chemical descriptors (|L-1.356|, Etotal, qC6, α, q, and Etherm). The modeling performance of the proposed hybrid PSO-SVR method was compared with those of MLR and ANN methods, the comparison results revealed that the calculated errors of the SVR model were smaller than those of MLR model or ANN model for the identical training set and test set. The generalization ability of the SVR model was also superior to those of the MLR and ANN models. The evaluation results via independent samples confirmed that the established SVR model is acceptable in practice. This investigation demonstrates that the methodology introduced in this study can successfully rationalize the Tg of polymethacrylates, and the derived SVR model is more suitable to be used for prediction of the Tg values for unknown polymethacrylates possessing similar structure than the conventional multivariable linear regression model or ANN.

Acknowledgements

  1. Top of page
  2. Abstract
  3. 1. Introduction
  4. 2. Materials and Methods
  5. 3. Results and Discussion
  6. 4. Conclusion
  7. Acknowledgements

This work was supported by the Fundamental Research Funds for the Central Universities (CDJXS10101107), the Innovative Talent Funds for Project 985 (WLYJSBJRCTD201102), the Scientific Research Foundation for the Returned Overseas Chinese Scholars of Ministry of Education, China (2008101-1), the Natural Science Foundation of Chongqing, China (CSTC2006BB5240) and the Program for New Century Excellent Talents in Universities of China (NCET-07-0903).