 |
|
|
| 論文10. Application of Artificial Neural Network
for the Estimation of Solubility of New Blowing and Cleaning Agents |
| |
| Shingo Urata*, Jun Irisawa* and Akira Takada* |
|
|
The Artificial Neural Network (ANN) technique was applied to the
solubility
estimation in order to screen new alternative blowing and cleaning
agents. In this study, two solubility estimation studies were carried
out. One was the n - eicosane (n - C10H22)
solubility estimation in hydrofluoroethers (HFEs), hydrofluoroketones
(HFKs), and chlorine containing compounds, and the other was the estimation
of HFEs solubility in polyol PE - 315 (aromatic polyol, OH value =
315mgKOH/g). Information on the intermolecular interactions was evaluated
using the molecular simulation technique with the code ‘Cerius2/Blends
(MSI)’, and was used as a descriptor. The computationally calculated
fractional charges of atoms in solute, and several measured physical
properties, such as liquid density, molecular volume, and molecular
weight were also prepared as descriptors, and several of these descriptors
were selected empirically in each study. In both studies, the optimal
thresholds for training of ANN were determined, in order to avoid
over - fitting using the cross validation test, and two accurate ANNs
were constructed. Finally, the two constructed ANNs were applied to
screen new alternatives, and CHF2CF2OCHF2
(HFE - 236pc) was consequently discovered to be a new candidate for
a blowing agent. |
|
|
| |
|
| 1.Introduction |
|
Chlorofluorocarbons (CFCs)and hydrochloro fluoro - carbons (HCFCs)have
been extensively utilized as refrigerants, blowing agents, and cleaning
agents. However, since these compounds have the potential to deplete
the ozone layer, they should be replaced with alternative compounds.
For this purpose, we have mainly developed hydrofluoroethers (HFEs)and
hydrofluoro - ketones (HFKs)as new alternatives to CFCs and HCFCs(1
), because they show almost zero ozone depletion potential (ODP)(2
)and low global warming potential (GWP)(3 )-(6
), and, moreover, have physical properties similar to those
of CFCs and HCFCs.(1 )
Solubility is one of the most important physical properties in selecting
suitable compounds for each type of practical use. For example, the
high solubility of oil is required in the use of cleaning agents,
and solubility in polyol is important for use
in blowing, because low solubility in polyol negatively influences
the performance of polyurethane foam. In our project, the solubility
of n - eicosane (n - C20H42)was
measured as an index of ability as cleaning agents, and the solubility
in polyol PE - 315 (aromatic polyol, OH value = 315mgKOH/g)was measured
to confirm the performance as blowing agents .(7)、(8)
However, it is difficult to measure the solubility of all HFEs and
HFKs, because they have a lot of isomers. In such cases, molecular
simulation techniques or empirical estimation methods like quantitative
structure property relationships (QSPR)are powerful enough to use
to accelerate the screening of candidates, without the necessity of
conducting laborious experiments. Nowadays, molecular simulation techniques
have certainly become useful tools, but they require long CPU times,
which are still insufficient to quantitatively estimate the properties
of complex mixtures. However, empirical estimation methods are rapid
and effective if there is enough observed data. Therefore, we tried
to estimate solubilities empirically in this work.
One of our objectives in this study was to estimate the n - eicosane
solubilities in HFEs or HFKs, and the other was to estimate the solubilities
of HFEs in PE - 315. These studies were carried out by means of the
Artificial Neural Network (ANN). The three - layered feed forward
type ANN was used, and was optimized by using the general back - propagation
learning algorithm.(9) As the descriptors of
ANN, information on intermolecular interactions evaluated by using
the Monte Carlo technique and several basic physical properties such
as molecular weight, molecular volume, and liquid density, were empirically
selected. |
|
| ▲PageTop |
|
|
|
| 2.Methodology |
|
| 2.1 Artificial Neural Network
The three - layered feed forward type ANN, in which one output
neuron is used, was applied in this study. In this three - layered
neural network, signals x(i), which are input parameters
of ANN, are transferred from the input layer to the output layer
through the hidden layer. These layers consist of several units
called neurons. The neurons of the hidden layer y(j)are
described by the sigmoidal transfer functions as follows: |
 |
| where w12 (i , j)are the connections
between the input and the hidden neurons, and αis an adjustable parameter
for the non - linearity of sigmoidal transfer functions. The neuron
in the output layer z is represented by the linear summation
of y(j). |
 |
where w23 (j)are the connections between the hidden and
output neurons. In this work, the input parameters were normalized
between 0.1 and 0.9, and the output values were normalized between
0.0 and 1.0.
The training of ANN was carried out using a general back propagation
algorithm(9 ). This training procedure was
repeated until error E, given by eq. (3), fell under threshold
Eth. |
 |
where T(p)is the training value set number p.
In order to avoid the over - fitting, the predictability of the constructed
ANN was confirmed by means of the cross - validation test method at
various Eth, and the optimal threshold
was decided. As the cross - validation test, we applied the “leave
- one - out ”test method, in which one set of the training data is
removed, and the rest are used for training. After training, the estimated
value and training value for the dataset left out are compared. This
test method was applied to all datasets.
|
2.2 Descriptor Generation
The information on the intermolecular interaction used as a descriptor
of ANN was evaluated using the code “Cerius2/Blends (MSI)”(10
) , which is the Monte Carlo program combined with a modified
Flory - Huggins theory(11). In this software,
the distribution of intermolecular energy Eij
is evaluated from a lot of random configurations of two molecules,
and the coordination number of molecule j around molecule
i (Zij)is also available.
In addition, the interaction parameter χis directly calculated
using these parameters as follows: |
 |
These values can be calculated from only the molecular structure
of two compounds. In this work, 100, 000 random configurations were
generated to estimate Eij, and
Zij was calculated as an average value
of 500 samples. The geometry of each molecule was optimized by using
the ab initio molecular orbital calculation at the HF/6 - 31G* level.
The intermolecular interaction was represented by van der Waals and
the electrostatic interactions. The van der Waals interaction was
evaluated by using Dreiding 2.21(12)as a force
field. In order to calculate electrostatic interaction, the fractional
charge of each atom in the molecule was optimized at the B3LYP/6 -
31+G**//HF/6 - 31G*level with the CHELPG(13)
method. All molecular orbital calculations were carried out by means
of the Gaussian94 suite of programs(14).
We also used several basic physical properties, such as molecular
weight (MW), molecular volume (V), and liquid density (ρ)as descriptors.
The liquid densities are observed values near room temperature. The
molecular volume was given by MW/ρ. However, such observed liquid
densities would not be useful for the screening of new compounds.
Therefore, liquid densities were simplified to one decimal place in
[ g/cm3 ] and were used as input parameters,
in accordance with the predictable accuracy of liquid density. Such
precision of liquid density is predictable by using a combination
of the conventional methods(15)or ANN estimation
methods(16), (17)from
the formula of compounds. |
| |
|
| ▲PageTop |
|
|
|
| 3.Results and Discussion |
|
| 3.1 Solubility of n - eicosane
In order to estimate the solubility of n - eicosane, we selected
the interaction parameter χcalculated by using eq. (4)as a descriptor
of ANN. In the calculation of this χparameter, n - eicosane was
represented by a combination of two n - decanes (n - C10H22), since
its size was too long to be represented by one segment (Fig.1).In
addition to interaction parameter χ, other presumably important
properties, such as molecular weight and liquid density, were chosen
as descriptors empirically. A schematic diagram of the constructed
ANN is shown in Fig.2.
For the input layer, we used the three descriptors described above
and a bias neuron, which always puts out unit value. Four neurons
were used in the hidden layer, whose a value was 2.5.
The observed solubility values of 31 compounds, which were measured
at 297 K, were used as training values to construct the ANN. In
the training procedure, the logarithmic values of the volume fraction
of these solubility data were used, since there were differences
in figures between the fluorinated and chlorinated compounds.
As a beginning, the relation between threshold
Eth and the average of the absolute
deviation (AAD) of the predicted values were studied to avoid over
- fitting. Figure
3 shows the relation between threshold and predictability.
Consequently, it was revealed that over - fitting occurred when
Eth was set under 0.1.Therefore, the
training of ANN using all 31 data was carried out until error
E became 0.1.Here, error E was calculated using outputs
normalized from 0 to 1, with the following to eq.(3).
The results of the training and leave - one - out test at Eth
=0.1 are graphically shown in Fig.4,
and these values and input parameters are listed in Table
1. The correlation coefficients (R)of the trained and
estimated values were 0.976 and 0.948 respectively, and the AAD
for trained and estimated values were 0.209 and 0.296.This result
shows that the proposed estimation method using ANN is accurate
enough to estimate solubility, and that the parameters used were
suitable as descriptors.
3.2 Solubility in polyol PE - 315
The constitution of ANN, which succeeded in estimating the solubility
of n - eicosane, failed to estimate the HFEs ’solubilities in PE
- 315. We reasoned that the failure was due to the high polarity
of PE - 315, since the χparameter only represented the uniformly
averaged information of intermolecular interactions. To overcome
that problem, two new parameters were introduced instead of the
χparameter. First, the change of the most stable interaction energy
between before and after mixing (ΔEmin),
which was defined by eq. (5), was used as a descriptor. |
 |
where Eijmin is the minimum interaction
energy between molecules i and j. The most stable
configurations were selected from 100, 000 configurations generated
by the Blends code(12). In this calculation,
a simplified structure of PE - 315 was used (Fig.5).
This parameter may represent the ease of mixing. Secondly, the maximum
fractional charge of hydrogen of each solute was added to the descriptors
to represent the interaction between solute and the polar chain of
PE - 315. We found that most of the conformations given minimum interaction
energy were specific arrangements (Fig.6).
Therefore, it seems to be difficult to represent the intermolecular
interaction between solute and PE - 315 by using only the parameter
(ΔEmin )defined as eq. (5). In addition
to
these two parameters, liquid density and molecular volume were chosen
as descriptors in the same way with the n - eicosane solubility study.Figure
7 shows a schematic diagram of used ANN. For the input
layer, we used the four descriptors described above and a bias neuron.
The hidden layer was composed of four neurons whose a value was 2.0.
The observed solubilities of 25 compounds are listed in Table
2. Since the solubilities were roughly observed, we used
the center value of each observed data as the training values.
At first, the relation between threshold Eth
and predictability in the leave - one - out cross validation test
was studied to avoid over - fitting. In this case, over - fitting
was observed when Eth was set under
0.17 (Fig.8).
Thus, we set Eth on 0.18 to avoid
over - fitting in the training procedure using all 25 data. Here,
error E was also calculated using outputs normalized from
0 to 1. The results of the training and leave - one - out test are
graphically shown in Fig.9,
and these values and input parameters are listed
in Table 2.
Consequently, the correlation coefficients R for the trained
and estimated values were 0.935 and 0.827, respectively. And their
AAD was 5.5 wt%and 10.1 wt%. The predictability was a little worse
than the accuracy of trained values. However, it was accurate enough
to select the new candidates, which will have high solubility, if
the accuracy of observation is considered. From this result, it was
concluded that these selected input parameters are reasonable as descriptors,
and the proposed method is useful in the estimation of solubility
for the polar compounds. 3.3 Application for
screening
Since the purpose of this study is not only the construction of
the solubility estimation method but also the screening of alternative
compounds for CFCs or HCFCs, we tried to estimate the solubility by
using two constructed ANNs for unobserved compounds. Here, estimated
liquid densities were used for the new compounds ’input parameters
by means of the ANN estimation method(16 ), (17 )
.In the
estimation of the n - eicosane solubility, there was no compound that
showed high solubility, however, for the solubility with PE - 315,
we could find four promising HFEs. They were listed in Table
3. HFE - 236pc as a new candidate for the blowing
agents among them. After our computer estimation, we measured the
HFE - 236pc solubility with PE - 315, and it was revealed that HFE
- 236pc had solubility over 60wt%. This result indicated that this
constructed ANN is useful enough to screen alternative compounds without
further experimental work. |
|
| ▲PageTop |
|
|
|
| 4.Conclusion |
|
A couple of solubility estimation studies were carried out by means
of the Artificial Neural Network technique. One of them was the estimation
of n - eicosane solubilities in HFEs, HFKs and chlorine containing
compounds, and the other was the estimation of solubilities of HFEs
in polyol PE - 315. The former solubility estimation was studied by
using the interaction parameter c evaluated by the Monte Carlo technique,
with the observed liquid density and molecular weight as descriptors.
In the later study, the change of minimum interaction energy between
before and after mixing (ΔEmin), the
maximum fractional charge of hydrogen in solute, the observed liquid
density, and the molecular volume were selected as descriptors empirically.
The change of minimum interaction energy was evaluated by using the
Monte Carlo technique. The maximum fractional charge of hydrogen was
calculated by using molecular orbital calculation. In both studies,
the most suitable threshold of the learning procedure was decided,
and the predictability in the leave - one - out cross validation test
was considered. Owing
to such procedures, over - fitting was avoided, and accurate estimation
methods were constructed.
Finally, we tried to screen new candidates for cleaning and blowing
agents using constructed ANNs. In this way, we were able to find a
promising compound, HFE - 236pc (CHF2CF2OCHF2),
as a new blowing agent. -Acknowledgements -
We would like to thank Mr. Ryouichi Tamai (Central Glass Co. , Ltd.
), Yuji Kurokawa (Kanto Denka Kogyo Co. , Ltd. )and Akira Sekiya (National
Institute of Advanced Industrial Science and Technology)for their
support on this study, in addition we are greatful to Dr. Hiroshi
Yamamoto (Asahi Glass Co. , LTD. )for his helpful discussions on molecular
simulation and the artificial neural network. This research was carried
out under the
project “Department for the New Refrigerant and Other Substances Research
(F. Y. 1994 - 2001, in Japan)”conducted by the Research Institute
of Innovative Technology for the Earth (RITE)with financial support
from the New Energy and Industrial Technology Development Organization
(NEDO). -References - |
| (1) |
A. Sekiya and S. Misaki, J. Fluorine
Chem. , 101 , 215 (2000). |
| (2) |
A. R. Ravishankara, A. A. Turnipseed, N. R. Jensen, S. Barone,
M. Mills, C. J. Howark and S. Solomon, Science 263
, 71 (1994). |
| (3) |
V. Ramaswamy, O. Boucher, J. Haigh, D. Hauglustaine, J. Haywood,
G. Myhre, T. Nakajima, G. Y. Shi and S. Solomon:Radiative Forcing
of Climate Change. In: Climate Change 2001:The Scientific Basis,
Contribution of Working Group I to the Third Assessment Report
of the Intergovernmental panel on Climate Change [ J. T. Houghton,
Y. Ding, D. J. Griggs, M. Noguer, P. J. van der Linden, X. Dai,
K. Maskell and C. A. Johnson, (eds. )] , Cambridge University
Press, 2001. |
| (4) |
K. Tokuhashi, A. Takahashi, M.Kaise, S. . Kondo, A. Sekiya,
S. Yamashita and H. Ito, Int. J. Chem. Kinet. , 31
, 846 (1999). |
| (5) |
K. Tokuhashi, A. Takahashi, M. Kaise, S. Kondo, A. Sekiya,
S. Yamashita, S. and H. Ito, J. Phys. Chem. A, 104
, 1165 (2000). |
| (6) |
K. Tokuhashi, A. Takahashi, M. Kaise and S. Kondo, J.
Geophys. Res. , 104 , 18681 (1999). |
| (7) |
N. Takada, R. Tamai, H.Yamamoto and A. Sekiya, J. Cellular
Plastics, 35 , 389 (1999). |
| (8) |
R. Tamai, S. Urata, H. Takeyasu, H. Sato and A. Sekiya, Proceeding
of Polyurethane Conference 2000, 553 (2000). |
| (9) |
D. E. Rumelhart, G. E. Hinton and R. J. Williams, Nature
(London), 323 , 533 (1986). |
| (10) |
C. F. Fan, B. D. Olafson, M. Blanco, S. L. Hsu, Macromolecules,
25 , 3667 (1992). |
| (11) |
A. I. Pesci, K. F. Freed, J. Chem. Phys. , 90
, 2017 (1989). |
| (12) |
S. L. Mayo, B. D. Olafson and W. A. Goddard III, J. Phys.
Chem. , 94 , 8897 (1990). |
| (13) |
C. M. Breneman and K. B. Wiberg, J. Comp. Chem. ,
11 , 361 (1990). |
| (14) |
Gaussian 94, M. J. Frisch, G. W. Trucks, H. B. Schlegel,
P. M. W. Gill, B. G. Johnson, M. A. Robb, J. R. Cheeseman, T.
Keith, G. A. Petersson, J. A. Montgomery, K. Raghavachari, M.
A. Al - Laham, V. G. Zakrzewski, J. V. Ortiz, J. B. Foresman,
J. Cioslowski, B. B. Stefanov, A. Nanayakkara, M. Challacombe,
C. Y. Peng, P. Y. Ayala, W. Chen, M. W. Wong, J. L. Andres,
E. S. Replogle, R. Gomperts, R. L. Martin, D. J. Fox, J. S.
Binkley, D. J. Defrees, J. Baker, J. P. Stewart, M. Head - Gordon,
C. Gonzalez, and J. A. Pople, Gaussian, Inc. , Pittsburgh PA,
1995. |
| (15) |
R. C. Reid, J. M. Prausnitz and T. K. Sherwood, The properties
of Gases and Liquids Third Edition;McGraw - Hill Book Company;1977.
|
| (16) |
H. Yamamoto, JCPE Journal, 11 ,
65 (1999). |
| (17) |
H. Yamamoto, Report of the Research Laboratory,
Asahi Glass Co. , Ltd. , 49 , 53 (1999).
|
|
|
| ▲PageTop |
|
|
|
| |
| |
 |