
Table 3. Contribution of the core rules.


Figure 1. Decision tree model for classification of pesticide persistence in environment. Each decision node is accompanied by the numbers of compounds that arrive at the node and flow away. Terminal leaves are marked by double board; their index numbers are given in circles. Algorithmic part is shown in the BASIC-like style.

Figure 2. Chart of topological descriptors. Parameters nDB and nBO are the constitutional descriptors; UNSATW is the empirical parameter. Each other predictor is the number of non-overlapped occurrences of the corresponding fragment in a molecular structure.
Table 4. Summary of classification results for the training set.
| Interclass mistake (Observed - Predicted) | Count |
| -2 | 3 |
| -1 | 17 |
| 0 | 272 |
| 1 | 21 |
| 2 | 2 |
Table 5. Summary of classification results for the test set.
| Interclass mistake (Observed - Predicted) | Count |
| -2 | 0 |
| -1 | 9 |
| 0 | 82 |
| 1 | 11 |
| 2 | 3 |
This work was partially supported by Japan Chemical Industry Association. The authors are also thankful to the U.S. Environmental Protection Agency, the U.S. National Library of Medicine, and the Pesticide Management Education Program at Cornel University for free access to corresponding databases.
References
[ 1] We accepted the common tendency and used the QSBR abbreviation (Quantitative Structure-Biodegradation Relationship). In fact, the biodegradation is one of the major ways of chemical decay, and often is associated with the whole degradation process.
[ 2] G. Klopman and M. Tu, Encyclopedia of Computational Chemistry, Wiley, Chichester (1998), pp. 128-135.
[ 3] W. J. G. M. Peijnenburg, Pure Appl. Chem., 66, 1931 (1994).
[ 4] J. R. Parsons and H. A. J. Govers, Ecotoxicol. Environ. Safety, 19, 212 (1990).
[ 5] G. J. Niemi, G. D. Veith, R. R. Regal, and D. D. Vaishnav, Environ. Toxicol. Chem., 6, 515 (1987).
[ 6] R. S. Boethling, B. Gregg, F. R. Gabel, N. W. Campbell, and A. Sablijic, Ecotoxicol. Environ. Safety, 18, 252 (1989).
[ 7] S. M. Desai, R.Govind, and H. H. Tabak, Environ. Toxicol. Chem., 9, 473 (1990).
[ 8] P. Bhagat, Chem. Eng. Prog., 86, 55 (1990).
[ 9] G. Klopman and M. J. McGonigal, J. Chem. Inf. Comput. Sci., 21, 48 (1981).
[10] K. Hiromatsu, Y. Yakabe, K. Katagiri, and Tsu. Nishihara, Chemosphere, 41, 1749 (2000).
[11] H. H. Tabak, C. Gao, S. Desai, and R. Govind, Water Sci. Technol., 26, 763 (1992).
[12] H. H. Tabak and R. Govind, Environ. Technol. Chem., 12, 251 (1993).
[13] BIODEG, Environmental Fate Database of Syracuse Research Corporation, Environmental Science Center division, 301 Plainfield Road, Syracuse, NY 13212 USA.
URL: http://esc.syrres.com/
[14] P. H. Howard, R. S. Boethling, W. M. Stiteler, W. M. Meylan, A. E. Hueber, H. A. Beauman, and M. E. Larosche, Environ. Toxicol. Chem., 11, 593 (1992).
[15] G. Klopman, D. M. Balthasar, and H. S. Rosendranz, Environ. Toxicol. Chem., 12, 231 (1993).
[16] J. Devillers, D. Domine, and R. S. Boethling, Neural Networks in QSAR and Drug Design, ed by J. Devillers, Academic Press, New York (1996), pp. 65-82.
[17] Correlation coefficient for this test set had been absent in the original, but was calculated by ourselves just from the tabulated results of original's Table II.
[18] Japan Chemical Industry Ecology-Toxicology & Information Center (JETOC), "Biodegradation and Bioaccumulation Data of Existing Chemicals Based on the Chemical Substances Control Law (CSCL Japan)," Tokyo (1992).
[19] H. Loonen, F. Lindgren, B. Hansen, W. Karcher, J. Niemela, K. Hiromatsu, M. Takatsuki, W. Peijnenburg, E. Rorije, J. Struijs, Environ. Toxicol. Chem., 18, 1763 (1999).
[20] A. Sabljic and W. Peijnenburg, Pure Appl. Chem., 73, 1331 (2001).
[21] A. C. Waldron, "Pesticides and Groundwater Contamination, "Ohio State University Extension Bulletin, 820, Columbus (Ohio) (1992). Available on the Internet at http://ohioline.ag.ohio-state.edu/b820/index.html.
[22] A. G. Hornsby, R. Don Wauchope, and A. E. Herner, Pesticide Properties in the Environment, Springer, New York (1995).
[23] Pesticide Property Database of the Alternate Crops and Systems Laboratory of Beltsville Agricultural Research Center. Available on the Internet at http://wizard.arsusda.gov/acsl/ppdb.html.
[24] Hazardous Substances Data Bank of U.S. National Library of Medicine. Available on the Internet at http://toxnet.nlm.nih.gov/cgi-bin/sis/htmlgen?HSDB.
[25] Pesticide Management Education Program at Cornell University. Available on the Internet at http://pmep.cce.cornell.edu.
[26] A. McCulloch, Chemosphere, 47, 667 (2002).
[27] Very short record for each chemical is given in the Table 1, because of space limitation. Detailed information about all HL vales and molecular structures is available from authors on request.
[28] Compendium of Pesticide Common Names. Available on the Internet at http://www.hclrss.demon.co.uk.
[29] ChemIDPlus Database of U.S. National Library of Medicine. Available on the Internet at http://chem.sis.nlm.nih.gov/chemidplus/.
[30] J. March, Advanced Organic Chemistry: Reactions, Mechanisms, and Structure, Wiley, New York (1992).
[31] M. K. Cyranski, T. M. Krygowski, A. R. Katritzky, and P. von R. Schleyer, J. Org. Chem., 67, 1333 (2002).
[32] M. Karelson, Molecular Descriptors in QSAR/QSPR, Wiley, New York (2000).
[33] J. Devillers, Encyclopedia of Computational Chemistry, Wiley, Chichester (1998), pp. 932-941.
[34] J. W. Raymond, T. N. Rogers, D. R. Shonnard, and A. A. Kline, J. Hazard. Mater., 84, 189 (2001).
[35] "KDnuggets News," the e-newsletter on Data Mining, Data Mining Books section. Available on the Internet at http://www.kdnuggets.com/publications/books.html.
[36] J. R. Rose, Encyclopedia of Computational Chemistry, Wiley, Chichester (1998), pp. 1521-1525.
[37] L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone, Classification and Regression Trees, Wadsworth, Belmont (1984).
[38] Data analysis and statistical programming environment STATISTICA v. 5-6. Information is available on the Internet at http://www.statsoft.com.
[39] R. Bartha, J. Agr. Food Chem., 19, 385 (1971).
[40] EKeeper software for evaluation of the level of persistence of chemicals in environment. Available on Internet at http://www.mis.tutkie.tut.ac.jp/; go to "English" / "MIS-services".
Return