

Author: Korhonen S.-P. Tuppurainen K. Laatikainen R. Peräkylä M.
Publisher: Taylor & Francis Ltd
ISSN: 1062-936X
Source: SAR and QSAR in Environmental Research, Vol.16, Iss.6, 2005-12, pp. : 567-579
Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.
Abstract
Self-Organizing Molecular Field Analysis (SOMFA) comes with a built-in regression methodology, the Self-Organizing Regression (SOR), instead of relying on external methods such as PLS. In this article we present a proof of the equivalence between SOR and SIMPLS with one principal component. Thus, the modest performance of SOMFA on complex datasets can be primarily attributed to the low performance of the SOMFA regression methodology. A multi-component extension of the original SOR methodology (MCSOR) is introduced, and the performances of SOR, MCSOR and SIMPLS are compared using several datasets. The results indicate that in general the performance of SOMFA models is greatly improved if SOR is replaced with a more sophisticated regression method. The results obtained for the Cramer ( CBG ) dataset further underline the fact that it is a very poor benchmark dataset and should not be used to evaluate the performance of QSAR techniques.
Related content


By Urry Francis M. Kushnir Mark Nelson Gordon McDowell Mitzi Jennison Tom
Journal of Analytical Toxicology, Vol. 20, Iss. 7, 1996-11 ,pp. :




Multivariate Statistical Methods in Environmental Forensics
Environmental Forensics, Vol. 8, Iss. 1-2, 2007-01 ,pp. :


Improving Corporate Environmental Performance
Environmental Management and Health, Vol. 5, Iss. 2, 1994-02 ,pp. :


Improving Performance in Supply Chain
MATEC Web of conference, Vol. 137, Iss. issue, 2017-11 ,pp. :