

Author: Jin Rong Si Luo Chan Christina
Publisher: Inderscience Publishers
ISSN: 1748-5673
Source: International Journal of Data Mining and Bioinformatics, Vol.2, Iss.3, 2008-09, pp. : 250-267
Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.
Abstract
This paper addresses the sparse data problem in the linear regression model, namely the number of variables is significantly larger than the number of the data points for regression. We assume that in addition to the measured data points, the prior knowledge about the input variables may be provided in the form of pair wise similarity. We presented a full Bayesian framework to effectively exploit the similarity information of the input variables for linear regression. Empirical studies with gene expression data show that the regression errors can be reduced significantly by incorporating the similarity information derived from gene ontology.
Related content


An Active Lattice Model in a Bayesian Framework
Computer Vision and Image Understanding, Vol. 63, Iss. 2, 1996-03 ,pp. :








A Regression Model for Fuzzy Initial Data
By Domrachev V. G. Poleshuk O. M.
Automation and Remote Control, Vol. 64, Iss. 11, 2003-11 ,pp. :