Sequence analysis using logic regression

Citation
C. Kooperberg et al., Sequence analysis using logic regression, GENET EPID, 21, 2001, pp. S626-S631
Citations number
6
Language
INGLESE
art.tipo
Article
Categorie Soggetti
Molecular Biology & Genetics
Journal title
GENETIC EPIDEMIOLOGY
ISSN journal
0741-0395 → ACNP
Volume
21
Year of publication
2001
Supplement
1
Pages
S626 - S631
Database
ISI
SICI code
0741-0395(2001)21:<S626:SAULR>2.0.ZU;2-G
Abstract
Logic Regression is a new adaptive regression methodology that attempts to construct predictors as Boolean combinations of (binary) covariates. In thi s paper we use this algorithm to deal with single-nucleotide polymorphism ( SNP) sequence data. The predictors that are found are interpretable as risk factors of the disease. Significance of these risk factors is assessed usi ng techniques like cross-validation, permutation tests, and independent tes t sets. These model selection techniques remain valid when data is dependen t, as is the case for the family data used here. In our analysis of the Gen etic Analysis Workshop 12 data we identify the exact locations of mutations on gene I and gene 6 and a number of mutations on gene 2 that are associat ed with the affected status, without selecting any false positives. (C) 200 1 Wiley-Liss, Inc.