This database was collected on patients carrying a colon adenocarcinoma. It
has 104 observations on 33 binary qualitative explanatory variables and one
response variable y
representing the cancer stage according to the to
Astler-Coller classification (Astler and Coller, 1954). This dataset has
some missing data due to technical limits. A microsattelite is a non-coding
DNA sequence.
Format
A data frame with 104 observations on the following 34 variables.
- y
the response: a binary vector (Astler-Coller score).
- D2S138
a binary vector that indicates whether this microsatellite is altered or not.
- D18S61
a binary vector that indicates whether this microsatellite is altered or not.
- D16S422
a binary vector that indicates whether this microsatellite is altered or not.
- D17S794
a binary vector that indicates whether this microsatellite is altered or not.
- D6S264
a binary vector that indicates whether this microsatellite is altered or not.
- D14S65
a binary vector that indicates whether this microsatellite is altered or not.
- D18S53
a binary vector that indicates whether this microsatellite is altered or not.
- D17S790
a binary vector that indicates whether this microsatellite is altered or not.
- D1S225
a binary vector that indicates whether this microsatellite is altered or not.
- D3S1282
a binary vector that indicates whether this microsatellite is altered or not.
- D9S179
a binary vector that indicates whether this microsatellite is altered or not.
- D5S430
a binary vector that indicates whether this microsatellite is altered or not.
- D8S283
a binary vector that indicates whether this microsatellite is altered or not.
- D11S916
a binary vector that indicates whether this microsatellite is altered or not.
- D2S159
a binary vector that indicates whether this microsatellite is altered or not.
- D16S408
a binary vector that indicates whether this microsatellite is altered or not.
- D5S346
a binary vector that indicates whether this microsatellite is altered or not.
- D10S191
a binary vector that indicates whether this microsatellite is altered or not.
- D13S173
a binary vector that indicates whether this microsatellite is altered or not.
- D6S275
a binary vector that indicates whether this microsatellite is altered or not.
- D15S127
a binary vector that indicates whether this microsatellite is altered or not.
- D1S305
a binary vector that indicates whether this microsatellite is altered or not.
- D4S394
a binary vector that indicates whether this microsatellite is altered or not.
- D20S107
a binary vector that indicates whether this microsatellite is altered or not.
- D1S197
a binary vector that indicates whether this microsatellite is altered or not.
- D1S207
a binary vector that indicates whether this microsatellite is altered or not.
- D10S192
a binary vector that indicates whether this microsatellite is altered or not.
- D3S1283
a binary vector that indicates whether this microsatellite is altered or not.
- D4S414
a binary vector that indicates whether this microsatellite is altered or not.
- D8S264
a binary vector that indicates whether this microsatellite is altered or not.
- D22S928
a binary vector that indicates whether this microsatellite is altered or not.
- TP53
a binary vector that indicates whether this microsatellite is altered or not.
- D9S171
a binary vector that indicates whether this microsatellite is altered or not.
Source
Weber et al. (2007). Allelotyping analyzes of synchronous primary and metastasis CIN colon cancers identified different subtypes. Int J Cancer, 120(3), pages 524-32.
References
Nicolas Meyer, Myriam Maumy-Bertrand et Frédéric Bertrand (2010). Comparing the linear and the logistic PLS regression with qualitative predictors: application to allelotyping data. Journal de la Société Française de Statistique, 151(2), pages 1-18.
Examples
data(aze)
str(aze)
#> 'data.frame': 104 obs. of 34 variables:
#> $ y : int 0 0 0 0 0 0 0 0 0 0 ...
#> $ D2S138 : int 0 0 NA NA 0 0 0 0 1 0 ...
#> $ D18S61 : int 1 0 0 0 0 0 1 NA 1 1 ...
#> $ D16S422: int 1 1 0 0 NA 0 0 1 1 0 ...
#> $ D17S794: int NA 1 NA NA 1 0 NA 0 0 NA ...
#> $ D6S264 : int 0 NA 0 0 NA 0 NA NA 0 0 ...
#> $ D14S65 : int 1 1 1 NA 0 NA NA NA 0 0 ...
#> $ D18S53 : int 1 0 NA NA 0 0 1 1 NA 0 ...
#> $ D17S790: int 0 NA 0 0 NA 0 1 0 0 1 ...
#> $ D1S225 : int 0 NA 0 0 0 0 1 0 1 0 ...
#> $ D3S1282: int 0 0 NA 0 0 0 NA NA 0 0 ...
#> $ D9S179 : int NA 1 NA 0 0 0 0 0 NA NA ...
#> $ D5S430 : int NA 1 0 NA NA 0 NA 0 NA 0 ...
#> $ D8S283 : int 1 NA 0 1 0 0 0 1 NA 0 ...
#> $ D11S916: int 0 0 0 0 0 0 0 0 1 1 ...
#> $ D2S159 : int NA 0 0 0 0 0 0 0 1 0 ...
#> $ D16S408: int 1 1 0 NA 0 0 NA 1 1 0 ...
#> $ D5S346 : int 0 1 0 1 0 0 0 1 1 0 ...
#> $ D10S191: int 0 0 0 0 0 0 0 1 1 0 ...
#> $ D13S173: int 1 1 NA NA NA 0 0 1 0 1 ...
#> $ D6S275 : int 0 1 NA 0 0 NA NA 0 1 NA ...
#> $ D15S127: int NA 0 0 0 0 NA 1 0 NA 0 ...
#> $ D1S305 : int 0 0 0 0 0 0 NA 0 1 0 ...
#> $ D4S394 : int 0 0 NA 0 NA 0 NA 0 1 1 ...
#> $ D20S107: int 1 1 NA NA 0 0 1 NA 1 1 ...
#> $ D1S197 : int 0 0 NA 0 0 1 NA NA 1 0 ...
#> $ D1S207 : int 0 0 0 0 0 0 1 0 1 0 ...
#> $ D10S192: int NA NA NA 1 0 0 1 0 1 NA ...
#> $ D3S1283: int 0 0 NA NA 0 0 NA 1 1 0 ...
#> $ D4S414 : int 0 1 0 0 0 0 NA 0 1 1 ...
#> $ D8S264 : int 0 NA 0 0 0 0 0 1 1 0 ...
#> $ D22S928: int 0 NA 0 0 0 0 0 1 1 0 ...
#> $ TP53 : int 1 1 0 0 1 0 NA 1 1 1 ...
#> $ D9S171 : int 0 1 0 NA NA 0 0 NA 1 NA ...