Skip to main content

Table 1 The clinical variables in the METABRIC dataset

From: Discovering causal interactions using Bayesian network scoring and information gain

Variable Description Values
age_at_diagnosis age at diagnosis of the disease 0-39, 39–54, 54–69, 69–84, 84-100
menopausal_status inferred menopausal status pre, post
size size of tumor in cm 0-20, 20–50, 50-180
lymph_nodes_positive number of positive lymph nodes 0, 1, 2–3, 4–5, 6–9. ≥ 10
lymph_nodes_removed number of lymph nodes removed 0, 1–3, 4–9, 10–20, ≥ 21
percent_nodes_positive percent of removed nodes positive 0-0.2, 0.2-0.4, 0.4-0.6, 0.6-0.8, 0.8-1
grade grade of disease 1, 2, 3
stage composite of size and # positive nodes 0,1,2,3,4
histological tumor histology IDC, Other
ER_Expr estrogen receptor expression +, −
PR_Expr progesterone receptor expression +, −
HER2_status HER2 expression +, −
P53_mutation_status whether P53 is mutated +, −
chemo whether patient had chemotherapy yes, no
radiation whether patient had radiation therapy yes, no
hormone whether patient had hormone therapy yes, no