impute_mean.Rd
Replaces any values < 0 with NA, calculates the subgroup mean, then replaces missing values with the subgroup mean.
impute_mean(
data,
var_names,
remove_zeros = FALSE,
strat_vars = c("year", "sex", "imd_quintile", "age_cat")
)
Data table - the health survey data
Character vector - the variable names to be imputed (numeric variables only)
Logical - should zeros be treated as missing data
Character vector - the variables by which to stratify the subgroup means
Returns an updated version of data in which the variables specified have had their missing values imputed with the subgroup means.
If not all NAs can be imputed with the fine scale starting amount of stratification, imputation is attempted again, removing the stratification variable specified last.
if (FALSE) {
data <- read_2001()
data <- clean_age(data)
data <- clean_demographic(data)
data <- impute_mean(data, var_names = c("d7many"))
}