Title: | Penalized High-Dimensional Joint Model |
---|---|
Description: | Joint models have been widely used to study the associations between longitudinal biomarkers and a survival outcome. However, existing joint models only consider one or a few longitudinal biomarkers and cannot deal with high-dimensional longitudinal biomarkers. This package can be used to fit our recently developed penalized joint model that can handle high-dimensional longitudinal biomarkers. Specifically, an adaptive lasso penalty is imposed on the parameters for the effects of the longitudinal biomarkers on the survival outcome, which allows for variable selection. Also, our algorithm is computationally efficient, which is based on the Gaussian variational approximation method. |
Authors: | Jiehuan Sun [aut, cre] |
Maintainer: | Jiehuan Sun <[email protected]> |
License: | GPL-2 |
Version: | 0.1.0 |
Built: | 2024-11-10 04:14:19 UTC |
Source: | https://github.com/cran/HDJM |
This list contains a list of parameters specifying the joint model.
ID_name the variable name for the patient ID in both longitudinal data and survival data.
item_name the variable name for the longitudinal outcomes in the longitudinal data.
value_name the variable name for the longitudinal measurements in the longitudinal data.
time_name the variable name for the measurement timepoints in the longitudinal data.
fix_cov a set of variables names indicating the covariates of fixed-effects in the longitudinal submodel. If NULL, not baseline covariates are included.
random_cov a set of variables names indicating the covariates of random-effects in the longitudinal submodel. If NULL, not baseline covariates are included.
FUN a function specifying the time-related basis functions in the longitudinal submodel.
ran_time_ind a vector of integers specifying which time-related basis functions are also included with random-effects in the longitudinal submodel.
surv_time_name the variable name for the survival time in the survival data.
surv_status_name the variable name for the censoring indicator in the survival data.
surv_cov a set of variables names specifying the baseline covariates in the survival submodel.
n_points an integer indicating the numebr of nodes being used in the Gaussian quadrature.
Jiehuan Sun [email protected]
The function is used to fit the penalized HDJM with adpative lasso penalty.
HDJM_fit( LongData = NULL, SurvData = NULL, marker.name = NULL, control_list = NULL, nlam = 50, ridge = 0, pmax = 10, min_ratio = 0.01, maxiter = 100, eps = 1e-04, UseSurvN = FALSE )
HDJM_fit( LongData = NULL, SurvData = NULL, marker.name = NULL, control_list = NULL, nlam = 50, ridge = 0, pmax = 10, min_ratio = 0.01, maxiter = 100, eps = 1e-04, UseSurvN = FALSE )
LongData |
a data frame containing the longitudinal data
(see |
SurvData |
a data frame containing the survival data
(see |
marker.name |
a vector indicating which set of longitudinal biomarkers to be analyzed. If NULL, all biomarkers in LongData will be used. |
control_list |
a list of parameters specifying the joint model
(see |
nlam |
number of tuning parameters. |
ridge |
ridge penalty. |
pmax |
the maximum of biomarkers being selected. The algorithm will stop early if the maximum has been reached. |
min_ratio |
the ratio between the largest possible penalty and the smallest penalty to tune. |
maxiter |
the maximum number of iterations. |
eps |
threshold for convergence. |
UseSurvN |
a logical variable indicating whether the effective sample size (i.e., the number of events) should be used in calculating BIC. |
return a list with the following objects.
marker.name |
the names for biomarkers being analyzed. |
alpha |
the estimates for the effects of biomarkers in the survival submodel. |
weib |
the estimates for the Weibull baseline hazard in the survival submodel. |
gamma |
the estimates for the effects of baseline covariates in the survival submodel. |
beta |
the estimates for the fixed-effects in the longitudinal submodel. |
sig2 |
the estimates for the noise variances in the longitudinal submodel. |
Sigma |
the estimates for the covariance matrices of the random effects in the longitudinal submodel. |
Jiehuan Sun and Sanjib Basu. "Penalized Joint Models of High-Dimensional Longitudinal Biomarkers and A Survival Outcome".
data(HDJMdata) flex_time_fun <- function(x=NULL){ xx = matrix(x, ncol = 1) colnames(xx) = c("year_l") xx } ran_time_ind = 1 ## random time-trend effects control_list = list( ID_name = "ID", item_name = "item", value_name = "value", time_name = "years", fix_cov = NULL, random_cov = NULL, FUN = flex_time_fun, ran_time_ind=ran_time_ind, surv_time_name = "ftime", surv_status_name = "fstat", surv_cov = "x", n_points = 5 ) ## takes about one minute. res = HDJM_fit(LongData=LongData, SurvData=SurvData, control_list=control_list)
data(HDJMdata) flex_time_fun <- function(x=NULL){ xx = matrix(x, ncol = 1) colnames(xx) = c("year_l") xx } ran_time_ind = 1 ## random time-trend effects control_list = list( ID_name = "ID", item_name = "item", value_name = "value", time_name = "years", fix_cov = NULL, random_cov = NULL, FUN = flex_time_fun, ran_time_ind=ran_time_ind, surv_time_name = "ftime", surv_status_name = "fstat", surv_cov = "x", n_points = 5 ) ## takes about one minute. res = HDJM_fit(LongData=LongData, SurvData=SurvData, control_list=control_list)
This dataset contains longitudinal outcomes.
data(HDJMdata)
data(HDJMdata)
A data frame with 48700 rows and 4 variables
ID patient ID
item types of longitudinal outcome
years measurement timepoints
value measurements
Jiehuan Sun [email protected]
This dataset contains survival outcome.
data(HDJMdata)
data(HDJMdata)
A data frame with 100 rows and 4 variables
ID patient ID
fstat censoring indicator
ftime survival time
x baseline covariates
Jiehuan Sun [email protected]