Skip to contents

Example

The example uses the cancer data set from the survival package and compares survival, defined by the time variable time (recoded to years) and the event variable status (recoded to 1 = death, 0 = censored), by sex.

library(rifttable)
data(cancer, package = "survival")

cancer <- cancer |>
  tibble::as_tibble() |>
  dplyr::mutate(
    # The exposure (here, 'sex') must be categorical (a factor)
    sex = factor(
      sex,
      levels = 1:2,
      labels = c("Male", "Female")
    ),
    time = time / 365.25, # transform to years
    status = status - 1
  )

tibble::tribble(
  ~label,                                  ~type,
  "**Absolute estimates**",                "",
  "*Counts and sums*",                     "",
  "  Observations, *N*",                   "total",
  "  Events, *n*",                         "events",
  "  Events/observations",                 "events/total",
  "  Events/person-years",                 "events/time",
  "*Follow-up*",                           "",
  "  Person-years",                        "time",
  "  Maximum follow-up, years",            "maxfu",
  "  Median follow-up, years",             "medfu",
  "  Median follow-up (IQR), years",       "medfu (iqr)",
  "*Rates*",                               "",
  "  Rate per 1000 person-years",          "rate",
  "  Rate per 1000 person-years (95% CI)", "rate (ci)",
  "  Events/py (rate per 1000 py)",        "events/time (rate)",
  "*Risks*",                               "",
  "  1-year survival",                     "surv",
  "  1-year survival (95% CI)",            "surv (ci)",
  "  1-year risk/cumulative incidence",    "cuminc",
  "  1-year risk (95% CI)",                "cuminc (ci)",
  "  Median survival, years",              "medsurv",
  "  Median survival (95 CI), years",      "medsurv (ci)",
  "",                                      "",
  "**Comparative estimates**",             "",
  "  1-year survival difference",          "survdiff",
  "  1-year risk difference",              "cumincdiff",
  "  1-year survival ratio",               "survratio",
  "  1-year risk ratio",                   "cumincratio",
  "  Hazard ratio (95% CI)",               "hr"
) |>
  dplyr::mutate(
    time = "time",
    event = "status",
    exposure = "sex",
    arguments = list(list(timepoint = 1))
  ) |>
  rifttable(
    data = cancer,
    overall = TRUE
  ) |>
  rt_gt()
Summary Overall Male Female
Absolute estimates
Counts and sums
Observations, N 228 138 90
Events, n 165 112 53
Events/observations 165/228 112/138 53/90
Events/person-years 165/191 112/107 53/84
Follow-up
Person-years 191 107 84
Maximum follow-up, years 2.80 2.80 2.64
Median follow-up, years 1.61 2.30 1.45
Median follow-up (IQR), years 1.61 (0.82, 2.64) 2.30 (1.11, 2.77) 1.45 (0.76, 2.25)
Rates
Rate per 1000 person-years 866.0 1046.6 634.6
Rate per 1000 person-years (95% CI) 866.0 (743.4, 1008.7) 1046.6 (869.7, 1259.6) 634.6 (484.8, 830.6)
Events/py (rate per 1000 py) 165/191 (866.0) 112/107 (1046.6) 53/84 (634.6)
Risks
1-year survival 0.41 0.34 0.53
1-year survival (95% CI) 0.41 (0.34, 0.49) 0.34 (0.26, 0.43) 0.53 (0.42, 0.66)
1-year risk/cumulative incidence 0.59 0.66 0.47
1-year risk (95% CI) 0.59 (0.51, 0.66) 0.66 (0.57, 0.74) 0.47 (0.34, 0.58)
Median survival, years 0.85 0.74 1.17
Median survival (95 CI), years 0.85 (0.78, 0.99) 0.74 (0.58, 0.85) 1.17 (0.95, 1.51)
Comparative estimates
1-year survival difference 0 (reference) 0.19 (0.05, 0.34)
1-year risk difference 0 (reference) -0.19 (-0.33, -0.04)
1-year survival ratio 1 (reference) 1.57 (1.12, 2.19)
1-year risk ratio 1 (reference) 0.71 (0.55, 1.00)
Hazard ratio (95% CI) 1 (reference) 0.59 (0.42, 0.82)

Absolute estimates per exposure category

type Description Options (arguments =)
"cuminc" Cumulative incidence (“risk”) from the Kaplan-Meier estimator or, if competing risks are present, its generalized form, the Aalen-Johansen estimator. If no time point is provided, returns cumulative incidence at end of follow-up. Change between display as proportion or percent using the parameter risk_percent of rifttable(). timepoint, id. Example: list(timepoint = 2.5) to show 2.5-year risk.
"cuminc (ci)" Cumulative incidence (“risk”), as above, with confidence intervals (default: 95%; Greenwood standard errors with log transformation, the default of the survival package/survival::survfit()). See "cuminc".
"events" Event count.
"events/time" Events slash person-time.
"events/time (rate)" A combination: Events slash time followed by rate in parentheses.
"events/total" Events slash number of observations.
"rate" Event rate: event count divided by person-time, multiplied by the rifttable() parameter factor.
"rate (ci)" Event rate with confidence interval (default: 95%; Poisson-type large-sample interval).
"surv" Survival from the Kaplan-Meier estimator. Not estimated if competing risks are present. If no time point is provided, returns survival at end of follow-up. Change between display as proportion or percent using the parameter risk_percent of rifttable(). See "cuminc".
"surv (ci)" Survival from the Kaplan-Meier estimator, as above, with confidence interval (default: 95%; Greenwood standard errors with log transformation, the default of the survival package/survival::survfit()). See "cuminc".
"time" Person-time.
"maxfu" Maximum follow-up time.
"medfu" Median follow-up (“reverse Kaplan-Meier”), equals median survival for censoring. If competing risks are present, events other than the event of interest are also considered censoring to estimate the median follow-up for the event of interest.
"medfu (iqr)" Median and interquartile range for follow-up, as above.
"medsurv" Median survival.
"medsurv (ci)" Median survival with confidence interval (default: 95%).

Comparative estimates with confidence intervals

type Description Options (arguments =)
"cumincdiff" Difference in cumulative incidence (risk difference) from Kaplan-Meier estimator or, if competing risks are present, its generalized form, the Aalen-Johansen estimator, with confidence interval (default: 95%). Uses rifttable::survdiff_ci(). timepoint. Example: list(timepoint = 2.5) to calculate difference in 2.5-year risk.
"cumincratio" Ratio of cumulative incidence (risk ratio), with confidence interval (default: 95%), similar to "cumincdiff". timepoint
"hr" Hazard ratio from Cox proportional hazards regression, with confidence interval (default: 95%). If competing events are present, hazard ratios are cause-specific for the event of interest. list(robust = TRUE) for robust (sandwich) standard errors. Use "+ cluster(id_variable)" in confounders to obtain clustering specifically for Cox models, or use the id argument of the main rifttable() call.
"survdiff" Difference in survival from Kaplan-Meier estimator, with confidence interval (default: 95%). Not estimated if competing risks are present. Uses rifttable::survdiff_ci(). See "cumincdiff".
"survratio" Ratio of survival from Kaplan-Meier estimator, with confidence interval (default: 95%), similar to "survdiff". See "cumincdiff".

Competing events

With only one event type, the event variable only has two levels: censoring, typically encoded as 0, and the event, typically encoded as 1.

With competing events, the event variable will have additional levels. The survival::Surv() function used by rifttable assumes that the first-ordered level represents censoring and the others are different non-censoring events. For example, if the event variable is a factor, then "Censoring" needs to be the first of the factor’s levels().

It is necessary to specify the event of interest in the design if competing events are present. For example, if the event variable is a factor variable status_competing, with levels "Censored", "Outcome of interest", and "Other-cause death", then specify event = "status_competing@Outcome of interest" in the table design.

See the tables above for how competing events are handled. When no details are noted, the event of interest is recoded as the sole event and other events are considered censoring.

Weighted estimates

The name of a weights variable, for example inverse-probability weights, can be provided via the column weights of the design table. Weights, if present, are used by all comparative estimators of survival as well as by type = "cuminc" and type = "surv", and are ignored otherwise.

Clustering and robust variance

An id variable identifying individuals in the data must be provided when time and time2 are used in the setting of competing events, or when non-integer weights are present. If no id variable is provided, each row is taken as one individual. Robust variance will then automatically be calculated by estimators of survival/cumulative incidence and Cox models.