Shengxin Tu dissertation defense – May 10
PhD candidate Shengxin Tu will defend her dissertation on Friday, May 10, at 8 a.m. Central Time, at 2525 West End Avenue, in the 11th floor large conference room (suite 1100, room 11105). Her advisor is Bryan Shepherd. All are invited and encouraged to attend.
Rank-Based Analyses and Designs with Clustered Data
Clustered data are common in biomedical research. It is often of interest to evaluate the correlations within clusters and between variables with clustered data. Conventional approaches, including intraclass correlation coefficients (ICCs) and Pearson correlations, are commonly used in analyses with clustered data. However, these conventional approaches are sensitive to extreme values and skewness. They also depend on the scale of the data and are not applicable to ordered categorical data. In this dissertation, we define population parameters for the rank ICC and between- and within-cluster Spearman rank correlations. These definitions are natural extensions of the conventional correlations to the rank scale. We show that the total Spearman rank correlation approximates a weighted sum of between- and within-cluster Spearman rank correlations, with weights determined by the rank ICCs of the two random variables. We also describe estimation and inference for these four rank-based correlations, conduct simulations to evaluate the performance of our estimators, and illustrate their use with real data examples. Furthermore, we apply the rank ICC in the design of clustered randomized controlled trials (RCTs), proposing unified and simple sample size calculations for cluster RCTs with skewed or ordinal outcomes. Our calculation involves inflating the sample size for an adequately powered individual RCT for an ordinal outcome with a design effect that incorporates the rank ICC. For continuous outcomes, our calculation sets the number of distinct ordinal levels to the sample size. We show that with continuous data, our calculations closely approximate more complicated sample size calculations based on clustered Wilcoxon rank-sum tests. We conduct simulations to evaluate our calculations’ performance and illustrate their use in the design of two cluster RCTs, one with a skewed continuous outcome and a non-inferiority trial with an irregularly distributed count outcome.