Title: | MAny-Group MAtching |
---|---|
Description: | Balancing quasi-experimental field research for effects of covariates is fundamental for drawing causal inference. Propensity Score Matching deals with this issue but current techniques are restricted to binary treatment variables. Moreover, they provide several solutions without providing a comprehensive framework on choosing the best model. The MAGMA R-package addresses these restrictions by offering nearest neighbor matching for two to four groups. It also includes the option to match data of a 2x2 design. In addition, MAGMA includes a framework for evaluating the post-matching balance. The package includes functions for the matching process and matching reporting. We provide a tutorial on MAGMA as vignette. More information on MAGMA can be found in Feuchter, M. D., Urban, J., Scherrer V., Breit, M. L., and Preckel F. (2022) <https://osf.io/p47nc/>. |
Authors: | Julian Urban [aut, cre], Markus D. Feuchter [aut], Vsevolod Scherrer [aut], Moritz L. Breit [aut], Franzis Preckel [aut] |
Maintainer: | Julian Urban <[email protected]> |
License: | GPL-3 |
Version: | 1.0.3 |
Built: | 2025-03-04 13:30:07 UTC |
Source: | https://github.com/julianurban/magma |
This function extracts the balance criteria or pairwise effects of a
Balance_MAGMA
result for a specified sample size.
Balance_extract(Balance, samplesize, effects = FALSE)
Balance_extract(Balance, samplesize, effects = FALSE)
Balance |
A result of |
samplesize |
An integer indicating the sample size for which the balance criteria or pairwise effects should be extracted. |
effects |
Indicates whether balance criteria or pairwise effects should be extracted. The default value FALSE returns the balance criteria, while TRUE leads to the extraction of pairwise effects. |
Given a previous computed Balance_MAGMA
object, this function
enables the extraction of balance criteria or pairwise effects for any
desired sample size. This makes it possible to independently check the
balance for each possible sample size.
Depends on the effects argument. If FALSE, it returns in a vector containing the balance criteria. If TRUE, it returns a vector containing all possible pairwise effects.
Julian Urban
# Defining the names of the metric and binary covariates covariates_vector <- c("GPA_school", "IQ_score", "Motivation", "parents_academic", "gender") # Estimating balance of a two-group matching using the data set # 'MAGMA_sim_data'. # Matching variable 'gifted_support' (received giftedness support yes or no) Balance_gifted <- Balance_MAGMA(Data = MAGMA_sim_data[MAGMA_sim_data$step_gifted < 200, ], group = "gifted_support", covariates = covariates_vector, step = "step_gifted") # Balance criteria for 100 cases per group # Balance criteria Balance_100_gifted <- Balance_extract(Balance = Balance_gifted, samplesize = 100, effects = FALSE) Balance_100_gifted # Pairwise effects Balance_100_gifted_effects <- Balance_extract(Balance = Balance_gifted, samplesize = 100, effects = TRUE) Balance_100_gifted_effects # 2x2 matching using the data set 'MAGMA_sim_data' # Matching variables are 'gifted_support' (received giftedness support yes # or no) and 'enrichment' (participated in enrichment or not) # 'MAGMA_sim_data_gift_enrich' contains the result of the matching # 2x2 matching is equivalent to four-group matching MAGMA_sim_data_gift_enrich <- MAGMA(Data = MAGMA_sim_data, group = c("gifted_support", "enrichment"), dist = "ps_2x2", cores = 2) # Estimating balance. Covariates same as above Balance_2x2 <- Balance_MAGMA(Data = MAGMA_sim_data_gift_enrich, group = c("gifted_support", "enrichment"), covariates = covariates_vector, step = "step") # Step created during matching str(Balance_2x2) # Balance criteria for 125 cases per group # Balance criteria Balance_125_2x2 <- Balance_extract(Balance = Balance_2x2, samplesize = 125, effects = FALSE) Balance_125_2x2 # Pairwise effects Balance_125_2x2_effects <- Balance_extract(Balance = Balance_2x2, samplesize = 125, effects = TRUE) Balance_125_2x2_effects
# Defining the names of the metric and binary covariates covariates_vector <- c("GPA_school", "IQ_score", "Motivation", "parents_academic", "gender") # Estimating balance of a two-group matching using the data set # 'MAGMA_sim_data'. # Matching variable 'gifted_support' (received giftedness support yes or no) Balance_gifted <- Balance_MAGMA(Data = MAGMA_sim_data[MAGMA_sim_data$step_gifted < 200, ], group = "gifted_support", covariates = covariates_vector, step = "step_gifted") # Balance criteria for 100 cases per group # Balance criteria Balance_100_gifted <- Balance_extract(Balance = Balance_gifted, samplesize = 100, effects = FALSE) Balance_100_gifted # Pairwise effects Balance_100_gifted_effects <- Balance_extract(Balance = Balance_gifted, samplesize = 100, effects = TRUE) Balance_100_gifted_effects # 2x2 matching using the data set 'MAGMA_sim_data' # Matching variables are 'gifted_support' (received giftedness support yes # or no) and 'enrichment' (participated in enrichment or not) # 'MAGMA_sim_data_gift_enrich' contains the result of the matching # 2x2 matching is equivalent to four-group matching MAGMA_sim_data_gift_enrich <- MAGMA(Data = MAGMA_sim_data, group = c("gifted_support", "enrichment"), dist = "ps_2x2", cores = 2) # Estimating balance. Covariates same as above Balance_2x2 <- Balance_MAGMA(Data = MAGMA_sim_data_gift_enrich, group = c("gifted_support", "enrichment"), covariates = covariates_vector, step = "step") # Step created during matching str(Balance_2x2) # Balance criteria for 125 cases per group # Balance criteria Balance_125_2x2 <- Balance_extract(Balance = Balance_2x2, samplesize = 125, effects = FALSE) Balance_125_2x2 # Pairwise effects Balance_125_2x2_effects <- Balance_extract(Balance = Balance_2x2, samplesize = 125, effects = TRUE) Balance_125_2x2_effects
This function computes all four balance criteria of 'MAGMA.R', namely Pillai's Trace, d-ratio, mean g, and adjusted d-ratio. The estimation considers the scale level of the variables. Balance estimation is performed across various sample sizes. See Details for more information.
Balance_MAGMA( Data, group, covariates, step = "step", verbose = TRUE, covariates_ordinal = NULL, covariates_nominal = NULL )
Balance_MAGMA( Data, group, covariates, step = "step", verbose = TRUE, covariates_ordinal = NULL, covariates_nominal = NULL )
Data |
A data frame containing at least the grouping variable, the step variable from the main MAGMA-function (or other matching algorithms), and all covariates of interest. |
group |
A character specifying the name of your grouping variable in data. Note that MAGMA can only match your data for a maximum of four groups. For matching over two grouping variables (e.g., 2x2 design) is possible by specifying group as a character vector with a length of two. In this case, each of the two grouping variables can only have two levels. |
covariates |
A character vector listing the names of all binary and metric covariates of interest. |
step |
A character specifying the step variable of the matching. Per default, it is set to step, which corresponds the resulting name of the main MAGMA function. |
verbose |
TRUE or FALSE indicating whether matching information should be printed to the console. |
covariates_ordinal |
A character vector listing the names of all ordinal covariates of interest. |
covariates_nominal |
A character vector listing the names of all nominal covariates of interest. |
This function computes all four balance criteria of 'MAGMA.R', namely Pillai's Trace, d-ratio, mean g, and adjusted d-ratio. This is an iterative process including more cases with each iteration according to the step variable. Thus, starting with cases having a small within-match distance, larger distances are included with increasing iterations. As a minimum the function specifies n >= 20 per group. This does not imply that balance criteria with such a small sample size can be estimated consistently. For Pillai's Trace a higher minimum sample size can be possible. It depends on the number of covariates to ensure a positive model identification. Missing data for Pillai's Trace are excluded listwise, while for the other balance criteria pairwise exclusion is applied.
A list of length four containing all balance criteria and all pairwise effects with respect to group sample size.
Julian Urban
Pastore, M., Loro, P.A.D., Mingione, M., Calcagni, A. (2022). overlapping: Estimation of Overlapping in Empirical Distributions. R package version 2.1, (https://CRAN.R-project.org/package=overlapping).
Revelle, W. (2023). psych: Procedures for Psychological, Psychometric, and Personality Research. Northwestern University, Evanston, Illinois. R package version 2.3.6, (https://CRAN.R-project.org/package=psych).
Viechtbauer, W. (2010). Conducting meta-analyses in R with the metafor package. Journal of Statistical Software, 36(3), 1-48. (doi:10.18637/jss.v036.i03)
Fisher, Z., Tipton, E., Zhipeng, H. (2023). robumeta: Robust Variance Meta-Regression. R package version 2.1, (https://CRAN.R-project.org/package=robumeta).
# Defining the names of the metric and binary covariates covariates_vector <- c("GPA_school", "IQ_score", "Motivation", "parents_academic", "gender") # Estimating balance of a two-group matching using the data set # 'MAGMA_sim_data'. # Matching variable 'gifted_support' (received giftedness support yes or no) # Using subsample only for faster execution. Balance_gifted <- Balance_MAGMA(Data = MAGMA_sim_data[MAGMA_sim_data$step_gifted < 200, ], group = "gifted_support", covariates = covariates_vector, step = "step_gifted") str(Balance_gifted) # 2x2 matching using the data set 'MAGMA_sim_data' # Matching variables are 'gifted_support' (received giftedness support yes # or no) and 'enrichment' (participated in enrichment or not) # 'MAGMA_sim_data_gift_enrich' contains the result of the matching # 2x2 matching is equivalent to four-group matching MAGMA_sim_data_gift_enrich <- MAGMA(Data = MAGMA_sim_data, group = c("gifted_support", "enrichment"), dist = "ps_2x2", cores = 2) # Estimating balance. Covariates same as above Balance_2x2 <- Balance_MAGMA(Data = MAGMA_sim_data_gift_enrich, group = c("gifted_support", "enrichment"), covariates = covariates_vector, step = "step") # step created during matching str(Balance_2x2)
# Defining the names of the metric and binary covariates covariates_vector <- c("GPA_school", "IQ_score", "Motivation", "parents_academic", "gender") # Estimating balance of a two-group matching using the data set # 'MAGMA_sim_data'. # Matching variable 'gifted_support' (received giftedness support yes or no) # Using subsample only for faster execution. Balance_gifted <- Balance_MAGMA(Data = MAGMA_sim_data[MAGMA_sim_data$step_gifted < 200, ], group = "gifted_support", covariates = covariates_vector, step = "step_gifted") str(Balance_gifted) # 2x2 matching using the data set 'MAGMA_sim_data' # Matching variables are 'gifted_support' (received giftedness support yes # or no) and 'enrichment' (participated in enrichment or not) # 'MAGMA_sim_data_gift_enrich' contains the result of the matching # 2x2 matching is equivalent to four-group matching MAGMA_sim_data_gift_enrich <- MAGMA(Data = MAGMA_sim_data, group = c("gifted_support", "enrichment"), dist = "ps_2x2", cores = 2) # Estimating balance. Covariates same as above Balance_2x2 <- Balance_MAGMA(Data = MAGMA_sim_data_gift_enrich, group = c("gifted_support", "enrichment"), covariates = covariates_vector, step = "step") # step created during matching str(Balance_2x2)
This function shows and quantifies the kernel density overlap of a variable for two or more groups.
Density_overlap( Data, variable, group, variable_name = NULL, group_labels = NULL, group_name = NULL, step_num = NULL, step_var = NULL, verbose = TRUE )
Density_overlap( Data, variable, group, variable_name = NULL, group_labels = NULL, group_name = NULL, step_num = NULL, step_var = NULL, verbose = TRUE )
Data |
A data frame that contains the desired variable for density plotting as well as the specified grouping variable. |
variable |
A character specifying the variable for which the density should be plotted (e.g., "ps_gifted"). |
group |
A character specifying the groups for which the density should be plotted. Can be an independent group comparison (e.g., comparing matched groups) or the comparison of pre and post matched samples. |
variable_name |
A character specifying the name to appear in the plot for the variable. |
group_labels |
A character vector specifying the labels for the groups to ppear in the legend of the plot. |
group_name |
A character specifying the name of the grouping variable to appear in the title of the legend. |
step_num |
An integer specifying the number of cases to be included per group in this post matching comparison. Is based on the step variable of MAGMA. |
step_var |
A character specifying the name of the step variable. |
verbose |
TRUE or FALSE indicating whether matching information should be printed to the console. |
This function enables the comparison of the density of variables for two or more groups. It plots the kernel density separately for each group and quantifies the amount of overlap.
A plot showing the kernel density for a specified variable separately for specified groups and the quantification of this overlap.
Julian Urban
Pastore, M., Loro, P.A.D., Mingione, M., Calcagni, A. (2022). overlapping: Estimation of Overlapping in Empirical Distributions. R package version 2.1, https://CRAN.R-project.org/package=overlapping.
# Estimating density overlap using the data set 'MAGMA_sim_data # Estimating density overlap for 'ps_gifted' (propensity scores for # giftedness support) # Defining plot aesthetics with 'group', 'variable_name', "group_lables', # and 'group_name' # Estimating pre-matching density overlap by not specifying 'step_num' and # 'step_var' Density_overlap(Data = MAGMA_sim_data, variable = "ps_gifted", group = "gifted_support", step_num = NULL, step_var = NULL, variable_name = "Propensity Score", group_labels = c("No Support", "Support"), group_name = "Gifted Support") # Estimating density overlap using the matched data set #'MAGMA_sim_data_gifted' # Estimating density overlap for 'ps_gifted' (propensity scores for # giftedness support) # Defining plot aesthetics with 'group', 'variable_name', 'group_lables', # and 'group_name' # Estimating post-matching overlap for 250 cases per group ('step_num') # Name of the step variable is 'step' Density_overlap(Data = MAGMA_sim_data, variable = "ps_gifted", group = "gifted_support", step_num = 250, step_var = "step_gifted", variable_name = "Propensity Score", group_labels = c("No Support", "Support"), group_name = "Gifted Support") # Estimating density overlap using the data set 'MAGMA_sim_data # Estimating density overlap for 'teacher_ability_rating' (ability rated # from teacher as below average, average, or above average) # Defining plot aesthetics with 'group', 'variable_name', 'group_lables', # and 'group_name' # Estimating pre-matching density overlap by not specifying 'step_num' and # 'step_var' Density_overlap(Data = MAGMA_sim_data, variable = "GPA_school", group = "teacher_ability_rating", variable_name = "School Achievement", group_labels = c("Low", "Medium", "High"), group_name = "Rating")
# Estimating density overlap using the data set 'MAGMA_sim_data # Estimating density overlap for 'ps_gifted' (propensity scores for # giftedness support) # Defining plot aesthetics with 'group', 'variable_name', "group_lables', # and 'group_name' # Estimating pre-matching density overlap by not specifying 'step_num' and # 'step_var' Density_overlap(Data = MAGMA_sim_data, variable = "ps_gifted", group = "gifted_support", step_num = NULL, step_var = NULL, variable_name = "Propensity Score", group_labels = c("No Support", "Support"), group_name = "Gifted Support") # Estimating density overlap using the matched data set #'MAGMA_sim_data_gifted' # Estimating density overlap for 'ps_gifted' (propensity scores for # giftedness support) # Defining plot aesthetics with 'group', 'variable_name', 'group_lables', # and 'group_name' # Estimating post-matching overlap for 250 cases per group ('step_num') # Name of the step variable is 'step' Density_overlap(Data = MAGMA_sim_data, variable = "ps_gifted", group = "gifted_support", step_num = 250, step_var = "step_gifted", variable_name = "Propensity Score", group_labels = c("No Support", "Support"), group_name = "Gifted Support") # Estimating density overlap using the data set 'MAGMA_sim_data # Estimating density overlap for 'teacher_ability_rating' (ability rated # from teacher as below average, average, or above average) # Defining plot aesthetics with 'group', 'variable_name', 'group_lables', # and 'group_name' # Estimating pre-matching density overlap by not specifying 'step_num' and # 'step_var' Density_overlap(Data = MAGMA_sim_data, variable = "GPA_school", group = "teacher_ability_rating", variable_name = "School Achievement", group_labels = c("Low", "Medium", "High"), group_name = "Rating")
This function computes all four balance criteria of 'MAGMA.R,' namely Pillai's Trace, d-ratiO, mean g, and adjusted d-ratio for the unmatched data set. This enables comparison of initial unbalance with the balance after matching.
initial_unbalance( Data, group, covariates, verbose = TRUE, covariates_ordinal = NULL, covariates_nominal = NULL )
initial_unbalance( Data, group, covariates, verbose = TRUE, covariates_ordinal = NULL, covariates_nominal = NULL )
Data |
A data frame containing at least the grouping variable and all covariates of interest. |
group |
A character specifying the name of your grouping variable in data. Note that MAGMA can only match your data for a maximum of 4 groups. For matching over two grouping variables (e.g., 2x2 design) is possible by specifying group as a character vector with a length of two. In this case each or the two grouping variables can only have two levels. |
covariates |
A character vector listing the names of all binary and metric covariates of interest. |
verbose |
TRUE or FALSE indicating whether matching information should be printed to the console. |
covariates_ordinal |
A character vector listing the names of all ordinal covariates of interest. |
covariates_nominal |
A character vector listing the names of all nominal covariates of interest. |
This function computes all four Balance criteria of 'MAGMA.R', namely Pillai's Trace, d-ratio, mean g, and adjusted d-ratio for the overall samples. Missing data for Pillai's Trace are excluded listwise, while for the other balance criteria pairwise exclusion is applied.
A numeric vector of length 4 containing the balance criteria for the unmatched sample.
Julian Urban
Pastore, M., Loro, P.A.D., Mingione, M., Calcagni, A. (2022). overlapping: Estimation of Overlapping in Empirical Distributions. R package version 2.1, (https://CRAN.R-project.org/package=overlapping).
Revelle, W. (2023). psych: Procedures for Psychological, Psychometric, and Personality Research. Northwestern University, Evanston, Illinois. R package version 2.3.6, (https://CRAN.R-project.org/package=psych)
Viechtbauer, W. (2010). Conducting meta-analyses in R with the metafor package. Journal of Statistical Software, 36(3), 1-48. (doi:10.18637/jss.v036.i03)
Fisher, Z., Tipton, E., Zhipeng, H. (2023). robumeta: Robust Variance Meta-Regression. R package version 2.1, (https://CRAN.R-project.org/package=robumeta).
# Defining covariates for balance estimation covariates_vector <- c("GPA_school", "IQ_score", "Motivation", "parents_academic", "gender") # Computing initial unbalance using the data set 'MAGMA_sim_data' # Computing initial unbalance for the variable 'gifted_support' (received # giftedness support yes or no) unbalance_gifted <- initial_unbalance(Data = MAGMA_sim_data, group = "gifted_support", covariates = covariates_vector) unbalance_gifted # Computing initial unbalance using the data set 'MAGMA_sim_data' # Computing initial unbalance for the variable 'teacher_ability_rating' # (ability rated from teacher as below average, average, or above average) unbalance_tar <- initial_unbalance(Data = MAGMA_sim_data, group = "teacher_ability_rating", covariates = covariates_vector) unbalance_tar # Computing initial unbalance using the data set 'MAGMA_sim_data' # Computing initial unbalance for the variables 'gifted_support' (received # giftedness support yes or no) and 'enrichment' (participated in enrichment # or not) unbalance_2x2 <- initial_unbalance(Data = MAGMA_sim_data, group = c("gifted_support", "enrichment"), covariates = covariates_vector) unbalance_2x2
# Defining covariates for balance estimation covariates_vector <- c("GPA_school", "IQ_score", "Motivation", "parents_academic", "gender") # Computing initial unbalance using the data set 'MAGMA_sim_data' # Computing initial unbalance for the variable 'gifted_support' (received # giftedness support yes or no) unbalance_gifted <- initial_unbalance(Data = MAGMA_sim_data, group = "gifted_support", covariates = covariates_vector) unbalance_gifted # Computing initial unbalance using the data set 'MAGMA_sim_data' # Computing initial unbalance for the variable 'teacher_ability_rating' # (ability rated from teacher as below average, average, or above average) unbalance_tar <- initial_unbalance(Data = MAGMA_sim_data, group = "teacher_ability_rating", covariates = covariates_vector) unbalance_tar # Computing initial unbalance using the data set 'MAGMA_sim_data' # Computing initial unbalance for the variables 'gifted_support' (received # giftedness support yes or no) and 'enrichment' (participated in enrichment # or not) unbalance_2x2 <- initial_unbalance(Data = MAGMA_sim_data, group = c("gifted_support", "enrichment"), covariates = covariates_vector) unbalance_2x2
This function conducts many group matching for 2 to 4 groups. It augments the original data set by the relevant 'MAGMA.R' variables. For details, see below.
MAGMA(Data, group, dist, cores = 1, verbose = TRUE)
MAGMA(Data, group, dist, cores = 1, verbose = TRUE)
Data |
A data frame or tibble containing at least your grouping and distance variable. Data needs to be specified in your environment. |
group |
A character specifying the name of your grouping variable in the data. Note that MAGMA can only match your data for a maximum of 4 groups. Matching over two grouping variables (e.g., 2x2 Design) is possible by specifying group as a character vector with a length of two. In this case, each or the 2 grouping variables can only have two levels. |
dist |
A character specifying the name of your distance variable in data. |
cores |
An integer defining the number of cores used for parallel computation. |
verbose |
TRUE or FALSE indicating whether matching information should be printed to the console. |
This function conducts nearest neighbor many group matching. It is applicable for two to four groups or a 2x2 design. As output, this function augments your original data by the variables weight, step, distance, and ID. Weight indicates whether a case was matched. Step specifies the iteration in which a case was matched. It also shows which cases were matched together. Distance indicates the mean difference within a match. Since matches with a lower distance are matched in an earlier iteration, step and distance are strongly correlated. This function has some CPU and RAM load. In most four-group applications and three-group applications with large sample size, RAM may be not sufficient. Therefore MAGMA switches to random quasi-systematic matching. If this is the case, MAGMA informs you. The output of the function does not change, but balance might be slightly affected.
Your input data frame augmented with matching relevant variables, namely weight, step, distance, and ID. In case of missing values on the distance or group variable, MAGMA excludes them for the matching process. The returned data set does not contain those excluded cases. For more information, see Details.
Julian Urban
# Running this code will take a while # Two-group exact matching using the data set 'MAGMA_sim_data' # Matching variable 'gifted_support' (received giftedness support yes or no) # 'MAGMA_sim_data_gifted' contains the result of the matching MAGMA_sim_data_gifted <- MAGMA(Data = MAGMA_sim_data, group = "gifted_support", dist = "ps_gifted", cores = 1) head(MAGMA_sim_data_gifted) # Two-group exact matching using the data set 'MAGMA_sim_data' # Matching variable 'teacher_ability_rating' (ability rated from teacher as # below average, average, or above average) # MAGMA_sim_data_tar' contains the result of the matching # Cores per default = 1 MAGMA_sim_data_tar <- MAGMA(Data = MAGMA_sim_data, group = "teacher_ability_rating", dist = "ps_tar") head(MAGMA_sim_data_tar) # 2x2 matching using the data set 'MAGMA_sim_data' # Matching variables are 'gifted_support' (received giftedness support yes # or no) and 'enrichment' (participated in enrichment or not) # 'MAGMA_sim_data_gift_enrich' contains the result of the matching # 2x2 matching is equal to four-group matching MAGMA_sim_data_gift_enrich <- MAGMA(Data = MAGMA_sim_data, group = c("gifted_support", "enrichment"), dist = "ps_2x2", cores = 2) head(MAGMA_sim_data_gift_enrich)
# Running this code will take a while # Two-group exact matching using the data set 'MAGMA_sim_data' # Matching variable 'gifted_support' (received giftedness support yes or no) # 'MAGMA_sim_data_gifted' contains the result of the matching MAGMA_sim_data_gifted <- MAGMA(Data = MAGMA_sim_data, group = "gifted_support", dist = "ps_gifted", cores = 1) head(MAGMA_sim_data_gifted) # Two-group exact matching using the data set 'MAGMA_sim_data' # Matching variable 'teacher_ability_rating' (ability rated from teacher as # below average, average, or above average) # MAGMA_sim_data_tar' contains the result of the matching # Cores per default = 1 MAGMA_sim_data_tar <- MAGMA(Data = MAGMA_sim_data, group = "teacher_ability_rating", dist = "ps_tar") head(MAGMA_sim_data_tar) # 2x2 matching using the data set 'MAGMA_sim_data' # Matching variables are 'gifted_support' (received giftedness support yes # or no) and 'enrichment' (participated in enrichment or not) # 'MAGMA_sim_data_gift_enrich' contains the result of the matching # 2x2 matching is equal to four-group matching MAGMA_sim_data_gift_enrich <- MAGMA(Data = MAGMA_sim_data, group = c("gifted_support", "enrichment"), dist = "ps_2x2", cores = 2) head(MAGMA_sim_data_gift_enrich)
This function provides pre- and post-matching descriptive statistics and effects.
MAGMA_desc( Data, covariates, group, step_num = NULL, step_var = NULL, filename = NULL, verbose = TRUE, covariates_ordinal = NULL, covariates_nominal = NULL )
MAGMA_desc( Data, covariates, group, step_num = NULL, step_var = NULL, filename = NULL, verbose = TRUE, covariates_ordinal = NULL, covariates_nominal = NULL )
Data |
A data frame that contains the desired variable for density plotting as well as the specified grouping variable. |
covariates |
A character vector specifying the variable names of the binary and metric variables for which the descriptive statistics should be computed. |
group |
A character (vector) specifying the groups for which differentiated statistics should be computed. |
step_num |
An integer specifying the number of cases to be included per group in this post matching comparison (e.g., 100). If no value is specified, pre-matching statistics are computed. Is based on the step variable of MAGMA. Optional argument. |
step_var |
A character specifying the name of the step variable in the data set. If no value is specified, pre matching statistics are computed. Optional argument. |
filename |
A character specifying the filename that the resulting Word document with the Table should have. Optional argument. |
verbose |
TRUE or FALSE indicating whether matching information should be printed to the console. |
covariates_ordinal |
A character vector specifying the variable names of the ordinal variables for which the descriptive statistics should be computed. |
covariates_nominal |
A character vector specifying the variable names of the nominal variables for which the descriptive statistics should be computed. |
This function enables the computation of descriptive statistics for the overall sample and specified groups. Additional, pairwise effects according to the respective scale level are computed.
A table of descriptive statistics and pairwise effects for pre- or post-matching samples.
Julian Urban
# Defining covariates covariates_gifted <- c("GPA_school", "IQ_score", "Motivation", "parents_academic", "gender") # Estimating pre-matching descriptive statistics and pairwise effects using # the data set 'MAGMA_sim_data' # Estimating statistics for grouping variable 'gifted support' (received # giftedness support yes or no) MAGMA_desc(Data = MAGMA_sim_data, covariates = covariates_gifted, group = "gifted_support") # Estimating post-matching descriptive statistics and pairwise effects using # the data set 'MAGMA_sim_data' # Estimating statistics for grouping variable 'gifted support' (received # giftedness support yes or no) # Estimating statistics for 100 cases per group MAGMA_desc(Data = MAGMA_sim_data, covariates = covariates_gifted, group = "gifted_support", step_num = 100, step_var = "step_gifted")
# Defining covariates covariates_gifted <- c("GPA_school", "IQ_score", "Motivation", "parents_academic", "gender") # Estimating pre-matching descriptive statistics and pairwise effects using # the data set 'MAGMA_sim_data' # Estimating statistics for grouping variable 'gifted support' (received # giftedness support yes or no) MAGMA_desc(Data = MAGMA_sim_data, covariates = covariates_gifted, group = "gifted_support") # Estimating post-matching descriptive statistics and pairwise effects using # the data set 'MAGMA_sim_data' # Estimating statistics for grouping variable 'gifted support' (received # giftedness support yes or no) # Estimating statistics for 100 cases per group MAGMA_desc(Data = MAGMA_sim_data, covariates = covariates_gifted, group = "gifted_support", step_num = 100, step_var = "step_gifted")
This function conducts exact many group matching for 2 to 4 groups. Exact means that only cases with the same value on the exact variable can be matched. It augments the original data set by relevant 'MAGMA.R' variables. For details, see below.
MAGMA_exact(Data, group, dist, exact, cores = 1, verbose = TRUE)
MAGMA_exact(Data, group, dist, exact, cores = 1, verbose = TRUE)
Data |
A data frame or tibble containing at least your grouping and distance variable. Data needs to be specified in your environment. |
group |
A character specifying the name of your grouping variable in the data. Note that MAGMA can only match your data for a maximum of 4 groups. Matching over two grouping variables (e.g., 2x2 Design) is possible by specifying group as a character vector with a length of two. In this case, each or the 2 grouping variables can only have two levels. |
dist |
A character specifying the name of your distance variable in data. |
exact |
A character specifying the name of the exact variable. Only cases with the same value on this variable can be matched. |
cores |
An integer defining the number of cores used for parallel computation. |
verbose |
TRUE or FALSE indicating whether matching information should be printed to the console. |
This function conducts nearest neighbor exact many group matching. It is
applicable for two to four groups or a 2x2 design. As output, this function
augments your original data by the variables weight, step, distance,
and ID. Weight indicates whether a case was matched. Step specifies the
iteration in which a case was matched. It also shows which cases were
matched together. Distance indicates the mean difference within a match.
Since matches with a lower distance are matched in an earlier iteration,
step and distance are strongly correlated.
Exact matching means that only cases with the same value on the exact
variable can be matched. As example, only person of the same gender, the
same school, or the same organization are possible matches. For standard
matching, see MAGMA
Your input data frame of valid cases augmented with matching relevant variables, namely weight, step, distance, and ID. In case of missing values on the distance or group variable, MAGMA_exact excludes them for the matching process. The returned data set does not contain those excluded cases. For more information, see Details.
Julian Urban
# Running this code will take a while # Two-group exact matching using the data set 'MAGMA_sim_data' # Matching variable 'gifted_support' (received giftedness support yes or no) # 'MAGMA_sim_data_gifted_exact' contains the result of the matching # Exact matching for 'enrichment' (participated in enrichment or not) # Students that participated can only be matched with other # students that participated and vice versa MAGMA_sim_data_gifted_exact <- MAGMA_exact(Data = MAGMA_sim_data[c(1:100 ), ], group = "gifted_support", dist = "ps_gifted", exact = "enrichment", cores = 1) head(MAGMA_sim_data_gifted_exact) # Conducting three-group matching using the data set 'MAGMA_sim_data' # Matching variable 'teacher_ability_rating' (ability rated from teacher as # below average, average, or above average) # 'MAGMA_sim_data_tar_exact' contains the result of the matching # Exact matching for gender (male or female) # Male students can only be matched to male students, female students can only # be matched to female students # Cores per default = 1 MAGMA_sim_data_tar_exact<- MAGMA_exact(Data = MAGMA_sim_data, group = "teacher_ability_rating", dist = "ps_tar", exact = "gender") head(MAGMA_sim_data_tar_exact) # 2x2 matching using the data set 'MAGMA_sim_data' # Matching variables are 'gifted_support' (received giftedness support yes # or no) and 'enrichment' (participated in enrichment or not) # 'MAGMA_sim_data_gift_enrich_exact' contains the result of the matching # 2x2 matching is equal to four-group matching # Exact matching for for teacher rated ability (ability rated from teacher as # below average, average, or above average) # Below average students can only be matched to other below average rated # students, average rated students can be matched with other average rated # students, and above average rated students can only be matched to other # above average rated students MAGMA_sim_data_gift_enrich_exact <- MAGMA_exact(Data = MAGMA_sim_data, group = c("gifted_support", "enrichment"), dist = "ps_2x2", exact = "teacher_ability_rating", cores = 2) head(MAGMA_sim_data_gift_enrich_exact)
# Running this code will take a while # Two-group exact matching using the data set 'MAGMA_sim_data' # Matching variable 'gifted_support' (received giftedness support yes or no) # 'MAGMA_sim_data_gifted_exact' contains the result of the matching # Exact matching for 'enrichment' (participated in enrichment or not) # Students that participated can only be matched with other # students that participated and vice versa MAGMA_sim_data_gifted_exact <- MAGMA_exact(Data = MAGMA_sim_data[c(1:100 ), ], group = "gifted_support", dist = "ps_gifted", exact = "enrichment", cores = 1) head(MAGMA_sim_data_gifted_exact) # Conducting three-group matching using the data set 'MAGMA_sim_data' # Matching variable 'teacher_ability_rating' (ability rated from teacher as # below average, average, or above average) # 'MAGMA_sim_data_tar_exact' contains the result of the matching # Exact matching for gender (male or female) # Male students can only be matched to male students, female students can only # be matched to female students # Cores per default = 1 MAGMA_sim_data_tar_exact<- MAGMA_exact(Data = MAGMA_sim_data, group = "teacher_ability_rating", dist = "ps_tar", exact = "gender") head(MAGMA_sim_data_tar_exact) # 2x2 matching using the data set 'MAGMA_sim_data' # Matching variables are 'gifted_support' (received giftedness support yes # or no) and 'enrichment' (participated in enrichment or not) # 'MAGMA_sim_data_gift_enrich_exact' contains the result of the matching # 2x2 matching is equal to four-group matching # Exact matching for for teacher rated ability (ability rated from teacher as # below average, average, or above average) # Below average students can only be matched to other below average rated # students, average rated students can be matched with other average rated # students, and above average rated students can only be matched to other # above average rated students MAGMA_sim_data_gift_enrich_exact <- MAGMA_exact(Data = MAGMA_sim_data, group = c("gifted_support", "enrichment"), dist = "ps_2x2", exact = "teacher_ability_rating", cores = 2) head(MAGMA_sim_data_gift_enrich_exact)
The 'MAGMA.R' simulated data set contains 14 variables of 800 cases. It is used as example in the vignette and the help pages.
MAGMA_sim_data
MAGMA_sim_data
A data frame with 17 variables of 800 cases.
: Individual ID for each case
: Binary variable indicating gender of a participant
: Binary variable that specifies whether a case received giftedness support (1) or not (0)
: Three-step ordinal variable ranging from 1 to 3 indicating increasing eacher rated ability of a case
: Binary variable that indicates whether a case participated in an afternoon enrichment program
: Binary variable that indicates whether at least one parent of a case has an academic background
: Variable ranging from 1 to 6 that indicates a case's high school GPA. Lower values indicate higher achievement
: Variable indicating the normed IQ score of a case
: The scale score of a case in a motivational questionnaire
: variable ranging from 1 to 6 that indicates a case's college GPA. Lower values indicate higher achievement
: Multinomial variable representing the combination of gifted support and enrichment
: Propensity score of twangs mnps function for teacher_ability_rating
: Propensity score of twangs mnps function for support_enrichment
: Propensity score of twangs ps function for gifted support
: Indicates step of MAGMA-matching for gifted support
: Indicates weight of MAGMA-matching for gifted support
: Indicates distance of MAGMA-matching for gifted support
Simulated data
Plots for balance with respect to sample size.
Plot_MAGMA( Balance, criterion = c("Pillai", "d_ratio", "mean_g", "Adj_d_ratio") )
Plot_MAGMA( Balance, criterion = c("Pillai", "d_ratio", "mean_g", "Adj_d_ratio") )
Balance |
A result of Balance_MAGMA. Compare the function
|
criterion |
A character vector specifying for which balance criteria a plot should be created. Default is all criteria. |
This function creates R-Plots using ggplot2 to show the balance trend over sample size.
R Plots showing the balance trend over sample size.
Julian Urban
# This function bases on a MAGMA function as well as Balance_MAGMA # To run examples, copy them into your console or script # Defining the names of the metric and binary covariates covariates_vector <- c("GPA_school", "IQ_score", "Motivation", "parents_academic", "gender") # Estimating balance of a two-group matching using the data set # 'MAGMA_sim_data'. # Matching variable 'gifted_support' (received giftedness support yes or no) Balance_gifted <- Balance_MAGMA(Data = MAGMA_sim_data[MAGMA_sim_data$step_gifted < 150, ], group = "gifted_support", covariates = covariates_vector, step = "step_gifted") Plot_MAGMA(Balance = Balance_gifted, criterion = "Adj_d_ratio") #Using default to plot all criteria # 2x2 matching using the data set 'MAGMA_sim_data' # Matching variables are 'gifted_support' (received giftedness support yes # or no) and 'enrichment' (participated in enrichment or not) # 'MAGMA_sim_data_gift_enrich' contains the result of the matching # 2x2 matching is equivalent to four-group matching MAGMA_sim_data_gift_enrich <- MAGMA(Data = MAGMA_sim_data, group = c("gifted_support", "enrichment"), dist = "ps_2x2", cores = 2) # Estimating balance. Covariates same as above Balance_2x2 <- Balance_MAGMA(Data = MAGMA_sim_data_gift_enrich, group = c("gifted_support", "enrichment"), covariates = covariates_vector, step = "step") #step created during matching Plot_MAGMA(Balance = Balance_2x2, criterion = c("d_ration", "Adj_d_ratio"))
# This function bases on a MAGMA function as well as Balance_MAGMA # To run examples, copy them into your console or script # Defining the names of the metric and binary covariates covariates_vector <- c("GPA_school", "IQ_score", "Motivation", "parents_academic", "gender") # Estimating balance of a two-group matching using the data set # 'MAGMA_sim_data'. # Matching variable 'gifted_support' (received giftedness support yes or no) Balance_gifted <- Balance_MAGMA(Data = MAGMA_sim_data[MAGMA_sim_data$step_gifted < 150, ], group = "gifted_support", covariates = covariates_vector, step = "step_gifted") Plot_MAGMA(Balance = Balance_gifted, criterion = "Adj_d_ratio") #Using default to plot all criteria # 2x2 matching using the data set 'MAGMA_sim_data' # Matching variables are 'gifted_support' (received giftedness support yes # or no) and 'enrichment' (participated in enrichment or not) # 'MAGMA_sim_data_gift_enrich' contains the result of the matching # 2x2 matching is equivalent to four-group matching MAGMA_sim_data_gift_enrich <- MAGMA(Data = MAGMA_sim_data, group = c("gifted_support", "enrichment"), dist = "ps_2x2", cores = 2) # Estimating balance. Covariates same as above Balance_2x2 <- Balance_MAGMA(Data = MAGMA_sim_data_gift_enrich, group = c("gifted_support", "enrichment"), covariates = covariates_vector, step = "step") #step created during matching Plot_MAGMA(Balance = Balance_2x2, criterion = c("d_ration", "Adj_d_ratio"))
This function prints an APA Table of the Balance criteria. It displays the balance criteria for four different sample sizes per group. In each scenario, one balance criteria has its optimal value. Thus, the table is a 4x5 table showing the four balance criteria and the respective sample size per group for the four scenarios.
Table_MAGMA(Balance, filename = NULL, verbose = TRUE)
Table_MAGMA(Balance, filename = NULL, verbose = TRUE)
Balance |
A result of |
filename |
Optional argument. A character specifying the filename that the resulting Word document with the table should have. |
verbose |
TRUE or FALSE indicating whether matching information should be printed to the console. |
This function creates an APA Table including the optimal models for each balance criterion, the other criteria for the respective sample size per group as well as the sample size itself. With an optional argument you can save a the APA table in Word.
A 4x5 APA table showing the four balance criteria and the respective sample sizes per group for four scenarios. In each of these scenarios, one balance criteria has its optimal value. It can print a Word Document with this table.
Julian Urban
# This function bases on a MAGMA function as well as Balance_MAGMA # Defining the names of the metric and binary covariates covariates_vector <- c("GPA_school", "IQ_score", "Motivation", "parents_academic", "gender") # Estimating balance of a two-group matching using the data set # 'MAGMA_sim_data'. # Matching variable 'gifted_support' (received giftedness support yes or no) Balance_gifted <- Balance_MAGMA(Data = MAGMA_sim_data[MAGMA_sim_data$step_gifted < 200, ], group = "gifted_support", covariates = covariates_vector, step = "step_gifted") Table_MAGMA(Balance_gifted) # 2x2 matching using the data set 'MAGMA_sim_data' # Matching variables are 'gifted_support' (received giftedness support yes # or no) and 'enrichment' (participated in enrichment or not) # 'MAGMA_sim_data_gift_enrich' contains the result of the matching # 2x2 matching is equivalent to four-group matching MAGMA_sim_data_gift_enrich <- MAGMA(Data = MAGMA_sim_data, group = c("gifted_support", "enrichment"), dist = "ps_2x2", cores = 2) # Estimating Balance. Covariates same as above Balance_2x2 <- Balance_MAGMA(Data = MAGMA_sim_data_gift_enrich, group = c("gifted_support", "enrichment"), covariates = covariates_vector, step = "step") #step created during matching Table_MAGMA(Balance_2x2)
# This function bases on a MAGMA function as well as Balance_MAGMA # Defining the names of the metric and binary covariates covariates_vector <- c("GPA_school", "IQ_score", "Motivation", "parents_academic", "gender") # Estimating balance of a two-group matching using the data set # 'MAGMA_sim_data'. # Matching variable 'gifted_support' (received giftedness support yes or no) Balance_gifted <- Balance_MAGMA(Data = MAGMA_sim_data[MAGMA_sim_data$step_gifted < 200, ], group = "gifted_support", covariates = covariates_vector, step = "step_gifted") Table_MAGMA(Balance_gifted) # 2x2 matching using the data set 'MAGMA_sim_data' # Matching variables are 'gifted_support' (received giftedness support yes # or no) and 'enrichment' (participated in enrichment or not) # 'MAGMA_sim_data_gift_enrich' contains the result of the matching # 2x2 matching is equivalent to four-group matching MAGMA_sim_data_gift_enrich <- MAGMA(Data = MAGMA_sim_data, group = c("gifted_support", "enrichment"), dist = "ps_2x2", cores = 2) # Estimating Balance. Covariates same as above Balance_2x2 <- Balance_MAGMA(Data = MAGMA_sim_data_gift_enrich, group = c("gifted_support", "enrichment"), covariates = covariates_vector, step = "step") #step created during matching Table_MAGMA(Balance_2x2)