A protocol for assessing stability and invariance properties of two commonly used outcome measures in anxiolytic clinical trials

David Byrne; Fiona Boland; Frank Doyle

doi:10.12688/hrbopenres.14239.1

Home Browse A protocol for assessing stability and invariance properties of two...

ALL Metrics

-

Views

9

Downloads

Get PDF

Get XML

Export

▬

✚

Study Protocol

A protocol for assessing stability and invariance properties of two commonly used outcome measures in anxiolytic clinical trials

[version 1; peer review: awaiting peer review]

David Byrne ¹, Fiona Boland², Frank Doyle¹

PUBLISHED 23 Sep 2025

Author details Author details

¹ Dept. Health Psychology, Royal College of Surgeons in Ireland, Dublin, Leinster, Ireland
² Data Science Centre, Royal College of Surgeons in Ireland, Dublin, Leinster, Ireland

David Byrne
Roles: Conceptualization, Formal Analysis, Funding Acquisition, Methodology, Project Administration, Validation, Writing – Original Draft Preparation, Writing – Review & Editing

Fiona Boland
Roles: Funding Acquisition, Methodology, Supervision, Validation, Writing – Original Draft Preparation, Writing – Review & Editing

Frank Doyle
Roles: Funding Acquisition, Methodology, Supervision, Validation, Writing – Original Draft Preparation, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS AWAITING PEER REVIEW

Abstract

Background

Research has suggested that network analysis can be used to identify important pathology symptoms and inform targeted treatment plans that could lead to more efficacious outcomes in clinical trials. However, unless it can be demonstrated that network models are stable, including when accounting for moderating variables, NA-derived treatment plans may not be appropriate to implement.

Aim

We aim to assess the stability and invariance properties of two commonly used anxiety outcome measures to determine the suitability of NA methods to inform treatment plan design in clinical settings.

Method

Individual participant data (IPD) for large multi-trial samples will be accessed via Vivli.org. Exploratory graphical analysis will be used to model empirical networks pre- (baseline) and post-treatment (outcome) for the two most commonly used outcome measures in antidepressant clinical trials, namely the Hamilton Rating Scale for Anxiety (HAM-A) and the anxiety subscale of the Hospital Anxiety and Depression Scale (HADS_A). Bootstrapping and permutation techniques will be used to determine the stability and invariance properties of empirical networks in relation to a range of moderating variables, such as age, sex, treatment type and symptom severity. For networks that are unstable or partially invariant, we will examine item redundancy and remove non-performing items to pursue stable/invariant abbreviated models.

Discussion

This study will determine the suitability of applying NA methods in clinical trials. Findings could inform the way in which clinical trials, and other such research, are conducted. If outcome measures are stable and invariant, then NA methods will have demonstrable utility to inform more efficacious treatment plans. However, if NA is not found to be suitable, its validity as a robust analytical approach will be questionable.

Keywords

Network Analysis; Randomised Trial; Generalised Anxiety Disorder; Adult Psychiatry; Mental Health

Corresponding author: David Byrne

Competing interests: No competing interests were disclosed.

Grant information: Research Ireland Government of Ireland Postdoctoral Research Fellowship [GOIPD/2024/299]
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2025 Byrne D et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Byrne D, Boland F and Doyle F. A protocol for assessing stability and invariance properties of two commonly used outcome measures in anxiolytic clinical trials [version 1; peer review: awaiting peer review]. HRB Open Res 2025, 8:106 (https://doi.org/10.12688/hrbopenres.14239.1) First published: 23 Sep 2025, 8:106 (https://doi.org/10.12688/hrbopenres.14239.1) Latest published: 23 Sep 2025, 8:106 (https://doi.org/10.12688/hrbopenres.14239.1)

Background

Measuring symptoms and which ones matter?

Symptoms caused by psychopathologies such as anxiety or depression can vary from person to person and there are a number of different ways these can be assessed. The most common is the use of clinician rating scales or patient reported outcome measures. These types of measures allow clinicians and researchers to measure the severity of common anxiety symptoms, such as low mood, low energy and insomnia. Symptom severity is typically measure using a rating response format, rating symptom severity on a scale from 1–5 (for example), with 1 indication mild severity and 5 indicating high severity (Vagias, 2006). A wide array of outcome measures are currently available, such as the Hamilton Anxiety Rating Scale (HAM-A; Hamilton, 1959), Hospital Anxiety and Depression Scale (Zigmond & Snaith, 1983) and Beck Anxiety Inventory (BAI; Beck et al., 1988a), which measure anxiety, or the Hamilton Rating Scale for Depression (HRSD; Hamilton, 1960), Montgomery-Åsberg Depression Rating Scale (MADRS; Montgomery & Åsberg, 1979) and Beck Depression Inventory (BDI; Beck et al., 1988b), which measure depression.

While there is some overlap in both diagnostic criteria and symptoms assessed between and among anxiety and depression measures, different measures can consist of very different symptoms. For example, one study examined the convergent validity of 52 symptoms across seven commonly used depression measures, including the HRSD, MADRS and BDI (Fried, 2017). It was found that 40% of symptoms assessed were unique to a single measure. These findings were supported by a recent systematic review of 388 different outcome measures used across 450 depression randomised trials of various treatment-types (Veal et al., 2024), which noted that the most commonly used measure (the HRSD) only accounted for 59% of 80 depression domains that matter to patients. Even if high levels of convergent validity are observed, it is still possible that measures that purport to assess depression may in fact assess completely different constructs. Fried (2017) provided a reproducible example, whereby 40 disparate items with minimal inter-item correlations (r=0.1) could be distributed into separate measures of 20 items each with no overlap and resulting sum scores could show high correlation (r=0.69). This was argued to demonstrate that high convergent validity can be achieved even between measures that consist of minimally related individual items. Not only does this make it difficult to operationalise or ‘reify’ a unitary latent construct of conditions like anxiety or depression (Jones & Robinaugh, 2021), it also becomes difficult to identify what might be the most important and impactful symptoms.

A network modelling perspective

Network analysis (NA) is a modern psychometric method that can be used to explore complex patterns and interactions in outcome measure data. Network theory eschews common assumptions of psychopathology that psychological problems are caused by disease entities which are independent of their symptoms (i.e. latent constructs; Borsboom, 2017). Rather, NA methods adhere to assumptions that are more broadly accepted in clinical practice, such as the interaction of thoughts, feelings and behaviours, outside of the influence of latent constructs (Jones & Robinaugh, 2021). This is achieved by plotting inter-relationships or “edges”, among symptoms, or “nodes”, while appreciating that these symptoms can be empirically related, often for unknown reasons (Borsboom, 2017). This allows clinicians and researchers to move away from the need to reify constructs like depression as latent variables and focus on identifying the key, or “central”, symptoms in patient depression networks, both statistically in terms of measurable treatment outcomes and experientially in terms of patient experience of given symptoms (Borsboom & Cramer, 2013; Borsboom, 2017).

In recent years, network analysis has seen a surge in popularity and has been used to help understand pathology and predict different types of outcomes across a wide range of domains. For example, a recent study (Elliott et al., 2020) used NA methods to analyse data from a clinical trial of people with anorexia nervosa (n=142) and found that high centrality symptoms, including “Feeling Fat” and “Fear of Weight Gain” were strongly related to prognostic utility (r²=0.52, 0.55, respectively). Another study examined a clinical sample (n=58) of people with mood and anxiety disorder with the aim of predicting patient dropout (Lutz et al., 2018). Baseline network models were found to be non-invariant between completers and dropouts, meaning the way in which symptoms related to one another differed significantly between the two groups. Using two-tailed p-values test, the difference in the dropout model was found to be characterised by low centrality of ‘feeling nervous’ and ‘being active’, suggesting these symptoms were less influential within the dropout groups network. Analyses based on these models correctly identified 47 of 58 patients who subsequently left the study, outperforming any other single predictor, such as sex, and it was argued that inspection of baseline network models could be used to predict dropout in such trials.

Using network analysis to augment treatment outcomes

Several studies have examined the potential for NA to inform more efficacious treatment plans in antidepressant clinical trials and other research, with promising results (Bringmann et al., 2015; Maciaszek et al., 2023; Park et al., 2021). In one such example (Maciaszek et al., 2023), treatment efficacy was calculated as the percentage change between pre- and post-treatment outcomes scores for 88 patients in a clinical trial of the antidepressant duloxetine, as measured using the HRSD (Hamilton, 1960). This treatment efficacy variable was then included in a pre-treatment (baseline) network model of depression symptom scores. NA identified “depressed mood” among the most central symptoms and that treatment efficacy was most strongly related to this symptom. This suggested that duloxetine may be most efficacious when depression is characterised by high levels of depressed mood. In addition, the UKU Side Effect Rating Scale (UKU SERS; Lindström et al., 2001) was used to monitor adverse outcomes and this was also modelled in the baseline network. Adverse outcomes were directly related to anxiety (which was the most central symptom) and this relationship was stronger than that between treatment efficacy and depressed mood, suggesting that the efficacy of duloxetine may be mitigated by high levels of anxiety, even when depression networks are characterised by depressed mood. These findings point toward the potentially significant implications NA methods could have in augmenting the design of more efficacious treatment plans in clinical trial settings.

There is a notable absence of research examining the implications of NA in relation to anti-anxiety treatment clinical trials, and what research can be found often examines community samples and typically examines anxiety as a comorbidity within broader network models (e.g., Fisher et al., 2017; Jin et al., 2022; Levinson et al., 2017; Yohannes et al., 2022). While it is useful and informative to build an understanding of the relationship between anxiety and different comorbidities, a key limitation here is that it is not possible to determine important underlying network modelling assumptions (such as configural or metric invariance) of different anxiety measures (Christensen & Golino, 2021; Jamison et al., 2022). Configural invariance assesses the stability of a network model and exists when the network structure is consistent and sub-groups of nodes, called “communities”, are stable across moderating variables, such as age or sex. When configural invariance holds, it suggests the basic symptom network structure is comparable across groups. Metric invariance reflects the importance or “strength” of different nodes and exists when node strength remains similar across different moderating variables, indicating the influence of particular symptoms are similar across groups. These are important, as unless it can be demonstrated that configural and metric invariance exists in network models, clinicians and researchers may not be able to utilise NA to inform treatment plan design, as recommended treatments may not be appropriate for all participants or generalisable to the broader population.

The proposed study

This study aims to evaluate the suitability of NA for use in clinical trials by examining configural and metric invariance of commonly used outcome measures of anxiety. Specific aims are: (a) to specify network models for outcome measures frequently used in clinical trials of pharmacotherapies for anxiety pre- and post-treatment (b) to assess stability characteristics (i.e. configural invariance) of resulting models (c) to assess metric invariance of network models in relation to important moderating variables, and (d) to determine optimum outcome measures for use with NA methods in clinical trial settings by comparing stability and invariance indices of different measures.

Methods

Study design

This study will conduct secondary analyses of existing data from anxiolytic trials randomised trials. Individual participant data (IPD) will be accessed via the data repository Vivli.org (2024). We will specifically target treatment trials of generalised anxiety disorder (GAD), as measured using two commonly adopted outcome measures, namely the Hamilton Anxiety Rating Scale (HAM-A; Hamilton, 1960) and the Hospital Anxiety and Depression Scale- Anxiety subscale (HADS-A; Zigmond & Snaith, 1983). Preliminary searchers of the Vivli repository indicate that six GAD treatment trials (n=2,334) use each outcome measure, allowing for direct comparison of performance, and may be eligible for inclusion. Data will be collated according to outcome measure, with potential moderating variables representing age, sex, geographic location, treatment-type and symptom severity (as well as other such potential modifiers, as may be available in the data). Network models will be estimated pre- (baseline) and post-treatment (outcome). The post-treatment follow-up interval will be 8-weeks, in line with previous similar research and to maximise sample size (Byrne et al., 2025b; Byrne et al., 2025a; Cipriani et al., 2018; Doyle et al., 2023). Invariance at pre- and post-treatment will be assessed at two levels. Configural invariance will be assessed to ensure the structure of the model and constituent communities are stable across moderating variable groups. Metric invariance will be evaluated to determine whether node centrality for each symptom remains similar across moderating variable groups. If these measures prove to be unstable or non-invariant, further analyses will be undertaken to remove underperforming items in an attempt to specify optimal abbreviated network models. The abbreviated models will then be subject to invariance analyses to determine their suitability for use in clinical trial settings.

Ethical approval for this study was awarded by the RCSI University of Medicine and Health Sciences Ethics Committee (Ref: REC202410010).

Data collection

A data access request will be submitted to Vivli.org to obtain access to analysis-ready data for 6 (n=2,334) already-completed GAD treatment trials. Inclusion/exclusion criteria are specified in Table 1. Each of the six trials for which data has been requested used both the HADS-A and HAM-A as outcome measures.

Table 1. Trial inclusion criteria.

Condition	Criteria
Phase	2, 3, or 4
Masking	Double blind (minimum)
Pathology	Generalised Anxiety Disorder
Age	+18
Treatment	Anti-anxiety (any)
Comparator	Placebo or another anxiety treatment
Outcome Assessment	8 weeks
Exclusion Criteria	All studies outside above parameters

Outcome measures

We will evaluate and report the stability and invariance properties of the 14-item HAM-A and the anxiety subscale of the HADS (HADS-A; totalling seven items) in relation to each moderating variable.

Hamilton Rating Scale for Anxiety. The HAM-A (Hamilton, 1959) is a unidimensional measure of anxiety commonly used in randomised clinical trials (Amsterdam et al., 2009; Bradley et al., 2018; Llorca et al., 2002), and assesses anxiety symptoms such as anxious mood tension, insomnia, low mood and somatic symptoms. Severity of different symptoms is measured on a five-point rating scale, rated from 0-4, with higher values indicating increased symptom severity (Hamilton, 1959).

Hospital Anxiety and Depression Scale. The HADS measures both anxiety and depression and consists of 14 items split into two subscales of seven items each. Items are measured on a four-point Likert scale, with 0 indicating the lowest severity and 3 indicating the highest (Zigmond & Snaith, 1983). Only the seven items of the anxiety sub-scale will be included in the current analyses. Similar to the HAM-A, these items reflect issues such as mood, tension and somatic symptoms (see Table 2 for a list of symptoms assessed by each measure).

Table 2. HAM-A and HADS-A symptoms.

	HAM-A		HADS-A
1	Anxious mood	1	I feel tense or 'wound up'
2	Tension	2	I get sort of frightened feeling like 'butterflies' in the stomach
3	Fears
4	Insomnia	3	I get sort of frightened feeling as if something awful is about to happen
5	Intellectual
6	Depressed mood	4	I feel restless as I have to be on the move
7	Somatic (Muscular)		I feel restless as I have to be on the move
8	Somatic (Sensory)	5	Worrying thoughts go through my mind
9	Cardiovascular symptoms		Worrying thoughts go through my mind
10	Respiratory symptoms	6	I get sudden feelings of panic
11	Gastrointestinal symptoms	7	I can sit at ease and feel relaxed
12	Genitourinary symptoms
13	Autonomic symptoms
14	Behaviour at interview

HAM-A rated on scale from 0–4

HADS rated on scale from 0–3

Statistical analysis plan

Specifying Empirical Network Models. Analyses will be conducted using R v4.2.2 (2024). NA modelling will be conducted for baseline (pre-treatment) and 8-week outcome (post-treatment) data, and stability and invariance analyses will be conducted using the package ‘EGAnet’ (Golino et al., 2024). Graphical lasso (glasso) estimation will be used to calculate networks, which EGAnet conducts via the qgraph package (Epskamp, 2023). A walktrap algorithm will be used for community detection, to identify clusters of nodes more closely connected to each other, than the rest of the network, using the igraph package (Csárdi et al., 2024).

Empirical networks will be modelled for baseline and outcome data using the ‘EGA’ function in EGAnet. The walktrap algorithm will be used to identify communities of nodes by implementing random walks between nodes to identify the strongest relationships among sub-groups of nodes in the network. Network loadings will be calculated to represent the between and within community strength of each outcome measure node. Network loadings are used to represent the strength of relationships among different nodes, and are reflective of factor loadings calculated during factor analysis (Golino et al., 2024; Hallquist et al., 2021).

Stability analyses will be conducted to determine the stability, and thus configural invariance, of empirical networks (Byrne et al., 2025b; Christensen & Golino, 2021; Golino et al., 2024). To assess network stability, the ‘bootEGA’ function will be used to bootstrap the empirical models for 1,000 iterations using glasso estimation and a walktrap algorithm, as per the empirical networks. The resulting models will be used to form a distribution of simulated samples, against which item and dimension stability characteristics of the empirical model will be assessed. Multivariate normality testing will be conducted to determine whether bootstrapping should use a parametric or resampling technique. When the bootstrapped sample distribution is computed, the ‘dimensionStability’ function will then be used to compute stability indices. Item replication scores, which indicate the proportion (percentage) of times each node replicates with a given community, will be inspected to determine node stability across bootstrapped samples. Community replication scores, which are mean replication scores for constituent nodes within a given community, will be examined. Node and community replication will be assessed in relation to a lower threshold of 0.65, below which they are considered to be unstable (Christensen & Golino, 2021). The stability of the specified communities will also be assessed in relation to the frequency with which a given number of communities is identified in the network model during bootstrapping. These analyses will determine if the configuration of the network can be considered stable, thus establishing configural invariance, and will be performed for baseline and outcome models.

If a configurally invariant network model is found, metric invariance analyses will then be conducted using the ‘invariance’ function, as per recommendations by Jamison et al. (2022). Centrality scores will be calculated for each node according to each moderating variable group (e.g. for ‘male’ and ‘female’ participants) and the difference in node centrality between groups will be computed. The resulting values are termed the ‘empirical values’. The configurally invariant model will then be permuted for 1,000 samples independently for each group (e.g. ‘male’ and ‘female’) and respective centrality scores will be computed. The difference in centrality scores between groups will be determined, resulting in a null distribution. The empirical values will then be compared with the null distribution using two-tailed p-values to determine which nodes meet the criteria of metric invariance and which are non-invariant.

Exploring Revised Network Models. If empirical network models are found to be configurally non-invariant, additional stability analyses will be undertaken to identify revised invariant models. Nodes that are found to have a stability score lower than a threshold of 0.65 will be removed and the network will be modelled again. This will continue until all nodes demonstrate acceptable stability scores. If network models consist of multiple communities, the stability of the these will be assessed in terms of the number of times bootstrapped models result in a given number of dimensions, as per above. Metric invariance will then be assessed for revised configurally invariant models. If partial invariance is detected, additional analyses may be conducted to remove invariant items, with the aim of identify revised models that are both configurally and metrically non-invariant. There are currently no agreed upon guidelines for acceptable levels of partial invariance (Jamison et al., 2022), so a threshold of >70% of nodes demonstrating invariance will be adopted as a partial invariance criterion to reflect the mean threshold for item stability (Christensen & Golino, 2021). If fewer than 70% of nodes are invariant, additional analyses will not be undertaken.

Descriptive comparisons of configural and metric invariance outcomes, as well as edge weight and centrality indices, will be conducted between HAM-A and HADS network models to determine which scale (or sub-scale, in the case of the HADS) may be a more stable measure of anxiety symptoms. If the HADS-A network is unstable, further analysis will be considered whereby the full 14-item scale will be modelled to explore if this provides a stable network.

Discussion

This study will be the first of its kind to assess network configural and metric invariance of the two most commonly adopted outcome measures in nxiolyticclinical trials using a large multi-trial sample. Findings from this study could significantly impact the way in which clinical trials, and other such research, are conducted and analysed in two ways. First, the potential to identify invariant measurement models would open up the utility of NA methods to the design of more efficacious treatment plans by allowing clinicians to identify and target central symptoms. As demonstrated by Lutz et al. (2018), this utility could also be applied to other important outcomes, such as predicting and redressing patient dropout. Conversely, findings indicating that outcome measures are unstable or non-invariant would have adverse implications for the use of NA methods, as any recommended treatment plans based on such analyses may not be appropriate for all patients or generalisable to a broader population.

A key strength of this study is the potential to analyse a large multi-trial sample, which is less susceptible to type 1 errors during invariance analyses. The sample consisting of data from multiple trials also broadens the scope of moderating variables that could be assessed, which would strengthen any potential invariance claims (Christensen & Golino, 2021; Jamison et al., 2022). Another strength is the use of the invariance function in EGAnet. Simulation studies have shown that this can outperform other types of metric invariance analyses, particularly with small or unequal samples (Jamison et al., 2022).

This study will have limitations. Analyses will be conducted within a secure research environment provisioned and hosted by Vivli.org. As such, Analyses will also be contingent upon the resources available within the provided research environment. The sample will also be limited to trials that are accessible through the Vivli repository. Results obtained may not be generalisable to other outcome measures, or to the use of the HAM-A or HADS in uncontrolled or observational studies. In addition, some studies accessed may have particular sampling characteristics (e.g. age- sex- or geolocation-specific sampling), which may impact the generalisability of findings. Finally, the HAM-A is a clinician-rated outcome measure, whereas the HADS is patient-rated. This could influence network indices and invariance analyses in relation to moderating variables. Interpretations of outcomes will need to be assessed with respect to differing methods and perspectives.

Consent

Consent was not a requisite for this study, as no data were collected or analysed for this manuscript. Planned analyses will be conducted using secondary data and participant consent for the use of their data in research subsequent to the original study was obtained by respective study sponsors.

Data availability

There are no data associated with this manuscript.

Faculty Opinions recommended

References

Amsterdam JD, Li Y, Soeller I, et al.: A randomized, double-blind, placebo-controlled trial of oral Matricaria recutita (chamomile) extract therapy for generalized anxiety disorder. J Clin Psychopharmacol. 2009; 29(4): 378–82. PubMed Abstract | Publisher Full Text | Free Full Text
Beck AT, Epstein N, Brown G, et al.: An inventory for measuring clinical anxiety: psychometric properties. J Consult Clin Psychol. 1988a; 56(6): 893–897. PubMed Abstract | Publisher Full Text
Beck AT, Steer RA, Garbin MGJ: Psychometric properties of the Beck Depression Inventory twenty-five years of evaluation. Clin Psychol Rev. 1988b; 8(1): 77–100. Publisher Full Text
Borsboom D: A network theory of mental disorders. World Psychiatry. 2017; 16(1): 5–13. PubMed Abstract | Publisher Full Text | Free Full Text
Borsboom D, Cramer AOJ: Network analysis: an integrative approach to the structure of psychopathology. Annu Rev Clin Psychol. 2013; 9: 91–121. PubMed Abstract | Publisher Full Text
Bradley P, Shiekh M, Mehra V, et al.: Improved efficacy with targeted pharmacogenetic-guided treatment of patients with depression and anxiety: a randomized clinical trial demonstrating clinical utility. J Psychiatr Res. 2018; 96: 100–107. PubMed Abstract | Publisher Full Text
Bringmann LF, Lemmens LHJM, Huibers MJH, et al.: Revealing the dynamic network structure of the Beck Depression Inventory-II. Psychol Med. 2015; 45(4): 747–757. PubMed Abstract | Publisher Full Text
Byrne D, Boland F, Brannick S, et al.: Applying advanced psychometric approaches yields differential randomized trial effect sizes: secondary analysis of individual participant data from antidepressant studies using the Hamilton Rating Scale for Depression. J Clin Epidemiol. 2025a; 183: 111762. PubMed Abstract | Publisher Full Text
Byrne D, Ghoshal A, Boland F, et al.: An exploratory graphical analysis of the Montgomery-Åsberg Depression Rating Scale pre- and post-treatment using pooled antidepressant trial secondary data. J Affect Disord. 2025b; 368: 584–590. PubMed Abstract | Publisher Full Text
Christensen AP, Golino H: Estimating the stability of psychological dimensions via bootstrap Exploratory Graph Analysis: a Monte Carlo simulation and tutorial. Psych. 2021; 3(3): 479–500. Publisher Full Text
Cipriani A, Furukawa TA, Salanti G, et al.: Comparative efficacy and acceptability of 21 antidepressant drugs for the acute treatment of adults with major depressive disorder: a systematic review and network meta-analysis. Lancet. 2018; 391(10128): 1357–1366. PubMed Abstract | Publisher Full Text | Free Full Text
Csárdi G, Nepusz T, Traag V, et al.: igraph. Network analysis and visualization. Last modified March 13 2024. Reference Source
Doyle F, Byrne D, Carney RM, et al.: The effects of advanced factor analysis approaches on outcomes in randomised trials for depression: protocol for secondary analysis of individual participant data. BJPsych Open. 2023; 9(5): e157. PubMed Abstract | Publisher Full Text | Free Full Text
Golino H, Christensen C, Moulder R, et al.: EGAnet. Exploratory Graph Analysis – a framework for estimating the number of dimensions in multivariate data using network psychometrics. Last modified 28 September 2024. Reference Source
Elliott H, Jones PJ, Schmidt U: Central symptoms predict posttreatment outcomes and clinical impairment in anorexia nervosa: a network analysis. Clin Psychol Sci. 2020; 8(1): 139–154. Publisher Full Text
Epskamp S: qgraph. Graph plotting methods, psychometric data visualization and graphical model estimation. Last modified March 13 2024. Last modified November 3 2023. Reference Source
Fisher AJ, Reeves JW, Lawyer G, et al.: Exploring the idiographic dynamics of mood and anxiety via network analysis. J Abnorm Psychol. 2017; 126(8): 1044–1056. PubMed Abstract | Publisher Full Text
Fried EI: The 52 symptoms of major depression: lack of content overlap among seven common depression scales. J Affect Disord. 2017; 208: 191–197. PubMed Abstract | Publisher Full Text
Hamilton M: A rating scale for depression. J Neurol Neurosurg Psychiatry. 1960; 23(1): 56–62. PubMed Abstract | Publisher Full Text | Free Full Text
Hamilton M: The assessment of anxiety states by rating. Br J Med Psychol. 1959; 32(1): 50–55. PubMed Abstract | Publisher Full Text
Hallquist MN, Wright AGC, Molenaar PCM: Problems with centrality measures in psychopathology symptom networks: why network psychometrics cannot escape psychometric theory. Multivariate Behav Res. 2021; 56(2): 199–223. PubMed Abstract | Publisher Full Text | Free Full Text
Jamison L, Golino H, Christensen AP: Metric invariance in exploratory graph analysis via permutation testing. Preprint from PsyArXiv, 2022. Publisher Full Text
Jin Y, Sha S, Tain T, et al.: Network analysis of comorbid depression and anxiety and their associations with quality of life among clinicians in public hospitals during the late stage of the COVID-19 pandemic in China. J Affect Disord. 2022; 314: 193–200. PubMed Abstract | Publisher Full Text | Free Full Text
Jones PJ, Robinaugh DR: An answer to “so what?” implications of network theory for research and practice. Focus (Am Psychiatr Publ). 2021; 19(2): 204–210. PubMed Abstract | Publisher Full Text | Free Full Text
Levinson CA, Zerwas S, Calebs B, et al.: The core symptoms of bulimia nervosa, anxiety, and depression: a network analysis. J Abnorm Psychol. 2017; 126(3): 340–354. PubMed Abstract | Publisher Full Text | Free Full Text
Lindström E, Lewander T, Malm U, et al.: Patient-rated versus clinician-rated side effects of drug treatment in schizophrenia. Clinical validation of a self-rating version of the UKU Side Effect Rating Scale (UKU-SERS-Pat). Nord J Psychiatry. 2001; 55(44): 5–69. PubMed Abstract | Publisher Full Text
Llorca PM, Spadone C, Sol O, et al.: Efficacy and safety of hydroxyzine in the treatment of generalized anxiety disorder: a 3-month double-blind study. J Clin Psychiatry. 2002; 63(11): 1020–1027. PubMed Abstract | Publisher Full Text
Lutz W, Schwartz B, Hormann SG, et al.: Using network analysis for the prediction of treatment dropout in patients with mood and anxiety disorders: a methodological proof-of-concept study. Sci Rep. 2018; 8(1): 7819. PubMed Abstract | Publisher Full Text | Free Full Text
Montgomery SA, Åsberg M: A new depression scale designed to be sensitive to change. Br J Psychiatry. 1979; 134: 382–389. PubMed Abstract | Publisher Full Text
Maciaszek J, Pawlowski T, Hadryś T, et al.: Baseline depressive symptoms as predictors of efficacy and tolerability of the treatment with duloxetine: a network analysis approach. Front Psychiatry. 2023; 14: 1210289. PubMed Abstract | Publisher Full Text | Free Full Text
Park SC, Kin Y, Kim K, et al.: Network analysis of the symptoms of depressive disorders over the course of therapy: changes in centrality measures. Psychiatry Investig. 2021; 18(1): 48–58. PubMed Abstract | Publisher Full Text | Free Full Text
R Core Team: R: A language and environment for statistical computing [computer program]. R Foundation for Statistical Computing. Vienna, Austria, 2024.
Vagias WM: Likert-type scale response anchors. Clemson International Institute for tourism & research development. Department of Parks, Recreation and Tourism Management. Clemson University, 2006. Reference Source
Veal C, Tomlinson A, Cipriani A, et al.: Heterogeneity of outcome measures in depression trials and the relevance of the content of outcome measures to patients: a systematic review. Lancet Psychiatry. 2024; 11(4): 285–294. PubMed Abstract | Publisher Full Text
Vivli.org: Electronic source. Reference Source
Yohannes AM, Murri MB, Hanania NA, et al.: Depressive and anxiety symptoms in patients with COPD: a network analysis. Respir Med. 2022; 198: 106865. PubMed Abstract | Publisher Full Text | Free Full Text
Zigmond AS, Snaith RP: The hospital anxiety and depression scale. Acta Psychiatr Scand. 1983; 67(6): 361–730. PubMed Abstract | Publisher Full Text

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 23 Sep 2025

Author details Author details

¹ Dept. Health Psychology, Royal College of Surgeons in Ireland, Dublin, Leinster, Ireland
² Data Science Centre, Royal College of Surgeons in Ireland, Dublin, Leinster, Ireland

David Byrne
Roles: Conceptualization, Formal Analysis, Funding Acquisition, Methodology, Project Administration, Validation, Writing – Original Draft Preparation, Writing – Review & Editing

Fiona Boland
Roles: Funding Acquisition, Methodology, Supervision, Validation, Writing – Original Draft Preparation, Writing – Review & Editing

Frank Doyle
Roles: Funding Acquisition, Methodology, Supervision, Validation, Writing – Original Draft Preparation, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

Research Ireland Government of Ireland Postdoctoral Research Fellowship [GOIPD/2024/299]
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (1)

version 1

Published: 23 Sep 2025, 8:106

https://doi.org/10.12688/hrbopenres.14239.1

Copyright

© 2025 Byrne D et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

VIEWS

67

downloads

9

Citations

0

SEE MORE DETAILS

CITE

how to cite this article

Byrne D, Boland F and Doyle F. A protocol for assessing stability and invariance properties of two commonly used outcome measures in anxiolytic clinical trials [version 1; peer review: awaiting peer review]. HRB Open Res 2025, 8:106 (https://doi.org/10.12688/hrbopenres.14239.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 23 Sep 2025

Open Peer Review

Reviewer Status

AWAITING PEER REVIEW

Comments on this article

All Comments(0)

Add a comment

Sign up for content alerts

[1] Amsterdam JD, Li Y, Soeller I, et al.: A randomized, double-blind, placebo-controlled trial of oral Matricaria recutita (chamomile) extract therapy for generalized anxiety disorder. J Clin Psychopharmacol. 2009; 29(4): 378–82. PubMed Abstract | Publisher Full Text | Free Full Text

[2] Beck AT, Epstein N, Brown G, et al.: An inventory for measuring clinical anxiety: psychometric properties. J Consult Clin Psychol. 1988a; 56(6): 893–897. PubMed Abstract | Publisher Full Text

[3] Beck AT, Steer RA, Garbin MGJ: Psychometric properties of the Beck Depression Inventory twenty-five years of evaluation. Clin Psychol Rev. 1988b; 8(1): 77–100. Publisher Full Text

[4] Borsboom D: A network theory of mental disorders. World Psychiatry. 2017; 16(1): 5–13. PubMed Abstract | Publisher Full Text | Free Full Text

[5] Borsboom D, Cramer AOJ: Network analysis: an integrative approach to the structure of psychopathology. Annu Rev Clin Psychol. 2013; 9: 91–121. PubMed Abstract | Publisher Full Text

[6] Bradley P, Shiekh M, Mehra V, et al.: Improved efficacy with targeted pharmacogenetic-guided treatment of patients with depression and anxiety: a randomized clinical trial demonstrating clinical utility. J Psychiatr Res. 2018; 96: 100–107. PubMed Abstract | Publisher Full Text

[7] Bringmann LF, Lemmens LHJM, Huibers MJH, et al.: Revealing the dynamic network structure of the Beck Depression Inventory-II. Psychol Med. 2015; 45(4): 747–757. PubMed Abstract | Publisher Full Text

[8] Byrne D, Boland F, Brannick S, et al.: Applying advanced psychometric approaches yields differential randomized trial effect sizes: secondary analysis of individual participant data from antidepressant studies using the Hamilton Rating Scale for Depression. J Clin Epidemiol. 2025a; 183: 111762. PubMed Abstract | Publisher Full Text

[9] Byrne D, Ghoshal A, Boland F, et al.: An exploratory graphical analysis of the Montgomery-Åsberg Depression Rating Scale pre- and post-treatment using pooled antidepressant trial secondary data. J Affect Disord. 2025b; 368: 584–590. PubMed Abstract | Publisher Full Text

[10] Christensen AP, Golino H: Estimating the stability of psychological dimensions via bootstrap Exploratory Graph Analysis: a Monte Carlo simulation and tutorial. Psych. 2021; 3(3): 479–500. Publisher Full Text

[11] Cipriani A, Furukawa TA, Salanti G, et al.: Comparative efficacy and acceptability of 21 antidepressant drugs for the acute treatment of adults with major depressive disorder: a systematic review and network meta-analysis. Lancet. 2018; 391(10128): 1357–1366. PubMed Abstract | Publisher Full Text | Free Full Text

[12] Csárdi G, Nepusz T, Traag V, et al.: igraph. Network analysis and visualization. Last modified March 13 2024. Reference Source

[13] Doyle F, Byrne D, Carney RM, et al.: The effects of advanced factor analysis approaches on outcomes in randomised trials for depression: protocol for secondary analysis of individual participant data. BJPsych Open. 2023; 9(5): e157. PubMed Abstract | Publisher Full Text | Free Full Text

[14] Golino H, Christensen C, Moulder R, et al.: EGAnet. Exploratory Graph Analysis – a framework for estimating the number of dimensions in multivariate data using network psychometrics. Last modified 28 September 2024. Reference Source

[15] Elliott H, Jones PJ, Schmidt U: Central symptoms predict posttreatment outcomes and clinical impairment in anorexia nervosa: a network analysis. Clin Psychol Sci. 2020; 8(1): 139–154. Publisher Full Text

[16] Epskamp S: qgraph. Graph plotting methods, psychometric data visualization and graphical model estimation. Last modified March 13 2024. Last modified November 3 2023. Reference Source

[17] Fisher AJ, Reeves JW, Lawyer G, et al.: Exploring the idiographic dynamics of mood and anxiety via network analysis. J Abnorm Psychol. 2017; 126(8): 1044–1056. PubMed Abstract | Publisher Full Text

[18] Fried EI: The 52 symptoms of major depression: lack of content overlap among seven common depression scales. J Affect Disord. 2017; 208: 191–197. PubMed Abstract | Publisher Full Text

[19] Hamilton M: A rating scale for depression. J Neurol Neurosurg Psychiatry. 1960; 23(1): 56–62. PubMed Abstract | Publisher Full Text | Free Full Text

[20] Hamilton M: The assessment of anxiety states by rating. Br J Med Psychol. 1959; 32(1): 50–55. PubMed Abstract | Publisher Full Text

[21] Hallquist MN, Wright AGC, Molenaar PCM: Problems with centrality measures in psychopathology symptom networks: why network psychometrics cannot escape psychometric theory. Multivariate Behav Res. 2021; 56(2): 199–223. PubMed Abstract | Publisher Full Text | Free Full Text

[22] Jamison L, Golino H, Christensen AP: Metric invariance in exploratory graph analysis via permutation testing. Preprint from PsyArXiv, 2022. Publisher Full Text

[23] Jin Y, Sha S, Tain T, et al.: Network analysis of comorbid depression and anxiety and their associations with quality of life among clinicians in public hospitals during the late stage of the COVID-19 pandemic in China. J Affect Disord. 2022; 314: 193–200. PubMed Abstract | Publisher Full Text | Free Full Text

[24] Jones PJ, Robinaugh DR: An answer to “so what?” implications of network theory for research and practice. Focus (Am Psychiatr Publ). 2021; 19(2): 204–210. PubMed Abstract | Publisher Full Text | Free Full Text

[25] Levinson CA, Zerwas S, Calebs B, et al.: The core symptoms of bulimia nervosa, anxiety, and depression: a network analysis. J Abnorm Psychol. 2017; 126(3): 340–354. PubMed Abstract | Publisher Full Text | Free Full Text

[26] Lindström E, Lewander T, Malm U, et al.: Patient-rated versus clinician-rated side effects of drug treatment in schizophrenia. Clinical validation of a self-rating version of the UKU Side Effect Rating Scale (UKU-SERS-Pat). Nord J Psychiatry. 2001; 55(44): 5–69. PubMed Abstract | Publisher Full Text

[27] Llorca PM, Spadone C, Sol O, et al.: Efficacy and safety of hydroxyzine in the treatment of generalized anxiety disorder: a 3-month double-blind study. J Clin Psychiatry. 2002; 63(11): 1020–1027. PubMed Abstract | Publisher Full Text

[28] Lutz W, Schwartz B, Hormann SG, et al.: Using network analysis for the prediction of treatment dropout in patients with mood and anxiety disorders: a methodological proof-of-concept study. Sci Rep. 2018; 8(1): 7819. PubMed Abstract | Publisher Full Text | Free Full Text

[29] Montgomery SA, Åsberg M: A new depression scale designed to be sensitive to change. Br J Psychiatry. 1979; 134: 382–389. PubMed Abstract | Publisher Full Text

[30] Maciaszek J, Pawlowski T, Hadryś T, et al.: Baseline depressive symptoms as predictors of efficacy and tolerability of the treatment with duloxetine: a network analysis approach. Front Psychiatry. 2023; 14: 1210289. PubMed Abstract | Publisher Full Text | Free Full Text

[31] Park SC, Kin Y, Kim K, et al.: Network analysis of the symptoms of depressive disorders over the course of therapy: changes in centrality measures. Psychiatry Investig. 2021; 18(1): 48–58. PubMed Abstract | Publisher Full Text | Free Full Text

[32] R Core Team: R: A language and environment for statistical computing [computer program]. R Foundation for Statistical Computing. Vienna, Austria, 2024.

[33] Vagias WM: Likert-type scale response anchors. Clemson International Institute for tourism & research development. Department of Parks, Recreation and Tourism Management. Clemson University, 2006. Reference Source

[34] Veal C, Tomlinson A, Cipriani A, et al.: Heterogeneity of outcome measures in depression trials and the relevance of the content of outcome measures to patients: a systematic review. Lancet Psychiatry. 2024; 11(4): 285–294. PubMed Abstract | Publisher Full Text

[35] Vivli.org: Electronic source. Reference Source

[36] Yohannes AM, Murri MB, Hanania NA, et al.: Depressive and anxiety symptoms in patients with COPD: a network analysis. Respir Med. 2022; 198: 106865. PubMed Abstract | Publisher Full Text | Free Full Text

[37] Zigmond AS, Snaith RP: The hospital anxiety and depression scale. Acta Psychiatr Scand. 1983; 67(6): 361–730. PubMed Abstract | Publisher Full Text

A protocol for assessing stability and invariance properties of two commonly used outcome measures in anxiolytic clinical trials

Abstract

Background

Aim

Method

Discussion

Keywords

Background

Measuring symptoms and which ones matter?

A network modelling perspective

Using network analysis to augment treatment outcomes

The proposed study

Methods

Study design

Data collection

Table 1. Trial inclusion criteria.

Outcome measures

Table 2. HAM-A and HADS-A symptoms.

Statistical analysis plan

Discussion

Consent

Data availability

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Comments on this article

Are you a HRB-funded researcher?

Thank you!

A protocol for assessing stability and invariance properties of two commonly used outcome measures in anxiolytic clinical trials

Abstract

Background

Aim

Method

Discussion

Keywords

Background

Measuring symptoms and which ones matter?

A network modelling perspective

Using network analysis to augment treatment outcomes

The proposed study

Methods

Study design

Data collection

Table 1. Trial inclusion criteria.

Outcome measures

Table 2. HAM-A and HADS-A symptoms.

Statistical analysis plan

Discussion

Consent

Data availability

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Comments on this article

Competing Interests Policy

Stay Updated

Are you a HRB-funded researcher?

Thank you!