Silvia Metelli

1 Clinical paper (Target: BMJ)
1 Software paper (Target: Journal of Statistical Software)
1 Tutorial paper: NMAstudio tutorial available here

Some slides here: Dynamic Treatment Regimes/Just-in-Time adaptive interventions

Manuscript under preparation (Target: Annals of Applied Statistics)

Abstract:

Networks of interventions can be used to summarise evidence from studies involving multiple treatments pertaining the same health condition, and to compare their effectiveness. Complex interventions are increasingly encountered in networks of studies. These interventions consist of multiple, potentially common and interactive components. In digital health, they tipically involve interventions delivered through mobile devices, such as notifications to support behaviour change. When pooling results from studies involving complex interventions, the main interest does not only revolve around whether the intervention works per se, but rather what components contribute the most to the overall effectiveness and how they interact with each other. However, this is a statistically challenging problem as the effects of interacting components are hard to disentangle and may have complicated causal pathways. Furthermore, complex interventions tend to be very heterogeneous, due to increased variation across patient populations and interventions, which makes a reliable quantitative synthesis more challenging. In this work, we propose a model that considers each intervention component as a potential mediator in the path from treatment to outcome. We frame the mediating pathway as a novel Bayesian latent class model which allows to decompose each component effect into additive effects of distinct mediating paths when there exist multiple mediators (i.e. components here) that are correlated and interacting. A key underlying assumption of the model is to implicitly treat each combination of components as forming ‘their own’ class of interventions, and so the model may naturally incorporate interactions as class-specific parameters and better explain between-study variability. We illustrate our methods using both synthetic data and real-world data in the field of physical activity.

Manuscript under preparation (presented at 43rd ISCB Annual Conference, Aug 2022, Newcastle, UK)

Abstract:

Network meta-analysis (NMA) usually provides estimates of the relative effects with the highest possible precision. However, sparse networks with few available studies and limited direct evidence can arise, threatening the robustness and reliability of NMA estimates. In these cases, the limited amount of available information can hamper the formal evaluation of the underlying NMA assumptions of transitivity and consistency. In addition, NMA estimates from sparse networks are expected to be imprecise and possibly biased as they rely on large sample approximations which are invalid in the absence of sufficient data. We propose a Bayesian framework that allows sharing of information between two networks that pertain to different population subgroups. Specifically, we use the results from a subgroup with a lot of direct evidence (a dense network) to construct informative priors for the relative effects in the target subgroup (a sparse network). This is a two-stage approach where at the first stage we extrapolate the results of the dense network to those expected from the sparse network. This takes place by using a modified hierarchical NMA model where we add a location parameter that shifts the distribution of the relative effects to make them applicable to the target population. At the second stage, these extrapolated results are used as prior information for the sparse network. We illustrate our approach through a motivating example of psychiatric patients. Our approach results in more precise and robust estimates of the relative effects and can adequately inform clinical practice in presence of sparse networks.

Link to paper.

This manuscript received the 2022 ISCB Student Conference Award (awardee: Theodoros Evrenoglou)

Abstract:

In a network meta-analysis, some of the collected studies may deviate markedly from the others, for example having very unusual effect sizes. These deviating studies can be regarded as outlying with respect to the rest of the network and can be influential on the pooled results. Thus, it could be inappropriate to synthesize those studies without further investigation. In this paper, we propose two Bayesian methods to detect outliers in a network meta-analysis via: (a) a mean-shifted outlier model and (b), posterior predictive p-values constructed from ad-hoc discrepancy measures. The former method uses Bayes factors to formally test each study against outliers while the latter provides a score of outlyingness for each study in the network, which allows to numerically quantify the uncertainty associated with being outlier. Furthermore, we present a simple method based on informative priors as part of the network meta-analysis model to down-weight the detected outliers. We conduct extensive simulations to evaluate the effectiveness of the proposed methodology while comparing it to some alternative, available outlier diagnostic tools. Two real networks of interventions are then used to demonstrate our methods in practice.

Link to paper.
Supplementary Material

Abstract:

Monitoring computer network traffic for anomalous behaviour presents an important security challenge. Arrivals of new edges in a network graph represent connections between a client and server pair not previously observed, and in rare cases these might suggest the presence of intruders or malicious implants. We propose a Bayesian model and anomaly detection method for simultaneously characterising existing network structure and modelling likely new edge formation. The method is demonstrated on real computer network authentication data and successfully identifies some machines which are known to be compromised.

Link to paper.
Supplementary Material

Abstract:

Computer networks are complex and the analysis of their structure in search for anomalous behaviour is both a challenging and important task for cyber security. For instance, new edges, i.e. connections from a host or user to a computer that has not been connected to before, provide potentially strong statistical evidence for detecting anomalies. Unusual new edges can sometimes be indicative of both legitimate activity, such as automated update requests permitted by the client, and illegitimate activity, such as denial of service (DoS) attacks to cause service disruption or intruders escalating privileges by traversing through the host network. In both cases, capturing and accumulating evidence of anomalous new edge formation represents an important security application. Computer networks tend to exhibit an underlying cluster structure, where nodes are naturally grouped together based on similar connection patterns. What constitutes anomalous behaviour may strongly differ between clusters, so inferring these peer groups constitutes an important step in modelling the types of new connections a user would make. In this article, we present a two-step Bayesian statistical method aimed at clustering similar users inside the network and simultaneously modelling new edge activity, exploiting both overall-level and cluster-level covariates.

Link to paper.

Abstract:

In multilevel models for binary responses, estimation is computationally challenging due to the need to evaluate intractable integrals. In this paper, we investigate the performance of integrated nested Laplace approximation (INLA), a fast deterministic method for Bayesian inference. In particular, we conduct an extensive simulation study to compare the results obtained with INLA to the results obtained with a traditional stochastic method for Bayesian inference (MCMC Gibbs sampling), and with maximum likelihood through adaptive quadrature. Particular attention is devoted to the case of small number of clusters. The specification of the prior distribution for the cluster variance plays a crucial role and it turns out to be more relevant than the choice of the estimation method. The simulations show that INLA has an excellent performance as it achieves good accuracy (similar to MCMC) with reduced computational times (similar to adaptive quadrature).

Link to paper.

Abstract:

Anomalous connections in a computer network graph can be a signal of malicious behaviours. For instance, a compromised computer node tends to form a large number of new client edges in the network graph, connecting to server IP (Internet Protocol) addresses which have not previously been visited. This behaviour can be caused by malware (malicious software) performing a denial of service (DoS) attack, to cause disruption or further spread malware, alternatively, the rapid formation of new edges by a compromised node can be caused by an intruder seeking to escalate privileges by traversing through the host network. However, study of computer network flow data suggests new edges are also regularly formed by uninfected hosts, and often in bursts. Statistically detecting anomalous formation of new edges requires reliable models of the normal rate of new edges formed by each host. Network traffic data are complex, and so the potential number of variables which might be included in such a statistical model can be large, and without proper treatment this would lead to overfitting of models with poor predictive performance. In this paper, Bayesian variable selection is applied to a logistic regression model for new edge formation for the purpose of selecting the best subset of variables to include.

Link to paper.

Abstract:

Background There is no consensus on the most efficient catheter ablation (CA) strategy for patients with atrial fibrillation (AF). The objective of this study was to compare the efficacy and safety of different CA strategies for AF ablation through network meta-analysis (NMA).
Methods A systematic search of PubMed, Web of Science, and CENTRAL was performed up to October 5th, 2020. Randomized controlled trials (RCT) comparing different CA approaches were included. Efficacy was defined as arrhythmia recurrence after CA and safety as any reported complication related to the procedure during a minimum follow-up time of 6 months.
Results In total, 67 RCTs (n=9871) comparing 19 different CA strategies were included. The risk of recurrence was significantly decreased compared to pulmonary vein isolation (PVI) alone for PVI with renal denervation (RR: 0.60, CI: 0.38–0.94), PVI with ganglia-plexi ablation (RR: 0.62, CI: 0.41–0.94), PVI with additional ablation lines (RR: 0.8, CI: 0.68–0.95) and PVI in combination with bi-atrial modification (RR: 0.32, CI: 0.11–0.88). Strategies including PVI appeared superior to non-PVI strategies such as electrogram-based approaches. No significant differences in safety were observed.
Conclusions This NMA showed that PVI in combination with additional CA strategies, such as autonomic modulation and additional lines, seem to increase the efficacy of PVI alone. These strategies can be considered in treating patients with AF, since, additionally, no differences in safety were observed. This study provides decision-makers with comprehensive and comparative evidence about the efficacy and safety of different CA strategies.

Link to paper.

Abstract:

Objective: Meta-analyses of observational studies are frequently published in the literature, but they are generally considered suboptimal to those involving randomised controlled trials (RCTs) only. This is due to the increased risk of biases that observational studies may entail as well as because of the high heterogeneity that might be present. In this article, we highlight aspects of meta-analyses with observational studies that need more careful consideration in comparison to meta-analyses of RCTs.

Methods: We present an overview of recommendations from the literature with respect to how the different steps of a meta-analysis involving observational studies should be comprehensively conducted. We focus more on issues arising at the step of the quantitative synthesis, in terms of handling heterogeneity and biases. We briefly describe some sophisticated synthesis methods, which may allow for more flexible modelling approaches than common meta-analysis models. We illustrate the issues encountered in the presence of observational studies using an example from mental health, which assesses the risk of myocardial infarction in antipsychotic drug users.

Results: The increased heterogeneity observed among studies challenges the interpretation of the diamond, while the inclusion of short exposure studies may lead to an exaggerated risk for myocardial infarction in this population.

Conclusions: In the presence of observational study designs, prior to synthesis, investigators should carefully consider whether all studies at hand are able to answer the same clinical question. The potential for a quantitative synthesis should be guided through examination of the amount of clinical and methodological heterogeneity and assessment of possible biases.

Link to paper.

Abstract:

Blood Abstract here

Link to report.
Download pdf

PhD Thesis
Some slides here

Development of an optimal allocation strategy for patients in a waiting list for lung transplantation. We develop a method based on Survival Random Forests to estimate counterfactual individualised treatment effects, subsequently used to derive the optimal allocation rule. (First author: M. Slaoui, PostDoc)

Application of multistate transition modelling to predict adverse events and in-hospital mortality for patients with acute kidney injury using the MIMIC III database: an emulated trials approach. (with Estelle Lu, MSc student).

TBA

Research.

Working Papers.

Submitted & Preprints.

Selected Publications.

Statistics.

Multidisciplinary.

Book Chapters.

Editorials.

Reports.

PhD Thesis.

Other Contributed Projects.