• Non ci sono risultati.

Missing data imputation in longitudinal trial of endometrial cancer patients

N/A
N/A
Protected

Academic year: 2021

Condividi "Missing data imputation in longitudinal trial of endometrial cancer patients"

Copied!
2
0
0

Testo completo

(1)

27 July 2021

AperTO - Archivio Istituzionale Open Access dell'Università di Torino

Original Citation:

Missing data imputation in longitudinal trial of endometrial cancer patients

Terms of use:

Open Access

(Article begins on next page)

Anyone can freely access the full text of works made available as "Open Access". Works made available under a Creative Commons license can be used according to the terms and conditions of said license. Use of all other works requires consent of the right holder (author or publisher) if not exempted from copyright protection by the applicable law.

Availability:

This is the author's manuscript

(2)

Missing data imputation in longitudinal trial of endometrial cancer patients

Rosato R, Pagano E, Piovano E, Fuso L, Tripodi E, Mitidieri M, Ceccarelli M, Zola P, Di Cuonzo D

Aims

Missing data is a common problem in health-related quality-of-life studies, especially with longitudinal study design.

Several techniques for the multiple imputation (MI) for missing data have been developed and validated on simulated data.

This study aims to investigate the missing data generating process, define models for MI and validate them on real data.

Methods

Data of all the patients recruited from September 2008 to May 2014 in an ongoing randomized clinical trial between two follow up regimens in endometrial cancer surgically treated patients were used.

Participants were followed since the end of primary treatment with repeated clinical evaluation and quality of life (QoL) assessments (baseline, 6, 12, 24,36,48, 60 months) with the Short Form 12-item survey (SF-12).

QoL missing data did not satisfy the assumption of multivariate normality.

We used two different techniques to impute non-monotonic data: Markov-chain Monte Carlo (MCMC) and fully conditional specification (FCS).

In order to impute missing data using FCS method, we specified a separate regression model for each variable with missing.

Validation of the quality of the process of imputation was performed on data collected from May 2014 to November 2015 for the patients still in follow-up.

For each MI approach, we assessed bias (average difference between estimate and real data) and coverage of the estimate 95% confidence interval for each coefficient estimate to determine which procedure is most appropriate.

Results

In the study period, 822 patients were enrolled.

Using the two different methods of imputation similar results were obtained on all the measurement points.

We found no evidence that MCMC method performed less well, despite the unrealistic assumption of multivariate normal distribution.

Both the methods show a good performance.

For example, the absolute bias for Mental Component varied between 0.03 and 1.6. Performance was slightly reduced in the long term data.

Conclusions

To the best of our knowledge, this study is the first to use real data to validate the two techniques of imputation on longitudinal records with a large number of measurement points and a large

proportion of missing data.

Riferimenti

Documenti correlati

Atomic resolution images (high-angle annular dark-field scanning TEM (HAADF-STEM)) were used to obtain the experimental strain fields of two different pairs of parallel threading

In particular, when data are characterised by a low internal homogeneity (as in the Large businesses case) it is more difficult to build any type of tree. In general,

La Parte Terza, con due Capitoli, dispone un’ampia riflessione sulla metodologia, sulle tecniche di ricerca e sulla teoria sociologica che hanno guidato l’intero studio

The study may show to what extent the proportion of missing data, the missing data mechanism, the characteristics of the scale, and the sample size affect the performance of

The former calculates Köppen’s climate classification, six aridity indices, four continentality indices, Thornthwaite – Mather’s water balance, and OIV’s bio-climatic indices

La sostenibilità ambientale attiene alla necessità di una migliore utilizzazione delle risorse ambientali attualmente disponibili, affinché anche le generazioni future

Cell extracts were obtained either in non-reducing (lanes 1, 2) or in reducing (lanes 3, 4) conditions, by means of 25 mM DTT addition prior to loading. An anti-b actin antibody