• Non ci sono risultati.

Quality assessment and external validation of non-invasive scores for disease staging in non-alcoholic fatty liver disease: an international multicenter study

N/A
N/A
Protected

Academic year: 2021

Condividi "Quality assessment and external validation of non-invasive scores for disease staging in non-alcoholic fatty liver disease: an international multicenter study"

Copied!
1
0
0

Testo completo

(1)

International Biometric Society

QUALITYASSESSMENTANDEXTERNALVALIDATION OFNON-INVASIVE SCORESFORDISEASE STAGINGINNON-ALCOHOLICFATTYLIVERDISEASE: ANINTERNATIONALMULTICENTERSTUDY.

Castiglione A1, Ciccone G1, Musso G2, Baldi I3

1University of Turin and Cancer Epidemiology Unit, "Città della Salute e della Scienza"

Hospital, Torino, Italy,2Gradenigo Hospital of Turin, 3University of Padua

Introduction: Non-alcoholic fatty liver disease (NAFLD) includes a broad range of conditions, from simple steatosis to non-alcoholic steato-hepatitis (NASH), fibrosis and cirrhosis. Because the management of patients varies according to the stage of NAFLD, the correct staging of patients is essential. To avoid liver biopsy, actually the gold standard test for diagnosing NASH and staging fibrosis, several non-invasive scores have been developed. Objective: Study aim was the assessment of the methodological quality of current scores for diagnosing and staging NAFLD and the evaluation of their diagnostic accuracy in an

external validation set.

Methods: Quality of studies was independently assessed by two raters using the QUADAS-2 criteria and performance of scores was validated in a wide external population from 23 different centers (N=3357). Differences in scores performance between development and validation set could be explained both by incorrect regression coefficients effect and by case-mix differences between development and validation sets. To disentangle these two effects, a benchmark approach was used. For each score, in addition to the simple score performance in validation set, we estimated, score performance under the condition that model predictions are statistically correct (case-mix corrected benchmark) and score performance that can be obtained by refitting the model in the validation set (refitted benchmark)

Results: Overall, methodological quality of the 17 original studies was rather low (with high agreement between raters, with Cohen's kappa statistic=0.84). Of the 7 more valid scores, performance in the validation set was poorer (c-statistic from 0.56 to 0.71) than in the development set (c-statistic from 0.82 to 0.90). However, in 3 out of 7 scores, case-mix corrected values were very close to that of the development set. This similarity suggested that most of the worsening in performance could be attributed to case–mix differences between development and validation sets. This result may reflect a different distribution in the validation set of some predictors omitted in the original scores. To a lesser extent, some, decrease in performance could be attributed to differences in coefficient estimates. Only one score for NASH (NICE) and two scores for fibrosis (NAFLD and BAARD scores), reached an higher study quality; at the same time these scores had the case-mix corrected performance closer to the performance in the development set.

Conclusion: Available non-invasive scores for staging NAFLD showed very limited external validity and clinical usefulness. Before application in clinical practice of non-invasive scores to avoid liver biopsy, further studies are necessary in order to find important variables omitted in the original scores, and to quantify of the usefulness of these updated scores, including the latent variables, in decision-making (for biopsy or treatment decisions). Finally, for such improved scores, a further assessment of their impact in terms of health service organization and costs may be advisable.

Riferimenti

Documenti correlati

In this work, a simplified control strategy is presented, where the conventional power-split logics, typical of the above-mentioned strategies, is here replaced with an

Lo mostrano nuovamente i dati numerici (messi opportunamente in rilievo da Quondam): se infatti nella Storia della letteratura italiana il nome di Dante ricorre 474 volte

Or, à y regarder de plus près, ce paradoxe n’est qu’apparent, car le retard avec lequel l’ADF est apparue en Italie fait que, dans ce pays, on a connu quasiment en même temps les

By decreasing the Adige River stage (Scenario RS05, Figure 10B), the water flowing into the aquifer decreases as decreases the recharge fluxes from the rivers to the aquifer.. On

Scopo dell’attacco era sabotare la centrifuga della centra- le, responsabile della produzione di acqua pesante, tramite

Thus, it is crucial to continuously assess the capabilities of deepfake text generators - both current transformer-based text generators (such as GPT-2 [ 13 ], GROVER[ 14 ] and CTRL[

The global cluster environment is related to the clustercentric distance and the distance to the nearest X-ray peaks, whereas the local envi- ronment is connected to the local