• Non ci sono risultati.

Assessing cutoff points for biomarker selection in -omics technologies

N/A
N/A
Protected

Academic year: 2021

Condividi "Assessing cutoff points for biomarker selection in -omics technologies"

Copied!
1
0
0

Testo completo

(1)

Assessing cutoff points for biomarker selection in

-omics technologies

Pietro Franceschi1,?, Ron Wehrens1

1.Biostatistics and Data Management,

Edmund Mach Foundation – Research and Innovation Centre Via Edmund Mach 1

38010 San Michele all’Adige (TN) Italy

?

Contact author: pietro.franceschi@iasma.it

Keywords: Metabolomics, Cutoff Values, Biomarker Selection.

Data from modern -omics technologies provide an holistic representation of the state of biologi-cal systems. However, considering the inherent complexity of the datasets, it is necessary to develop filters able to highlight only relevant information to be used for the biological interpretation.

Biomarker identification represents a paradigmatic example of the situation faced in data filter-ing. Biomarkers are variables (metabolites, proteins, genes, ...) which can be used to characterize specific subgroups in the data; in a two-class setting, for example, the biomarkers are those variables that allow discrimination between the classes.

A class tag can be used to distinguish many situations: it can be used to discriminate treated vs. non-treated, to mark different varieties of the same organism, etcetera. Most methods for biomarker identification formulate the problem in a classification setting, selecting as important the variables which give good predictive performance.

In all the biomarker selection strategies, it is necessary to define a cutoff value which identifies the subset of the features to be considered biomarkers, as with the rather arbitrary α level in statistical testing.

The choice of a reasonable and robust cutoff has important practical implications for the de-velopment of a biological model. Biomarkers identification, indeed, can be a long and expensive process so it is of paramount importance to be able of focus only on reliable biomarkers.

In the majority of cases the optimal cutoff point depends on the specific dataset under study, so there is a definitive interest in developing and comparing general strategies able to identify good cutoff points.

In this contribution, several strategies to face the problem will proposed and evaluated using spiked -omics datasets. Among them, particular focus will be on Higher Criticism (Donoho; 2008) and on the recently proposed Stability Based Biomarkers Selection (Wehrens; 2011).

References

D.Donoho, J.Jin (2008). Higher criticism thresholding: Optimal feature selection when useful features are rare and weak. PNAS 105(39), 14790 – 14795.

R. Wehrens, P. Franceschi, U. Vrhovsek, F. Mattivi (2011). Stability-based biomarker selection. Analytica Chimica Acta, doi:10.1016/j.aca.2011.01.039.

Riferimenti

Documenti correlati

CB: clinical benefit; ctDNA: circulating tumor DNA; Ipi: ipilimumab; MDSCSs: myeloid derived suppressor cells; NGS: next-generation sequencing; Nivo: nivolumab; NLR: neutrophil

Doitchinov introduced the notion of supertopological spaces in order to con- struct a unified theory of topological, proximity and uniform spaces, and he proved a certain

In particular, we have studied form factors and correlation func- tions for the SG and broken φ 4 theories on a twisted cylinder, and we have derived the scaling functions of the

Modalità di rilevamento che non richiedono l’uso dei Ground Control Points: la tecnologia Post Processed Kinematic (PPK) consente di raggiungere zone precedentemente non

Conversely, the observed large benefit of laparoscopic surgery plus postoperative pentoxifylline is somewhat unex- pected as, according to a Cochrane review including three RCTs (5),

a. All of these Ans.5. None of these Ans.7.. During the process of combustion ________is given out a. Both heat & light Ans. Ideal fuel has ____ Heating Value a. Heating value

independent of the path for given chemical reaction and reactant mixture composition b.. is highest for constant