• Non ci sono risultati.

Discovering candidates for gene network expansion by variable subsetting and ranking aggregation

N/A
N/A
Protected

Academic year: 2021

Condividi "Discovering candidates for gene network expansion by variable subsetting and ranking aggregation"

Copied!
1
0
0

Testo completo

(1)

Discovering Candidates for Gene Network Expansion by Variable

Subsetting and Ranking Aggregation

Luca Erculiani∗ Francesca Galante∗ Caterina Gallo∗ Francesco Asnicar∗ Luca Masera∗ Paolo Morettin∗ Nadir Sella∗ Thomas Tolio∗ Giulia Malacarne† Kristof Engelen†

Andrea Argentini‡ Valter Cavecchia§ Claudio Moser† Enrico Blanzieri∗

We present a method that produces a list of genes that are candidates for Network Expansion by Sub-setting and Ranking Aggregation (NESRA) and its application to gene regulatory networks. Our group has recently developed gene@home [3], a BOINC project [1] that permits to search for candidate genes for the expansion of a gene regulatory network us-ing gene expression data. The project adopts in-tensive variable-subsetting strategies enabled by the computational power provided by the volunteers who join the project by means of the BOINC client, and exploits the PC algorithm for discovering putative causal relationships within each subset of variables. The PC algorithm, whose name derives from the ini-tials of its authors [7] and PC* [2] are algorithms that discover causal relationships among variables. In particular, PC is based on the systematic testing for conditional independence of variables given subsets of other variables, comprehensively presented and eval-uated by Kalish and colleagues [4] who proposed it also for gene network reconstruction [5]. NESRA is an algorithm which runs as a postprocessor of the gene@home project that has: 1) a procedure that sys-tematically subsets the variables, runs the PC and ranks the genes; the subsetting is iterated several times and a ranked list of candidates is produced by counting the number of times a relationship is found; 2) several ranking steps are executed with different values of the dimension of the subsets and with different number of iterations producing several ranked lists; 3) the ranked lists are aggregated by us-ing a state-of-the-art rankus-ing aggregator. Here we show that a single ranking step is enough to outper-form PC and PC*, but with some dependency on the parameters. Moreover, we show that the output

University of Trento, Italy.

CRI, Fondazione Edmund Mach, San Michele all’Adige, Trento, Italy.

Ghent University and VIB, Ghent, Belgium.

§

CNR-IMEM, Trento, Italy.

ranking aggregation method is better that the aver-age performance of the single ranking steps. Evalua-tions are done by means of the gene@home project on Arabidopsis thaliana including a comparison against ARACNE [6] (Table 1).

Method k=5 k=10 k=20 k=55 NESRA 0.90 0.80 0.60 0.42 ARACNE 0.2 0.3 0.35 0.45 Table 1: A. thaliana, Expansion of the Flower Organ Specification Gene Regulatory Network. NESRA and ARACNE (default parameters) precision for different values k of the length of the gene list.

References

[1] D. P. Anderson. BOINC: A system for public-resource computing and storage. In Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing, GRID ’04, pages 4–10, Washing-ton, DC, USA, 2004. IEEE Computer Society. [2] D. Colombo and M. H. Maathuis.

Order-independent constraint-based causal structure learning. arXiv preprint arXiv:1211.3295, 2012. [3] gene@home. http://gene.disi.unitn.it/

test/, April 2015.

[4] M. Kalisch and P. B¨uhlmann. Estimating high-dimensional directed acyclic graphs with the PC-algorithm. J. Mach. Learn. Res., 8:613–636, May 2007.

[5] M. H. Maathuis et al. Predicting causal effects in large-scale systems from observational data. Nat. Methods, 7(4):247–248, Apr 2010.

[6] A. A. Margolin et al. ARACNE: an algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context. BMC Bioinfor-matics, 7 Suppl 1:S7, 2006.

[7] P. Spirtes and C. Glymour. An algorithm for fast recovery of sparse causal graphs. Social Science Computer Review, 9:62–72, 1991.

Riferimenti

Documenti correlati

Quella di essere stato il mandante dell’assassinio di un certo Santi Termini, formalmente incensurato, del tentato omicidio di Pietro Lepre, “perseguitato allora e in

In vista del fatto che è stata constatata l'impossibilità di tradurre in atto le clausole del Trattato di pace con l'Italia relative al Territorio Libero di Trieste, i Governi del

Lo stesso Salvini, in concomitanza con la vicenda della Mare Jonio, ha firmato una direttiva con cui il Ministero degli Interni chiede, alle forze di polizia, una nuova stretta

A systematic study of the known sources in the web-based TeVCat catalog has been performed to search for possible γ -ray counterparts on the AGILE data collected

A fuoco sono in particolare le emozioni in ambito estetico, dove molte cose si danno per scontate - come ad esempio il fatto che l’arte si muova essenzialmente in ambito emozionale

È morto alle soglie del novantesimo compleanno Michel Butor, uno degli ultimi grandi vecchi della letteratura francese, nello pseudo esilio che si era scelto da parecchi decenni,

If your paper contains figures or text that require permission to reproduce, please confirm that you have obtained all relevant permissions and that the correct permission text

Figure 3. Sample XU-1: a) coarse grained matrix amphibole and talc in dolomite, with calcite as crack fillings and pseudomorphs, and a small rounded relic of matrix quartz; b) quartz