• Non ci sono risultati.

Targets of dimethyl fumarate: a novel predictive method based on concepts of network theory and machine learning

N/A
N/A
Protected

Academic year: 2021

Condividi "Targets of dimethyl fumarate: a novel predictive method based on concepts of network theory and machine learning"

Copied!
3
0
0

Testo completo

(1)

21 July 2021

AperTO - Archivio Istituzionale Open Access dell'Università di Torino

Original Citation:

Targets of dimethyl fumarate: a novel predictive method based on concepts of network theory and machine learning

Terms of use:

Open Access

(Article begins on next page)

Anyone can freely access the full text of works made available as "Open Access". Works made available under a

Creative Commons license can be used according to the terms and conditions of said license. Use of all other works

requires consent of the right holder (author or publisher) if not exempted from copyright protection by the applicable law.

Availability:

This is the author's manuscript

(2)

Dimethyl fumarate (DMF)-based drug products have been approved for use as oral treatments for psoriasis and relapsing-remitting multiple sclerosis in recent years. The adduction of DMF to certain cysteine residues in proteins (P), resulting in the appearance of the corresponding dimethyl succinated (DMSu)-P, is thought to underlie the effects of this drug.

Targets of Dimethyl Fumarate: a Novel

Predictive Method Based on Concepts of

Network Theory and Machine Learning

Arianna Carolina Rosa

1

, Elisa Benetti

1

, Jessica Audisio

1

, Gianluca Miglio

1,2

1

Department of Drug Science and Technology and

2

Competence Centre for Scientific

Computing C

3

S, University of Turin. E-mail: gianluca.miglio@unito.it

DMF

P

P

Effects

DMSu-P

1.

Hierarchical clustering

2.

Classification algorithms/models:

Classification and Regression Tree (CART)

Conditional Inference Tree (CIT)

k-Nearest Neighbors (KNN)

Linear Discriminant Analysis (LDA)

Neuronal Network (NNET)

Partial Least Square (PLS)

Random Forrest (RF)

Support Vector Machine (SVM)

Neighborhood compositions

INTRODUCTION

To date, only a few targets for these compounds have been discovered using proteomic methods. This study proposes a novel method by which to consistently classify (class prediction) cysteine sites in proteins in terms of their reactivity towards DMF.

METHODS

32 proteins with defined 3D structure

308 cysteine sites

35 modifiable cysteine (MC) sites

and

273 non-modifiable cysteine (NMC) sites

Literature search

Identification of DMSu-P

1

2

Residue Interaction Network

Dataset of cysteine neighbors

3

4

Data analysis

(3)

RESULTS

Targets of Dimethyl Fumarate: a Novel

Predictive Method Based on Concepts of

Network Theory and Machine Learning

P.279

Arianna Carolina Rosa

1

, Elisa Benetti

1

, Jessica Audisio

1

, Gianluca Miglio

1,2

1

Department of Drug Science and Technology and

2

Competence Centre for Scientific

Computing C

3

S, University of Turin. E-mail: gianluca.miglio@unito.it

A)

B)

A)

B)

C)

D)

Heatmap of neighborhood compositions across all the 380 cysteine sites found in the DMF-sensitive proteins.

Predictive power of neighborhood amino acid composition. Eight algorithms/models (Classification and Regression Tree, CART; Conditional Inference Tree, CIT; k-Nearest Neighbors, KNN; Linear Discriminant Analysis, LDA; Neuronal Network, NNET; Partial Least Square, PLS; Random Forrest, RF; Support Vector Machine, SVM) were tested to analyze data on neighborhood compositions of cysteine site included in the DMF-DS. Accuracy, sensitivity and specificity were computed to quantify algorithm/model performance.

CONCLUSION

Predicted reactivity of explicit examples. Probabilities (P) of a cysteine site found in human probable DNA dCdU-editing enzyme (PDB ID: 3VOW; reactivity toward DMF, panel A), human tyrosyl-tRNA synthetase, cytoplasmic (1NTG; reactivity toward fumarate, panel B) to be a modifiable site was computed by analyzing data on their neighborhood using three algorithms/models and compared to the measured reactivity.

Concepts of network theory, for protein structure analysis, can be combined with machine learning techniques to estimate the propensity of a cysteine residue in protein to be modified by DMF. When adopted together with other approaches for

target identification/validation, this method could provide helpful data by which to find pharmacological targets of DMF-based drug products.

REFERENCES

Blewett MM et al., Sci. Signal. 2016, 9, rs10.

Kuhn M and Johnson K, Applied predictive modelling. Springer-Verlag, New York 2013.

Doncheva NT, et al., Nat. Protoc. 2012, 7, 670.

Hierarchical clustering

1

2

Classification

Model validation

3

This study was supported by funds from the Università degli Studi di Torino, Ricerca Locale Ex 60% 2015 and 2016-2017 to GM, and

Riferimenti

Documenti correlati

The proportion of esca-diseased vines that did not show the visual symptoms of esca (hidden esca) in a vineyard in a given year was expressed as the ratio of the number of

Xeno- fobia, ostilità delle istituzioni poli- tiche, crisi della solidarietà sono espressioni che si rincorrono nel- le testimonianze di Paesi diversi, con una consonanza tale da far

The first one has a structure similar to the one in Figure 2.1, the fuzzing client is used to run the 500 test cases on a WebSocket server to check if the implementation of the

“very irritating this tendency to turn the father in some kind of… ill-grown child that is somehow… compared to the children, often subject to reprobation by a wife who instead

La costituzione fisica del muro lo differenzia dunque dal confine, sul quale in ogni caso esso si inscrive, e ciò comporta la necessità di studiarlo mediante una differente

In this TCGA cohort, we found multiple significant predisposing genes for the European ancestry and seven for the African ancestry, yet lack cancer cohorts with sufficient

Lungi dal poter essere liquidati, come troppo spesso è stato fatto, come un semplice scritto di circostanza, come un tentativo di uniformarsi alla moda corrente per ottenere il

We first discuss a prior summability condition for the posterior to accumulate around the densities in the model closest in the Kullback– Leibler sense to the data generating