• Non ci sono risultati.

Data Base and Data Mining Group of Politecnico di TorinoDBGThe data mining processElena Baralis

N/A
N/A
Protected

Academic year: 2021

Condividi "Data Base and Data Mining Group of Politecnico di TorinoDBGThe data mining processElena Baralis"

Copied!
5
0
0

Testo completo

(1)

Data Base and Data Mining Group of Politecnico di Torino

DBG

The data mining process

Elena Baralis

Politecnico di Torino

(2)

DB M G

2

Non trivial extraction of

implicit

previously unknown

potentially useful

information from available data

Extraction is automatic

performed by appropriate algorithms

Extracted information is represented by means of abstract models

denoted as pattern

Data mining

(3)

DB M G

3

Knowledge Discovery Process

data

selected

data preprocessed

data transformed

data pattern

knowledge

selection

preprocessing

transformation

data mining

interpretation

KDD = Knowledge Discovery from Data

(4)

DB M G

4

Descriptive methods

Extract interpretable models describing data

Example: client segmentation

Predictive methods

Exploit some known variables to predict

unknown or future values of (other) variables

Example: “spam” email detection

Analysis techniques

(5)

DB M G

5

Analysis techniques

Association rule extraction

Classification

Clustering

model

training data

unclassified data classified data

Riferimenti

Documenti correlati

Franco Scarselli Sistemi per basi di dati 2005-2006 15. Strumenti per il

From: Tan,Steinbach, Kumar, Introduction to Data Mining, McGraw Hill 2006.. DB M

 example: intrusion detection in network traffic analysis.

Database and data mining group, Politecnico di Torino.. DataBase an d Data Mining Group of Po litecnico d

Adattato da Golfarelli, Rizzi,”Data warehouse, teoria e pratica della progettazione”, McGraw Hill 2006 Standardizzazione nome: Elena.

Results: After adjustment for age, the contribution of fat oxidation to metabolic energy sources, normalized for fat-free mass, in fasting conditions was significantly correlated

In the amino acids for which we were able to examine both racemization and hydrolysis (aspartic acid/asparagine, glutamic acid/glutamine and alanine), it was clear that hydrolysis

Proximity graphs will continue to play a crucial role in applications such as the calibration problem in microarray analysis, as well as in other clustering