• Non ci sono risultati.

Execution of MapReduce applications

N/A
N/A
Protected

Academic year: 2021

Condividi "Execution of MapReduce applications"

Copied!
6
0
0

Testo completo

(1)

Execution of MapReduce applications

(2)

The BigData@Polito cluster has

A set of servers running Hadoop

An Access Gateway server used to interact with the Hadoop cluster

▪ Submit jobs/execute MapReduce applications

▪ Submit hdfs commands

▪ Analyze the log files from command line

(3)

Personal workstation/

LABINF PC

“jupyter.polito.it”

server (gateway)

Cluster node 1

Cluster node 2

Cluster node 30

BigData@Polito cluster

(4)

Execute a MapReduce Application on the cluster (i.e., submit a MapReduce job on the cluster)

Log into jupyter

https://jupyter.polito.it

Copy the jar file containing your application from your personal workstation (or the workstation of LABINF) in the “local” file system of the gateway

Drag and Drop from your PC to the jupyter web page

Copy the input data of your application from the “local” file

system of your personal workstation in the HDFS file system of the cluster

Drag and Drop from your PC to the jupyter web page and then use the HDFS command

Or use the HUE web interface

https://bigdatalab.polito.it/hue

(5)

Open a terminal in Jupyter

Use the hadoop command from the opened terminal to submit the job

▪ Specify the name of the jar file, the name of the input (HDFS) data, the name of output folder, the

parameters/arguments of the application

▪ Example

hadoop jar Exercise1-1.0.0.jar

it.polito.bigdata.hadoop.exercise1.DriverBigData 2 ex1_data ex1_out

(6)

Open a terminal in Jupyter

Use the hadoop command from the opened terminal to submit the job

▪ Specify the name of the jar file, the name of the input (HDFS) data, the name of output folder, the

parameters/arguments of the application

▪ Example

hadoop jar Exercise1-1.0.0.jar

it.polito.bigdata.hadoop.exercise1.DriverBigData 2 ex1_data ex1_out

This command execute an application on the cluster

• Exercise1-1.0.0.jar: jar file containing the code of the MapReduce application

• it.polito.bigdata.hadoop.exercise1.DriverBigData: driver class

• This application has three parameters

• Number of instances of the reducer

• Input folder

• Output folder

Riferimenti

Documenti correlati

Ad ogni consegna, da effettuare con pacco sigillato (ovvero con etichetta sigillo comprovante l’integrità del pacco) contenente tutto il materiale ordinato, dovrà essere allegata

There are many ways to elicit this information, but one we have found to be useful is first to ask the patient: “Out of the last 30 days, how many days did you have a head- ache of

They may also, according to some philosophers, expose fallacies in philosophical thinking that go to the heart of human nature by showing how the brain actually makes decisions..

A Palm OS handheld computer that is charged and has been HotSync’d at least once to the desktop (or laptop) computer that you will be using to download the software or a Pocket PC

Ad ogni consegna, da effettuare con pacco sigillato (ovvero con etichetta sigillo comprovante l’integrità del pacco) contenente tutto il materiale ordinato, dovrà essere allegata

Le attività in cui l’intelligenza artificiale e i Workbot in particolare, possono davvero fare la differenza sono quelle di pre-selezione attraverso l’analisi automatizzata

(a) For t = 1 find the unit tangent vector, the unit normal vector, and express the acceleration vector as linear combination of them.. Moreover, the curve passes through

Ad ogni consegna, da effettuare con pacco sigillato (ovvero con etichetta sigillo comprovante l’integrità del pacco) contenente tutto il materiale ordinato, dovrà essere allegata