• Non ci sono risultati.

Batch System as a Service

N/A
N/A
Protected

Academic year: 2022

Condividi "Batch System as a Service"

Copied!
31
0
0

Testo completo

(1)

Batch System as a Service

/////////////////////////////////////////////// Alessandro Italiano - Workshop di CCR sull'Infrastruttura Cloud - Napoli, 12/2014////////////////////////////////////////////////

(2)

The issue

Dynamically instantiate a computing cluster exploitable through the batch jobs submission

————————

- Provide all the main batch system functionalities such as queues, resource management, fairshare priority, policies,…

- Provide resource for batch job execution in a

dynamic way

(3)

BSaaS, first use case

I already have a batch system.

I just need to stretch/shrink computing

capabilities

(4)

We have evaluated two solution - CloudScheduler

- HTCondor rooster

(5)

CloudScheduler, one main limit

One can use a scheduling algorithm in either Condor Job Scheduler or Cloud Scheduler but one

consequence of the current design is that Cloud Scheduler can impact the scheduling decision made

by the Condor Job Scheduler(and vice versa). In the simplest case where Condor use FIFO algorithm it

may be possible that job are not run in expected

order under certain circumstances

(6)

HTCondor Rooster

condor_rooster is an optional daemon that may be added to the condor_master daemon’s

DAEMON_LIST. It is responsible for waking up hibernating machines when their

UNHIBERNATE expression becomes True

(7)

HTCondor Rooster

UNHIBERNATE = TotalIdleJobs > 100

ROOSTER_WAKEUP_CMD = /usr/share/submitEc2.py

#!/usr/bin/env python import boto

….

(8)

HTCondor Rooster, limitations

- VMs instantiate with the same ec2 credentials

- A Idle VM can be shotted down at the most

(9)

BSaaS, second use case

I need a batch facility just for five months

(10)

BSaaS, second use case

Requirements

- batch system service provisioning - batch system configuration

- dynamic computing host provisioning

(11)

BaaS: Meet the requirements

- batch system service provisioning - Heat orchestrates the batch system

resource provisioning

- batch system configuration

- dynamic computing host provisioning

(12)

- batch system service provisioning

- Heat can orchestrate the batch system resource provisioning

- batch system configuration

Puppet manages batch system installation and configuration

- dynamic computing host provisioning

BaaS: Meet the requirements

(13)

Ceilometer Alarming

(14)

Ceilometer Alarming

Ceilometer

(15)

Ceilometer Alarming

Ceilometer

Alarm on a single meter

(16)

Ceilometer Alarming

Ceilometer

Alarm on a single meter

Alarm on a combination of alarms

(17)

Ceilometer Alarming

Ceilometer

Alarm on a single meter

Alarm on a combination of alarms

Instance upload its own meter

(18)

Ceilometer Alarming

Ceilometer

Alarm on a single meter

Alarm on a combination of alarms

Instance upload its own meter

(19)

Ceilometer Alarming

Ceilometer

Alarm on a single meter

Alarm on a combination of alarms

Instance upload its own meter

http callback

(20)

- batch system service provisioning

- Heat can orchestrate the batch system resource provisioning

- batch system configuration

Puppet manages batch system installation and configuration

- dynamic computing host provisioning

BaaS: Meet the requirements

(21)

BaaS: Meet the requirements

BaaS

(22)

BaaS: Meet the requirements

Ceilometer

BaaS

(23)

BaaS: Meet the requirements

Ceilometer

Heat

BaaS

(24)

BaaS: Meet the requirements

Ceilometer

Heat

Puppet

BaaS

(25)

PRISMA: HPC as a

Service

(26)

Perché HPCaaS

• Uno degli use-case (eSeismic) di PRISMA richiede

l’esecuzione di applicazioni che richiedono potenza di calcolo

• E necessitano di brevi (certi) tempi di esecuzione

• Lo use-case di PRISMA può essere schematizzato come un SaaS.

• Eseguire un software ben noto in modo semplice, affidabile, scalabile e ripetitivo.

• L’orchestatore di livello PaaS interagisce con i servizi esterni

via REST API

(27)

Come è implementato

• Partire da oggetti già disponibili per ridurre il costo di sviluppo e di mantenimento:

JST: già usato in BioVeL per attività simili

• Sviluppato e mantenuto da Bari

Celery: per distribuire le task all’interno degli host virtuali per una specifica applicazione di eSiesmic (OpenQuake)

HTCondor: per le applicazioni in cui serve un batch system

Elastiq: per la gestione delle macchine virtuali

(28)

Schema logico

Orchestratore PaaS

JST REST Frontend

JST DB

Elastiq

(29)

Schema logico

Orchestratore PaaS

JST REST Frontend

JST DB Elastiq

JST

Backend

(30)

Schema logico

Orchestratore PaaS

JST REST Frontend

JST DB Elastiq

JST

Backend

(31)

Referenze

Celery:

• http://www.celeryproject.org

• https://github.com/celery/celery/wiki

Elastiq:

• https://github.com/dberzano/elastiq

Riferimenti

Documenti correlati

Sono stati effettuati esperimenti in Batch in fase preliminare per accertarsi dell’efficienza di rimozione dei singoli contaminanti e per lo studio della cinetica di

It is seen that for lower values of T a the induction time becomes larger, while the rate of temperature rise during explosion is still high and does

[r]

Under the assumptions made in the thermal theory of explosion written for a batch reactor model, it is impossible to avoid explosion in a perfectly isolated reactor in which

Under the assumptions made in the thermal theory of explosion written for a batch reactor model, it is possible to avoid explosion in a cooled reactor in which an exothermic

A combustor operates at an equivalence ratio of 0.25 with an air flowrate of 6.5 kg/s. The fuel is methane. the fuel mass flowrate.. 2. the air-fuel ratio at which the engine

A combustor operates at an equivalence ratio of 0.25 with an air flowrate of 6.5 kg/s. The fuel is methane. the fuel mass flowrate.. 2. the air-fuel ratio at which the

A combustor operates at an equivalence ratio of 0.25 with an air flowrate of 6.5 kg/s. The fuel is methane. the fuel mass flowrate.. 2. the air-fuel ratio at which the engine