• Non ci sono risultati.

Self-Organizing Event Space Partitioning for Content-Based Publish/Subscribe Systems ?

N/A
N/A
Protected

Academic year: 2021

Condividi "Self-Organizing Event Space Partitioning for Content-Based Publish/Subscribe Systems ?"

Copied!
2
0
0

Testo completo

(1)

Brief Announcement: Distributed

Self-Organizing Event Space Partitioning for Content-Based Publish/Subscribe Systems ?

Roberto Beraldi, Adriano Cerocchi, Fabio Papale, and Leonardo Querzoni University of Rome “La Sapienza”, Rome, Italy

[beraldi,cerocchi,querzoni]@dis.uniroma1.it

Context and motivations. Publish/subscribe systems have commonly been divided in two large families on the basis of their event-selection model [2]: topic- based and content-based systems. The former trade reduced subscription expres- siveness with simpler implementations and higher performance. Conversely, the latter allow to accurately map published data in a complex event schema on top of which expressive subscriptions can be defined, but incur the cost of more complex implementations that delivers reduced performance on large distributed settings. System developers are thus faced with a choice about which kind of sys- tem is best suited to the target application. A common solution to this dilemma lies in the event space partitioning [4] technique: the event schema is partitioned in a number of subspaces that are then statically mapped to topics. The parti- tioning must be globally known and subscribers are expected to subscribe those topics where subspaces that have a non-empty intersection with their content- based subscriptions have been mapped. Undesired events (false positives) can be filtered out at the receiver side. The event space partitioning granularity strongly affects the performance of such systems: if it is excessively coarse-grained too much resources are wasted to deliver false positives, while if it is too fine-grained the number of topics that will be generated, and that must be managed by the topic based system, could easily become huge. Current solutions [3] provide sub- optimal approximations that are calculated offline and then statically applied to the system.

Contribution. We propose a self-organizing algorithm that builds and dynam- ically adapts at run-time an event space partitioning that eventually provides subscribers with a desired level of performance (i.e. percentage of false posi- tives below a specified threshold) while striving to limit the number of topics that must be managed. The novelty of our solution lies in its ability to work on the basis of run-time performance indices measured by subscribers. Each sub- scriber monitors the ratio between the false positives (an event notified to the subscriber that does not match any of its content-based subscriptions) and the total number of events received for each topic it is subscribed to. If this ratio raises above a predefined global threshold T

F P

the subscriber starts a procedure for partitioning the subspace mapped to that topic. At the end of this proce- dure it updates its topic subscriptions and starts again monitoring performance

?

This work was partially supported by the BLEND and SOFIA European projects

and by the DOTS-LCCI Italian project.

(2)

S1 S2

X Y

/

/X1 /X2 /X3

/X1/Y1 /X1/Y2 /X1/Y3

/X1/Y1/X1 /X1/Y1/X2 /X1/Y1/X3

/X3/Y1 /X3/Y2 /X3/Y3

/X2 /X3/Y3/X3/Y2/X3/Y1 /X1/Y3/X1/Y2/X1/Y1/X1 /X1/Y1/X2 /X1/Y1/X3

e

Fig. 1. A partitioned event space with the corresponding partitioning tree.

statistics. Figure 1 shows an example based on a two dimensions event space with attributes X and Y ; the light dotted lines represent a possible partitioning on the event space where two subscriptions S1 and S2 has been defined; the right side of the figure offers a representation of the corresponding partitioning tree, the data structure hosted in a distributed fashion at the subscriber side to keep track of the current partitioning. Each node in the tree corresponds to a partition (or subspace). The root node corresponds to an initial single parti- tion that matches the whole event space. Each partition can be subdivided in multiple sub-partitions that are represented in the tree as children nodes. Two topics are defined for each node in the tree: a data topic and a control topic. The former is used to diffuse events while the latter is used to diffuse information about the partitioning. The proposed algorithm can be embedded within an ar- chitectural component that provides a content-based publish/subscribe interface and leverages services offered by a plain topic-based system. Simulation-based experiments show that (i) the proposed algorithm converges to a stable event space partitioning in a limited amount of time, (ii) it adapts the partitioning with bounded oscillations even in presence of abrupt changes in the workload, (iii) the obtained partitioning provides the desired level of performance and (iv) the associated cost (average number of subscribed topics per subscriber) is lower than the cost required by a static partitioning as long as the desired performance threshold is kept low (i.e. performance similar to a pure content-based system with no false positives). A detailed description of the algorithm and an extensive experimental evaluation are available in [1].

References

1. Beraldi, R., Cerocchi, A., Papale, F., Querzoni, L.: Distributed self-organizing event space partitioning for content-based publish/subscribe systems. Tech.

rep., Universit` a di Roma “La Sapienza”. http://www.dis.uniroma1.it/~midlab/

publications.php (2011)

2. Eugster, P., Felber, P., Guerraoui, R., Kermarrec, A.M.: The many faces of pub- lish/subscribe. ACM Computing Surveys 35(2), 114–131 (2003)

3. Opyrchal, L., Astley, M., Auerbach, J., Banavar, G., R.Strom, Sturman, D.: Ex- ploiting IP multicast in content-based publish-subscribe systems. In: Proceedings of Middleware 2000, IFIP/ACM International Conference on Distributed Systems Platforms. pp. 185–207 (4-7 April 2000)

4. Wang, Y., Qiu, L., Achlioptas, D., Das, G., Larson, P., Wang, H.J.: Subscription

partitioning and routing in content-based publish/subscribe systems. Tech. rep.,

Microsoft Research (2002)

Riferimenti

Documenti correlati

Il Minervini, come era consono agli studiosi della sua generazione, dopo aver insegnato in sedi lontane come le Università di Messina, Modena, Bari, e poi

Current state-of-the-art content-based systems ([3], [4], [6] and [11]) work over networks of brokers with acyclic and static topologies whose representation is stored into

Experimental results (reported in [1]) show clearly that (i) increasing the asso- ciativity produces a reduction in the TCP hops metric and (ii) the self-organizing algorithm allows

The algorithm can be split in four phases: triggering, tear-up link discovery, teardown link selection, and reconfiguration which includes the content-based routing tables

Results are organized as follows: first we evaluate the application-level routing performance (number of TCP hops per event notification) and the associativity metric of the system

We show how EDLs built via the simple subscription flooding approach (previously adopted in [5]) incur in a contin- uous degradation of accuracy in a dynamic scenario, and, for

DIPARTIMENTO DI FILOLOGIA, LETTERATURA E LINGUISTICA Corso di Laurea Magistrale in Lingua e letteratura italiana. Presidente Prof.ssa

Il deemed dividend approach si basa sulla presunzione secondo la quale gli utili prodotti dalla società controllata estera vengono considerati come distribuiti in capo ai