• Non ci sono risultati.

Information Retrieval 9

N/A
N/A
Protected

Academic year: 2021

Condividi "Information Retrieval 9"

Copied!
1
0
0

Testo completo

(1)

Information Retrieval

9 June 2014

Ex 1 [ranks 4] Construct the Cartesian Tree for the array A = [4, 2, 6, 8, 1], and specify the property that relates this tree with Range Minimum Queries over A.

Ex 2 [ranks 5+5]Given a posting list describe the use of skip pointers:

 Comment on the best skip-positioning in terms of time access, whenever all items have the same probability to be accessed.

 Comment on the best positioning of skips in terms of time access, whenever each item has its own probability to be accessed.

Ex 3 [points 4+3] Given the following matrix of pair-wise similarity between 5 items

Show the next cluster formed by the agglomerative clustering algorithm based on MIN and MAX similarity-functions, given that we have already formed the following clusters {(I1,I2), (I3,I4), (I5)}

Ex 4 [points 4] Design a cuckoo hash over two tables of size m=7, each, and hash functions h1(k) = k mod m, h2(k) = 3*k mod m. Show the content of the two tables by inserting the sequence of keys S={1, 5, 8, 3, 12, 10, 11, 13, 9} if possible.

Ex 5 [points 2+3] Given a graph of three pages, and directed edges (1,3) (2,3) (3,2).

1. Specify the formula of HITS on this graph, and indicate how can be computed a(3) and h(3) as a function of the other node’s values.

2. Assuming that the initial values for a() and h() are equal to 1, make one step of HITS calculation.

Riferimenti

Documenti correlati

We show that segregation according to information preferences leads to a seg- regation of information: If groups do not interact with each other, within each group, only the

As before, the choice  D 0:5 fits PL better than our 289 models; but, again, we redrew the graphs using the optimal value of  for each 290 turbulence level, and found no

Figure 3 shows more details on the precision of our solution by showing the number of global icebergs that should be detected (referred to as generated) and the number of items

Then we have studied the abundance gradients along the Galactic disk produced by our best cosmological model and their dependence upon several parameters: a threshold in the surface

In particular it linearizes the system in the following way: if the input signal at the gate increases, a larger drain current will flow through the inductor, its voltage drop

Suppose z is a real-valued function defined on a open disc in R 2 , centered at the point p 0 and that all partial derivatives through order two exist in this disc, and

Given the following knapsack problems, derive (if possible) cover inequalities violated by the optimal solution of the

Cartesian reference system belonging to the triangle