• Non ci sono risultati.

Information Retrieval 19 December 2018 Name: Surname: Matricola:

N/A
N/A
Protected

Academic year: 2021

Condividi "Information Retrieval 19 December 2018 Name: Surname: Matricola:"

Copied!
1
0
0

Testo completo

(1)

Information Retrieval

19 December 2018

Name: Surname: Matricola:

Ex 1 [points 5+5+3]

 Given a binary array B[1,n] describe the data structure that is built to solve the Rank_1 query taking o(n) bits in addition to the storage of B.

 Describe the solution based on champion lists to efficiently solve AND-queries over q terms and TF-IDF relevance score.

 Define Link Probability and Commonness as they are used by entity annotators, like TagMe.

Ex 2 [points 3+3+3]

Given the words {abaco, adagio, baco, alano}

 construct a 2-gram index,

 Show the algorithm that, based on the overlap distance, selects “good candidates” for the 1-edit distance problem over the two query strings P1=abco, P2=acabi.

Ex 3 [points 4+4]

Given the directed graph G consisting of nodes {A, B, C, D} and edges {(B,A), (A,C), (D,C), (A,B), (C, B), (D,A), (B,D)}.

 Compute two steps of PageRank by assuming uniform starting probability and teleportation step with parameter ¼, hence alpha = ¾ (hint: try to operate only on fractions).

 Compute one step of Personalized PageRank for the node A by assuming starting probability [1, 0, 0, 0] and teleportation step with parameter

½

(hint:

try to operate only on fractions).

Riferimenti

Documenti correlati

Show how to compute via Bloom Filter the intersection between two sets A and B, each one stored on a different machine, and estimate its communication cost (in

 Define what is a Spectral Bloom filter, the operations supported, and show the formula of the error rate (as a function of the space and number of keys) and prove its correctness.

[r]

 Execute one step of the Personalized PageRank algorithm with respect to the node 1, assuming alpha = ½ and uniform

 Define, in formulas only, how it is computed the authoritative score of node A and its hubness score as a function of the authoritative/hubness scores of its adjacent nodes.

 Compute two steps of PageRank by assuming uniform starting probability and parameter ¼ for the teleportation step (hint: try to operate only on fractions)..  Assume now that

Exercise [rank 5] Given the string S=baobab, compute the Suffix Array, and then derive the LCP array by using the linear-time algorithm seen in class, showing the amount of

Exercise [rank 5] Given the string S=abaa, compute its Arithmetic encoding based on empirical frequencies by emitting the final binary compressed string. (hint: please work with