• Non ci sono risultati.

Information Retrieval

N/A
N/A
Protected

Academic year: 2021

Condividi "Information Retrieval"

Copied!
1
0
0

Testo completo

(1)

Information Retrieval

15 January 2021 – time 45 minutes

Question #1 [rank 5]. Given the two posting lists:

T1 à 1, 2, 3, 5, 6, 7, 8, 10, 15

T2 à 9, 10, 11, 16, 18, 19, 20, 23, 25

Assume that they are divided into logical blocks of three items each, and they have skip pointers to the beginning of blocks. Assume that the last block of each list has a skip pointer of value +infty.

Show the pairs of elements which are compared by the algorithm that computes the intersection of the two lists and exploits the skip pointers. Stop as soon as a first match is found (namely at 10).

Question #2 [rank 5]. You are given the sets A={1, 4, 6, 8}, B = {2, 4, 6, 7, 8}, C = {4, 6, 11}, apply MinHashing technique based on a sketch of size 2 (integers – minima), and the following permutations p1(x) = 2*x mod 13 and p2(y) = 3*y mod 13. Estimate the Jaccard similarity among the three sets.

Question #3 [rank 5+5]. Let us given the dictionary of strings D = {bas, box, bus, cus}.

• Construct a data structure that efficiently solves the 1-error search

• Apply it to search for the pattern P=rox.

Question #4 [rank 5]. Given the four files { aba, aaba, aabb, baaa }, apply the Z-delta algorithm to compute the best compression of this group of files.

Question #5 [rank 3]. Assume that a client is executing the zsync algorithm and stores the file f_old = rane_matte, and receives from the server the hashes h1, h2, h3, h4 that it finds as h1 = att, h2=ran, h4=ane.

Thus the client answers to the server with the bit string 1101 and then it gets <5,3,EOF>.

Show which is the f_new file reconstructed by the client.

Question #6 [rank 2]. Assume that you are given n users, each with a set of movies U_i that (s)he has seen, drawn from a universe U of movies. Design a solution that, given a set of movies S, finds the users which share at least k movies with S. Comment the efficiency and efficacy of the proposed solution.

Riferimenti

Documenti correlati

Attn STUDENT SERVICE AREA UNIVERSITY of ROME “FORO ITALICO”.. HEADOFFICE

This is a contradiction because a positive integer is the sum of two squares if and only if all the prime factors congruent to 3 mod 4 have an even exponent in

Answer: As a basis of Im A, take the first, fourth and fifth columns, so that A has rank 3. Find a basis of the kernel of A and show that it really is a basis... Does the union of

If S is not a linearly independent set of vectors, remove as many vectors as necessary to find a basis of its linear span and write the remaining vectors in S as a linear combination

n “Thin” client containing only the Presentation Logic subsystem (session, text input, dialog, and display management services). n

I giocatori I e II muovono a turno: ogni gio- catore pu prendere, dalla riga (non vuota...) che vuole, qualsiasi numero positivo di fiammiferi. Il giocatore che prende l’ultimo

NOTE: In the solution of a given exercise you must (briefly) explain the line of your argument and show the main points of your calculations1. Solutions without adequate

1.Unconditional cancellation of public external debt payments by all lenders - bilateral, multilateral and private lenders - for all countries in need for at least the next four