• Non ci sono risultati.

Information Retrieval

N/A
N/A
Protected

Academic year: 2021

Condividi "Information Retrieval"

Copied!
1
0
0

Testo completo

(1)

Information Retrieval

5 February 2021 – time 45 minutes

Question #1 [rank 4+3]. Given the list of items S = (1, 1, 3, 2, 2, 4, 2, 1, 3), and two hash functions h1(x) = 2x mod 7 and h2 = 3x mod 7,

• Build a Spectral Bloom Filter on S, using h1 and h2.

• Comment on how to solve Query(2), and whether the answer is correct.

Question #2 [rank 5]. Given the following three adjacency lists, compress them via the Web graph algorithm by choosing always the best previous list to differentially encode the current one. (“Best” = the one that induces the most copies.)

10 à 3, 10, 11, 13, 15, 17 11 à 5, 10, 11, 13, 14

12 à 3, 10, 11, 13, 21, 25, 30

Question #3 [rank 5+5]. Given the following binary strings: A = 01000, A’ = 01001, B = 01010, B’ = 01101, C = 10, C’ = 11.

• Construct a two-level indexing solution in which each disk page contains two strings, and it is compressed via Front Coding, and the strings indexed in internal memory are in a Patricia Trie.

• Show how it is searched the string S = 01011.

Question #4 [rank 6]. You are given the two files:

F_old = “abxxab”, and F_new = “x_abab”,

Assume a block size B=2 chars, and hash function h(c1 c2) = (c1 + c2) mod 7, where we assume that {a, b, _, x} map to the values {1, 2, 3, 4}. Show the execution of the algorithm zsync based on that hash function.

Question #5 [rank 2]. Let us assume that you are given two terms t1 and t2, whose posting lists have length n1 and n2, respectively.

• Write the pseudocode to solve “t1 and (not t2)”. Use NEXT[] as operator to advance on a posing list, and use HEAD[] as operator to return the head of a posting list.

• Students of Informatica-Umanistica can use a wordy description of the algorithm.

Riferimenti

Documenti correlati

Southampton, United Kingdom, (5) Department of Nuclear Physical Chemistry, Institute of Nuclear Physics Polish Academy of Sciences, Krakòw, Poland, (6) Physics Department, University

Anche il Regolamento FIGC, così come il relativo Regolamento FIFA, disciplina l’attività degli Agenti di calciatori autorizzati dalla FIGC, ovvero di coloro che, in

(2018) performed an experimental design involving 26 children with ASD and tested the effectiveness of a multisystem aquatic therapy program employing cognitive-behavioral

Gli autori, partendo dall’assunzione empirica che circa il 25% dei pazienti, che richiedono interventi di psicoterapia, sono soggetti alessitimici e che questa

È interessante rilevare la presenza all’interno di questi funghi di sostanze come flavonoidi e vitamina C, che hanno la caratteristica di essere anti-radicali liberi,

For instance, the genes encoding leptin, thyroid hormone receptor, insulin growth factor-1, growth hormone receptor, are associated with the impacts of heat stress on the

After that, a list of n Node Endpoint Identifiers (EIDs) (Next Hop EIDs, 32 bit) and n time values (Start Times, 32 bit) uniquely identify the sequence of contacts that the bundle