• Non ci sono risultati.

Information Retrieval 29

N/A
N/A
Protected

Academic year: 2021

Condividi "Information Retrieval 29"

Copied!
1
0
0

Testo completo

(1)

Information Retrieval

29 July 2014

Ex 1 [ranks 3+2] Describe the dynamic indexing technique, assuming blocks of power-of-two number of documents, and compute the time complexity of each document insertion, assuming that the document processing takes constant time.

Ex 2 [ranks 4+4] Given a dictionary D of n strings of total length N describe which data-structure solution you’d use to solve the following query on a string-pattern P:

 Find the strings of D which differ from P of one single character.

 Find the strings of D which differ from P of two characters.

For each solution, detail the query algorithm and its time/space complexity.

Ex 3 [points 4+4] Given symbols and probabilities: p(a) = 0.5, p(b)=p(c) = 0.25.

 Compress the string abca via Arithmetic coding

 Describe the iterative algorithm which represents a real number in binary, and comment on its correctness

Ex 4 [points 3+3+3] Define what is a strongly-connected component (SCC) in an directed graph G, how SCCs of G can be computed in RAM, and how they can be computed efficiently semi-externally.

Riferimenti

Documenti correlati

[r]

Ex 1 [ranks 3+3+3] Given the words {abaco, adagio, baco, alano} construct a 2-gram index, and describe the algorithm that solves 1-edit distance over it. Finally test the

[r]

 Discuss its advantages in crawling with respect to the simple scheme that allocates URL to crawlers based on a static url_domain->crawler scheme.  Simulate its execution over

 Describe the algorithm that solves the top-k in auto-completion search, provided that k is not known at preprocessing time and possibly large (hint:. you need an additional

Ex 5 [points 3] Compute the probability of error for a Bloom Filter that uses a bit- array of m bits, k hash functions and indexes a dictionary of

Ex 2 [ranks 3] Given the string S=aaba, compute the length of its Arithmetic encoding without running the entire algorithm, and comment

With the extensive use of electronic content creation and distribution in electronic formats, therefore the question arises naturally whether there is still much need for