Inverted indices are sparse: most of the entries in the list are zero. We store only non-zero values as a set of (docid,frequency) tuples. We use a variant of the linear merge algorithm to find a set of documents matching the query. An integral part of the algorithm is a scoring function, which combines entries for the same document across multiple inverted lists.
1 июн 2024