<rmuir> maxDoc() doesnt reflect deletes <rmuir> docFreq() doesnt reflect deletes <rmuir> the numDocs() reflects delete |
sumOfNorms
can be used as a "sum of lengths", provided the norm reflects the length (and not 1/sqrt(#tokens)
as the default)ReaderUtil.getTopLevelContext(context);
in MockBM25Similarity.avgDocumentLength()
.Similarity.computeWeight()
(soon to be computeStats
) we are seek'ed to the term, so statistics should be computed there.score + boost
: I do not consider this a boost, but rather a sum of similarity scores, of which one happens to come from outside (e.g. PageRank)score * boost
score = tf(boost * freq) * idf