The Index Structure
The index structure formed after indexing is shown below :
|
Stored |
Index |
Comment |
||
|
boost |
YES |
Indexer |
|
|
|
digest |
YES |
Indexer |
|
|
|
lang |
YES |
language-identifier |
|
|
|
segment |
YES |
Indexer |
|
|
|
tstamp |
YES |
Tokenized |
Indexer |
|
|
anchor |
NO |
Tokenized |
index-basic |
|
|
title |
YES |
Tokenized |
index-basic |
also by index-more |
|
site |
NO |
index-basic |
|
|
|
host |
NO |
Tokenized |
index-basic |
hostname |
|
url |
YES |
Tokenized |
index-basic |
|
|
content |
NO |
Tokenized |
index-basic |
content |
|
lastModified |
YES |
index-more |
|
|
|
date |
NO |
index-more |
|
|
|
contentLength |
YES |
index-more |
|
|
|
type |
NO |
index-more |
contentType,primaryType,subType (all mime-types) |
|
|
primaryType |
YES |
index-more |
primaryType (mime-type) |
|
|
subType |
YES |
index-more |
subType (mime-type) |
|
|
domain |
NO |
Tokenized |
index-domain |
|
|
tld |
YES |
UnTokenized / NotStored(based on conf) |
tld |
|
|
category |
NO |
index-url-category |
||
|
subcollection |
YES |
Tokenized |
subcollection |
see subcollection plugin |
Jira Issues about indexing and IndexingFilterPlugins are
The index plugins to include are :
index-(basic | more | extra | domain | url-category) | tld | subcollection