Contains information about a tokenization process.
uint DocumentIndexOffset
Document Index Offset. Used to identify sequences of tokens in a document.
async Task Tokenize(object Value, TokenizationProcess Process)
Tokenizes an object.
Tokenizes a set of human readable texts.
Grade Supports(Type Type)
How well the tokenizer can tokenize objects of type Type .
HumanReadableTextsTokenizer()
Tokenizes a set of human readable texts.
async Task Tokenize(object Value, TokenizationProcess Process)
Tokenizes an object.
static async Task Tokenize(IEnumerable< HumanReadableText > HumanReadableTexts, Contract Contract, TokenizationProcess Process)
Tokenizes a set of human readable texts.
Class representing human-readable text.
Interface for full-text-search tokenizers