Represents a token and a corresponding occurrence count.
Contains information about a tokenization process.
uint DocumentIndexOffset
Document Index Offset. Used to identify sequences of tokens in a document.
Dictionary< string, List< uint > > TokenCounts
Accumulated token counts.
TokenizationProcess()
Contains information about a tokenization process.
TokenCount[] ToArray()
Generates an array of token counts.