Machinese Phrase Tagger
Machinese Phrase Tagger is a set of program components that performs basic linguistic analysis tasks at very high speed and provides relevant information about words and concepts to volume-intensive applications.
Machinese Phrase Tagger splits raw text into understandable word units and provides the possible base forms and classes for words. It also disambiguates i.e. selects the correct form and class for each word that can have more than one interpretation and identifies the head words of a sentence. For example, the word "thought" can be either a form of the noun "thought" or the verb "to think".
Machinese Phrase Tagger contains a custom lexicon mechanism, which enables developers to add their own words to the parser. These words can be, for example, domain-specific vocabularies, multi-word terms, names and places etc. This way developers can influence how the parser analyses texts.