Contextual tagger utilizing deterministic finite state transducer
First Claim
1. A computer system for correcting part of speech tags of words of sentences in a text, comprising:
- means for receiving an initially tagged input sentence; and
,a contextual part of speech tagger for correcting part-of-speech tags of the words of said initially tagged input sentence, said tagger including a deterministic finite state transducer for tagging said words in accordance with context and in a single pass.
4 Assignments
0 Petitions
Accused Products
Abstract
A system for assigning part-of-speech tags to English text includes an improved contextual tagger which utilizes a deterministic finite state transducer to improve tagging speed such that large documents can have its sentences accurately tagged as to parts of speech to permit fast grammar checking, spell checking, information retrieval, text indexing and optical character recognition. The subject system performs by first acquiring a set of rules by examining a training corpus of tagged text. Then, these rules are transformed into a deterministic finite-state transducer through the utilization of non-deterministic transducers, a composer and a determiniser. In order to tag an input sentence, the sentence is initially tagged by first assigning each word in the sentence with its most likely part of speech tag regardless of the surrounding words in the sentences. The deterministic finite-state transducer is then applied on the resulting sequence of part of speech tags using the surrounding words and obtains the final part of speech tags. The Subject System requires an amount of time to compute the part-of-speech tags which is proportional to the number of words in the input sentence and which is independent of the number of rules it has applied.
267 Citations
9 Claims
-
1. A computer system for correcting part of speech tags of words of sentences in a text, comprising:
-
means for receiving an initially tagged input sentence; and
,a contextual part of speech tagger for correcting part-of-speech tags of the words of said initially tagged input sentence, said tagger including a deterministic finite state transducer for tagging said words in accordance with context and in a single pass. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
Specification