METHOD AND SYSTEM FOR ANALYZING TEXT
First Claim
1. A method for predicting a value of a variable associated with a target word or set of words, performed on at least one computer and comprising the steps of:
- collecting a text corpus comprising a set of words that include the target word,generating a representation of the text corpus,creating a semantic space for the set of words, based on the representation of the text corpus,defining, for a location in the semantic space, a value of the variable,estimating, for the target word, a value of the variable, based on the semantic space and the defined variable value of the location in the semantic space, andcalculating a predicted value of the target word, on basis of the semantic space, the defined variable value of the location in the semantic space and the estimated variable value of the target word.
2 Assignments
0 Petitions
Accused Products
Abstract
An apparatus for providing a control input signal for an industrial process or technical system having one or more controllable elements includes elements for generating a semantic space for a text corpus, and elements for generating a norm from one or more reference words or texts, the or each reference word or text being associated with a defined respective value on a scale, and the norm being calculated as a reference point or set of reference points in the semantic space for the or each reference word or text with its associated respective scale value. Elements for reading at least one target word included in the text corpus, elements for predicting a value of a variable associated with the target word based on the semantic space and the norm, and elements for providing the predicted value in a control input signal to the industrial process or technical system. A method for predicting a value of a variable associated with a target word is also disclosed together with an associated system and computer readable medium.
251 Citations
22 Claims
-
1. A method for predicting a value of a variable associated with a target word or set of words, performed on at least one computer and comprising the steps of:
-
collecting a text corpus comprising a set of words that include the target word, generating a representation of the text corpus, creating a semantic space for the set of words, based on the representation of the text corpus, defining, for a location in the semantic space, a value of the variable, estimating, for the target word, a value of the variable, based on the semantic space and the defined variable value of the location in the semantic space, and calculating a predicted value of the target word, on basis of the semantic space, the defined variable value of the location in the semantic space and the estimated variable value of the target word. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An apparatus (300) for providing a control input signal (412) for an industrial process or technical system (400) having one or more controllable elements (421-42n), the apparatus being characterized by
means (320; -
909) for generating a semantic space (892) for a text corpus (302;
908);means (330;
914) for generating a norm (896) from one or more reference words or texts (332;
893), the or each reference word or text being associated with a defined respective value on a scale, and the norm being calculated as a reference point or set of reference points in the semantic space for the or each reference word or text with its associated respective scale value;means (340) for reading at least one target word; means (340) for predicting a value (350) of a variable associated with the target word based on the semantic space and the norm; and means (340) for providing the predicted value in a control input signal (412) to said industrial process or technical system (400). - View Dependent Claims (11, 12)
-
909) for generating a semantic space (892) for a text corpus (302;
-
13. A system for predicting a value of a variable associated with a target word or set of words, comprising at least one computer and configured to:
-
collect a text corpus comprising a set of words that include the target word, generate a representation of the text corpus, create a semantic space for the set of words, based on the representation of the text corpus, define, for a location in the semantic space, of a subset of the words, a value of the variable, estimate, for the target word, a value of the variable, based on the semantic space and the defined variable value of the location in the semantic space, and calculate a predicted value of the target word, on basis of the semantic space, the defined variable value of the location in the semantic space and the estimated variable value of the target word. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A computer readable medium having stored thereon a computer program having software instructions which when run on a computer cause the computer to perform the steps of:
-
collecting a text corpus comprising a set of words that include the target word, generating a representation of the text corpus, creating a semantic space for the set of words, based on the representation of the text corpus, defining, for a location in the semantic space, a value of the variable, estimating, for the target word, a value of the variable, based on the semantic space and the defined variable value of the location in the semantic space, and calculating a predicted value of the target word, on basis of the semantic space, the defined variable value of the location in the semantic space and the estimated variable value of the target word.
-
Specification