Method of and system for processing a text
First Claim
1. A computer-implemented method for generating a summary of a digital text, the method executable on a server, the server coupled to a communication network, the method comprising:
- acquiring by the server, an indication of the digital text to be processed, the digital text comprising a plurality of sentences;
parsing by the server, each of plurality of sentences into one or more concept phrases, each of the one or more concept phrases having at least one word;
the parsing being executed by applying at least one parsing parameter;
executing, by the server, a first analysis to generate a context-independent relation (CIR) value for a given concept phrase of the one or more concept phrases, the CIR value representing a first ratio of a co-inclusion of;
(i) at least one word of the given concept phrase and (ii) at least one word of each of the remaining concept phrases of the one or more concept phrases;
executing, by the server, a second analysis to generate a context-dependent relation (CDR) value for the given concept phrase, the CDR value representing a second ratio of;
(i) a number of sentences where the given concept phrase co-occurs with another concept phrase of the one or more concept phrases to (ii) a total number of the plurality of sentences containing the other concept phrase within the digital text;
determining by the server, a total CIR weight and a total CDR weight for each of the concept phrases;
determining by the server, for each of the concept phrase, a concept meaning value, based at least in part on its respective total CIR weight and the total CDR weight;
determining by the server, for a given sentence of the plurality of sentences, a sentence meaning value, based at least in part of the concept meaning value of each concept phrase contained in the given sentence;
ranking by the server, each sentence based at least on the determined sentence meaning value; and
,generating by the server, the summary of the digital text, the summary of the digital text comprising at least one sentence extracted from the digital text based on its determined ranking.
3 Assignments
0 Petitions
Accused Products
Abstract
There is disclosed a computer-implemented method for generating a summary of a digital text. The method can be executable on a server. The server being coupled to a communication network. Embodiments of the methods disclosed herein generate a summary of the digital text by selecting sentences from the digital text based on a calculated sentence value. The sentence value is calculated by relying on the digital text itself without use of ontology dictionaries. Embodiments of the present method determine the sentence value by firstly breaking the sentence into one or more concept phrases and then determining, for a given sentence of the digital text: (i) a non-contextual value for its concept phrases and (ii) a contextual value for its concept phrases.
17 Citations
20 Claims
-
1. A computer-implemented method for generating a summary of a digital text, the method executable on a server, the server coupled to a communication network, the method comprising:
-
acquiring by the server, an indication of the digital text to be processed, the digital text comprising a plurality of sentences; parsing by the server, each of plurality of sentences into one or more concept phrases, each of the one or more concept phrases having at least one word;
the parsing being executed by applying at least one parsing parameter;executing, by the server, a first analysis to generate a context-independent relation (CIR) value for a given concept phrase of the one or more concept phrases, the CIR value representing a first ratio of a co-inclusion of;
(i) at least one word of the given concept phrase and (ii) at least one word of each of the remaining concept phrases of the one or more concept phrases;executing, by the server, a second analysis to generate a context-dependent relation (CDR) value for the given concept phrase, the CDR value representing a second ratio of;
(i) a number of sentences where the given concept phrase co-occurs with another concept phrase of the one or more concept phrases to (ii) a total number of the plurality of sentences containing the other concept phrase within the digital text;determining by the server, a total CIR weight and a total CDR weight for each of the concept phrases; determining by the server, for each of the concept phrase, a concept meaning value, based at least in part on its respective total CIR weight and the total CDR weight; determining by the server, for a given sentence of the plurality of sentences, a sentence meaning value, based at least in part of the concept meaning value of each concept phrase contained in the given sentence; ranking by the server, each sentence based at least on the determined sentence meaning value; and
,generating by the server, the summary of the digital text, the summary of the digital text comprising at least one sentence extracted from the digital text based on its determined ranking. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer-implemented method for generating a summary of a digital text, the method executable on a server, the server coupled to a communication network, the method comprising:
-
acquiring by the server, an indication of the digital text to be processed, the digital text comprising a plurality of sentences; parsing by the server, each of plurality of sentences into one or more concept phrases, each of the one or more concept phrases having at least one word;
the parsing being executed by applying at least one parsing parameter;executing, by the server, a first analysis to generate a context-independent relation (CIR) value for a given concept phrase of the one or more concept phrases, the CIR value representing a first ratio of a co-inclusion of;
(i) at least one word of the given concept phrase and (ii) at least one word of each of the remaining concept phrases of the one or more concept phrases;executing, by the server, a second analysis to generate a context-dependent relation (CDR) value for the given concept phrase, the CDR value representing a second ratio of;
(i) a number of sentences where the given concept phrase co-occurs with another concept phrase of the one or more concept phrases to (ii) a total number of the plurality of sentences containing the other concept phrase within the digital text;determining by the server, a total CIR weight and a total CDR weight for each of the concept phrases; determining by the server, for each of the concept phrase, a concept meaning value, based at least in part on its respective total CIR weight and the total CDR weight; determining by the server, for a given sentence of the plurality of sentences, a sentence meaning value, based at least in part of the concept meaning value of each concept phrase contained in the given sentence; ranking by the server, each sentence based at least on the determined sentence meaning value; and
,assigning, by the server, a topic category to the digital text, the topic category being based on at least one higher ranked concept phrase.
-
-
20. A server comprising:
-
a communication interface for communication with an electronic device via a communication network, a processor operationally connected with the communication interface, the processor configured to; acquire an indication of a digital text to be processed in order to generate a summary thereof, the digital text comprising a plurality of sentences; parse each of plurality of sentences into one or more concept phrases, each of the one or more concept phrases having at least one word by applying at least one parsing parameter; execute a first analysis to generate a context-independent relation (CIR) value for a given concept phrase of the one or more concept phrases, the CIR value representing a first ratio of a co-inclusion of;
(i) at least one word of the given concept phrase and (ii) at least one word of each of the remaining concept phrases of the one or more concept phrases;execute a second analysis to generate a context-dependent relation (CDR) value for the given concept phrase, the CDR value representing a second ratio of;
(i) a number of sentences where the given concept phrase co-occurs with another concept phrase of the one or more concept phrases to (ii) a total number of the plurality of sentences containing the other concept phrase within the digital text;determine a total CIR weight and a total CDR weight for each of the concept phrases; determine, for each of the concept phrase, a concept meaning value, based at least in part on its respective total CIR weight and the total CDR weight; determine, for a given sentence of the plurality of sentences, a sentence meaning value, based at least in part of the concept meaning value of each concept phrase contained in the given sentence; rank each sentence based at least on the determined sentence meaning value; and generate the summary of the digital text, the summary of the digital text comprising at least one sentence extracted from the digital text based on its determined ranking.
-
Specification