LANGUAGE ANALYSIS BASED ON WORD-SELECTION, AND LANGUAGE ANALYSIS APPARATUS

US 20160005421A1
Filed: 02/25/2014
Published: 01/07/2016
Est. Priority Date: 02/26/2013
Status: Active Grant

First Claim

Patent Images

1-18. -18. (canceled)

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The invention relates to a method for wording-based speech analysis. In order to provide a method that allows automated analysis of largely arbitrary features of a person from whom a voice file that needs to be analysed comes, the invention detaches itself from the known concept of evaluating static keyword lists for the personality type. The method according to the invention comprises the preparation of a computer system by formation of a reference sample that allows the comparison that is necessary for feature recognition with other persons. The preparation of the computer system involves the recording and storage of a further voice file in addition to the voice files of the reference sample, the analysis of the additionally recorded voice file and the output of the recognized features using at least one output unit connected to the computer system. Furthermore, the invention relates to a speech analysis device for carrying out the method.

29 Citations

View as Search Results

33 Claims

1-18. -18. (canceled)

19. A method for automated language analysis based on word-selection, comprising the steps:
- a) preparing a computer system (1.30) byaa) storing a plurality of reference language files (1.10) in a memory unit (1.20) of the computer system (1.30) in order to form a reference sample (1.40), wherein each reference language file (1.10) comprises a minimum number of 100 words, and each reference language file (1.10) originates from a different person having known characteristics,ab) storing a dictionary file (2.20) containing a multiplicity of different categories (2.10) in a memory unit (1.20) of the computer system (1.30), wherein all the words in the dictionary file (2.20) are classified in at least one of the categories (2.10),ac) making an individual comparison of each reference language file (1.10) in the reference sample (1.40) with the dictionary file (2.20) by calculating the percentage frequency (3.40) of the words in each reference language file (1.10) that are contained in each category (2.10) of the dictionary file (2.20), andad) storing a set of rules (5.40) in a memory unit (1.20) of the computer system (1.30), which set of rules uses statistical and/or algorithmic methods to calculate associations at least between the percentage frequencies (3.40) calculated in step ac) in one or more categories (2.10) and at least one known characteristic (4.20) of the people from whom the reference language files (1.10) originate.b) following preparation of the computer system in accordance with steps aa)-ad), recording and storing a language file (6.10), in addition to the reference language files (1.10) of the reference sample (1.40), in a memory unit (1.20) of the computer system (1.30), wherein each language file (6.10) and each reference language file is one of a text file or an audio file that is converted into a text file by a transcription,c) analyzing the language file (6.10) additionally recorded and stored in step b), byca) making an individual comparison of the language file (6.10) with the dictionary file (2.20) by calculating the percentage frequency (7.30) of the words in the language file (6.10) that are contained in each category (2.10) of the dictionary file (2.20), andcb) using the set of rules (5.40) to process the percentage frequencies (7.30) calculated in step ca), which set of rules uses statistical and/or algorithmic methods to examine the percentage frequencies (7.30) calculated in step ca) for similarities with the percentage frequencies (3.40) calculated in step ac), and classifies the language file (6.10) according to the established similarities, and associates said file with at least one known characteristic belonging to the people from whom the reference language files (1.10) originate,d) creating an output file (8.20), which contains characteristics (4.20) associated with the language file (6.10) in step cb), ande) outputting the output file (8.20),f)fa) expanding the reference sample (1.40) in step aa) by adding as reference language files (1.10), each language file (6.10) recorded in step b),fb) providing a feedback through an input, which allows an evaluation of the correctness of the analysis of step c), andfc) updating and re-saving the set of rules (5.40) taking into account the enlarged database from step ad).
- View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33)
- - 20. The method as claimed in claim 19, wherein the language file (6.10) is added to the reference sample (1.40) in step fa) only if the language file (6.10) has a specified minimum number of 100 words.
  - 21. The method as claimed in claim 19, further comprising the steps of:
    - ga) at least once during recording of the additional language file (6.10) in step b), buffering a partial file of the language file (6.10) in the memory unit (1.20) of the computer system (1.30),gb) analyzing the buffered partial file bygba) making an individual comparison of the partial file with the dictionary file (2.20) by calculating the percentage frequency (7.30) of the words in the partial file that are contained in each category (2.10) of the dictionary file (2.20),gbb) using the set of rules (5.40) to process the percentage frequencies (7.30) calculated in step gba), which set of rules (5.40) uses statistical and/or algorithmic methods to examine the percentage frequencies (7.30) calculated in step gba) for similarities with the percentage frequencies (3.40) calculated in step ac), and classifies the partial file according to the established similarities and associates said file with at least one known characteristic belonging to the different people from whom the reference language files (1.10) originate,gc) creating an interim output file, which contains characteristics (4.20) associated with the partial file in step gbb), andgd) outputting the interim output file.
  - 22. The method as claimed in claim 21, wherein the output file and/or the interim output file contains personality traits and/or characteristics relating to the psychological state of the person.
  - 23. The method as claimed in claim 19, wherein different dictionary files (2.20) are stored on the computer system (1.30) according to the intended use of the method.
  - 24. The method as claimed in claim 19, wherein a plurality of dictionary files (2.20) with different content are stored on the computer system (1.30) in step ab), which dictionary files can be used selectively.
  - 25. The method as claimed in claim 19, wherein step a) additionally comprises recording and storing at least one additional item of information (2.30) of each reference language file (1.10), and the set of rules (5.40) is designed such that using statistical and/or algorithmic methods it also determines associations between the at least one additional item of information (2.30) and the known characteristics (4.20) of the people from whom the reference language files (1.10) originate,step b) additionally comprises recording and storing the at least one additional item of information (2.30) for each language file (6.10), andstep c) comprises in addition to processing the percentage frequencies calculated in step ca), using the set of rules (5.40) to process the at least one additional item of information (2.30) of each recorded language file (6.10), wherein the set of rules (5.40) uses the statistical and/or algorithmic methods to examine the at least one additional item of information (2.30) of each language file for similarities with this at least one additional item of information (2.30) in the reference language files, and wherein the set of rules (5.40) classifies the language file (6.10), taking into account all the established similarities, and associates said file with the occurrence of at least one known characteristic (4.20) belonging to the different people from whom the reference language files (1.10) originate,wherein the reference language file and the language file (1.10, 6.10) are each transcribed from an audio file, and prosodic information is extracted from the audio files as the additional information and/or morphological and/or syntactic information (2.40, 2.50) is extracted from each reference language file and from each language file (1.10, 6.10) as the additional information.
  - 26. The method as claimed in claim 19, further comprising the steps of:
    - supplying the output file (8.20) containing the language file (6.10) associated characteristics (4.20) of the person to an automatic answering process,generating, by the answering process, a response depending on the characteristics (4.20) associated with the language file (6.10), using standard responses stored in response files (10.10), which contain mappings between the associated characteristics (4.20) of the person and the standard responses, andreproducing, by the electroacoustic transducer (12.10), the response as an audio file.
  - 27. The method as claimed in claim 26, further comprising the step of controlling at least one of the duration (11.30), the frequency (11.20), and the power (11.10) in the step of reproducing by a control module (11.40), whereby the electroacoustic transducer (12.10) converts electrical signals of the control module (11.40) to acoustic signals.
  - 28. A language analysis apparatus for automated language analysis based on word-selection, comprisinga computer system (1.30) having at least one memory unit (1.20),an input unit connected to the computer system (1.30),a program, which is stored in at least one memory unit (1.20), that is designed to execute on the computer system (1.30) computer executable steps for performing the method of claim 19.
  - 29. The language analysis apparatus as claimed in claim 28, wherein the computer system (1.30) comprises a plurality of memory units (1.20), and the language files (1.10, 6.10) and the dictionary file(s) (2.20) are stored in different memory units (1.20).
  - 30. The language analysis apparatus as claimed in claim 28, wherein the input unit comprises a voice recognition system.
  - 31. The language analysis apparatus as claimed in claim 28, further comprising at least one of a printer (9.20), a display unit (9.10), and an electroacoustic transducer (9.40) connected to the computer system in order to output the output file (8.20).
  - 32. The language analysis apparatus as claimed in claim 28, wherein:
    - the program, which is stored in the at least one memory unit (1.20), comprises a response module (10.20) and a control module (11.40),wherein the response module (10.20) includes computer executable steps for automatically creating a response file (10.10) as an audio file for the output file (8.20) based on the characteristics (4.20) of the person in the language file (6.10), the response module (10.20) includes a databank including standard responses as a response file, which contains mappings between the associated characteristics (4.20) of the person and the standard responses,wherein the control module (11.40) includes computer executable steps for controlling the output by an electroacoustic transducer (12.10) of the audio file according to the characteristics (4.20) contained in the output file (8.20), andthe electroacoustic transducer (12.10) for outputting the audio file is connected to the computer system (1.30).
  - 33. The language analysis apparatus as claimed in claim 32, wherein the control module is designed to control at least one of the duration, the frequency, and the power of the audio-file output according to the characteristics (4.20) contained in the output file (8.20), wherein the electroacoustic transducer (12.10) generates an acoustic signal based on an electric signal generated by the control module.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
VIER Precire GmbH
Original Assignee
PRECIRE Technologies GmbH (Hdi Haftpflichtverband Der Deutschen Industrie Versicherungsverein Auf Gegenseitigkeit)
Inventors
GRATZEL, Dirk C, GREB, Christian

Granted Patent

US 9,805,740 B2
Time in Patent Office

Days
Field of Search
US Class Current

1/1
CPC Class Codes

G10L 15/1822   Parsing for meaning underst...

G10L 15/183   using context dependencies,...

G10L 25/51   for comparison or discrimin...

LANGUAGE ANALYSIS BASED ON WORD-SELECTION, AND LANGUAGE ANALYSIS APPARATUS

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

29 Citations

33 Claims

Specification

Solutions

Use Cases

Quick Links

LANGUAGE ANALYSIS BASED ON WORD-SELECTION, AND LANGUAGE ANALYSIS APPARATUS

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

29 Citations

33 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links