×

ELECTRONIC TABLE OF CONTENTS ENTRY CLASSIFICATION AND LABELING SCHEME

  • US 20090144277A1
  • Filed: 12/03/2007
  • Published: 06/04/2009
  • Est. Priority Date: 12/03/2007
  • Status: Abandoned Application
First Claim
Patent Images

1. One or more computer-storage media having computer-executable instructions embodied thereon that, when executed, perform a method for classifying character strings of a table-of-contents (TOC) portion of an electronic document, the method comprising:

  • receiving textual data extracted from the electronic document, the textual data comprising one or more character strings of the TOC portion of the electronic document;

    extracting semantic information from the textual data of the identified TOC portion;

    executing a classification procedure to determine at least one appropriate classification for the one or more character strings of the TOC portion by analyzing the semantic information;

    appending one or more labels, selected from a predetermined set of TOC-architecture labels, to the one or more character strings according to the at least one appropriate classification; and

    storing the one or more labels in association with the one or more character strings.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×