Clustering based personalized web experience

US 20050081139A1
Filed: 10/08/2004
Published: 04/14/2005
Est. Priority Date: 10/10/2003
Status: Abandoned Application

First Claim

Patent Images

1. A personalization method, comprising:

forming a personal profile for a user from the output of a first clustering algorithm applied to (1) a plurality of documents viewed by the user, and (2) one or more data streams comprising at least one of;

data entered by the user;

click stream data characterizing a series of web navigation actions by the user; and

purchase data identifying one or more items that have been purchased by the user; and

presenting content to the user as a function of selected data in the personal profile.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

One embodiment of the present invention is a method for the customized presentation of one or more document streams. The method involves accepting or determining criteria characterizing information of interest to a user, and processing a stream of documents, wherein each document is tagged with one or more key content terms, and theme data is generated. The stream is filtered based on whether the criteria apply to each document, the documents in the filtered stream are clustered, and the clustered documents (including the theme data) are presented to the user via a visual user interface.

Citations

54 Claims

1. A personalization method, comprising:
- forming a personal profile for a user from the output of a first clustering algorithm applied to (1) a plurality of documents viewed by the user, and (2) one or more data streams comprising at least one of;
  
  data entered by the user;
  
  click stream data characterizing a series of web navigation actions by the user; and
  
  purchase data identifying one or more items that have been purchased by the user; and
  
  presenting content to the user as a function of selected data in the personal profile.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
- - 2. The method of claim 1, further comprising:
    - providing a software agent on a user'"'"'s computer; and
      
      capturing data from the plurality of documents and the one or more data streams with the software agent.
  - 3. The method of claim 2, wherein the one or more data streams are collected from communications between the user'"'"'s computer and one or more remote computers.
  - 4. The method of claim 1, wherein the forming is performed by the user'"'"'s computer.
  - 5. The method of claim 1, further comprising applying the first clustering algorithm at two or more times to update the personal profile.
  - 6. The method of claim 1, wherein the forming comprises:
    - asking the user a set of questions, receiving answers to the set of questions, and applying the first clustering algorithm to the answers.
  - 7. The method of claim 1, wherein the plurality of documents are electronic articles.
  - 8. The method of claim 1, further comprising filtering electronic documents as a function of selected data in the personal profile.
  - 9. The method of claim 8, wherein the presenting operates on the filtered electronic documents.
  - 10. The method of claim 8, wherein the filtering occurs responsively to a request for electronic documents by the user.
  - 11. The method of claim 8, wherein the filtering comprises searching the Internet for electronic documents as a function of selected data in the personal profile.
  - 12. The method of claim 8, further comprising applying a second clustering algorithm to the filtered electronic documents to produce one or more document clusters.
  - 13. The method of claim 12, wherein the first clustering algorithm and the second clustering algorithm are soft clustering algorithms.
  - 14. The method of claim 12, wherein the content presented is the one or more clusters.

15. A method for the customized presentation of one or more document streams, comprising:
- accepting one or more user-provided criteria;
  
  processing a stream of documents, the processing for each document in the stream including;
  
  tagging the document with one or more key content terms; and
  
  generating theme data for the document;
  
  filtering the stream based on whether the criteria apply to the key content terms for each document;
  
  clustering the filtered stream; and
  
  presenting the clustered stream, including theme data for at least one presented document, to a user via a graphical user interface.
- View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
- - 16. The method of claim 15, wherein the accepting and the presenting occur at a first computer and the processing, the filtering and the clustering occur at a second computer.
  - 17. The method of claim 15, wherein the accepting, the presenting, and the processing occur at a first computer and the filtering and the clustering occur at a second computer.
  - 18. The method of claim 15, wherein the documents are electronic articles.
  - 19. The method of claim 15, wherein accepting the user-provided criteria includes:
    - asking the user a set of questions;
      
      receiving answers to the set of questions; and
      
      applying a soft clustering algorithm to the user'"'"'s answers.
  - 20. The method of claim 15, wherein the clustering includes applying a soft clustering algorithm.
  - 21. The method of claim 20, wherein each document is clustered into one or more document clusters.
  - 22. The method of claim 15, further comprising developing the user-provided criteria, wherein the developing includes applying a clustering algorithm to (1) a plurality of electronic documents viewed by the user, and (2) one or more data streams comprising at least one of:
    - data entered by the user;
      
      click stream data characterizing a series of web navigation actions by the user; and
      
      purchase data identifying one or more items that have been purchased by the user.
  - 23. The method of claim 22, wherein the developing occurs at a user'"'"'s computer.
  - 24. The method of claim 22, wherein the clustering algorithm is a soft clustering algorithm.
  - 25. The method of claim 22, further comprising:
    - providing a software agent on a user'"'"'s computer; and
      
      collecting the plurality of electronic documents and the one or more data streams with the software agent.
  - 26. The method of claim 25, wherein the one or more data streams are collected from communications between the user'"'"'s computer and one or more remote computers.

27. A method, comprising:
- accessing a plurality of electronic documents;
  
  attaching one or more key terms to each of the electronic documents to represent its content;
  
  creating a personal profile for a user;
  
  filtering the electronic documents as a function of the personal profile and the key terms;
  
  applying a first soft clustering algorithm to the filtered electronic documents to cluster the filtered electronic documents into two or more content-based categories; and
  
  presenting the two or more content-based categories to the user.
- View Dependent Claims (28, 29, 30, 31)
- - 28. The method of claim 27 wherein the two or more content-based categories contain substantially the same quantity of the electronic documents.
  - 29. The method of claim 27, further comprising:
    - updating the personal profile two or more times; and
      
      performing the accessing, the attaching, the filtering, the applying, and the presenting, two or more times.
  - 30. The method of claim 27, wherein the creating includes applying a second clustering algorithm to electronic data accessed by the user.
  - 31. The method of claim 30, wherein the second clustering algorithm is a soft clustering algorithm.

32. A clustering method, comprising:
- applying a first clustering algorithm to electronic data accessed by a user to form a user profile;
  
  filtering electronic documents as a function of the user profile to retain a set of user-appropriate appropriate electronic documents; and
  
  applying a second clustering algorithm to the set of user-appropriate electronic documents to produce one or more clusters.
- View Dependent Claims (33, 34, 35)
- - 33. The method of claim 32, further comprising accessing the one or more clusters.
  - 34. The method of claim 32, wherein the first clustering algorithm and the second clustering algorithm are soft clustering algorithms.
  - 35. The method of claim 32, wherein the first clustering algorithm and the second clustering algorithm are the same clustering algorithm.

36. A system, comprising:
- a client computer, wherein the client computer accesses electronic documents and clusters data from the electronic documents to develop user criteria; and
  
  a remote computer, wherein the remote computer accepts the user criteria, processes a stream of documents, filters the stream of documents based on whether the user criteria apply to each document in the stream;
  
  clusters the filtered stream, and presents the clustered stream to the client computer.

37. A system, comprising a processor and a computer-readable medium encoded with programming instructions executable by the processor to:
- access electronic documents;
  
  tag each electronic document with one or more key content terms;
  
  generate theme data for each electronic document;
  
  filter the electronic documents based on whether preference criteria of a user apply to the key content terms of each electronic document;
  
  apply a first clustering algorithm to the electronic documents to produce clusters; and
  
  present the clusters, including theme data, to the user.
- View Dependent Claims (38, 39)
- - 38. The system of claim 37, wherein the programming instructions are further executable by the processor to apply a second clustering algorithm to electronic data accessed by the user to create the preference criteria.
  - 39. The system of claim 38, wherein the first clustering algorithm and the second clustering algorithm are the same soft clustering algorithm.

40. A method, comprising:
- a user at a computer accessing a plurality of electronic documents;
  
  the user at the computer generating one or more data streams comprising at least one of;
  
  data entered by the user;
  
  click stream data characterizing a series of web navigation actions by the user; and
  
  purchase data identifying one or more items that have been purchased by the user; and
  
  ;
  
  the computer capturing data from the plurality of electronic documents and the one or more data streams with a software agent on the computer; and
  
  the computer displaying clusters of electronic articles, wherein the clusters are generated by applying a first clustering algorithm to filtered electronic articles, wherein the filtered electronic articles are generated by attaching tag data to electronic articles and filtering the electronic articles as a function of the tag data and a set of user criteria.
- View Dependent Claims (41, 42, 43, 44, 45)
- - 41. The method of claim 40, further comprising the computer developing the set of user criteria by applying a second clustering algorithm to the captured data.
  - 42. The method of claim 41, wherein the first clustering algorithm and the second clustering algorithm are soft clustering algorithms.
  - 43. The method of claim 40, wherein the computer attaches the tag data to the electronic documents.
  - 44. The method of claim 40, wherein the computer filters the electronic documents.
  - 45. The method of claim 40, wherein the computer applies the first clustering algorithm.

46. An apparatus, comprising one or more processors and a memory encoded with programming instructions executable by the one or more processors to:
- accept one or more user-provided criteria;
  
  process a stream of documents, wherein to process each document in the stream includes;
  
  tagging the document with one or more key content terms; and
  
  generating theme data for the document;
  
  filter the stream based on whether the criteria apply to each document;
  
  cluster the filtered stream; and
  
  present the clustered stream, including the theme data, to the user via a graphical user interface.
- View Dependent Claims (47, 48, 49)
- - 47. The apparatus of claim 46, further comprising one or more parts of a computer network carrying one or more signals encoding the programming instructions.
  - 48. The apparatus of claim 46, the programming instructions being further executable by the processor to develop the user-provided criteria, wherein to develop includes:
    - asking the user a set of questions;
      
      receiving answers to the set of questions; and
      
      applying a soft clustering algorithm to the user'"'"'s answers.
  - 49. The apparatus of claim 46, the programming instructions being further executable by the processor to develop the user-provided criteria, wherein to develop includes applying a clustering algorithm to a plurality of electronic documents viewed by the user, and one or more data streams comprising at least one of:
    - data entered by the user;
      
      click stream data characterizing a series of Web navigation actions by the user; and
      
      purchase data identifying one or more items that have been purchased by the user.

50. A method of clustering a collection of documents, comprising:
- creating an ordered list of w unique words in the collection of electronic documents;
  
  initializing a set P of zero or more prototype vectors, each of a dimension w; and
  
  for each document d in the collection of electronic documents;
  
  a) generating a w-dimensional vector I_dof numbers that each characterize the frequency in d of the word in the corresponding position in the ordered list;
  
  b) for each prototype P_i;
  
  i) determining a degree of membership of document d in P_i; and
  
  ii) if the degree of membership is greater than a predetermined threshold ρ
  
  , updating prototype P_ias a function of document d.
- View Dependent Claims (51, 52, 53, 54)
- - 51. The method of claim 50, further comprising, after the processing for each document d is complete, selecting a plurality of key words representative of each prototype P_i.
  - 52. The method of claim 50, wherein the updating assigns {right arrow over (P)}_i=λ
    - ({right arrow over (I)}_d{circumflex over (
      
      )}{right arrow over (P)}_i)+(1−
      
      λ
      
      ){right arrow over (P)}_ifor a predetermined λ
      
      , where 0≦
      
      λ
      
      ≦
      
      1.
  - 53. The method of claim 50, wherein the determining step for each document I_dand prototype P_icomprises calculating ∥
    - {right arrow over (I)}_d{circumflex over (
      
      )}{right arrow over (P)}_i∥
      
      .
  - 54. The method of claim 50, wherein:
    - determining the degree of membership of I_din P_icomprises calculating ∥
      
      {right arrow over (I)}_d{circumflex over (
      
      )}{right arrow over (P)}_i∥
      
      /∥
      
      {right arrow over (I)}_d∥
      
      .

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Humanizing Technologies, Inc.
Original Assignee
Humanizing Technologies, Inc.
Inventors
Kondadadi, Ravikumar, Witwer, George

Application Number

US10/961,314
Publication Number

US 20050081139A1
Time in Patent Office

Days
Field of Search
US Class Current

715/234
CPC Class Codes

G06F 16/34   Browsing; Visualisation the...

G06F 16/35   Clustering; Classification

G06F 16/9535   Search customisation based ...

Clustering based personalized web experience

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

54 Claims

Specification

Solutions

Use Cases

Quick Links

Clustering based personalized web experience

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

54 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links