Food description processing methods and apparatuses
First Claim
Patent Images
1. A system comprising:
- a preexisting database of crowd-sourced food records, the crowd-sourced food records including descriptions of associated food items previously entered by a plurality of first users;
one or more processors; and
a food data pre-processing engine, which, when operated by the one or more processors, limits the number of crowd-sourced food records in the database by causing the system to;
receive a description of a food item entered by a second user;
compare the user entered description to the preexisting database, the comparison comprising;
division of the user entered description into a plurality of component words;
elimination of unnecessary ones of the plurality of component words;
standardization of remaining ones of the plurality of component words of the user entered description via removal of special characters or features therefrom; and
determination of a single distance between the standardized remaining ones of the plurality of component words and respective component words of a description field for each of the food items associated with the food records in the preexisting database;
when the distance indicates that the user entered description comprises a distinct food item having a description that is outside a threshold of similarity to an existing food record, store the standardized description of the food item in the preexisting database, wherein storage of the standardized description includes addition of the standardized description as a new food record to a cluster associated with the existing food record, and wherein the addition of the standardized description as a new food record to the cluster comprises (i) determination of a score associated with the standardized description, wherein the score is calculated based at least in part on a number of times the food record has been logged or a number of times the food record appeared in recent searches of the database, (ii) storage of the standardized description as a new food record of the cluster, and (iii) designation of the new food record as a new candidate record representative of the cluster based on a result of the determination of the score; and
when the distance indicates that the user entered description of the food item is within a threshold of similarity to an existing food record in the preexisting database, merge the standardized description with the existing food record.
9 Assignments
0 Petitions
Accused Products
Abstract
Disclosed embodiments include apparatuses, methods and storage media associated with standardization or normalization of a food description. In embodiments, an apparatus may include one or more processors and a food data pre-processing engine. The food data processing engine may, when operated by the one or more processors, cause the apparatus to receive a description of a food item, process the description to generate a standardized or normalized description for the food item, and store the standardized or normalized description in a food record in a food data storage arrangement of a fitness management system. Other embodiments may be described and claimed.
-
Citations
25 Claims
-
1. A system comprising:
-
a preexisting database of crowd-sourced food records, the crowd-sourced food records including descriptions of associated food items previously entered by a plurality of first users; one or more processors; and a food data pre-processing engine, which, when operated by the one or more processors, limits the number of crowd-sourced food records in the database by causing the system to; receive a description of a food item entered by a second user; compare the user entered description to the preexisting database, the comparison comprising; division of the user entered description into a plurality of component words; elimination of unnecessary ones of the plurality of component words; standardization of remaining ones of the plurality of component words of the user entered description via removal of special characters or features therefrom; and determination of a single distance between the standardized remaining ones of the plurality of component words and respective component words of a description field for each of the food items associated with the food records in the preexisting database; when the distance indicates that the user entered description comprises a distinct food item having a description that is outside a threshold of similarity to an existing food record, store the standardized description of the food item in the preexisting database, wherein storage of the standardized description includes addition of the standardized description as a new food record to a cluster associated with the existing food record, and wherein the addition of the standardized description as a new food record to the cluster comprises (i) determination of a score associated with the standardized description, wherein the score is calculated based at least in part on a number of times the food record has been logged or a number of times the food record appeared in recent searches of the database, (ii) storage of the standardized description as a new food record of the cluster, and (iii) designation of the new food record as a new candidate record representative of the cluster based on a result of the determination of the score; and when the distance indicates that the user entered description of the food item is within a threshold of similarity to an existing food record in the preexisting database, merge the standardized description with the existing food record. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method of limiting a number of records in a preexisting database of crowd-sourced food records, the method comprising:
-
receiving, at a food data pre-processing engine of a fitness management system, a description of a food item entered by a user at a user device; processing, by the food data pre-processing engine, the description, wherein the processing includes comparing the user entered description to the preexisting database of crowd-sourced food records, the crowd-sourced food records including descriptions of food items previously entered by a plurality of first users, wherein the comparing includes; dividing the user entered description into a plurality of component words, eliminating unnecessary ones of the plurality of component words, standardizing remaining ones of the plurality of component words of the user entered description via removal of special characters or features therefrom, and determining a single distance between the standardized remaining ones of the plurality of component words of the user entered description and respective component words of food items in the database; determining, at the food data pre-processing engine, whether a previous record for the food item entered by the user exists within a food data storage arrangement of the fitness management system based on the distance, wherein the determination of whether a previous record for the food item entered by the user exists comprises determining whether the description of the food item is above a threshold of similarity to an existing food record; when it is determined that a previous record for the food item exists, using the description of the food item entered by the user to create a new record to be stored in a cluster of similar food records having descriptions for the same food item in the food data storage arrangement of the fitness management system, and storing the new record in the cluster of similar food records, wherein the act of storing comprises associating a score with the standardized description and subsequently designating the new record as a new candidate record of the cluster based on the score being the greatest score within the cluster, wherein the score is calculated based at least in part on a number of times the food record has been logged or a number of times the food record appeared in recent searches of the database; and when it is determined that a previous record for the food item does not exist, using the description of the food item entered by the user to create a new food record in the food data storage arrangement of the fitness management system. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory computer-readable storage media having instructions embodied thereon for limiting a number of records in a preexisting database of crowd-sourced food records, which, when executed by a processor of a computing device, cause the computing device to:
-
receive a description of a food item entered by a user; compare the description entered by the user to a database comprising a plurality of crowd-sourced food records, the crowd-sourced food records including descriptions of food items previously entered by a plurality of other users, the comparison comprising; standardization of the description for the food item entered by the user; and determination of a similarity between the standardized description and respective ones of the descriptions for the food records in the database, wherein the similarity is based on a distance between the standardized description and text in a description field for the food record; when the standardized description is not determined to be within a threshold level of similarity to at least one of the descriptions in the database, store the standardized description as a new record in the database; and when the standardized description is determined to be within a threshold level of similarity to at least one of the descriptions in the database; determine a score associated with the standardized description, wherein the score is calculated based at least in part on a number of times the food record has been logged or a number of times the food record appeared in recent searches of the database; save the standardized description as a new food record of a cluster of records each associated with the at least one description in the database; and when the score is above a given threshold, designate the new food record as one of a plurality of candidate records of the cluster, the candidate records comprising one or more records having descriptions to which the standardized description is compared. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25)
-
Specification