Query Parser Derivation Computing Device and Method for Making a Query Parser for Parsing Unstructured Search Queries
First Claim
1. A method comprising:
- deriving, via a query parser derivation computing device, a query parser for parsing an unstructured geographic web-search query into a field-based format; and
utilizing, by the query parser, conditional random fields, learned by semi-supervised automated learning, to produce structured information from the unstructured geographic web-search query.
20 Assignments
0 Petitions
Accused Products
Abstract
A system and method is provided which may comprise parsing an unstructured geographic web-search query into a field-based format, by utilizing conditional random fields, learned by semi-supervised automated learning, to parse structured information from the unstructured geographic web-search query. The system and method may also comprise establishing semi-supervised conditional random fields utilizing one of a rule-based finite state machine model and a statistics-based conditional random field model. Systematic geographic parsing may be used with the one of the rule-based finite state machine model and the statistics-based conditional random field model. Parsing an unstructured local geographical web-based query in local domain may be done by applying a learned model parser to the query, using at least one class-based query log from a form-based query system. The learned model parser may comprise at least one class-level n-gram language model-based feature harvested from a structured query log.
114 Citations
20 Claims
-
1. A method comprising:
-
deriving, via a query parser derivation computing device, a query parser for parsing an unstructured geographic web-search query into a field-based format; and utilizing, by the query parser, conditional random fields, learned by semi-supervised automated learning, to produce structured information from the unstructured geographic web-search query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method comprising:
-
deriving a query parser, via a query parser derivation computing device; utilizing an unstructured local geographical web-based query in local domain by applying a learned model parser to the query. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A tangible machine readable medium storing instructions, the instructions, when executed by a computing device, causing the computing device to perform a method, comprising:
-
deriving a query parser by defining class-based language model features comprising at least one token appearing in a defined position in a model feature comprising a grouping of tokens of a given length n, where n is an integer, including GEO_UNI_FIRST, for the first token in a grouping of tokens of length one, relating to a geographic term, GEO_BI_FIRST, for the first token in a grouping of tokens of length two relating to a geographic term, GEO_BI_SECOND, for the second token in a grouping of tokens of length two relating to geographic term, GEO_TRI_FIRST, for the first token in a grouping of tokens of length three relating to a geographic term, GEO_TRI_SECOND, for the second token in a grouping of tokens of length three relating to a geographic term, GEO_TRI_THIRD, for the third token in a grouping of tokens of length three relating to geographic terms, and including ST_UNI_FIRST, for the first token in a grouping of tokens of length one, relating to a search term, ST_BI_FIRST, for the first token in a grouping of tokens of length two relating to a search term, ST_BI_SECOND, for the second token in a grouping of tokens of length two relating to a search term, ST_TRI_FIRST, for the first token in a grouping of tokens of length three relating to a search term, ST_TRI_SECOND, for the second token in a grouping of tokens of length three relating to a search term, ST_TRI_THIRD, for the third token in a grouping of tokens of length three relating to a search term, for tokens relating to one of a geographic term and a search term, for any grouping of length n where n is an integer; and utilizing the features as high-level language model features for deriving the parser.
-
Specification