EXTRACTING STRUCTURED DATA FROM WEB QUERIES
First Claim
1. In a computing environment, a method performed on at least one processor comprising, processing a web query into structured data, including classifying the query and segmenting the query into labeled segments, the structured data including a class, an intent set comprising one or more labeled segments corresponding to data sought by the query, and an attribute set comprising one or more labeled segments that each correspond to a modifier of the query.
2 Assignments
0 Petitions
Accused Products
Abstract
Described is processing a web query into structured data, such as for use in formulating a database query. A classifier and segmental conditional random field (CRF) model classify and segment a query into labeled segments used to obtain the structured data. The structured data includes a class, an intent set corresponding to data sought by the query, and an attribute set corresponding to an attribute name and attribute value representing each modifier in the query. The structured data may be converted into a query language (e.g., SQL) query to query the structured data source; e.g., the class may be used to select a database, the attribute set used to find one or more columns and a row in the database, and the intent set matched to a column, whose row data is the result being sought. A web search engine may return the database query as part of its results.
-
Citations
20 Claims
- 1. In a computing environment, a method performed on at least one processor comprising, processing a web query into structured data, including classifying the query and segmenting the query into labeled segments, the structured data including a class, an intent set comprising one or more labeled segments corresponding to data sought by the query, and an attribute set comprising one or more labeled segments that each correspond to a modifier of the query.
- 9. In a computing environment, a system comprising, a semantic structure extractor, including a classifier that classifies a query into classification-related data, a segmental conditional random field model that segments the query into segment-related data including a label for each segment, a mechanism for selecting a class based upon the classification-related data and the segment-related data, the semantic structure extractor providing structured data comprising the class, an intent set comprising one or more labeled segments corresponding to data sought by the query, and an attribute set comprising one or more labeled segments that each correspond to a modifier of the query, in which each label of a segment the attribute set correspond to the class.
-
17. In a computing environment, a system comprising:
-
a web search engine that receives a web query; a semantic structure extractor that receives at least part of the web query from the web search engine and converts the web query into structured data including a class, an intent set comprising one or more labeled segments corresponding to data sought by the query, and an attribute set comprising one or more labeled segments that each correspond to a modifier of the query, the semantic structure extractor determining the class based upon terms in the query, and determining labels for the segments based upon the class and the terms in the query; and a mechanism that uses the structured data to query a source of structured data to provide results from the source of structured data to the web search engine. - View Dependent Claims (18, 19, 20)
-
Specification