WO2007134130A3 - Systems and methods for generating statistics from search engine query logs - Google Patents
Systems and methods for generating statistics from search engine query logs Download PDFInfo
- Publication number
- WO2007134130A3 WO2007134130A3 PCT/US2007/068602 US2007068602W WO2007134130A3 WO 2007134130 A3 WO2007134130 A3 WO 2007134130A3 US 2007068602 W US2007068602 W US 2007068602W WO 2007134130 A3 WO2007134130 A3 WO 2007134130A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- statistics
- systems
- methods
- search engine
- query logs
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/242—Query formulation
- G06F16/2425—Iterative querying; Query formulation based on the results of a preceding query
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/3349—Reuse of stored results of previous queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Business, Economics & Management (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Accounting & Taxation (AREA)
- Development Economics (AREA)
- Finance (AREA)
- Strategic Management (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Entrepreneurship & Innovation (AREA)
- Game Theory and Decision Science (AREA)
- Economics (AREA)
- Marketing (AREA)
- General Business, Economics & Management (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
A computer-implemented method includes calculating first statistics about a user- identified event within a first subset of a database of events; selecting a second subset of the database of events based on said first statistics; calculating second statistics about the user- identified event within the second subset of the database of events; merging the first and second statistics as statistics of the user-identified event within the entire database of events; and generating a result including at least a portion of the merged statistics of the user- identified event.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US74688606P | 2006-05-09 | 2006-05-09 | |
US60/746,886 | 2006-05-09 | ||
US11/746,049 | 2007-05-08 | ||
US11/746,049 US8126874B2 (en) | 2006-05-09 | 2007-05-08 | Systems and methods for generating statistics from search engine query logs |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2007134130A2 WO2007134130A2 (en) | 2007-11-22 |
WO2007134130A3 true WO2007134130A3 (en) | 2008-10-09 |
Family
ID=38694684
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2007/068602 WO2007134130A2 (en) | 2006-05-09 | 2007-05-09 | Systems and methods for generating statistics from search engine query logs |
Country Status (2)
Country | Link |
---|---|
US (2) | US8126874B2 (en) |
WO (1) | WO2007134130A2 (en) |
Families Citing this family (65)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7610282B1 (en) * | 2007-03-30 | 2009-10-27 | Google Inc. | Rank-adjusted content items |
JP5255055B2 (en) * | 2007-05-21 | 2013-08-07 | グーグル・インコーポレーテッド | Query statistics provider |
WO2008151466A1 (en) * | 2007-06-14 | 2008-12-18 | Google Inc. | Dictionary word and phrase determination |
US8290921B2 (en) * | 2007-06-28 | 2012-10-16 | Microsoft Corporation | Identification of similar queries based on overall and partial similarity of time series |
KR100903506B1 (en) * | 2007-10-24 | 2009-06-17 | 엔에이치엔(주) | System and method for managing informaiton map |
KR100893129B1 (en) * | 2007-10-24 | 2009-04-15 | 엔에이치엔(주) | System for extracting recommended keyword of multimedia contents and method thereof |
GB0813123D0 (en) * | 2008-07-17 | 2008-08-27 | Symbian Software Ltd | Method of searching |
US8060580B2 (en) * | 2008-10-03 | 2011-11-15 | Seomoz, Inc. | Index rank optimization system and method |
US8214350B1 (en) * | 2009-01-02 | 2012-07-03 | Google Inc. | Pre-computed impression lists |
CN101477542B (en) * | 2009-01-22 | 2013-02-13 | 阿里巴巴集团控股有限公司 | Sampling analysis method, system and equipment |
US8504580B2 (en) * | 2009-03-03 | 2013-08-06 | Ilya Geller | Systems and methods for creating an artificial intelligence |
US8516013B2 (en) | 2009-03-03 | 2013-08-20 | Ilya Geller | Systems and methods for subtext searching data using synonym-enriched predicative phrases and substituted pronouns |
US8447789B2 (en) * | 2009-09-15 | 2013-05-21 | Ilya Geller | Systems and methods for creating structured data |
US8756244B2 (en) * | 2009-07-29 | 2014-06-17 | Teradata Us, Inc. | Metadata as comments for search problem determination and analysis |
US8255379B2 (en) | 2009-11-10 | 2012-08-28 | Microsoft Corporation | Custom local search |
WO2011090036A1 (en) * | 2010-01-19 | 2011-07-28 | 日本電気株式会社 | Trend information retrieval device, trend information retrieval method and recording medium |
US20110270819A1 (en) * | 2010-04-30 | 2011-11-03 | Microsoft Corporation | Context-aware query classification |
US20110314045A1 (en) * | 2010-06-21 | 2011-12-22 | Microsoft Corporation | Fast set intersection |
JP5121888B2 (en) * | 2010-06-30 | 2013-01-16 | ヤフー株式会社 | Apparatus and method for determining spam IP address and apparatus and method for determining spam query |
US9430502B1 (en) * | 2010-09-10 | 2016-08-30 | Tellabs Operations, Inc. | Method and apparatus for collecting and storing statistics data from network elements using scalable architecture |
US20130006914A1 (en) * | 2011-06-28 | 2013-01-03 | Microsoft Corporation | Exposing search history by category |
US8688499B1 (en) * | 2011-08-11 | 2014-04-01 | Google Inc. | System and method for generating business process models from mapped time sequenced operational and transaction data |
US9218629B2 (en) * | 2012-01-20 | 2015-12-22 | Blackberry Limited | Prioritizing and providing information about user contacts |
US20130232172A1 (en) * | 2012-03-01 | 2013-09-05 | Salesforce.Com, Inc. | Methods and systems for matching expressions |
US8620925B1 (en) | 2012-05-17 | 2013-12-31 | Google Inc. | System and method for identifying advertising opportunities |
US8682925B1 (en) | 2013-01-31 | 2014-03-25 | Splunk Inc. | Distributed high performance analytics store |
US8516008B1 (en) | 2012-05-18 | 2013-08-20 | Splunk Inc. | Flexible schema column store |
US10061807B2 (en) | 2012-05-18 | 2018-08-28 | Splunk Inc. | Collection query driven generation of inverted index for raw machine data |
US9201916B2 (en) * | 2012-06-13 | 2015-12-01 | Infosys Limited | Method, system, and computer-readable medium for providing a scalable bio-informatics sequence search on cloud |
WO2014047186A1 (en) * | 2012-09-18 | 2014-03-27 | Newtek Business Services, Inc. | Real-time data capture and distribution system for e-commerce payment transactions |
CN104077530A (en) | 2013-03-27 | 2014-10-01 | 国际商业机器公司 | Method and device used for evaluating safety of data access sentence |
US9373322B2 (en) * | 2013-04-10 | 2016-06-21 | Nuance Communications, Inc. | System and method for determining query intent |
US10169711B1 (en) * | 2013-06-27 | 2019-01-01 | Google Llc | Generalized engine for predicting actions |
US11386085B2 (en) | 2014-01-27 | 2022-07-12 | Microstrategy Incorporated | Deriving metrics from queries |
US10255320B1 (en) | 2014-01-27 | 2019-04-09 | Microstrategy Incorporated | Search integration |
US9952894B1 (en) * | 2014-01-27 | 2018-04-24 | Microstrategy Incorporated | Parallel query processing |
US10095759B1 (en) | 2014-01-27 | 2018-10-09 | Microstrategy Incorporated | Data engine integration and data refinement |
US11921715B2 (en) | 2014-01-27 | 2024-03-05 | Microstrategy Incorporated | Search integration |
US9818065B2 (en) * | 2014-03-12 | 2017-11-14 | Microsoft Technology Licensing, Llc | Attribution of activity in multi-user settings |
US10599659B2 (en) * | 2014-05-06 | 2020-03-24 | Oath Inc. | Method and system for evaluating user satisfaction with respect to a user session |
US9747331B2 (en) * | 2014-10-06 | 2017-08-29 | International Business Machines Corporation | Limiting scans of loosely ordered and/or grouped relations in a database |
RU2610280C2 (en) | 2014-10-31 | 2017-02-08 | Общество С Ограниченной Ответственностью "Яндекс" | Method for user authorization in a network and server used therein |
RU2580432C1 (en) | 2014-10-31 | 2016-04-10 | Общество С Ограниченной Ответственностью "Яндекс" | Method for processing a request from a potential unauthorised user to access resource and server used therein |
US10229150B2 (en) | 2015-04-23 | 2019-03-12 | Splunk Inc. | Systems and methods for concurrent summarization of indexed data |
US11048701B2 (en) * | 2016-09-13 | 2021-06-29 | International Business Machines Corporation | Query optimization in hybrid DBMS |
CN106649804A (en) * | 2016-12-29 | 2017-05-10 | 深圳市优必选科技有限公司 | Data processing method, data processing device and data processing system for data query server |
US10474674B2 (en) | 2017-01-31 | 2019-11-12 | Splunk Inc. | Using an inverted index in a pipelined search query to determine a set of event data that is further limited by filtering and/or processing of subsequent query pipestages |
US11379530B2 (en) | 2017-01-31 | 2022-07-05 | Splunk Inc. | Leveraging references values in inverted indexes to retrieve associated event records comprising raw machine data |
US10467433B2 (en) * | 2017-03-17 | 2019-11-05 | Mediasift Limited | Event processing system |
US10846318B1 (en) | 2017-04-18 | 2020-11-24 | Microstrategy Incorporated | Natural language visualizations |
US10423638B2 (en) | 2017-04-27 | 2019-09-24 | Google Llc | Cloud inference system |
US20180357278A1 (en) * | 2017-06-09 | 2018-12-13 | Linkedin Corporation | Processing aggregate queries in a graph database |
US10817757B2 (en) * | 2017-07-31 | 2020-10-27 | Splunk Inc. | Automated data preprocessing for machine learning |
US11403366B2 (en) * | 2018-09-30 | 2022-08-02 | Hewlett Packard Enterprise Development Lp | On-demand retrieval of information from databases |
US11195050B2 (en) | 2019-02-05 | 2021-12-07 | Microstrategy Incorporated | Machine learning to generate and evaluate visualizations |
CN112307360B (en) * | 2019-07-30 | 2023-08-25 | 百度在线网络技术(北京)有限公司 | Regional event detection method and device based on search engine and search engine |
US11620157B2 (en) | 2019-10-18 | 2023-04-04 | Splunk Inc. | Data ingestion pipeline anomaly detection |
US11615101B2 (en) | 2019-10-18 | 2023-03-28 | Splunk Inc. | Anomaly detection in data ingested to a data intake and query system |
US11614970B2 (en) | 2019-12-06 | 2023-03-28 | Microstrategy Incorporated | High-throughput parallel data transmission |
US11567965B2 (en) | 2020-01-23 | 2023-01-31 | Microstrategy Incorporated | Enhanced preparation and integration of data sets |
US11663176B2 (en) | 2020-07-31 | 2023-05-30 | Splunk Inc. | Data field extraction model training for a data intake and query system |
US11704490B2 (en) | 2020-07-31 | 2023-07-18 | Splunk Inc. | Log sourcetype inference model training for a data intake and query system |
CN112463570B (en) * | 2020-12-15 | 2024-04-09 | 航天信息股份有限公司 | Log statistics method, device and system |
US11687438B1 (en) | 2021-01-29 | 2023-06-27 | Splunk Inc. | Adaptive thresholding of data streamed to a data processing pipeline |
US20230229659A1 (en) * | 2022-01-20 | 2023-07-20 | Oracle International Corporation | Estimating query execution performance using a sampled counter |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5983216A (en) * | 1997-09-12 | 1999-11-09 | Infoseek Corporation | Performing automated document collection and selection by providing a meta-index with meta-index values indentifying corresponding document collections |
US20040215607A1 (en) * | 2003-04-25 | 2004-10-28 | Travis Robert L. | Method and system fo blending search engine results from disparate sources into one search result |
US20050010564A1 (en) * | 2003-05-19 | 2005-01-13 | Netezza Corporation | Limiting scans of loosely ordered and/or grouped relations using nearly ordered maps |
Family Cites Families (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6115393A (en) * | 1991-04-12 | 2000-09-05 | Concord Communications, Inc. | Network monitoring |
US5613113A (en) * | 1993-10-08 | 1997-03-18 | International Business Machines Corporation | Consistent recreation of events from activity logs |
CA2167790A1 (en) * | 1995-01-23 | 1996-07-24 | Donald S. Maier | Relational database system and method with high data availability during table data restructuring |
US6144967A (en) * | 1996-01-25 | 2000-11-07 | International Business Machines Corporation | Object oriented processing log analysis tool framework mechanism |
US5696964A (en) * | 1996-04-16 | 1997-12-09 | Nec Research Institute, Inc. | Multimedia database retrieval system which maintains a posterior probability distribution that each item in the database is a target of a search |
US6654933B1 (en) * | 1999-09-21 | 2003-11-25 | Kasenna, Inc. | System and method for media stream indexing |
US6108648A (en) * | 1997-07-18 | 2000-08-22 | Informix Software, Inc. | Optimizer with neural network estimator |
US6292830B1 (en) * | 1997-08-08 | 2001-09-18 | Iterations Llc | System for optimizing interaction among agents acting on multiple levels |
US6067541A (en) * | 1997-09-17 | 2000-05-23 | Microsoft Corporation | Monitoring document changes in a file system of documents with the document change information stored in a persistent log |
KR100268749B1 (en) * | 1998-02-26 | 2000-10-16 | 이 병 길 | Layered manganese dioxide for li secondary batteries and method for producing the same |
US6278993B1 (en) * | 1998-12-08 | 2001-08-21 | Yodlee.Com, Inc. | Method and apparatus for extending an on-line internet search beyond pre-referenced sources and returning data over a data-packet-network (DPN) using private search engines as proxy-engines |
US6898597B1 (en) * | 1999-11-09 | 2005-05-24 | Insweb Corporation | Event log |
US6477523B1 (en) * | 1999-12-03 | 2002-11-05 | Ncr Corporation | Selectivity prediction with compressed histograms in a parallel processing database system |
US20020152305A1 (en) * | 2000-03-03 | 2002-10-17 | Jackson Gregory J. | Systems and methods for resource utilization analysis in information management environments |
US20040230461A1 (en) * | 2000-03-30 | 2004-11-18 | Talib Iqbal A. | Methods and systems for enabling efficient retrieval of data from data collections |
US6687696B2 (en) * | 2000-07-26 | 2004-02-03 | Recommind Inc. | System and method for personalized search, information filtering, and for generating recommendations utilizing statistical latent class models |
US6766320B1 (en) * | 2000-08-24 | 2004-07-20 | Microsoft Corporation | Search engine with natural language-based robust parsing for user query and relevance feedback learning |
US7617201B1 (en) * | 2001-06-20 | 2009-11-10 | Microstrategy, Incorporated | System and method for analyzing statistics in a reporting system |
US7139749B2 (en) * | 2002-03-19 | 2006-11-21 | International Business Machines Corporation | Method, system, and program for performance tuning a database query |
US7249118B2 (en) * | 2002-05-17 | 2007-07-24 | Aleri, Inc. | Database system and methods |
US6947927B2 (en) * | 2002-07-09 | 2005-09-20 | Microsoft Corporation | Method and apparatus for exploiting statistics on query expressions for optimization |
US7308643B1 (en) * | 2003-07-03 | 2007-12-11 | Google Inc. | Anchor tag indexing in a web crawler system |
US7240049B2 (en) * | 2003-11-12 | 2007-07-03 | Yahoo! Inc. | Systems and methods for search query processing using trend analysis |
US7383262B2 (en) * | 2004-06-29 | 2008-06-03 | Microsoft Corporation | Ranking database query results using probabilistic models from information retrieval |
US7580921B2 (en) * | 2004-07-26 | 2009-08-25 | Google Inc. | Phrase identification in an information retrieval system |
US7584175B2 (en) * | 2004-07-26 | 2009-09-01 | Google Inc. | Phrase-based generation of document descriptions |
US7426507B1 (en) * | 2004-07-26 | 2008-09-16 | Google, Inc. | Automatic taxonomy generation in search results using phrases |
US7567959B2 (en) * | 2004-07-26 | 2009-07-28 | Google Inc. | Multiple index based information retrieval system |
US20060074883A1 (en) * | 2004-10-05 | 2006-04-06 | Microsoft Corporation | Systems, methods, and interfaces for providing personalized search and information access |
US7809722B2 (en) * | 2005-05-09 | 2010-10-05 | Like.Com | System and method for enabling search and retrieval from image files based on recognized information |
US20070038889A1 (en) * | 2005-08-11 | 2007-02-15 | Wiggins Robert D | Methods and systems to access process control log information associated with process control systems |
US7668823B2 (en) * | 2007-04-03 | 2010-02-23 | Google Inc. | Identifying inadequate search content |
-
2007
- 2007-05-08 US US11/746,049 patent/US8126874B2/en not_active Expired - Fee Related
- 2007-05-09 WO PCT/US2007/068602 patent/WO2007134130A2/en active Application Filing
-
2012
- 2012-02-14 US US13/396,511 patent/US9262767B2/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5983216A (en) * | 1997-09-12 | 1999-11-09 | Infoseek Corporation | Performing automated document collection and selection by providing a meta-index with meta-index values indentifying corresponding document collections |
US20040215607A1 (en) * | 2003-04-25 | 2004-10-28 | Travis Robert L. | Method and system fo blending search engine results from disparate sources into one search result |
US20050010564A1 (en) * | 2003-05-19 | 2005-01-13 | Netezza Corporation | Limiting scans of loosely ordered and/or grouped relations using nearly ordered maps |
Also Published As
Publication number | Publication date |
---|---|
US8126874B2 (en) | 2012-02-28 |
US9262767B2 (en) | 2016-02-16 |
US20120215765A1 (en) | 2012-08-23 |
WO2007134130A2 (en) | 2007-11-22 |
US20110040733A1 (en) | 2011-02-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2007134130A3 (en) | Systems and methods for generating statistics from search engine query logs | |
WO2007021360A3 (en) | Computer-implemented personal information manager method and system | |
WO2008030335A3 (en) | Enterprise performance management software system having action-based data capture | |
WO2004114160A3 (en) | Systems and processes for automated criteria and attribute generation, searching, auditing and reporting of data | |
WO2008060860A3 (en) | A method of improving a query to a database system | |
WO2008076486A3 (en) | Trip optimization system and method for a vehicle | |
WO2007108788A3 (en) | Method and system for answer extraction | |
WO2006017575A3 (en) | Commercial shape search engine | |
WO2005098595A3 (en) | Methods and systems for interfacing applications with a search engine | |
WO2006118814A3 (en) | Method for finding semantically related search engine queries | |
WO2011034502A8 (en) | Textual query based multimedia retrieval system | |
WO2005114541A3 (en) | Systems and methods for minimizing security logs | |
WO2007019470A3 (en) | Management of expert resources using seeker profiles | |
WO2010120929A3 (en) | Generating user-customized search results and building a semantics-enhanced search engine | |
WO2007124139A3 (en) | Computer systems and methods for automatic generation of models for a dataset | |
WO2008094289A3 (en) | A method of choosing advertisements to be shown to a search engine user | |
WO2008063818A3 (en) | Automatic system and method for vehicle diagnostic data retrieval using multiple data sources | |
EP1484695A3 (en) | Automatic task generator method and system | |
WO2006124910A3 (en) | System and method for automated management of an address database | |
WO2009123866A3 (en) | Method and system for organizing information | |
WO2007002750A3 (en) | Price determination for items of low demand | |
WO2006115698A3 (en) | Page-biased search | |
WO2007075417A3 (en) | System and method for analyzing communications using multi-dimensional hierarchical structures | |
WO2007073515A3 (en) | System and method for problem analysis | |
WO2000079436A3 (en) | Search engine interface |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 07762074 Country of ref document: EP Kind code of ref document: A2 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 07762074 Country of ref document: EP Kind code of ref document: A2 |