WO2014085776A3 - Web search ranking - Google Patents
Web search ranking Download PDFInfo
- Publication number
- WO2014085776A3 WO2014085776A3 PCT/US2013/072502 US2013072502W WO2014085776A3 WO 2014085776 A3 WO2014085776 A3 WO 2014085776A3 US 2013072502 W US2013072502 W US 2013072502W WO 2014085776 A3 WO2014085776 A3 WO 2014085776A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- web search
- training samples
- ranking
- search ranking
- translation model
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2457—Query processing with adaptation to user needs
- G06F16/24578—Query processing with adaptation to user needs using ranking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/335—Filtering based on additional data, e.g. user or group profiles
- G06F16/337—Profile generation, learning or modification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
Abstract
A computer-implemented method and system for Web search ranking are provided herein. The method includes generating a number of training samples from clickthrough data, wherein the training samples include positive query-document pairs and negative query-document pairs. The method also includes discriminatively training a translation model based on the training samples and ranking a number of documents for a Web search based on the translation model.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/688,209 US9104733B2 (en) | 2012-11-29 | 2012-11-29 | Web search ranking |
US13/688,209 | 2012-11-29 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2014085776A2 WO2014085776A2 (en) | 2014-06-05 |
WO2014085776A3 true WO2014085776A3 (en) | 2014-07-17 |
Family
ID=49765715
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2013/072502 WO2014085776A2 (en) | 2012-11-29 | 2013-11-29 | Web search ranking |
Country Status (2)
Country | Link |
---|---|
US (1) | US9104733B2 (en) |
WO (1) | WO2014085776A2 (en) |
Families Citing this family (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9398104B2 (en) * | 2012-12-20 | 2016-07-19 | Facebook, Inc. | Ranking test framework for search results on an online social network |
US9529858B2 (en) * | 2014-03-06 | 2016-12-27 | Yahoo! Inc. | Methods and systems for ranking items on a presentation area based on binary outcomes |
RU2608886C2 (en) | 2014-06-30 | 2017-01-25 | Общество С Ограниченной Ответственностью "Яндекс" | Search results ranking means |
US10007732B2 (en) | 2015-05-19 | 2018-06-26 | Microsoft Technology Licensing, Llc | Ranking content items based on preference scores |
US10140983B2 (en) * | 2015-08-28 | 2018-11-27 | International Business Machines Corporation | Building of n-gram language model for automatic speech recognition (ASR) |
US11120351B2 (en) * | 2015-09-21 | 2021-09-14 | International Business Machines Corporation | Generic term weighting based on query performance prediction |
US10437841B2 (en) * | 2016-10-10 | 2019-10-08 | Microsoft Technology Licensing, Llc | Digital assistant extension automatic ranking and selection |
US10565265B2 (en) * | 2016-10-12 | 2020-02-18 | Salesforce.Com, Inc. | Accounting for positional bias in a document retrieval system using machine learning |
CN107491518B (en) * | 2017-08-15 | 2020-08-04 | 北京百度网讯科技有限公司 | Search recall method and device, server and storage medium |
RU2731658C2 (en) | 2018-06-21 | 2020-09-07 | Общество С Ограниченной Ответственностью "Яндекс" | Method and system of selection for ranking search results using machine learning algorithm |
US11403303B2 (en) * | 2018-09-07 | 2022-08-02 | Beijing Bytedance Network Technology Co., Ltd. | Method and device for generating ranking model |
RU2733481C2 (en) | 2018-12-13 | 2020-10-01 | Общество С Ограниченной Ответственностью "Яндекс" | Method and system for generating feature for ranging document |
RU2744029C1 (en) | 2018-12-29 | 2021-03-02 | Общество С Ограниченной Ответственностью "Яндекс" | System and method of forming training set for machine learning algorithm |
CN111597800B (en) * | 2019-02-19 | 2023-12-12 | 百度在线网络技术(北京)有限公司 | Method, device, equipment and storage medium for obtaining synonyms |
US11645290B2 (en) * | 2019-10-14 | 2023-05-09 | Airbnb, Inc. | Position debiased network site searches |
US11816159B2 (en) | 2020-06-01 | 2023-11-14 | Yandex Europe Ag | Method of and system for generating a training set for a machine learning algorithm (MLA) |
US11809420B2 (en) * | 2020-08-13 | 2023-11-07 | Sabre Glbl Inc. | Database search query enhancer |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090313286A1 (en) * | 2008-06-17 | 2009-12-17 | Microsoft Corporation | Generating training data from click logs |
US20120143789A1 (en) * | 2010-12-01 | 2012-06-07 | Microsoft Corporation | Click model that accounts for a user's intent when placing a quiery in a search engine |
Family Cites Families (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7624006B2 (en) * | 2004-09-15 | 2009-11-24 | Microsoft Corporation | Conditional maximum likelihood estimation of naïve bayes probability models |
US20080208836A1 (en) * | 2007-02-23 | 2008-08-28 | Yahoo! Inc. | Regression framework for learning ranking functions using relative preferences |
US8452585B2 (en) | 2007-06-21 | 2013-05-28 | Microsoft Corporation | Discriminative syntactic word order model for machine translation |
US8051061B2 (en) * | 2007-07-20 | 2011-11-01 | Microsoft Corporation | Cross-lingual query suggestion |
US7788276B2 (en) | 2007-08-22 | 2010-08-31 | Yahoo! Inc. | Predictive stemming for web search with statistical machine translation models |
US7933847B2 (en) | 2007-10-17 | 2011-04-26 | Microsoft Corporation | Limited-memory quasi-newton optimization algorithm for L1-regularized objectives |
US7734633B2 (en) * | 2007-10-18 | 2010-06-08 | Microsoft Corporation | Listwise ranking |
US7844555B2 (en) * | 2007-11-13 | 2010-11-30 | Microsoft Corporation | Ranker selection for statistical natural language processing |
US8615388B2 (en) * | 2008-03-28 | 2013-12-24 | Microsoft Corporation | Intra-language statistical machine translation |
US8326785B2 (en) * | 2008-09-30 | 2012-12-04 | Microsoft Corporation | Joint ranking model for multilingual web search |
US8671093B2 (en) * | 2008-11-18 | 2014-03-11 | Yahoo! Inc. | Click model for search rankings |
US8255412B2 (en) * | 2008-12-17 | 2012-08-28 | Microsoft Corporation | Boosting algorithm for ranking model adaptation |
US8543580B2 (en) | 2008-12-23 | 2013-09-24 | Microsoft Corporation | Mining translations of web queries from web click-through data |
US8239334B2 (en) | 2008-12-24 | 2012-08-07 | Microsoft Corporation | Learning latent semantic space for ranking |
US8719298B2 (en) | 2009-05-21 | 2014-05-06 | Microsoft Corporation | Click-through prediction for news queries |
US20100318531A1 (en) * | 2009-06-10 | 2010-12-16 | Microsoft Corporation | Smoothing clickthrough data for web search ranking |
US20110029517A1 (en) * | 2009-07-31 | 2011-02-03 | Shihao Ji | Global and topical ranking of search results using user clicks |
US8682811B2 (en) * | 2009-12-30 | 2014-03-25 | Microsoft Corporation | User-driven index selection |
US8275656B2 (en) | 2010-03-11 | 2012-09-25 | Yahoo! Inc. | Maximum likelihood estimation under a covariance constraint for predictive modeling |
US8392343B2 (en) * | 2010-07-21 | 2013-03-05 | Yahoo! Inc. | Estimating probabilities of events in sponsored search using adaptive models |
US9092483B2 (en) * | 2010-10-19 | 2015-07-28 | Microsoft Technology Licensing, Llc | User query reformulation using random walks |
US20120109860A1 (en) * | 2010-11-03 | 2012-05-03 | Microsoft Corporation | Enhanced Training Data for Learning-To-Rank |
US8407041B2 (en) * | 2010-12-01 | 2013-03-26 | Microsoft Corporation | Integrative and discriminative technique for spoken utterance translation |
US8645289B2 (en) * | 2010-12-16 | 2014-02-04 | Microsoft Corporation | Structured cross-lingual relevance feedback for enhancing search results |
US8798984B2 (en) * | 2011-04-27 | 2014-08-05 | Xerox Corporation | Method and system for confidence-weighted learning of factored discriminative language models |
US9501759B2 (en) * | 2011-10-25 | 2016-11-22 | Microsoft Technology Licensing, Llc | Search query and document-related data translation |
US9009148B2 (en) * | 2011-12-19 | 2015-04-14 | Microsoft Technology Licensing, Llc | Clickthrough-based latent semantic model |
-
2012
- 2012-11-29 US US13/688,209 patent/US9104733B2/en active Active
-
2013
- 2013-11-29 WO PCT/US2013/072502 patent/WO2014085776A2/en active Application Filing
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090313286A1 (en) * | 2008-06-17 | 2009-12-17 | Microsoft Corporation | Generating training data from click logs |
US20120143789A1 (en) * | 2010-12-01 | 2012-06-07 | Microsoft Corporation | Click model that accounts for a user's intent when placing a quiery in a search engine |
Non-Patent Citations (2)
Title |
---|
SIMON CARTER ET AL: "Syntactic discriminative language model rerankers for statistical machine translation", MACHINE TRANSLATION, KLUWER ACADEMIC PUBLISHERS, DO, vol. 25, no. 4, 1 September 2011 (2011-09-01), pages 317 - 339, XP019978465, ISSN: 1573-0573, DOI: 10.1007/S10590-011-9108-7 * |
THORSTEN JOACHIMS: "Optimizing search engines using clickthrough data", PROCEEDINGS OF THE EIGHTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING , KDD '02, 1 January 2002 (2002-01-01), New York, New York, USA, pages 133, XP055038616, ISBN: 978-1-58-113567-1, DOI: 10.1145/775047.775067 * |
Also Published As
Publication number | Publication date |
---|---|
US9104733B2 (en) | 2015-08-11 |
US20140149429A1 (en) | 2014-05-29 |
WO2014085776A2 (en) | 2014-06-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2014085776A3 (en) | Web search ranking | |
AU2018200396B2 (en) | A method and system for extraction | |
WO2012106133A3 (en) | System for identifying textual relationships | |
WO2014049334A3 (en) | A document management system and method | |
EP2659399A4 (en) | System and method for providing contextual actions on a search results page | |
WO2012143603A3 (en) | Methods and apparatuses for facilitating gesture recognition | |
WO2014102548A3 (en) | Search system and corresponding method | |
WO2013134641A3 (en) | Recognizing speech in multiple languages | |
GB201307409D0 (en) | Systems and methods for providing data-driven document suggestions | |
WO2016109307A3 (en) | Discriminating ambiguous expressions to enhance user experience | |
WO2014004686A3 (en) | System and method for creating slideshows | |
WO2012134972A3 (en) | Systems and methods for paragraph-based document searching | |
WO2011035298A3 (en) | Methods and apparatus to perform choice modeling with substitutability data | |
WO2012115958A3 (en) | Automatic data cleaning for machine learning classifiers | |
WO2013131025A3 (en) | Product cycle analysis using social media data | |
WO2012166989A3 (en) | Emotion-based user identification for online experiences | |
MX2013007804A (en) | Data improvement system and method. | |
EP3051432A4 (en) | Semantic information acquisition method, keyword expansion method thereof, and search method and system | |
WO2014031683A3 (en) | Hierarchical based sequencing machine learning model | |
WO2012074704A3 (en) | Display of search ads in local language | |
BR112013010516A2 (en) | system and method for generating a geostatistical model of a relevant geological volume, which is limited by a process-based model of the relevant geological volume | |
WO2012057588A3 (en) | Apparatus and method for diagnosing learning ability | |
EP2680251A4 (en) | Search system, search method for search system, information processing device, search program, corresponding keyword management device and corresponding keyword management system | |
WO2014058575A3 (en) | Modeling data generating process | |
BR112014032104A2 (en) | method for identifying protein-drug interactions, and, computer product. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 13805703 Country of ref document: EP Kind code of ref document: A2 |
|
DPE1 | Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101) | ||
122 | Ep: pct application non-entry in european phase |
Ref document number: 13805703 Country of ref document: EP Kind code of ref document: A2 |