WO2014085776A3 - Web search ranking - Google Patents

Web search ranking Download PDF

Info

Publication number
WO2014085776A3
WO2014085776A3 PCT/US2013/072502 US2013072502W WO2014085776A3 WO 2014085776 A3 WO2014085776 A3 WO 2014085776A3 US 2013072502 W US2013072502 W US 2013072502W WO 2014085776 A3 WO2014085776 A3 WO 2014085776A3
Authority
WO
WIPO (PCT)
Prior art keywords
web search
training samples
ranking
search ranking
translation model
Prior art date
Application number
PCT/US2013/072502
Other languages
French (fr)
Other versions
WO2014085776A2 (en
Inventor
Jianfeng Gao
Zhonghua QU
Gu Xu
Original Assignee
Microsoft Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corporation filed Critical Microsoft Corporation
Publication of WO2014085776A2 publication Critical patent/WO2014085776A2/en
Publication of WO2014085776A3 publication Critical patent/WO2014085776A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • G06F16/24578Query processing with adaptation to user needs using ranking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/335Filtering based on additional data, e.g. user or group profiles
    • G06F16/337Profile generation, learning or modification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Abstract

A computer-implemented method and system for Web search ranking are provided herein. The method includes generating a number of training samples from clickthrough data, wherein the training samples include positive query-document pairs and negative query-document pairs. The method also includes discriminatively training a translation model based on the training samples and ranking a number of documents for a Web search based on the translation model.
PCT/US2013/072502 2012-11-29 2013-11-29 Web search ranking WO2014085776A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US13/688,209 US9104733B2 (en) 2012-11-29 2012-11-29 Web search ranking
US13/688,209 2012-11-29

Publications (2)

Publication Number Publication Date
WO2014085776A2 WO2014085776A2 (en) 2014-06-05
WO2014085776A3 true WO2014085776A3 (en) 2014-07-17

Family

ID=49765715

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2013/072502 WO2014085776A2 (en) 2012-11-29 2013-11-29 Web search ranking

Country Status (2)

Country Link
US (1) US9104733B2 (en)
WO (1) WO2014085776A2 (en)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9398104B2 (en) * 2012-12-20 2016-07-19 Facebook, Inc. Ranking test framework for search results on an online social network
US9529858B2 (en) * 2014-03-06 2016-12-27 Yahoo! Inc. Methods and systems for ranking items on a presentation area based on binary outcomes
RU2608886C2 (en) 2014-06-30 2017-01-25 Общество С Ограниченной Ответственностью "Яндекс" Search results ranking means
US10007732B2 (en) 2015-05-19 2018-06-26 Microsoft Technology Licensing, Llc Ranking content items based on preference scores
US10140983B2 (en) * 2015-08-28 2018-11-27 International Business Machines Corporation Building of n-gram language model for automatic speech recognition (ASR)
US11120351B2 (en) * 2015-09-21 2021-09-14 International Business Machines Corporation Generic term weighting based on query performance prediction
US10437841B2 (en) * 2016-10-10 2019-10-08 Microsoft Technology Licensing, Llc Digital assistant extension automatic ranking and selection
US10565265B2 (en) * 2016-10-12 2020-02-18 Salesforce.Com, Inc. Accounting for positional bias in a document retrieval system using machine learning
CN107491518B (en) * 2017-08-15 2020-08-04 北京百度网讯科技有限公司 Search recall method and device, server and storage medium
RU2731658C2 (en) 2018-06-21 2020-09-07 Общество С Ограниченной Ответственностью "Яндекс" Method and system of selection for ranking search results using machine learning algorithm
US11403303B2 (en) * 2018-09-07 2022-08-02 Beijing Bytedance Network Technology Co., Ltd. Method and device for generating ranking model
RU2733481C2 (en) 2018-12-13 2020-10-01 Общество С Ограниченной Ответственностью "Яндекс" Method and system for generating feature for ranging document
RU2744029C1 (en) 2018-12-29 2021-03-02 Общество С Ограниченной Ответственностью "Яндекс" System and method of forming training set for machine learning algorithm
CN111597800B (en) * 2019-02-19 2023-12-12 百度在线网络技术(北京)有限公司 Method, device, equipment and storage medium for obtaining synonyms
US11645290B2 (en) * 2019-10-14 2023-05-09 Airbnb, Inc. Position debiased network site searches
US11816159B2 (en) 2020-06-01 2023-11-14 Yandex Europe Ag Method of and system for generating a training set for a machine learning algorithm (MLA)
US11809420B2 (en) * 2020-08-13 2023-11-07 Sabre Glbl Inc. Database search query enhancer

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090313286A1 (en) * 2008-06-17 2009-12-17 Microsoft Corporation Generating training data from click logs
US20120143789A1 (en) * 2010-12-01 2012-06-07 Microsoft Corporation Click model that accounts for a user's intent when placing a quiery in a search engine

Family Cites Families (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7624006B2 (en) * 2004-09-15 2009-11-24 Microsoft Corporation Conditional maximum likelihood estimation of naïve bayes probability models
US20080208836A1 (en) * 2007-02-23 2008-08-28 Yahoo! Inc. Regression framework for learning ranking functions using relative preferences
US8452585B2 (en) 2007-06-21 2013-05-28 Microsoft Corporation Discriminative syntactic word order model for machine translation
US8051061B2 (en) * 2007-07-20 2011-11-01 Microsoft Corporation Cross-lingual query suggestion
US7788276B2 (en) 2007-08-22 2010-08-31 Yahoo! Inc. Predictive stemming for web search with statistical machine translation models
US7933847B2 (en) 2007-10-17 2011-04-26 Microsoft Corporation Limited-memory quasi-newton optimization algorithm for L1-regularized objectives
US7734633B2 (en) * 2007-10-18 2010-06-08 Microsoft Corporation Listwise ranking
US7844555B2 (en) * 2007-11-13 2010-11-30 Microsoft Corporation Ranker selection for statistical natural language processing
US8615388B2 (en) * 2008-03-28 2013-12-24 Microsoft Corporation Intra-language statistical machine translation
US8326785B2 (en) * 2008-09-30 2012-12-04 Microsoft Corporation Joint ranking model for multilingual web search
US8671093B2 (en) * 2008-11-18 2014-03-11 Yahoo! Inc. Click model for search rankings
US8255412B2 (en) * 2008-12-17 2012-08-28 Microsoft Corporation Boosting algorithm for ranking model adaptation
US8543580B2 (en) 2008-12-23 2013-09-24 Microsoft Corporation Mining translations of web queries from web click-through data
US8239334B2 (en) 2008-12-24 2012-08-07 Microsoft Corporation Learning latent semantic space for ranking
US8719298B2 (en) 2009-05-21 2014-05-06 Microsoft Corporation Click-through prediction for news queries
US20100318531A1 (en) * 2009-06-10 2010-12-16 Microsoft Corporation Smoothing clickthrough data for web search ranking
US20110029517A1 (en) * 2009-07-31 2011-02-03 Shihao Ji Global and topical ranking of search results using user clicks
US8682811B2 (en) * 2009-12-30 2014-03-25 Microsoft Corporation User-driven index selection
US8275656B2 (en) 2010-03-11 2012-09-25 Yahoo! Inc. Maximum likelihood estimation under a covariance constraint for predictive modeling
US8392343B2 (en) * 2010-07-21 2013-03-05 Yahoo! Inc. Estimating probabilities of events in sponsored search using adaptive models
US9092483B2 (en) * 2010-10-19 2015-07-28 Microsoft Technology Licensing, Llc User query reformulation using random walks
US20120109860A1 (en) * 2010-11-03 2012-05-03 Microsoft Corporation Enhanced Training Data for Learning-To-Rank
US8407041B2 (en) * 2010-12-01 2013-03-26 Microsoft Corporation Integrative and discriminative technique for spoken utterance translation
US8645289B2 (en) * 2010-12-16 2014-02-04 Microsoft Corporation Structured cross-lingual relevance feedback for enhancing search results
US8798984B2 (en) * 2011-04-27 2014-08-05 Xerox Corporation Method and system for confidence-weighted learning of factored discriminative language models
US9501759B2 (en) * 2011-10-25 2016-11-22 Microsoft Technology Licensing, Llc Search query and document-related data translation
US9009148B2 (en) * 2011-12-19 2015-04-14 Microsoft Technology Licensing, Llc Clickthrough-based latent semantic model

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090313286A1 (en) * 2008-06-17 2009-12-17 Microsoft Corporation Generating training data from click logs
US20120143789A1 (en) * 2010-12-01 2012-06-07 Microsoft Corporation Click model that accounts for a user's intent when placing a quiery in a search engine

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
SIMON CARTER ET AL: "Syntactic discriminative language model rerankers for statistical machine translation", MACHINE TRANSLATION, KLUWER ACADEMIC PUBLISHERS, DO, vol. 25, no. 4, 1 September 2011 (2011-09-01), pages 317 - 339, XP019978465, ISSN: 1573-0573, DOI: 10.1007/S10590-011-9108-7 *
THORSTEN JOACHIMS: "Optimizing search engines using clickthrough data", PROCEEDINGS OF THE EIGHTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING , KDD '02, 1 January 2002 (2002-01-01), New York, New York, USA, pages 133, XP055038616, ISBN: 978-1-58-113567-1, DOI: 10.1145/775047.775067 *

Also Published As

Publication number Publication date
US9104733B2 (en) 2015-08-11
US20140149429A1 (en) 2014-05-29
WO2014085776A2 (en) 2014-06-05

Similar Documents

Publication Publication Date Title
WO2014085776A3 (en) Web search ranking
AU2018200396B2 (en) A method and system for extraction
WO2012106133A3 (en) System for identifying textual relationships
WO2014049334A3 (en) A document management system and method
EP2659399A4 (en) System and method for providing contextual actions on a search results page
WO2012143603A3 (en) Methods and apparatuses for facilitating gesture recognition
WO2014102548A3 (en) Search system and corresponding method
WO2013134641A3 (en) Recognizing speech in multiple languages
GB201307409D0 (en) Systems and methods for providing data-driven document suggestions
WO2016109307A3 (en) Discriminating ambiguous expressions to enhance user experience
WO2014004686A3 (en) System and method for creating slideshows
WO2012134972A3 (en) Systems and methods for paragraph-based document searching
WO2011035298A3 (en) Methods and apparatus to perform choice modeling with substitutability data
WO2012115958A3 (en) Automatic data cleaning for machine learning classifiers
WO2013131025A3 (en) Product cycle analysis using social media data
WO2012166989A3 (en) Emotion-based user identification for online experiences
MX2013007804A (en) Data improvement system and method.
EP3051432A4 (en) Semantic information acquisition method, keyword expansion method thereof, and search method and system
WO2014031683A3 (en) Hierarchical based sequencing machine learning model
WO2012074704A3 (en) Display of search ads in local language
BR112013010516A2 (en) system and method for generating a geostatistical model of a relevant geological volume, which is limited by a process-based model of the relevant geological volume
WO2012057588A3 (en) Apparatus and method for diagnosing learning ability
EP2680251A4 (en) Search system, search method for search system, information processing device, search program, corresponding keyword management device and corresponding keyword management system
WO2014058575A3 (en) Modeling data generating process
BR112014032104A2 (en) method for identifying protein-drug interactions, and, computer product.

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13805703

Country of ref document: EP

Kind code of ref document: A2

DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)
122 Ep: pct application non-entry in european phase

Ref document number: 13805703

Country of ref document: EP

Kind code of ref document: A2