WO2004097568A3 - Method and apparatus for machine learning a document relevance function - Google Patents

Method and apparatus for machine learning a document relevance function Download PDF

Info

Publication number
WO2004097568A3
WO2004097568A3 PCT/US2004/012813 US2004012813W WO2004097568A3 WO 2004097568 A3 WO2004097568 A3 WO 2004097568A3 US 2004012813 W US2004012813 W US 2004012813W WO 2004097568 A3 WO2004097568 A3 WO 2004097568A3
Authority
WO
WIPO (PCT)
Prior art keywords
documents
relevance
machine learning
training
relevance scores
Prior art date
Application number
PCT/US2004/012813
Other languages
French (fr)
Other versions
WO2004097568A2 (en
Inventor
David Cossock
Original Assignee
Overture Services Inc
David Cossock
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Overture Services Inc, David Cossock filed Critical Overture Services Inc
Priority to EP04750656A priority Critical patent/EP1623298A2/en
Priority to JP2006513331A priority patent/JP2006524869A/en
Publication of WO2004097568A2 publication Critical patent/WO2004097568A2/en
Publication of WO2004097568A3 publication Critical patent/WO2004097568A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/335Filtering based on additional data, e.g. user or group profiles
    • G06F16/337Profile generation, learning or modification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99937Sorting

Abstract

Provided is a method and computer program product for determining a document relevance function for estimating a relevance score of a document in a database with respect to a query. For each of a plurality of test queries, a respective set of result documents is collected. For each test query, a subset of the documents in the respective result set is selected, and a set of training relevance scores is assigned to documents in the subset. In one embodiment, at least some of the training relevance scores are assigned by human subjects who determine individual relevance scores for submitted documents with respect to the corresponding queries. Finally, a relevance function is determined based on the plurality of test queries, the subsets of documents, and the sets of training relevance scores.
PCT/US2004/012813 2003-04-25 2004-04-23 Method and apparatus for machine learning a document relevance function WO2004097568A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP04750656A EP1623298A2 (en) 2003-04-25 2004-04-23 Method and apparatus for machine learning a document relevance function
JP2006513331A JP2006524869A (en) 2003-04-25 2004-04-23 Method and apparatus for machine learning of document relevance functions

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/424,170 US7197497B2 (en) 2003-04-25 2003-04-25 Method and apparatus for machine learning a document relevance function
US10/424,170 2003-04-25

Publications (2)

Publication Number Publication Date
WO2004097568A2 WO2004097568A2 (en) 2004-11-11
WO2004097568A3 true WO2004097568A3 (en) 2006-01-05

Family

ID=33299288

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2004/012813 WO2004097568A2 (en) 2003-04-25 2004-04-23 Method and apparatus for machine learning a document relevance function

Country Status (6)

Country Link
US (1) US7197497B2 (en)
EP (1) EP1623298A2 (en)
JP (1) JP2006524869A (en)
KR (1) KR20060006945A (en)
CN (1) CN1826597A (en)
WO (1) WO2004097568A2 (en)

Families Citing this family (137)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6883135B1 (en) * 2000-01-28 2005-04-19 Microsoft Corporation Proxy server using a statistical model
AUPR605601A0 (en) * 2001-07-03 2001-07-26 Blackwood, Miles Pipeite of sandwich construction
US7792828B2 (en) 2003-06-25 2010-09-07 Jericho Systems Corporation Method and system for selecting content items to be presented to a viewer
US7610313B2 (en) * 2003-07-25 2009-10-27 Attenex Corporation System and method for performing efficient document scoring and clustering
US8548995B1 (en) * 2003-09-10 2013-10-01 Google Inc. Ranking of documents based on analysis of related documents
US7424467B2 (en) * 2004-01-26 2008-09-09 International Business Machines Corporation Architecture for an indexer with fixed width sort and variable width sort
US7499913B2 (en) * 2004-01-26 2009-03-03 International Business Machines Corporation Method for handling anchor text
US7293005B2 (en) 2004-01-26 2007-11-06 International Business Machines Corporation Pipelined architecture for global analysis and index building
US8296304B2 (en) 2004-01-26 2012-10-23 International Business Machines Corporation Method, system, and program for handling redirects in a search engine
US7191175B2 (en) 2004-02-13 2007-03-13 Attenex Corporation System and method for arranging concept clusters in thematic neighborhood relationships in a two-dimensional visual display space
US7584221B2 (en) * 2004-03-18 2009-09-01 Microsoft Corporation Field weighting in text searching
US7260573B1 (en) * 2004-05-17 2007-08-21 Google Inc. Personalizing anchor text scores in a search engine
US7461064B2 (en) 2004-09-24 2008-12-02 International Buiness Machines Corporation Method for searching documents for ranges of numeric values
US7606793B2 (en) * 2004-09-27 2009-10-20 Microsoft Corporation System and method for scoping searches using index keys
US7739277B2 (en) * 2004-09-30 2010-06-15 Microsoft Corporation System and method for incorporating anchor text into ranking search results
US7761448B2 (en) 2004-09-30 2010-07-20 Microsoft Corporation System and method for ranking search results using click distance
US7827181B2 (en) * 2004-09-30 2010-11-02 Microsoft Corporation Click distance determination
US7779001B2 (en) * 2004-10-29 2010-08-17 Microsoft Corporation Web page ranking with hierarchical considerations
US7716198B2 (en) 2004-12-21 2010-05-11 Microsoft Corporation Ranking search results using feature extraction
US7698331B2 (en) * 2005-01-18 2010-04-13 Yahoo! Inc. Matching and ranking of sponsored search listings incorporating web search technology and web content
US7356777B2 (en) 2005-01-26 2008-04-08 Attenex Corporation System and method for providing a dynamic user interface for a dense three-dimensional scene
US7404151B2 (en) 2005-01-26 2008-07-22 Attenex Corporation System and method for providing a dynamic user interface for a dense three-dimensional scene
US7921365B2 (en) 2005-02-15 2011-04-05 Microsoft Corporation System and method for browsing tabbed-heterogeneous windows
US9092523B2 (en) 2005-02-28 2015-07-28 Search Engine Technologies, Llc Methods of and systems for searching by incorporating user-entered information
US7792833B2 (en) * 2005-03-03 2010-09-07 Microsoft Corporation Ranking search results using language types
US20060200460A1 (en) * 2005-03-03 2006-09-07 Microsoft Corporation System and method for ranking search results using file types
US7680772B2 (en) * 2005-03-09 2010-03-16 Intuit Inc. Search quality detection
WO2006102122A2 (en) 2005-03-18 2006-09-28 Wink Technologies, Inc. Search engine that applies feedback from users to improve search results
US7546294B2 (en) * 2005-03-31 2009-06-09 Microsoft Corporation Automated relevance tuning
WO2006133252A2 (en) * 2005-06-08 2006-12-14 The Regents Of The University Of California Doubly ranked information retrieval and area search
US7627564B2 (en) * 2005-06-21 2009-12-01 Microsoft Corporation High scale adaptive search systems and methods
US8244722B1 (en) 2005-06-30 2012-08-14 Google Inc. Ranking documents
US20070005588A1 (en) * 2005-07-01 2007-01-04 Microsoft Corporation Determining relevance using queries as surrogate content
US8249344B2 (en) * 2005-07-01 2012-08-21 Microsoft Corporation Grammatical parsing of document visual structures
US8195654B1 (en) * 2005-07-13 2012-06-05 Google Inc. Prediction of human ratings or rankings of information retrieval quality
US8417693B2 (en) * 2005-07-14 2013-04-09 International Business Machines Corporation Enforcing native access control to indexed documents
US7599917B2 (en) * 2005-08-15 2009-10-06 Microsoft Corporation Ranking search results using biased click distance
CN101454776A (en) * 2005-10-04 2009-06-10 汤姆森环球资源公司 Systems, methods, and software for identifying relevant legal documents
US7630964B2 (en) * 2005-11-14 2009-12-08 Microsoft Corporation Determining relevance of documents to a query based on identifier distance
US20070150477A1 (en) * 2005-12-22 2007-06-28 International Business Machines Corporation Validating a uniform resource locator ('URL') in a document
US8250061B2 (en) * 2006-01-30 2012-08-21 Yahoo! Inc. Learning retrieval functions incorporating query differentiation for information retrieval
US8509563B2 (en) * 2006-02-02 2013-08-13 Microsoft Corporation Generation of documents from images
US8001121B2 (en) * 2006-02-27 2011-08-16 Microsoft Corporation Training a ranking function using propagated document relevance
US7451120B1 (en) 2006-03-20 2008-11-11 Google Inc. Detecting novel document content
US20070233679A1 (en) * 2006-04-03 2007-10-04 Microsoft Corporation Learning a document ranking function using query-level error measurements
US7647314B2 (en) * 2006-04-28 2010-01-12 Yahoo! Inc. System and method for indexing web content using click-through features
US7593934B2 (en) * 2006-07-28 2009-09-22 Microsoft Corporation Learning a document ranking using a loss function with a rank pair or a query parameter
US7647353B2 (en) * 2006-11-14 2010-01-12 Google Inc. Event searching
US8176055B1 (en) 2007-03-27 2012-05-08 Google Inc. Content entity management
US20080250008A1 (en) * 2007-04-04 2008-10-09 Microsoft Corporation Query Specialization
US8117137B2 (en) * 2007-04-19 2012-02-14 Microsoft Corporation Field-programmable gate array based accelerator system
US7853589B2 (en) * 2007-04-30 2010-12-14 Microsoft Corporation Web spam page classification using query-dependent data
US8073803B2 (en) * 2007-07-16 2011-12-06 Yahoo! Inc. Method for matching electronic advertisements to surrounding context based on their advertisement content
KR100899930B1 (en) * 2007-07-24 2009-05-28 엔에이치엔(주) System and Method for Generating Relating Data Class
US20090055436A1 (en) * 2007-08-20 2009-02-26 Olakunle Olaniyi Ayeni System and Method for Integrating on Demand/Pull and Push Flow of Goods-and-Services Meta-Data, Including Coupon and Advertising, with Mobile and Wireless Applications
US8645390B1 (en) 2007-08-31 2014-02-04 Google Inc. Reordering search query results in accordance with search context specific predicted performance functions
US7895198B2 (en) * 2007-09-28 2011-02-22 Yahoo! Inc. Gradient based optimization of a ranking measure
US7840569B2 (en) 2007-10-18 2010-11-23 Microsoft Corporation Enterprise relevancy ranking using a neural network
US20090106221A1 (en) * 2007-10-18 2009-04-23 Microsoft Corporation Ranking and Providing Search Results Based In Part On A Number Of Click-Through Features
US9348912B2 (en) * 2007-10-18 2016-05-24 Microsoft Technology Licensing, Llc Document length as a static relevance feature for ranking search results
US8375073B1 (en) 2007-11-12 2013-02-12 Google Inc. Identification and ranking of news stories of interest
US8005774B2 (en) * 2007-11-28 2011-08-23 Yahoo! Inc. Determining a relevance function based on a query error derived using a structured output learning technique
US8099417B2 (en) * 2007-12-12 2012-01-17 Microsoft Corporation Semi-supervised part-of-speech tagging
US8775416B2 (en) * 2008-01-09 2014-07-08 Yahoo!Inc. Adapting a context-independent relevance function for identifying relevant search results
US7984004B2 (en) * 2008-01-17 2011-07-19 Microsoft Corporation Query suggestion generation
US7996379B1 (en) 2008-02-01 2011-08-09 Google Inc. Document ranking using word relationships
US8650144B2 (en) * 2008-02-14 2014-02-11 Yahoo! Inc. Apparatus and methods for lossless compression of numerical attributes in rule based systems
US8812493B2 (en) * 2008-04-11 2014-08-19 Microsoft Corporation Search results ranking using editing distance and document information
KR100953488B1 (en) * 2008-04-16 2010-04-19 엔에이치엔(주) Method and system for generating rank learning model using error minimization
US8126839B2 (en) * 2008-06-19 2012-02-28 Yahoo! Inc. Methods and apparatuses for adapting a ranking function of a search engine for use with a specific domain
US8171021B2 (en) 2008-06-23 2012-05-01 Google Inc. Query identification and association
US8621424B2 (en) * 2008-06-30 2013-12-31 Yahoo! Inc. Compiler based code modification for use in document ranking
US8458170B2 (en) 2008-06-30 2013-06-04 Yahoo! Inc. Prefetching data for document ranking
US8131659B2 (en) * 2008-09-25 2012-03-06 Microsoft Corporation Field-programmable gate array based accelerator system
US8301638B2 (en) * 2008-09-25 2012-10-30 Microsoft Corporation Automated feature selection based on rankboost for ranking
US8073727B2 (en) * 2008-10-23 2011-12-06 Sap Ag System and method for hierarchical weighting of model parameters
US8671093B2 (en) * 2008-11-18 2014-03-11 Yahoo! Inc. Click model for search rankings
CN101477542B (en) * 2009-01-22 2013-02-13 阿里巴巴集团控股有限公司 Sampling analysis method, system and equipment
US20100268709A1 (en) * 2009-04-21 2010-10-21 Yahoo! Inc., A Delaware Corporation System, method, or apparatus for calibrating a relevance score
US20100293175A1 (en) * 2009-05-12 2010-11-18 Srinivas Vadrevu Feature normalization and adaptation to build a universal ranking function
US20100332550A1 (en) * 2009-06-26 2010-12-30 Microsoft Corporation Platform For Configurable Logging Instrumentation
US20100332531A1 (en) * 2009-06-26 2010-12-30 Microsoft Corporation Batched Transfer of Arbitrarily Distributed Data
US8515957B2 (en) 2009-07-28 2013-08-20 Fti Consulting, Inc. System and method for displaying relationships between electronically stored information to provide classification suggestions via injection
US8082247B2 (en) * 2009-07-30 2011-12-20 Microsoft Corporation Best-bet recommendations
US20110029516A1 (en) * 2009-07-30 2011-02-03 Microsoft Corporation Web-Used Pattern Insight Platform
EP2471009A1 (en) 2009-08-24 2012-07-04 FTI Technology LLC Generating a reference set for use during document review
US20110264609A1 (en) * 2010-04-22 2011-10-27 Microsoft Corporation Probabilistic gradient boosted machines
US8572496B2 (en) * 2010-04-27 2013-10-29 Go Daddy Operating Company, LLC Embedding variable fields in individual email messages sent via a web-based graphical user interface
US8738635B2 (en) 2010-06-01 2014-05-27 Microsoft Corporation Detection of junk in search result ranking
US8375061B2 (en) * 2010-06-08 2013-02-12 International Business Machines Corporation Graphical models for representing text documents for computer analysis
US9002773B2 (en) 2010-09-24 2015-04-07 International Business Machines Corporation Decision-support application and system for problem solving using a question-answering system
US9208231B1 (en) * 2010-12-01 2015-12-08 Google Inc. Identifying languages relevant to resources
WO2012121729A1 (en) * 2011-03-10 2012-09-13 Textwise Llc Method and system for information modeling and applications thereof
US8666914B1 (en) * 2011-05-23 2014-03-04 A9.Com, Inc. Ranking non-product documents
US9477756B1 (en) * 2012-01-16 2016-10-25 Amazon Technologies, Inc. Classifying structured documents
US9495462B2 (en) 2012-01-27 2016-11-15 Microsoft Technology Licensing, Llc Re-ranking search results
US8972328B2 (en) 2012-06-19 2015-03-03 Microsoft Corporation Determining document classification probabilistically through classification rule analysis
US9213770B1 (en) * 2012-08-14 2015-12-15 Amazon Technologies, Inc. De-biased estimated duplication rate
US9122681B2 (en) 2013-03-15 2015-09-01 Gordon Villy Cormack Systems and methods for classifying electronic information using advanced active learning techniques
US9996624B2 (en) * 2014-06-30 2018-06-12 Google Llc Surfacing in-depth articles in search results
US9565147B2 (en) 2014-06-30 2017-02-07 Go Daddy Operating Company, LLC System and methods for multiple email services having a common domain
US10764265B2 (en) * 2014-09-24 2020-09-01 Ent. Services Development Corporation Lp Assigning a document to partial membership in communities
US10621189B2 (en) 2015-06-05 2020-04-14 Apple Inc. In-application history search
US10509833B2 (en) 2015-06-05 2019-12-17 Apple Inc. Proximity search scoring
US10592572B2 (en) 2015-06-05 2020-03-17 Apple Inc. Application view index and search
US10755032B2 (en) 2015-06-05 2020-08-25 Apple Inc. Indexing web pages with deep links
US10509834B2 (en) 2015-06-05 2019-12-17 Apple Inc. Federated search results scoring
US10242001B2 (en) 2015-06-19 2019-03-26 Gordon V. Cormack Systems and methods for conducting and terminating a technology-assisted review
RU2632133C2 (en) * 2015-09-29 2017-10-02 Общество С Ограниченной Ответственностью "Яндекс" Method (versions) and system (versions) for creating prediction model and determining prediction model accuracy
EP3188038B1 (en) * 2015-12-31 2020-11-04 Dassault Systèmes Evaluation of a training set
US10372714B2 (en) 2016-02-05 2019-08-06 International Business Machines Corporation Automated determination of document utility for a document corpus
US11068546B2 (en) 2016-06-02 2021-07-20 Nuix North America Inc. Computer-implemented system and method for analyzing clusters of coded documents
US10755182B2 (en) * 2016-08-11 2020-08-25 International Business Machines Corporation System and method for ground truth evaluation
US10621492B2 (en) * 2016-10-21 2020-04-14 International Business Machines Corporation Multiple record linkage algorithm selector
AU2018200643A1 (en) * 2017-03-09 2018-09-27 Accenture Global Solutions Limited Smart advisory for distributed and composite testing teams based on production data and analytics
CN108572900B (en) * 2017-03-09 2021-07-13 北京京东尚科信息技术有限公司 Blank pit position monitoring method, system, electronic equipment and storage medium
US10372426B2 (en) * 2017-11-06 2019-08-06 International Business Machines Corporation Cognitive redundant coding corpus determination system
US11409749B2 (en) * 2017-11-09 2022-08-09 Microsoft Technology Licensing, Llc Machine reading comprehension system for answering queries related to a document
RU2693324C2 (en) 2017-11-24 2019-07-02 Общество С Ограниченной Ответственностью "Яндекс" Method and a server for converting a categorical factor value into its numerical representation
US11341138B2 (en) * 2017-12-06 2022-05-24 International Business Machines Corporation Method and system for query performance prediction
US10831770B2 (en) * 2017-12-12 2020-11-10 International Business Machines Corporation System and method for estimating query performance in document retrieval
US10915538B2 (en) * 2018-03-23 2021-02-09 Home Depot Product Authority, Llc Ranking and presenting search engine results based on category-specific ranking models
US11093512B2 (en) * 2018-04-30 2021-08-17 International Business Machines Corporation Automated selection of search ranker
RU2721159C1 (en) 2018-12-13 2020-05-18 Общество С Ограниченной Ответственностью "Яндекс" Method and server for generating meta-attribute for ranging documents
RU2744028C2 (en) * 2018-12-26 2021-03-02 Общество С Ограниченной Ответственностью "Яндекс" Method and system for storing multiple documents
US11403300B2 (en) 2019-02-15 2022-08-02 Wipro Limited Method and system for improving relevancy and ranking of search result
US11429897B1 (en) 2019-04-26 2022-08-30 Bank Of America Corporation Identifying relationships between sentences using machine learning
US11783005B2 (en) 2019-04-26 2023-10-10 Bank Of America Corporation Classifying and mapping sentences using machine learning
US11205506B2 (en) 2019-05-22 2021-12-21 International Business Machines Corporation Verifying natural language processing in health care
CN110598272B (en) * 2019-08-22 2022-11-22 合肥工业大学 Heuristic generation method and device for multi-unmanned platform information interaction topology
US11423231B2 (en) 2019-08-27 2022-08-23 Bank Of America Corporation Removing outliers from training data for machine learning
US11449559B2 (en) 2019-08-27 2022-09-20 Bank Of America Corporation Identifying similar sentences for machine learning
US11526804B2 (en) 2019-08-27 2022-12-13 Bank Of America Corporation Machine learning model training for reviewing documents
US11556711B2 (en) 2019-08-27 2023-01-17 Bank Of America Corporation Analyzing documents using machine learning
US11436235B2 (en) 2019-09-23 2022-09-06 Ntent Pipeline for document scoring
CN112231621B (en) * 2020-10-13 2021-09-24 电子科技大学 Method for reducing element detection limit based on BP-adaboost
US11893981B1 (en) 2023-07-11 2024-02-06 Seekr Technologies Inc. Search system and method having civility score

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5696962A (en) * 1993-06-24 1997-12-09 Xerox Corporation Method for computerized information retrieval using shallow linguistic analysis
US6026388A (en) * 1995-08-16 2000-02-15 Textwise, Llc User interface and other enhancements for natural language information retrieval system and method
US20020114394A1 (en) * 2000-12-06 2002-08-22 Kai-Kuang Ma System and method for motion vector generation and analysis of digital video clips

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6119114A (en) * 1996-09-17 2000-09-12 Smadja; Frank Method and apparatus for dynamic relevance ranking
US5909510A (en) * 1997-05-19 1999-06-01 Xerox Corporation Method and apparatus for document classification from degraded images
US6651057B1 (en) * 1999-09-03 2003-11-18 Bbnt Solutions Llc Method and apparatus for score normalization for information retrieval applications
US6430559B1 (en) * 1999-11-02 2002-08-06 Claritech Corporation Method and apparatus for profile score threshold setting and updating
US20030074353A1 (en) * 1999-12-20 2003-04-17 Berkan Riza C. Answer retrieval technique
US7062485B1 (en) * 2000-09-01 2006-06-13 Huaichuan Hubert Jin Method and apparatus for score normalization for information retrieval applications
US6701317B1 (en) * 2000-09-19 2004-03-02 Overture Services, Inc. Web page connectivity server construction
US7010527B2 (en) * 2001-08-13 2006-03-07 Oracle International Corp. Linguistically aware link analysis method and system
US7158983B2 (en) * 2002-09-23 2007-01-02 Battelle Memorial Institute Text analysis technique
US7917483B2 (en) * 2003-04-24 2011-03-29 Affini, Inc. Search engine and method with improved relevancy, scope, and timeliness

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5696962A (en) * 1993-06-24 1997-12-09 Xerox Corporation Method for computerized information retrieval using shallow linguistic analysis
US6026388A (en) * 1995-08-16 2000-02-15 Textwise, Llc User interface and other enhancements for natural language information retrieval system and method
US20020114394A1 (en) * 2000-12-06 2002-08-22 Kai-Kuang Ma System and method for motion vector generation and analysis of digital video clips

Also Published As

Publication number Publication date
WO2004097568A2 (en) 2004-11-11
KR20060006945A (en) 2006-01-20
EP1623298A2 (en) 2006-02-08
US7197497B2 (en) 2007-03-27
US20040215606A1 (en) 2004-10-28
JP2006524869A (en) 2006-11-02
CN1826597A (en) 2006-08-30

Similar Documents

Publication Publication Date Title
WO2004097568A3 (en) Method and apparatus for machine learning a document relevance function
Voorhees The TREC question answering track
Light et al. Accumulating evidence: Procedures for resolving contradictions among different research studies
WO2002069094A3 (en) Human capital management performance capability matching system and methods
WO2010074887A3 (en) Interactively ranking image search results using color layout relevance
CN108198604A (en) A kind of nutrition dietary based on personal characteristics recommends method
EP1574972A3 (en) Machine-learned approach to determining document relevance for search over large electronic collections of documents
CN104361102A (en) Expert recommendation method and system based on group matching
Harwood et al. Imagery use in elite youth sport participants: Reinforcing the applied significance of achievement goal theory
CN106960003A (en) Plagiarize the query generation method of the retrieval of the source based on machine learning in detection
CN116501843A (en) Efficient network retrieval enhancement answer method and system for human preference
Radlinski et al. Detecting duplicate web documents using clickthrough data
CN112927782B (en) Heart health state early warning system based on text emotion analysis
KHALID CLUSTER ANALYSIS-A STANDARD SETTING TECHNIQUE IN MEASUREMENT AND TESTING.
CN106422258A (en) Shadowboxing footwork learning and scoring method
Song et al. ECNU at 2015 eHealth Task 2: User-centred Health Information Retrieval.
Chen et al. Transrank: A novel algorithm for transfer of rank learning
Alstot et al. Effects of interventions based in behavior analysis on motor skill acquisition: A meta-analysis
Ibrahim Evolutionary algorithms and machine learning techniques for information retrieval
Rivera et al. Classifying the physical activity indicator using machine learning and direct measurements: a feasibility study
Shin et al. Improving information retrieval in MEDLINE by modulating MeSH term weights
Sato Predicting Triple Scoring with Crowdsourcing-specific Features-The fiddlehead Triple Scorer at WSDM Cup 2017
Stanescu et al. Automatic assessment of narrative answers using information retrieval techniques
Bentzen et al. Wealth distribution and mobility in Denmark: a longitudinal study
Amati et al. Merging XML indices

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2006513331

Country of ref document: JP

Ref document number: 1020057020306

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 2004750656

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 20048174686

Country of ref document: CN

WWP Wipo information: published in national office

Ref document number: 1020057020306

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2004750656

Country of ref document: EP