WO2007149623A3 - Full text query and search systems and method of use - Google Patents

Full text query and search systems and method of use Download PDF

Info

Publication number
WO2007149623A3
WO2007149623A3 PCT/US2007/067439 US2007067439W WO2007149623A3 WO 2007149623 A3 WO2007149623 A3 WO 2007149623A3 US 2007067439 W US2007067439 W US 2007067439W WO 2007149623 A3 WO2007149623 A3 WO 2007149623A3
Authority
WO
WIPO (PCT)
Prior art keywords
information
measure
itoms
hits
shared
Prior art date
Application number
PCT/US2007/067439
Other languages
French (fr)
Other versions
WO2007149623A2 (en
Inventor
Yuanhua Tom Tang
Qianjin Hu
Yonghong Grace Yang
Chunnuan Chen
Minghua Mei
Original Assignee
Infovell Inc
Yuanhua Tom Tang
Qianjin Hu
Yonghong Grace Yang
Chunnuan Chen
Minghua Mei
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Infovell Inc, Yuanhua Tom Tang, Qianjin Hu, Yonghong Grace Yang, Chunnuan Chen, Minghua Mei filed Critical Infovell Inc
Priority to EP07761298A priority Critical patent/EP2013788A4/en
Publication of WO2007149623A2 publication Critical patent/WO2007149623A2/en
Publication of WO2007149623A3 publication Critical patent/WO2007149623A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution

Abstract

Roughly described, a database searching method for searching a database, in which hits are ranked in dependence upon an information measure of itoms shared by both the hit and the query. The information measure can be a Shannon information score, or another measure which indicates the information value of the shared itoms. An itom can be a word or other token, or a multi-word phrase, and can overlap with each other. Synonyms can be substituted for itoms in the query, with the information measure of substituted itoms being derated in accordance with a predetermined measure of the synonyms' similarity. Indirect searching methods are described in which hit from other search engines are re-ranked in dependence upon the information measures of shared itoms. Structured and completely unstructured databases may be searched, with hits being demarcated dynamically. Hits may be clustered based upon distances in an information- measure- weighted distance space.
PCT/US2007/067439 2006-04-25 2007-04-25 Full text query and search systems and method of use WO2007149623A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP07761298A EP2013788A4 (en) 2006-04-25 2007-04-25 Full text query and search systems and method of use

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US74560506P 2006-04-25 2006-04-25
US74560406P 2006-04-25 2006-04-25
US60/745,604 2006-04-25
US60/745,605 2006-04-25

Publications (2)

Publication Number Publication Date
WO2007149623A2 WO2007149623A2 (en) 2007-12-27
WO2007149623A3 true WO2007149623A3 (en) 2009-02-12

Family

ID=38834185

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2007/067439 WO2007149623A2 (en) 2006-04-25 2007-04-25 Full text query and search systems and method of use

Country Status (2)

Country Link
EP (1) EP2013788A4 (en)
WO (1) WO2007149623A2 (en)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9348912B2 (en) 2007-10-18 2016-05-24 Microsoft Technology Licensing, Llc Document length as a static relevance feature for ranking search results
US8364679B2 (en) 2009-09-17 2013-01-29 Cpa Global Patent Research Limited Method, system, and apparatus for delivering query results from an electronic document collection
TWI486797B (en) * 2010-03-09 2015-06-01 Alibaba Group Holding Ltd Methods and devices for sorting search results
US9495462B2 (en) 2012-01-27 2016-11-15 Microsoft Technology Licensing, Llc Re-ranking search results
US10692015B2 (en) * 2016-07-15 2020-06-23 Io-Tahoe Llc Primary key-foreign key relationship determination through machine learning
CN106789895B (en) * 2016-11-18 2020-03-27 东软集团股份有限公司 Compressed text detection method and device
US11604841B2 (en) 2017-12-20 2023-03-14 International Business Machines Corporation Mechanistic mathematical model search engine
US10394555B1 (en) 2018-12-17 2019-08-27 Bakhtgerey Sinchev Computing network architecture for reducing a computing operation time and memory usage associated with determining, from a set of data elements, a subset of at least two data elements, associated with a target computing operation result
CN110413734B (en) * 2019-07-25 2023-02-17 万达信息股份有限公司 Intelligent search system and method for medical service
CN111079036B (en) * 2019-11-25 2023-11-07 罗靖涛 Field type searching method
CN111222040B (en) * 2019-12-30 2023-06-13 航天信息股份有限公司企业服务分公司 Scheme self-matching processing method and system based on training requirements
US11900272B2 (en) 2020-05-13 2024-02-13 Factset Research System Inc. Method and system for mapping labels in standardized tables using machine learning
CN113327572B (en) * 2021-06-02 2024-02-09 清华大学深圳国际研究生院 Controllable emotion voice synthesis method and system based on emotion type label
US11546142B1 (en) 2021-12-22 2023-01-03 Bakhtgerey Sinchev Cryptography key generation method for encryption and decryption
CN116595973B (en) * 2023-05-19 2023-10-03 广东职教桥数据科技有限公司 Post function identification method based on natural language processing classification technology

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5761497A (en) * 1993-11-22 1998-06-02 Reed Elsevier, Inc. Associative text search and retrieval system that calculates ranking scores and window scores
US5812998A (en) * 1993-09-30 1998-09-22 Omron Corporation Similarity searching of sub-structured databases
US20020111941A1 (en) * 2000-12-19 2002-08-15 Xerox Corporation Apparatus and method for information retrieval
US6633817B1 (en) * 1999-12-29 2003-10-14 Incyte Genomics, Inc. Sequence database search with sequence search trees
US20040024583A1 (en) * 2000-03-20 2004-02-05 Freeman Robert J Natural-language processing system using a large corpus
US20060026147A1 (en) * 2004-07-30 2006-02-02 Cone Julian M Adaptive search engine

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1825395A4 (en) * 2004-10-25 2010-07-07 Yuanhua Tang Full text query and search systems and methods of use

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5812998A (en) * 1993-09-30 1998-09-22 Omron Corporation Similarity searching of sub-structured databases
US5761497A (en) * 1993-11-22 1998-06-02 Reed Elsevier, Inc. Associative text search and retrieval system that calculates ranking scores and window scores
US6633817B1 (en) * 1999-12-29 2003-10-14 Incyte Genomics, Inc. Sequence database search with sequence search trees
US20040024583A1 (en) * 2000-03-20 2004-02-05 Freeman Robert J Natural-language processing system using a large corpus
US20020111941A1 (en) * 2000-12-19 2002-08-15 Xerox Corporation Apparatus and method for information retrieval
US20060026147A1 (en) * 2004-07-30 2006-02-02 Cone Julian M Adaptive search engine

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP2013788A4 *

Also Published As

Publication number Publication date
EP2013788A2 (en) 2009-01-14
WO2007149623A2 (en) 2007-12-27
EP2013788A4 (en) 2012-04-25

Similar Documents

Publication Publication Date Title
WO2007149623A3 (en) Full text query and search systems and method of use
WO2006047654A3 (en) Full text query and search systems and methods of use
WO2005010691A3 (en) Disambiguation of search phrases using interpretation clusters
WO2008009017A3 (en) Method and system for qualifying keywords in query strings
US8122043B2 (en) System and method for using an exemplar document to retrieve relevant documents from an inverted index of a large corpus
NZ578672A (en) Information-retrieval systems, methods, and software with concept-based searching and ranking
WO2005017682A3 (en) Product placement engine and method
WO2006118814A3 (en) Method for finding semantically related search engine queries
WO2005032235A3 (en) Increasing a number of relevant advertisements using a relaxed match
WO2008073502A3 (en) Viewport-relative scoring for location search queries
WO2007130716A3 (en) Methods and apparatus for computerized searching
BRPI0501320A (en) Suggested Related Terms for a Multisense Query
WO2007101194A3 (en) System and method for identifying related queries for languages with multiple writing systems
WO2008051750A3 (en) Associating geographic-related information with objects
WO2007016232A3 (en) Processor for fast phrase searching
WO2007095599A3 (en) Survey based qualification of keyword searches
WO2006009635A3 (en) Apparatus, method and sytem of artificial intelligence for data searching applications
WO2008027503A3 (en) Semantic search engine
WO2002089004A3 (en) Search data management
Crimp et al. Refining query expansion terms using query context
CA et al. Thesaurus-based retrieval of case law
Lee et al. SiteQ/J: A Question Answering System for Japanese.
Mejova et al. TREC Blog and TREC Chem: A View from the Corn Fields.
Dalton et al. UMass CIIR at TAC KBP 2013 Entity Linking: Query Expansion using Urban Dictionary.
Selvi et al. An approach to improve precision and recall for ad-hoc information retrieval using sbir algorithm

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200780023220.4

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07761298

Country of ref document: EP

Kind code of ref document: A2

WWE Wipo information: entry into national phase

Ref document number: 2007761298

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE