CN102081668A - Information retrieval optimizing method based on domain ontology - Google Patents

Information retrieval optimizing method based on domain ontology Download PDF

Info

Publication number
CN102081668A
CN102081668A CN 201110025219 CN201110025219A CN102081668A CN 102081668 A CN102081668 A CN 102081668A CN 201110025219 CN201110025219 CN 201110025219 CN 201110025219 A CN201110025219 A CN 201110025219A CN 102081668 A CN102081668 A CN 102081668A
Authority
CN
China
Prior art keywords
notion
inquiry
abstract
formula
query
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 201110025219
Other languages
Chinese (zh)
Other versions
CN102081668B (en
Inventor
熊晶
王爱民
徐建良
王继鹏
张长青
郭涛
梁燕军
孙华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN2011100252193A priority Critical patent/CN102081668B/en
Publication of CN102081668A publication Critical patent/CN102081668A/en
Application granted granted Critical
Publication of CN102081668B publication Critical patent/CN102081668B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention provides an information retrieval optimizing method based on domain ontology, comprising the steps of: obtaining query key words submitted by users via a retrieval interface of a retrieval system; performing lexeme expansion for the query key words submitted by users via domain ontology inference in a desired domain of users and according to the built domain ontology, so as to obtain one or more sets of new query strings; submitting the expanded query strings to one or more search engines for retrieval; performing repetition removal and re-sequencing integration for the results fed back by the various search engines; displaying the final result to the user via the retrieval interface. In the invention, the efficiency of the information retrieval relevant to the domain is improved by the lexeme advantage of the domain ontology.

Description

Information retrieval optimization method based on domain body
Technical field
The present invention relates to a kind of network technology, be based on the information retrieval method of search engine specifically.
Background technology
People are to use research tool from the main means that network obtains information, as Google, Baidu, Yahoo etc.The principle of work of search engine comprises three processes substantially: (1) gathers information from the internet, by regularly the information of all website and webpage on the internet being grasped with Web Spider.(2) organize your messages and set up index data base and analyze collecting the webpage of returning by analyzing the directory system program, extract keyword that related web page place website links, type of coding, content of pages comprise, keyword position, rise time, size, with the information such as linking relationship of other webpage, calculate according to certain degree of correlation algorithm, obtain each webpage at the degree of correlation (or importance) that reaches each keyword in the super chain in the content of pages, set up the web page index database with these relevant informations then.(3) in index data base searching order, accept inquiry when the user after keyword search is imported at the interface of search engine, from the web page index database, find all related web pages that meet this keyword by the search system program, according to ready-made degree of correlation numerical ordering, the degree of correlation is high more, and rank is forward more.At last,, organize and return to the user contents such as the chained address of Search Results, content of pages summaries by page generation system.
Present search engine is based on the search engine of keyword matching mostly.Yet these search engines seldom have the semantic reasoning ability.Though Google has adopted some natural language processing techniques, for example, the synonym expansion, but it can not resolve the semantic relation between the notion, caused the reduction of precision ratio so to a certain extent, made that the inquiry return results is not a user institute satisfactory information.On the other hand, user's inquiry depends on certain professional domain to a great extent, as marine field.For example; suppose that the user wants to search for the information of marine field relevant " DIP (Dissolved inorganic phosphorus dissolves Phos) "; its Query Result as shown in Figure 4; usually can obtain a large amount of other fields " DIP " information; as " the Dual Inline Package " of microelectronic, i.e. dual-in-line package technology.Because these are and the incoherent garbage of user's purpose that the user obviously is unsatisfied to such result.
" body (Ontology) " conduct " the clear and definite formalization normalized illustration of shared ideas model " is by taking out the model that the related notion of some phenomenons obtains in the objective world, and the implication of conceptual model performance is independent of concrete ambient condition.What body embodied is the knowledge of common approval, reflection be the concept set of generally acknowledging in the association area, so body provides common understanding and description to domain knowledge, can be used to better share, exchange and reuse.Constitute body notion and between relation through explication, use body can eliminate phenomenons such as polysemy, many speech one justice and the meaning of a word be ambiguous, thereby finish domain knowledge clear, definite, complete definition and description.The target of body research is to obtain a Knowledge Representation Method, makes machine to share and process information as the mankind.At present, ontology is widely used in fields such as the representation of knowledge, information retrieval.
Summary of the invention
In order to overcome the existing deficiency of search engine on semantic retrieval, the invention provides a kind of information retrieval optimization method based on domain body.
Technical scheme of the present invention is: a kind of information retrieval optimization method based on domain body, and its step is as follows:
(1), obtains the key word of the inquiry that the user submits to by the search interface of searching system;
(2) in the field of user expectation, according to the domain body of having set up, the key word of the inquiry that the user is submitted to carries out semantic extension by ontology inference, obtains one or more groups new inquiry string;
(3) inquiry string after will expanding is submitted to one or more search engines and is retrieved;
(4) return results to each search engine goes heavily, sorts and integrate;
(5) net result is shown to the user by search interface.
In the above-mentioned steps (2) based on the semantic extension mode of domain body comprise in the following mode a kind of, two kinds or all:
1. based on the optimization method of is-a relation
Is-a relation (inheritance) has shown the classification of notion, and promptly the example of father's notion equals the summation of sub-notion example.Therefore added some constraints in that son is conceptive, sub-notion is also referred to as the particularization of father's notion.The probability that a notion father notion direct with it or sub-notion occur in same document is higher.Therefore, when search during, can utilize the father's notion P of A or sub-notion C to improve the precision ratio of search as constraint about the document of certain notion A.So the inquiry that a notion can be optimized to notion itself and its father's notion or sub-notion is right.
2. based on the optimization method of part-of relation
Part-of represents whole-part relations, is used for describing the mutual relationship between a notion and its part notion.The ingredient of a notion also therewith the field under the notion be closely related.Therefore, also be associated usually with part notion document matching with its global concept.So the inquiry that a notion can be optimized to notion itself and part notion thereof is right.
3. based on the optimization method of equivalent-class relation
Equivalent-class (equivalence class) relation is used for the synonym phenomenon of process field knowledge.Utilize the equivalent-class relation, the notion in the user inquiring can be mapped to the synonym of equal value with it.Like this, can improve the precision ratio of information retrieval.And, the common householder method of equivalent-class relation as preceding two kinds of optimization methods.
Between the internal notion of described inquiry be " with " or " or " logical relation, " with " can improve the inquiry accuracy rate, " or " can improve recall ratio.
In the above-mentioned steps (4), to the return results of each search engine go heavily, ordering integrates, the algorithm that can adopt is as follows:
(1) URL to Search Results handles, and intercepting " # " URL character string before is as final chained address; If there is MD5 (URL A)=MD5 (URL B), then think URL AAnd URL BCorresponding page is a duplicate pages, goes heavily;
(2) sort algorithm is considered two aspects:
1. the semantic distance Dist (C of each notion in the inquiry string i, C j), C wherein iWith C jBe two notions in the inquiry string,
Dist ( C i , C j ) = Σ k = 1 n ω e k + N C i + N C j N C i + N C j + 2 × N LCA × ϵ Formula 1
In the formula 1,
Figure BSA00000424876200042
Link node C in the expression body tree i, C jShortest path in the Weighted distance sum on each limit;
Figure BSA00000424876200043
With Represent node C respectively iAnd C jWeighted distance to minimum common ancestor's node; N LCARepresent the Weighted distance of minimum common ancestor to root node; ε is a constant, determines according to weighting coefficient.
The semantic weight of different relations is with reference to table 1 between the notion.
Table 1 semantic distance weight table
Figure BSA00000424876200051
In the table 1,
Figure BSA00000424876200052
The expression blank operation, single operation is represented in its combination with row; E represents the equivalent-class relation; G represents the is-a relation, and direction is pointed to father's notion by sub-notion; S represents the is-a relation, and direction is pointed to sub-notion by father's notion; P represents the part-of relation.
Because the semantic distance of notion semantic similarity and notion is inverse function each other, when semantic distance was 0, semantic similarity was 1.Therefore can be with C i, C jSimilarity between the two is reduced to:
Sim ( C i , C j ) = 1 Dist ( C i , C j ) + 1 Formula 2
2. inquiry string and Search Results the record degree of correlation Rank (Query, Abstract).
Rank ( Query , Abstract ) = Σ i = 1 n Rank ( C i , Abstract ) Formula 3
In the formula 3, Rank (C i, be the degree of correlation between each notion and the Search Results summary Abstract among the inquiry string Query Abstract), n is the number of notion among the Query.
Rank ( C i , Abstract ) = m × Σ j = 1 m ln len ( Abstract ) Index ( C i , j , Abstract ) Formula 4
In the formula 4, m=Time (C i, Abstract) be notion C iThe number of times that in summary Abstract, occurs; The length of len (Abstract) expression summary Abstract; Index (C i, j Abstract) is notion C iThe position that the j time occurs in summary Abstract.
3. to original query key word K iAnd the inquiry string Query of expansion, obtain K respectively iSemantic similarity with each notion among the Query
Figure BSA00000424876200061
Then can calculate the matching degree R of result for retrieval.
R=α Sim (K i, C j(Query, Abstract) formula 5 for)+β Rank
In the formula 5, α and β are constant, represent the semantic relevancy of etendue critical word and the weight of the summary degree of correlation thereof respectively.α ∈ (0,1) wherein, β ∈ (0,1), and alpha+beta=1.
4. the order of successively decreasing according to R numerical value is finished the ordering of result for retrieval.
The present invention is recall ratio and the precision ratio that utilizes the relevant information retrieval in the semantic advantage raising field of body.On the basis of the method, user's key word of the inquiry can be utilized domain body carry out semantic extension, obtain one or more groups new query string, then it is submitted to the Web search engine, and Search Results sorted and put in order, finally be shown to the user.Because these new query strings have been considered the relation between the field concept, as hypernym, hyponym, synonym etc., can improve the recall ratio of retrieval; Simultaneously because that body is the field is relevant, make result for retrieval be limited under within the scope in field, can screen out a large amount of and information field independence, thereby improve the precision ratio of retrieval.
Description of drawings
Fig. 1 is a marine ecology domain body fragment;
Fig. 2 is the optimization information searching system OASIS workflow diagram that the present invention is based on domain body;
Fig. 3 is the search interface of OASIS of the present invention;
Fig. 4 is the Search Results homepage sectional drawing that retrieval " DIP " obtains in Google;
Fig. 5 is to be the summary degree of correlation of example calculating with " InorganicNutrient+DIP ";
Fig. 6 is the Search Results sectional drawing that retrieval " DIP " obtains in OASIS of the present invention.
Embodiment
Below by a marine ecology field specific embodiment the present invention is described in further detail.
The present invention proposes a kind of information retrieval optimization method based on domain body, is example with the marine ecology field, in conjunction with the accompanying drawings, specifically describes as follows.
The workflow diagram of committed step of the present invention is an example with the marine ecology field as shown in Figure 2, and when submit queries " DIP ", concrete implementation step is:
1. server is set up a marine ecology body (Ontology), and with the storage of ocean.ont form, its body fragment as shown in Figure 1;
2. pass through search interface shown in Figure 3 at user side, submit to key word of the inquiry " DIP " to inquire about (Portal);
3. server obtains the key word of the inquiry that the user submits to, utilize HozoAPI to carry out semantic reasoning to the ocean.ont body and realize optimizing (Query Optimizer), at notion " DIP ", can get access to relative notion has: based on the notion InorganicNutrient of is-a relation, based on notion Phytoplankton, the Seawater of part-of relation.Obtain three groups of new inquiry strings " InorganicNutrient+DIP ", " DIP+Phytoplankton " and " DIP+Seawater " by the relation between these notions and the notion;
4. these three groups of character strings are sent to Web search engine (Web SearchEngine) respectively, (World Wide Web) obtains three groups of retrieval sets from WWW, get preceding 30 records of each result for retrieval, obtain result set Result_1 respectively, Result_2 and Result_3;
5. server is with Result_1, and Result_2 and Result_3 merge, and resequences after finishing retry, obtains net result collection Result.Main algorithm is as follows:
(1) URL to Search Results handles, and intercepting " # " URL character string before is as final chained address.If there is MD5 (URL A)=MD5 (URL B), then think URL AAnd URL BCorresponding page is a duplicate pages.
(2) sort algorithm is considered two aspects:
1. the semantic distance Dist (C of each notion in the inquiry string i, C j), C wherein iWith C jBe two notions in the inquiry string.
Utilize formula 1:
Figure BSA00000424876200081
Calculate C iWith C jSemantic distance, and by formula 2: Calculate C iWith C jSemantic similarity.
2. utilize formula 3:
Figure BSA00000424876200083
Calculate the degree of correlation of inquiry string and Search Results record.
3. to original query key word K iAnd the inquiry string Query of expansion, obtain K respectively iSemantic similarity with each notion among the Query
Figure BSA00000424876200084
And utilize formula 5:R=α Sim (K i, C j(Query Abstract) calculates matching degree to)+β Rank, finishes the ordering of result for retrieval by its result's the order of successively decreasing.
Be that the example explanation describes now with inquiry string " InorganicNutrient+DIP ".Two notions are respectively with C INAnd C DIPExpression.
By Fig. 1 associative list 1 as can be known
Figure BSA00000424876200091
Figure BSA00000424876200092
Figure BSA00000424876200093
N LCA=2, get ε=1.Then calculate by formula 1
Calculate by formula 2 Sim ( C IN , C DIP ) = 1 Dist ( C IN , C DIP ) + 1 = 0.27
(Query, correlation parameter Abstract) as shown in Figure 5 to calculate Rank.
Utilize formula 5, get α=0.6, β=0.4:
R URL 1 = 0.6 × 0.27 + 0.4 × 4.192 = 1.839
R URL 2 = 0.6 × 0.27 + 0.4 × 1.253 = 0.663
Therefore
Figure BSA00000424876200098
Come The prostatitis.
6. Result is shown to the user by search interface.As shown in Figure 6.
Said process is the specialty retrieval optimization method that is defaulted as marine ecology domain-specific searching system OASIS and interface 3 with.Also can adopt this professional searching system for other field, but will adopt the association area body.Certainly for comprehensive search engine, then can on search interface, increase field keyword column by user's input, to determine the field of user expectation retrieval according to the field keyword of user's input, for the user strange situation is divided in the field, can the preliminary election association area on the search interface of search engine select during by user search, to determine domain body and to carry out the meaning of a word expansion of association area.For not selecting or do not import the field keyword, then adopt all spectra body when determining domain body.

Claims (8)

1. information retrieval optimization method based on domain body, its step is as follows:
(1), obtains the key word of the inquiry that the user submits to by the search interface of searching system;
(2) in the field of user expectation, according to the domain body of having set up, the key word of the inquiry that the user is submitted to carries out semantic extension by the domain body reasoning, obtains one or more groups new inquiry string;
(3) inquiry string after will expanding is submitted to one or more search engines and is retrieved;
(4) return results to each search engine goes heavy and the ordering integration;
(5) net result is shown to the user by search interface.
2. the method for claim 1 is characterized in that describedly carrying out semantic extension by ontology inference, be adopt in the following method one or both or all:
1. based on is-a optimized relation method
Father's notion P or the sub-notion C of the notion A that obtains based on described key word of the inquiry, the inquiry that is optimized to notion A itself and its father's notion P is right, or the inquiry of notion A itself and its sub-notion C is right;
2. based on the optimization method of part-of relation
The inquiry that will be optimized to this notion itself and its part notion formation based on the notion that key word of the inquiry obtains is right;
3. the optimization that concerns based on equivalent-class
It is right to be optimized to the inquiry that this notion and the synonym of equal value with it constitute based on the notion that key word of the inquiry obtains.
3. method as claimed in claim 2, it is characterized in that between the internal notion of described inquiry for " with " or " or " logical relation.
4. as described method one of in the claim 1 to 3, it is characterized in that: described go heavily to be meant for Search Results URL handle, intercepting " # " URL character string before is as final chained address, for URL AAnd URL BIf there is MD5 (URL A)=MD5 (URL B), then think URL AAnd URL BCorresponding page is a duplicate pages, removes one of them chained address.
5. method as claimed in claim 4 is characterized in that: described ordering is to utilize the semantic similarity of notion in conjunction with the summary sort algorithm, and the result after going is heavily sorted.
6. method as claimed in claim 5 is characterized in that described sort method comprises:
1. calculate the semantic distance Dist (C of each notion in the inquiry string by formula 1 i, C j),
Dist ( C i , C j ) = Σ k = 1 n ω e k + N C i + N C j N C i + N C j + 2 × N LCA × ϵ Formula 1
C wherein iWith C jBe two notions in the inquiry string, Link node C in the expression body tree i, C jShortest path in the Weighted distance sum on each limit; With
Figure FSA00000424876100024
Represent node C respectively iAnd C jWeighted distance to minimum common ancestor's node; N LCARepresent the Weighted distance of minimum common ancestor to root node; ε is a constant, determines according to weighting coefficient,
When semantic distance was 0, semantic similarity was 1, with C i, C jSimilarity between the two is reduced to formula 2:
Sim ( C i , C j ) = 1 Dist ( C i , C j ) + 1 Formula 2
2. by formula 3 determine the degree of correlation Rank that inquiry strings and Search Results write down (Query, Abstract)
Rank ( Query , Abstract ) = Σ i = 1 n Rank ( C i , Abstract ) Formula 3
In the formula 3, Rank (C i, be the degree of correlation between each notion and the Search Results summary Abstract among the inquiry string Query Abstract), n is the number of notion among the Query
Rank ( C i , Abstract ) = m × Σ j = 1 m ln len ( Abstract ) Index ( C i , j , Abstract ) Formula 4
In the formula 4, m=Time (C i, Abstract) be notion C iThe number of times that in summary Abstract, occurs; The length of len (Abstract) expression summary Abstract; Index (C i, j Abstract) is notion C iThe position that the j time occurs in summary Abstract,
3. to original query key word K iAnd the inquiry string Query of expansion, obtain K respectively iSemantic similarity with each notion among the Query Calculate the matching degree R of result for retrieval by formula 5.
R=α Sim (K i, C j(Query, Abstract) formula 5 for)+β Rank
In the formula 5, α and β are constant, represent the semantic relevancy of etendue critical word and the weight of the summary degree of correlation thereof respectively, α ∈ (0,1) wherein, and β ∈ (0,1), and alpha+beta=1,
4. the order of successively decreasing according to R numerical value is finished the ordering of result for retrieval.
7. as described method one of in the claim 1 to 3, it is characterized in that: described search interface is the special interface at a certain field.
8. as described method one of in the claim 1 to 3, it is characterized in that: described search interface has field option or field key word to fill in the zone, field option of selecting according to the user in described step (2) or field key word load corresponding domain body and carry out semantic extension.
CN2011100252193A 2011-01-24 2011-01-24 Information retrieval optimizing method based on domain ontology Expired - Fee Related CN102081668B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2011100252193A CN102081668B (en) 2011-01-24 2011-01-24 Information retrieval optimizing method based on domain ontology

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2011100252193A CN102081668B (en) 2011-01-24 2011-01-24 Information retrieval optimizing method based on domain ontology

Publications (2)

Publication Number Publication Date
CN102081668A true CN102081668A (en) 2011-06-01
CN102081668B CN102081668B (en) 2012-07-25

Family

ID=44087629

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2011100252193A Expired - Fee Related CN102081668B (en) 2011-01-24 2011-01-24 Information retrieval optimizing method based on domain ontology

Country Status (1)

Country Link
CN (1) CN102081668B (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102779161A (en) * 2012-06-14 2012-11-14 杜小勇 Semantic labeling method based on resource description framework (RDF) knowledge base
CN102799677A (en) * 2012-07-20 2012-11-28 河海大学 Water conservation domain information retrieval system and method based on semanteme
CN103324644A (en) * 2012-03-23 2013-09-25 日电(中国)有限公司 Query result diversification method
CN103577581A (en) * 2013-11-08 2014-02-12 南京绿色科技研究院有限公司 Method for forecasting price trend of agricultural products
CN103927358A (en) * 2014-04-15 2014-07-16 清华大学 Text search method and system
CN104239513A (en) * 2014-09-16 2014-12-24 西安电子科技大学 Semantic retrieval method oriented to field data
CN104933159A (en) * 2015-06-26 2015-09-23 南京邮电大学 Semantic query method based on medicine body library
WO2015154679A1 (en) * 2014-04-08 2015-10-15 北京奇虎科技有限公司 Method and device for ranking search results of multiple search engines
CN105631007A (en) * 2015-12-29 2016-06-01 云南电网有限责任公司电力科学研究院 Industry technical information collecting method and system
CN106033428A (en) * 2015-03-11 2016-10-19 北大方正集团有限公司 A uniform resource locator selecting method and a uniform resource locator selecting device
CN107562831A (en) * 2017-08-23 2018-01-09 中国软件与技术服务股份有限公司 A kind of accurate lookup method based on full-text search
CN109740947A (en) * 2019-01-08 2019-05-10 上海市研发公共服务平台管理中心 Expert's method for digging, system, storage medium and electric terminal based on patent data
CN110457490A (en) * 2019-08-15 2019-11-15 桂林电子科技大学 A kind of semantic work stream index construction and search method based on domain body

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060047649A1 (en) * 2003-12-29 2006-03-02 Ping Liang Internet and computer information retrieval and mining with intelligent conceptual filtering, visualization and automation
CN101719145A (en) * 2009-11-17 2010-06-02 北京大学 Individuation searching method based on book domain ontology

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060047649A1 (en) * 2003-12-29 2006-03-02 Ping Liang Internet and computer information retrieval and mining with intelligent conceptual filtering, visualization and automation
CN101719145A (en) * 2009-11-17 2010-06-02 北京大学 Individuation searching method based on book domain ontology

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
《情报学报》 20100228 朱恒民等 基于领域本体实现全网信息的智能搜索方法研究 9~15 1-4,7,8 第29卷, 第1期 *

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103324644B (en) * 2012-03-23 2016-05-11 日电(中国)有限公司 A kind of Query Result variation method and device
CN103324644A (en) * 2012-03-23 2013-09-25 日电(中国)有限公司 Query result diversification method
CN102779161A (en) * 2012-06-14 2012-11-14 杜小勇 Semantic labeling method based on resource description framework (RDF) knowledge base
CN102779161B (en) * 2012-06-14 2015-03-04 杜小勇 Semantic labeling method based on resource description framework (RDF) knowledge base
CN102799677B (en) * 2012-07-20 2014-11-12 河海大学 Water conservation domain information retrieval system and method based on semanteme
CN102799677A (en) * 2012-07-20 2012-11-28 河海大学 Water conservation domain information retrieval system and method based on semanteme
CN103577581B (en) * 2013-11-08 2016-09-28 南京绿色科技研究院有限公司 Agricultural product price trend forecasting method
CN103577581A (en) * 2013-11-08 2014-02-12 南京绿色科技研究院有限公司 Method for forecasting price trend of agricultural products
WO2015154679A1 (en) * 2014-04-08 2015-10-15 北京奇虎科技有限公司 Method and device for ranking search results of multiple search engines
CN103927358A (en) * 2014-04-15 2014-07-16 清华大学 Text search method and system
CN103927358B (en) * 2014-04-15 2017-02-15 清华大学 text search method and system
CN104239513B (en) * 2014-09-16 2019-03-08 西安电子科技大学 A kind of semantic retrieving method of domain-oriented data
CN104239513A (en) * 2014-09-16 2014-12-24 西安电子科技大学 Semantic retrieval method oriented to field data
CN106033428A (en) * 2015-03-11 2016-10-19 北大方正集团有限公司 A uniform resource locator selecting method and a uniform resource locator selecting device
CN106033428B (en) * 2015-03-11 2019-08-30 北大方正集团有限公司 The selection method of uniform resource locator and the selection device of uniform resource locator
CN104933159A (en) * 2015-06-26 2015-09-23 南京邮电大学 Semantic query method based on medicine body library
CN104933159B (en) * 2015-06-26 2019-01-18 南京邮电大学 A kind of semantic query method based on drug ontology library
CN105631007A (en) * 2015-12-29 2016-06-01 云南电网有限责任公司电力科学研究院 Industry technical information collecting method and system
CN107562831A (en) * 2017-08-23 2018-01-09 中国软件与技术服务股份有限公司 A kind of accurate lookup method based on full-text search
CN109740947A (en) * 2019-01-08 2019-05-10 上海市研发公共服务平台管理中心 Expert's method for digging, system, storage medium and electric terminal based on patent data
CN110457490A (en) * 2019-08-15 2019-11-15 桂林电子科技大学 A kind of semantic work stream index construction and search method based on domain body
CN110457490B (en) * 2019-08-15 2021-06-18 桂林电子科技大学 Semantic workflow index construction and retrieval method based on domain ontology

Also Published As

Publication number Publication date
CN102081668B (en) 2012-07-25

Similar Documents

Publication Publication Date Title
CN102081668B (en) Information retrieval optimizing method based on domain ontology
CN108846029B (en) Information correlation analysis method based on knowledge graph
CN101630314B (en) Semantic query expansion method based on domain knowledge
CN102779193B (en) Self-adaptive personalized information retrieval system and method
Jindal et al. A review of ranking approaches for semantic search on web
CN102902806B (en) A kind of method and system utilizing search engine to carry out query expansion
CN103886099B (en) Semantic retrieval system and method of vague concepts
CN103823906A (en) Multi-dimension searching sequencing optimization algorithm and tool based on microblog data
CN102799677A (en) Water conservation domain information retrieval system and method based on semanteme
US8700624B1 (en) Collaborative search apps platform for web search
CN103020074A (en) Object-level search technique based on main body
CN104636403B (en) Handle the method and device of inquiry request
Sharma et al. Web search result optimization by mining the search engine query logs
CN101814085A (en) WEB data bank selection method based on WDB (World Data Bank) characteristics and user query requests
Murugudu et al. Efficiently harvesting deep web interfaces based on adaptive learning using two-phase data crawler framework
Chopra et al. A survey on improving the efficiency of different web structure mining algorithms
Kumar et al. Smart information retrieval using query transformation based on ontology and semantic-association
Yadav et al. Wavelet tree based hybrid geo-textual indexing technique for geographical search
Kataria et al. A novel approach for rank optimization using search engine transaction logs
Zeraatkar et al. Improvement of Page Ranking Algorithm by Negative Score of Spam Pages.
Zubi Ranking webpages using web structure mining concepts
Puspitaningrum et al. Wiki-MetaSemantik: A Wikipedia-derived query expansion approach based on network properties
Bama et al. Improved pagerank algorithm for web structure mining
AnigboguKenechukwu et al. A Cohesive Page Ranking and Depth-First Crawling Scheme For Improved Search Results
Peng et al. A focused web crawler face stock information of financial field

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20120725

Termination date: 20170124

CF01 Termination of patent right due to non-payment of annual fee