CN102081668A - Information retrieval optimizing method based on domain ontology - Google Patents
Information retrieval optimizing method based on domain ontology Download PDFInfo
- Publication number
- CN102081668A CN102081668A CN 201110025219 CN201110025219A CN102081668A CN 102081668 A CN102081668 A CN 102081668A CN 201110025219 CN201110025219 CN 201110025219 CN 201110025219 A CN201110025219 A CN 201110025219A CN 102081668 A CN102081668 A CN 102081668A
- Authority
- CN
- China
- Prior art keywords
- notion
- inquiry
- abstract
- formula
- query
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Abstract
The invention provides an information retrieval optimizing method based on domain ontology, comprising the steps of: obtaining query key words submitted by users via a retrieval interface of a retrieval system; performing lexeme expansion for the query key words submitted by users via domain ontology inference in a desired domain of users and according to the built domain ontology, so as to obtain one or more sets of new query strings; submitting the expanded query strings to one or more search engines for retrieval; performing repetition removal and re-sequencing integration for the results fed back by the various search engines; displaying the final result to the user via the retrieval interface. In the invention, the efficiency of the information retrieval relevant to the domain is improved by the lexeme advantage of the domain ontology.
Description
Technical field
The present invention relates to a kind of network technology, be based on the information retrieval method of search engine specifically.
Background technology
People are to use research tool from the main means that network obtains information, as Google, Baidu, Yahoo etc.The principle of work of search engine comprises three processes substantially: (1) gathers information from the internet, by regularly the information of all website and webpage on the internet being grasped with Web Spider.(2) organize your messages and set up index data base and analyze collecting the webpage of returning by analyzing the directory system program, extract keyword that related web page place website links, type of coding, content of pages comprise, keyword position, rise time, size, with the information such as linking relationship of other webpage, calculate according to certain degree of correlation algorithm, obtain each webpage at the degree of correlation (or importance) that reaches each keyword in the super chain in the content of pages, set up the web page index database with these relevant informations then.(3) in index data base searching order, accept inquiry when the user after keyword search is imported at the interface of search engine, from the web page index database, find all related web pages that meet this keyword by the search system program, according to ready-made degree of correlation numerical ordering, the degree of correlation is high more, and rank is forward more.At last,, organize and return to the user contents such as the chained address of Search Results, content of pages summaries by page generation system.
Present search engine is based on the search engine of keyword matching mostly.Yet these search engines seldom have the semantic reasoning ability.Though Google has adopted some natural language processing techniques, for example, the synonym expansion, but it can not resolve the semantic relation between the notion, caused the reduction of precision ratio so to a certain extent, made that the inquiry return results is not a user institute satisfactory information.On the other hand, user's inquiry depends on certain professional domain to a great extent, as marine field.For example; suppose that the user wants to search for the information of marine field relevant " DIP (Dissolved inorganic phosphorus dissolves Phos) "; its Query Result as shown in Figure 4; usually can obtain a large amount of other fields " DIP " information; as " the Dual Inline Package " of microelectronic, i.e. dual-in-line package technology.Because these are and the incoherent garbage of user's purpose that the user obviously is unsatisfied to such result.
" body (Ontology) " conduct " the clear and definite formalization normalized illustration of shared ideas model " is by taking out the model that the related notion of some phenomenons obtains in the objective world, and the implication of conceptual model performance is independent of concrete ambient condition.What body embodied is the knowledge of common approval, reflection be the concept set of generally acknowledging in the association area, so body provides common understanding and description to domain knowledge, can be used to better share, exchange and reuse.Constitute body notion and between relation through explication, use body can eliminate phenomenons such as polysemy, many speech one justice and the meaning of a word be ambiguous, thereby finish domain knowledge clear, definite, complete definition and description.The target of body research is to obtain a Knowledge Representation Method, makes machine to share and process information as the mankind.At present, ontology is widely used in fields such as the representation of knowledge, information retrieval.
Summary of the invention
In order to overcome the existing deficiency of search engine on semantic retrieval, the invention provides a kind of information retrieval optimization method based on domain body.
Technical scheme of the present invention is: a kind of information retrieval optimization method based on domain body, and its step is as follows:
(1), obtains the key word of the inquiry that the user submits to by the search interface of searching system;
(2) in the field of user expectation, according to the domain body of having set up, the key word of the inquiry that the user is submitted to carries out semantic extension by ontology inference, obtains one or more groups new inquiry string;
(3) inquiry string after will expanding is submitted to one or more search engines and is retrieved;
(4) return results to each search engine goes heavily, sorts and integrate;
(5) net result is shown to the user by search interface.
In the above-mentioned steps (2) based on the semantic extension mode of domain body comprise in the following mode a kind of, two kinds or all:
1. based on the optimization method of is-a relation
Is-a relation (inheritance) has shown the classification of notion, and promptly the example of father's notion equals the summation of sub-notion example.Therefore added some constraints in that son is conceptive, sub-notion is also referred to as the particularization of father's notion.The probability that a notion father notion direct with it or sub-notion occur in same document is higher.Therefore, when search during, can utilize the father's notion P of A or sub-notion C to improve the precision ratio of search as constraint about the document of certain notion A.So the inquiry that a notion can be optimized to notion itself and its father's notion or sub-notion is right.
2. based on the optimization method of part-of relation
Part-of represents whole-part relations, is used for describing the mutual relationship between a notion and its part notion.The ingredient of a notion also therewith the field under the notion be closely related.Therefore, also be associated usually with part notion document matching with its global concept.So the inquiry that a notion can be optimized to notion itself and part notion thereof is right.
3. based on the optimization method of equivalent-class relation
Equivalent-class (equivalence class) relation is used for the synonym phenomenon of process field knowledge.Utilize the equivalent-class relation, the notion in the user inquiring can be mapped to the synonym of equal value with it.Like this, can improve the precision ratio of information retrieval.And, the common householder method of equivalent-class relation as preceding two kinds of optimization methods.
Between the internal notion of described inquiry be " with " or " or " logical relation, " with " can improve the inquiry accuracy rate, " or " can improve recall ratio.
In the above-mentioned steps (4), to the return results of each search engine go heavily, ordering integrates, the algorithm that can adopt is as follows:
(1) URL to Search Results handles, and intercepting " # " URL character string before is as final chained address; If there is MD5 (URL
A)=MD5 (URL
B), then think URL
AAnd URL
BCorresponding page is a duplicate pages, goes heavily;
(2) sort algorithm is considered two aspects:
1. the semantic distance Dist (C of each notion in the inquiry string
i, C
j), C wherein
iWith C
jBe two notions in the inquiry string,
In the formula 1,
Link node C in the expression body tree
i, C
jShortest path in the Weighted distance sum on each limit;
With
Represent node C respectively
iAnd C
jWeighted distance to minimum common ancestor's node; N
LCARepresent the Weighted distance of minimum common ancestor to root node; ε is a constant, determines according to weighting coefficient.
The semantic weight of different relations is with reference to table 1 between the notion.
Table 1 semantic distance weight table
In the table 1,
The expression blank operation, single operation is represented in its combination with row; E represents the equivalent-class relation; G represents the is-a relation, and direction is pointed to father's notion by sub-notion; S represents the is-a relation, and direction is pointed to sub-notion by father's notion; P represents the part-of relation.
Because the semantic distance of notion semantic similarity and notion is inverse function each other, when semantic distance was 0, semantic similarity was 1.Therefore can be with C
i, C
jSimilarity between the two is reduced to:
2. inquiry string and Search Results the record degree of correlation Rank (Query, Abstract).
In the formula 3, Rank (C
i, be the degree of correlation between each notion and the Search Results summary Abstract among the inquiry string Query Abstract), n is the number of notion among the Query.
In the formula 4, m=Time (C
i, Abstract) be notion C
iThe number of times that in summary Abstract, occurs; The length of len (Abstract) expression summary Abstract; Index (C
i, j Abstract) is notion C
iThe position that the j time occurs in summary Abstract.
3. to original query key word K
iAnd the inquiry string Query of expansion, obtain K respectively
iSemantic similarity with each notion among the Query
Then can calculate the matching degree R of result for retrieval.
R=α Sim (K
i, C
j(Query, Abstract) formula 5 for)+β Rank
In the formula 5, α and β are constant, represent the semantic relevancy of etendue critical word and the weight of the summary degree of correlation thereof respectively.α ∈ (0,1) wherein, β ∈ (0,1), and alpha+beta=1.
4. the order of successively decreasing according to R numerical value is finished the ordering of result for retrieval.
The present invention is recall ratio and the precision ratio that utilizes the relevant information retrieval in the semantic advantage raising field of body.On the basis of the method, user's key word of the inquiry can be utilized domain body carry out semantic extension, obtain one or more groups new query string, then it is submitted to the Web search engine, and Search Results sorted and put in order, finally be shown to the user.Because these new query strings have been considered the relation between the field concept, as hypernym, hyponym, synonym etc., can improve the recall ratio of retrieval; Simultaneously because that body is the field is relevant, make result for retrieval be limited under within the scope in field, can screen out a large amount of and information field independence, thereby improve the precision ratio of retrieval.
Description of drawings
Fig. 1 is a marine ecology domain body fragment;
Fig. 2 is the optimization information searching system OASIS workflow diagram that the present invention is based on domain body;
Fig. 3 is the search interface of OASIS of the present invention;
Fig. 4 is the Search Results homepage sectional drawing that retrieval " DIP " obtains in Google;
Fig. 5 is to be the summary degree of correlation of example calculating with " InorganicNutrient+DIP ";
Fig. 6 is the Search Results sectional drawing that retrieval " DIP " obtains in OASIS of the present invention.
Embodiment
Below by a marine ecology field specific embodiment the present invention is described in further detail.
The present invention proposes a kind of information retrieval optimization method based on domain body, is example with the marine ecology field, in conjunction with the accompanying drawings, specifically describes as follows.
The workflow diagram of committed step of the present invention is an example with the marine ecology field as shown in Figure 2, and when submit queries " DIP ", concrete implementation step is:
1. server is set up a marine ecology body (Ontology), and with the storage of ocean.ont form, its body fragment as shown in Figure 1;
2. pass through search interface shown in Figure 3 at user side, submit to key word of the inquiry " DIP " to inquire about (Portal);
3. server obtains the key word of the inquiry that the user submits to, utilize HozoAPI to carry out semantic reasoning to the ocean.ont body and realize optimizing (Query Optimizer), at notion " DIP ", can get access to relative notion has: based on the notion InorganicNutrient of is-a relation, based on notion Phytoplankton, the Seawater of part-of relation.Obtain three groups of new inquiry strings " InorganicNutrient+DIP ", " DIP+Phytoplankton " and " DIP+Seawater " by the relation between these notions and the notion;
4. these three groups of character strings are sent to Web search engine (Web SearchEngine) respectively, (World Wide Web) obtains three groups of retrieval sets from WWW, get preceding 30 records of each result for retrieval, obtain result set Result_1 respectively, Result_2 and Result_3;
5. server is with Result_1, and Result_2 and Result_3 merge, and resequences after finishing retry, obtains net result collection Result.Main algorithm is as follows:
(1) URL to Search Results handles, and intercepting " # " URL character string before is as final chained address.If there is MD5 (URL
A)=MD5 (URL
B), then think URL
AAnd URL
BCorresponding page is a duplicate pages.
(2) sort algorithm is considered two aspects:
1. the semantic distance Dist (C of each notion in the inquiry string
i, C
j), C wherein
iWith C
jBe two notions in the inquiry string.
Utilize formula 1:
Calculate C
iWith C
jSemantic distance, and by formula 2:
Calculate C
iWith C
jSemantic similarity.
2. utilize formula 3:
Calculate the degree of correlation of inquiry string and Search Results record.
3. to original query key word K
iAnd the inquiry string Query of expansion, obtain K respectively
iSemantic similarity with each notion among the Query
And utilize formula 5:R=α Sim (K
i, C
j(Query Abstract) calculates matching degree to)+β Rank, finishes the ordering of result for retrieval by its result's the order of successively decreasing.
Be that the example explanation describes now with inquiry string " InorganicNutrient+DIP ".Two notions are respectively with C
INAnd C
DIPExpression.
Calculate by formula 2
(Query, correlation parameter Abstract) as shown in Figure 5 to calculate Rank.
Utilize formula 5, get α=0.6, β=0.4:
6. Result is shown to the user by search interface.As shown in Figure 6.
Said process is the specialty retrieval optimization method that is defaulted as marine ecology domain-specific searching system OASIS and interface 3 with.Also can adopt this professional searching system for other field, but will adopt the association area body.Certainly for comprehensive search engine, then can on search interface, increase field keyword column by user's input, to determine the field of user expectation retrieval according to the field keyword of user's input, for the user strange situation is divided in the field, can the preliminary election association area on the search interface of search engine select during by user search, to determine domain body and to carry out the meaning of a word expansion of association area.For not selecting or do not import the field keyword, then adopt all spectra body when determining domain body.
Claims (8)
1. information retrieval optimization method based on domain body, its step is as follows:
(1), obtains the key word of the inquiry that the user submits to by the search interface of searching system;
(2) in the field of user expectation, according to the domain body of having set up, the key word of the inquiry that the user is submitted to carries out semantic extension by the domain body reasoning, obtains one or more groups new inquiry string;
(3) inquiry string after will expanding is submitted to one or more search engines and is retrieved;
(4) return results to each search engine goes heavy and the ordering integration;
(5) net result is shown to the user by search interface.
2. the method for claim 1 is characterized in that describedly carrying out semantic extension by ontology inference, be adopt in the following method one or both or all:
1. based on is-a optimized relation method
Father's notion P or the sub-notion C of the notion A that obtains based on described key word of the inquiry, the inquiry that is optimized to notion A itself and its father's notion P is right, or the inquiry of notion A itself and its sub-notion C is right;
2. based on the optimization method of part-of relation
The inquiry that will be optimized to this notion itself and its part notion formation based on the notion that key word of the inquiry obtains is right;
3. the optimization that concerns based on equivalent-class
It is right to be optimized to the inquiry that this notion and the synonym of equal value with it constitute based on the notion that key word of the inquiry obtains.
3. method as claimed in claim 2, it is characterized in that between the internal notion of described inquiry for " with " or " or " logical relation.
4. as described method one of in the claim 1 to 3, it is characterized in that: described go heavily to be meant for Search Results URL handle, intercepting " # " URL character string before is as final chained address, for URL
AAnd URL
BIf there is MD5 (URL
A)=MD5 (URL
B), then think URL
AAnd URL
BCorresponding page is a duplicate pages, removes one of them chained address.
5. method as claimed in claim 4 is characterized in that: described ordering is to utilize the semantic similarity of notion in conjunction with the summary sort algorithm, and the result after going is heavily sorted.
6. method as claimed in claim 5 is characterized in that described sort method comprises:
1. calculate the semantic distance Dist (C of each notion in the inquiry string by formula 1
i, C
j),
C wherein
iWith C
jBe two notions in the inquiry string,
Link node C in the expression body tree
i, C
jShortest path in the Weighted distance sum on each limit;
With
Represent node C respectively
iAnd C
jWeighted distance to minimum common ancestor's node; N
LCARepresent the Weighted distance of minimum common ancestor to root node; ε is a constant, determines according to weighting coefficient,
When semantic distance was 0, semantic similarity was 1, with C
i, C
jSimilarity between the two is reduced to formula 2:
2. by formula 3 determine the degree of correlation Rank that inquiry strings and Search Results write down (Query, Abstract)
In the formula 3, Rank (C
i, be the degree of correlation between each notion and the Search Results summary Abstract among the inquiry string Query Abstract), n is the number of notion among the Query
In the formula 4, m=Time (C
i, Abstract) be notion C
iThe number of times that in summary Abstract, occurs; The length of len (Abstract) expression summary Abstract; Index (C
i, j Abstract) is notion C
iThe position that the j time occurs in summary Abstract,
3. to original query key word K
iAnd the inquiry string Query of expansion, obtain K respectively
iSemantic similarity with each notion among the Query
Calculate the matching degree R of result for retrieval by formula 5.
R=α Sim (K
i, C
j(Query, Abstract) formula 5 for)+β Rank
In the formula 5, α and β are constant, represent the semantic relevancy of etendue critical word and the weight of the summary degree of correlation thereof respectively, α ∈ (0,1) wherein, and β ∈ (0,1), and alpha+beta=1,
4. the order of successively decreasing according to R numerical value is finished the ordering of result for retrieval.
7. as described method one of in the claim 1 to 3, it is characterized in that: described search interface is the special interface at a certain field.
8. as described method one of in the claim 1 to 3, it is characterized in that: described search interface has field option or field key word to fill in the zone, field option of selecting according to the user in described step (2) or field key word load corresponding domain body and carry out semantic extension.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011100252193A CN102081668B (en) | 2011-01-24 | 2011-01-24 | Information retrieval optimizing method based on domain ontology |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011100252193A CN102081668B (en) | 2011-01-24 | 2011-01-24 | Information retrieval optimizing method based on domain ontology |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102081668A true CN102081668A (en) | 2011-06-01 |
CN102081668B CN102081668B (en) | 2012-07-25 |
Family
ID=44087629
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2011100252193A Expired - Fee Related CN102081668B (en) | 2011-01-24 | 2011-01-24 | Information retrieval optimizing method based on domain ontology |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102081668B (en) |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102779161A (en) * | 2012-06-14 | 2012-11-14 | 杜小勇 | Semantic labeling method based on resource description framework (RDF) knowledge base |
CN102799677A (en) * | 2012-07-20 | 2012-11-28 | 河海大学 | Water conservation domain information retrieval system and method based on semanteme |
CN103324644A (en) * | 2012-03-23 | 2013-09-25 | 日电(中国)有限公司 | Query result diversification method |
CN103577581A (en) * | 2013-11-08 | 2014-02-12 | 南京绿色科技研究院有限公司 | Method for forecasting price trend of agricultural products |
CN103927358A (en) * | 2014-04-15 | 2014-07-16 | 清华大学 | Text search method and system |
CN104239513A (en) * | 2014-09-16 | 2014-12-24 | 西安电子科技大学 | Semantic retrieval method oriented to field data |
CN104933159A (en) * | 2015-06-26 | 2015-09-23 | 南京邮电大学 | Semantic query method based on medicine body library |
WO2015154679A1 (en) * | 2014-04-08 | 2015-10-15 | 北京奇虎科技有限公司 | Method and device for ranking search results of multiple search engines |
CN105631007A (en) * | 2015-12-29 | 2016-06-01 | 云南电网有限责任公司电力科学研究院 | Industry technical information collecting method and system |
CN106033428A (en) * | 2015-03-11 | 2016-10-19 | 北大方正集团有限公司 | A uniform resource locator selecting method and a uniform resource locator selecting device |
CN107562831A (en) * | 2017-08-23 | 2018-01-09 | 中国软件与技术服务股份有限公司 | A kind of accurate lookup method based on full-text search |
CN109740947A (en) * | 2019-01-08 | 2019-05-10 | 上海市研发公共服务平台管理中心 | Expert's method for digging, system, storage medium and electric terminal based on patent data |
CN110457490A (en) * | 2019-08-15 | 2019-11-15 | 桂林电子科技大学 | A kind of semantic work stream index construction and search method based on domain body |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060047649A1 (en) * | 2003-12-29 | 2006-03-02 | Ping Liang | Internet and computer information retrieval and mining with intelligent conceptual filtering, visualization and automation |
CN101719145A (en) * | 2009-11-17 | 2010-06-02 | 北京大学 | Individuation searching method based on book domain ontology |
-
2011
- 2011-01-24 CN CN2011100252193A patent/CN102081668B/en not_active Expired - Fee Related
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060047649A1 (en) * | 2003-12-29 | 2006-03-02 | Ping Liang | Internet and computer information retrieval and mining with intelligent conceptual filtering, visualization and automation |
CN101719145A (en) * | 2009-11-17 | 2010-06-02 | 北京大学 | Individuation searching method based on book domain ontology |
Non-Patent Citations (1)
Title |
---|
《情报学报》 20100228 朱恒民等 基于领域本体实现全网信息的智能搜索方法研究 9~15 1-4,7,8 第29卷, 第1期 * |
Cited By (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103324644B (en) * | 2012-03-23 | 2016-05-11 | 日电(中国)有限公司 | A kind of Query Result variation method and device |
CN103324644A (en) * | 2012-03-23 | 2013-09-25 | 日电(中国)有限公司 | Query result diversification method |
CN102779161A (en) * | 2012-06-14 | 2012-11-14 | 杜小勇 | Semantic labeling method based on resource description framework (RDF) knowledge base |
CN102779161B (en) * | 2012-06-14 | 2015-03-04 | 杜小勇 | Semantic labeling method based on resource description framework (RDF) knowledge base |
CN102799677B (en) * | 2012-07-20 | 2014-11-12 | 河海大学 | Water conservation domain information retrieval system and method based on semanteme |
CN102799677A (en) * | 2012-07-20 | 2012-11-28 | 河海大学 | Water conservation domain information retrieval system and method based on semanteme |
CN103577581B (en) * | 2013-11-08 | 2016-09-28 | 南京绿色科技研究院有限公司 | Agricultural product price trend forecasting method |
CN103577581A (en) * | 2013-11-08 | 2014-02-12 | 南京绿色科技研究院有限公司 | Method for forecasting price trend of agricultural products |
WO2015154679A1 (en) * | 2014-04-08 | 2015-10-15 | 北京奇虎科技有限公司 | Method and device for ranking search results of multiple search engines |
CN103927358A (en) * | 2014-04-15 | 2014-07-16 | 清华大学 | Text search method and system |
CN103927358B (en) * | 2014-04-15 | 2017-02-15 | 清华大学 | text search method and system |
CN104239513B (en) * | 2014-09-16 | 2019-03-08 | 西安电子科技大学 | A kind of semantic retrieving method of domain-oriented data |
CN104239513A (en) * | 2014-09-16 | 2014-12-24 | 西安电子科技大学 | Semantic retrieval method oriented to field data |
CN106033428A (en) * | 2015-03-11 | 2016-10-19 | 北大方正集团有限公司 | A uniform resource locator selecting method and a uniform resource locator selecting device |
CN106033428B (en) * | 2015-03-11 | 2019-08-30 | 北大方正集团有限公司 | The selection method of uniform resource locator and the selection device of uniform resource locator |
CN104933159A (en) * | 2015-06-26 | 2015-09-23 | 南京邮电大学 | Semantic query method based on medicine body library |
CN104933159B (en) * | 2015-06-26 | 2019-01-18 | 南京邮电大学 | A kind of semantic query method based on drug ontology library |
CN105631007A (en) * | 2015-12-29 | 2016-06-01 | 云南电网有限责任公司电力科学研究院 | Industry technical information collecting method and system |
CN107562831A (en) * | 2017-08-23 | 2018-01-09 | 中国软件与技术服务股份有限公司 | A kind of accurate lookup method based on full-text search |
CN109740947A (en) * | 2019-01-08 | 2019-05-10 | 上海市研发公共服务平台管理中心 | Expert's method for digging, system, storage medium and electric terminal based on patent data |
CN110457490A (en) * | 2019-08-15 | 2019-11-15 | 桂林电子科技大学 | A kind of semantic work stream index construction and search method based on domain body |
CN110457490B (en) * | 2019-08-15 | 2021-06-18 | 桂林电子科技大学 | Semantic workflow index construction and retrieval method based on domain ontology |
Also Published As
Publication number | Publication date |
---|---|
CN102081668B (en) | 2012-07-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102081668B (en) | Information retrieval optimizing method based on domain ontology | |
CN108846029B (en) | Information correlation analysis method based on knowledge graph | |
CN101630314B (en) | Semantic query expansion method based on domain knowledge | |
CN102779193B (en) | Self-adaptive personalized information retrieval system and method | |
Jindal et al. | A review of ranking approaches for semantic search on web | |
CN102902806B (en) | A kind of method and system utilizing search engine to carry out query expansion | |
CN103886099B (en) | Semantic retrieval system and method of vague concepts | |
CN103823906A (en) | Multi-dimension searching sequencing optimization algorithm and tool based on microblog data | |
CN102799677A (en) | Water conservation domain information retrieval system and method based on semanteme | |
US8700624B1 (en) | Collaborative search apps platform for web search | |
CN103020074A (en) | Object-level search technique based on main body | |
CN104636403B (en) | Handle the method and device of inquiry request | |
Sharma et al. | Web search result optimization by mining the search engine query logs | |
CN101814085A (en) | WEB data bank selection method based on WDB (World Data Bank) characteristics and user query requests | |
Murugudu et al. | Efficiently harvesting deep web interfaces based on adaptive learning using two-phase data crawler framework | |
Chopra et al. | A survey on improving the efficiency of different web structure mining algorithms | |
Kumar et al. | Smart information retrieval using query transformation based on ontology and semantic-association | |
Yadav et al. | Wavelet tree based hybrid geo-textual indexing technique for geographical search | |
Kataria et al. | A novel approach for rank optimization using search engine transaction logs | |
Zeraatkar et al. | Improvement of Page Ranking Algorithm by Negative Score of Spam Pages. | |
Zubi | Ranking webpages using web structure mining concepts | |
Puspitaningrum et al. | Wiki-MetaSemantik: A Wikipedia-derived query expansion approach based on network properties | |
Bama et al. | Improved pagerank algorithm for web structure mining | |
AnigboguKenechukwu et al. | A Cohesive Page Ranking and Depth-First Crawling Scheme For Improved Search Results | |
Peng et al. | A focused web crawler face stock information of financial field |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20120725 Termination date: 20170124 |
|
CF01 | Termination of patent right due to non-payment of annual fee |