CN102033955B - Method for expanding user search results and server - Google Patents

Method for expanding user search results and server Download PDF

Info

Publication number
CN102033955B
CN102033955B CN 201010605956 CN201010605956A CN102033955B CN 102033955 B CN102033955 B CN 102033955B CN 201010605956 CN201010605956 CN 201010605956 CN 201010605956 A CN201010605956 A CN 201010605956A CN 102033955 B CN102033955 B CN 102033955B
Authority
CN
China
Prior art keywords
searching key
word
search results
correlation rule
rule
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN 201010605956
Other languages
Chinese (zh)
Other versions
CN102033955A (en
Inventor
常华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
(Beijing) Technology Co., Ltd.
Original Assignee
常华
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 常华 filed Critical 常华
Priority to CN 201010605956 priority Critical patent/CN102033955B/en
Publication of CN102033955A publication Critical patent/CN102033955A/en
Application granted granted Critical
Publication of CN102033955B publication Critical patent/CN102033955B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention relates to the technical field of information search, and discloses a method and equipment for expanding user search results. The method comprises the following steps of: acquiring search keywords input by a user on a search interface; acquiring associated words associated with the search keywords; querying an index database according to the search keywords, the associated words and the combination of the search keywords and the associated words to obtain search results respectively; performing repeat removal and sequencing on the search results; and sending the sequenced search results to a client so that the client can show the received search results to users. According to the technical scheme provided by the invention, richer search results can be intelligently provided for the users.

Description

The method of extending user Search Results and server
Technical field
The present invention relates to the information search technique field, more particularly, relate to a kind of method and server of extending user Search Results.
Background technology
The internet is the abundantest information source of present content and most effective information transfer channel, and the birth of internet search engine makes people to find own needed information with simple and rapid mode.Current commercial internet search engine relies on its advanced distributed computing architecture, has reached very complete information coverage rate, and refresh rate is preferably also arranged, and can obtain the newer information content.
The workflow of internet search engine mainly is divided into three parts; One, use crawlers to obtain the info web on the internet; Two, these information are set up index, three, the inquiry that the user asked is searched for by the index of having set up, and result document is provided.
In general, the inquiry that the user asked comprises one or more keyword.When comprising the keyword that one or more user imports in the result document that internet search engine provided, this result is considered to search hit, meets user's search expectation.Such as, the user imports keyword 360, can comprise 360 in the result document that obtains so; If input keyword QQ can comprise QQ in the result document that then obtains.That is to say that in the prior art, the result document that internet search engine provided just comprises some documents of the keyword of user's input.And in fact; Some does not comprise the keyword of user input but the document that comprises with other keywords of the relevant relation of said keyword also is that the user is needed; Such as, when the user search information relevant with 360, hope can obtain the relevant information that certain incidence relation arranged with 360 simultaneously.At present, existing internet search engine also can't satisfy user's this demand.
Summary of the invention
The embodiment of the invention provides a kind of method and server of extending user Search Results to the problems referred to above that exist in the prior art, can intelligently for the user abundanter Search Results be provided.
For this reason, the embodiment of the invention provides following technical scheme:
A kind of method of extending user Search Results comprises:
Obtain the searching key word of user in the search interface input;
Obtain the conjunctive word that is associated with said searching key word;
According to the query composition index data base of said searching key word, conjunctive word and said searching key word and conjunctive word, obtain Search Results respectively;
Go heavy and ordering to said Search Results;
Search Results after the ordering is sent to client, so that client represents the Search Results of receiving to said user.
Preferably, the said conjunctive word that is associated with said searching key word that obtains comprises:
Search association rule database according to said searching key word;
If the correlation rule that comprises said searching key word is arranged in the said association rule database, then from said correlation rule, obtain the conjunctive word that is associated with said searching key word.
Preferably, said method also comprises:
Setting comprises keyword and the correlation rule of the conjunctive word that is associated with said keyword; And/or, generate the correlation rule of the conjunctive word that comprises keyword and be associated with said keyword according to a plurality of searching key words that said user imports;
Said correlation rule is saved in the said association rule database.
Preferably, said method also comprises:
Add up all users' search behavior and/or Search Results;
Confirm the power of incidence relation between keyword and the corresponding conjunctive word in the said correlation rule according to statistics;
According to definite result the correlation rule in the said association rule database is safeguarded.
Preferably, saidly confirm that according to statistics the power of incidence relation between keyword and the corresponding conjunctive word in the said correlation rule comprises:
Calculate the support and/or the degree of confidence of said correlation rule according to statistics;
If said support greater than the preset confidence threshold value, confirms then that said correlation rule be strong association greater than the support threshold value of setting and/or said degree of confidence; Otherwise be weak association.
A kind of server comprises:
The keyword acquiring unit is used to obtain the searching key word of user in the search interface input;
The conjunctive word acquiring unit is used to obtain the conjunctive word that is associated with said searching key word;
Query unit is used for respectively obtaining Search Results according to the query composition index data base of said searching key word, conjunctive word and said searching key word and conjunctive word;
The arrangement unit is used for going heavy and ordering to said Search Results;
Transmitting element is used for the Search Results after the ordering is sent to client, so that client represents the Search Results of receiving to said user.
Preferably; Said conjunctive word acquiring unit; Specifically be used for searching association rule database according to said searching key word; If the correlation rule that comprises said searching key word is arranged in the said association rule database, then from said correlation rule, obtain the conjunctive word that is associated with said searching key word.
Preferably, said server also comprises: rule is provided with unit and/or regular generation unit, and preserves the unit;
Said rule is provided with the unit, is used to be provided with the correlation rule of the conjunctive word that comprises keyword and be associated with said keyword;
Said regular generation unit is used for a plurality of searching key words according to said user's input, generates the correlation rule of the conjunctive word that comprises keyword and be associated with said keyword;
Said preservation unit is used for said correlation rule is saved in said association rule database.
Preferably, said server also comprises:
Statistic unit is used to add up all users' search behavior and/or Search Results;
The degree of association is confirmed the unit, the power of incidence relation between the keyword that is used for confirming said correlation rule according to statistics and corresponding conjunctive word;
The rule maintenance unit is used for according to confirming that the result safeguards the correlation rule of said association rule database.
Preferably, the said degree of association confirms that the unit comprises:
Computation subunit is used for calculating according to statistics the support and/or the degree of confidence of said correlation rule;
Analyze subelement, be used for during greater than the preset confidence threshold value, confirming that said correlation rule is for strong related greater than the support threshold value of setting and/or said degree of confidence in said support; Otherwise be weak association.
The method and the server of embodiment of the invention extending user Search Results; Searching key word to user's input; Excavated the conjunctive word that has incidence relation with said searching key word; And, obtain search result corresponding, thereby expanded Search Results respectively according to the query composition index data base of said searching key word, conjunctive word and said searching key word and conjunctive word; The document that will have incidence relation with the searching key word of user's input also offers the user in the lump, for the user provides abundanter Search Results.
Description of drawings
Fig. 1 is the process flow diagram of the method for embodiment of the invention extending user Search Results;
Fig. 2 is a kind of structural representation of embodiment of the invention server;
Fig. 3 is the another kind of structural representation of embodiment of the invention server.
Embodiment
In order to make those skilled in the art person understand the scheme of the embodiment of the invention better, the embodiment of the invention is done further to specify below in conjunction with accompanying drawing and embodiment.
The method and the server of embodiment of the invention extending user Search Results; Searching key word to user's input has excavated the conjunctive word that has incidence relation with said searching key word, that is to say; Intelligently user's search behavior and expectation are predicted; And, obtain search result corresponding, thereby expanded Search Results respectively according to the query composition index data base of said searching key word, conjunctive word and said searching key word and conjunctive word; The document that will have incidence relation with the searching key word of user's input also offers the user in the lump, for the user provides abundanter Search Results.
As shown in Figure 1, be the process flow diagram of the method for embodiment of the invention extending user Search Results, comprise following basic step:
Step 101 is obtained the searching key word of user in the search interface input.
Said searching key word can be a Chinese, English arbitrarily, can be independent speech, also can be phrase, and the keyword of user's input can be one or more.
In addition, user's input also can be the phrase that comprises one or more keys, has imported " 360 with QQ Great War " such as the user, then can therefrom extract " 360 ", " QQ " and " Great War " these several keywords.Concrete extracting mode can be handled according to extracting mode of the prior art, and this embodiment of the invention is not done qualification.
For above-mentioned situation, server can be searched for respectively and each keyword document matching, obtains search result corresponding.
Step 102 is obtained the conjunctive word that is associated with said searching key word.
In embodiments of the present invention; Can set up various correlation rules in advance; The conjunctive word that comprises keyword in the said correlation rule and be associated with said keyword; For the ease of these correlation rules are safeguarded, can also the various correlation rules of setting up be saved in the association rule database so that when needed to said correlation rule upgrade, increase or deletion etc.
Such as; Some have very strong ageing information, and As time goes on, these information no longer become focus; People also can descend to the attention rate of these information; Correspondingly, the correlation rule relevant with these information also need upgrade or delete, and is that the user provides some unwanted Search Results to avoid.
Correspondingly; Server is behind the said searching key word that receives the client transmission; Just can search said association rule database according to said searching key word; If the correlation rule that comprises said searching key word is arranged in the said association rule database, then from said correlation rule, obtain the conjunctive word that is associated with said searching key word.
Need to prove that the foundation of said correlation rule can have multiple mode, such as:
(1) sets up the correlation rule of the conjunctive word comprise keyword and to be associated through set-up mode, that is to say, set up said correlation rule through human-edited's mode with said keyword.
(2) front is mentioned; The keyword of user's input can be a plurality of; For the situation that a plurality of keywords are arranged, just can have incidence relation between these keywords, therefore; Can also generate the correlation rule of the conjunctive word that comprises keyword and be associated automatically by a plurality of searching key words of server according to said user's input with said keyword.Need to prove that said server can be a search engine server, the user that it was directed against also is meant the user that all use this search engine.
Certainly, can adopt above-mentioned dual mode to set up corresponding correlation rule simultaneously, and the situation that other modes can also be arranged and deposit, the embodiment of the invention are not done qualification to this yet.
Such as, in said association rule database, the correlation rule shown in the table 1 is arranged:
Table 1:
ID Rule
1 Potato=>dietary function
2 QQ=>360
3 { Zhang San, Li Si }=>lawsuit
4 Law of conservation of mass=>Luo Mengnuosuofu
5 Einstein=>relativity
6 Einstein=>Nobel Prize in physics
The front is mentioned, and the keyword of user's input can be one or more.For the situation of having only a keyword, when searching said association rule database, possibly obtain the one or more conjunctive words corresponding with this keyword.Such as, the user has imported searching key word " einstein " at search interface, then searches said association rule database, can obtain two conjunctive words being associated with keyword " einstein ", i.e. " relativity " and " Nobel Prize in physics ".For the situation that a plurality of keywords are arranged; When searching said association rule database; Can search according to said a plurality of keywords, such as, the user has imported searching key word " Zhang San " and " Li Si " at search interface; Then search said association rule database, can obtain the conjunctive word " lawsuit " that is associated with keyword " Zhang San " and " Li Si ".
In addition; Have when a plurality of at the keyword of user input, these keywords have certain incidence relation usually, therefore; Server also can extract these keywords; Generate corresponding correlation rule, if the record of this correlation rule not in the said association rule database then is saved in the correlation rule that generates in the said association rule database.Such as; The user has imported searching key word " 360 " and " network security " at search interface; Then server generates correlation rule { 360=>network security } according to the keyword of user's input; And this correlation rule not in the said association rule database, then server adds the correlation rule { 360=>network security } that generates in the said association rule database to.
Step 103 according to the query composition index data base of said searching key word, conjunctive word and said searching key word and conjunctive word, obtains Search Results respectively.
Such as, if the user imports QQ, the said association rule database of whois lookup obtains the conjunctive word 360 with the relevant relation of QQ, then to { QQ}, { { 360} searches for respectively, obtains search result corresponding for QQ, 360}.
Said Search Results can comprise the summary or the partial content of relevant documentation, can further include the URL of said document, is linked to relevant documentation so that the user can click URL.
Need to prove; Some internet web page information of regularly collecting have been preserved in the said index data base; Concrete collection mode can adopt prior art; Such as utilizing web crawlers program search internet web page, set up the corresponding index of different web pages information, deposit the index of setting up in said index data base.
Step 104 is gone heavy and ordering to said Search Results.
Because once search may obtain a plurality of Search Results, go heavy and ordering to these Search Results, the user is better experienced.
Said go heavily to be meant a plurality of identical result document are only kept one, concrete implementation procedure can not done qualification to this embodiment of the invention with reference to prior art.
When Search Results is sorted, can the Search Results of the said searching key word of correspondence be come the front, be corresponding said searching key word then and the Search Results of related contamination, be the Search Results of corresponding said conjunctive word at last.Certainly, also can adopt other orders.
In addition, when Search Results is sorted, can also take all factors into consideration other factors, such as, the time that can basis relevant with said Search Results source information document produces sorts to said Search Results, before the Search Results that the time is nearest comes; Can also basis relevant with the said Search Results source information document and the matching degree of said searching key word sort to said Search Results; Before the Search Results that matching degree is the highest came, the calculating of said matching degree can be carried out according to account form of the prior art.When considering that multiple factor sorts to said Search Results, can set different weights to different factors, according to the priority of each each Search Results of weight calculation, priority is high comes the front.
Step 105 sends to client with the Search Results after the ordering, so that client represents the Search Results of receiving to said user.
Need to prove, can the whole of the Search Results after the ordering or ordering be sent to client in preceding part.
It is thus clear that the method for embodiment of the invention extending user Search Results is to the searching key word of user's input; Excavated the conjunctive word that has incidence relation with said searching key word; That is to say, intelligently user's search behavior and expectation are predicted, and respectively according to the query composition index data base of said searching key word, conjunctive word and said searching key word and conjunctive word; Obtain search result corresponding; Thereby expanded Search Results, the document that will have incidence relation with the searching key word of user's input also offers the user in the lump, for the user provides abundanter Search Results.
The front is mentioned, and in instance of the present invention, described correlation rule can have multiple mode to set up, such as, the mode through being provided with, or set up corresponding correlation rule by server automatically according to a plurality of searching key words of user's input.And, can also these correlation rules be saved in the same association rule database.
In order further to guarantee the strong correlation of these correlation rules; Equally; Can carry out regular update to it by manual work; Can also analyze all users' search behavior and/or Search Results by server, automatically these correlation rules safeguarded, be elaborated in the face of this down according to analysis result.
At first, brief account two notions relevant: support, degree of confidence with said correlation rule.Said correlation rule, support, degree of confidence all are the notion in the data mining subject at first, wherein:
A correlation rule can be designated as:
A=>B (1)
Wherein, A representes keyword, and B representes the conjunctive word of A.
Support is defined as:
sup ( A = > B ) = n ( A ′ ′ B ) N - - - ( 2 )
Wherein, and n (A " B) expression A and the simultaneous number of times of B, N representes the number of all affairs.
Degree of confidence is defined as:
Conf ( A = > B ) = n ( A ′ ′ B ) n ( A ) - - - ( 3 )
Wherein, n (A) represents the number of times that A takes place.
Support and degree of confidence can be represented the power of incidence relation between a plurality of clauses and subclauses.
Need to prove, can find out that (A=>B) must equal sup, and (B=>A), ((value of B=>A) is then different for A=>B) and conf for conf for sup by last formula (2), (3).
Based on above-mentioned principle; In embodiments of the present invention; Can also may further comprise the steps: add up all users' search behavior and/or Search Results, confirm the power of incidence relation between keyword and the corresponding conjunctive word in the said correlation rule, the correlation rule in the said association rule database is safeguarded according to definite result according to statistics; Particularly, can be that correlation rule is upgraded, adds or operation such as deletion.
Between keyword and the corresponding conjunctive word confirmed according to statistics in the said correlation rule, during incidence relation strong and weak, can multiple implementation specifically be arranged, to specifying for example below this based on foregoing support and/or degree of confidence.
(1) confirms the power of incidence relation between keyword and the corresponding conjunctive word in the said correlation rule according to all users' search behavior
For instance, when supposing to have several users to use search, the several query words below having imported:
1.360 Great War QQ;
2.QQ prosecute 360;
3.QQ。
Suppose that A is 360, B is QQ, then according to above-mentioned these search behaviors, can obtain:
N (A " B)=2, N=3, so, sup (A=>B)=2/3=0.667;
N (A)=2, thus conf (A=>B)=2/2=1.0;
Equally, can obtain conf (B=>A)=2/3=0.667.
(2) confirm the power of incidence relation between keyword and the corresponding conjunctive word in the said correlation rule according to all users' Search Results
In embodiments of the present invention, can user's one query be called affairs.
For the current incidence relation that has existed in the association rule database, 360 related QQ for example, to 360, QQ and 360, QQ} searches for the result document that obtains and follows the trail of statistics, supposes that statistics is following in certain period:
These result document add up to N=100;
The number that only comprises 360 document is: n (360)=10;
The number that only comprises the document of QQ is n (QQ)=20;
Not only comprised 360 but also the number that comprises the document of QQ be n (A " B)=70;
Then calculate and can obtain:
sup(360=>QQ)=70/100=0.7;
conf(360=>QQ)=70/(70+10)=0.875;
conf(QQ=>360)=70/(70+20)=0.778。
These three values are dynamic changes, if a certain period, these three values are all diminishing, and explains that 360 are weakening with the incidence relation of QQ, otherwise explain that then its incidence relation is in enhancing.
(3) comprehensive above-mentioned two kinds of statisticses, search behavior and the Search Results of promptly taking all factors into consideration the user are confirmed the power of incidence relation between keyword and the corresponding conjunctive word in the said correlation rule
Such as, can give specific weight to the statistical value of said search behavior and Search Results respectively, when calculating said support and degree of confidence, carry out weighted mean according to weight separately, weight separately can be identical, also can be different.
Need to prove, to above-mentioned (1) kind statistical computation mode, in user's once search; The situation of three or the keyword more than three may appear comprising in the searching key word of user input, have imported " 360 with QQ lawsuit situation " such as the user, and keyword set is exactly { 360 so; QQ, the lawsuit situation }, at this moment; Can calculate respectively the support and the degree of confidence of each combination, comprise:
Sup (360=>{ QQ, lawsuit }), conf (360=>{ QQ, lawsuit });
Sup (QQ=>{ 360, lawsuit }), conf (QQ=>{ 360, lawsuit });
Sup (lawsuit=>QQ, 360}), conf (lawsuit=>QQ, 360});
Conf ({ QQ tells }=>360);
Conf ({ 360, tell }=>QQ);
Conf (QQ, 360}=>tell);
sup(360=>{QQ}),conf(360=>{QQ});
Sup (360=>{ lawsuit }), conf (360=>{ lawsuit });
Conf (lawsuit=>360);
Conf (lawsuit=>QQ);
conf(QQ=>360);
Sup (QQ=>lawsuit), conf (QQ=>lawsuit);
It is thus clear that, when the keyword number in certain search affairs is too many, can cause calculated amount too big, in the application of reality, can do a little restrictions, such as only calculating two degree of confidence and supports between the keyword.
Need to prove; After calculating said support and degree of confidence; Can confirm the power of incidence relation between keyword and the corresponding conjunctive word in the said correlation rule according to one of them, such as support threshold value and confidence threshold value are set respectively, after the support that calculates surpasses said support threshold value; Think strong association, otherwise think weak association; Equally, after the degree of confidence that calculates surpasses said confidence threshold value, think strong association.Certainly, also can take all factors into consideration this two values, after the support that calculates and degree of confidence are all above corresponding threshold, just think strong association.
In addition; When the correlation rule in the said association rule database being safeguarded according to definite result; Can determine whether the needs deletion according to the power of its incidence relation, add or revise the correlation rule in the said association rule database; Such as, the incidence relation in confirming certain correlation rule is deleted this correlation rule after belonging to weak association.
Need to prove; Above-mentioned only be to utilize in the embodiment of the invention support and/or degree of confidence judge incidence relation in the correlation rule power concrete for example; In the practical application, can also judge the power of said incidence relation, this embodiment of the invention is not done qualification through other modes.
It is thus clear that; The method of embodiment of the invention extending user Search Results; Intelligently user's search behavior and expectation are predicted that not only the document that will have incidence relation with the searching key word of user's input also offers the user in the lump, for the user provides abundanter Search Results; And, guaranteed the validity and the accuracy of the Search Results of expansion through automatic maintenance to correlation rule.
One of ordinary skill in the art will appreciate that all or part of step that realizes in the foregoing description method is to instruct relevant hardware to accomplish through program; Described program can be stored in the computer read/write memory medium; Described storage medium, as: ROM/RAM, magnetic disc, CD etc.
Correspondingly, the embodiment of the invention also provides a kind of server, and is as shown in Figure 2, is a kind of structural representation of this server.
In this embodiment, said server comprises:
Keyword acquiring unit 201 is used to obtain the searching key word of user in the search interface input;
Conjunctive word acquiring unit 202 is used to obtain the conjunctive word that is associated with said searching key word;
Query unit 203 is used for respectively obtaining Search Results according to the query composition index data base of said searching key word, conjunctive word and said searching key word and conjunctive word;
Arrangement unit 204 is used for going heavy and ordering to said Search Results;
Transmitting element 205 is used for the Search Results after the ordering is sent to client, so that client represents the Search Results of receiving to said user.
In embodiments of the present invention; Can set up various correlation rules in advance; The conjunctive word that comprises keyword in the said correlation rule and be associated with said keyword; For the ease of these correlation rules are safeguarded, can also the various correlation rules of setting up be saved in the association rule database so that when needed to said correlation rule upgrade, increase or deletion etc.
Correspondingly; Said conjunctive word acquiring unit 202; Specifically be used for searching association rule database 205 according to said searching key word; If the correlation rule that comprises said searching key word is arranged in the said association rule database, then from said correlation rule, obtain the conjunctive word that is associated with said searching key word.
Need to prove that said association rule database 205 can be inner at said server, also can be independent of outside the said server.
In addition, in embodiments of the present invention, said server also can further comprise: rule is provided with unit and/or regular generation unit, and preserves the unit, wherein:
Said rule is provided with the unit, is used to be provided with the correlation rule of the conjunctive word that comprises keyword and be associated with said keyword;
Said regular generation unit is used for a plurality of searching key words according to said user's input, generates the correlation rule of the conjunctive word that comprises keyword and be associated with said keyword;
Said preservation unit is used for said correlation rule is saved in said association rule database.
That is to say that said correlation rule can have multiple mode to generate, such as, through said rule the unit is set by manual work some correlation rules are set, can also generate some correlation rules automatically by said regular generation unit.In practical application, said server can include only said rule any in unit and the said regular generation unit is set, and also can comprise this two unit simultaneously.Certainly, the embodiment of the invention is not limited in above-mentioned these implementations, can also adopt other modes or above-mentioned variety of way and other modes and the mode of depositing generates said correlation rule, and this is enumerated no longer one by one.
It is thus clear that; The server of the embodiment of the invention; To the searching key word of user input, intelligently user's search behavior and expectation are predicted, and respectively according to the query composition index data base of said searching key word, conjunctive word and said searching key word and conjunctive word; Obtain search result corresponding; Thereby expanded Search Results, the document that will have incidence relation with the searching key word of user's input also offers the user in the lump, for the user provides abundanter Search Results.
As shown in Figure 3, be the another kind of structural representation of embodiment of the invention server.
Be that with difference embodiment illustrated in fig. 2 in this embodiment, said server also further comprises:
Statistic unit 206 is used to add up all users' search behavior and/or Search Results;
The degree of association is confirmed unit 207, the power of incidence relation between the keyword that is used for confirming said correlation rule according to statistics and corresponding conjunctive word;
Rule maintenance unit 208 is used for according to confirming that the result safeguards the correlation rule of said association rule database, and particularly, this maintenance can be deletion, interpolation or revise the correlation rule in the said association rule database.
In embodiments of the present invention, the said degree of association confirms that unit 207 can confirm the power of incidence relation between keyword and the corresponding conjunctive word in the said correlation rule in several ways, such as, can confirm according to support and/or degree of confidence.
Correspondingly, the said degree of association confirms that unit 207 comprises:
Computation subunit is used for calculating according to statistics the support and/or the degree of confidence of said correlation rule;
Analyze subelement, be used for during greater than the preset confidence threshold value, confirming that said correlation rule is for strong related greater than the support threshold value of setting and/or said degree of confidence in said support; Otherwise be weak association.
Certainly, the embodiment of the invention is not limited in above-mentioned this implementation, and in practical application, the said degree of association confirms that unit 207 can also confirm the power of said incidence relation through other modes, and this embodiment of the invention is not done qualification.
The server of the embodiment of the invention; Not only intelligently user's search behavior and expectation are predicted; The document that will have incidence relation with the searching key word of user's input also offers the user in the lump; For the user provides abundanter Search Results, and, guaranteed the validity and the accuracy of the Search Results of expansion through automatic maintenance to correlation rule.
Identical similar part is mutually referring to getting final product between each embodiment in this instructions, and each embodiment stresses all is the difference with other embodiment.Especially, for Apparatus and system embodiment, because it is basically similar in appearance to method embodiment, so describe fairly simplely, relevant part gets final product referring to the part explanation of method embodiment.System embodiment described above only is schematic; Wherein said unit as the separating component explanation can or can not be physically to separate also; The parts that show as the unit can be or can not be physical locations also; Promptly can be positioned at a place, perhaps also can be distributed on a plurality of NEs.Can realize the purpose of present embodiment scheme according to the needs selection some or all of module wherein of reality.Those of ordinary skills promptly can understand and implement under the situation of not paying creative work.
More than disclosedly be merely preferred implementation of the present invention; But the present invention is not limited thereto; Any those skilled in the art can think do not have a creationary variation, and, all should drop in protection scope of the present invention not breaking away from some improvement and the retouching of being done under the principle of the invention prerequisite.

Claims (2)

1. the method for an extending user Search Results is characterized in that, comprising:
Obtain the searching key word of user in the search interface input;
Obtain the conjunctive word that is associated with said searching key word;
According to the query composition index data base of said searching key word, conjunctive word and said searching key word and conjunctive word, obtain Search Results respectively;
Go heavy and ordering to said Search Results;
Search Results after the ordering is sent to client, so that client represents the Search Results of receiving to said user;
The said conjunctive word that is associated with said searching key word that obtains comprises:
Search association rule database according to said searching key word;
If the correlation rule that comprises said searching key word is arranged in the said association rule database, then from said correlation rule, obtain the conjunctive word that is associated with said searching key word;
Said method also comprises:
Setting comprises keyword and the correlation rule of the conjunctive word that is associated with said keyword; And/or, generate the correlation rule of the conjunctive word that comprises keyword and be associated with said keyword according to a plurality of searching key words that said user imports;
Said correlation rule is saved in the said association rule database;
The said conjunctive word that is associated with said searching key word that obtains comprises: search said association rule database according to said searching key word; If the correlation rule that comprises said searching key word is arranged in the said association rule database, then from said correlation rule, obtain the conjunctive word that is associated with said searching key word;
Said method also comprises:
After being saved in said correlation rule in the said association rule database, add up all users' search behavior and/or Search Results;
Confirm the power of incidence relation between keyword and the corresponding conjunctive word in the said correlation rule according to statistics;
According to definite result the correlation rule in the said association rule database is safeguarded;
Saidly confirm that according to statistics the power of incidence relation between keyword and the corresponding conjunctive word in the said correlation rule comprises:
Calculate the support and/or the degree of confidence of said correlation rule according to statistics;
If said support greater than the preset confidence threshold value, confirms then that said correlation rule be strong association greater than the support threshold value of setting and/or said degree of confidence; Otherwise be weak association;
Wherein said support and degree of confidence are represented the power of incidence relation between a plurality of clauses and subclauses.
2. a server system is characterized in that, comprising:
The keyword acquiring unit is used to obtain the searching key word of user in the search interface input;
The conjunctive word acquiring unit is used to obtain the conjunctive word that is associated with said searching key word;
Query unit is used for respectively obtaining Search Results according to the query composition index data base of said searching key word, conjunctive word and said searching key word and conjunctive word;
The arrangement unit is used for going heavy and ordering to said Search Results;
Transmitting element is used for the Search Results after the ordering is sent to client, so that client represents the Search Results of receiving to said user;
Said conjunctive word acquiring unit; Specifically be used for searching association rule database according to said searching key word; If the correlation rule that comprises said searching key word is arranged in the said association rule database, then from said correlation rule, obtain the conjunctive word that is associated with said searching key word;
Said server system also comprises: rule is provided with unit and/or regular generation unit, and preserves the unit;
Said rule is provided with the unit, is used to be provided with the correlation rule of the conjunctive word that comprises keyword and be associated with said keyword;
Said regular generation unit is used for a plurality of searching key words according to said user's input, generates the correlation rule of the conjunctive word that comprises keyword and be associated with said keyword;
Said preservation unit is used for said correlation rule is saved in said association rule database;
Said server system also comprises:
Statistic unit is used to add up all users' search behavior and/or Search Results;
The degree of association is confirmed the unit, the power of incidence relation between the keyword that is used for confirming said correlation rule according to statistics and corresponding conjunctive word;
The rule maintenance unit is used for according to confirming that the result safeguards the correlation rule of said association rule database;
The said degree of association confirms that the unit comprises:
Computation subunit is used for calculating according to statistics the support and/or the degree of confidence of said correlation rule;
Analyze subelement, be used for during greater than the preset confidence threshold value, confirming that said correlation rule is for strong related greater than the support threshold value of setting and/or said degree of confidence in said support; Otherwise be weak association;
Wherein said support and degree of confidence are represented the power of incidence relation between a plurality of clauses and subclauses.
CN 201010605956 2010-12-24 2010-12-24 Method for expanding user search results and server Expired - Fee Related CN102033955B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201010605956 CN102033955B (en) 2010-12-24 2010-12-24 Method for expanding user search results and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201010605956 CN102033955B (en) 2010-12-24 2010-12-24 Method for expanding user search results and server

Publications (2)

Publication Number Publication Date
CN102033955A CN102033955A (en) 2011-04-27
CN102033955B true CN102033955B (en) 2012-12-05

Family

ID=43886848

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201010605956 Expired - Fee Related CN102033955B (en) 2010-12-24 2010-12-24 Method for expanding user search results and server

Country Status (1)

Country Link
CN (1) CN102033955B (en)

Families Citing this family (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102207962B (en) * 2011-05-25 2013-03-13 盛乐信息技术(上海)有限公司 Dynamic search engine and method
CN103324631B (en) * 2012-03-22 2018-05-29 深圳市世纪光速信息技术有限公司 The method and device of data search is provided
CN103377226B (en) * 2012-04-25 2016-08-03 中国移动通信集团公司 A kind of intelligent search method and system thereof
CN103885947B (en) * 2012-12-19 2018-08-10 北京百度网讯科技有限公司 A kind of method for digging of search need, intelligent search method and its device
CN103123649B (en) * 2013-01-29 2016-04-20 广州一找网络科技有限公司 A kind of message searching method based on microblog and system
CN103514269B (en) * 2013-09-12 2017-08-01 百度在线网络技术(北京)有限公司 Second query word associated with the first query word is determined based on natural search result
CN103744856B (en) * 2013-12-03 2016-09-21 北京奇虎科技有限公司 Linkage extended search method and device, system
CN103927354A (en) 2014-04-11 2014-07-16 百度在线网络技术(北京)有限公司 Interactive searching and recommending method and device
CN104376072A (en) * 2014-11-12 2015-02-25 天脉聚源(北京)传媒科技有限公司 Searching method and device
CN104484367A (en) * 2014-12-05 2015-04-01 广州招商速建互联网信息科技有限公司 Data mining and analyzing system
CN104484374B (en) * 2014-12-08 2018-11-16 百度在线网络技术(北京)有限公司 A kind of method and device creating network encyclopaedia entry
CN104537057B (en) * 2014-12-26 2016-06-29 奇飞翔艺(北京)软件有限公司 Data search method and client
CN104537072B (en) * 2014-12-30 2019-03-12 百度在线网络技术(北京)有限公司 Searching method and device
CN104778208A (en) * 2015-03-11 2015-07-15 赵业东 Method and system for optimally grasping search engine SEO (search engine optimization) website data
CN104915418B (en) * 2015-06-08 2018-09-04 小米科技有限责任公司 Recommendation of websites method and device
CN104899305A (en) * 2015-06-12 2015-09-09 百度在线网络技术(北京)有限公司 Recommendation method and device for mobile search
CN105279252B (en) * 2015-10-12 2017-12-26 广州神马移动信息科技有限公司 Excavate method, searching method, the search system of related term
CN108845992B (en) * 2015-10-30 2022-08-26 上海智臻智能网络科技股份有限公司 Computer readable storage medium and question-answer interaction method
CN105740476A (en) * 2016-03-18 2016-07-06 科润智能科技股份有限公司 Associated problem recommending method, device and system
CN108241646B (en) * 2016-12-23 2021-12-17 阿里巴巴集团控股有限公司 Search matching method and device and recommendation method and device
CN106599297A (en) * 2016-12-28 2017-04-26 北京百度网讯科技有限公司 Method and device for searching question-type search terms on basis of deep questions and answers
CN107066521A (en) * 2017-02-16 2017-08-18 国网上海市电力公司 Transmission line equipment association picture crawling method based on text keyword
CN107526797A (en) * 2017-08-17 2017-12-29 广州市西美信息科技有限公司 Substance search method and device
CN108763196A (en) * 2018-05-03 2018-11-06 上海海事大学 A kind of keyword extraction method based on PMI
CN109145016A (en) * 2018-09-10 2019-01-04 合肥科讯金服科技有限公司 A kind of finance internet big data searching system
CN109785926A (en) * 2018-12-21 2019-05-21 宜宝科技(北京)有限公司 Inspection report processing method and processing device for unit end
CN111324830B (en) * 2020-02-07 2021-09-07 北京星选科技有限公司 Object processing method and device, electronic equipment and computer readable storage medium
CN111625701B (en) * 2020-05-25 2024-01-26 Oppo广东移动通信有限公司 Searching method, searching device, server and storage medium
CN112100330B (en) * 2020-09-09 2023-09-26 杭州凡闻科技有限公司 Topic searching method and system based on artificial intelligence technology
CN112579912A (en) * 2020-12-29 2021-03-30 湖北亿咖通科技有限公司 Searching method, electronic equipment and computer storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5987457A (en) * 1997-11-25 1999-11-16 Acceleration Software International Corporation Query refinement method for searching documents
CN101038596A (en) * 2007-04-29 2007-09-19 北京搜狗科技发展有限公司 Method and system for classifying website
CN101241512A (en) * 2008-03-10 2008-08-13 北京搜狗科技发展有限公司 Search method for redefining enquiry word and device therefor
CN101246499A (en) * 2008-03-27 2008-08-20 腾讯科技(深圳)有限公司 Network information search method and system

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101566988A (en) * 2008-04-24 2009-10-28 华为技术有限公司 Method, system and device for searching fuzzy semantics
CN101770499A (en) * 2009-01-07 2010-07-07 上海聚力传媒技术有限公司 Information retrieval method in search engine and corresponding search engine

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5987457A (en) * 1997-11-25 1999-11-16 Acceleration Software International Corporation Query refinement method for searching documents
CN101038596A (en) * 2007-04-29 2007-09-19 北京搜狗科技发展有限公司 Method and system for classifying website
CN101241512A (en) * 2008-03-10 2008-08-13 北京搜狗科技发展有限公司 Search method for redefining enquiry word and device therefor
CN101246499A (en) * 2008-03-27 2008-08-20 腾讯科技(深圳)有限公司 Network information search method and system

Also Published As

Publication number Publication date
CN102033955A (en) 2011-04-27

Similar Documents

Publication Publication Date Title
CN102033955B (en) Method for expanding user search results and server
CN101446940B (en) Method and device of automatically generating a summary for document set
Cano Basave et al. Automatic labelling of topic models learned from twitter by summarisation
Shokouhi Central-rank-based collection selection in uncooperative distributed information retrieval
CN101782919B (en) Web form data output method, device and form processing system
Santos et al. Explicit search result diversification through sub-queries
CN100595759C (en) Method and device for enquire enquiry extending as well as related searching word stock
CN103136228A (en) Image search method and image search device
CN103186612B (en) A kind of method of classified vocabulary, system and implementation method
US20140289675A1 (en) System and Method of Mapping Products to Patents
CN102341800A (en) Search processing method and apparatus
CN101820592A (en) Method and device for mobile search
CN102737021B (en) Search engine and realization method thereof
US10565253B2 (en) Model generation method, word weighting method, device, apparatus, and computer storage medium
CN102567290B (en) Method, device and equipment for expanding short text to be processed
CN102722498A (en) Search engine and implementation method thereof
CN107807957A (en) entity library generating method and device
CN102710795A (en) Hotspot collecting method and device
EP3559888A1 (en) Systems and methods for intelligent prospect identification using online resources and neural network processing to classify organizations based on published materials
CN106709851A (en) Big data retrieval method and apparatus
CN103838754A (en) Information searching device and method
Keikha et al. Temper: A temporal relevance feedback method
CN103942268A (en) Method and device for combining search and application and application interface
CN103984757A (en) Method and system for inserting news information articles in search result page
CN104331493A (en) Method and device for generating trend interpretation data by virtue of computer

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: MIAOCHUANG (BEIJING) TECHNOLOGY CO., LTD.

Free format text: FORMER OWNER: CHANG HUA

Effective date: 20141202

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 100040 SHIJINGSHAN, BEIJING TO: 100192 HAIDIAN, BEIJING

TR01 Transfer of patent right

Effective date of registration: 20141202

Address after: 100192 Yongtai Innovation Park, middle Yongtai Road, Beijing, Haidian District B318

Patentee after: (Beijing) Technology Co., Ltd.

Address before: 100040, 4, building 31, 502, East District, Yongle District, Beijing, Shijingshan District

Patentee before: Chang Hua

CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20121205

Termination date: 20201224

CF01 Termination of patent right due to non-payment of annual fee