CN104573019A - Information searching method and device - Google Patents

Information searching method and device Download PDF

Info

Publication number
CN104573019A
CN104573019A CN201510013397.2A CN201510013397A CN104573019A CN 104573019 A CN104573019 A CN 104573019A CN 201510013397 A CN201510013397 A CN 201510013397A CN 104573019 A CN104573019 A CN 104573019A
Authority
CN
China
Prior art keywords
term
stage
result
retrieval
search word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510013397.2A
Other languages
Chinese (zh)
Other versions
CN104573019B (en
Inventor
卢家广
曹冰
陈晨
韩中华
马晋
沈健
张晓婧
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Baidu Online Network Technology Beijing Co Ltd
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201510013397.2A priority Critical patent/CN104573019B/en
Publication of CN104573019A publication Critical patent/CN104573019A/en
Application granted granted Critical
Publication of CN104573019B publication Critical patent/CN104573019B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3322Query formulation using system suggestions
    • G06F16/3323Query formulation using system suggestions using document space presentation or visualization, e.g. category, hierarchy or range presentation and selection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3337Translation of the query language, e.g. Chinese to English

Abstract

The invention provides an information searching method and device, the information searching method comprises following steps: obtaining the associated searching word of the searching word in the first stage according to the searching word in the first stage, displaying the associated searching word, the associated searching word comprises the searching word, which has different language compared with the searching word in the first stage; obtaining the searching word in the second stage, the searching word in the second stage is the searching word selected from the associated searching word; and obtaining the corresponding searching result according to the searching word in the second stage, displaying the searching result. The cross-language domain search can be realized by the method and the combinatorial search is achieved.

Description

Information retrieval method and device
Technical field
The present invention relates to search technique field, particularly relate to a kind of information retrieval method and device.
Background technology
User can obtain by search engine the information needed, and when using search engine, user can input term in the search box, the result for retrieval that search engine search is relevant to term, and is illustrated in search results pages.Generally, the relevant search for user recommends can also be comprised in search results pages, such as, when term is " support vector machine ", can comprise " support vector " in relevant search, " support vector machine introduction ", the term that " support vector machine principle " etc. are relevant, user can select a term again to initiate search in relevant search.
But, usually provide the term that inputs with the user term with voice domain in prior art in relevant search, and, search can only be initiated for the term of in relevant search.
Summary of the invention
The present invention is intended to solve one of technical matters in correlation technique at least to a certain extent.
For this reason, one object of the present invention is to propose a kind of information retrieval method, and the method can realize across language domains search, and can realize combinatorial search.
Another object of the present invention is to propose a kind of information indexing device.
For achieving the above object, the information retrieval method that first aspect present invention embodiment proposes, comprise: according to the term of first stage, obtain the associative search word of the term of described first stage, and show described associative search word, described associative search word comprises the term with the term of described first stage with different language; Obtain the term of subordinate phase, the term of described subordinate phase is the term selected in described associative search word; Term according to described subordinate phase obtains corresponding result for retrieval, and shows described result for retrieval.
The information retrieval method that first aspect present invention embodiment proposes, by obtaining the associative search word of the term of first stage, associative search word comprises the term with the term of first stage with different language, can realize searching for across language domains; By obtaining the term of subordinate phase, the term of subordinate phase is the term selected in associative search word, by selecting the search that can realize one or more term, can realize combinatorial search.
For achieving the above object, the information indexing device that second aspect present invention embodiment proposes, comprise: acquisition module, for the term according to the first stage, obtain the associative search word of the term of described first stage, and show described associative search word, described associative search word comprises the term with the term of described first stage with different language; Select module, for obtaining the term of subordinate phase, the term of described subordinate phase is the term selected in described associative search word; Display module, obtains corresponding result for retrieval for the term according to described subordinate phase, and shows described result for retrieval.
The information indexing device that second aspect present invention embodiment proposes, by obtaining the associative search word of the term of first stage, associative search word comprises the term with the term of first stage with different language, can realize searching for across language domains; By obtaining the term of subordinate phase, the term of subordinate phase is the term selected in associative search word, by selecting the search that can realize one or more term, can realize combinatorial search.
The aspect that the present invention adds and advantage will part provide in the following description, and part will become obvious from the following description, or be recognized by practice of the present invention.
Accompanying drawing explanation
The present invention above-mentioned and/or additional aspect and advantage will become obvious and easy understand from the following description of the accompanying drawings of embodiments, wherein:
Fig. 1 is the schematic flow sheet of the information retrieval method that one embodiment of the invention proposes;
Fig. 2 is the displaying schematic diagram after the retrieval of first stage in the embodiment of the present invention;
Fig. 3 is the displaying schematic diagram after the retrieval of a kind of subordinate phase in the embodiment of the present invention;
Fig. 4 is the displaying schematic diagram after the retrieval of another kind of subordinate phase in the embodiment of the present invention;
Fig. 5 is the schematic flow sheet of the information retrieval method that another embodiment of the present invention proposes;
Fig. 6 is the schematic flow sheet of the information retrieval method that another embodiment of the present invention proposes;
Fig. 7 is the schematic flow sheet of term Chinese and English conversion in the embodiment of the present invention;
Fig. 8 is the schematic diagram of the middle-and-high-ranking filter information of the embodiment of the present invention;
Fig. 9 is the structural representation of the information indexing device that another embodiment of the present invention proposes;
Figure 10 is the structural representation of the information indexing device that another embodiment of the present invention proposes.
Embodiment
Be described below in detail embodiments of the invention, the example of described embodiment is shown in the drawings, and wherein same or similar label represents same or similar element or has element that is identical or similar functions from start to finish.Being exemplary below by the embodiment be described with reference to the drawings, only for explaining the present invention, and can not limitation of the present invention being interpreted as.On the contrary, embodiments of the invention comprise fall into attached claims spirit and intension within the scope of all changes, amendment and equivalent.
Fig. 1 is the schematic flow sheet of the information retrieval method that one embodiment of the invention proposes, and the method can be applied in a search engine, and the method comprises:
S11: according to the term of first stage, obtains the associative search word of the term of described first stage, and shows described associative search word, and described associative search word comprises the term with the term of described first stage with different language.
The term of first stage can refer to the term that user inputs in the search box.
Be understandable that, term (query) also can be called query word, or, search word.
The term of first stage has first language, and first language is such as Chinese.
After search engine receives Chinese retrieval word, can obtain the associative search word of Chinese retrieval word, this associative search word comprises the term of second language, and second language is such as English.Such as, associative search word comprises the translator of English result of the Chinese retrieval word of user's input.In addition, associative search word can also comprise the Chinese retrieval word of user's input itself.
Such as, see Fig. 2, the term 21 of first stage is " ecommerce ", and associative search word 22 comprises " ecommerce ", " Electronic commerce ", " Electronic business " and " E-commerce ".
Certainly, be understandable that, when obtaining associative search word according to the term of first stage, result for retrieval that can also be corresponding with the term of first stage, and result for retrieval is shown together with associative search word, such as, see Fig. 2, the result for retrieval 23 that the term of first stage is corresponding can also be shown.
The flow process of the concrete acquisition of the term according to first stage associative search word can see subsequent embodiment.
S12: the term obtaining subordinate phase, the term of described subordinate phase is the term selected in described associative search word.
Optionally, the term of described acquisition subordinate phase, comprising:
Receive the check boxes of user by arranging, the term selected in described associative search word, the term of described selection is defined as the term of described subordinate phase, the term of described selection is one or at least two, the term of described selection comprises: the term of described first stage, and/or have the term of different language with the term of described first stage, described check boxes is corresponding with described associative search word when showing described associative search word to be arranged.
Wherein, can also arrange check boxes by corresponding each associative search word when showing associative search word, user can select term by clicking check boxes.Optionally, the term of first stage can be selected by default user.
Concrete, see Fig. 3, can select a term in associative search word, such as, the term selected in Fig. 3, namely the term 31 of subordinate phase comprises: " E-commerce ".Or,
See Fig. 4, can select multiple term in associative search word, such as, the term selected in Fig. 4, namely the term 41 of subordinate phase comprises: " ecommerce ", " Electronic commerce ", and " E-commerce ".
In prior art, when initiating again to search for by relevant search, in relevant search term can only be selected to search for, and in the present embodiment, by providing options for associative search word, multiple term can be selected in associative search word, thus combinatorial search can be realized at search word, make result for retrieval can comprise result for retrieval corresponding to any one term selected.
S13: the term according to described subordinate phase obtains corresponding result for retrieval, and shows described result for retrieval.
Wherein, the result for retrieval of acquisition can comprise the result for retrieval of the term of corresponding each selection, and such as, see Fig. 3, result for retrieval 32 comprises the result for retrieval of corresponding term " E-commerce ".Or,
Such as, see Fig. 4, result for retrieval 42 comprises the result for retrieval of corresponding term " Electronic commerce ", the result for retrieval of corresponding term " E-commerce ", the result for retrieval of corresponding term " ecommerce ".
In addition, when showing result for retrieval, general rise of prices of the stocks and other securities process can be carried out to corresponding term.Such as, see Fig. 3, " E-commerce " in result for retrieval is all carried out general rise of prices of the stocks and other securities process.Or see Fig. 4, " Electronic commerce " in Article 1 result for retrieval all carries out general rise of prices of the stocks and other securities process.
Above-mentionedly be retrieved as example with the retrieval carrying out the first stage once and subordinate phase, be understandable that, the retrieval of first stage and/or the retrieval of subordinate phase can repeatedly, or, only carry out the retrieval of first stage.Such as, see Fig. 5, in another embodiment, the flow process of information retrieval method can comprise:
S51: user carries out the search of first stage.
Such as, time first, user can input term in the search box, and clicks search button, and as " using Baidu.com " initiates search, this search can be called the search of first stage
In addition, the search of this first stage can repeat, and such as, user re-enters new term in the search box, and initiates the search to new term.
S52: the search entering subordinate phase from the search of first stage.
Such as, after user inputs term in the search box, as shown in above-described embodiment, can show multiple associative search word, user by selecting one or more term, thus can enter the search of subordinate phase from the search of first stage.
S53: user carries out the search of subordinate phase.
Such as, after user selects one or more term in multiple associative search word, then click the search that " using Baidu.com " then can initiate subordinate phase.
The search of this subordinate phase can be repeatedly, such as, first time can be selected " E-commerce ", initiate the search of subordinate phase, after obtaining result for retrieval corresponding to " E-commerce ", user can also reselect new term, such as, second time can select " Electronic commerce " and " E-commerce ", again initiates the search of new subordinate phase.
S54: the search entering the first stage from the search of subordinate phase.
After user carries out the search of subordinate phase, user can also carry out the search of first stage again.Such as, user by selecting " E-commerce " carries out the search of subordinate phase and after obtaining result for retrieval, user can also input new search word again in the search box, thus again carries out the search of first stage.
Certainly, when the search of first stage and/or subordinate phase, user can leave, and exits search.
In the present embodiment, by obtaining the associative search word of the term of first stage, associative search word comprises the term with the term of first stage with different language, can realize searching for across language domains; By obtaining the term of subordinate phase, the term of subordinate phase is the term selected in associative search word, by selecting the search that can realize one or more term, can realize combinatorial search; Therefore, the present embodiment can realize across language domains search, and can realize combinatorial search, thus improves search effect, promotes Consumer's Experience.
Fig. 6 is the schematic flow sheet of the information retrieval method that another embodiment of the present invention proposes, and the present embodiment is to be converted to English by term from Chinese, and the method comprises:
S601: obtain term.
Wherein, the term that search engine obtains can be the term of first stage, or, the term of subordinate phase.
S602: carry out the conversion of term Chinese and English.
Such as, when search engine receives the Chinese retrieval word of user's input, can translate this Chinese retrieval word, obtain English translation result.
Be understandable that, if the term obtained is English, the term of such as subordinate phase, this step can be ignored, or, keep English constant when carrying out the conversion of term Chinese and English.
When translating, the translation model and dictionary set up in advance can be obtained, according to described translation model and dictionary, adopt mechanical translation mode, obtain the translation result of the term of described reception, wherein, if when this information retrieval method is applied in these retrieval, described translation model is according to obtaining across the training of language language material in paper.
Concrete, see Fig. 7, the flow process of term Chinese and English conversion can comprise:
S71: the Chinese and English language material collecting paper.
By collecting the language material in paper, can obtain when the request of paper search intention and better translating effect.
S72: the language material according to collecting carries out off-line model training.
Concrete model training mode adopts the common flow process obtaining training pattern according to language material.
S73: obtain translation model by training.
S74: during translation on line, receives Chinese retrieval word.
S75: according to translation model and dictionary, adopts online machine translation.
Dictionary can specifically refer to bilingual dictionary, by combining translation model and dictionary, can obtain more extensive and suitable translation result.
S76: through translation, obtains English translation result.
Further, after obtaining translation result, can screen translation result, obtain the translation result after the screening of predetermined number.Concrete screening process can be as follows:
S603: term analysis is carried out to the translation result of the term obtained and term, obtains analysis result.
Each term of retrieval end can be indicated how to retrieve by term analysis, each term carrying out retrieving comprises original term, and, each translation result.
Whether term analysis can comprise: grammatical analysis, cuts word, analyze each word and can omit when follow-up retrieval, general rise of prices of the stocks and other securities information etc.
Such as, for a term " what is machine learning ", cut word analysis can obtain " < what; 1>; < is; 1>; < machine, 0>, < learn; 0> ", four elements, each element first row represents cuts word result, and when secondary series represents retrieval, whether this word can omit, if can omit, comprise other in webpage and cut word result but do not comprise this word and also can be called back.
S604: carry out term queue according to described analysis result and recall, obtain recalling result.
Retrieval end is according to the multiple term analysis results obtained, and carry out queue for each term analysis result and recall, obtain the result for retrieval of each term, result for retrieval also can be called recalls result.
Queue: the corresponding queue of each query analysis result, this queue is used for carrying out recalling result for retrieval to the back-end, and to be Myself the process wanting to do to recalling result, and such as result filters duplicate removal, sort result position adjustment etc.
Queue is recalled: process is recalled in the retrieval of single query.Queue utilizes query analysis result above, to index rear end inquiry inverted index, obtains result for retrieval, and the final score of the every bar result of comprehensive multifactor calculating, duplicate removal etc. is carried out to result simultaneously.Issue several query analysis result, just carry out several queue and recall process.
S605: according to described correlation values of recalling the term of result and described acquisition, carry out term mass filter, obtain associative search word.
There is the problem of quality in the English query itself converting out due to Chinese and English request, these quality problems can be embodied in the correlativity aspect with existing paper resource.If conversion query is out not high with the paper resource dependencies of itself, just can not provide the good result of corresponding query yet, so this English query be returned to user, this is do not have income and help to user in fact.For this reason, in the present embodiment, do a filtration to the conversion query being finally prompted to user, filter criteria can carry out after queue recalls, carrying out threshold filtering by the correlation values of recalling first bar result and original term in result according to conversion query out.Concrete threshold value can be arranged, e.g., and 500.Such as, the correlation values of what the first translation result was corresponding recall first bar result and original term in result is greater than 500, then associative search word comprises the first translation result.
Above-mentioned term corresponding to result of recalling correlation values being greater than threshold value is defined as associative search word, optionally, can also carry out descending sort according to correlation values, and the term of the preceding predetermined number of sequence is defined as associative search word.
After term mass filter, the first stage can be in or subordinate phase carries out different process according to current.
S606: judge that current retrieval request is the retrieval request of first stage or the retrieval request of subordinate phase, if the retrieval request of first stage, then perform S609, if the retrieval request of subordinate phase, then perform S607.
Wherein, the term that S601 obtains can be carried in retrieval request, and retrieval request can be specifically URL(uniform resource locator) (Uniform Resource Locator, URL) request.
Optionally, the current retrieval request of described judgement is the retrieval request of first stage or the retrieval request of subordinate phase, comprising:
Obtain the parameter preset in described URL request, described parameter preset is used for identifying user search behavior;
According to described parameter preset, judge that current retrieval request is the retrieval request of first stage or the retrieval request of subordinate phase.
Concrete, can judge according to the sc_tr parameter in URL request, when without sc_tr parameter or sc_tr=1, determine that current retrieval request is the retrieval request of first stage, or, work as sc_tr>1, determine that current retrieval request is the retrieval request of subordinate phase, in addition, can also determine which term user have selected by sc_tr parameter.
S607: the result of recalling obtained is recalled to queue and carries out merging rearrangement.
Carry out after queue recalls, the retrieval of the English query that original query and user choose all is called back, and just needs the sequence carrying out result after recalling.Can sort according to the correlativity of recalling result and original query, and stepping can be obtained to Relevance scores division.Results relevance score in same shelves be considered as the same, other factors then according to paper in current shelves sort, as quoted number of times.Quote number of times more, more show the quality of paper.
S608: according to the advanced filter information preset, be combined the result of recalling after rearrangement and screen, obtain result for retrieval.
Advanced filter information can be default, the tabulate statistics information of such as academics search result, adds up the paper publishing amount of current result for retrieval different field, time etc., and can search for click when front end represents.Such as, see any one result of page searching in Fig. 2-Fig. 4, this result of page searching can comprise " advanced filter " option, user can open this " advanced filter " option and select.See Fig. 8, after opening " advanced filter " option, be supplied to alternative content of user.
Result for retrieval due to the most multiple query merges and returns, and therefore needs the Sino-British query clicked according to user to carry out corresponding advanced filter information and merges.
The mixing of final many query result for retrieval represents the result for retrieval that effect can be shown in Figure 4.
S609: show result for retrieval and associative search word.
By Chinese and English query conversion above and query mass filter, can obtain the Chinese and English query combination being presented to user in front end, user can according to the intention demand of oneself, and single choice or multiselect need the Chinese and English term carrying out searching for.
If the term of initial acquisition is the term of first stage, then now show the result for retrieval that the term of first stage is corresponding, such as, the result for retrieval of corresponding Chinese retrieval word as shown in Figure 2.Or, if the term of initial acquisition is the term of subordinate phase, then now show the result for retrieval that the term of subordinate phase is corresponding, such as, the result for retrieval of the pure English term of correspondence as shown in Figure 3, or, the result for retrieval of Chinese and English mixing as shown in Figure 4.
S610: judge whether user have selected associative search word, if so, repeat S601 and subsequent step thereof, otherwise, perform S611.
Such as, can judge to have selected term according to user's check boxes whether clicked before associative search word.
S611: terminate.
In the present embodiment, by providing English term after input Chinese retrieval word, the information retrieval in Chinese Papers territory and English papers territory can be realized, realizing the interactive retrieval supporting Chinese and English intertranslation; One or more by selecting when selecting, the single or multiple terms needed can be selected flexibly, initiate retrieval or combined retrieval separately; By selecting Chinese, or, English, or select Chinese and English, pure Chinese retrieval can be realized, pure English retrieval, or Chinese and English mixing retrieval, improves and retrieves dirigibility, meet the needs of different users.
Fig. 9 is the structural representation of the information indexing device that another embodiment of the present invention proposes, and this device 90 comprises acquisition module 91, selects module 92 and display module 93.
Acquisition module 91, for the term according to the first stage, obtains the associative search word of the term of described first stage, and shows described associative search word, and described associative search word comprises the term with the term of described first stage with different language;
The term of first stage can refer to the term that user inputs in the search box.
Be understandable that, term (query) also can be called query word, or, search word.
The term of first stage has first language, and first language is such as Chinese.
After search engine receives Chinese retrieval word, can obtain the associative search word of Chinese retrieval word, this associative search word comprises the term of second language, and second language is such as English.Such as, associative search word comprises the translator of English result of the Chinese retrieval word of user's input.In addition, associative search word can also comprise the Chinese retrieval word of user's input itself.
Such as, see Fig. 2, the term 21 of first stage is " ecommerce ", and associative search word 22 comprises " ecommerce ", " Electronic commerce ", " Electronic business " and " E-commerce ".
Certainly, be understandable that, when obtaining associative search word according to the term of first stage, result for retrieval that can also be corresponding with the term of first stage, and result for retrieval is shown together with associative search word, such as, see Fig. 2, the result for retrieval 23 that the term of first stage is corresponding can also be shown.
Optionally, described associative search word also comprises: the term of described first stage, and see Figure 10, described acquisition module 91 comprises:
First module 911, for carrying out across language switching the term of described first stage, obtains the translation result of the term of described first stage;
Second unit 912, for screening the term of described first stage and described translation result, obtains described associative search word.
Optionally, if described information retrieval be used for these retrieval time, described first module 911 specifically for:
Obtain the translation model and dictionary set up in advance, according to described translation model and dictionary, adopt mechanical translation mode, obtain the translation result of the term of described first stage, wherein, described translation model is according to obtaining across the training of language language material in paper.
Concrete translation flow can be shown in Figure 7 flow process, do not repeat them here.
Optionally, described second unit 912 specifically for:
Term analysis is carried out to the term of described first stage and described translation result, obtains analysis result;
Each term of retrieval end can be indicated how to retrieve by term analysis, each term carrying out retrieving comprises original term, and, each translation result.
Whether term analysis can comprise: grammatical analysis, cuts word, analyze each word and can omit when follow-up retrieval, general rise of prices of the stocks and other securities information etc.
Such as, for a term " what is machine learning ", cut word analysis can obtain " < what; 1>; < is; 1>; < machine, 0>, < learn; 0> ", four elements, each element first row represents cuts word result, and when secondary series represents retrieval, whether this word can omit, if can omit, comprise other in webpage and cut word result but do not comprise this word and also can be called back.
Carry out term queue according to described analysis result to recall, obtain recalling result;
Retrieval end, according to the multiple term analysis results obtained, carries out queue for each term analysis result and recalls, obtain the result for retrieval of each term.
Queue: the corresponding queue of each query analysis result, this queue is used for carrying out recalling result for retrieval to the back-end, and to be Myself the process wanting to do to recalling result, and such as result filters duplicate removal, sort result position adjustment etc.
Queue is recalled: process is recalled in the retrieval of single query.Queue utilizes query analysis result above, to index rear end inquiry inverted index, obtains result for retrieval, and the final score of the every bar result of comprehensive multifactor calculating, duplicate removal etc. is carried out to result simultaneously.Issue several query analysis result, just carry out several queue and recall process.
According to described correlation values of recalling the term of result and described first stage, carry out term mass filter, obtain described associative search word.
There is the problem of quality in the English query itself converting out due to Chinese and English request, these quality problems can be embodied in the correlativity aspect with existing paper resource.If conversion query is out not high with the paper resource dependencies of itself, just can not provide the good result of corresponding query yet, so this English query be returned to user, this is do not have income and help to user in fact.For this reason, in the present embodiment, do a filtration to the conversion query being finally prompted to user, filter criteria can carry out after queue recalls, carrying out threshold filtering by the correlation values of recalling first bar result and original term in result according to conversion query out.Concrete threshold value can be arranged, e.g., and 500.Such as, the correlation values of what the first translation result was corresponding recall first bar result and original term in result is greater than 500, then associative search word comprises the first translation result.
Above-mentioned term corresponding to result of recalling correlation values being greater than threshold value is defined as associative search word, optionally, can also carry out descending sort according to correlation values, and the term of the preceding predetermined number of sequence is defined as associative search word.
After term mass filter, the first stage can be in or subordinate phase carries out different process according to current.
Select module 92 for obtaining the term of subordinate phase, the term of described subordinate phase is the term selected in described associative search word;
Optionally, described selection module 92 specifically for:
Receive the check boxes of user by arranging, the term selected in described associative search word, the term of described selection is defined as the term of described subordinate phase, the term of described selection is one or at least two, the term of described selection comprises: the term of described first stage, and/or have the term of different language with the term of described first stage, described check boxes is corresponding with described associative search word when showing described associative search word to be arranged.
Wherein, can also arrange check boxes by corresponding each associative search word when showing associative search word, user can select term by clicking check boxes.Optionally, the term of first stage can be selected by default user.
Concrete, see Fig. 3, can select a term in associative search word, such as, the term selected in Fig. 3, namely the term 31 of subordinate phase comprises: " E-commerce ".Or,
See Fig. 4, can select multiple term in associative search word, such as, the term selected in Fig. 4, namely the term 41 of subordinate phase comprises: " ecommerce ", " Electronic commerce ", and " E-commerce ".
In prior art, when initiating again to search for by relevant search, in relevant search term can only be selected to search for, and in the present embodiment, by providing options for associative search word, multiple term can be selected in associative search word, thus combinatorial search can be realized at search word, make result for retrieval can comprise result for retrieval corresponding to any one term selected.
Display module 93 obtains corresponding result for retrieval for the term according to described subordinate phase, and shows described result for retrieval.
Wherein, the result for retrieval of acquisition can comprise the result for retrieval of the term of corresponding each selection, and such as, see Fig. 3, result for retrieval 32 comprises the result for retrieval of corresponding term " E-commerce ".Or,
Such as, see Fig. 4, result for retrieval 42 comprises the result for retrieval of corresponding term " Electronic commerce ", the result for retrieval of corresponding term " E-commerce ", the result for retrieval of corresponding term " ecommerce ".
In addition, when showing result for retrieval, general rise of prices of the stocks and other securities process can be carried out to corresponding term.Such as, see Fig. 3, " E-commerce " in result for retrieval is all carried out general rise of prices of the stocks and other securities process.Or see Fig. 4, " Electronic commerce " in Article 1 result for retrieval all carries out general rise of prices of the stocks and other securities process.
Optionally, described display module 93 specifically for:
Carry out term queue according to the term of described subordinate phase to recall, and carry out merging rearrangement to recalling result;
According to the advanced filter information preset, be combined the result of recalling after rearrangement and screen, obtain the result for retrieval that the term of described subordinate phase is corresponding.
Optionally, see Figure 10, this device 90 also comprises:
Judge module 94, for judging that current retrieval request is the retrieval request of first stage or the retrieval request of subordinate phase, so that when current retrieval request is the retrieval request of subordinate phase, term according to described subordinate phase obtains corresponding result for retrieval, or, when current retrieval request is the retrieval request of first stage, obtain described associative search word according to the term of described first stage, and the result for retrieval corresponding with the term of described first stage;
Wherein, described retrieval request is URL request, and the current retrieval request of described judgement is the retrieval request of first stage or the retrieval request of subordinate phase, comprising:
Obtain the parameter preset in described URL request, described parameter preset is used for identifying user search behavior;
According to described parameter preset, judge that current retrieval request is the retrieval request of first stage or the retrieval request of subordinate phase.
Concrete, can judge according to the sc_tr parameter in URL request, when without sc_tr parameter or sc_tr=1, determine that current retrieval request is the retrieval request of first stage, or, work as sc_tr>1, determine that current retrieval request is the retrieval request of subordinate phase, in addition, can also determine which term user have selected by sc_tr parameter.
In the present embodiment, by obtaining the associative search word of the term of first stage, associative search word comprises the term with the term of first stage with different language, can realize searching for across language domains; By obtaining the term of subordinate phase, the term of subordinate phase is the term selected in associative search word, by selecting the search that can realize one or more term, can realize combinatorial search; Therefore, the present embodiment can realize across language domains search, and can realize combinatorial search, thus improves search effect, promotes Consumer's Experience.
It should be noted that, in describing the invention, term " first ", " second " etc. only for describing object, and can not be interpreted as instruction or hint relative importance.In addition, in describing the invention, except as otherwise noted, the implication of " multiple " is two or more.
Describe and can be understood in process flow diagram or in this any process otherwise described or method, represent and comprise one or more for realizing the module of the code of the executable instruction of the step of specific logical function or process, fragment or part, and the scope of the preferred embodiment of the present invention comprises other realization, wherein can not according to order that is shown or that discuss, comprise according to involved function by the mode while of basic or by contrary order, carry out n-back test, this should understand by embodiments of the invention person of ordinary skill in the field.
Should be appreciated that each several part of the present invention can realize with hardware, software, firmware or their combination.In the above-described embodiment, multiple step or method can with to store in memory and the software performed by suitable instruction execution system or firmware realize.Such as, if realized with hardware, the same in another embodiment, can realize by any one in following technology well known in the art or their combination: the discrete logic with the logic gates for realizing logic function to data-signal, there is the special IC of suitable combinational logic gate circuit, programmable gate array (PGA), field programmable gate array (FPGA) etc.
Those skilled in the art are appreciated that realizing all or part of step that above-described embodiment method carries is that the hardware that can carry out instruction relevant by program completes, described program can be stored in a kind of computer-readable recording medium, this program perform time, step comprising embodiment of the method one or a combination set of.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing module, also can be that the independent physics of unit exists, also can be integrated in a module by two or more unit.Above-mentioned integrated module both can adopt the form of hardware to realize, and the form of software function module also can be adopted to realize.If described integrated module using the form of software function module realize and as independently production marketing or use time, also can be stored in a computer read/write memory medium.
The above-mentioned storage medium mentioned can be ROM (read-only memory), disk or CD etc.
In the description of this instructions, specific features, structure, material or feature that the description of reference term " embodiment ", " some embodiments ", " example ", " concrete example " or " some examples " etc. means to describe in conjunction with this embodiment or example are contained at least one embodiment of the present invention or example.In this manual, identical embodiment or example are not necessarily referred to the schematic representation of above-mentioned term.And the specific features of description, structure, material or feature can combine in an appropriate manner in any one or more embodiment or example.
Although illustrate and describe embodiments of the invention above, be understandable that, above-described embodiment is exemplary, can not be interpreted as limitation of the present invention, and those of ordinary skill in the art can change above-described embodiment within the scope of the invention, revises, replace and modification.

Claims (15)

1. an information retrieval method, is characterized in that, comprising:
According to the term of first stage, obtain the associative search word of the term of described first stage, and show described associative search word, described associative search word comprises the term with the term of described first stage with different language;
Obtain the term of subordinate phase, the term of described subordinate phase is the term selected in described associative search word;
Term according to described subordinate phase obtains corresponding result for retrieval, and shows described result for retrieval.
2. method according to claim 1, is characterized in that, described associative search word also comprises: the term of described first stage, and the associative search word of the described term of described acquisition, comprising:
The term of described first stage is carried out across language switching, obtains the translation result of the term of described first stage;
The term of described first stage and described translation result are screened, obtains described associative search word.
3. method according to claim 2, is characterized in that, if when described information retrieval is used for these retrieval, the described term to the described first stage carries out across language switching, obtains the translation result of the term of described first stage, comprising:
Obtain the translation model and dictionary set up in advance, according to described translation model and dictionary, adopt mechanical translation mode, obtain the translation result of the term of described first stage, wherein, described translation model is according to obtaining across the training of language language material in paper.
4. method according to claim 2, is characterized in that, the described term to the described first stage and described translation result screen, and obtain described associative search word, comprising:
Term analysis is carried out to the term of described first stage and described translation result, obtains analysis result;
Carry out term queue according to described analysis result to recall, obtain recalling result;
According to described correlation values of recalling the term of result and described first stage, carry out term mass filter, obtain described associative search word.
5. method according to claim 1, is characterized in that, described associative search word also comprises the term of described first stage, and the term of described acquisition subordinate phase, comprising:
Receive the check boxes of user by arranging, the term selected in described associative search word, the term of described selection is defined as the term of described subordinate phase, the term of described selection is one or at least two, the term of described selection comprises: the term of described first stage, and/or have the term of different language with the term of described first stage, described check boxes is corresponding with described associative search word when showing described associative search word to be arranged.
6. method according to claim 1, is characterized in that, the described term according to described subordinate phase obtains corresponding result for retrieval, comprising:
Carry out term queue according to the term of described subordinate phase to recall, and carry out merging rearrangement to recalling result;
According to the advanced filter information preset, be combined the result of recalling after rearrangement and screen, obtain the result for retrieval that the term of described subordinate phase is corresponding.
7. method according to claim 1, is characterized in that, also comprises:
Judge that current retrieval request is the retrieval request of first stage or the retrieval request of subordinate phase, so that when current retrieval request is the retrieval request of subordinate phase, term according to described subordinate phase obtains corresponding result for retrieval, or, when current retrieval request is the retrieval request of first stage, described associative search word is obtained according to the term of described first stage, and the result for retrieval corresponding with the term of described first stage;
Wherein, described retrieval request is URL request, and the current retrieval request of described judgement is the retrieval request of first stage or the retrieval request of subordinate phase, comprising:
Obtain the parameter preset in described URL request, described parameter preset is used for identifying user search behavior;
According to described parameter preset, judge that current retrieval request is the retrieval request of first stage or the retrieval request of subordinate phase.
8. the method according to any one of claim 1-7, is characterized in that, the term of described first stage is Chinese retrieval word, and the term of described subordinate phase is English term.
9. an information indexing device, is characterized in that, comprising:
Acquisition module, for the term according to the first stage, obtain the associative search word of the term of described first stage, and show described associative search word, described associative search word comprises the term with the term of described first stage with different language;
Select module, for obtaining the term of subordinate phase, the term of described subordinate phase is the term selected in described associative search word;
Display module, obtains corresponding result for retrieval for the term according to described subordinate phase, and shows described result for retrieval.
10. device according to claim 9, is characterized in that, described associative search word also comprises: the term of described first stage, and described acquisition module comprises:
First module, for carrying out across language switching the term of described first stage, obtains the translation result of the term of described first stage;
Second unit, for screening the term of described first stage and described translation result, obtains described associative search word.
11. devices according to claim 10, is characterized in that, if described information retrieval be used for these retrieval time, described first module specifically for:
Obtain the translation model and dictionary set up in advance, according to described translation model and dictionary, adopt mechanical translation mode, obtain the translation result of the term of described first stage, wherein, described translation model is according to obtaining across the training of language language material in paper.
12. devices according to claim 10, is characterized in that, described second unit specifically for:
Term analysis is carried out to the term of described first stage and described translation result, obtains analysis result;
Carry out term queue according to described analysis result to recall, obtain recalling result;
According to described correlation values of recalling the term of result and described first stage, carry out term mass filter, obtain described associative search word.
13. devices according to claim 9, is characterized in that, described associative search word also comprises the term of described first stage, described selection module specifically for:
Receive the check boxes of user by arranging, the term selected in described associative search word, the term of described selection is defined as the term of described subordinate phase, the term of described selection is one or at least two, the term of described selection comprises: the term of described first stage, and/or have the term of different language with the term of described first stage, described check boxes is corresponding with described associative search word when showing described associative search word to be arranged.
14. devices according to claim 9, is characterized in that, described display module specifically for:
Carry out term queue according to the term of described subordinate phase to recall, and carry out merging rearrangement to recalling result;
According to the advanced filter information preset, be combined the result of recalling after rearrangement and screen, obtain the result for retrieval that the term of described subordinate phase is corresponding.
15. devices according to claim 9, is characterized in that, also comprise:
Judge module, for judging that current retrieval request is the retrieval request of first stage or the retrieval request of subordinate phase, so that when current retrieval request is the retrieval request of subordinate phase, term according to described subordinate phase obtains corresponding result for retrieval, or, when current retrieval request is the retrieval request of first stage, obtain described associative search word according to the term of described first stage, and the result for retrieval corresponding with the term of described first stage;
Wherein, described retrieval request is URL request, and the current retrieval request of described judgement is the retrieval request of first stage or the retrieval request of subordinate phase, comprising:
Obtain the parameter preset in described URL request, described parameter preset is used for identifying user search behavior;
According to described parameter preset, judge that current retrieval request is the retrieval request of first stage or the retrieval request of subordinate phase.
CN201510013397.2A 2015-01-12 2015-01-12 Information retrieval method and device Active CN104573019B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510013397.2A CN104573019B (en) 2015-01-12 2015-01-12 Information retrieval method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510013397.2A CN104573019B (en) 2015-01-12 2015-01-12 Information retrieval method and device

Publications (2)

Publication Number Publication Date
CN104573019A true CN104573019A (en) 2015-04-29
CN104573019B CN104573019B (en) 2019-04-02

Family

ID=53089081

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510013397.2A Active CN104573019B (en) 2015-01-12 2015-01-12 Information retrieval method and device

Country Status (1)

Country Link
CN (1) CN104573019B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105631052A (en) * 2016-03-01 2016-06-01 北京百度网讯科技有限公司 Artificial intelligence based retrieval method and artificial intelligence based retrieval device
CN106445953A (en) * 2015-08-07 2017-02-22 北京奇虎科技有限公司 Advertisement creative information retrieval method and system
CN106649388A (en) * 2015-11-02 2017-05-10 阿里巴巴集团控股有限公司 Information retrieval method and apparatus
CN106708808A (en) * 2016-12-14 2017-05-24 东软集团股份有限公司 Information mining method and information mining device
CN107632986A (en) * 2016-07-18 2018-01-26 阿里巴巴集团控股有限公司 Searching method and device
CN111125310A (en) * 2019-12-24 2020-05-08 北京百度网讯科技有限公司 Voice query method and device

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1492354A (en) * 2000-06-02 2004-04-28 钧 顾 Multilingual information searching method and multilingual information search engine system
CN1916905A (en) * 2006-09-04 2007-02-21 北京航空航天大学 Method for carrying out retrieval hint based on inverted list
CN101271461A (en) * 2007-03-19 2008-09-24 株式会社东芝 Cross-language retrieval request conversion and cross-language information retrieval method and system
CN101763402A (en) * 2009-12-30 2010-06-30 哈尔滨工业大学 Integrated retrieval method for multi-language information retrieval
CN102117284A (en) * 2009-12-30 2011-07-06 安世亚太科技(北京)有限公司 Method for retrieving cross-language knowledge
CN102651003A (en) * 2011-02-28 2012-08-29 北京百度网讯科技有限公司 Cross-language searching method and device
CN102654867A (en) * 2011-03-02 2012-09-05 北京百度网讯科技有限公司 Webpage sorting method and system in cross-language search
US8332205B2 (en) * 2009-01-09 2012-12-11 Microsoft Corporation Mining transliterations for out-of-vocabulary query terms
CN103294682A (en) * 2012-02-24 2013-09-11 摩根全球购物有限公司 Multi-language retrieving method, computer readable storage medium and network searching system
US8543563B1 (en) * 2012-05-24 2013-09-24 Xerox Corporation Domain adaptation for query translation
US8639701B1 (en) * 2010-11-23 2014-01-28 Google Inc. Language selection for information retrieval
CN103729386A (en) * 2012-10-16 2014-04-16 阿里巴巴集团控股有限公司 Information query system and method

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1492354A (en) * 2000-06-02 2004-04-28 钧 顾 Multilingual information searching method and multilingual information search engine system
CN1916905A (en) * 2006-09-04 2007-02-21 北京航空航天大学 Method for carrying out retrieval hint based on inverted list
CN101271461A (en) * 2007-03-19 2008-09-24 株式会社东芝 Cross-language retrieval request conversion and cross-language information retrieval method and system
US8332205B2 (en) * 2009-01-09 2012-12-11 Microsoft Corporation Mining transliterations for out-of-vocabulary query terms
CN101763402A (en) * 2009-12-30 2010-06-30 哈尔滨工业大学 Integrated retrieval method for multi-language information retrieval
CN102117284A (en) * 2009-12-30 2011-07-06 安世亚太科技(北京)有限公司 Method for retrieving cross-language knowledge
US8639701B1 (en) * 2010-11-23 2014-01-28 Google Inc. Language selection for information retrieval
CN102651003A (en) * 2011-02-28 2012-08-29 北京百度网讯科技有限公司 Cross-language searching method and device
CN102654867A (en) * 2011-03-02 2012-09-05 北京百度网讯科技有限公司 Webpage sorting method and system in cross-language search
CN103294682A (en) * 2012-02-24 2013-09-11 摩根全球购物有限公司 Multi-language retrieving method, computer readable storage medium and network searching system
US8543563B1 (en) * 2012-05-24 2013-09-24 Xerox Corporation Domain adaptation for query translation
CN103729386A (en) * 2012-10-16 2014-04-16 阿里巴巴集团控股有限公司 Information query system and method

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106445953A (en) * 2015-08-07 2017-02-22 北京奇虎科技有限公司 Advertisement creative information retrieval method and system
CN106649388A (en) * 2015-11-02 2017-05-10 阿里巴巴集团控股有限公司 Information retrieval method and apparatus
CN105631052A (en) * 2016-03-01 2016-06-01 北京百度网讯科技有限公司 Artificial intelligence based retrieval method and artificial intelligence based retrieval device
CN107632986A (en) * 2016-07-18 2018-01-26 阿里巴巴集团控股有限公司 Searching method and device
CN107632986B (en) * 2016-07-18 2021-04-06 阿里巴巴集团控股有限公司 Searching method and device
CN106708808A (en) * 2016-12-14 2017-05-24 东软集团股份有限公司 Information mining method and information mining device
CN106708808B (en) * 2016-12-14 2020-01-14 东软集团股份有限公司 Information mining method and device
CN111125310A (en) * 2019-12-24 2020-05-08 北京百度网讯科技有限公司 Voice query method and device

Also Published As

Publication number Publication date
CN104573019B (en) 2019-04-02

Similar Documents

Publication Publication Date Title
CN104573019A (en) Information searching method and device
CN105630871B (en) Search result display methods, device and search system
RU2671044C1 (en) Method and device for data storage
CN109543086A (en) A kind of network data acquisition and methods of exhibiting towards multi-data source
Carnerud et al. Four decades of research on quality: summarising, Trendspotting and looking ahead
CN102880694B (en) The browser client of load classification channel and method in new Shipping Options Page
US8560518B2 (en) Method and apparatus for building sales tools by mining data from websites
CN101408879A (en) Method and system for searching product based on search engine
CN104123366A (en) Search method and server
JP2013531289A (en) Use of model information group in search
US20040015485A1 (en) Method and apparatus for improved internet searching
CN105069077A (en) Search method and device
CN103942198B (en) For excavating the method and apparatus being intended to
Zopf Auto-hMDS: Automatic construction of a large heterogeneous multilingual multi-document summarization corpus
EP3355204A1 (en) Information search method and device
CN106156111A (en) Patent document search method, device and system
CN102819384A (en) Method and device for prompting display at input field
CN103136213A (en) Method and device for providing related words
CN104077314A (en) Method and system for adding browser into favorites and terminal equipment
CN102609539B (en) Search method and search system
CN103500181A (en) Internet information analyzing method and device
JP2010146366A (en) Information providing server
CN103294670A (en) Searching method and system based on word list
CN103198078A (en) Internet news event report trend analysis method and system
CN108572817A (en) The method, apparatus and medium of dynamic resource allocation based on business model

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant