Summary of the invention
The detection method that fundamental purpose of the present invention is to provide a kind of Website page to include quantity in a search engine and device, to solve prior art to the Website page inaccurate problem of the Query Result of including quantity in a search engine.
To achieve these goals, according to an aspect of the present invention, the detection method that a kind of Website page includes quantity is in a search engine provided.
The detection method that this Website page includes quantity in a search engine comprises: the network address obtaining all pages of website to be detected; The network address to be checked is determined from the network address of all pages of website to be detected; Obtain the network address comprising the network address to be checked; Traversal comprises the network address of the network address to be checked, and the whether searched engine of the page detecting the network address that comprises the network address to be checked corresponding is included; If detect that the searched engine of the page corresponding to the network address that comprises the network address to be checked is included, webpage corresponding for the network address to be checked quantity of including in a search engine is added up.
Further, after the network address of all pages obtaining website to be detected, the detection method that this Website page includes quantity in a search engine also comprises: detect the network address whether successfully having obtained all pages of website to be detected; If the network address of all pages successfully obtaining website to be detected detected, by the network address of all pages of website to be detected stored in the page network address list of website to be detected.
Further, the network address to be checked is following any class or multiclass network address: the first category network address, and the first category network address is the network address of the homepage of website to be detected; The second classification network address, the second classification network address is the second level domain network address of website to be detected; The 3rd classification network address, the 3rd classification network address is the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address.
Further, detect and comprise the whether searched engine of the page corresponding to the network address of the network address to be checked and include and comprise: judge whether the network address to be checked is the first category network address; If judge that the network address to be checked is the first category network address, travel through all pages of website to be detected; The whether searched engine of all pages detecting website to be detected is included; If judge that the network address to be checked is not the first category network address, judge whether the network address to be checked is the second classification network address; If judge that the network address to be checked is the second classification network address, travel through the page that all second level domain network addresss of website to be detected are corresponding; The whether searched engine of the page detecting all second level domain network addresss of website to be detected corresponding is included; If judge that the network address to be checked is not the second classification network address, travel through the page that the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address is corresponding; The whether searched engine of the page detecting the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address corresponding is included.
Further, the whether searched engine of all pages detecting website to be detected is included and is comprised: the network address searching for all pages of website to be detected respectively in a search engine, whether the Search Results judging in search engine points out the network address of all pages not finding website to be detected, if the Search Results judging in search engine does not point out the network address of all pages not finding website to be detected, determine that the searched engine of all pages of website to be detected is included, detect the whether searched engine of the page corresponding to all second level domain network addresss of website to be detected and include and comprise: the network address searching for the page corresponding to all second level domain network addresss of website to be detected respectively in a search engine, the network address of the page whether Search Results judging in search engine points out all second level domain network addresss of not finding website to be detected corresponding, if the network address of the page that the Search Results judging in search engine does not point out all second level domain network addresss of not finding website to be detected corresponding, determine that the searched engine of the page corresponding to all second level domain network addresss of website to be detected is included, detect the whether searched engine of the page corresponding to the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address to include and comprise: the network address searching for the page corresponding to the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address respectively in a search engine, whether the Search Results judging in search engine points out the network address of the page not finding the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address corresponding, if the Search Results judging in search engine does not point out the network address of the page not finding the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address corresponding, determine that the searched engine of the page corresponding to the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address is included.
Further, obtain the network address comprising the network address to be checked to comprise: judge whether the network address to be checked is the first category network address; If judge that the network address to be checked is the first category network address, the network address determining to comprise the network address to be checked is the network address of all pages of website to be detected; By the network address of all pages of website to be detected stored in the first network address list to be checked, or, judge whether the network address to be checked is the second classification network address; If judge that the network address to be checked is the second classification network address, the all-network address determining to comprise the network address to be checked is the network address of the page corresponding to all second level domain network addresss of website to be detected; By the network address of the page corresponding for all second level domain network addresss of website to be detected stored in the second network address list to be checked, or, judge whether the network address to be checked is the 3rd classification network address; If judge that the network address to be checked is the 3rd classification network address, the all-network address determining to comprise the network address to be checked is the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address; By the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address stored in the 3rd network address list to be checked.
Further, if detect that the searched engine of the page corresponding to the network address that comprises the network address to be checked is included, webpage corresponding for the network address to be checked quantity of including in a search engine is carried out cumulative comprising: judge whether the network address to be checked is the first category network address; If judge that the network address to be checked is the first category network address, traversal the first network address list to be checked; Add up the number that the page corresponding to the network address in the first network address list to be checked is included in a search engine, or judge whether the network address to be checked is the second classification network address; If judge that the network address to be checked is the second classification network address, traversal the second network address list to be checked; Add up the number that the page corresponding to the network address in the second network address list to be checked is included in a search engine, or judge whether the network address to be checked is the 3rd classification network address; If judge that the network address to be checked is the 3rd classification network address, traversal the 3rd network address list to be checked; Add up the number that the page corresponding to the network address in the 3rd network address list to be checked is included in a search engine.
To achieve these goals, according to a further aspect in the invention, the pick-up unit that a kind of Website page includes quantity is in a search engine provided.
The pick-up unit that this Website page includes quantity in a search engine comprises: the first acquisition module, for obtaining the network address of all pages of website to be detected; Determination module, determines the network address to be checked in the network address for all pages from website to be detected; Second acquisition module, for obtaining the network address comprising the network address to be checked; Detection module, for traveling through the network address comprising the network address to be checked, detecting the whether searched engine of the page comprising the network address of the network address to be checked corresponding and including; Accumulator module, for when detecting that the searched engine of the page corresponding to the network address that comprises the network address to be checked is included, adds up webpage corresponding for the network address to be checked quantity of including in a search engine.
Further, determination module comprises: first determines submodule, and for the network address to be checked is defined as the first category network address, wherein, the first category network address is the network address of the homepage of website to be detected; Second determines submodule, and for the network address to be checked is defined as the second classification network address, wherein, the second classification network address is the second level domain network address of website to be detected; 3rd determines submodule, for the network address to be checked is defined as the 3rd classification network address, wherein, the 3rd classification network address is the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address.
Further, detection module comprises: the first judge module, for judging whether the network address to be checked is the first category network address; First traversal submodule, for when judging that the network address to be checked is the first category network address, travels through all pages of website to be detected; First detection sub-module, the whether searched engine of all pages for detecting website to be detected is included; Second judges submodule, for when judging that the network address to be checked is not the first category network address, judges whether the network address to be checked is the second classification network address; Second traversal submodule, for when judging that the network address to be checked is the second classification network address, travels through the page that all second level domain network addresss of website to be detected are corresponding; Second detection sub-module, the whether searched engine of the page corresponding to all second level domain network addresss for detecting website to be detected is included; 3rd traversal submodule, for when judging that the network address to be checked is not the second classification network address, travel through the page that the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address is corresponding; 3rd detection sub-module, for detect all pages of website to be detected the network address in the whether searched engine of the page corresponding to the network address except the first category network address and the second classification network address include.
Further, the second acquisition module comprises: the 3rd judges submodule, for judging whether the network address to be checked is the first category network address; 4th determines submodule, and for when judging that the network address to be checked is the first category network address, the network address determining to comprise the network address to be checked is the network address of all pages of website to be detected; First memory module, for the network address of all pages by website to be detected stored in the first network address list to be checked; 4th judges submodule, for judging whether the network address to be checked is the second classification network address; 5th determines submodule, and for when judging that the network address to be checked is the second classification network address, the all-network address determining to comprise the network address to be checked is the network address of the page corresponding to all second level domain network addresss of website to be detected; Second memory module, for the network address of the page corresponding to all second level domain network addresss by website to be detected stored in the second network address list to be checked; 5th judges submodule, for judging whether the network address to be checked is the 3rd classification network address; 6th determines submodule, for when judging that the network address to be checked is the 3rd classification network address, the all-network address determining to comprise the network address to be checked is the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address; 3rd memory module, for the network address in the network address of all pages by website to be detected except the first category network address and the second classification network address stored in the 3rd network address list to be checked.
Further, accumulator module comprises: the 6th judges submodule, for judging whether the network address to be checked is the first category network address; 4th traversal submodule, for when judging that the network address to be checked is the first category network address, traversal the first network address list to be checked; First statistical module, for adding up the number that the page corresponding to the network address in the first network address list to be checked is included in a search engine; 7th judges submodule, for judging whether the network address to be checked is the second classification network address; 5th traversal submodule, for when judging that the network address to be checked is the second classification network address, traversal the second network address list to be checked; Second statistical module, the number the 8th be included in a search engine for adding up the page corresponding to the network address in the second network address list to be checked judges submodule, for judging whether the network address to be checked is the 3rd classification network address; 6th traversal submodule, for when judging that the network address to be checked is the 3rd classification network address, traversal the 3rd network address list to be checked; 3rd statistical module, for adding up the number that the page corresponding to the network address in the 3rd network address list to be checked is included in a search engine.
By the present invention, adopt the network address of all pages obtaining website to be detected; The network address to be checked is determined from the network address of all pages of website to be detected; Obtain the network address comprising the network address to be checked; Traversal comprises the network address of the network address to be checked, and the whether searched engine of the page detecting the network address that comprises the network address to be checked corresponding is included; If detect that the searched engine of the page corresponding to the network address that comprises the network address to be checked is included, webpage corresponding for the network address to be checked quantity of including in a search engine is added up, solves prior art to the Website page inaccurate problem of the Query Result of including quantity in a search engine.This invention adopts and submits the network address to be checked to search engine one by one, inquire about and feed back page collection situation in a search engine corresponding to each network address to be checked, and then reaching the accurate count Website page effect of including quantity in a search engine.
Embodiment
It should be noted that, when not conflicting, the embodiment in the application and the feature in embodiment can combine mutually.Below with reference to the accompanying drawings and describe the present invention in detail in conjunction with the embodiments.
The application's scheme is understood better in order to make those skilled in the art person, below in conjunction with the accompanying drawing in the embodiment of the present application, technical scheme in the embodiment of the present application is clearly and completely described, obviously, described embodiment is only the embodiment of the application's part, instead of whole embodiments.Based on the embodiment in the application, those of ordinary skill in the art are not making the every other embodiment obtained under creative work prerequisite, all should belong to the scope of the application's protection.
It should be noted that, term " first ", " second " etc. in the instructions of the application and claims and above-mentioned accompanying drawing are for distinguishing similar object, and need not be used for describing specific order or precedence.Should be appreciated that the data used like this can be exchanged, in the appropriate case so that the embodiment of the application described herein.In addition, term " comprises " and " having " and their any distortion, intention is to cover not exclusive comprising, such as, contain those steps or unit that the process of series of steps or unit, method, system, product or equipment is not necessarily limited to clearly list, but can comprise clearly do not list or for intrinsic other step of these processes, method, product or equipment or unit.
The present invention aims to provide detection method and the device that a kind of Website page includes quantity in a search engine.
Fig. 1 is the process flow diagram of including the detection method of quantity according to the Website page of the embodiment of the present invention in a search engine.As shown in Figure 1, the detection method that this Website page includes quantity in a search engine comprises following step S101 to step S105:
Step S101, obtains the network address of all pages of website to be detected.
Website to be detected is need the query web page website of including quantity in a search engine.Preferably, the network address that the detection method that the Website page of this embodiment includes quantity in a search engine obtains all pages of website to be detected utilizes search engine optimization decision data center (Search Engine Optimization Dissector, referred to as SEOD) system existing website crawler technology, crawl the network address of all pages of this website to be detected.The network address obtaining all pages of this website to be detected directly can also be provided the network address of all pages of whole website to be detected by the head of a station of this website to be detected.
Preferably, after the network address of all pages obtaining website to be detected, the detection method that this Website page includes quantity in a search engine also comprises: detect the network address whether successfully having obtained all pages of website to be detected; If the network address of all pages successfully obtaining website to be detected detected, by the network address of all pages of website to be detected stored in the page network address list of website to be detected.Table 1 is the network address list of all pages of abc website, and wherein, in table 1, the network address of the page can be distinguished by page level.
The network address list of all pages of table 1 abc website
Step S102, determines the network address to be checked from the network address of all pages of website to be detected.
The Website page network address to be checked of including in a search engine in the detection method of quantity of this embodiment can be following any class or multiclass network address: the first category network address, and the first category network address is the network address of the homepage of website to be detected; The second classification network address, the second classification network address is the second level domain network address of website to be detected; The 3rd classification network address, the 3rd classification network address is the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address.As shown in table 1, the first category network address is www.abc.com, and the first category network address can be a1.abc.com or a2.abc.com etc., and the first category network address can be a1.abc.com/d1 or b1.abc.com/e1 etc.
Step S103, obtains the network address comprising the network address to be checked.
Preferably, the Website page of this embodiment is included the network address that in the detection method of quantity, acquisition comprises the network address to be checked in a search engine and is comprised: judge whether the network address to be checked is the first category network address; If judge that the network address to be checked is the first category network address, the network address determining to comprise the network address to be checked is the network address of all pages of website to be detected; By the network address of all pages of website to be detected stored in the first network address list to be checked, or, judge whether the network address to be checked is the second classification network address; If judge that the network address to be checked is the second classification network address, the all-network address determining to comprise the network address to be checked is the network address of the page corresponding to all second level domain network addresss of website to be detected; By the network address of the page corresponding for all second level domain network addresss of website to be detected stored in the second network address list to be checked, or, judge whether the network address to be checked is the 3rd classification network address; If judge that the network address to be checked is the 3rd classification network address, the all-network address determining to comprise the network address to be checked is the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address; By the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address stored in the 3rd network address list to be checked.
Table 2 first network address list to be checked
The Website page of this embodiment includes the detection method of quantity in a search engine by judging the classification of the network address to be checked, is deposited in corresponding tables of data the network address meeting the network address to be checked.Such as, if the network address to be checked that user exports is the first category network address, now user inquiry be website to be detected all pages in a search engine include quantity, then the first network address list to be checked is identical with the page network address list of website to be detected, is the network address of all pages of whole website to be detected.When such as the querying command of user's input is site:abc.com, the first network address list to be checked is as shown in table 2.
If the network address to be checked that user exports is the second classification network address, now user's inquiry be the page corresponding to all second level domain network addresss of website to be detected in a search engine include quantity, then the network address in the second network address list to be checked should be the network address of the page corresponding to all second level domain network addresss of website to be detected.When such as the querying command of user's input is second level domain site:a1.abc.com, the second network address list to be checked is as shown in table 3.Distinguishingly, as a1=www, the page corresponding to the network address under querying command representative inquiry www domain name in a search engine include quantity, namely all pages in whole website to be detected in a search engine include quantity.
Table 3 second network address list to be checked
Table 4 the 3rd network address list to be checked
If the network address to be checked that user exports is the 3rd classification network address, now user's inquiry be comprise the page corresponding to the network address of specific character string, the page corresponding to the network address in the network address of i.e. all pages of website to be detected except the first category network address and the second classification network address in a search engine include quantity, then the network address in the 3rd network address list to be checked should be the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address.When such as the querying command of user's input is second level domain site:a1.abc.com inurl:b1, represent the page corresponding to the network address that comprises b1 character string under user wishes to inquire about a1 second level domain in a search engine include quantity, then the 3rd network address list to be checked is as shown in table 4.Distinguishingly, when the querying command of user's input is site:abc.abc.com inurl:b1, comprise in the all-network address of the whole website to be detected of querying command representative inquiry the page corresponding to the network address of character string b1 in a search engine include quantity.
Step S104, traversal comprises the network address of the network address to be checked, and the whether searched engine of the page detecting the network address that comprises the network address to be checked corresponding is included.
Preferably, the detection method that the Website page of this embodiment includes quantity in a search engine detects and comprises the whether searched engine of the page corresponding to the network address of the network address to be checked and include and comprise: judge whether the network address to be checked is the first category network address; If judge that the network address to be checked is the first category network address, travel through all pages of website to be detected; The whether searched engine of all pages detecting website to be detected is included; If judge that the network address to be checked is not the first category network address, judge whether the network address to be checked is the second classification network address; If judge that the network address to be checked is the second classification network address, travel through the page that all second level domain network addresss of website to be detected are corresponding; The whether searched engine of the page detecting all second level domain network addresss of website to be detected corresponding is included; If judge that the network address to be checked is not the second classification network address, travel through the page that the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address is corresponding; The whether searched engine of the page detecting the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address corresponding is included.The Website page of this embodiment includes the detection method of quantity in a search engine by the all-network address in network address list to be checked, and the whether searched engine of the page that requester network address is corresponding is one by one included, and by the result of inquiry stored in tables of data.
The querying method whether single page in this embodiment is included in a search engine is, the network address to be checked (not being with http: // or site :) is inputted in the search box of search engine, Search Results comprises two parts: Part I, if the page corresponding to the network address to be checked searched engine is included, can display of search results brief introduction; If the not searched engine of the page corresponding to the network address to be checked is included, can point out and not find this network address to be checked.Part II is the Search Results comprising this network address to be checked character string.As shown in Figure 2, the Search Results schematic diagram that the not searched engine of the page corresponding to the network address to be checked is included as shown in Figure 3 for the Search Results schematic diagram that the searched engine of the page corresponding to the network address to be checked is included.If the page corresponding to the network address to be checked is included in a search engine, then in search result list, Search Results is designated as 1, otherwise is designated as 0.After having traveled through network address list to be checked, just can obtain including the list of result statistics.
Particularly, the whether searched engine of all pages detecting website to be detected is included and is comprised: the network address searching for all pages of website to be detected respectively in a search engine; Whether the Search Results judging in search engine points out the network address of all pages not finding website to be detected; If the Search Results judging in search engine does not point out the network address of all pages not finding website to be detected, determine that the searched engine of all pages of website to be detected is included.Table 5 includes the results list for the page corresponding to the first category network address.
The page that the table 5 first category network address is corresponding include the results list
The page that the table 6 second classification network address is corresponding include the results list
Particularly, detect the whether searched engine of the page corresponding to all second level domain network addresss of website to be detected to include and comprise: the network address searching for the page corresponding to all second level domain network addresss of website to be detected respectively in a search engine; The network address of the page whether Search Results judging in search engine points out all second level domain network addresss of not finding website to be detected corresponding; If the network address of the page that the Search Results judging in search engine does not point out all second level domain network addresss of not finding website to be detected corresponding, determine that the searched engine of the page corresponding to all second level domain network addresss of website to be detected is included.Table 6 be the page corresponding to the second classification network address include the results list.
Particularly, detect the whether searched engine of the page corresponding to the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address to include and comprise: the network address searching for the page corresponding to the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address respectively in a search engine; Whether the Search Results judging in search engine points out the network address of the page not finding the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address corresponding; If the Search Results judging in search engine does not point out the network address of the page not finding the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address corresponding, determine that the searched engine of the page corresponding to the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address is included.As shown in table 7 be the page corresponding to the 3rd classification network address include the results list.
The page that table 7 the 3rd classification network address is corresponding include the results list
Step S105, if detect that the searched engine of the page corresponding to the network address that comprises the network address to be checked is included, adds up webpage corresponding for the network address to be checked quantity of including in a search engine.
If detect that the searched engine of the page corresponding to the network address that comprises the network address to be checked is included, webpage corresponding for the network address to be checked quantity of including in a search engine is carried out cumulative comprising: judge whether the network address to be checked is the first category network address; If judge that the network address to be checked is the first category network address, traversal the first network address list to be checked; Add up the number that the page corresponding to the network address in the first network address list to be checked is included in a search engine, shown in table 5, page quantity of including in a search engine corresponding to the first category network address is 9.Judge whether the network address to be checked is the second classification network address; If judge that the network address to be checked is the second classification network address, traversal the second network address list to be checked; Add up the number that the page corresponding to the network address in the second network address list to be checked is included in a search engine, shown in table 6, page quantity of including in a search engine corresponding to the second classification network address is 5.Judge whether the network address to be checked is the 3rd classification network address; If judge that the network address to be checked is the 3rd classification network address, traversal the 3rd network address list to be checked; Add up the number that the page corresponding to the network address in the 3rd network address list to be checked is included in a search engine, shown in table 7, page quantity of including in a search engine corresponding to the 3rd classification network address is 4.
Preferably, obtain the page corresponding to the network address to be checked in a search engine include quantity after, the detection method that the Website page of this embodiment includes quantity in a search engine also comprises and the page corresponding for the network address to be checked obtained quantity of including in a search engine being exported, show with the form of form or icon, be convenient to user intuitively the analyzing web site page in a search engine include quantity.
The Website page of this embodiment includes the network address of all pages of the detection method employing acquisition website to be detected of quantity in a search engine; The network address to be checked is determined from the network address of all pages of website to be detected; Obtain the network address comprising the network address to be checked; Traversal comprises the network address of the network address to be checked, and the whether searched engine of the page detecting the network address that comprises the network address to be checked corresponding is included; If detect that the searched engine of the page corresponding to the network address that comprises the network address to be checked is included, webpage corresponding for the network address to be checked quantity of including in a search engine is added up, solve prior art to the Website page inaccurate problem of the Query Result of including quantity in a search engine, reach the accurate count Website page effect of including quantity in a search engine.
As can be seen from the above description, the detection method that the Website page of the embodiment of the present invention includes quantity in a search engine achieves technique effect:
Embodiments of the invention can add up exactly Website page in a search engine include quantity.Adopt the mode of simulation browser program in embodiments of the invention, submit to the network address to be checked to judge that the whether searched engine of the page corresponding to the network address to be checked is included to search engine seriatim.This embodiment can carry out a secondary response to each network address to be checked, ensure that control errors is in page level.Error rate is reduced by the method for input inquiry order in a search engine relative to prior art, because a large amount of network addresss can be had in each server, if when having server not respond, the Query Result of the network address of whole server will be caused to make mistakes, the error caused be server level other.Meanwhile, embodiments of the invention can also the query function that provides of match search engine, by analysis and consult order, and then judges the classification of the network address to be checked.In addition, the function of Website page can be crawled by simulation search Engine-Network reptile because SEOD system itself has been provided with, it is hereby ensured the foundation of the network address list of all pages of network to be detected.
It should be noted that, can perform in the computer system of such as one group of computer executable instructions in the step shown in the process flow diagram of accompanying drawing, and, although show logical order in flow charts, but in some cases, can be different from the step shown or described by order execution herein.
The embodiment of the present invention additionally provides the pick-up unit that a kind of Website page includes quantity in a search engine.It should be noted that, the Website page that the pick-up unit that this Website page includes quantity in a search engine may be used for performing the embodiment of the present invention includes the detection method of quantity in a search engine.
Fig. 4 is the schematic diagram of including the pick-up unit of quantity according to the Website page of the embodiment of the present invention in a search engine.As shown in Figure 4, the pick-up unit that this Website page includes quantity in a search engine comprises: the first acquisition module 10, determination module 20, the second acquisition module 30, detection module 40 and accumulator module 50.
First acquisition module 10, for obtaining the network address of all pages of website to be detected.
Determination module 20, determines the network address to be checked in the network address for all pages from website to be detected.
Preferably, determination module 20 comprises: first determines submodule, and for the network address to be checked is defined as the first category network address, wherein, the first category network address is the network address of the homepage of website to be detected; Second determines submodule, and for the network address to be checked is defined as the second classification network address, wherein, the second classification network address is the second level domain network address of website to be detected; 3rd determines submodule, for the network address to be checked is defined as the 3rd classification network address, wherein, the 3rd classification network address is the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address.
Second acquisition module 30, for obtaining the network address comprising the network address to be checked.
Preferably, the second acquisition module 30 comprises: the 3rd judges submodule, for judging whether the network address to be checked is the first category network address; 4th determines submodule, and for when judging that the network address to be checked is the first category network address, the network address determining to comprise the network address to be checked is the network address of all pages of website to be detected; First memory module, for the network address of all pages by website to be detected stored in the first network address list to be checked; 4th judges submodule, for judging whether the network address to be checked is the second classification network address; 5th determines submodule, and for when judging that the network address to be checked is the second classification network address, the all-network address determining to comprise the network address to be checked is the network address of the page corresponding to all second level domain network addresss of website to be detected; Second memory module, for the network address of the page corresponding to all second level domain network addresss by website to be detected stored in the second network address list to be checked; 5th judges submodule, for judging whether the network address to be checked is the 3rd classification network address; 6th determines submodule, for when judging that the network address to be checked is the 3rd classification network address, the all-network address determining to comprise the network address to be checked is the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address; 3rd memory module, for the network address in the network address of all pages by website to be detected except the first category network address and the second classification network address stored in the 3rd network address list to be checked.
Detection module 40, for traveling through the network address comprising the network address to be checked, detecting the whether searched engine of the page comprising the network address of the network address to be checked corresponding and including.
Preferably, detection module 40 comprises: the first judge module, for judging whether the network address to be checked is the first category network address; First traversal submodule, for when judging that the network address to be checked is the first category network address, travels through all pages of website to be detected; First detection sub-module, the whether searched engine of all pages for detecting website to be detected is included; Second judges submodule, for when judging that the network address to be checked is not the first category network address, judges whether the network address to be checked is the second classification network address; Second traversal submodule, for when judging that the network address to be checked is the second classification network address, travels through the page that all second level domain network addresss of website to be detected are corresponding; Second detection sub-module, the whether searched engine of the page corresponding to all second level domain network addresss for detecting website to be detected is included; 3rd traversal submodule, for when judging that the network address to be checked is not the second classification network address, travel through the page that the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address is corresponding; 3rd detection sub-module, for detect all pages of website to be detected the network address in the whether searched engine of the page corresponding to the network address except the first category network address and the second classification network address include.
Accumulator module 50, for when detecting that the searched engine of the page corresponding to the network address that comprises the network address to be checked is included, adds up webpage corresponding for the network address to be checked quantity of including in a search engine.
Preferably, accumulator module 50 comprises: the 6th judges submodule, for judging whether the network address to be checked is the first category network address; 4th traversal submodule, for when judging that the network address to be checked is the first category network address, traversal the first network address list to be checked; First statistical module, for adding up the number that the page corresponding to the network address in the first network address list to be checked is included in a search engine; 7th judges submodule, for judging whether the network address to be checked is the second classification network address; 5th traversal submodule, for when judging that the network address to be checked is the second classification network address, traversal the second network address list to be checked; Second statistical module, the number the 8th be included in a search engine for adding up the page corresponding to the network address in the second network address list to be checked judges submodule, for judging whether the network address to be checked is the 3rd classification network address; 6th traversal submodule, for when judging that the network address to be checked is the 3rd classification network address, traversal the 3rd network address list to be checked; 3rd statistical module, for adding up the number that the page corresponding to the network address in the 3rd network address list to be checked is included in a search engine.
The pick-up unit that the Website page of this embodiment includes quantity in a search engine comprises the first acquisition module 10, first determination module 20, second acquisition module 30, detection module 40, and accumulator module 50.The pick-up unit of being included quantity by the Website page of this embodiment in a search engine solves prior art to the Website page inaccurate problem of the Query Result of including quantity in a search engine.
Obviously, those skilled in the art should be understood that, above-mentioned of the present invention each module or each step can realize with general calculation element, they can concentrate on single calculation element, or be distributed on network that multiple calculation element forms, alternatively, they can realize with the executable program code of calculation element, thus, they can be stored and be performed by calculation element in the storage device, or they are made into each integrated circuit modules respectively, or the multiple module in them or step are made into single integrated circuit module to realize.Like this, the present invention is not restricted to any specific hardware and software combination.
The foregoing is only the preferred embodiments of the present invention, be not limited to the present invention, for a person skilled in the art, the present invention can have various modifications and variations.Within the spirit and principles in the present invention all, any amendment done, equivalent replacement, improvement etc., all should be included within protection scope of the present invention.