CN104408156A - Method and device for detecting recording quantity of web pages in search engine - Google Patents

Method and device for detecting recording quantity of web pages in search engine Download PDF

Info

Publication number
CN104408156A
CN104408156A CN201410730102.9A CN201410730102A CN104408156A CN 104408156 A CN104408156 A CN 104408156A CN 201410730102 A CN201410730102 A CN 201410730102A CN 104408156 A CN104408156 A CN 104408156A
Authority
CN
China
Prior art keywords
network address
checked
website
detected
search engine
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410730102.9A
Other languages
Chinese (zh)
Other versions
CN104408156B (en
Inventor
姜世豪
杨韬
王晓群
谭紫萱
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Gridsum Technology Co Ltd
Original Assignee
Beijing Gridsum Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Gridsum Technology Co Ltd filed Critical Beijing Gridsum Technology Co Ltd
Priority to CN201410730102.9A priority Critical patent/CN104408156B/en
Publication of CN104408156A publication Critical patent/CN104408156A/en
Application granted granted Critical
Publication of CN104408156B publication Critical patent/CN104408156B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Abstract

The invention discloses a method and a device for detecting recording quantity of web pages in a search engine. The method for detecting recording quantity of the web pages in the search engine comprises the steps of: obtaining network addresses of all pages of the website to be detected; determining the network address to be inquired from the network addresses of all pages of the website to be detected; obtaining the network addresses which contain the network address to be inquired; traversing the network addresses which contain the network addresses to be inquired, and detecting whether the pages corresponding to the network addresses which contains the network address to be inquired are recorded by a search engine; if the pages corresponding to the network addresses which contain the network address to be inquired are recorded by the search engine, accumulating the recording quantity of the web pages corresponding to the network addresses in the search engine. According to the method and the device for detecting recording quantity of web pages in the search engine, the problem that in the prior art, the inquire result about recording quantity of the new pages in the search engine is inaccurate is overcome.

Description

Website page includes detection method and the device of quantity in a search engine
Technical field
The present invention relates to internet arena, include detection method and the device of quantity in particular to a kind of Website page in a search engine.
Background technology
Along with the development of the Internet search technology, the flowing of access brought by search engine is in occupation of the major portion of website visiting flow.The flow in search engine source is divided into paid search flow and naturally searches for (i.e. non-payment search) flow.Wherein the input of paid search needs certain fund cost, and naturally to search for the flow brought be free, high-quality, stable.Therefore, the increasing head of a station (advertiser) pays close attention to the performance of oneself website in search naturally.
The size of the performance of naturally searching for and naturally search flow is directly determined by the rank of natural Search Results.Want to improve the rank performance of website in natural Search Results, first the structure optimizing website is needed, allow search engine can grab the website page as much as possible by reptile, with improve website in a search engine include quantity, again keyword disposition optimization is done to the page, and then improve the rank of website in natural Search Results.Therefore, the lifting of quantity included by search engine, is prerequisite and the basis of optimizing website.
Improve website in a search engine include quantity, the method realizations such as the hierarchical structure of optimizing webpage code and adjustment website can be gone by adopting the mode adapting to search engine crawler algorithm.In continuous adaptation and adjustment process, the including quantity and can change, meanwhile, because the algorithm of search engine also has lasting adjustment thereupon of website, therefore website main need pay close attention to website in a search engine include quantity, weigh the effect of optimization of own website.The data variation of the accurate grasp search engine amount of including is vital, and for this reason, search engine provides and a kind ofly carrys out by inputted search code the method that query web includes quantity.The method is by input inquiry order in search engine search box, and search engine is retrieved and returned numerical value to realize from server.But the network address that website has been included by search engine adopts the mode of distributed storage to store on a different server, and the numerical value returned by each server is added as net result.Be limited to network factors, often can not obtain the response of Servers-all, to such an extent as to final Query Result very different under different time, different network environments, have influence on the accurate evaluation of website being included to quantity.
For prior art to the Website page inaccurate problem of the Query Result of including quantity in a search engine, at present effective solution is not yet proposed.
Summary of the invention
The detection method that fundamental purpose of the present invention is to provide a kind of Website page to include quantity in a search engine and device, to solve prior art to the Website page inaccurate problem of the Query Result of including quantity in a search engine.
To achieve these goals, according to an aspect of the present invention, the detection method that a kind of Website page includes quantity is in a search engine provided.
The detection method that this Website page includes quantity in a search engine comprises: the network address obtaining all pages of website to be detected; The network address to be checked is determined from the network address of all pages of website to be detected; Obtain the network address comprising the network address to be checked; Traversal comprises the network address of the network address to be checked, and the whether searched engine of the page detecting the network address that comprises the network address to be checked corresponding is included; If detect that the searched engine of the page corresponding to the network address that comprises the network address to be checked is included, webpage corresponding for the network address to be checked quantity of including in a search engine is added up.
Further, after the network address of all pages obtaining website to be detected, the detection method that this Website page includes quantity in a search engine also comprises: detect the network address whether successfully having obtained all pages of website to be detected; If the network address of all pages successfully obtaining website to be detected detected, by the network address of all pages of website to be detected stored in the page network address list of website to be detected.
Further, the network address to be checked is following any class or multiclass network address: the first category network address, and the first category network address is the network address of the homepage of website to be detected; The second classification network address, the second classification network address is the second level domain network address of website to be detected; The 3rd classification network address, the 3rd classification network address is the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address.
Further, detect and comprise the whether searched engine of the page corresponding to the network address of the network address to be checked and include and comprise: judge whether the network address to be checked is the first category network address; If judge that the network address to be checked is the first category network address, travel through all pages of website to be detected; The whether searched engine of all pages detecting website to be detected is included; If judge that the network address to be checked is not the first category network address, judge whether the network address to be checked is the second classification network address; If judge that the network address to be checked is the second classification network address, travel through the page that all second level domain network addresss of website to be detected are corresponding; The whether searched engine of the page detecting all second level domain network addresss of website to be detected corresponding is included; If judge that the network address to be checked is not the second classification network address, travel through the page that the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address is corresponding; The whether searched engine of the page detecting the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address corresponding is included.
Further, the whether searched engine of all pages detecting website to be detected is included and is comprised: the network address searching for all pages of website to be detected respectively in a search engine, whether the Search Results judging in search engine points out the network address of all pages not finding website to be detected, if the Search Results judging in search engine does not point out the network address of all pages not finding website to be detected, determine that the searched engine of all pages of website to be detected is included, detect the whether searched engine of the page corresponding to all second level domain network addresss of website to be detected and include and comprise: the network address searching for the page corresponding to all second level domain network addresss of website to be detected respectively in a search engine, the network address of the page whether Search Results judging in search engine points out all second level domain network addresss of not finding website to be detected corresponding, if the network address of the page that the Search Results judging in search engine does not point out all second level domain network addresss of not finding website to be detected corresponding, determine that the searched engine of the page corresponding to all second level domain network addresss of website to be detected is included, detect the whether searched engine of the page corresponding to the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address to include and comprise: the network address searching for the page corresponding to the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address respectively in a search engine, whether the Search Results judging in search engine points out the network address of the page not finding the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address corresponding, if the Search Results judging in search engine does not point out the network address of the page not finding the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address corresponding, determine that the searched engine of the page corresponding to the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address is included.
Further, obtain the network address comprising the network address to be checked to comprise: judge whether the network address to be checked is the first category network address; If judge that the network address to be checked is the first category network address, the network address determining to comprise the network address to be checked is the network address of all pages of website to be detected; By the network address of all pages of website to be detected stored in the first network address list to be checked, or, judge whether the network address to be checked is the second classification network address; If judge that the network address to be checked is the second classification network address, the all-network address determining to comprise the network address to be checked is the network address of the page corresponding to all second level domain network addresss of website to be detected; By the network address of the page corresponding for all second level domain network addresss of website to be detected stored in the second network address list to be checked, or, judge whether the network address to be checked is the 3rd classification network address; If judge that the network address to be checked is the 3rd classification network address, the all-network address determining to comprise the network address to be checked is the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address; By the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address stored in the 3rd network address list to be checked.
Further, if detect that the searched engine of the page corresponding to the network address that comprises the network address to be checked is included, webpage corresponding for the network address to be checked quantity of including in a search engine is carried out cumulative comprising: judge whether the network address to be checked is the first category network address; If judge that the network address to be checked is the first category network address, traversal the first network address list to be checked; Add up the number that the page corresponding to the network address in the first network address list to be checked is included in a search engine, or judge whether the network address to be checked is the second classification network address; If judge that the network address to be checked is the second classification network address, traversal the second network address list to be checked; Add up the number that the page corresponding to the network address in the second network address list to be checked is included in a search engine, or judge whether the network address to be checked is the 3rd classification network address; If judge that the network address to be checked is the 3rd classification network address, traversal the 3rd network address list to be checked; Add up the number that the page corresponding to the network address in the 3rd network address list to be checked is included in a search engine.
To achieve these goals, according to a further aspect in the invention, the pick-up unit that a kind of Website page includes quantity is in a search engine provided.
The pick-up unit that this Website page includes quantity in a search engine comprises: the first acquisition module, for obtaining the network address of all pages of website to be detected; Determination module, determines the network address to be checked in the network address for all pages from website to be detected; Second acquisition module, for obtaining the network address comprising the network address to be checked; Detection module, for traveling through the network address comprising the network address to be checked, detecting the whether searched engine of the page comprising the network address of the network address to be checked corresponding and including; Accumulator module, for when detecting that the searched engine of the page corresponding to the network address that comprises the network address to be checked is included, adds up webpage corresponding for the network address to be checked quantity of including in a search engine.
Further, determination module comprises: first determines submodule, and for the network address to be checked is defined as the first category network address, wherein, the first category network address is the network address of the homepage of website to be detected; Second determines submodule, and for the network address to be checked is defined as the second classification network address, wherein, the second classification network address is the second level domain network address of website to be detected; 3rd determines submodule, for the network address to be checked is defined as the 3rd classification network address, wherein, the 3rd classification network address is the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address.
Further, detection module comprises: the first judge module, for judging whether the network address to be checked is the first category network address; First traversal submodule, for when judging that the network address to be checked is the first category network address, travels through all pages of website to be detected; First detection sub-module, the whether searched engine of all pages for detecting website to be detected is included; Second judges submodule, for when judging that the network address to be checked is not the first category network address, judges whether the network address to be checked is the second classification network address; Second traversal submodule, for when judging that the network address to be checked is the second classification network address, travels through the page that all second level domain network addresss of website to be detected are corresponding; Second detection sub-module, the whether searched engine of the page corresponding to all second level domain network addresss for detecting website to be detected is included; 3rd traversal submodule, for when judging that the network address to be checked is not the second classification network address, travel through the page that the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address is corresponding; 3rd detection sub-module, for detect all pages of website to be detected the network address in the whether searched engine of the page corresponding to the network address except the first category network address and the second classification network address include.
Further, the second acquisition module comprises: the 3rd judges submodule, for judging whether the network address to be checked is the first category network address; 4th determines submodule, and for when judging that the network address to be checked is the first category network address, the network address determining to comprise the network address to be checked is the network address of all pages of website to be detected; First memory module, for the network address of all pages by website to be detected stored in the first network address list to be checked; 4th judges submodule, for judging whether the network address to be checked is the second classification network address; 5th determines submodule, and for when judging that the network address to be checked is the second classification network address, the all-network address determining to comprise the network address to be checked is the network address of the page corresponding to all second level domain network addresss of website to be detected; Second memory module, for the network address of the page corresponding to all second level domain network addresss by website to be detected stored in the second network address list to be checked; 5th judges submodule, for judging whether the network address to be checked is the 3rd classification network address; 6th determines submodule, for when judging that the network address to be checked is the 3rd classification network address, the all-network address determining to comprise the network address to be checked is the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address; 3rd memory module, for the network address in the network address of all pages by website to be detected except the first category network address and the second classification network address stored in the 3rd network address list to be checked.
Further, accumulator module comprises: the 6th judges submodule, for judging whether the network address to be checked is the first category network address; 4th traversal submodule, for when judging that the network address to be checked is the first category network address, traversal the first network address list to be checked; First statistical module, for adding up the number that the page corresponding to the network address in the first network address list to be checked is included in a search engine; 7th judges submodule, for judging whether the network address to be checked is the second classification network address; 5th traversal submodule, for when judging that the network address to be checked is the second classification network address, traversal the second network address list to be checked; Second statistical module, the number the 8th be included in a search engine for adding up the page corresponding to the network address in the second network address list to be checked judges submodule, for judging whether the network address to be checked is the 3rd classification network address; 6th traversal submodule, for when judging that the network address to be checked is the 3rd classification network address, traversal the 3rd network address list to be checked; 3rd statistical module, for adding up the number that the page corresponding to the network address in the 3rd network address list to be checked is included in a search engine.
By the present invention, adopt the network address of all pages obtaining website to be detected; The network address to be checked is determined from the network address of all pages of website to be detected; Obtain the network address comprising the network address to be checked; Traversal comprises the network address of the network address to be checked, and the whether searched engine of the page detecting the network address that comprises the network address to be checked corresponding is included; If detect that the searched engine of the page corresponding to the network address that comprises the network address to be checked is included, webpage corresponding for the network address to be checked quantity of including in a search engine is added up, solves prior art to the Website page inaccurate problem of the Query Result of including quantity in a search engine.This invention adopts and submits the network address to be checked to search engine one by one, inquire about and feed back page collection situation in a search engine corresponding to each network address to be checked, and then reaching the accurate count Website page effect of including quantity in a search engine.
Accompanying drawing explanation
The accompanying drawing forming a application's part is used to provide a further understanding of the present invention, and schematic description and description of the present invention, for explaining the present invention, does not form inappropriate limitation of the present invention.In the accompanying drawings:
Fig. 1 is the process flow diagram of including the detection method of quantity according to the Website page of the embodiment of the present invention in a search engine;
Fig. 2 is the Search Results schematic diagram of including according to the searched engine of the page that the network address to be checked of the embodiment of the present invention is corresponding;
Fig. 3 is the Search Results schematic diagram of including according to the not searched engine of the page that the network address to be checked of the embodiment of the present invention is corresponding; And
Fig. 4 is the schematic diagram of including the pick-up unit of quantity according to the Website page of the embodiment of the present invention in a search engine.
Embodiment
It should be noted that, when not conflicting, the embodiment in the application and the feature in embodiment can combine mutually.Below with reference to the accompanying drawings and describe the present invention in detail in conjunction with the embodiments.
The application's scheme is understood better in order to make those skilled in the art person, below in conjunction with the accompanying drawing in the embodiment of the present application, technical scheme in the embodiment of the present application is clearly and completely described, obviously, described embodiment is only the embodiment of the application's part, instead of whole embodiments.Based on the embodiment in the application, those of ordinary skill in the art are not making the every other embodiment obtained under creative work prerequisite, all should belong to the scope of the application's protection.
It should be noted that, term " first ", " second " etc. in the instructions of the application and claims and above-mentioned accompanying drawing are for distinguishing similar object, and need not be used for describing specific order or precedence.Should be appreciated that the data used like this can be exchanged, in the appropriate case so that the embodiment of the application described herein.In addition, term " comprises " and " having " and their any distortion, intention is to cover not exclusive comprising, such as, contain those steps or unit that the process of series of steps or unit, method, system, product or equipment is not necessarily limited to clearly list, but can comprise clearly do not list or for intrinsic other step of these processes, method, product or equipment or unit.
The present invention aims to provide detection method and the device that a kind of Website page includes quantity in a search engine.
Fig. 1 is the process flow diagram of including the detection method of quantity according to the Website page of the embodiment of the present invention in a search engine.As shown in Figure 1, the detection method that this Website page includes quantity in a search engine comprises following step S101 to step S105:
Step S101, obtains the network address of all pages of website to be detected.
Website to be detected is need the query web page website of including quantity in a search engine.Preferably, the network address that the detection method that the Website page of this embodiment includes quantity in a search engine obtains all pages of website to be detected utilizes search engine optimization decision data center (Search Engine Optimization Dissector, referred to as SEOD) system existing website crawler technology, crawl the network address of all pages of this website to be detected.The network address obtaining all pages of this website to be detected directly can also be provided the network address of all pages of whole website to be detected by the head of a station of this website to be detected.
Preferably, after the network address of all pages obtaining website to be detected, the detection method that this Website page includes quantity in a search engine also comprises: detect the network address whether successfully having obtained all pages of website to be detected; If the network address of all pages successfully obtaining website to be detected detected, by the network address of all pages of website to be detected stored in the page network address list of website to be detected.Table 1 is the network address list of all pages of abc website, and wherein, in table 1, the network address of the page can be distinguished by page level.
The network address list of all pages of table 1 abc website
Step S102, determines the network address to be checked from the network address of all pages of website to be detected.
The Website page network address to be checked of including in a search engine in the detection method of quantity of this embodiment can be following any class or multiclass network address: the first category network address, and the first category network address is the network address of the homepage of website to be detected; The second classification network address, the second classification network address is the second level domain network address of website to be detected; The 3rd classification network address, the 3rd classification network address is the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address.As shown in table 1, the first category network address is www.abc.com, and the first category network address can be a1.abc.com or a2.abc.com etc., and the first category network address can be a1.abc.com/d1 or b1.abc.com/e1 etc.
Step S103, obtains the network address comprising the network address to be checked.
Preferably, the Website page of this embodiment is included the network address that in the detection method of quantity, acquisition comprises the network address to be checked in a search engine and is comprised: judge whether the network address to be checked is the first category network address; If judge that the network address to be checked is the first category network address, the network address determining to comprise the network address to be checked is the network address of all pages of website to be detected; By the network address of all pages of website to be detected stored in the first network address list to be checked, or, judge whether the network address to be checked is the second classification network address; If judge that the network address to be checked is the second classification network address, the all-network address determining to comprise the network address to be checked is the network address of the page corresponding to all second level domain network addresss of website to be detected; By the network address of the page corresponding for all second level domain network addresss of website to be detected stored in the second network address list to be checked, or, judge whether the network address to be checked is the 3rd classification network address; If judge that the network address to be checked is the 3rd classification network address, the all-network address determining to comprise the network address to be checked is the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address; By the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address stored in the 3rd network address list to be checked.
Table 2 first network address list to be checked
The Website page of this embodiment includes the detection method of quantity in a search engine by judging the classification of the network address to be checked, is deposited in corresponding tables of data the network address meeting the network address to be checked.Such as, if the network address to be checked that user exports is the first category network address, now user inquiry be website to be detected all pages in a search engine include quantity, then the first network address list to be checked is identical with the page network address list of website to be detected, is the network address of all pages of whole website to be detected.When such as the querying command of user's input is site:abc.com, the first network address list to be checked is as shown in table 2.
If the network address to be checked that user exports is the second classification network address, now user's inquiry be the page corresponding to all second level domain network addresss of website to be detected in a search engine include quantity, then the network address in the second network address list to be checked should be the network address of the page corresponding to all second level domain network addresss of website to be detected.When such as the querying command of user's input is second level domain site:a1.abc.com, the second network address list to be checked is as shown in table 3.Distinguishingly, as a1=www, the page corresponding to the network address under querying command representative inquiry www domain name in a search engine include quantity, namely all pages in whole website to be detected in a search engine include quantity.
Table 3 second network address list to be checked
Table 4 the 3rd network address list to be checked
If the network address to be checked that user exports is the 3rd classification network address, now user's inquiry be comprise the page corresponding to the network address of specific character string, the page corresponding to the network address in the network address of i.e. all pages of website to be detected except the first category network address and the second classification network address in a search engine include quantity, then the network address in the 3rd network address list to be checked should be the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address.When such as the querying command of user's input is second level domain site:a1.abc.com inurl:b1, represent the page corresponding to the network address that comprises b1 character string under user wishes to inquire about a1 second level domain in a search engine include quantity, then the 3rd network address list to be checked is as shown in table 4.Distinguishingly, when the querying command of user's input is site:abc.abc.com inurl:b1, comprise in the all-network address of the whole website to be detected of querying command representative inquiry the page corresponding to the network address of character string b1 in a search engine include quantity.
Step S104, traversal comprises the network address of the network address to be checked, and the whether searched engine of the page detecting the network address that comprises the network address to be checked corresponding is included.
Preferably, the detection method that the Website page of this embodiment includes quantity in a search engine detects and comprises the whether searched engine of the page corresponding to the network address of the network address to be checked and include and comprise: judge whether the network address to be checked is the first category network address; If judge that the network address to be checked is the first category network address, travel through all pages of website to be detected; The whether searched engine of all pages detecting website to be detected is included; If judge that the network address to be checked is not the first category network address, judge whether the network address to be checked is the second classification network address; If judge that the network address to be checked is the second classification network address, travel through the page that all second level domain network addresss of website to be detected are corresponding; The whether searched engine of the page detecting all second level domain network addresss of website to be detected corresponding is included; If judge that the network address to be checked is not the second classification network address, travel through the page that the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address is corresponding; The whether searched engine of the page detecting the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address corresponding is included.The Website page of this embodiment includes the detection method of quantity in a search engine by the all-network address in network address list to be checked, and the whether searched engine of the page that requester network address is corresponding is one by one included, and by the result of inquiry stored in tables of data.
The querying method whether single page in this embodiment is included in a search engine is, the network address to be checked (not being with http: // or site :) is inputted in the search box of search engine, Search Results comprises two parts: Part I, if the page corresponding to the network address to be checked searched engine is included, can display of search results brief introduction; If the not searched engine of the page corresponding to the network address to be checked is included, can point out and not find this network address to be checked.Part II is the Search Results comprising this network address to be checked character string.As shown in Figure 2, the Search Results schematic diagram that the not searched engine of the page corresponding to the network address to be checked is included as shown in Figure 3 for the Search Results schematic diagram that the searched engine of the page corresponding to the network address to be checked is included.If the page corresponding to the network address to be checked is included in a search engine, then in search result list, Search Results is designated as 1, otherwise is designated as 0.After having traveled through network address list to be checked, just can obtain including the list of result statistics.
Particularly, the whether searched engine of all pages detecting website to be detected is included and is comprised: the network address searching for all pages of website to be detected respectively in a search engine; Whether the Search Results judging in search engine points out the network address of all pages not finding website to be detected; If the Search Results judging in search engine does not point out the network address of all pages not finding website to be detected, determine that the searched engine of all pages of website to be detected is included.Table 5 includes the results list for the page corresponding to the first category network address.
The page that the table 5 first category network address is corresponding include the results list
The page that the table 6 second classification network address is corresponding include the results list
Particularly, detect the whether searched engine of the page corresponding to all second level domain network addresss of website to be detected to include and comprise: the network address searching for the page corresponding to all second level domain network addresss of website to be detected respectively in a search engine; The network address of the page whether Search Results judging in search engine points out all second level domain network addresss of not finding website to be detected corresponding; If the network address of the page that the Search Results judging in search engine does not point out all second level domain network addresss of not finding website to be detected corresponding, determine that the searched engine of the page corresponding to all second level domain network addresss of website to be detected is included.Table 6 be the page corresponding to the second classification network address include the results list.
Particularly, detect the whether searched engine of the page corresponding to the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address to include and comprise: the network address searching for the page corresponding to the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address respectively in a search engine; Whether the Search Results judging in search engine points out the network address of the page not finding the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address corresponding; If the Search Results judging in search engine does not point out the network address of the page not finding the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address corresponding, determine that the searched engine of the page corresponding to the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address is included.As shown in table 7 be the page corresponding to the 3rd classification network address include the results list.
The page that table 7 the 3rd classification network address is corresponding include the results list
Step S105, if detect that the searched engine of the page corresponding to the network address that comprises the network address to be checked is included, adds up webpage corresponding for the network address to be checked quantity of including in a search engine.
If detect that the searched engine of the page corresponding to the network address that comprises the network address to be checked is included, webpage corresponding for the network address to be checked quantity of including in a search engine is carried out cumulative comprising: judge whether the network address to be checked is the first category network address; If judge that the network address to be checked is the first category network address, traversal the first network address list to be checked; Add up the number that the page corresponding to the network address in the first network address list to be checked is included in a search engine, shown in table 5, page quantity of including in a search engine corresponding to the first category network address is 9.Judge whether the network address to be checked is the second classification network address; If judge that the network address to be checked is the second classification network address, traversal the second network address list to be checked; Add up the number that the page corresponding to the network address in the second network address list to be checked is included in a search engine, shown in table 6, page quantity of including in a search engine corresponding to the second classification network address is 5.Judge whether the network address to be checked is the 3rd classification network address; If judge that the network address to be checked is the 3rd classification network address, traversal the 3rd network address list to be checked; Add up the number that the page corresponding to the network address in the 3rd network address list to be checked is included in a search engine, shown in table 7, page quantity of including in a search engine corresponding to the 3rd classification network address is 4.
Preferably, obtain the page corresponding to the network address to be checked in a search engine include quantity after, the detection method that the Website page of this embodiment includes quantity in a search engine also comprises and the page corresponding for the network address to be checked obtained quantity of including in a search engine being exported, show with the form of form or icon, be convenient to user intuitively the analyzing web site page in a search engine include quantity.
The Website page of this embodiment includes the network address of all pages of the detection method employing acquisition website to be detected of quantity in a search engine; The network address to be checked is determined from the network address of all pages of website to be detected; Obtain the network address comprising the network address to be checked; Traversal comprises the network address of the network address to be checked, and the whether searched engine of the page detecting the network address that comprises the network address to be checked corresponding is included; If detect that the searched engine of the page corresponding to the network address that comprises the network address to be checked is included, webpage corresponding for the network address to be checked quantity of including in a search engine is added up, solve prior art to the Website page inaccurate problem of the Query Result of including quantity in a search engine, reach the accurate count Website page effect of including quantity in a search engine.
As can be seen from the above description, the detection method that the Website page of the embodiment of the present invention includes quantity in a search engine achieves technique effect:
Embodiments of the invention can add up exactly Website page in a search engine include quantity.Adopt the mode of simulation browser program in embodiments of the invention, submit to the network address to be checked to judge that the whether searched engine of the page corresponding to the network address to be checked is included to search engine seriatim.This embodiment can carry out a secondary response to each network address to be checked, ensure that control errors is in page level.Error rate is reduced by the method for input inquiry order in a search engine relative to prior art, because a large amount of network addresss can be had in each server, if when having server not respond, the Query Result of the network address of whole server will be caused to make mistakes, the error caused be server level other.Meanwhile, embodiments of the invention can also the query function that provides of match search engine, by analysis and consult order, and then judges the classification of the network address to be checked.In addition, the function of Website page can be crawled by simulation search Engine-Network reptile because SEOD system itself has been provided with, it is hereby ensured the foundation of the network address list of all pages of network to be detected.
It should be noted that, can perform in the computer system of such as one group of computer executable instructions in the step shown in the process flow diagram of accompanying drawing, and, although show logical order in flow charts, but in some cases, can be different from the step shown or described by order execution herein.
The embodiment of the present invention additionally provides the pick-up unit that a kind of Website page includes quantity in a search engine.It should be noted that, the Website page that the pick-up unit that this Website page includes quantity in a search engine may be used for performing the embodiment of the present invention includes the detection method of quantity in a search engine.
Fig. 4 is the schematic diagram of including the pick-up unit of quantity according to the Website page of the embodiment of the present invention in a search engine.As shown in Figure 4, the pick-up unit that this Website page includes quantity in a search engine comprises: the first acquisition module 10, determination module 20, the second acquisition module 30, detection module 40 and accumulator module 50.
First acquisition module 10, for obtaining the network address of all pages of website to be detected.
Determination module 20, determines the network address to be checked in the network address for all pages from website to be detected.
Preferably, determination module 20 comprises: first determines submodule, and for the network address to be checked is defined as the first category network address, wherein, the first category network address is the network address of the homepage of website to be detected; Second determines submodule, and for the network address to be checked is defined as the second classification network address, wherein, the second classification network address is the second level domain network address of website to be detected; 3rd determines submodule, for the network address to be checked is defined as the 3rd classification network address, wherein, the 3rd classification network address is the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address.
Second acquisition module 30, for obtaining the network address comprising the network address to be checked.
Preferably, the second acquisition module 30 comprises: the 3rd judges submodule, for judging whether the network address to be checked is the first category network address; 4th determines submodule, and for when judging that the network address to be checked is the first category network address, the network address determining to comprise the network address to be checked is the network address of all pages of website to be detected; First memory module, for the network address of all pages by website to be detected stored in the first network address list to be checked; 4th judges submodule, for judging whether the network address to be checked is the second classification network address; 5th determines submodule, and for when judging that the network address to be checked is the second classification network address, the all-network address determining to comprise the network address to be checked is the network address of the page corresponding to all second level domain network addresss of website to be detected; Second memory module, for the network address of the page corresponding to all second level domain network addresss by website to be detected stored in the second network address list to be checked; 5th judges submodule, for judging whether the network address to be checked is the 3rd classification network address; 6th determines submodule, for when judging that the network address to be checked is the 3rd classification network address, the all-network address determining to comprise the network address to be checked is the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address; 3rd memory module, for the network address in the network address of all pages by website to be detected except the first category network address and the second classification network address stored in the 3rd network address list to be checked.
Detection module 40, for traveling through the network address comprising the network address to be checked, detecting the whether searched engine of the page comprising the network address of the network address to be checked corresponding and including.
Preferably, detection module 40 comprises: the first judge module, for judging whether the network address to be checked is the first category network address; First traversal submodule, for when judging that the network address to be checked is the first category network address, travels through all pages of website to be detected; First detection sub-module, the whether searched engine of all pages for detecting website to be detected is included; Second judges submodule, for when judging that the network address to be checked is not the first category network address, judges whether the network address to be checked is the second classification network address; Second traversal submodule, for when judging that the network address to be checked is the second classification network address, travels through the page that all second level domain network addresss of website to be detected are corresponding; Second detection sub-module, the whether searched engine of the page corresponding to all second level domain network addresss for detecting website to be detected is included; 3rd traversal submodule, for when judging that the network address to be checked is not the second classification network address, travel through the page that the network address in the network address of all pages of website to be detected except the first category network address and the second classification network address is corresponding; 3rd detection sub-module, for detect all pages of website to be detected the network address in the whether searched engine of the page corresponding to the network address except the first category network address and the second classification network address include.
Accumulator module 50, for when detecting that the searched engine of the page corresponding to the network address that comprises the network address to be checked is included, adds up webpage corresponding for the network address to be checked quantity of including in a search engine.
Preferably, accumulator module 50 comprises: the 6th judges submodule, for judging whether the network address to be checked is the first category network address; 4th traversal submodule, for when judging that the network address to be checked is the first category network address, traversal the first network address list to be checked; First statistical module, for adding up the number that the page corresponding to the network address in the first network address list to be checked is included in a search engine; 7th judges submodule, for judging whether the network address to be checked is the second classification network address; 5th traversal submodule, for when judging that the network address to be checked is the second classification network address, traversal the second network address list to be checked; Second statistical module, the number the 8th be included in a search engine for adding up the page corresponding to the network address in the second network address list to be checked judges submodule, for judging whether the network address to be checked is the 3rd classification network address; 6th traversal submodule, for when judging that the network address to be checked is the 3rd classification network address, traversal the 3rd network address list to be checked; 3rd statistical module, for adding up the number that the page corresponding to the network address in the 3rd network address list to be checked is included in a search engine.
The pick-up unit that the Website page of this embodiment includes quantity in a search engine comprises the first acquisition module 10, first determination module 20, second acquisition module 30, detection module 40, and accumulator module 50.The pick-up unit of being included quantity by the Website page of this embodiment in a search engine solves prior art to the Website page inaccurate problem of the Query Result of including quantity in a search engine.
Obviously, those skilled in the art should be understood that, above-mentioned of the present invention each module or each step can realize with general calculation element, they can concentrate on single calculation element, or be distributed on network that multiple calculation element forms, alternatively, they can realize with the executable program code of calculation element, thus, they can be stored and be performed by calculation element in the storage device, or they are made into each integrated circuit modules respectively, or the multiple module in them or step are made into single integrated circuit module to realize.Like this, the present invention is not restricted to any specific hardware and software combination.
The foregoing is only the preferred embodiments of the present invention, be not limited to the present invention, for a person skilled in the art, the present invention can have various modifications and variations.Within the spirit and principles in the present invention all, any amendment done, equivalent replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (12)

1. Website page includes a detection method for quantity in a search engine, it is characterized in that, comprising:
Obtain the network address of all pages of website to be detected;
The network address to be checked is determined from the network address of all pages of described website to be detected;
Obtain the network address comprising the described network address to be checked;
Traversal comprises the network address of the described network address to be checked, and the whether searched engine of the page detecting the network address that comprises the described network address to be checked corresponding is included;
If detect that the page corresponding to the network address that comprises the described network address to be checked is included by described search engine, the include quantity of webpage corresponding for the described network address to be checked in described search engine is added up.
2. Website page according to claim 1 includes the detection method of quantity in a search engine, it is characterized in that, after the network address of all pages obtaining website to be detected, described method also comprises:
Detect the network address whether successfully having obtained all pages of described website to be detected; And
If the network address of all pages successfully obtaining described website to be detected detected, by the page network address list of the network address of all pages of described website to be detected stored in described website to be detected.
3. Website page according to claim 1 includes the detection method of quantity in a search engine, it is characterized in that, the described network address to be checked is following any class or multiclass network address:
The first category network address, the described first category network address is the network address of the homepage of described website to be detected;
The second classification network address, the described second classification network address is the second level domain network address of described website to be detected; And
The 3rd classification network address, the described 3rd classification network address is the network address in the network address of all pages of described website to be detected except the described first category network address and the described second classification network address.
4. Website page according to claim 3 includes the detection method of quantity in a search engine, it is characterized in that, detects to comprise the whether searched engine of the page corresponding to the network address of the described network address to be checked and include and comprise:
Judge whether the described network address to be checked is the described first category network address;
If judge that the described network address to be checked is the described first category network address, travel through all pages of described website to be detected;
Whether all pages detecting described website to be detected are included by described search engine;
If judge that the described network address to be checked is not the described first category network address, judge whether the described network address to be checked is the described second classification network address;
If judge that the described network address to be checked is the described second classification network address, travel through the page that all second level domain network addresss of described website to be detected are corresponding;
Whether the page detecting all second level domain network addresss of described website to be detected corresponding is included by described search engine;
If judge that the described network address to be checked is not the described second classification network address, travel through the page that the network address in the network address of all pages of described website to be detected except the described first category network address and the described second classification network address is corresponding; And
Whether the page detecting the network address in the network address of all pages of described website to be detected except the described first category network address and the described second classification network address corresponding is included by described search engine.
5. Website page according to claim 4 includes the detection method of quantity in a search engine, it is characterized in that,
Whether all pages detecting described website to be detected are included by described search engine and are comprised:
The network address of all pages of described website to be detected is searched for respectively in described search engine;
Whether the Search Results judging in described search engine points out the network address of all pages not finding described website to be detected; And
If the Search Results judging in described search engine does not point out the network address of all pages not finding described website to be detected, determine that all pages of described website to be detected are included by described search engine,
Detect the page corresponding to all second level domain network addresss of described website to be detected whether to be included by described search engine and comprise:
The network address of the page corresponding to all second level domain network addresss of described website to be detected is searched for respectively in described search engine;
The network address of the page whether Search Results judging in described search engine points out all second level domain network addresss of not finding described website to be detected corresponding; And
If the network address of the page that the Search Results judging in described search engine does not point out all second level domain network addresss of not finding described website to be detected corresponding, determine that the page corresponding to all second level domain network addresss of described website to be detected is included by described search engine
Detect the page corresponding to the network address in the network address of all pages of described website to be detected except the described first category network address and the described second classification network address whether to be included by described search engine and comprise:
The network address of the page corresponding to the network address in the network address of all pages of described website to be detected except the described first category network address and the described second classification network address is searched for respectively in described search engine;
Whether the Search Results judging in described search engine points out the network address of the page not finding the network address in the network address of all pages of described website to be detected except the described first category network address and the described second classification network address corresponding; And
If the Search Results judging in described search engine does not point out the network address of the page not finding the network address in the network address of all pages of described website to be detected except the described first category network address and the described second classification network address corresponding, determine that the page corresponding to the network address in the network address of all pages of described website to be detected except the described first category network address and the described second classification network address is included by described search engine.
6. Website page according to claim 3 includes the detection method of quantity in a search engine, it is characterized in that, obtains the network address comprising the described network address to be checked and comprises:
Judge whether the described network address to be checked is the described first category network address;
If judge that the described network address to be checked is the described first category network address, the network address comprising the described network address to be checked described in determining is the network address of all pages of described website to be detected; And
By the network address of all pages of described website to be detected stored in the first network address list to be checked,
Or,
Judge whether the described network address to be checked is the described second classification network address;
If judge that the described network address to be checked is the described second classification network address, the all-network address comprising the described network address to be checked described in determining is the network address of the page corresponding to all second level domain network addresss of described website to be detected; And
By the network address of the page corresponding for all second level domain network addresss of described website to be detected stored in the second network address list to be checked,
Or,
Judge whether the described network address to be checked is the described 3rd classification network address;
If judge that the described network address to be checked is the described 3rd classification network address, the all-network address comprising the described network address to be checked described in determining is the network address in the network address of all pages of described website to be detected except the described first category network address and the described second classification network address; And
By the network address in the network address of all pages of described website to be detected except the described first category network address and the described second classification network address stored in the 3rd network address list to be checked.
7. Website page according to claim 6 includes the detection method of quantity in a search engine, it is characterized in that, if detect that the searched engine of the page corresponding to the network address that comprises the described network address to be checked is included, webpage corresponding for the described network address to be checked quantity of including in a search engine is carried out cumulative comprising:
Judge whether the described network address to be checked is the described first category network address;
If judge that the described network address to be checked is the described first category network address, travel through described first network address list to be checked; And
Add up the number that the page corresponding to the network address in described first network address list to be checked is included in a search engine,
Or
Judge whether the described network address to be checked is the described second classification network address;
If judge that the described network address to be checked is the described second classification network address, travel through described second network address list to be checked; And
Add up the number that the page corresponding to the network address in described second network address list to be checked is included in a search engine,
Or
Judge whether the described network address to be checked is the described 3rd classification network address;
If judge that the described network address to be checked is the described 3rd classification network address, travel through described 3rd network address list to be checked; And
Add up the number that the page corresponding to the network address in described 3rd network address list to be checked is included in a search engine.
8. Website page includes a pick-up unit for quantity in a search engine, it is characterized in that, comprising:
First acquisition module, for obtaining the network address of all pages of website to be detected;
Determination module, determines the network address to be checked in the network address for all pages from described website to be detected;
Second acquisition module, for obtaining the network address comprising the described network address to be checked;
Detection module, for traveling through the network address comprising the described network address to be checked, detecting the whether searched engine of the page comprising the network address of the described network address to be checked corresponding and including;
Accumulator module, for when detecting that the page corresponding to the network address that comprises the described network address to be checked is included by described search engine, adds up the quantity of including of webpage corresponding for the described network address to be checked in described search engine.
9. Website page according to claim 8 includes the pick-up unit of quantity in a search engine, it is characterized in that, described determination module comprises:
First determines submodule, and for the described network address to be checked is defined as the first category network address, wherein, the described first category network address is the network address of the homepage of described website to be detected;
Second determines submodule, and for the described network address to be checked is defined as the second classification network address, wherein, the described second classification network address is the second level domain network address of described website to be detected; And
3rd determines submodule, for the described network address to be checked is defined as the 3rd classification network address, wherein, the described 3rd classification network address is the network address in the network address of all pages of described website to be detected except the described first category network address and the described second classification network address.
10. Website page according to claim 9 includes the pick-up unit of quantity in a search engine, it is characterized in that, described detection module comprises:
First judge module, for judging whether the described network address to be checked is the described first category network address;
First traversal submodule, for when judging that the described network address to be checked is the described first category network address, travels through all pages of described website to be detected;
Whether the first detection sub-module, included by described search engine for all pages detecting described website to be detected;
Second judges submodule, for when judging that the described network address to be checked is not the described first category network address, judges whether the described network address to be checked is the described second classification network address;
Second traversal submodule, for when judging that the described network address to be checked is the described second classification network address, travels through the page that all second level domain network addresss of described website to be detected are corresponding;
Second detection sub-module, whether the page corresponding to all second level domain network addresss for detecting described website to be detected is included by described search engine;
3rd traversal submodule, for when judging that the described network address to be checked is not the described second classification network address, travel through the page that the network address in the network address of all pages of described website to be detected except the described first category network address and the described second classification network address is corresponding; And
3rd detection sub-module, for detect all pages of described website to be detected the network address in the page corresponding to the network address except the described first category network address and the described second classification network address whether included by described search engine.
11. Website pages according to claim 9 include the pick-up unit of quantity in a search engine, it is characterized in that, described second acquisition module comprises:
3rd judges submodule, for judging whether the described network address to be checked is the described first category network address;
4th determines submodule, and for when judging that the described network address to be checked is the described first category network address, the network address comprising the described network address to be checked described in determining is the network address of all pages of described website to be detected;
First memory module, for the network address of all pages by described website to be detected stored in the first network address list to be checked;
4th judges submodule, for judging whether the described network address to be checked is the described second classification network address;
5th determines submodule, for when judging that the described network address to be checked is the described second classification network address, the all-network address comprising the described network address to be checked described in determining is the network address of the page corresponding to all second level domain network addresss of described website to be detected;
Second memory module, for the network address of the page corresponding to all second level domain network addresss by described website to be detected stored in the second network address list to be checked;
5th judges submodule, for judging whether the described network address to be checked is the described 3rd classification network address;
6th determines submodule, for when judging that the described network address to be checked is the described 3rd classification network address, the all-network address comprising the described network address to be checked described in determining is the network address in the network address of all pages of described website to be detected except the described first category network address and the described second classification network address; And
3rd memory module, for the network address in the network address of all pages by described website to be detected except the described first category network address and the described second classification network address stored in the 3rd network address list to be checked.
12. Website pages according to claim 11 include the pick-up unit of quantity in a search engine, it is characterized in that, described accumulator module comprises:
6th judges submodule, for judging whether the described network address to be checked is the described first category network address;
4th traversal submodule, for when judging that the described network address to be checked is the described first category network address, travels through described first network address list to be checked;
First statistical module, for adding up the number that the page corresponding to the network address in described first network address list to be checked is included in a search engine;
7th judges submodule, for judging whether the described network address to be checked is the described second classification network address;
5th traversal submodule, for when judging that the described network address to be checked is the described second classification network address, travels through described second network address list to be checked;
Second statistical module, for adding up the number that the page corresponding to the network address in described second network address list to be checked is included in a search engine
8th judges submodule, for judging whether the described network address to be checked is the described 3rd classification network address;
6th traversal submodule, for when judging that the described network address to be checked is the described 3rd classification network address, travels through described 3rd network address list to be checked; And
3rd statistical module, for adding up the number that the page corresponding to the network address in described 3rd network address list to be checked is included in a search engine.
CN201410730102.9A 2014-12-03 2014-12-03 Website page includes the detection method and device of quantity in a search engine Active CN104408156B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410730102.9A CN104408156B (en) 2014-12-03 2014-12-03 Website page includes the detection method and device of quantity in a search engine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410730102.9A CN104408156B (en) 2014-12-03 2014-12-03 Website page includes the detection method and device of quantity in a search engine

Publications (2)

Publication Number Publication Date
CN104408156A true CN104408156A (en) 2015-03-11
CN104408156B CN104408156B (en) 2017-12-22

Family

ID=52645787

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410730102.9A Active CN104408156B (en) 2014-12-03 2014-12-03 Website page includes the detection method and device of quantity in a search engine

Country Status (1)

Country Link
CN (1) CN104408156B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108804585A (en) * 2018-05-25 2018-11-13 网宿科技股份有限公司 A kind of data processing method and device in CDN system
CN110287444A (en) * 2019-07-02 2019-09-27 郑州悉知信息科技股份有限公司 Website detection method, device and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100082661A1 (en) * 2008-09-23 2010-04-01 Microsoft Corporation Linking Search Queries to Rich Media Themes
CN103092937A (en) * 2013-01-08 2013-05-08 合一网络技术(北京)有限公司 Visualization webpage recording detection method
CN103218452A (en) * 2013-04-27 2013-07-24 人民搜索网络股份公司 Method and device for recognizing valid interlinkage in Hub webpage
CN103631828A (en) * 2012-08-28 2014-03-12 阿里巴巴集团控股有限公司 Method and device for determining access path and method and system for determining page churn rate
CN104090931A (en) * 2014-06-25 2014-10-08 华南理工大学 Information prediction and acquisition method based on webpage link parameter analysis

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100082661A1 (en) * 2008-09-23 2010-04-01 Microsoft Corporation Linking Search Queries to Rich Media Themes
CN103631828A (en) * 2012-08-28 2014-03-12 阿里巴巴集团控股有限公司 Method and device for determining access path and method and system for determining page churn rate
CN103092937A (en) * 2013-01-08 2013-05-08 合一网络技术(北京)有限公司 Visualization webpage recording detection method
CN103218452A (en) * 2013-04-27 2013-07-24 人民搜索网络股份公司 Method and device for recognizing valid interlinkage in Hub webpage
CN104090931A (en) * 2014-06-25 2014-10-08 华南理工大学 Information prediction and acquisition method based on webpage link parameter analysis

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108804585A (en) * 2018-05-25 2018-11-13 网宿科技股份有限公司 A kind of data processing method and device in CDN system
CN108804585B (en) * 2018-05-25 2021-11-02 网宿科技股份有限公司 Data processing method and device in CDN system
CN110287444A (en) * 2019-07-02 2019-09-27 郑州悉知信息科技股份有限公司 Website detection method, device and storage medium
CN110287444B (en) * 2019-07-02 2021-06-25 郑州悉知信息科技股份有限公司 Website detection method and device and storage medium

Also Published As

Publication number Publication date
CN104408156B (en) 2017-12-22

Similar Documents

Publication Publication Date Title
CN104123332B (en) The display methods and device of search result
US7392278B2 (en) Building and using subwebs for focused search
US9690846B2 (en) Intelligent navigation of a category system
US20060155751A1 (en) System and method for document analysis, processing and information extraction
Jain et al. Page ranking algorithms in web mining, limitations of existing methods and a new method for indexing web pages
JP5855773B2 (en) Determination of search result ranking based on confidence level values associated with sellers
CN102663048B (en) Method and device for providing search result
US8682881B1 (en) System and method for extracting structured data from classified websites
US8838643B2 (en) Context-aware parameterized action links for search results
CN102880624A (en) Website navigation tool system
US9411895B2 (en) Personalized deeplinks for search results
CN105765573A (en) Improvements in website traffic optimization
CN103839172B (en) Method of Commodity Recommendation and system
CN103970748A (en) Related keyword recommending method and device
CN103699603A (en) Information recommendation method and system based on user behaviors
Prajapati A survey paper on hyperlink-induced topic search (HITS) algorithms for web mining
US20170255653A1 (en) Method for categorizing images to be associated with content items based on keywords of search queries
US10095788B2 (en) Context-sensitive deeplinks
US7143085B2 (en) Optimization of server selection using euclidean analysis of search terms
CN107145497A (en) The method of the image of metadata selected and content matching based on image and content
US20130031091A1 (en) Action-based search results and action view pivoting
Dias et al. Automating the extraction of static content and dynamic behaviour from e-commerce websites
CN102955859B (en) Web page content revealing method and device
CN104462259A (en) Method and equipment for providing search result of time-efficient picture
CN105786810B (en) The method for building up and device of classification mapping relations

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
PE01 Entry into force of the registration of the contract for pledge of patent right
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: Method and device for detecting recording quantity of web pages in search engine

Effective date of registration: 20190531

Granted publication date: 20171222

Pledgee: Shenzhen Black Horse World Investment Consulting Co., Ltd.

Pledgor: Beijing Guoshuang Technology Co.,Ltd.

Registration number: 2019990000503

CP02 Change in the address of a patent holder
CP02 Change in the address of a patent holder

Address after: 100083 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing

Patentee after: BEIJING GRIDSUM TECHNOLOGY Co.,Ltd.

Address before: 100086 Beijing city Haidian District Shuangyushu Area No. 76 Zhichun Road cuigongfandian 8 layer A

Patentee before: BEIJING GRIDSUM TECHNOLOGY Co.,Ltd.