CN104199844A - Newly-issued site recording method and device - Google Patents

Newly-issued site recording method and device Download PDF

Info

Publication number
CN104199844A
CN104199844A CN201410389303.7A CN201410389303A CN104199844A CN 104199844 A CN104199844 A CN 104199844A CN 201410389303 A CN201410389303 A CN 201410389303A CN 104199844 A CN104199844 A CN 104199844A
Authority
CN
China
Prior art keywords
webpage
site information
ageing
website
content
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410389303.7A
Other languages
Chinese (zh)
Inventor
王智广
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd, Qizhi Software Beijing Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201410389303.7A priority Critical patent/CN104199844A/en
Publication of CN104199844A publication Critical patent/CN104199844A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9577Optimising the visualization of content, e.g. distillation of HTML documents

Abstract

The invention provides a newly-issued site recording method and device. The method comprises the steps that whether a webpage is a time-efficient webpage or not is judged; time-efficient content in the time-efficient webpage is analyzed and judged, and information, contained in the time-efficient webpage, of other sites is obtained, wherein the information of the other sites is different from the information of sites in the time-efficient webpage; whether the information of the other sites correspond to newly-issued sites or not is verified, and if yes, the sites corresponding to the information of the other sites serve as the newly-issued sites to be recorded. By means of the newly-issued site recording method, the new sites which cannot be recorded in a traditional webpage hyperlink mode can be effectively recorded in time, the hysteresis quality caused by manual participation is avoided, the time efficiency is improved, and convenience is provided for subsequent using of the new sites.

Description

Website recording method and the device of new issue
Technical field
The present invention relates to technical field of internet application, particularly relate to a kind of website recording method and device of new issue.
Background technology
In computer realm, website (site) refers to the set (generally referring to a LAN (Local Area Network)) of the computing machine that can realize very fast traffic rate physically with good connection, between website, be generally that website is that a kind of of actual physical distribution of online computing machine objectively responded by connecting at a slow speed to realize information communication (generally referring to wide area network).
Along with the development of Internet service, the renewal speed of internet is exceedingly fast, and every day, even a few hours or several minutes all may exist " birth " of new site, and while mentioning for search engine, including new site is a basic requirement.In correlation technique, when enabling, new site can link out by the super chain of other webpages.Yet, exist a part of website just by it is reported, posts, the mode such as a microblogging mentions that new site enables.News report as shown in Figure 1, the new site cloud.360.cn mentioning in report cannot find the super chain relation that comprises it in for a long time when reaching the standard grade, now cannot surpass chain mode by conventional web and include this new site, need to include by manual mode, and artificial participation has hysteresis quality.
Therefore, including those how in time, effectively cannot surpass the new site that chain mode is included by conventional web, becomes the technical matters of needing at present solution badly.
Summary of the invention
In view of the above problems, the present invention has been proposed to a kind of website recording method and corresponding device of the new issue that overcomes the problems referred to above or address the above problem are at least in part provided.
According to one aspect of the present invention, a kind of website recording method of new issue is provided, comprising: judge whether webpage is ageing webpage; Analysis judgment is the ageing content in ageing webpage, obtains other site information that wherein comprise, and wherein, described other site information are different from the site information of described ageing webpage; Whether described other site information of checking correspond to the website of new issue, if so, website corresponding to described other site information are included as the website of new issue.
Alternatively, describedly judge that whether webpage is ageing webpage, comprising: according to the webpage issuing time of described webpage and/or front chain info web, confirm whether described webpage is ageing webpage.
Alternatively, described analysis judgment is the ageing content in ageing webpage, obtains other site information that wherein comprise, and comprising: content of pages part and/or the super chain part of resolving described ageing webpage; Obtain the site information of mentioning in described content of pages part, and/or, the site information of described super chain part.
Alternatively, described analysis judgment is after the ageing content in ageing webpage, before obtaining other site information that wherein comprise, also comprise: when the webpage of described ageing webpage is a plurality of, according to the content of each webpage, determine the quality of each webpage, wherein, effective content that described webpage comprises is directly proportional to the quality of described webpage; Select quality to surpass a plurality of webpages of assign thresholds; In a plurality of webpages of selecting, obtain other site information that wherein comprise.
Alternatively, described ageing content comprise following one of at least:
Title;
Text;
Peer link.
Alternatively, whether described other site information of checking correspond to the website of new issue, comprising: resolve described other site information, obtain domain-name information; Judge whether domain name information is included before; If not, confirm that described website corresponding to other site information is the website of new issue; If so, confirm that described website corresponding to other site information is not the website of new issue.
Alternatively, whether described other site information of checking correspond to the website of new issue, comprising: resolve described other site information, search Internet protocol IP information; If described other site information have IP, and in search engine, do not include described other site information, the website that described website corresponding to other site information of checking is new issue.
According to another aspect of the present invention, also provide a kind of website of new issue to include device, comprising:
Judge module, is suitable for judging whether webpage is ageing webpage;
Acquisition module, being suitable for analysis judgment is the ageing content in ageing webpage, obtains other site information that wherein comprise, wherein, described other site information are different from the site information of described ageing webpage;
Authentication module, is suitable for the website whether described other site information of checking correspond to new issue;
Including module, is yes if be suitable for the result of authentication module, website corresponding to described other site information is included as the website of new issue.
Alternatively, described judge module is also suitable for: according to the webpage issuing time of described webpage and/or front chain info web, confirm whether described webpage is ageing webpage.
Alternatively, described acquisition module is also suitable for: content of pages part and/or the super chain part of resolving described ageing webpage; Obtain the site information of mentioning in described content of pages part, and/or, the site information of described super chain part.
Alternatively, described acquisition module is also suitable for: when the webpage of described ageing webpage is a plurality of, determine the quality of each webpage according to the content of each webpage, wherein, effective content that described webpage comprises is directly proportional to the quality of described webpage; Select quality to surpass a plurality of webpages of assign thresholds; In a plurality of webpages of selecting, obtain other site information that wherein comprise.
Alternatively, described ageing content comprise following one of at least:
Title;
Text;
Peer link.
Alternatively, described authentication module is also suitable for: resolve described other site information, obtain domain-name information; Judge whether domain name information is included before; If not, confirm that described website corresponding to other site information is the website of new issue; If so, confirm that described website corresponding to other site information is not the website of new issue.
Alternatively, described authentication module is also suitable for: resolve described other site information, search Internet protocol IP information; If described other site information have IP, and in search engine, do not include described other site information, the website that described website corresponding to other site information of checking is new issue.
According to technical scheme of the present invention, by resolving the ageing content in ageing webpage, obtain other site information that wherein comprise, and whether website corresponding to other site information that checking is obtained is the website of new issue, if so, website corresponding to other site information included as the website of new issue.As can be seen here, the embodiment of the present invention can be resolved the ageing content in ageing webpage, obtain other site information that wherein comprise, and whether website corresponding to other site information that checking is obtained is the website of new issue, and without artificial other site information of finding in ageing webpage, also without website corresponding to other site information of artificial judgment, whether be the website of new issue, thereby solved, prior art mentions: due to just by news report, post, send out the modes such as microblogging and mention that new site enabled, thereby cannot surpass chain mode by conventional web and include this new site, need to include by manual mode, and artificial participation has this problem of hysteresis quality.Therefore, the embodiment of the present invention can be included those in time, effectively cannot surpass the new site that chain mode is included by conventional web, and the hysteresis quality of having avoided artificial participation to bring improves time efficiency, for follow-up use new site facilitates.And, for search engine, greatly promoted it and included the comprehensive and ageing of website, play the effect of search engine optimization.
Above-mentioned explanation is only the general introduction of technical solution of the present invention, in order to better understand technological means of the present invention, and can be implemented according to the content of instructions, and for above and other objects of the present invention, feature and advantage can be become apparent, below especially exemplified by the specific embodiment of the present invention.
According to the detailed description to the specific embodiment of the invention by reference to the accompanying drawings below, those skilled in the art will understand above-mentioned and other objects, advantage and feature of the present invention more.
Accompanying drawing explanation
By reading below detailed description of the preferred embodiment, various other advantage and benefits will become cheer and bright for those of ordinary skills.Accompanying drawing is only for the object of preferred implementation is shown, and do not think limitation of the present invention.And in whole accompanying drawing, by identical reference symbol, represent identical parts.In the accompanying drawings:
Fig. 1 shows by news report and mentions the enabled schematic diagram of new site;
Fig. 2 shows the process flow diagram of the website recording method of new issue according to an embodiment of the invention;
Fig. 3 shows by sending out microblogging mode and mentions the enabled schematic diagram of new site;
Fig. 4 shows according to an embodiment of the invention the process flow diagram of website recording method based on checking the new issue of domain name;
Fig. 5 shows according to an embodiment of the invention the process flow diagram of website recording method based on searching the new issue of IP; And
Fig. 6 shows the structural representation that the new according to an embodiment of the invention website of issuing is included device.
Embodiment
Exemplary embodiment of the present disclosure is described below with reference to accompanying drawings in more detail.Although shown exemplary embodiment of the present disclosure in accompanying drawing, yet should be appreciated that and can realize the disclosure and the embodiment that should do not set forth limits here with various forms.On the contrary, it is in order more thoroughly to understand the disclosure that these embodiment are provided, and can by the scope of the present disclosure complete convey to those skilled in the art.
For solving the problems of the technologies described above, the embodiment of the present invention provides a kind of website recording method of new issue, and Fig. 2 shows the process flow diagram of the website recording method of new issue according to an embodiment of the invention.As shown in Figure 2, the method at least comprises the following steps S202 to step S208.
Step S202, judge whether webpage is ageing webpage, if so, continue execution step S204; Otherwise, finish this flow process.
Step S204, analysis judgment are the ageing content in ageing webpage, obtain other site information that wherein comprise, and wherein, other site information are different from the site information of ageing webpage.
Step S206, verify whether other site information correspond to the website of new issue, if so, continue execution step S208; Otherwise, finish this flow process.
Step S208, website corresponding to other site information included as the website of new issue.
According to technical scheme of the present invention, by resolving the ageing content in ageing webpage, obtain other site information that wherein comprise, and whether website corresponding to other site information that checking is obtained is the website of new issue, if so, website corresponding to other site information included as the website of new issue.As can be seen here, the embodiment of the present invention can be resolved the ageing content in ageing webpage, obtain other site information that wherein comprise, and whether website corresponding to other site information that checking is obtained is the website of new issue, and without artificial other site information of finding in ageing webpage, also without website corresponding to other site information of artificial judgment, whether be the website of new issue, thereby solved, prior art mentions: due to just by news report, post, send out the modes such as microblogging and mention that new site enabled, thereby cannot surpass chain mode by conventional web and include this new site, need to include by manual mode, and artificial participation has this problem of hysteresis quality.Therefore, the embodiment of the present invention can be included those in time, effectively cannot surpass the new site that chain mode is included by conventional web, and the hysteresis quality of having avoided artificial participation to bring improves time efficiency, for follow-up use new site facilitates.And, for search engine, greatly promoted it and included the comprehensive and ageing of website, play the effect of search engine optimization.
The ageing webpage of above mentioning in step S202 refers to that issuing time is no more than appointment duration apart from current, and the search engine webpage of not including.Further, the technological means that step S202 can adopt is: according to the webpage issuing time of webpage and/or front chain info web, confirm whether webpage is ageing webpage.For example, current time is 9:00 in the morning, can be by webpage issuing time the morning webpage of 7:00 to 9:00 confirm as ageing webpage.Again for example, if the front chain webpage of webpage is to be utilized the webpage at the Search Results place that searched key word searches by search engine, in Search Results, be linked to this webpage, think that the searched engine of this webpage included, this webpage is not ageing webpage; Otherwise this webpage is ageing webpage.
After step S202 judgement webpage is ageing webpage, step S204 further resolves the ageing content in ageing webpage, and the ageing content here can be the ageing body matter of webpage, as text, title, peer link etc.The technological means that step S204 can adopt is: resolves content of pages part and/or the super chain part of ageing webpage, and then obtains the site information of mentioning in content of pages part, and/or, the site information of super chain part.For example, in Fig. 1, this webpage is ageing webpage, resolves content of pages part and/or the super chain part of this webpage, now gets the site information of mentioning in content of pages part, i.e. " cloud.360.cn ".Again for example, Fig. 3 is for mentioning the enabled schematic diagram of new site by sending out microblogging mode, and the webpage at this microblogging place is ageing webpage, resolves content of pages part and/or the super chain part of this webpage, now get the site information of super chain part, i.e. " cloud.360.cn ".Certainly, it will be appreciated by persons skilled in the art that the site information of obtaining in the embodiment of the present invention is not limited to a site information, can, according to many site information of ageing contents extraction of reality, all belong to protection scope of the present invention.
The quality of the website extracting due to the ageing webpage of low-quality rubbish is often lower, and the embodiment of the present invention can further identify low-quality ageing webpage.Be that the technological means that step S204 can adopt is: when the webpage of ageing webpage is a plurality of, according to the content of each webpage, determine the quality of each webpage, wherein, effective content that webpage comprises is directly proportional to the quality of webpage, and then select quality to surpass a plurality of webpages of assign thresholds, in a plurality of webpages of selecting, obtain other site information that wherein comprise subsequently.
Above, step S204 gets after other site information, step S206 further verifies whether other site information correspond to the website of new issue, can be by checking domain name or searching IP (Internet Protocol, Internet protocol) mode is verified, will describe this two kinds of modes below in detail.
Mode one, by checking that the mode of domain name verifies.
In mode one, can obtain domain-name information by resolving other site information, and then judge whether this domain-name information is included before, for example judge whether this domain-name information is included by reptile before.If be not included before this domain-name information, confirm that website corresponding to other site information is for the website of new issue; Otherwise, confirm that website corresponding to other site information is not the website of new issue, possible this website was once used, after become invalid or inefficacy website, now enable again.
Mode two, verifies by searching the mode of IP.
In mode two, can, by resolving other site information, search IP information.If other site information have IP, and in search engine, do not include other site information, verified that website corresponding to other site information was for the website of new issue.If other site information have IP, and included other site information in search engine, and verified that website corresponding to other site information was not the website of new issue, possible this website was once used, after become invalid or inefficacy website, now enable again.If other site information do not have IP, verify that website corresponding to other site information is not the new website of issuing.
More than introduced the multiple implementation of each link in the embodiment shown in Fig. 2, the website recording method of the new issue embodiment of the present invention being provided below by concrete preferred embodiment is described further.
Embodiment mono-
Fig. 4 shows according to an embodiment of the invention the process flow diagram of website recording method based on checking the new issue of domain name.As shown in Figure 4, the method comprises the following steps S402 to step S412.
Step S402, according to the webpage issuing time of webpage and/or front chain info web, confirm whether webpage is ageing webpage, if so, continue execution step S404; Otherwise, finish this flow process.The ageing webpage here refers to that issuing time is no more than appointment duration apart from current, and the search engine webpage of not including.For example, current time is 9:00 in the morning, can be by webpage issuing time the morning webpage of 7:00 to 9:00 confirm as ageing webpage.Again for example, if the front chain webpage of webpage is to be utilized the webpage at the Search Results place that searched key word searches by search engine, in Search Results, be linked to this webpage, think that the searched engine of this webpage included, this webpage is not ageing webpage; Otherwise this webpage is ageing webpage.
Step S404, analysis judgment are the ageing content in ageing webpage, obtain other site information that wherein comprise, and wherein, other site information are different from the site information of ageing webpage.The ageing content here can be the ageing body matter of webpage, as text, title, peer link etc.For example, resolve content of pages part and/or the super chain part of ageing webpage, and then obtain the site information of mentioning in content of pages part, and/or, the site information of super chain part.
Further, when if the webpage of ageing webpage is a plurality of, can determine according to the content of each webpage the quality of each webpage, wherein, effective content that webpage comprises is directly proportional to the quality of webpage, and then select quality to surpass a plurality of webpages of assign thresholds, in a plurality of webpages of selecting, obtain other site information that wherein comprise subsequently, thereby low-quality ageing webpage can be identified.
Step S406, by resolving other site information, obtain domain-name information.
Step S408, judged before whether this domain-name information and be included, if so, continued execution step S410; Otherwise, continue execution step S412.
Step S410, confirm that website corresponding to other site information is not the website of new issue, and finish this flow process.
Step S412, confirm that website corresponding to other site information is for the website of new issue, and website corresponding to other site information included as the website of new issue.
In embodiment mono-, can resolve the ageing content in ageing webpage, obtain other site information that wherein comprise, and the mode based on checking domain name verifies whether other site information correspond to the website of new issue, and without artificial other site information of finding in ageing webpage, also without website corresponding to other site information of artificial judgment, whether be the website of new issue, realize in time, effectively include those and cannot surpass the new site that chain mode is included by conventional web, the hysteresis quality of having avoided artificial participation to bring, improve time efficiency, for follow-up use new site facilitates.
Embodiment bis-
Fig. 5 shows according to an embodiment of the invention the process flow diagram of website recording method based on searching the new issue of IP.As shown in Figure 5, the method comprises the following steps S502 to step S514.
Step S502, according to the webpage issuing time of webpage and/or front chain info web, confirm whether webpage is ageing webpage, if so, continue execution step S504; Otherwise, finish this flow process.The ageing webpage here refers to that issuing time is no more than appointment duration apart from current, and the search engine webpage of not including.For example, current time is 9:00 in the morning, can be by webpage issuing time the morning webpage of 7:00 to 9:00 confirm as ageing webpage.Again for example, if the front chain webpage of webpage is to be utilized the webpage at the Search Results place that searched key word searches by search engine, in Search Results, be linked to this webpage, think that the searched engine of this webpage included, this webpage is not ageing webpage; Otherwise this webpage is ageing webpage.
Step S504, analysis judgment are the ageing content in ageing webpage, obtain other site information that wherein comprise, and wherein, other site information are different from the site information of ageing webpage.The ageing content here can be the ageing body matter of webpage, as text, title, peer link etc.For example, resolve content of pages part and/or the super chain part of ageing webpage, and then obtain the site information of mentioning in content of pages part, and/or, the site information of super chain part.
Further, when if the webpage of ageing webpage is a plurality of, can determine according to the content of each webpage the quality of each webpage, wherein, effective content that webpage comprises is directly proportional to the quality of webpage, and then select quality to surpass a plurality of webpages of assign thresholds, in a plurality of webpages of selecting, obtain other site information that wherein comprise subsequently, thereby low-quality ageing webpage can be identified.
Step S506, by resolving other site information, search IP information.
Step S508, judge whether other site information have IP, if so, continue execution step S510; Otherwise, continue execution step S512.
Step S510, judge in search engine, whether to include other site information, if so, continue execution step S512; Otherwise, continue execution step S514.
Step S512, confirm that website corresponding to other site information is not the website of new issue, and finish this flow process.
Step S514, confirm that website corresponding to other site information is for the website of new issue, and website corresponding to other site information included as the website of new issue.
In embodiment bis-, can resolve the ageing content in ageing webpage, obtain other site information that wherein comprise, and the mode based on searching IP verifies whether other site information correspond to the website of new issue, and without artificial other site information of finding in ageing webpage, also without website corresponding to other site information of artificial judgment, whether be the website of new issue, realize in time, effectively include those and cannot surpass the new site that chain mode is included by conventional web, the hysteresis quality of having avoided artificial participation to bring, improve time efficiency, for follow-up use new site facilitates.
It should be noted that, in practical application, above-mentioned all optional embodiments can adopt the mode combination in any of combination, form optional embodiment of the present invention, and this is no longer going to repeat them.
Based on same inventive concept, the embodiment of the present invention also provides a kind of website of new issue to include device, to realize the website recording method of above-mentioned new issue.
Fig. 6 shows the structural representation that the new according to an embodiment of the invention website of issuing is included device.Referring to Fig. 6, this device at least comprises: judge module 610, acquisition module 620, authentication module 630 and include module 640.
The website of now introducing the new issue of the embodiment of the present invention is included each composition or the function of device and annexation between each several part of device:
Judge module 610, is suitable for judging whether webpage is ageing webpage;
Acquisition module 620, is coupled with judge module 610, and being suitable for analysis judgment is the ageing content in ageing webpage, obtains other site information that wherein comprise, and wherein, other site information are different from the site information of ageing webpage;
Authentication module 630, is coupled with acquisition module 620, is suitable for verifying whether other site information correspond to the website of new issue;
Including module 640, be coupled with authentication module 630, is yes if be suitable for the result of authentication module 630, website corresponding to other site information is included as the website of new issue.
In one embodiment, judge module 610 can also be suitable for: according to the webpage issuing time of webpage and/or front chain info web, confirm whether webpage is ageing webpage.
In one embodiment, acquisition module 620 can also be suitable for: content of pages part and/or the super chain part of resolving ageing webpage; Obtain the site information of mentioning in content of pages part, and/or, the site information of super chain part.
In one embodiment, acquisition module 620 can also be suitable for: when the webpage of ageing webpage is a plurality of, determine the quality of each webpage according to the content of each webpage, wherein, effective content that webpage comprises is directly proportional to the quality of webpage; Select quality to surpass a plurality of webpages of assign thresholds; In a plurality of webpages of selecting, obtain other site information that wherein comprise.
In one embodiment, ageing content comprise following one of at least:
Title;
Text;
Peer link.
In one embodiment, authentication module 630 can also be suitable for: resolve other site information, obtain domain-name information; Judge whether domain-name information is included before; If not, confirm that website corresponding to other site information is for the website of new issue; If so, confirm that website corresponding to other site information is not the new website of issuing.
In one embodiment, authentication module 630 can also be suitable for: resolve other site information, search IP information; If other site information have IP, and in search engine, do not include other site information, verified that website corresponding to other site information was for the website of new issue.
According to the combination of above-mentioned any one preferred embodiment or a plurality of preferred embodiments, the embodiment of the present invention can reach following beneficial effect:
According to technical scheme of the present invention, by resolving the ageing content in ageing webpage, obtain other site information that wherein comprise, and whether website corresponding to other site information that checking is obtained is the website of new issue, if so, website corresponding to other site information included as the website of new issue.As can be seen here, the embodiment of the present invention can be resolved the ageing content in ageing webpage, obtain other site information that wherein comprise, and whether website corresponding to other site information that checking is obtained is the website of new issue, and without artificial other site information of finding in ageing webpage, also without website corresponding to other site information of artificial judgment, whether be the website of new issue, thereby solved, prior art mentions: due to just by news report, post, send out the modes such as microblogging and mention that new site enabled, thereby cannot surpass chain mode by conventional web and include this new site, need to include by manual mode, and artificial participation has this problem of hysteresis quality.Therefore, the embodiment of the present invention can be included those in time, effectively cannot surpass the new site that chain mode is included by conventional web, and the hysteresis quality of having avoided artificial participation to bring improves time efficiency, for follow-up use new site facilitates.And, for search engine, greatly promoted it and included the comprehensive and ageing of website, play the effect of search engine optimization.
In the instructions that provided herein, a large amount of details have been described.Yet, can understand, embodiments of the invention can not put into practice in the situation that there is no these details.In some instances, be not shown specifically known method, structure and technology, so that not fuzzy understanding of this description.
Similarly, be to be understood that, in order to simplify the disclosure and to help to understand one or more in each inventive aspect, in the above in the description of exemplary embodiment of the present invention, each feature of the present invention is grouped together into single embodiment, figure or sometimes in its description.Yet, the method for the disclosure should be construed to the following intention of reflection: the present invention for required protection requires than the more feature of feature of clearly recording in each claim.Or rather, as reflected in claims below, inventive aspect is to be less than all features of disclosed single embodiment above.Therefore, claims of following embodiment are incorporated to this embodiment thus clearly, and wherein each claim itself is as independent embodiment of the present invention.
Those skilled in the art are appreciated that and can the module in the equipment in embodiment are adaptively changed and they are arranged in one or more equipment different from this embodiment.Module in embodiment or unit or assembly can be combined into a module or unit or assembly, and can put them into a plurality of submodules or subelement or sub-component in addition.At least some in such feature and/or process or unit are mutually repelling, and can adopt any combination to combine all processes or the unit of disclosed all features in this instructions (comprising claim, summary and the accompanying drawing followed) and disclosed any method like this or equipment.Unless clearly statement in addition, in this instructions (comprising claim, summary and the accompanying drawing followed) disclosed each feature can be by providing identical, be equal to or the alternative features of similar object replaces.
In addition, those skilled in the art can understand, although embodiment more described herein comprise some feature rather than further feature included in other embodiment, the combination of the feature of different embodiment means within scope of the present invention and forms different embodiment.For example, in claims, the one of any of embodiment required for protection can be used with array mode arbitrarily.
All parts embodiment of the present invention can realize with hardware, or realizes with the software module moved on one or more processor, or realizes with their combination.It will be understood by those of skill in the art that and can use in practice microprocessor or digital signal processor (DSP) to realize the some or all functions of including the some or all parts in device according to the website of the new issue of the embodiment of the present invention.The present invention for example can also be embodied as, for carrying out part or all equipment or device program (, computer program and computer program) of method as described herein.Realizing program of the present invention and can be stored on computer-readable medium like this, or can there is the form of one or more signal.Such signal can be downloaded and obtain from internet website, or provides on carrier signal, or provides with any other form.
It should be noted above-described embodiment the present invention will be described rather than limit the invention, and those skilled in the art can design alternative embodiment in the situation that do not depart from the scope of claims.In the claims, any reference symbol between bracket should be configured to limitations on claims.Word " comprises " not to be got rid of existence and is not listed as element or step in the claims.Being positioned at word " " before element or " one " does not get rid of and has a plurality of such elements.The present invention can be by means of including the hardware of some different elements and realizing by means of the computing machine of suitably programming.In having enumerated the unit claim of some devices, several in these devices can be to carry out imbody by same hardware branch.The use of word first, second and C grade does not represent any order.Can be title by these word explanations.
So far, those skilled in the art will recognize that, although detailed, illustrate and described a plurality of exemplary embodiment of the present invention herein, but, without departing from the spirit and scope of the present invention, still can directly determine or derive many other modification or the modification that meets the principle of the invention according to content disclosed by the invention.Therefore, scope of the present invention should be understood and regard as and cover all these other modification or modifications.
The present invention also provides the website recording method of A1, a kind of new issue, comprising:
Judge whether webpage is ageing webpage;
Analysis judgment is the ageing content in ageing webpage, obtains other site information that wherein comprise, and wherein, described other site information are different from the site information of described ageing webpage;
Whether described other site information of checking correspond to the website of new issue, if so, website corresponding to described other site information are included as the website of new issue.
A2, according to the method described in A1, wherein, describedly judge that whether webpage is ageing webpage, comprising:
According to the webpage issuing time of described webpage and/or front chain info web, confirm whether described webpage is ageing webpage.
A3, according to the method described in A1 or A2, wherein, described analysis judgment is the ageing content in ageing webpage, obtains other site information that wherein comprise, and comprising:
Resolve content of pages part and/or the super chain part of described ageing webpage;
Obtain the site information of mentioning in described content of pages part, and/or, the site information of described super chain part.
A4, according to the method described in A1 to A3 any one, wherein, described analysis judgment is after the ageing content in ageing webpage, before obtaining other site information that wherein comprise, also comprises:
When the webpage of described ageing webpage is a plurality of,
According to the content of each webpage, determine the quality of each webpage, wherein, effective content that described webpage comprises is directly proportional to the quality of described webpage;
Select quality to surpass a plurality of webpages of assign thresholds;
In a plurality of webpages of selecting, obtain other site information that wherein comprise.
A5, according to the method described in A1 to A4 any one, wherein, described ageing content comprise following one of at least:
Title;
Text;
Peer link.
A6, according to the method described in A1 to A5 any one, wherein, whether described other site information of checking correspond to the website of new issue, comprising:
Resolve described other site information, obtain domain-name information;
Judge whether domain name information is included before;
If not, confirm that described website corresponding to other site information is the website of new issue;
If so, confirm that described website corresponding to other site information is not the website of new issue.
A7, according to the method described in A1 to A6 any one, wherein, whether described other site information of checking correspond to the website of new issue, comprising:
Resolve described other site information, search Internet protocol IP information;
If described other site information have IP, and in search engine, do not include described other site information, the website that described website corresponding to other site information of checking is new issue.
The website of B8, a kind of new issue is included device, comprising:
Judge module, is suitable for judging whether webpage is ageing webpage;
Acquisition module, being suitable for analysis judgment is the ageing content in ageing webpage, obtains other site information that wherein comprise, wherein, described other site information are different from the site information of described ageing webpage;
Authentication module, is suitable for the website whether described other site information of checking correspond to new issue;
Including module, is yes if be suitable for the result of authentication module, website corresponding to described other site information is included as the website of new issue.
B9, according to the device described in B8, wherein, described judge module is also suitable for:
According to the webpage issuing time of described webpage and/or front chain info web, confirm whether described webpage is ageing webpage.
B10, according to the device described in B8 or B9, wherein, described acquisition module is also suitable for:
Resolve content of pages part and/or the super chain part of described ageing webpage;
Obtain the site information of mentioning in described content of pages part, and/or, the site information of described super chain part.
B11, according to the device described in B8 to B10 any one, wherein, described acquisition module is also suitable for:
When the webpage of described ageing webpage is a plurality of,
According to the content of each webpage, determine the quality of each webpage, wherein, effective content that described webpage comprises is directly proportional to the quality of described webpage;
Select quality to surpass a plurality of webpages of assign thresholds;
In a plurality of webpages of selecting, obtain other site information that wherein comprise.
B12, according to the device described in B8 to B11 any one, wherein, described ageing content comprise following one of at least:
Title;
Text;
Peer link.
B13, according to the device described in B8 to B12 any one, wherein, described authentication module is also suitable for:
Resolve described other site information, obtain domain-name information;
Judge whether domain name information is included before;
If not, confirm that described website corresponding to other site information is the website of new issue;
If so, confirm that described website corresponding to other site information is not the website of new issue.
B14, according to the device described in B8 to B13 any one, wherein, described authentication module is also suitable for:
Resolve described other site information, search Internet protocol IP information;
If described other site information have IP, and in search engine, do not include described other site information, the website that described website corresponding to other site information of checking is new issue.

Claims (10)

1. a website recording method of newly issuing, comprising:
Judge whether webpage is ageing webpage;
Analysis judgment is the ageing content in ageing webpage, obtains other site information that wherein comprise, and wherein, described other site information are different from the site information of described ageing webpage;
Whether described other site information of checking correspond to the website of new issue, if so, website corresponding to described other site information are included as the website of new issue.
2. method according to claim 1, wherein, describedly judges that whether webpage is ageing webpage, comprising:
According to the webpage issuing time of described webpage and/or front chain info web, confirm whether described webpage is ageing webpage.
3. method according to claim 1 and 2, wherein, described analysis judgment is the ageing content in ageing webpage, obtains other site information that wherein comprise, and comprising:
Resolve content of pages part and/or the super chain part of described ageing webpage;
Obtain the site information of mentioning in described content of pages part, and/or, the site information of described super chain part.
4. according to the method described in claims 1 to 3 any one, wherein, described analysis judgment is after the ageing content in ageing webpage, before obtaining other site information that wherein comprise, also comprises:
When the webpage of described ageing webpage is a plurality of,
According to the content of each webpage, determine the quality of each webpage, wherein, effective content that described webpage comprises is directly proportional to the quality of described webpage;
Select quality to surpass a plurality of webpages of assign thresholds;
In a plurality of webpages of selecting, obtain other site information that wherein comprise.
5. according to the method described in claim 1 to 4 any one, wherein, described ageing content comprise following one of at least:
Title;
Text;
Peer link.
6. according to the method described in claim 1 to 5 any one, wherein, whether described other site information of checking correspond to the website of new issue, comprising:
Resolve described other site information, obtain domain-name information;
Judge whether domain name information is included before;
If not, confirm that described website corresponding to other site information is the website of new issue;
If so, confirm that described website corresponding to other site information is not the website of new issue.
7. according to the method described in claim 1 to 6 any one, wherein, whether described other site information of checking correspond to the website of new issue, comprising:
Resolve described other site information, search Internet protocol IP information;
If described other site information have IP, and in search engine, do not include described other site information, the website that described website corresponding to other site information of checking is new issue.
8. the website of new issue is included a device, comprising:
Judge module, is suitable for judging whether webpage is ageing webpage;
Acquisition module, being suitable for analysis judgment is the ageing content in ageing webpage, obtains other site information that wherein comprise, wherein, described other site information are different from the site information of described ageing webpage;
Authentication module, is suitable for the website whether described other site information of checking correspond to new issue;
Including module, is yes if be suitable for the result of authentication module, website corresponding to described other site information is included as the website of new issue.
9. device according to claim 8, wherein, described judge module is also suitable for:
According to the webpage issuing time of described webpage and/or front chain info web, confirm whether described webpage is ageing webpage.
10. device according to claim 8 or claim 9, wherein, described acquisition module is also suitable for:
Resolve content of pages part and/or the super chain part of described ageing webpage;
Obtain the site information of mentioning in described content of pages part, and/or, the site information of described super chain part.
CN201410389303.7A 2014-08-08 2014-08-08 Newly-issued site recording method and device Pending CN104199844A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410389303.7A CN104199844A (en) 2014-08-08 2014-08-08 Newly-issued site recording method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410389303.7A CN104199844A (en) 2014-08-08 2014-08-08 Newly-issued site recording method and device

Publications (1)

Publication Number Publication Date
CN104199844A true CN104199844A (en) 2014-12-10

Family

ID=52085137

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410389303.7A Pending CN104199844A (en) 2014-08-08 2014-08-08 Newly-issued site recording method and device

Country Status (1)

Country Link
CN (1) CN104199844A (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070073696A1 (en) * 2005-09-28 2007-03-29 Google, Inc. Online data verification of listing data
CN103092937A (en) * 2013-01-08 2013-05-08 合一网络技术(北京)有限公司 Visualization webpage recording detection method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070073696A1 (en) * 2005-09-28 2007-03-29 Google, Inc. Online data verification of listing data
CN103092937A (en) * 2013-01-08 2013-05-08 合一网络技术(北京)有限公司 Visualization webpage recording detection method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
GUJUNSEO.COM: ""解读百度站长社区资料:时效性资源收录问题"", 《HTTP://WWW.CHINAZ.COM/WEB/2012/0531/254926.SHTML》 *
闫峻: ""新一代搜索引擎准确性收录技术的研究"", 《万方数据知识服务平台》 *

Similar Documents

Publication Publication Date Title
CN102739653B (en) Detection method and device aiming at webpage address
CN105430002A (en) Vulnerability detection method and device
KR20140014132A (en) Methods and systems for providing content provider-specified url keyword navigation
CN102833258A (en) Website access method and system
CN102833262A (en) Whois information-based phishing website gathering, identification method and system
CN102957664A (en) Method and device for identifying phishing websites
CN103685603A (en) Domain name system analyzing method and device
CN103152355A (en) Method and system for promoting dangerous website and client device
CN105376217B (en) A kind of malice jumps and the automatic judging method of malice nested class objectionable website
CN106302862B (en) A kind of collection method and system of DNS recursion server
CN103152354A (en) Method and system for promoting dangerous website and client device
CN103399871B (en) Obtain the device and method of an associated second-level domain information of Main Domain
CN105704171B (en) System and method for realizing CDN access
CN103036993A (en) Browser client-side and method of achieving website logging
CN103685606A (en) Associated domain name acquisition method, associated domain name acquisition system and web administrator permission validation method
CN105407186A (en) Method and device for acquiring subdomain names
CN104301311A (en) Method and device for filtering network data content through DNS
CN105516390A (en) Method and device for managing domain name
CN104021154A (en) Method and device for searching browser
CN105530218A (en) Link security detection method and client
CN107577590A (en) Method and device based on database service real-time calling virtual interface
CN103412944A (en) Internet addressing method and device
KR20140037751A (en) Methods and systems for providing content provider-specified url keyword navigation
CN104065736A (en) URL redirection method, device, and system
CN103618742A (en) Method and system for acquiring sub domain names and webmaster permission verification method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20141210