CN102999576A - Method and equipment for confirming page description information corresponding to target pages - Google Patents

Method and equipment for confirming page description information corresponding to target pages Download PDF

Info

Publication number
CN102999576A
CN102999576A CN2012104528436A CN201210452843A CN102999576A CN 102999576 A CN102999576 A CN 102999576A CN 2012104528436 A CN2012104528436 A CN 2012104528436A CN 201210452843 A CN201210452843 A CN 201210452843A CN 102999576 A CN102999576 A CN 102999576A
Authority
CN
China
Prior art keywords
information
page
target pages
equipment
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012104528436A
Other languages
Chinese (zh)
Other versions
CN102999576B (en
Inventor
唐振江
董冰峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201210452843.6A priority Critical patent/CN102999576B/en
Publication of CN102999576A publication Critical patent/CN102999576A/en
Application granted granted Critical
Publication of CN102999576B publication Critical patent/CN102999576B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention aims at providing a method and equipment for confirming page description information corresponding to target pages. The method comprises the following specific steps of: confirming classifications relevance information corresponding to a target page to be processed; and carrying out corresponding adjustment processing on candidate description information corresponding to the target page according to the classification relevance information, so as to obtain page description information corresponding to the target page. Compared with the prior art, the method the advantages that corresponding adjustment processing is carried out on the candidate description information corresponding to the target page according to the classification relevance information of the confirmed target page, so as to obtain the page description information corresponding to the target page, so that the page description information of the target page is more accurate, the efficiency that a user obtains information is improved, the browsing reading experience of the user is improved, and the resource of user equipment is saved.

Description

Be used for determining the method and apparatus of the corresponding page-describing information of target pages
Technical field
The present invention relates to Internet technical field, relate in particular to a kind of technology for determining the corresponding page-describing information of target pages.
Background technology
Current, along with the development of Internet technology and internet, applications to the infiltration with life of user learning, work, people are more and more by Network Capture information, for example browsing pages or search for page result about particular topic.Correspondingly, if can accurately determine the page-describing information of target pages, can significantly improve the efficient of user's obtaining information, for example for search subscriber provides more suitably page result, perhaps push away for other more relevant information for the page browsing user.Yet, in the prior art often only by first the descriptor that word frequency is determined this page being added up again in page participle, often there is larger error in the page-describing information that obtains so, the user who for example pays close attention to " composition " is browsing the composition writing page, if this page comprises one piece about the model essay of " pyramid-shaped dumpling ", prior art then can obtain " pyramid-shaped dumpling " and be the descriptor of this page, rather than " composition ".Especially, along with spreading unchecked of present search engine optimization or Optimization Technology for Website, the page-describing information of utilizing the prior art to obtain is more and more unreliable, has had a strong impact on efficient and the experience of people's obtaining informations.
Summary of the invention
The purpose of this invention is to provide a kind of method and apparatus for determining the corresponding page-describing information of target pages.
According to an aspect of the present invention, provide a kind of method for determining the corresponding page-describing information of target pages, wherein, the method may further comprise the steps:
A determines the corresponding classification relevant information of pending target pages;
B adjusts accordingly processing according to described classification relevant information to the corresponding candidate's descriptor of described target pages, to obtain the corresponding page-describing information of described target pages.
According to another aspect of the present invention, also provide a kind of and determined equipment for the information of determining the corresponding page-describing information of target pages, wherein, this information determines that equipment comprises:
Sorter is used for determining the pending corresponding classification relevant information of target pages;
Determine device, be used for according to described classification relevant information, the corresponding candidate's descriptor of described target pages is adjusted accordingly processing, to obtain the corresponding page-describing information of described target pages.
According to a further aspect of the invention, also provide a kind of computer equipment, this computer equipment comprises such as the aforementioned information that is used for definite corresponding page-describing information of target pages according to a further aspect of the present invention determines equipment.
Compared with prior art, the present invention is by the classification relevant information according to the target pages of determining, the corresponding candidate's descriptor of described target pages is adjusted accordingly processing, to obtain the corresponding page-describing information of described target pages, thereby make the page-describing information of target pages more accurate, not only improve user's obtaining information efficient, also promoted user's the resources conservation of browsing reading experience and subscriber equipment.And the present invention also can according to described page-describing information, determine the presentation information corresponding with described target pages, thereby the information that improved further provide efficient and user's obtaining information efficient.Further, the present invention also can determine the content erotic degree information of described target pages, according to described page-describing information, and in conjunction with described content erotic degree information, determine the presentation information corresponding with described target pages, thereby the information that improved further provides efficient and user's obtaining information efficient, and then has also correspondingly promoted user's the reading experience of browsing.In addition, the present invention also can according to Search Results the page-describing information of the corresponding page and the matching degree information of search sequence, Search Results is carried out subsequent treatment, further shortened the time of user's Webpage search, reduced user's flowing of access, improve the efficient of user's obtaining information, and promoted user's search viewing experience.
Description of drawings
By reading the detailed description that non-limiting example is done of doing with reference to the following drawings, it is more obvious that other features, objects and advantages of the present invention will become:
Fig. 1 illustrates the equipment synoptic diagram that is used for determining the corresponding page-describing information of target pages according to one aspect of the invention;
Fig. 2 illustrates the equipment synoptic diagram that is used for determining the corresponding page-describing information of target pages in accordance with a preferred embodiment of the present invention;
Fig. 3 illustrates the method flow diagram that is used for determining the corresponding page-describing information of target pages according to a further aspect of the present invention;
Fig. 4 illustrates the method flow diagram that is used for determining the corresponding page-describing information of target pages in accordance with a preferred embodiment of the present invention.
Same or analogous Reference numeral represents same or analogous parts in the accompanying drawing.
Embodiment
Below in conjunction with accompanying drawing the present invention is described in further detail.
Fig. 1 illustrates according to the information that is used for definite corresponding page-describing information of target pages of one aspect of the invention and determines equipment 1, and wherein, information determines that equipment 1 comprises sorter 11 and definite device 12.Particularly, sorter 11 is determined the pending corresponding classification relevant information of target pages; Determine device 12 according to described classification relevant information, the corresponding candidate's descriptor of described target pages is adjusted accordingly processing, to obtain the corresponding page-describing information of described target pages.At this, information determines that equipment 1 includes but not limited to that the network equipment, subscriber equipment or the network equipment and subscriber equipment are by the mutually integrated equipment that consists of of network.At this, the described network equipment includes but not limited to such as network host, single network server, a plurality of webserver collection or based on the realizations such as set of computers of cloud computing; Perhaps realized by subscriber equipment.At this, cloud is by consisting of based on a large amount of main frames of cloud computing (Cloud Computing) or the webserver, and wherein, cloud computing is a kind of of Distributed Calculation, a super virtual machine that is comprised of the loosely-coupled computing machine collection of a group.At this, described subscriber equipment can be any electronic product that can carry out man-machine interaction by modes such as keyboard, mouse, touch pad, touch-screen or handwriting equipments with the user, such as computing machine, mobile phone, PDA, palm PC PPC or panel computer etc.Described network includes but not limited to internet, wide area network, Metropolitan Area Network (MAN), LAN (Local Area Network), VPN network, wireless self-organization network (Ad Hoc network) etc.Those skilled in the art will be understood that above-mentioned information determines that equipment 1 is only for for example; other network equipments existing or that may occur from now on or subscriber equipment are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.At this, the network equipment and subscriber equipment include a kind of can be according to the instruction of prior setting or storage, automatically carry out the electronic equipment of numerical evaluation and information processing, its hardware includes but not limited to microprocessor, special IC (ASIC), programmable gate array (FPGA), digital processing unit (DSP), embedded device etc.
Particularly, the application programming interfaces (API) that sorter 11 at first provides by the equipment such as the third party such as browser, search engine obtain pending target pages; Perhaps, by dynamic web page techniques such as ASP, JSP, obtain the user by the search sequence of subscriber equipment input, again this search sequence is submitted to search engine, and receive the Search Results corresponding with this search sequence that search engine feeds back, with as pending target pages; Perhaps, by agreement communication modes such as http, htths, obtain pending target pages; Then, sorter 11 is determined the corresponding classification relevant information of described target pages.At this, described classification relevant information include but not limited to following at least each: 1) virtual theme, at this, the user's of this target pages of access that the page body matter of the described target pages of described virtual theme intention can reflect access intention, for example, the hypothetical target page is one piece of rowing regatta composition model essay such as the body matter of " rowing regatta composition model essay " (http://www.qc99.com/xiaoxue/sinj/101176.Html), and the user who browses this page wish to learn the to write a composition information of writing aspect, then the corresponding classification relevant information of this target pages is that virtual theme is such as composition; For another example, the hypothetical target page is the picture of fresh flower such as the body matter of " download of fresh flower material " (http://sucai.redocn.com/category/260/), and the user who browses this page wishes to obtain the material of relevant fresh flower to be used for the Arts creation, and then the corresponding classification relevant information of this target pages is virtual theme such as Arts material; 2) exact matching object, at this, the described target pages of described exact matching object intention has comprised and the on all four content information of user's request, and described user's request has irreplaceability, for example, is the hypothetical target page such as " oral cavity, Beijing expert-good doctor is online " (http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm? province=beijing) comprised about relevant informations such as the hospital of disease " canker sore " and attending doctors, and the user who browses this page wish the inquiry obtain about the treatment disease as " canker sore " be not other diseases such as the page of the relevant information of " rhinitis ", then the corresponding classification relevant information of this target pages is the exact matching object; For another example, the hypothetical target page has comprised about information such as the Products of IBM minicomputer IBM POWER720, specifications parameters such as " IBM minicomputer IBMPOWER720 " (http://www.xinhuigroup.com/Product/10026/11479.html), and the user who browses this page wishes that inquiry obtains about IBM minicomputer IBMPOWER720 rather than other type products page such as " IBM POWER 550 " relevant information, and then the corresponding classification relevant information of this target pages is the exact matching object; 3) broad match object, at this, content information and the user's request of the described target pages of described broad match object intention have correlativity, for example, is the hypothetical target page such as " iphone5 pink colour and the back side have the outer casing protective sleeve of heart pattern " (http://www.vipshop.com/show-0-48369-0.html?), and other brands that the user who browses this page also may belong to like product such as intelligent machine such as " apple data line " and with " iohone5 " to other accessories of iphone5 equipment are interested such as " nokia " intelligent machine etc., and then the corresponding classification relevant information of this target pages is the broad match object; 4) mismatch object, at this, the content information of the described target pages of described mismatch object intention is not suitable for comprising the presentation information outside the content information that supplies the user to obtain this target pages of place, for example, when the user browses news report such as " expert claims also also to oppose to China Obama, and the friend returns to in-depth to the Asia-Pacific strategy " (http://news.sina.com.cn/w/sd/2012-11-08/021925532469.shtml), except the content report of paying close attention to this news, can not pay close attention to the other guide information in this page, then the corresponding classification relevant information of this page is mismatch object such as news report again.Those skilled in the art will be understood that above-mentioned classification relevant information only for giving an example, and other classification relevant informations existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and are contained in this at this with way of reference.
For example, the user inputs network address http://news.sina.com.cn/ in browser address bar, press "enter" key", sorter 11 gets access to the webpage corresponding with this network address http://news.sina.com.cn/ by the application programming interfaces (API) that provide such as third party's equipment such as news websites.For another example, the user inputs keyword " iphone accessory " by its subscriber equipment such as PC in search column, click search button, then sorter 11 is by dynamic web page techniques such as JSP or ASP, get access to the search sequence of this user's input from this subscriber equipment, and submit searching request based on this search sequence to search engine, the one or more Search Results that are complementary with keyword " iphone accessory " that the application programming interfaces (API) that provide by search engine obtain that search engine obtains according to keyword " iphone accessory " matching inquiry, such as " iphone accessory [market price evaluation certified products crudely-made articles] ", " iphone accessory Apple Store (China) " etc. is as pending target pages.
Those skilled in the art will be understood that the above-mentioned mode of pending target pages of obtaining is only for giving an example; other existing or modes of obtaining pending target pages that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Then, sorter 11 is determined the pending corresponding classification relevant informations of target pages, at this, sorter 11 determine the mode of the corresponding classification relevant informations of described target pages include but not limited to following at least each:
1) according to the page subject content of described target pages, determines the corresponding classification relevant information of described target pages.Particularly, sorter 11 at first passes through such as page html tag analytical approach, extract the page body matter of described target pages, perhaps, according to VIPS (Vision-based Page Segmentation, page segmentation based on vision) algorithm, utilize the visual signatures such as spacing between webpage foreground color, background color, font color and size, frame, logical block and the logical block, element position, described target pages is carried out piecemeal process, to obtain the body matter piecemeal of described target pages; Then, sorter 11 is determined the corresponding classification relevant information of described target pages according to the page body matter of described target pages.For example, suppose that the described target pages that sorter 11 at first gets access to is that news report is such as " expert claims also also to oppose to China Obama, and the friend returns to in-depth to the Asia-Pacific strategy " (http://news.sina.com.cn/w/sd/2012-11-08/021925532469.shtml), then sorter 11 passes through such as page html tag analytical approach, extract the page body matter of this target pages and be the news report of " Obama also also opposes to China, and the friend returns to in-depth to the Asia-Pacific strategy ", then sorter 11 determines that the corresponding classification relevant informations of this target pages are the mismatch object.For another example, do you suppose that the described target pages that sorter 11 at first gets access to is about the page " Beijing oral cavity expert-good doctor online " (the http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm for the treatment of disease such as " canker sore " relevant information? province=beijing), this target pages has comprised and the on all four content information of user's request, and then sorter 11 determines that the corresponding classification relevant information of this target pages is the exact matching object.
2) according to the user's who accesses described target pages page access recorded information, determine the corresponding classification relevant information of described target pages.The user user browsing page is as " iphone accessory only product can snap up at a low price for example! Digital accessory special show is preferential in limited time " (http://www.vipshop.com/show-0-48369-0.html ?); and other brands that this user user also belongs to like product such as intelligent machine such as " apple data line " and with " iohone5 " to other accessories of iphone5 equipment are interested such as " nokia " intelligent machine etc., then sorter 11 determines that the corresponding classification relevant informations of this target pages are the broad match object.
Those skilled in the art will be understood that the mode of above-mentioned definite described classification relevant information is only for giving an example; the mode of other existing or definite described classification relevant informations that may occur from now on is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Determine device 12 according to described classification relevant information, the corresponding candidate's descriptor of described target pages is adjusted accordingly processing, to obtain the corresponding page-describing information of described target pages.At this, described candidate's descriptor includes but not limited to the description of target pages body matter information as described, the description of the corresponding described classification relevant information of described target pages.Particularly, determine that device 12 is at first by carrying out word frequency statistics such as the content of pages to described target pages, perhaps, call the page candidate descriptor application programming interfaces (API) that the affiliated third party website of described target pages provides, obtain the corresponding candidate's descriptor of described target pages; Then, determine the described classification relevant information that device 12 is determined according to sorter, the corresponding candidate's descriptor of described target pages is adjusted accordingly processing, to obtain the corresponding page-describing information of described target pages.Those skilled in the art will be understood that above-mentioned candidate's descriptor only for giving an example, and other candidate's descriptors existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and are contained in this at this with way of reference.At this, described corresponding adjustment process operation comprise following at least each:
-when described classification relevant information comprises described virtual theme, in virtual subject data base, carry out matching inquiry according to described candidate's descriptor, with the matching inquiry result of correspondence as described page-describing information;
-comprise described exact matching object when described classification relevant information, with described candidate's descriptor as described page-describing information;
-when described classification relevant information comprises described broad match object, in the generalized object database, carry out matching inquiry according to described candidate's descriptor, with described candidate's descriptor and corresponding matching inquiry result thereof as described page-describing information;
-when described classification relevant information comprises described mismatch object, described candidate's descriptor is emptied, with as described page-describing information.
For example, as described in supposing sorter 11 to determine pending target pages (http://www.qc99.com/xiaoxue/sinj/101176.Html) being corresponding such as " rowing regatta composition model essay " the classification relevant information be as described in virtual theme, and determine that device 12 at first calls the page candidate descriptor application programming interfaces (API) that the affiliated third party website qc99 of this target pages http://www.qc99.com/xiaoxue/sinj/101176.Html provides, the described candidate's descriptor that obtains this target pages http://www.qc99.com/xiaoxue/sinj/101176.Html comprises " rowing regatta composition model essay " content etc., determine that then device 12 carries out matching inquiry according to this candidate's descriptor in virtual subject data base, obtain the matching inquiry result such as " page body matter: rowing regatta composition model essay-correspondence classification relevant information: virtual theme (composition) ", then this matching inquiry result is as described page-describing information, at this, described virtual subject data base stores a plurality of virtual themes, it can be arranged in information and determine equipment 1, also can be arranged in information to determine the server that equipment 1 links to each other by network; For another example, do you suppose that sorter 11 determines that pending target pages are as about the page " Beijing oral cavity expert-good doctor online " (the http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm for the treatment of disease such as " canker sore " relevant information? province=beijing) described classification relevant information is the exact matching object, and determine that device 12 at first carries out word frequency statistics to the content of pages of this target pages, do you obtain this target pages http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm? described candidate's descriptor of province=beijing comprises " disease " canker sore " treatment-corresponding classification relevant information: exact matching object " etc., then determine device 12 with this candidate's descriptor as described page-describing information; And for example, suppose that sorter 11 definite pending target pages are as " iphone accessory only product can snap up at a low price! Digital accessory special show is preferential in limited time " the described classification relevant information of (http://www.vipshop.com/show-0-48369-0.html ?) is the broad match object; and determine that device 12 is at first to this target pages http://www.vipshop.com/show-0-48369-0.html? content of pages carry out word frequency statistics; obtain this target pages http://www.vipshop.com/show-0-48369-0.html? described candidate's descriptor comprise " digital accessory special show " etc., determine that then device 12 carries out matching inquiry according to this candidate's descriptor in the generalized object database, obtain the matching inquiry result as " the digital accessory (protecting sheathing accessory; charger etc.) of the iphone-digital accessory of nokia-... " etc., with this candidate's descriptor and corresponding matching inquiry result thereof as described page-describing information, at this, described generalized object database comprises the classification set of generalized object, each generalized object is classification again, it can be arranged in information and determine equipment 1, also can be arranged in information to determine the server that equipment 1 links to each other by network; Also as, suppose sorter 11 determine pending target pages be news report as " expert claim Obama to China also the enemy also the friend in-depth is returned to the Asia-Pacific strategy " (http://news.sina.com.cn/w/sd/2012-11-08/021925532469.shtml) as described in the classification relevant information be the mismatch object, and determine that device 12 at first calls the page candidate descriptor application programming interfaces (API) that the affiliated third party website sina of this target pages provides, the described candidate's descriptor that obtains this target pages comprises " news report-corresponding classification relevant information: mismatch object ", determine that then device 12 empties this candidate's descriptor, with as described page-describing information, namely the corresponding page-describing information of this target pages is vacancy.
Those skilled in the art will be understood that the above-mentioned mode that the corresponding candidate's descriptor of described target pages is adjusted accordingly processing is only for for example; other existing or modes that the corresponding candidate's descriptor of described target pages is adjusted accordingly processing that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Those skilled in the art will be understood that the mode of the corresponding page-describing information of the described target pages of above-mentioned acquisition is only for giving an example; the mode of the corresponding page-describing information of other described target pages of acquisition existing or that may occur from now on is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Information determines between each device of equipment 1 it is constant work.Particularly, sorter 11 continues to determine the pending corresponding classification relevant information of target pages; Determine that device 12 continues according to described classification relevant information, adjusts accordingly processing to the corresponding candidate's descriptor of described target pages, to obtain the corresponding page-describing information of described target pages.At this, those skilled in the art are to be understood that " continuing " information of referring to determines each device of equipment 1 obtaining of determining of relevant information and page-describing information of constantly classifying respectively, until information is determined equipment 1 the determining of relevant information that stop in a long time classifying.
Preferably, information determines that equipment 1 also comprises model apparatus for establishing (not shown), and particularly, the model apparatus for establishing carries out machine learning and processes according to a plurality of training pages through the mark classified information, to obtain to be used for the page classifications model of page classifications; Wherein, sorter 11 based on the page relevant information of described target pages, is determined described classification relevant information according to described page classifications model.
Particularly, the model apparatus for establishing carries out machine learning and processes according to a plurality of training pages through the mark classified information, to obtain to be used for the page classifications model of page classifications.For example, suppose through the mark classified information a plurality of training pages as follows:
I: rowing regatta composition model essay
Http:// www.qc99.com/xiaoxue/sinj/101176.Html, virtual theme
II:sina/ reading/novel shop/world's masterpiece/" the Count of Monte Christo "
Http:// vip.book.sina.com.cn/book/index_81300.html, virtual theme
III: oral cavity, Beijing expert-good doctor is online
http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm?
Province=beijing, the exact matching object
IV:sina sports news http://sports.sina.com.cn/, the mismatch object
V:sina financial and economic news http://finance.sina.com.cn/, the mismatch object
VI: product netting index code accessory only
Http:// www.vipshop.com/show-0-48369-0.html?, the broad match object
VII: the individual product http://cosmetic.dangdang.com/ that protects of Dangdang.com, the broad match object then the model apparatus for establishing according to this through the mark classified information a plurality of training pages, carrying out machine learning processes, as to as described in training set carry out linear regression analysis, perhaps described training set is carried out the modes such as nonlinear regression analysis, obtain to be used for page classifications model such as the decision tree of page classifications, each node of this decision tree is corresponding to each page classifications, wherein, described page classifications comprises a plurality of described training pages, comprises page I and II such as page classifications such as virtual subject classification, the exact matching object classification comprises page III, the mismatch object classification comprises page IV and V, the broad match object classification comprises page VI and VII.
Then, sorter 11 based on the page relevant information of described target pages, is determined described classification relevant information according to described page classifications model.At this, described page relevant information includes but not limited to such as page body matter classification, page structure feature etc.For example, suppose that the pending target pages that sorter 11 at first obtains is " rowing regatta composition model essay " http://www.qc99.com/xiaoxue/sinj/101176.Html, then sorter 11 can be according to the described page classifications model of model apparatus for establishing acquisition, page relevant information such as page body matter information based on this target pages, the page body matter classification of the training page that each page classifications in the page body matter classification of this target pages and the described page classifications model is included is compared, as suppose that the page body matter classification of determining this target pages is the composition type, consistent with the content of pages classification of the included training page of the page classifications of virtual theme, then sorter 11 determines that the described classification relevant information of these target pages is virtual theme.
Preferably, information determines that equipment 1 also comprises the search process device (not shown), and particularly, search process device at first obtains the one or more Search Results corresponding with search sequence; Then, according to described Search Results the page-describing information of the corresponding page and the matching degree information of described search sequence, described one or more Search Results are carried out subsequent treatment; Then, will in described one or more Search Results of subsequent treatment, at least one offer the corresponding application of described search sequence.
Particularly; search process device at first passes through ASP; the dynamic page technology such as JSP; obtain the user by the mobile enquiry request of subscriber equipment input inquiry sequence in the search engine search column; and then this search sequence mentioned to search engine; and receive the one or more Search Results corresponding with this search sequence that search engine feeds back; to obtain the one or more Search Results corresponding with search sequence; for example; suppose that user user uses its PC to input keyword " iphone protecting sheathing accessory " in the search engine search column; then click search button; then search process device passes through ASP; the dynamic page technology such as JSP; just can get access to the search sequence of user user input; then submit page searching request based on this search sequence to search engine, and receive one or more Search Results corresponding with this search sequence " iphone protecting sheathing accessory " such as Search Results A " homepage-Mi is the digital accessory certified products of apple discount store the more " that search engine feeds back; Search Results B " ... 3C apple accessory iphone shell cell-phone cover wholesale and retail containment vessel "; Search Results C " unique containment vessel iphone4s accessory recommending mobile phone Technology Times Sina website " etc.
Those skilled in the art will be understood that the above-mentioned mode of the one or more Search Results corresponding with search sequence of obtaining is only for giving an example; other existing or modes of obtaining the one or more Search Results corresponding with search sequence that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Then, search process device according to described Search Results the page-describing information of the corresponding page and the matching degree information of described search sequence, described one or more Search Results are carried out subsequent treatment.Particularly, search process device at first carries out semantic analysis to the corresponding page-describing information of described Search Results, according to the corresponding word of described search sequence shared ratio in the included total word of the corresponding page-describing information of described Search Results, determine the matching degree information of the corresponding page-describing information of described Search Results and described search sequence, as when ratio greater than 0.95 the time, determine that described matching degree information is matched, if ratio is between 0.95 and 0.7 the time, determine that described matching degree information is the moderate coupling, if ratio, is determined described matching degree information less than 0.7 o'clock and is low coupling; Then, search process device carries out subsequent treatment again according to this matching degree information to described one or more Search Results, as to as described in order between one or more Search Results adjust, to as described in one or more Search Results screen.For example, connect example, suppose Search Results A page-describing information and the matching degree of search sequence " iphone protecting sheathing accessory " of the corresponding page be higher than Search Results B the page-describing information of the corresponding page and the matching degree of this search sequence " iphone protecting sheathing accessory ", Search Results B page-describing information and the matching degree of search sequence " iphone protecting sheathing accessory " of the corresponding page be higher than Search Results C the page-describing information of the corresponding page and the matching degree of this search sequence " iphone protecting sheathing accessory ", then search process device is according to described matching degree information, determine Search Results A, putting in order of Search Results B and Search Results C is A, B, C, be that user user is when obtaining the Search Results corresponding with search sequence " iphone protecting sheathing accessory ", Search Results A is positioned at before the Search Results B, and Search Results B is positioned at before the Search Results C; For another example, search process device also can screen Search Results A, B, C according to described matching degree information, and such as filter search results, the Search Results C that matching degree is low does not offer the user.
Those skilled in the art will be understood that the above-mentioned mode that described one or more Search Results are carried out subsequent treatment is only for for example; other existing or modes that described one or more Search Results are carried out subsequent treatment that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Then, search process device is by dynamic web page techniques such as ASP, JSP or PHP, the perhaps communication mode of other agreements, such as communication protocols such as http or https, to in described one or more Search Results of subsequent treatment, at least one offer the corresponding application of described search sequence, offer the corresponding user of described search sequence for the described Search Results of using after processing.At this, described application includes but not limited to such as search engine, browser etc.For example, connect example, search process device will be rear its Search Results A, B and C that carries out after the subsequent treatment offer user user according to matching degree information order A, B, C, browse for the user, perhaps, page matching degree information among page Search Results A, B and the C is not offered user user less than the page Search Results of predetermined threshold.
Fig. 2 illustrates the equipment synoptic diagram that is used for determining the corresponding page-describing information of target pages in accordance with a preferred embodiment of the present invention, and information determines that device 1 comprises sorter 11 ', determines device 12 ' and coalignment 13 '.Particularly, sorter 11 ' is determined the corresponding classification relevant information of pending target pages; Determine device 12 ' according to described classification relevant information, the corresponding candidate's descriptor of described target pages is adjusted accordingly processing, to obtain the corresponding page-describing information of described target pages; Coalignment 13 ' is determined the presentation information corresponding with described target pages according to described page-describing information, and wherein, described presentation information and described page-describing information are complementary.At this, sorter 11 ' and definite device 12 ' are same or similar with corresponding intrument shown in Figure 1 respectively, so locate to repeat no more, and mode by reference is contained in this.
Particularly, coalignment 13 ' is determined the presentation information corresponding with described target pages according to described page-describing information, and wherein, described presentation information and described page-describing information are complementary.At this, described presentation information include but not limited to as to be shown in the page with certain carrier such as link, text, picture, video, animation etc., be used for the content to user's transmission of information, its include but not limited to as with as described in page-describing information page descriptor content information, with as described in the corresponding page style information of page-describing information etc.Particularly, coalignment 13 ' by the corresponding presentation information of the described descriptor of inquiry in the presentation information database, is determined the presentation information corresponding with described page-describing information according to described page-describing information; Perhaps, maybe this presents user's associated user's resource distribution content information to the user that presents by inquiry described page-describing information corresponding target pages in the presentation information database, determine the presentation information corresponding with described page-describing information, wherein, described presentation information and described page-describing information are complementary.At this, described presentation information database can be arranged in information and determine equipment 1, also can be arranged in information and determine the database that equipment 1 links to each other by network.
For example, suppose that sorter 11 ' determines that pending target pages is as " iphone accessory only product can snap up at a low price! Digital accessory special show is preferential in limited time " the described classification relevant information of (http://www.vipshop.com/show-0-48369-0.html ?) is the broad match object; and determine this target pages http://www.vipshop.com/show-0-48369-0.html that device 12 ' is determined? described page-describing information comprise " digital accessory (the protecting sheathing accessory of iphone, charger etc.)-the digital accessory of nokia-... " etc.; then coalignment 13 ' can be with this page-describing information, as with this target pages http://www.vipshop.com/show-0-48369-0.html? corresponding presentation information; For another example; connect example; coalignment 13 ' can with described page-describing information " the digital accessory of the digital accessory (protecting sheathing accessory, charger etc.) of iphone-nokia-... " content information, and other resource distribution content informations that present the user corresponding to this page-describing information as contents such as " iphone sell goods information " as described in presentation information.
Those skilled in the art will be understood that and above-mentionedly determine that the mode of the presentation information corresponding with described target pages is only for giving an example; other existing or modes of determining the presentation information corresponding with described target pages that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Preferably, information determines that equipment 1 also comprises susceptibility device (not shown), and particularly, the susceptibility device is determined the content erotic degree information of described target pages; Wherein, coalignment 13 ' is according to described page-describing information, and in conjunction with described content erotic degree information, determines the presentation information corresponding with described target pages, wherein, described presentation information and described page-describing information and described content erotic degree information are complementary.
Particularly, the susceptibility device is by the html source code such as the described target pages of parsing, obtain the content of pages information of described target pages, in this content of pages information, pass through inquiry predetermined content susceptibility information, to determine the content erotic degree information of described target pages.At this, described content erotic information include but not limited to as be only suitable for content that certain special group browses such as adult's information etc., as about cause death, the relevant content information of the fortuitous event such as disease, injury, damage or unknown losses etc.For example, suppose that pending described target pages that sorter 11 ' obtains is the news report of " perfume (or spice) how youngster faces by European Union for No. 5 prohibit selling " (http://news.163.com/12/1109/05/8FRIGU8300014AED.html), then the susceptibility device is by resolving the html source code of this page, find to comprise word such as " prohibiting selling ", " allergy " etc. in the content of pages information of this page, determine that namely the content erotic degree information of this target pages is " prohibiting selling ", " allergy ".
Those skilled in the art will be understood that foregoing susceptibility information is only for giving an example; other content erotic degree information existing or that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Those skilled in the art will be understood that the mode of above-mentioned definite described susceptibility information is only for giving an example; the mode of other existing or definite described susceptibility information that may occur from now on is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Then, coalignment 13 ' is according to described page-describing information, and in conjunction with described content erotic degree information, determines the presentation information corresponding with described target pages, wherein, described presentation information and described page-describing information and described content erotic degree information are complementary.For example, connect example, suppose to determine that device 12 ' determines that the described page-describing information of target pages " perfume (or spice) how youngster faces by European Union for No. 5 prohibit selling " (http://news.163.com/12/1109/05/8FRIGU8300014AED.html) is vacancy, namely the described classification relevant information of this target pages is the mismatch object, then coalignment 13 ' is according to this page-describing information, and " prohibit selling " in conjunction with content erotic degree information, " allergy ", the presentation information corresponding with this target pages of determining provides presentation information for being not suitable at this page, perhaps, described presentation information is other brand perfume, wherein, described presentation information and described page-describing information and described content erotic degree information are complementary.For another example, suppose to determine that device 12 ' determines that target pages is as " iphone accessory only product can snap up at a low price! Digital accessory special show is preferential in limited time " the described page-describing information of (http://www.vipshop.com/show-0-48369-0.html ?) be " iphone number accessory (protecting sheathing accessory; charger etc.)-the digital accessory of nokia-... "; and susceptibility determines that device determines that the described content erotic degree information of this target pages is to comprise content such as the adult's information that certain special group is browsed that is only suitable for; then coalignment 13 ' is according to this page-describing information; and in conjunction with described content erotic degree information; determine that the presentation information corresponding with this target pages comprises this page-describing information but state and forbids that children browse the information of this page; wherein, described presentation information and described page-describing information and described content erotic degree information are complementary.
Those skilled in the art will be understood that the above-mentioned mode of determining described presentation information in conjunction with content erotic degree information is only for for example; other existing or modes that may occur from now on really determining described presentation information in conjunction with content erotic degree information are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
In a preferred embodiment (with reference to figure 2), information determines that equipment 1 comprises sorter 11 ', determines device 12 ', coalignment 13 ', generating apparatus (not shown) and generator (not shown), wherein, sorter 11 ' comprises acquiring unit 111 ' (not shown) and taxon 112 ' (not shown).Below with reference to Fig. 2 the preferred embodiment is described: particularly, acquiring unit 111 ' obtains the accession page that the user accesses, with as described target pages; Taxon 112 ' is determined the corresponding classification relevant information of described target pages; Determine device 12 ' according to described classification relevant information, the corresponding candidate's descriptor of described target pages is adjusted accordingly processing, to obtain the corresponding page-describing information of described target pages; Coalignment 13 ' is determined the presentation information corresponding with described target pages according to described page-describing information, and wherein, described presentation information and described page-describing information are complementary; Generating apparatus upgrades processing according to described presentation information to described target pages, and to generate corresponding results page, wherein, described results page comprises described presentation information; Generator offers described user with described results page.At this, determine that device 12 ' and corresponding intrument shown in Figure 1 are same or similar, coalignment 13 ' is same or similar with corresponding intrument shown in Figure 2, so locate to repeat no more, and mode by reference is contained in this.
Particularly, acquiring unit 111 ' at first obtains user's accessing page request, with the corresponding page of accessing page request as described target pages; Perhaps, the application programming interfaces (API) by the equipment such as the third party such as browser, search engine provides obtain the accession page that the user accesses, with as described target pages.For example, user user inputs http://news.sina.com.cn/ in browser address bar, press the enter key, and the application programming interfaces (API) that provide by browser of acquiring unit 111 ' then just get access to the accessing page request of user user; Then, acquiring unit 111 ' is according to this page URL, send the respective page request of access to page server, the corresponding HTML that returns by page server responds, obtain the page http://news.sina.com.cn/ corresponding with this accessing page request, with page http://news.sina.com.cn/ as described target pages.For another example; suppose that user user inputs keyword " iphone protecting sheathing accessory " in the search engine search column; then click search button; the application programming interfaces (API) that provide by search engine of acquiring unit 111 ' then; just get access to the accessing page request of user user; then acquiring unit 111 ' is submitted page searching request based on this search sequence to search engine; and receive the one or more Search Results corresponding with this search sequence " iphone protecting sheathing accessory " such as the Search Results A " homepage-Mi is the digital accessory certified products of apple discount store the more " that search engine feeds back; Search Results B " ... 3C apple accessory iphone shell cell-phone cover wholesale and retail containment vessel "; Search Results C " unique containment vessel iphone4s accessory recommending mobile phone Technology Times Sina website " etc., then acquiring unit 111 ' will comprise that the search results pages of these Search Results is as described target pages.
Taxon 112 ' is determined the corresponding classification relevant information of described target pages.At this, taxon 112 ' determines that the mode of the corresponding classification relevant information of described target pages is identical with the mode of sorter 11 definite corresponding classification relevant informations of described target pages among Fig. 1, for simplicity's sake, thus do not repeat them here, and comprise therewith by reference.
Preferably, taxon 112 ' also can in conjunction with described user's user's operation information, be determined the corresponding classification relevant information of described target pages;
Wherein, described user's operation information comprise following at least each:
-described user is about the page access session information of described accession page;
-described user's page access recorded information;
The corresponding page searching record of-described accession page.
For example, when described user's operation information comprised described user about the page access session information of described accession page, at this, described page access session information included but not limited to the connected reference operation to accession page such as same user.Suppose user user at Search Results as " iphone accessory only product can snap up at a low price! Digital accessory special show is preferential in limited time " in the navigation process of (http://www.vipshop.com/show-0-48369-0.html ?) corresponding page; also inquiry obtains other information such as the accessory " apple data line white " of its demand, and then taxon 112 ' determines that the corresponding classification relevant information of this target pages is the broad match object; For another example, when described user's operation information comprises described user's page access recorded information, suppose that acquiring unit 111 ' gets access to the accession page of user user submission such as the accessing page request of " rowing regatta composition model essay " (http://www.qc99.com/xiaoxue/sinj/101176.Html), and user user often accesses as the page about how to write, and then taxon 112 ' determines that accession page is that virtual theme is such as writing such as (" rowing regatta composition model essay " http://www.qc99.com/xiaoxue/sinj/101176.Html) corresponding classification relevant information.
Those skilled in the art can understand above-mentioned user's operation information in conjunction with the user and determine that the mode of described classification relevant information is only for giving an example; the mode that other user's operation information in conjunction with the user existing or that may occur are from now on determined described classification relevant information is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this with way of reference.
Generating apparatus upgrades processing according to described presentation information to described target pages, as will with as described in presentation information be embedded in as described in the target pages, to generate corresponding results page, wherein, described results page comprises described presentation information.For example, suppose that coalignment 13 ' determines with target pages as " iphone accessory only product can snap up at a low price! Digital accessory special show is preferential in limited time " (http://www.vipshop.com/show-0-48369-0.html ?) corresponding presentation information comprise this page-describing information as " the digital accessory (protecting sheathing accessory, charger etc.) of the iphone-digital accessory of nokia-... "; then generating apparatus can be according to this presentation information; this target pages is upgraded processing; as will with as described in presentation information be embedded in this target pages; as be embedded in the navigation segmented areas place of this target pages; wherein, described presentation information and described page-describing information are complementary.
Generator is by dynamic web page techniques such as ASP, JSP or PHP, and perhaps the communication mode of other agreements such as communication protocols such as http or https, offers described user with described results page.
Preferably, information determines that equipment 1 also comprises the position determining means (not shown), and particularly, position determining means is determined described presentation information corresponding target position information in described target pages; Wherein, generating apparatus is according to described presentation information, and in conjunction with described target position information, described target pages is upgraded processing, to generate corresponding described results page, wherein, described results page comprises described presentation information in described target position information corresponding position.
Particularly, position determining means is determined described presentation information corresponding target position information in described target pages.At this, described target position information comprises described presentation information is embedded in which position in the described target pages, as with as described in the presentation information position that the user preferably browses in the target pages as described in being embedded in, perhaps, the described presentation information for the treatment of is embedded in the navigation segmented areas in the described target pages etc.At this, position determining means determine the mode of described target position information include but not limited to following at least each:
1) according to the page layout information of described target pages, determine target position information, as with the white space in the target pages as page right side subfield as described in target position information, with cause easily in the target pages zone that the user notes as around the search column in the search page etc. as described in target position information.For example, suppose that the pending described target pages that acquiring unit 111 ' obtains is that " iphone accessory only product can snap up at a low price! Digital accessory special show is preferential in limited time " (http://www.vipshop.com/show-0-48369-0.html ?); and position determining means passes through such as the html tag analytic method; perhaps according to VIPS (Vision-based PageSegmentation; based on the page segmentation of vision) algorithm; this target pages is resolved, obtain the page style information of this target pages, such as page layout information, wherein, the page right side subfield of this target pages is white space, and then position determining means can be with the subfield zone, page right side in this target pages as described target position information.
2) according to the content of pages information of described target pages, with the location of content zone that is complementary with the content of described presentation information in the described target pages as described target position information.For example; do you suppose that the described target pages that acquiring unit 111 ' obtains is page http://www.vipshop.com/show-0-48369-0.html? the described presentation information that coalignment 13 ' is determined comprise content as " the digital accessory (protecting sheathing accessory; charger etc.) of the iphone-digital accessory of nokia-... "; position determining means is by resolving this target pages; comprise a plurality of channel content in this target pages such as " luxurious ornaments "; " only product group "; " only product still " etc.; then position determining means as described target position information, is about in this target pages " only product still " channel position zone as the described described target position information for the treatment of presentation information with the location of content zone that is complementary with the content of this presentation information in this target pages.
3) according to the page relevant information of described target pages, and in conjunction with described user's page access recorded information, determine described presentation information corresponding target position information in described target pages.For example; do you suppose that the pending described target pages that acquiring unit 111 ' obtains is page http://www.vipshop.com/show-0-48369-0.html? the described presentation information that coalignment 13 ' is determined comprise content as " the digital accessory (protecting sheathing accessory; charger etc.) of the iphone-digital accessory of nokia-... "; do you suppose that user user often clicks this target pages http://www.vipshop.com/show-0-48369-0.html? in page top area contents link; is then position determining means often accessed this target pages http://www.vipshop.com/show-0-48369-0.html in conjunction with the page access recorded information of user user with user user? in the positional information of content in this target pages as the page top zone as described in presentation information corresponding target position information in this target pages.
Those skilled in the art can understand the mode of above-mentioned definite described target position information only for giving an example; the mode of other definite described target position informations existing or that may occur from now on is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this with way of reference.
Then, generating apparatus is according to described presentation information, and in conjunction with described target position information, described target pages is upgraded processing, as will with as described in presentation information be embedded in as described in target pages as described in the target position information place, to generate corresponding described results page, wherein, described results page comprises described presentation information in described target position information corresponding position.For example; connect example; assumed position determine the presentation information that device determines " the digital accessory (protecting sheathing accessory, charger etc.) of the iphone-digital accessory of nokia-... " at target pages http://www.vipshop.com/show-0-48369-0.html? in target position information be page right side right regions; then generating apparatus is embedded in the described target position information place of this target pages with what it was determined with this presentation information, to generate corresponding described results page.
Preferably, information determines that equipment 1 comprises that also pattern determines the device (not shown), and particularly, pattern determines that device determines described presentation information corresponding target style information in described target pages; Wherein, generating apparatus is according to described presentation information, and in conjunction with described target style information, described target pages is upgraded processing, to generate corresponding described results page, wherein, described results page comprises the described presentation information corresponding with described target style information.
Particularly, pattern determines that device determines described presentation information corresponding target style information in described target pages, at this, pattern determine device determine the mode of described presentation information corresponding target style information in described target pages include but not limited to following at least each:
1) according to the pattern relevant information of described target pages, determines described presentation information corresponding target style information in described target pages.Particularly, pattern is determined the pattern relevant information of the at first definite described target pages of device; Then, again according to the pattern relevant information of described target pages, from this pattern relevant information, extract the target style information that one or more style setting information is used as described presentation information, perhaps, directly with the pattern relevant information of the described target pages target style information as described presentation information.For example, do you suppose described target pages " only product meeting brand fashion discount store " the http://www.vipshop.com/show-0-48369-0.html that acquiring unit 111 ' gets access to? and the described presentation information that coalignment 13 ' is determined comprise content as " the digital accessory (protecting sheathing accessory; charger etc.) of the iphone-digital accessory of nokia-... ", then pattern determines that device at first can be by such as based on the html tag analytical approach, perhaps according to VIPS (Vision-based Page Segmentation, page segmentation based on vision) algorithm etc., described target pages is resolved, the pattern relevant information that obtains described target pages comprises the page top navigation block, the crumbs navigation, the text region unit, page left-hand column content blocks, page right hand column provides the Segment features such as Info Link piece and page bottom content blocks, and the font color in the page is grey, page tone is page style settings such as pink colour etc.; Then, pattern determines that device can be according to the pattern relevant information of described target pages, determine the target style information of described presentation information, as with as described in the page tone, font color etc. of presentation information be set to unanimously with the page tone of this initial search result page, font color etc., namely page tone is set to pink colour, font color is set to grey.
2) according to the application class information of described presentation information, in page pattern database, carry out matching inquiry, to obtain the page style information corresponding with described application class information, with as described target style information, wherein, described page pattern database comprises the mapping relations of application class and page pattern.At this, described application class information include but not limited to described first page request of access the trade classification of the corresponding page, such as food, environmental protection, news, cosmetics, fresh flower, automobile, novel etc.For example, for example, the application class information of supposing the application class information of described presentation information belongs to food service industry, then pattern determines that device carries out matching inquiry in the accession page pattern database, obtains the page style information corresponding with described application class information and comprises that crumbs navigation, text summary region piece, page layout background are black etc. for green, page font color; For another example, the application class information of supposing the application class information of described presentation information belongs to cosmetic industry, then pattern determines that device carries out matching inquiry in the accession page pattern database, obtains the page style information corresponding with described application class information and comprises that crumbs navigation, text summary region piece, page layout background are that warm tones such as pink colour etc., page font color are white etc.At this, described page pattern database both can be arranged in information and determine equipment 1, also can be arranged in information and determine the server that equipment 1 links to each other by network.
Those skilled in the art can understand the mode of above-mentioned definite described presentation information corresponding target style information in described target pages only for giving an example; the mode of other definite described presentation information existing or that may occur from now on corresponding target style information in described target pages is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this with way of reference.
Then, generating apparatus is according to described presentation information, and in conjunction with described target style information, described target pages is upgraded processing, to generate corresponding described results page, wherein, described results page comprises the described presentation information corresponding with described target style information.For example; connect example; suppose the presentation information that pattern determines device and determine " the digital accessory (protecting sheathing accessory; charger etc.) of the iphone-digital accessory of nokia-... " at target pages http://www.vipshop.com/show-0-48369-0.html? in corresponding target style information comprise crumbs navigation; text summary region piece; page layout background is warm tones such as pink colour etc.; page font color is white etc.; then generating apparatus is embedded in the display format of this presentation information with this target style information in this target pages; to generate corresponding described results page; wherein, described results page comprises the described presentation information corresponding with described target style information.
Those skilled in the art can understand above-mentioned combining target style information and generate the mode of results page only for giving an example; the mode that other combining target style informations existing or that may occur from now on generate results page is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this with way of reference.
Fig. 3 illustrates the method flow diagram that is used for determining the corresponding page-describing information of target pages according to a further aspect of the present invention.
Particularly, in step S1, information is determined equipment 1 definite pending corresponding classification relevant information of target pages; In step S2, information is determined equipment 1 according to described classification relevant information, and the corresponding candidate's descriptor of described target pages is adjusted accordingly processing, to obtain the corresponding page-describing information of described target pages.At this, information determines that equipment 1 includes but not limited to that the network equipment, subscriber equipment or the network equipment and subscriber equipment are by the mutually integrated equipment that consists of of network.At this, the described network equipment includes but not limited to such as network host, single network server, a plurality of webserver collection or based on the realizations such as set of computers of cloud computing; Perhaps realized by subscriber equipment.At this, cloud is by consisting of based on a large amount of main frames of cloud computing (Cloud Computing) or the webserver, and wherein, cloud computing is a kind of of Distributed Calculation, a super virtual machine that is comprised of the loosely-coupled computing machine collection of a group.At this, described subscriber equipment can be any electronic product that can carry out man-machine interaction by modes such as keyboard, mouse, touch pad, touch-screen or handwriting equipments with the user, such as computing machine, mobile phone, PDA, palm PC PPC or panel computer etc.Described network includes but not limited to internet, wide area network, Metropolitan Area Network (MAN), LAN (Local Area Network), VPN network, wireless self-organization network (Ad Hoc network) etc.Those skilled in the art will be understood that above-mentioned information determines that equipment 1 is only for for example; other network equipments existing or that may occur from now on or subscriber equipment are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.At this, the network equipment and subscriber equipment include a kind of can be according to the instruction of prior setting or storage, automatically carry out the electronic equipment of numerical evaluation and information processing, its hardware includes but not limited to microprocessor, special IC (ASIC), programmable gate array (FPGA), digital processing unit (DSP), embedded device etc.
Particularly, in step S1, information is determined the application programming interfaces (API) that equipment 1 at first provides by the equipment such as the third party such as browser, search engine, obtains pending target pages; Perhaps, by dynamic web page techniques such as ASP, JSP, obtain the user by the search sequence of subscriber equipment input, again this search sequence is submitted to search engine, and receive the Search Results corresponding with this search sequence that search engine feeds back, with as pending target pages; Perhaps, by agreement communication modes such as http, https, obtain pending target pages; Then, in step S1, information is determined equipment 1 definite corresponding classification relevant information of described target pages.At this, described classification relevant information include but not limited to following at least each: 1) virtual theme, at this, the user's of this target pages of access that the page body matter of the described target pages of described virtual theme intention can reflect access intention, for example, the hypothetical target page is one piece of rowing regatta composition model essay such as the body matter of " rowing regatta composition model essay " (http://www.qc99.com/xiaoxue/sinj/101176.Html), and the user who browses this page wish to learn the to write a composition information of writing aspect, then the corresponding classification relevant information of this target pages is that virtual theme is such as composition; For another example, the hypothetical target page is the picture of fresh flower such as the body matter of " download of fresh flower material " (http://sucai.redocn.com/category/260/), and the user who browses this page wishes to obtain the material of relevant fresh flower to be used for the Arts creation, and then the corresponding classification relevant information of this target pages is virtual theme such as Arts material; 2) exact matching object, at this, the described target pages of described exact matching object intention has comprised and the on all four content information of user's request, and described user's request has irreplaceability, for example, is the hypothetical target page such as " oral cavity, Beijing expert-good doctor is online " (http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm? province=beijing) comprised about relevant informations such as the hospital of disease " canker sore " and attending doctors, and the user who browses this page wish the inquiry obtain about the treatment disease as " canker sore " be not other diseases such as the page of the relevant information of " rhinitis ", then the corresponding classification relevant information of this target pages is the exact matching object; For another example, the hypothetical target page has comprised about information such as the Products of IBM minicomputer IBM POWER720, specifications parameters such as " IBM minicomputer IBMPOWER720 " (http://www.xinhuigroup.com/Product/10026/11479.html), and the user who browses this page wishes that inquiry obtains about IBM minicomputer IBMPOWER720 rather than other type products page such as " IBM POWER 550 " relevant information, and then the corresponding classification relevant information of this target pages is the exact matching object; 3) broad match object, at this, content information and the user's request of the described target pages of described broad match object intention have correlativity, for example, is the hypothetical target page such as " iphone5 pink colour and the back side have the outer casing protective sleeve of heart pattern " (http://www.vipshop.com/show-0-48369-0.html?), and other brands that the user who browses this page also may belong to like product such as intelligent machine such as " apple data line " and with " iohone5 " to other accessories of iphone5 equipment are interested such as " nokia " intelligent machine etc., and then the corresponding classification relevant information of this target pages is the broad match object; 4) mismatch object, at this, the content information of the described target pages of described mismatch object intention is not suitable for comprising the presentation information outside the content information that supplies the user to obtain this target pages of place, for example, when the user browses news report such as " expert claims also also to oppose to China Obama, and the friend returns to in-depth to the Asia-Pacific strategy " (http://news.sina.com.cn/w/sd/2012-11-08/021925532469.shtml), except the content report of paying close attention to this news, can not pay close attention to the other guide information in this page, then the corresponding classification relevant information of this page is mismatch object such as news report again.Those skilled in the art will be understood that above-mentioned classification relevant information only for giving an example, and other classification relevant informations existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and are contained in this at this with way of reference.
For example, the user inputs network address http://news.sina.com.cn/ in browser address bar, press "enter" key", in step S1, information determines that equipment 1 gets access to the webpage corresponding with this network address http://news.sina.com.cn/ by the application programming interfaces (API) that provide such as third party's equipment such as news websites.For another example, the user inputs keyword " iphone accessory " by its subscriber equipment such as PC in search column, click search button, then sorter 11 is by dynamic web page techniques such as JSP or ASP, get access to the search sequence of this user's input from this subscriber equipment, and submit searching request based on this search sequence to search engine, the one or more Search Results that are complementary with keyword " iphone accessory " that the application programming interfaces (API) that provide by search engine obtain that search engine obtains according to keyword " iphone accessory " matching inquiry, such as " iphone accessory [market price evaluation certified products crudely-made articles] ", " iphone accessory Apple Store (China) " etc. is as pending target pages.
Those skilled in the art will be understood that the above-mentioned mode of pending target pages of obtaining is only for giving an example; other existing or modes of obtaining pending target pages that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Then, in step S1, information is determined equipment 1 definite pending corresponding classification relevant information of target pages, at this, in step S1, information determine equipment 1 determine the mode of the corresponding classification relevant information of described target pages include but not limited to following at least each:
1) according to the page subject content of described target pages, determines the corresponding classification relevant information of described target pages.Particularly, in step S1, information determines that equipment 1 at first passes through such as page html tag analytical approach, extract the page body matter of described target pages, perhaps, according to VIPS (Vision-based Page Segmentation, page segmentation based on vision) algorithm, utilize the visual signatures such as spacing between webpage foreground color, background color, font color and size, frame, logical block and the logical block, element position, described target pages is carried out piecemeal process, to obtain the body matter piecemeal of described target pages; Then, in step S 1, information is determined equipment 1 according to the page body matter of described target pages, determines the corresponding classification relevant information of described target pages.For example, suppose in step S1, information determines that the described target pages that equipment 1 at first gets access to is that news report is such as " expert claims also also to oppose to China Obama, and the friend returns to in-depth to the Asia-Pacific strategy " (http://news.sina.com.cn/w/sd/2012-11-08/021925532469.shtml), then in step S1, information determines that equipment 1 passes through such as page html tag analytical approach, extract the page body matter of this target pages and be the news report of " Obama also also opposes to China, and the friend returns to in-depth to the Asia-Pacific strategy ", then in step S1, information determines that equipment 1 definite corresponding classification relevant information of this target pages is the mismatch object.For another example, suppose in step S1, does information determine that described target pages that equipment 1 at first gets access to is about the page " Beijing oral cavity expert-good doctor online " (the http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm for the treatment of disease such as " canker sore " relevant information? province=beijing), this target pages has comprised and the on all four content information of user's request, then in step S1, information determines that equipment 1 definite corresponding classification relevant information of this target pages is the exact matching object.
2) according to the user's who accesses described target pages page access recorded information, determine the corresponding classification relevant information of described target pages.The user user browsing page is as " iphone accessory only product can snap up at a low price for example! Digital accessory special show is preferential in limited time " (http://www.vipshop.com/show-0-48369-0.html ?); and other brands that this user user also belongs to like product such as intelligent machine such as " apple data line " and with " iohone5 " to other accessories of iphone5 equipment are interested such as " nokia " intelligent machine etc.; then in step S1, and information determines that equipment 1 determines that the corresponding classification relevant information of this target pages is the broad match object.
Those skilled in the art will be understood that the mode of above-mentioned definite described classification relevant information is only for giving an example; the mode of other existing or definite described classification relevant informations that may occur from now on is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
In step S2, information is determined equipment 1 according to described classification relevant information, and the corresponding candidate's descriptor of described target pages is adjusted accordingly processing, to obtain the corresponding page-describing information of described target pages.At this, described candidate's descriptor includes but not limited to the description of target pages body matter information as described, the description of the corresponding described classification relevant information of described target pages.Particularly, in step S2, information determines that equipment 1 is at first by carrying out word frequency statistics such as the content of pages to described target pages, perhaps, call the page candidate descriptor application programming interfaces (API) that the affiliated third party website of described target pages provides, obtain the corresponding candidate's descriptor of described target pages; Then, in step S2, information is determined the described classification relevant information that equipment 1 is determined according to sorter, the corresponding candidate's descriptor of described target pages is adjusted accordingly processing, to obtain the corresponding page-describing information of described target pages.Those skilled in the art will be understood that above-mentioned candidate's descriptor only for giving an example, and other candidate's descriptors existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and are contained in this at this with way of reference.At this, described corresponding adjustment process operation comprise following at least each:
-when described classification relevant information comprises described virtual theme, in virtual subject data base, carry out matching inquiry according to described candidate's descriptor, with the matching inquiry result of correspondence as described page-describing information;
-comprise described exact matching object when described classification relevant information, with described candidate's descriptor as described page-describing information;
-when described classification relevant information comprises described broad match object, in the generalized object database, carry out matching inquiry according to described candidate's descriptor, with described candidate's descriptor and corresponding matching inquiry result thereof as described page-describing information;
-when described classification relevant information comprises described mismatch object, described candidate's descriptor is emptied, with as described page-describing information.
For example, suppose in step S1, as described in information is determined equipment 1 to determine pending target pages (http://www.qc99.com/xiaoxue/sinj/101176.Html) is corresponding such as " rowing regatta composition model essay " the classification relevant information be as described in virtual theme, and in step S2, information determines that equipment 1 at first calls the page candidate descriptor application programming interfaces (API) that the affiliated third party website qc99 of this target pages http://www.qc99.com/xiaoxue/sinj/101176.Html provides, the described candidate's descriptor that obtains this target pages http://www.qc99.com/xiaoxue/sinj/101176.Html comprises " rowing regatta composition model essay " content etc., then in step S2, information determines that equipment 1 carries out matching inquiry according to this candidate's descriptor in virtual subject data base, obtain the matching inquiry result such as " page body matter: rowing regatta composition model essay-correspondence classification relevant information: virtual theme (composition) ", then this matching inquiry result is as described page-describing information, at this, described virtual subject data base stores a plurality of virtual themes, it can be arranged in information and determine equipment 1, also can be arranged in information to determine the server that equipment 1 links to each other by network; For another example, suppose in step S 1, does information determine that equipment 1 determines that pending target pages is as about the page " Beijing oral cavity expert-good doctor online " (the http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm for the treatment of disease such as " canker sore " relevant information? province=beijing) described classification relevant information is the exact matching object, and in step S2, information determines that equipment 1 at first carries out word frequency statistics to the content of pages of this target pages, do you obtain this target pages http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm? described candidate's descriptor of province=beijing comprises " disease " canker sore " treatment-corresponding classification relevant information: exact matching object " etc., then in step S2, information determine equipment 1 with this candidate's descriptor as described page-describing information; And for example, suppose in step S 1 that information determines that equipment 1 definite pending target pages is as " iphone accessory only product can snap up at a low price! Digital accessory special show is preferential in limited time " the described classification relevant information of (http://www.vipshop.com/show-0-48369-0.html ?) is the broad match object; and in step S2; information determines that equipment 1 is at first to this target pages http://www.vipshop.com/show-0-48369-0.html? content of pages carry out word frequency statistics; obtain this target pages http://www.vipshop.com/show-0-48369-0.html? described candidate's descriptor comprise " digital accessory special show " etc., then in step S2, information determines that equipment 1 carries out matching inquiry according to this candidate's descriptor in the generalized object database, obtain the matching inquiry result as " the digital accessory (protecting sheathing accessory; charger etc.) of the iphone-digital accessory of nokia-... " etc., with this candidate's descriptor and corresponding matching inquiry result thereof as described page-describing information, at this, described generalized object database comprises the classification set of generalized object, each generalized object is classification again, it can be arranged in information and determine equipment 1, also can be arranged in information to determine the server that equipment 1 links to each other by network; Also as, suppose in step S 1, information determine equipment 1 determine pending target pages be news report as " expert claim Obama to China also the enemy also the friend in-depth is returned to the Asia-Pacific strategy " (http://news.sina.com.cn/w/sd/2012-11-08/021925532469.shtml) as described in the classification relevant information be the mismatch object, and in step S2, information determines that equipment 1 at first calls the page candidate descriptor application programming interfaces (API) that the affiliated third party website sina of this target pages provides, the described candidate's descriptor that obtains this target pages comprises " news report-corresponding classification relevant information: mismatch object ", then in step S2, information determines that equipment 1 empties this candidate's descriptor, with as described page-describing information, namely the corresponding page-describing information of this target pages is vacancy.
Those skilled in the art will be understood that the above-mentioned mode that the corresponding candidate's descriptor of described target pages is adjusted accordingly processing is only for for example; other existing or modes that the corresponding candidate's descriptor of described target pages is adjusted accordingly processing that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Those skilled in the art will be understood that the mode of the corresponding page-describing information of the described target pages of above-mentioned acquisition is only for giving an example; the mode of the corresponding page-describing information of other described target pages of acquisition existing or that may occur from now on is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Information determines between each step of equipment 1 it is constant work.Particularly, in step S1, information determines that equipment 1 continues to determine the pending corresponding classification relevant information of target pages; In step S2, information determines that equipment 1 continues according to described classification relevant information, adjusts accordingly processing to the corresponding candidate's descriptor of described target pages, to obtain the corresponding page-describing information of described target pages.At this, those skilled in the art are to be understood that " continuing " information of referring to determines each step of equipment 1 obtaining of determining of relevant information and page-describing information of constantly classifying respectively, until information is determined equipment 1 the determining of relevant information that stop in a long time classifying.
Preferably, information determines that equipment 1 also comprises step S4 (not shown), particularly, in step S4, information is determined equipment 1 according to a plurality of training pages through the mark classified information, carries out machine learning and processes, to obtain to be used for the page classifications model of page classifications; Wherein, in step S1, information is determined equipment 1 according to described page classifications model, based on the page relevant information of described target pages, determines described classification relevant information.
Particularly, in step S4, information is determined equipment 1 according to a plurality of training pages through the mark classified information, carries out machine learning and processes, to obtain to be used for the page classifications model of page classifications.For example, suppose through the mark classified information a plurality of training pages as follows:
I: rowing regatta composition model essay
Http:// www.qc99.com/xiaoxue/sinj/101176.Html, virtual theme
II:sina/ reading/novel shop/world's masterpiece/" the Count of Monte Christo "
Http:// vip.book.sina.com.cn/book/index_81300.html, virtual theme
III: oral cavity, Beijing expert-good doctor is online
http://www.haodf.com/jibing/kouqiangkuiyang/daifu.htm?
Province=beijing, the exact matching object
IV:sina sports news http://sports.sina.com.cn/, the mismatch object
V:sina financial and economic news http://finance.sina.com.cn/, the mismatch object
VI: product netting index code accessory only
Http:// www.vipshop.com/show-0-48369-0.html?, the broad match object
VII: the individual product http://cosmetic.dangdang.com/ that protects of Dangdang.com, the broad match object is then in step S4, information determines that equipment 1 is according to these a plurality of training pages through the mark classified information, carrying out machine learning processes, as to as described in training set carry out linear regression analysis, perhaps described training set is carried out the modes such as nonlinear regression analysis, obtain to be used for page classifications model such as the decision tree of page classifications, each node of this decision tree is corresponding to each page classifications, wherein, described page classifications comprises a plurality of described training pages, comprises page I and II such as page classifications such as virtual subject classification, the exact matching object classification comprises page III, the mismatch object classification comprises page IV and V, the broad match object classification comprises page VI and VII.
Then, in step S1, information is determined equipment 1 according to described page classifications model, based on the page relevant information of described target pages, determines described classification relevant information.At this, described page relevant information includes but not limited to such as page body matter classification, page structure feature etc.For example, suppose in step S1, information determines that the pending target pages that equipment 1 at first obtains is " rowing regatta composition model essay " http://www.qc99.com/xiaoxue/sinj/101176.Html, then in step S1, information determines that equipment 1 can be according to the described page classifications model of model apparatus for establishing acquisition, page relevant information such as page body matter information based on this target pages, the page body matter classification of the training page that each page classifications in the page body matter classification of this target pages and the described page classifications model is included is compared, as suppose that the page body matter classification of determining this target pages is the composition type, consistent with the content of pages classification of the included training page of the page classifications of virtual theme, then in step S1, information determines that the described classification relevant information of equipment 1 definite this target pages is virtual theme.
Preferably, information determines that equipment 1 also comprises step S5 (not shown), and particularly, in step S5, information determines that equipment 1 at first obtains the one or more Search Results corresponding with search sequence; Then, according to described Search Results the page-describing information of the corresponding page and the matching degree information of described search sequence, described one or more Search Results are carried out subsequent treatment; Then, will in described one or more Search Results of subsequent treatment, at least one offer the corresponding application of described search sequence.
Particularly; in step S5; information determines that equipment 1 at first passes through ASP; the dynamic page technology such as JSP; obtain the user by the mobile enquiry request of subscriber equipment input inquiry sequence in the search engine search column; and then this search sequence mentioned to search engine; and receive the one or more Search Results corresponding with this search sequence that search engine feeds back; to obtain the one or more Search Results corresponding with search sequence; for example; suppose that user user uses its PC to input keyword " iphone protecting sheathing accessory " in the search engine search column; then click search button; then in step S5; information determines that equipment 1 passes through ASP; the dynamic page technology such as JSP; just can get access to the search sequence of user user input; then submit page searching request based on this search sequence to search engine, and receive one or more Search Results corresponding with this search sequence " iphone protecting sheathing accessory " such as Search Results A " homepage-Mi is the digital accessory certified products of apple discount store the more " that search engine feeds back; Search Results B " ... 3C apple accessory iphone shell cell-phone cover wholesale and retail containment vessel "; Search Results C " unique containment vessel iphone4s accessory recommending mobile phone Technology Times Sina website " etc.
Those skilled in the art will be understood that the above-mentioned mode of the one or more Search Results corresponding with search sequence of obtaining is only for giving an example; other existing or modes of obtaining the one or more Search Results corresponding with search sequence that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Then, in step S5, information determine equipment 1 according to described Search Results the page-describing information of the corresponding page and the matching degree information of described search sequence, described one or more Search Results are carried out subsequent treatment.Particularly, in step S5, information determines that equipment 1 at first carries out semantic analysis to the corresponding page-describing information of described Search Results, according to the corresponding word of described search sequence shared ratio in the included total word of the corresponding page-describing information of described Search Results, determine the matching degree information of the corresponding page-describing information of described Search Results and described search sequence, as when ratio greater than 0.95 the time, determine that described matching degree information is matched, if ratio is between 0.95 and 0.7 the time, determine that described matching degree information is the moderate coupling, if ratio, is determined described matching degree information less than 0.7 o'clock and is low coupling; Then, search process device carries out subsequent treatment again according to this matching degree information to described one or more Search Results, as to as described in order between one or more Search Results adjust, to as described in one or more Search Results screen.For example, connect example, suppose Search Results A page-describing information and the matching degree of search sequence " iphone protecting sheathing accessory " of the corresponding page be higher than Search Results B the page-describing information of the corresponding page and the matching degree of this search sequence " iphone protecting sheathing accessory ", Search Results B page-describing information and the matching degree of search sequence " iphone protecting sheathing accessory " of the corresponding page be higher than Search Results C the page-describing information of the corresponding page and the matching degree of this search sequence " iphone protecting sheathing accessory ", then in step S5, information determines that equipment 1 is according to described matching degree information, determine Search Results A, putting in order of Search Results B and Search Results C is A, B, C, be that user user is when obtaining the Search Results corresponding with search sequence " iphone protecting sheathing accessory ", Search Results A is positioned at before the Search Results B, and Search Results B is positioned at before the Search Results C; For another example, in step S5, information determines that equipment 1 also can screen Search Results A, B, C according to described matching degree information, and such as filter search results, the Search Results C that matching degree is low does not offer the user.
Those skilled in the art will be understood that the above-mentioned mode that described one or more Search Results are carried out subsequent treatment is only for for example; other existing or modes that described one or more Search Results are carried out subsequent treatment that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Then, in step S5, information determines that equipment 1 is by dynamic web page techniques such as ASP, JSP or PHP, the perhaps communication mode of other agreements, such as communication protocols such as http or https, to in described one or more Search Results of subsequent treatment, at least one offer the corresponding application of described search sequence, offer the corresponding user of described search sequence for the described Search Results of using after processing.At this, described application includes but not limited to such as search engine, browser etc.For example, connect example, in step S5, information determine equipment 1 will be rear its Search Results A, B and C that carries out after the subsequent treatment offer user user according to matching degree information order A, B, C, browse for the user, perhaps, page matching degree information among page Search Results A, B and the C is not offered user user less than the page Search Results of predetermined threshold.
Fig. 4 illustrates the method flow diagram that is used for determining the corresponding page-describing information of target pages in accordance with a preferred embodiment of the present invention.
Particularly, in step S1 ', information is determined equipment 1 definite pending corresponding classification relevant information of target pages; In step S2 ', information is determined equipment 1 according to described classification relevant information, and the corresponding candidate's descriptor of described target pages is adjusted accordingly processing, to obtain the corresponding page-describing information of described target pages; In step S3 ', information is determined equipment 1 according to described page-describing information, determines the presentation information corresponding with described target pages, and wherein, described presentation information and described page-describing information are complementary.At this, step S1 ' and step S2 ' are same or similar with corresponding step shown in Figure 3 respectively, so locate to repeat no more, and mode by reference is contained in this.
Particularly, in step S3 ', information is determined equipment 1 according to described page-describing information, determines the presentation information corresponding with described target pages, and wherein, described presentation information and described page-describing information are complementary.At this, described presentation information include but not limited to as to be shown in the page with certain carrier such as link, text, picture, video, animation etc., be used for the content to user's transmission of information, its include but not limited to as with as described in page-describing information page descriptor content information, with as described in the corresponding page style information of page-describing information etc.Particularly, in step S3 ', information is determined equipment 1 according to described page-describing information, by the corresponding presentation information of the described descriptor of inquiry in the presentation information database, determines the presentation information corresponding with described page-describing information; Perhaps, maybe this presents user's associated user's resource distribution content information to the user that presents by inquiry described page-describing information corresponding target pages in the presentation information database, determine the presentation information corresponding with described page-describing information, wherein, described presentation information and described page-describing information are complementary.At this, described presentation information database can be arranged in information and determine equipment 1, also can be arranged in information and determine the database that equipment 1 links to each other by network.
For example, suppose in step S1 ' that information determines that equipment 1 definite pending target pages is as " iphone accessory only product can snap up at a low price! Digital accessory special show is preferential in limited time " the described classification relevant information of (http://www.vipshop.com/show-0-48369-0.html ?) is the broad match object; and in step S2 '; information is determined this target pages http://www.vipshop.com/show-0-48369-0.html that equipment 1 is determined? described page-describing information comprise " digital accessory (the protecting sheathing accessory of iphone, charger etc.)-the digital accessory of nokia-... " etc.; then in step S3 '; information determines that equipment 1 can be with this page-describing information, as with this target pages http://www.vipshop.com/show-0-48369-0.html? corresponding presentation information; For another example; connect example; in step S3 '; information determine equipment 1 can with described page-describing information " the digital accessory of the digital accessory (protecting sheathing accessory, charger etc.) of iphone-nokia-... " content information, and other resource distribution content informations that present the user corresponding to this page-describing information as contents such as " iphone sell goods information " as described in presentation information.
Those skilled in the art will be understood that and above-mentionedly determine that the mode of the presentation information corresponding with described target pages is only for giving an example; other existing or modes of determining the presentation information corresponding with described target pages that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Preferably, information determines that equipment 1 also comprises step S6 ' (not shown), and is quick in step S6 ' particularly, and information is determined the content erotic degree information of equipment 1 definite described target pages; Wherein, in step S3 ', information determines that equipment 1 is according to described page-describing information, and in conjunction with described content erotic degree information, determine the presentation information corresponding with described target pages, wherein, described presentation information and described page-describing information and described content erotic degree information are complementary.
Particularly, in step S6 ', information determines that equipment 1 is by the html source code such as the described target pages of parsing, obtain the content of pages information of described target pages, in this content of pages information, pass through inquiry predetermined content susceptibility information, to determine the content erotic degree information of described target pages.At this, described content erotic information include but not limited to as be only suitable for content that certain special group browses such as adult's information etc., as about cause death, the relevant content information of the fortuitous event such as disease, injury, damage or unknown losses etc.For example, suppose in step S1 ', information determines that pending described target pages that equipment 1 obtains is the news report of " perfume (or spice) how youngster faces by European Union for No. 5 prohibit selling " (http://news.163.com/12/1109/05/8FRIGU8300014AED.html), then in step S6 ', information determines that equipment 1 is by resolving the html source code of this page, find to comprise word such as " prohibiting selling ", " allergy " etc. in the content of pages information of this page, determine that namely the content erotic degree information of this target pages is " prohibiting selling ", " allergy ".
Those skilled in the art will be understood that foregoing susceptibility information is only for giving an example; other content erotic degree information existing or that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Those skilled in the art will be understood that the mode of above-mentioned definite described susceptibility information is only for giving an example; the mode of other existing or definite described susceptibility information that may occur from now on is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Then, in step S3 ', information determines that equipment 1 is according to described page-describing information, and in conjunction with described content erotic degree information, determine the presentation information corresponding with described target pages, wherein, described presentation information and described page-describing information and described content erotic degree information are complementary.For example, connect example, suppose in step S2 ', information determines that equipment 1 determines that the described page-describing information of target pages " perfume (or spice) how youngster faces by European Union for No. 5 prohibit selling " (http://news.163.com/12/1109/05/8FRIGU8300014AED.html) is vacancy, namely the described classification relevant information of this target pages is the mismatch object, then in step S3 ', information determines that equipment 1 is according to this page-describing information, and " prohibit selling " in conjunction with content erotic degree information, " allergy ", the presentation information corresponding with this target pages of determining provides presentation information for being not suitable at this page, perhaps, described presentation information is other brand perfume, wherein, described presentation information and described page-describing information and described content erotic degree information are complementary.For another example, suppose in step S2 ' that information determines that equipment 1 definite target pages is as " iphone accessory only product can snap up at a low price! Digital accessory special show is preferential in limited time " the described page-describing information of (http://www.vipshop.com/show-0-48369-0.html ?) be " iphone number accessory (protecting sheathing accessory; charger etc.)-the digital accessory of nokia-... "; and in step S6 '; information determines that the described content erotic degree information of equipment 1 definite this target pages is to comprise content such as the adult's information that certain special group is browsed that is only suitable for; then in step S3 '; information determines that equipment 1 is according to this page-describing information; and in conjunction with described content erotic degree information; determine that the presentation information corresponding with this target pages comprises this page-describing information but statement forbids that children browse the information of this page; wherein, described presentation information and described page-describing information and described content erotic degree information are complementary.
Those skilled in the art will be understood that the above-mentioned mode of determining described presentation information in conjunction with content erotic degree information is only for for example; other existing or modes that may occur from now on really determining described presentation information in conjunction with content erotic degree information are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
In a preferred embodiment (with reference to figure 4), information determines that equipment 1 comprises step S1 ', step S2 ', step S3 ', step S7 ' (not shown) and step S8 ' (not shown), wherein, step S1 ' comprises step S11 ' (not shown) and step S12 ' (not shown).Below with reference to Fig. 4 the preferred embodiment is described: particularly, in step S11 ', information determines that equipment 1 obtains the accession page that the user accesses, with as described target pages; In step S12 ', information is determined equipment 1 definite corresponding classification relevant information of described target pages; In step S2 ', information is determined equipment 1 according to described classification relevant information, and the corresponding candidate's descriptor of described target pages is adjusted accordingly processing, to obtain the corresponding page-describing information of described target pages; In step S3 ', information is determined equipment 1 according to described page-describing information, determines the presentation information corresponding with described target pages, and wherein, described presentation information and described page-describing information are complementary; In step S7 ', information determines that equipment 1 generating apparatus according to described presentation information, upgrades processing to described target pages, and to generate corresponding results page, wherein, described results page comprises described presentation information; In step S8 ', information determines that equipment 1 generator offers described user with described results page.At this, step S2 ' is same or similar with corresponding intrument step shown in Figure 3, and step S3 ' is same or similar with corresponding step shown in Figure 4, so locate to repeat no more, and mode by reference is contained in this.
Particularly, in step S11 ', information determines that equipment 1 at first obtains user's accessing page request, with the corresponding page of accessing page request as described target pages; Perhaps, the application programming interfaces (API) by the equipment such as the third party such as browser, search engine provides obtain the accession page that the user accesses, with as described target pages.For example, user user inputs http://news.sina.com.cn/ in browser address bar, press the enter key, then in step S11 ', information is determined the application programming interfaces (API) that equipment 1 provides by browser, just gets access to the accessing page request of user user; Then, in step S11 ', information determines that equipment 1 is according to this page URL, send the respective page request of access to page server, the corresponding HTML that returns by page server responds, obtain the page http://news.sina.com.cn/ corresponding with this accessing page request, with page http://news.sina.com.cn/ as described target pages.For another example; suppose that user user inputs keyword " iphone protecting sheathing accessory " in the search engine search column; then click search button; then in step S11 '; information is determined the application programming interfaces (API) that equipment 1 provides by search engine; just get access to the accessing page request of user user; then in step S11 '; information determines that equipment 1 submits page searching request based on this search sequence to search engine; and receive the one or more Search Results corresponding with this search sequence " iphone protecting sheathing accessory " such as the Search Results A " homepage-Mi is the digital accessory certified products of apple discount store the more " that search engine feeds back; Search Results B " ... 3C apple accessory iphone shell cell-phone cover wholesale and retail containment vessel "; Search Results C " unique containment vessel iphone4s accessory recommending mobile phone Technology Times Sina website " etc.; then in step S11 ', information determines that equipment 1 will comprise that the search results pages of these Search Results is as described target pages.
In step S12 ', information is determined equipment 1 definite corresponding classification relevant information of described target pages.At this, in step S12 ', information determines that equipment 1 determines among the mode of the corresponding classification relevant information of described target pages and Fig. 3 in step S1, information determines that the mode of equipment 1 definite corresponding classification relevant information of described target pages is identical, for simplicity's sake, so do not repeat them here, and comprise therewith by reference.
Preferably, in step S12 ', information determines that equipment 1 also can in conjunction with described user's user's operation information, determine the corresponding classification relevant information of described target pages;
Wherein, described user's operation information comprise following at least each:
-described user is about the page access session information of described accession page;
-described user's page access recorded information;
The corresponding page searching record of-described accession page.
For example, when described user's operation information comprised described user about the page access session information of described accession page, at this, described page access session information included but not limited to the connected reference operation to accession page such as same user.Suppose user user at Search Results as " iphone accessory only product can snap up at a low price! Digital accessory special show is preferential in limited time " in the navigation process of (http://www.vipshop.com/show-0-48369-0.html ?) corresponding page; also inquiry obtains other information such as the accessory " apple data line white " of its demand; then in step S12 ', and information determines that equipment 1 determines that the corresponding classification relevant information of this target pages is the broad match object; For another example, when described user's operation information comprises described user's page access recorded information, suppose in step S11 ', information determines that equipment 1 gets access to the accession page of user user submission such as the accessing page request of " rowing regatta composition model essay " (http://www.qc99.com/xiaoxue/sinj/101176.Html), and often access as about how setting forth the page of just-in-time politics examination question of user user, then in step S12 ', information determines that equipment 1 definite accession page is that virtual theme is such as writing such as " rowing regatta composition model essay " (http://www.qc99.com/xiaoxue/sinj/101176.Html) corresponding classification relevant information.
Those skilled in the art can understand above-mentioned user's operation information in conjunction with the user and determine that the mode of described classification relevant information is only for giving an example; the mode that other user's operation information in conjunction with the user existing or that may occur are from now on determined described classification relevant information is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this with way of reference.
In step S7 ', information determines that equipment 1 according to described presentation information, upgrades processing to described target pages, as will with as described in presentation information be embedded in as described in the target pages, to generate corresponding results page, wherein, described results page comprises described presentation information.For example, suppose in step S3 ', information determine that equipment 1 determines with target pages as " iphone accessory only product can snap up at a low price! Digital accessory special show is preferential in limited time " (http://www.vipshop.com/show-0-48369-0.html ?) corresponding presentation information comprises that this page-describing information is such as " digital accessory (the protecting sheathing accessory of iphone; charger etc.)-the digital accessory of nokia-... "; then in step S7 '; information determines that equipment 1 can be according to this presentation information; this target pages is upgraded processing; as will with as described in presentation information be embedded in this target pages; as be embedded in the navigation segmented areas place of this target pages; wherein, described presentation information and described page-describing information are complementary.
In step S8 ', information is determined equipment 1 by dynamic web page techniques such as ASP, JSP or PHP, and perhaps the communication mode of other agreements such as communication protocols such as http or https, offers described user with described results page.
Preferably, information determines that equipment 1 also comprises a step S9 ' (not shown), and particularly, in step S9 ', information is determined equipment 1 definite described presentation information corresponding target position information in described target pages; Wherein, in step S7 ', information determines that equipment 1 is according to described presentation information, and in conjunction with described target position information, described target pages is upgraded processing, to generate corresponding described results page, wherein, described results page comprises described presentation information in described target position information corresponding position.
Particularly, in step S9 ', information is determined equipment 1 definite described presentation information corresponding target position information in described target pages.At this, described target position information comprises described presentation information is embedded in which position in the described target pages, as with as described in the presentation information position that the user preferably browses in the target pages as described in being embedded in, perhaps, the described presentation information for the treatment of is embedded in the navigation segmented areas in the described target pages etc.At this, in step S9 ', information determine equipment 1 determine the mode of described target position information include but not limited to following at least each:
1) according to the page layout information of described target pages, determine target position information, as with the white space in the target pages as page right side subfield as described in target position information, with cause easily in the target pages zone that the user notes as around the search column in the search page etc. as described in target position information.For example, suppose in step S11 ' that information determines that the pending described target pages that equipment 1 obtains is that " iphone accessory only product can snap up at a low price! Digital accessory special show is preferential in limited time " (http://www.vipshop.com/show-0-48369-0.html ?); and in step S9 '; information determines that equipment 1 passes through such as the html tag analytic method; perhaps according to VIPS (Vision-based Page Segmentation; based on the page segmentation of vision) algorithm, this target pages is resolved, obtain the page style information of this target pages, such as page layout information, wherein, the page right side subfield of this target pages is white space, then in step S9 ', information determines that equipment 1 can be with the subfield zone, page right side in this target pages as described target position information.
2) according to the content of pages information of described target pages, with the location of content zone that is complementary with the content of described presentation information in the described target pages as described target position information.For example; suppose in step S11 '; does information determine that the described target pages that equipment 1 obtains is page http://www.vipshop.com/show-0-48369-0.html? in step S3 '; information determine described presentation information that equipment 1 is determined comprise content as " the digital accessory (protecting sheathing accessory; charger etc.) of the iphone-digital accessory of nokia-... "; in step S9 '; information determines that equipment 1 is by resolving this target pages; comprise a plurality of channel content in this target pages such as " luxurious ornaments "; " only product group "; " only product still " etc.; then in step S9 '; information determine equipment 1 with the location of content zone that is complementary with the content of this presentation information in this target pages as described target position information, be about in this target pages " only product are still " channel position zone as the described described target position information for the treatment of presentation information.
3) according to the page relevant information of described target pages, and in conjunction with described user's page access recorded information, determine described presentation information corresponding target position information in described target pages.For example; suppose in step S11 '; does information determine that the pending described target pages that equipment 1 obtains is page http://www.vipshop.com/show-0-48369-0.html? in step S3 '; information determine described presentation information that equipment 1 is determined comprise content as " the digital accessory (protecting sheathing accessory; charger etc.) of the iphone-digital accessory of nokia-... "; do you suppose that user user often clicks this target pages http://www.vipshop.com/show-0-48369-0.html? in page top area contents link; then in step S9 '; information is determined equipment 1 in conjunction with the page access recorded information of user user, user user is often accessed this target pages http://www.vipshop.com/show-0-48369-0.html? in the positional information of content in this target pages as the page top zone as described in presentation information corresponding target position information in this target pages.
Those skilled in the art can understand the mode of above-mentioned definite described target position information only for giving an example; the mode of other definite described target position informations existing or that may occur from now on is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this with way of reference.
Then, in step S7 ', information determines that equipment 1 is according to described presentation information, and in conjunction with described target position information, described target pages is upgraded processing, as will with as described in presentation information be embedded in as described in target pages as described in the target position information place, to generate corresponding described results page, wherein, described results page comprises described presentation information in described target position information corresponding position.For example; connect example; suppose in step S9 '; information determine the presentation information that equipment 1 determines " the digital accessory (protecting sheathing accessory, charger etc.) of the iphone-digital accessory of nokia-... " at target pages http://www.vipshop.com/show-0-48369-0.html? in target position information be page right side right regions; then in step S7 '; information determines that equipment 1 is embedded in the described target position information place of this target pages with what it was determined with this presentation information, to generate corresponding described results page.
Preferably, information determines that equipment 1 also comprises step S10 ' (not shown), and particularly, in step S10 ', information is determined equipment 1 definite described presentation information corresponding target style information in described target pages; Wherein, in step S7 ', information determines that equipment 1 is according to described presentation information, and in conjunction with described target style information, described target pages is upgraded processing, to generate corresponding described results page, wherein, described results page comprises the described presentation information corresponding with described target style information.
Particularly, in step S10 ', information is determined equipment 1 definite described presentation information corresponding target style information in described target pages, at this, in step S10 ', information determine equipment 1 determine the mode of described presentation information corresponding target style information in described target pages include but not limited to following at least each:
1) according to the pattern relevant information of described target pages, determines described presentation information corresponding target style information in described target pages.Particularly, in step S10 ', information is determined the pattern relevant information of the at first definite described target pages of equipment 1; Then, again according to the pattern relevant information of described target pages, from this pattern relevant information, extract the target style information that one or more style setting information is used as described presentation information, perhaps, directly with the pattern relevant information of the described target pages target style information as described presentation information.For example, suppose in step S11 ', is information determined described target pages " only product meeting brand fashion discount store " the http://www.vipshop.com/show-0-48369-0.html that equipment 1 gets access to? and in step S3 ', information determine described presentation information that equipment 1 is determined comprise content as " the digital accessory (protecting sheathing accessory; charger etc.) of the iphone-digital accessory of nokia-... ", then pattern determines that device at first can be by such as based on the html tag analytical approach, perhaps according to VIPS (Vision-based Page Segmentation, page segmentation based on vision) algorithm etc., described target pages is resolved, the pattern relevant information that obtains described target pages comprises the page top navigation block, the crumbs navigation, the text region unit, page left-hand column content blocks, page right hand column provides the Segment features such as Info Link piece and page bottom content blocks, and the font color in the page is grey, page tone is page style settings such as pink colour etc.; Then, in step S10 ', information determines that equipment 1 can be according to the pattern relevant information of described target pages, determine the target style information of described presentation information, as with as described in the page tone, font color etc. of presentation information be set to unanimously with the page tone of this initial search result page, font color etc., namely page tone is set to pink colour, font color is set to grey.
2) according to the application class information of described presentation information, in page pattern database, carry out matching inquiry, to obtain the page style information corresponding with described application class information, with as described target style information, wherein, described page pattern database comprises the mapping relations of application class and page pattern.At this, described application class information include but not limited to described first page request of access the trade classification of the corresponding page, such as food, environmental protection, news, cosmetics, fresh flower, automobile, novel etc.For example, for example, the application class information of supposing the application class information of described presentation information belongs to food service industry, then in step S10 ', information determines that equipment 1 carries out matching inquiry in the accession page pattern database, obtains the page style information corresponding with described application class information and comprises that crumbs navigation, text summary region piece, page layout background are black etc. for green, page font color; For another example, the application class information of supposing the application class information of described presentation information belongs to cosmetic industry, then in step S10 ', information determines that equipment 1 carries out matching inquiry in the accession page pattern database, obtains the page style information corresponding with described application class information and comprises that crumbs navigation, text summary region piece, page layout background are that warm tones such as pink colour etc., page font color are white etc.At this, described page pattern database both can be arranged in information and determine equipment 1, also can be arranged in information and determine the server that equipment 1 links to each other by network.
Those skilled in the art can understand the mode of above-mentioned definite described presentation information corresponding target style information in described target pages only for giving an example; the mode of other definite described presentation information existing or that may occur from now on corresponding target style information in described target pages is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this with way of reference.
Then, in step S7 ', information determines that equipment 1 is according to described presentation information, and in conjunction with described target style information, described target pages is upgraded processing, to generate corresponding described results page, wherein, described results page comprises the described presentation information corresponding with described target style information.For example; connect example; suppose in step S10 '; information determine the presentation information that equipment 1 determines " the digital accessory (protecting sheathing accessory; charger etc.) of the iphone-digital accessory of nokia-... " at target pages http://www.vipshop.com/show-0-48369-0.html? in corresponding target style information comprise crumbs navigation; text summary region piece; page layout background is warm tones such as pink colour etc.; page font color is white etc.; then in step S7 '; information determines that equipment 1 is embedded in the display format of this presentation information with this target style information in this target pages; to generate corresponding described results page; wherein, described results page comprises the described presentation information corresponding with described target style information.
Those skilled in the art can understand above-mentioned combining target style information and generate the mode of results page only for giving an example; the mode that other combining target style informations existing or that may occur from now on generate results page is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this with way of reference.
It should be noted that the present invention can be implemented in the assembly of software and/or software and hardware, for example, can adopt special IC (ASIC), general purpose computing machine or any other similar hardware device to realize.In one embodiment, software program of the present invention can carry out to realize step mentioned above or function by processor.Similarly, software program of the present invention (comprising relevant data structure) can be stored in the computer readable recording medium storing program for performing, for example, and RAM storer, magnetic or CD-ROM driver or flexible plastic disc and similar devices.In addition, steps more of the present invention or function can adopt hardware to realize, for example, thereby as cooperate the circuit of carrying out each step or function with processor.
In addition, a part of the present invention can be applied to computer program, and for example computer program instructions when it is carried out by computing machine, by the operation of this computing machine, can call or provide the method according to this invention and/or technical scheme.And call the programmed instruction of method of the present invention, may be stored in fixing or movably in the recording medium, and/or be transmitted by the data stream in broadcasting or other signal bearing medias, and/or be stored in the working storage according to the computer equipment of described programmed instruction operation.At this, comprise according to one embodiment of present invention a device, this device comprises for the storer of storage computer program instructions and is used for the processor of execution of program instructions, wherein, when this computer program instructions is carried out by this processor, trigger this device operation based on aforementioned method according to a plurality of embodiment of the present invention and/or technical scheme.
To those skilled in the art, obviously the invention is not restricted to the details of above-mentioned example embodiment, and in the situation that does not deviate from spirit of the present invention or essential characteristic, can realize the present invention with other concrete form.Therefore, no matter from which point, all should regard embodiment as exemplary, and be nonrestrictive, scope of the present invention is limited by claims rather than above-mentioned explanation, therefore is intended to be included in the present invention dropping on the implication that is equal to important document of claim and all changes in the scope.Any Reference numeral in the claim should be considered as limit related claim.In addition, obviously other unit or step do not got rid of in " comprising " word, and odd number is not got rid of plural number.A plurality of unit of stating in the device claim or device also can be realized by software or hardware by a unit or device.The first, the second word such as grade is used for representing title, and does not represent any specific order.

Claims (21)

1. method of be used for determining the corresponding page-describing information of target pages, wherein, the method may further comprise the steps:
A determines the corresponding classification relevant information of pending target pages;
B adjusts accordingly processing according to described classification relevant information to the corresponding candidate's descriptor of described target pages, to obtain the corresponding page-describing information of described target pages.
2. method according to claim 1, wherein, the method also comprises:
-according to a plurality of training pages through the mark classified information, carry out machine learning and process, to obtain to be used for the page classifications model of page classifications;
Wherein, described step a comprises:
-according to described page classifications model, based on the page relevant information of described target pages, determine described classification relevant information.
3. method according to claim 1, wherein, the method also comprises:
C determines the presentation information corresponding with described target pages according to described page-describing information, and wherein, described presentation information and described page-describing information are complementary.
4. method according to claim 3, wherein, described step a comprises:
-obtain the accession page that the user accesses, with as described target pages;
A1 determines the corresponding classification relevant information of described target pages;
Wherein, the method also comprises:
D upgrades processing according to described presentation information to described target pages, and to generate corresponding results page, wherein, described results page comprises described presentation information;
-described results page is offered described user.
5. method according to claim 4, wherein, described step a1 comprises:
-in conjunction with described user's user's operation information, determine the corresponding classification relevant information of described target pages;
Wherein, described user's operation information comprise following at least each:
-described user is about the page access session information of described accession page;
-described user's page access recorded information;
The corresponding page searching record of-described accession page.
6. according to claim 4 or 5 described methods, wherein, the method also comprises:
-determine described presentation information corresponding target position information in described target pages;
Wherein, described steps d comprises:
-according to described presentation information, and in conjunction with described target position information, described target pages is upgraded processing, to generate corresponding described results page, wherein, described results page comprises described presentation information in described target position information corresponding position.
7. according to claim 4 or 5 described methods, wherein, the method also comprises:
-determine described presentation information corresponding target style information in described target pages;
Wherein, described steps d comprises:
-according to described presentation information, and in conjunction with described target style information, described target pages is upgraded processing, to generate corresponding described results page, wherein, described results page comprises the described presentation information corresponding with described target style information.
8. method according to claim 3, wherein, the method also comprises:
-determine the content erotic degree information of described target pages;
Wherein, described step c comprises:
-according to described page-describing information, and in conjunction with described content erotic degree information, determine the presentation information corresponding with described target pages, wherein, described presentation information and described page-describing information and described content erotic degree information are complementary.
9. method according to claim 1, wherein, described classification relevant information comprise following at least each:
-virtual theme;
-exact matching object;
-broad match object;
-mismatch object;
Wherein, described corresponding adjustment process operation comprise following at least each:
-when described classification relevant information comprises described virtual theme, in virtual subject data base, carry out matching inquiry according to described candidate's descriptor, with the matching inquiry result of correspondence as described page-describing information;
-comprise described exact matching object when described classification relevant information, with described candidate's descriptor as described page-describing information;
-when described classification relevant information comprises described broad match object, in the generalized object database, carry out matching inquiry according to described candidate's descriptor, with described candidate's descriptor and corresponding matching inquiry result thereof as described page-describing information;
-when described classification relevant information comprises described mismatch object, described candidate's descriptor is emptied, with as described page-describing information.
10. method according to claim 1, wherein, the method also comprises:
-obtain the one or more Search Results corresponding with search sequence;
-according to described Search Results the page-describing information of the corresponding page and the matching degree information of described search sequence, described one or more Search Results are carried out subsequent treatment;
-will in described one or more Search Results of subsequent treatment, at least one offer the corresponding application of described search sequence.
11. an information that is used for definite corresponding page-describing information of target pages is determined equipment, wherein, this information determines that equipment comprises:
Sorter is used for determining the pending corresponding classification relevant information of target pages;
Determine device, be used for according to described classification relevant information, the corresponding candidate's descriptor of described target pages is adjusted accordingly processing, to obtain the corresponding page-describing information of described target pages.
12. information according to claim 11 is determined equipment, wherein, this information determines that equipment also comprises:
The model apparatus for establishing is used for according to a plurality of training pages through the mark classified information, carries out machine learning and processes, to obtain to be used for the page classifications model of page classifications;
Wherein, described sorter is used for:
-according to described page classifications model, based on the page relevant information of described target pages, determine described classification relevant information.
13. information according to claim 11 is determined equipment, wherein, this information determines that equipment also comprises:
Coalignment is used for according to described page-describing information, determines the presentation information corresponding with described target pages, and wherein, described presentation information and described page-describing information are complementary.
14. information according to claim 13 is determined equipment, wherein, described sorter comprises:
Acquiring unit is used for obtaining the accession page that the user accesses, with as described target pages;
Taxon is used for determining the corresponding classification relevant information of described target pages;
Wherein, this information determines that equipment also comprises:
Generating apparatus is used for according to described presentation information, and described target pages is upgraded processing, and to generate corresponding results page, wherein, described results page comprises described presentation information;
Generator is used for described results page is offered described user.
15. information according to claim 14 is determined equipment, wherein, described taxon is used for:
-in conjunction with described user's user related information, determine the corresponding classification relevant information of described target pages;
Wherein, described user related information comprise following at least each:
-described user is about the page access session information of described accession page;
-described user's page access recorded information;
The corresponding page searching record of-described accession page.
16. according to claim 14 or 15 described information determine equipment, wherein, this information determines that equipment also comprises:
Position determining means is used for determining that described presentation information is at the corresponding target position information of described target pages;
Wherein, described generating apparatus is used for:
-according to described presentation information, and in conjunction with described target position information, described target pages is upgraded processing, to generate corresponding described results page, wherein, described results page comprises described presentation information in described target position information corresponding position.
17. according to claim 14 or 15 described information determine equipment, wherein, this information determines that equipment also comprises:
Pattern is determined device, is used for determining that described presentation information is at the corresponding target style information of described target pages;
Wherein, described generating apparatus is used for:
-according to described presentation information, and in conjunction with described target style information, described target pages is upgraded processing, to generate corresponding described results page, wherein, described results page comprises the described presentation information corresponding with described target style information.
18. information according to claim 13 is determined equipment, wherein, this information determines that equipment also comprises:
The susceptibility device is for the content erotic degree information of determining described target pages;
Wherein, described coalignment is used for:
-according to described page-describing information, and in conjunction with described content erotic degree information, determine the presentation information corresponding with described target pages, wherein, described presentation information and described page-describing information and described content erotic degree information are complementary.
19. information according to claim 11 is determined equipment, wherein, described classification relevant information comprise following at least each:
-virtual theme;
-exact matching object;
-broad match object;
-mismatch object;
Wherein, described corresponding adjustment process operation comprise following at least each:
-when described classification relevant information comprises described virtual theme, in virtual subject data base, carry out matching inquiry according to described candidate's descriptor, with the matching inquiry result of correspondence as described page-describing information;
-comprise described exact matching object when described classification relevant information, with described candidate's descriptor as described page-describing information;
-when described classification relevant information comprises described broad match object, in the generalized object database, carry out matching inquiry according to described candidate's descriptor, with described candidate's descriptor and corresponding matching inquiry result thereof as described page-describing information;
-when described classification relevant information comprises described mismatch object, described candidate's descriptor is emptied, with as described page-describing information.
20. information according to claim 11 is determined equipment, wherein, this information determines that equipment also comprises search process device, is used for:
-obtain the one or more Search Results corresponding with search sequence;
-according to described Search Results the page-describing information of the corresponding page and the matching degree information of described search sequence, described one or more Search Results are carried out subsequent treatment;
-will in described one or more Search Results of subsequent treatment, at least one offer the corresponding application of described search sequence.
21. a computer equipment comprises such as each described information in the claim 11 to 20 and determines equipment.
CN201210452843.6A 2012-11-13 2012-11-13 For the method and apparatus determining the page-describing information corresponding to target pages Active CN102999576B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210452843.6A CN102999576B (en) 2012-11-13 2012-11-13 For the method and apparatus determining the page-describing information corresponding to target pages

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210452843.6A CN102999576B (en) 2012-11-13 2012-11-13 For the method and apparatus determining the page-describing information corresponding to target pages

Publications (2)

Publication Number Publication Date
CN102999576A true CN102999576A (en) 2013-03-27
CN102999576B CN102999576B (en) 2016-08-17

Family

ID=47928144

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210452843.6A Active CN102999576B (en) 2012-11-13 2012-11-13 For the method and apparatus determining the page-describing information corresponding to target pages

Country Status (1)

Country Link
CN (1) CN102999576B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103345476A (en) * 2013-06-09 2013-10-09 北京百度网讯科技有限公司 Method and device for determining present information corresponding to destination page
CN103399764A (en) * 2013-07-24 2013-11-20 北京小米科技有限责任公司 Method, device and terminal for setting interface colors
CN103440326A (en) * 2013-09-02 2013-12-11 百度在线网络技术(北京)有限公司 Method and apparatus for providing representation information
CN106709073A (en) * 2013-12-30 2017-05-24 北京奇虎科技有限公司 Browser notification pushing method and browser terminal
CN109492216A (en) * 2018-09-19 2019-03-19 平安科技(深圳)有限公司 Water note identifies automatically and the measures and procedures for the examination and approval, device and computer readable storage medium
CN110489187A (en) * 2018-05-15 2019-11-22 腾讯科技(深圳)有限公司 Page furbishing method, device, storage medium and computer equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101251855A (en) * 2008-03-27 2008-08-27 腾讯科技(深圳)有限公司 Equipment, system and method for cleaning internet web page
CN101404031A (en) * 2008-11-12 2009-04-08 北京搜狗科技发展有限公司 Method and system for recognizing concept type web pages
US20110196737A1 (en) * 2010-02-05 2011-08-11 Microsoft Corporation Semantic advertising selection from lateral concepts and topics
CN102609407A (en) * 2012-02-16 2012-07-25 复旦大学 Fine-grained semantic detection method of harmful text contents in network
CN102750334A (en) * 2012-06-01 2012-10-24 北京市农林科学院农业科技信息研究所 Agricultural information accurate propelling method based on data mining (DM)

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101251855A (en) * 2008-03-27 2008-08-27 腾讯科技(深圳)有限公司 Equipment, system and method for cleaning internet web page
CN101404031A (en) * 2008-11-12 2009-04-08 北京搜狗科技发展有限公司 Method and system for recognizing concept type web pages
US20110196737A1 (en) * 2010-02-05 2011-08-11 Microsoft Corporation Semantic advertising selection from lateral concepts and topics
CN102609407A (en) * 2012-02-16 2012-07-25 复旦大学 Fine-grained semantic detection method of harmful text contents in network
CN102750334A (en) * 2012-06-01 2012-10-24 北京市农林科学院农业科技信息研究所 Agricultural information accurate propelling method based on data mining (DM)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103345476A (en) * 2013-06-09 2013-10-09 北京百度网讯科技有限公司 Method and device for determining present information corresponding to destination page
CN103345476B (en) * 2013-06-09 2017-03-01 北京百度网讯科技有限公司 For determining the method and apparatus that assume information corresponding with target pages
CN103399764A (en) * 2013-07-24 2013-11-20 北京小米科技有限责任公司 Method, device and terminal for setting interface colors
CN103440326A (en) * 2013-09-02 2013-12-11 百度在线网络技术(北京)有限公司 Method and apparatus for providing representation information
CN106709073A (en) * 2013-12-30 2017-05-24 北京奇虎科技有限公司 Browser notification pushing method and browser terminal
CN110489187A (en) * 2018-05-15 2019-11-22 腾讯科技(深圳)有限公司 Page furbishing method, device, storage medium and computer equipment
CN110489187B (en) * 2018-05-15 2021-09-24 腾讯科技(深圳)有限公司 Page refreshing method and device, storage medium and computer equipment
CN109492216A (en) * 2018-09-19 2019-03-19 平安科技(深圳)有限公司 Water note identifies automatically and the measures and procedures for the examination and approval, device and computer readable storage medium

Also Published As

Publication number Publication date
CN102999576B (en) 2016-08-17

Similar Documents

Publication Publication Date Title
CN105808685B (en) Promotion information pushing method and device
CN103544178B (en) It is a kind of for providing the method and apparatus of reconstruction page corresponding with target pages
CN103295145B (en) Mobile phone advertising method based on user consumption feature vector
US10031954B2 (en) Method and system for presenting a search result in a search result card
CN102999595B (en) A kind of for providing method and the equipment of the accession page corresponding with page info
CN102999576A (en) Method and equipment for confirming page description information corresponding to target pages
EP2941724A1 (en) Method and apparatus for generating webpage content
US10489474B1 (en) Techniques to leverage machine learning for search engine optimization
CN103699619A (en) Method and device for providing search results
CN107918622A (en) Commending contents, methods of exhibiting, client, server and system
US10019419B2 (en) Method, server, browser, and system for recommending text information
CN103455524A (en) Method and device for displaying and acquiring entry information
CN103440260A (en) Method and equipment used for providing representation information
CN103886016B (en) A kind of method and apparatus for being used to determine the rubbish text information in the page
CN106445971A (en) Application recommendation method and system
CN103678325A (en) Method and device for providing browsing page corresponding to initial page
CN103703483A (en) Information providing device, information providing method, information providing program, information display program, and computer-readable recording medium for storing information providing program
CN106371706A (en) Method and device for site selection of application shortcuts
CN102982135A (en) Method and device used for providing presented information
CN105138702B (en) Network searching method based on search engine and electronic equipment
CN107153697A (en) Product search method and device in a kind of commodity transaction website
Van Looy Search engine optimization
JP6295577B2 (en) Server apparatus, program, and information providing method
Brumen et al. Use of mobile technologies in tourism: Natural health resorts study
CN106776634A (en) A kind of method for network access, device and terminal device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant