CN103559203A - Method, device and system for web page sorting - Google Patents

Method, device and system for web page sorting Download PDF

Info

Publication number
CN103559203A
CN103559203A CN201310464478.5A CN201310464478A CN103559203A CN 103559203 A CN103559203 A CN 103559203A CN 201310464478 A CN201310464478 A CN 201310464478A CN 103559203 A CN103559203 A CN 103559203A
Authority
CN
China
Prior art keywords
webpage
access duration
duration information
search engine
engine server
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310464478.5A
Other languages
Chinese (zh)
Inventor
肖鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd, Qizhi Software Beijing Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201310464478.5A priority Critical patent/CN103559203A/en
Publication of CN103559203A publication Critical patent/CN103559203A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Abstract

The invention discloses a method, a device and a system for web page sorting. The method includes that access duration information of a web page is acquired, and the acquired access duration information is provided to a search engine server, so that the search engine server can refer to the access duration information of the web page in web page sorting. According to the technique scheme, since the access duration information of the web page can best reflect the real access web pages of a user, degree of accuracy of search results is increased to a great extent, and search experiences of the user is further improved.

Description

Web page sequencing method, device and system
Technical field
The present invention relates to technical field of the computer network, be specifically related to a kind of Web page sequencing method, device and system.
Background technology
Development along with the universal and internet of computing machine, people are more and more frequent to the use of network, computer network becomes requisite instrument in people's daily life gradually, and the various abundant information service that search engine can provide because of itself, information and the data of every aspect are provided to user, in daily life, be widely used, brought huge facility to the daily productive life of people.
Search engine web site is the class website that retrieval service is provided on internet specially, the search engine server of these websites is by modes such as web search software or network entry, the info web of a large amount of websites on internet is collected, after processing is processed, set up information database and index data base, user, by inputted search word (query) in the interface providing at search engine, obtains the Search Results that search engine returns for this search word.What Search Results was normally tactful according to a series of scoring and sort algorithm obtains.At first, mainly using the correlativity of webpage and search word as the foundation that determines search results ranking.
Correlativity determined by several factors conventionally, and one of them is the PageRank(webpage rank of webpage).PageRank weighs the importance of a webpage according to the internal links of other webpages.In simple terms, other webpages form the PageRank of this webpage to each link of certain particular webpage.But, consider the diversity of Search Results, except will considering correlativity, user's visit capacity is also an important judgment basis.
In the prior art, search engine server is mainly, by user, the clicking rate of webpage is carried out to the visit capacity of counting user to webpage.But, because clicking may not have after a webpage, user real checks that web site contents is (such as maloperation is closed after clicking certain web page interlinkage at once, or click after opening webpage is found its content to lose interest in and directly close), therefore thisly according to clicking rate, come counting user not accurate enough to the scheme of the visit capacity of webpage, the result presentation that can not well user really be liked out.
Now webpage quantity is on the internet considerably beyond the user's ability that can read and access, and this quantity is also at rapid growth, find the webpage of the true access of user, the degree of accuracy of raising Search Results that can be very large, and then improve user's search experience.
Summary of the invention
In view of the above problems, the present invention has been proposed to provide a kind of Web page sequencing method that overcomes the problems referred to above or address the above problem at least in part and corresponding device and system.
According to one aspect of the present invention, a kind of Web page sequencing method is provided, the method comprises:
Obtain the access duration information of webpage;
The access duration information of obtained webpage is offered to search engine server, for search engine server access duration information with reference to webpage when carrying out webpage sorting.
Alternatively, the access duration information that obtains webpage described in comprises: use browser plug-in at browser end, to obtain the access duration information of webpage.
Alternatively, the access duration information that described use browser plug-in obtains webpage at browser end comprises: browser plug-in prison input-output operation event; When browser plug-in captures the input-output operation event under web page browsing state at every turn, a cumulative time period on the access duration of current browsing page, and record the time point of this behavior;
Wherein: if between the time point of this behavior and the time point of last behavior interval greater than or equal Preset Time length, the cumulative described time period equals described Preset Time length, otherwise the cumulative described time period equals the time interval between the time point of this behavior and the time point of last behavior.
Alternatively, the described access duration information by obtained webpage offers search engine server and comprises: when browser plug-in gets the event that web page address changes or webpage is closed, the access duration information of cumulative webpage is offered to search engine server.
Alternatively, the access duration information of described webpage comprises: the access duration of the address of webpage and webpage;
Wherein, browser plug-in obtains the address of the webpage that browser window is corresponding according to current focus window.
Alternatively, the described access duration information by obtained webpage offers search engine server and comprises:
The access duration information of obtained webpage is directly sent to search engine server;
Or,
The access duration information of obtained webpage is sent to security server, then be transmitted to search engine server by security server.
According to a further aspect in the invention, provide a kind of webpage sorting device, this device comprises:
Duration acquiring unit, is suitable for obtaining the access duration information of webpage, and sends to transmitting element;
Transmitting element, the access duration information that is suitable for webpage that duration acquiring unit is sent offers search engine server, for search engine server access duration information with reference to webpage when carrying out webpage sorting.
Alternatively, described duration acquiring unit, is suitable for monitoring input-output operation event, while capturing the input-output operation event under web page browsing state at every turn, and a cumulative time period on the access duration of current browsing page, and record the time point of this behavior;
Wherein: if between the time point of this behavior and the time point of last behavior interval greater than or equal Preset Time length, the cumulative described time period equals described Preset Time length, otherwise the cumulative described time period equals the time interval between the time point of this behavior and the time point of last behavior.
Alternatively, described duration acquiring unit, is suitable for, when getting the event that web page address changes or webpage is closed, the access duration information of cumulative webpage being sent to transmitting element.
Alternatively, the access duration information of described webpage comprises: the access duration of the address of webpage and webpage;
Described duration acquiring unit, is suitable for according to the address of webpage corresponding to current focus window acquisition browser window.
Alternatively, described transmitting element, is suitable for the access duration information of obtained webpage directly to send to search engine server; Or, be suitable for the access duration information of obtained webpage to send to security server, then be transmitted to search engine server by security server.
According to another aspect of the invention, provide a kind of webpage sorting system, this system comprises: search engine server and a plurality of webpage sorting device as described in above-mentioned any one; Wherein,
Described transmitting element, is suitable for the access duration information of obtained webpage directly to send to search engine server;
Described search engine server, is suitable for when carrying out webpage sorting the access duration information with reference to webpage.
Alternatively, described search engine server, is suitable for, according to the PageRank value of the access duration information adjusting webpage of webpage, according to the PageRank value of each webpage, sorting.
According to another aspect of the invention, provide a kind of webpage sorting system, wherein, this system comprises: search engine server, security server and a plurality of webpage sorting device as described in above-mentioned any one; Wherein,
Described transmitting element, is suitable for the access duration information of obtained webpage to send to security server;
Described security server, is suitable for the access duration information of the webpage of a plurality of webpage sorting devices transmissions to be transmitted to described search engine server;
Described search engine server, is suitable for when carrying out webpage sorting the access duration information with reference to webpage.
Alternatively, described search engine server, is suitable for, according to the PageRank value of the access duration information adjusting webpage of webpage, according to the PageRank value of each webpage, sorting.
According to the access duration information that obtains webpage of the present invention, the access duration information of obtained webpage is offered to search engine server, for search engine server technical scheme with reference to the access duration information of webpage when carrying out webpage sorting, because the access duration information of webpage more can reflect the webpage of the true access of user, therefore improve to a great extent the accuracy of Search Results, and then improved user's search experience.
Above-mentioned explanation is only the general introduction of technical solution of the present invention, in order to better understand technological means of the present invention, and can be implemented according to the content of instructions, and for above and other objects of the present invention, feature and advantage can be become apparent, below especially exemplified by the specific embodiment of the present invention.
Accompanying drawing explanation
By reading below detailed description of the preferred embodiment, various other advantage and benefits will become cheer and bright for those of ordinary skills.Accompanying drawing is only for the object of preferred implementation is shown, and do not think limitation of the present invention.And in whole accompanying drawing, by identical reference symbol, represent identical parts.In the accompanying drawings:
Fig. 1 shows a kind of process flow diagram of Web page sequencing method according to an embodiment of the invention;
Fig. 2 shows a kind of according to an embodiment of the invention structural drawing of webpage sorting device;
Fig. 3 shows a kind of according to an embodiment of the invention composition schematic diagram of webpage sorting system;
Fig. 4 shows a kind of according to an embodiment of the invention composition schematic diagram of webpage sorting system.Embodiment
Exemplary embodiment of the present disclosure is described below with reference to accompanying drawings in more detail.Although shown exemplary embodiment of the present disclosure in accompanying drawing, yet should be appreciated that and can realize the disclosure and the embodiment that should do not set forth limits here with various forms.On the contrary, it is in order more thoroughly to understand the disclosure that these embodiment are provided, and can by the scope of the present disclosure complete convey to those skilled in the art.
Fig. 1 shows a kind of process flow diagram of Web page sequencing method according to an embodiment of the invention.As shown in Figure 1, the method comprises:
Step S110, obtains the access duration information of webpage.
In one embodiment of the invention, use browser plug-in at browser end, to obtain the access duration information of webpage.By browser plug-in, know webpage state, know the input and output events such as mouse, keyboard behavior, with this, judge current web page accessed effective time.
Step S120, offers search engine server by the access duration information of obtained webpage, for search engine server access duration information with reference to webpage when carrying out webpage sorting.
In this step, the access duration information of obtained webpage is offered to search engine server, make search engine server using the important evidence of these data as search engine sequence.
In scheme shown in Fig. 1, because the access duration information of webpage more can reflect the webpage of the true access of user, therefore the accuracy that has improved to a great extent Search Results, the webpage sorting that user really can be liked is to above and represent, thereby improved user's search experience.
In one embodiment of the invention, use browser plug-in at browser end, to obtain the access duration information of webpage.Browser is as the instrument of user's accessed web page, can get user's concrete web page access situation, when the page is opened, can use browser plug-in to know webpage state etc., can be by monitoring the input and output event of the terminal of running browser, as the events such as mouse and keyboard know whether this webpage is opened, and whether active (whether at front end) etc., and then analyze page events behavior.
Use browser plug-in can obtain the web page address of current browsing page, the variation of the variation of capture net page address, capture net page status (comprise start load, loaded, loaded unsuccessfully etc.) and the behavior etc. of closing of catching webpage.By browser plug-in, enter after browser, hook system event API, can monitor input and output event, as mouse and keyboard behavior (can be also other possible input and output behaviors), with this, confirm that current web page is in active state, and obtain web page address corresponding to browser window according to current focus window.A page, when active state, conventionally has mouse click, keyboard input or inputted search word event, or has the events such as mouse roller, according to these events, can know that whether user is at present at active state.The active procedure of mouse comprises residence time of mouse behavior, mouse and mouse current location etc.Browser plug-in records mouse event and cumulative time, by under the activation record of visitor's operating mouse in website, when Website page is closed, records cumulative quantity.
The Plugin Mechanism of the official of various browsers, as the npapi of the plugin of the BHO of IE, Chrome, Firefox etc. directly supports obtaining of above-mentioned event and state, therefore the Plugin Mechanism that those of ordinary skill in the art provides according to official is write the browser plug-in needing in the application and can be realized, and no longer elaborates here.
Specifically, in a specific embodiment of the present invention, obtain in the following way the access duration information of webpage:
(1) browser plug-in monitoring input and output events (as mouse behavior and keyboard behavior etc.);
(2) when browser plug-in captures the input and output event under web page browsing state at every turn, a cumulative time period on the access duration of current browsing page, and record the time point of this behavior;
Wherein: if between the time point of this behavior and the time point of last behavior interval greater than or equal Preset Time length, the cumulative described time period equals described Preset Time length, otherwise the cumulative described time period equals the time interval between the time point of this behavior and the time point of last behavior.
(3), when browser plug-in gets the event that web page address changes or webpage is closed, the access duration information of cumulative webpage is offered to search engine server.
Wherein: the access duration information of webpage comprises: the access duration of the address of webpage and webpage.Browser plug-in obtains the address of the webpage that browser window is corresponding according to current focus window.
For example: Preset Time length is 100 milliseconds, the time point that browser captures the browse state of webpage is for the first time a, and the mouse of continuous three times or the keyboard behavior that capture afterwards under this web page browsing state occur in b, c and tri-time points of d successively.If a and b, b and c, and the interval of c and d is all greater than 100 milliseconds, the access duration of this webpage is cumulative 300 milliseconds so.If a and b, and the interval of c and d is greater than 100 milliseconds, and the interval of b and c is less than 100 milliseconds, the so cumulative c-b+200 millisecond of the access duration of this webpage.
In this specific embodiment, do not have to adopt and using the time point opening the page, switch the page, close the page and enter the state that exits focus and as the reason of the standard of time cumulation be, may there is deviation in these behaviors, cause the situation that occurs that web page access duration is very long.And adopt above-mentioned scheme can avoid this problem.
The access duration information of the webpage in one embodiment of the invention, browser end being obtained directly sends to search engine server.
In another embodiment of the present invention, the access duration information of the webpage that browser is obtained sends to security server, then is transmitted to search engine server by security server.
Above-mentioned two schemes can be selected according to real network framework situation.For example, when search engine server can be provided for receiving from the interface logic of the access duration information of browser end, the scheme that can adopt front a kind of browser end directly to send to search engine server, on the contrary a kind of by the scheme of security server transfer after adopting.
The access duration information that search engine server the receives webpage line item of going forward side by side, regulates the PageRank value of each webpage according to the access duration information of each webpage, so as by this market demand in webpage sorting process.Cardinal rule is wherein, the access duration of a webpage is longer, and its probability that is discharged to forward position is larger.
In one embodiment of the invention, search engine server is receiving the access duration information of the webpage of browser end statistics, during specific implementation, because search engine server is being safeguarded index data base, wherein preserved the information that grabs all webpages, while receiving user's searching request, according to the data in this index data base, to user, return to Search Results exactly, therefore, can be for each webpage arranges this parameter of access duration in index data base, search engine server regulates the weight of webpage according to this parameter, thereby regulates the sequence of webpage.
Utilize this method of the present invention, can preferentially provide web page resources high-quality, that really meet user search intent to user, thereby reduce the time that user browsed, checked webpage, improve user's retrieval usefulness.
Fig. 2 shows a kind of according to an embodiment of the invention structural drawing of webpage sorting device.As shown in Figure 2, this webpage sorting device 200 comprises: duration acquiring unit 201 and transmitting element 202.
Duration acquiring unit 201, is suitable for obtaining the access duration information of webpage, and sends to transmitting element 202;
Transmitting element 202, the access duration information that is suitable for webpage that duration acquiring unit is sent offers search engine server, for search engine server access duration information with reference to webpage when carrying out webpage sorting.
In one embodiment of the invention, duration acquiring unit 201, be suitable for monitoring input and output event (as mouse behavior and keyboard behavior etc.), while capturing the input and output event under web page browsing state at every turn, a cumulative time period on the access duration of current browsing page, and record the time point of this behavior;
Wherein: if between the time point of this behavior and the time point of last behavior interval greater than or equal Preset Time length, the cumulative described time period equals described Preset Time length, otherwise the cumulative described time period equals the time interval between the time point of this behavior and the time point of last behavior.
For example: Preset Time length is 100 milliseconds, the time point that browser first captures web page browsing state is a, and the mouse of continuous three times or the keyboard behavior that capture afterwards under this web page browsing state occur in b, c and tri-time points of d successively.If a and b, b and c, and the interval of c and d is all greater than 100 milliseconds, the access duration of this webpage is cumulative 300 milliseconds so.If a and b, and the interval of c and d is greater than 100 milliseconds, and the interval of b and c is less than 100 milliseconds, the so cumulative c-b+200 millisecond of the access duration of this webpage.
In one embodiment of the invention, duration acquiring unit 201, is suitable for, when getting the event that web page address changes or webpage is closed, the access duration information of cumulative webpage being sent to transmitting element 202.
In one embodiment of the invention, the access duration information of described webpage comprises: the access duration of the address of webpage and webpage; Duration acquiring unit 201, is suitable for according to the address of webpage corresponding to current focus window acquisition browser window.
In one embodiment of the invention, transmitting element 202, is suitable for the access duration information of obtained webpage directly to send to search engine server; Or, be suitable for the access duration information of obtained webpage to send to security server, then be transmitted to search engine server by security server.
Fig. 3 shows a kind of according to an embodiment of the invention composition schematic diagram of webpage sorting system.As shown in Figure 3, this system comprises: search engine server 400, security server 300 and a plurality of webpage sorting device 200 as shown in Figure 2.
Transmitting element 202 in webpage sorting device 200, is suitable for the access duration information of obtained webpage to send to security server 300;
Security server 300, is suitable for the access duration information of the webpage of a plurality of webpage sorting devices transmissions to be transmitted to described search engine server 400;
Search engine server 400, is suitable for when carrying out webpage sorting the access duration information with reference to webpage.Particularly, search engine server 400 is suitable for regulating according to the access duration information of webpage the PageRank value of webpage, then sorts according to the PageRank value of each webpage.
Fig. 4 shows a kind of according to an embodiment of the invention composition schematic diagram of webpage sorting system.As shown in Figure 4, this system comprises: search engine server 400 and a plurality of device of webpage sorting as shown in Figure 2 200.
Transmitting element 202 in webpage sorting device 200, is suitable for the access duration information of obtained webpage directly to send to search engine server 400;
Search engine server 400, is suitable for when carrying out webpage sorting the access duration information with reference to webpage.Particularly, search engine server 400 is suitable for regulating according to the access duration information of webpage the PageRank value of webpage, then sorts according to the PageRank value of each webpage.
The access duration information that search engine server in Fig. 3 and Fig. 4 400 the receives webpages line item of going forward side by side, regulates the PageRank value of each webpage according to the access duration information of each webpage, so as by this market demand in webpage sorting process.Cardinal rule is wherein, the access duration of a webpage is longer, and its probability that is discharged to forward position is larger.In one embodiment of the invention, search engine server 400 receives the access duration information of the webpage of browser end statistics, during specific implementation, because search engine server 400 is being safeguarded index data base, wherein preserved the information that grabs all webpages, while receiving user's searching request, according to the data in this index data base, to user, return to Search Results exactly, therefore, can be for each webpage arranges this parameter of access duration in index data base, search engine server 400 regulates the weight of webpage according to this parameter, thereby regulate the sequence of webpage.
In sum, according to the access duration information that obtains webpage of the present invention, the access duration information of obtained webpage is offered to search engine server, for search engine server technical scheme with reference to the access duration information of webpage when carrying out webpage sorting, because the access duration information of webpage more can reflect the webpage of the true access of user, therefore improve to a great extent the accuracy of Search Results, and then improved user's search experience.
It should be noted that:
The algorithm providing at this is intrinsic not relevant to any certain computer, virtual system or miscellaneous equipment with demonstration.Various general-purpose systems also can with based on using together with this teaching.According to description above, it is apparent constructing the desired structure of this type systematic.In addition, the present invention is not also for any certain programmed language.It should be understood that and can utilize various programming languages to realize content of the present invention described here, and the description of above language-specific being done is in order to disclose preferred forms of the present invention.
In the instructions that provided herein, a large amount of details have been described.Yet, can understand, embodiments of the invention can not put into practice in the situation that there is no these details.In some instances, be not shown specifically known method, structure and technology, so that not fuzzy understanding of this description.
Similarly, be to be understood that, in order to simplify the disclosure and to help to understand one or more in each inventive aspect, in the above in the description of exemplary embodiment of the present invention, each feature of the present invention is grouped together into single embodiment, figure or sometimes in its description.Yet, the method for the disclosure should be construed to the following intention of reflection: the present invention for required protection requires than the more feature of feature of clearly recording in each claim.Or rather, as reflected in claims below, inventive aspect is to be less than all features of disclosed single embodiment above.Therefore, claims of following embodiment are incorporated to this embodiment thus clearly, and wherein each claim itself is as independent embodiment of the present invention.
Those skilled in the art are appreciated that and can the module in the equipment in embodiment are adaptively changed and they are arranged in one or more equipment different from this embodiment.Module in embodiment or unit or assembly can be combined into a module or unit or assembly, and can put them into a plurality of submodules or subelement or sub-component in addition.At least some in such feature and/or process or unit are mutually repelling, and can adopt any combination to combine all processes or the unit of disclosed all features in this instructions (comprising claim, summary and the accompanying drawing followed) and disclosed any method like this or equipment.Unless clearly statement in addition, in this instructions (comprising claim, summary and the accompanying drawing followed) disclosed each feature can be by providing identical, be equal to or the alternative features of similar object replaces.
In addition, those skilled in the art can understand, although embodiment more described herein comprise some feature rather than further feature included in other embodiment, the combination of the feature of different embodiment means within scope of the present invention and forms different embodiment.For example, in the following claims, the one of any of embodiment required for protection can be used with array mode arbitrarily.
All parts embodiment of the present invention can realize with hardware, or realizes with the software module moved on one or more processor, or realizes with their combination.It will be understood by those of skill in the art that and can use in practice microprocessor or digital signal processor (DSP) to realize according to the some or all functions of the some or all parts in the webpage sorting device of the embodiment of the present invention and system.The present invention for example can also be embodied as, for carrying out part or all equipment or device program (, computer program and computer program) of method as described herein.Realizing program of the present invention and can be stored on computer-readable medium like this, or can there is the form of one or more signal.Such signal can be downloaded and obtain from internet website, or provides on carrier signal, or provides with any other form.
It should be noted above-described embodiment the present invention will be described rather than limit the invention, and those skilled in the art can design alternative embodiment in the situation that do not depart from the scope of claims.In the claims, any reference symbol between bracket should be configured to limitations on claims.Word " comprises " not to be got rid of existence and is not listed as element or step in the claims.Being positioned at word " " before element or " one " does not get rid of and has a plurality of such elements.The present invention can be by means of including the hardware of some different elements and realizing by means of the computing machine of suitably programming.In having enumerated the unit claim of some devices, several in these devices can be to carry out imbody by same hardware branch.The use of word first, second and C grade does not represent any order.Can be title by these word explanations.

Claims (15)

1. a Web page sequencing method, wherein, the method comprises:
Obtain the access duration information of webpage;
The access duration information of obtained webpage is offered to search engine server, for search engine server access duration information with reference to webpage when carrying out webpage sorting.
2. the access duration information that obtains webpage described in the method for claim 1, wherein comprises:
Use browser plug-in at browser end, to obtain the access duration information of webpage.
3. method as claimed in claim 2, wherein, the access duration information that described use browser plug-in obtains webpage at browser end comprises:
Browser plug-in monitoring input-output operation event;
When browser plug-in captures the input-output operation event under web page browsing state at every turn, a cumulative time period on the access duration of current browsing page, and record the time point of this behavior;
Wherein: if between the time point of this behavior and the time point of last behavior interval greater than or equal Preset Time length, the cumulative described time period equals described Preset Time length, otherwise the cumulative described time period equals the time interval between the time point of this behavior and the time point of last behavior.
4. method as claimed in claim 3, wherein, the described access duration information by obtained webpage offers search engine server and comprises:
When browser plug-in gets the event that web page address changes or webpage is closed, the access duration information of cumulative webpage is offered to search engine server.
5. method as claimed in claim 3, wherein, the access duration information of described webpage comprises: the access duration of the address of webpage and webpage;
Wherein, browser plug-in obtains the address of the webpage that browser window is corresponding according to current focus window.
6. the method as described in any one in claim 1 to 5, wherein, the described access duration information by obtained webpage offers search engine server and comprises:
The access duration information of obtained webpage is directly sent to search engine server;
Or,
The access duration information of obtained webpage is sent to security server, then be transmitted to search engine server by security server.
7. a webpage sorting device, wherein, this device comprises:
Duration acquiring unit, is suitable for obtaining the access duration information of webpage, and sends to transmitting element;
Transmitting element, the access duration information that is suitable for webpage that duration acquiring unit is sent offers search engine server, for search engine server access duration information with reference to webpage when carrying out webpage sorting.
8. device as claimed in claim 7, wherein,
Described duration acquiring unit, is suitable for monitoring input-output operation event, while capturing the input-output operation event under web page browsing state at every turn, and a cumulative time period on the access duration of current browsing page, and record the time point of this behavior;
Wherein: if between the time point of this behavior and the time point of last behavior interval greater than or equal Preset Time length, the cumulative described time period equals described Preset Time length, otherwise the cumulative described time period equals the time interval between the time point of this behavior and the time point of last behavior.
9. device as claimed in claim 8, wherein,
Described duration acquiring unit, is suitable for, when getting the event that web page address changes or webpage is closed, the access duration information of cumulative webpage being sent to transmitting element.
10. device as claimed in claim 8, wherein, the access duration information of described webpage comprises: the access duration of the address of webpage and webpage;
Described duration acquiring unit, is suitable for according to the address of webpage corresponding to current focus window acquisition browser window.
11. devices as described in any one in claim 7 to 10, wherein,
Described transmitting element, is suitable for the access duration information of obtained webpage directly to send to search engine server; Or, be suitable for the access duration information of obtained webpage to send to security server, then be transmitted to search engine server by security server.
12. 1 kinds of webpage sorting systems, wherein, this system comprises: search engine server and a plurality of webpage sorting device as described in any one in claim 7 to 10; Wherein,
Described transmitting element, is suitable for the access duration information of obtained webpage directly to send to search engine server;
Described search engine server, is suitable for when carrying out webpage sorting the access duration information with reference to webpage.
13. systems as claimed in claim 12, is characterized in that,
Described search engine server, is suitable for, according to the PageRank value of the access duration information adjusting webpage of webpage, according to the PageRank value of each webpage, sorting.
14. 1 kinds of webpage sorting systems, wherein, this system comprises: search engine server, security server and a plurality of webpage sorting device as described in any one in claim 7 to 10; Wherein,
Described transmitting element, is suitable for the access duration information of obtained webpage to send to security server;
Described security server, is suitable for the access duration information of the webpage of a plurality of webpage sorting devices transmissions to be transmitted to described search engine server;
Described search engine server, is suitable for when carrying out webpage sorting the access duration information with reference to webpage.
15. systems as claimed in claim 14, is characterized in that,
Described search engine server, is suitable for, according to the PageRank value of the access duration information adjusting webpage of webpage, according to the PageRank value of each webpage, sorting.
CN201310464478.5A 2013-10-08 2013-10-08 Method, device and system for web page sorting Pending CN103559203A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310464478.5A CN103559203A (en) 2013-10-08 2013-10-08 Method, device and system for web page sorting

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310464478.5A CN103559203A (en) 2013-10-08 2013-10-08 Method, device and system for web page sorting

Publications (1)

Publication Number Publication Date
CN103559203A true CN103559203A (en) 2014-02-05

Family

ID=50013450

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310464478.5A Pending CN103559203A (en) 2013-10-08 2013-10-08 Method, device and system for web page sorting

Country Status (1)

Country Link
CN (1) CN103559203A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105426432A (en) * 2015-11-02 2016-03-23 广东欧珀移动通信有限公司 Website ranking method and device
CN106874165A (en) * 2015-12-14 2017-06-20 北京国双科技有限公司 Page detection method and device
CN107153908A (en) * 2017-03-24 2017-09-12 国家计算机网络与信息安全管理中心 Mobile news App influence power ranking methods
CN107402864A (en) * 2017-06-07 2017-11-28 阿里巴巴集团控股有限公司 Access processing method, device and equipment, the computer-readable recording medium of duration
CN107590176A (en) * 2017-07-31 2018-01-16 北京奇艺世纪科技有限公司 A kind of preparation method of evaluation index, device and electronic equipment
CN107797906A (en) * 2017-10-09 2018-03-13 四川巧夺天工信息安全智能设备有限公司 A kind of method for monitoring a variety of browsing device net pages in real time and browsing record
CN108920696A (en) * 2017-12-04 2018-11-30 重庆第二师范学院 A kind of Web page sequencing method and system based on transition probability
CN111382380A (en) * 2018-12-27 2020-07-07 北京奇虎科技有限公司 Statistical method and device for page access duration

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020026589A1 (en) * 2000-08-08 2002-02-28 Mikio Fukasawa Computer monitoring system
US20040024756A1 (en) * 2002-08-05 2004-02-05 John Terrell Rickard Search engine for non-textual data
US20070011020A1 (en) * 2005-07-05 2007-01-11 Martin Anthony G Categorization of locations and documents in a computer network
CN101079049A (en) * 2006-11-15 2007-11-28 腾讯科技(深圳)有限公司 Search system and method
CN101382938A (en) * 2008-10-23 2009-03-11 浙江大学 Network video ordering method based on focusing time of users
CN101782909A (en) * 2009-01-19 2010-07-21 杨云国 Search engine based on operation intention of user
CN102227737A (en) * 2008-11-28 2011-10-26 Est软件公司 Web page searching system and method using access time and frequency
CN102231165A (en) * 2011-07-11 2011-11-02 浙江大学 Method for searching and sequencing personalized web pages based on user retention time analysis
CN102779136A (en) * 2011-05-13 2012-11-14 北京搜狗科技发展有限公司 Method and device for information search

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020026589A1 (en) * 2000-08-08 2002-02-28 Mikio Fukasawa Computer monitoring system
US20040024756A1 (en) * 2002-08-05 2004-02-05 John Terrell Rickard Search engine for non-textual data
US20070011020A1 (en) * 2005-07-05 2007-01-11 Martin Anthony G Categorization of locations and documents in a computer network
CN101079049A (en) * 2006-11-15 2007-11-28 腾讯科技(深圳)有限公司 Search system and method
CN101382938A (en) * 2008-10-23 2009-03-11 浙江大学 Network video ordering method based on focusing time of users
CN102227737A (en) * 2008-11-28 2011-10-26 Est软件公司 Web page searching system and method using access time and frequency
CN101782909A (en) * 2009-01-19 2010-07-21 杨云国 Search engine based on operation intention of user
CN102779136A (en) * 2011-05-13 2012-11-14 北京搜狗科技发展有限公司 Method and device for information search
CN102231165A (en) * 2011-07-11 2011-11-02 浙江大学 Method for searching and sequencing personalized web pages based on user retention time analysis

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105426432A (en) * 2015-11-02 2016-03-23 广东欧珀移动通信有限公司 Website ranking method and device
CN106874165A (en) * 2015-12-14 2017-06-20 北京国双科技有限公司 Page detection method and device
CN107153908A (en) * 2017-03-24 2017-09-12 国家计算机网络与信息安全管理中心 Mobile news App influence power ranking methods
CN107402864A (en) * 2017-06-07 2017-11-28 阿里巴巴集团控股有限公司 Access processing method, device and equipment, the computer-readable recording medium of duration
CN107402864B (en) * 2017-06-07 2020-08-04 阿里巴巴集团控股有限公司 Method, device and equipment for processing access duration and readable medium
CN107590176A (en) * 2017-07-31 2018-01-16 北京奇艺世纪科技有限公司 A kind of preparation method of evaluation index, device and electronic equipment
CN107590176B (en) * 2017-07-31 2021-01-15 北京奇艺世纪科技有限公司 Evaluation index obtaining method and device and electronic equipment
CN107797906A (en) * 2017-10-09 2018-03-13 四川巧夺天工信息安全智能设备有限公司 A kind of method for monitoring a variety of browsing device net pages in real time and browsing record
CN107797906B (en) * 2017-10-09 2020-10-13 四川巧夺天工信息安全智能设备有限公司 Method for monitoring webpage browsing records of various browsers in real time
CN108920696A (en) * 2017-12-04 2018-11-30 重庆第二师范学院 A kind of Web page sequencing method and system based on transition probability
CN111382380A (en) * 2018-12-27 2020-07-07 北京奇虎科技有限公司 Statistical method and device for page access duration

Similar Documents

Publication Publication Date Title
CN103559203A (en) Method, device and system for web page sorting
US10394917B2 (en) User-trained searching application system and method
CN102831199B (en) Method and device for establishing interest model
KR101284875B1 (en) Systems and methods for analyzing a user's web history
KR101366408B1 (en) Mining web search user behavior to enhance web search relevance
CN102724059B (en) Website operation state monitoring and abnormal detection based on MapReduce
CN102932207B (en) The method of monitoring website access information and server
CN102932206B (en) The method and system of monitoring website access information
CN102855309B (en) A kind of information recommendation method based on user behavior association analysis and device
US20090089246A1 (en) System and method for history clustering
CN104063454A (en) Search push method and device for mining user demands
US20090089311A1 (en) System and method for inclusion of history in a search results page
CN102982134A (en) System enabling recommended web site information to be displayed in browser address bar
CN103617241B (en) Search information processing method, browser terminal and server
US8108379B2 (en) System and method for editing history in a search results page
CN103745006A (en) Internet information searching system and internet information searching method
CN107436940A (en) The method of web front-end Dynamic Display data based on user profile behavioural analysis
US11748436B2 (en) Web smart exploration and management in browser
Bokhari et al. Retrieval effectiveness of news search engines: a theoretical framework
CN104392000A (en) Method and device for determining catching quota of mobile station
Rushton et al. Searching for a new way to reach patrons: a search engine optimization pilot project at Binghamton University Libraries
Jayanthi et al. A novel framework to facilitate personalized web search in a dual mode
Thwe Web page access prediction based on integrated approach
Balaji et al. TOPCRAWL: Community mining in web search engines with emphasize on topical crawling
Ran et al. Research on Data Acquisition Strategy and Its Application in Web Usage Mining

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20140205

RJ01 Rejection of invention patent application after publication