WO2016058521A1 - Method and apparatus for judging importance of news release location and news - Google Patents

Method and apparatus for judging importance of news release location and news Download PDF

Info

Publication number
WO2016058521A1
WO2016058521A1 PCT/CN2015/091870 CN2015091870W WO2016058521A1 WO 2016058521 A1 WO2016058521 A1 WO 2016058521A1 CN 2015091870 W CN2015091870 W CN 2015091870W WO 2016058521 A1 WO2016058521 A1 WO 2016058521A1
Authority
WO
WIPO (PCT)
Prior art keywords
news
importance
measured
source
location
Prior art date
Application number
PCT/CN2015/091870
Other languages
French (fr)
Chinese (zh)
Inventor
魏少俊
Original Assignee
北京奇虎科技有限公司
奇智软件(北京)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CN201410539702.7A external-priority patent/CN104331419A/en
Priority claimed from CN201410539703.1A external-priority patent/CN104331420A/en
Application filed by 北京奇虎科技有限公司, 奇智软件(北京)有限公司 filed Critical 北京奇虎科技有限公司
Publication of WO2016058521A1 publication Critical patent/WO2016058521A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor

Definitions

  • the present invention relates to the field of Internet technologies, and in particular, to a method and apparatus for determining the importance of a news release location and news.
  • the importance of news is an important basis for sorting search results.
  • the news source here refers to a page that aggregates a large amount of news, such as news source a
  • its page URL can be http://news.a.com.cn
  • the news release location can explain The value of the news.
  • the most important news may be placed in the headline.
  • the present invention has been made in order to provide a method and apparatus for judging the importance of a news posting location that overcomes the above problems or at least partially solves the above problems, and a method and apparatus for measuring the importance of news.
  • a method for determining the importance of a news posting location includes: counting various news items at a publishing location to be determined on a news source within a specified time period; determining the respective news items a parameter, wherein the parameters of the pieces of news refer to parameters in which the pieces of news are linked or operated; and the importance of the release position to be determined is determined according to the determined parameters of the pieces of news.
  • an apparatus for determining the importance of a location of a news release comprising: a statistics module adapted to count each piece of news on a posting location to be determined on a news source within a specified time period; a determining module, configured to determine parameters of the respective pieces of news, wherein the parameters of the pieces of news refer to parameters in which the pieces of news are linked or operated; and the determining module is adapted to be based on the determined pieces of news The parameter determines the importance of the posting location to be determined.
  • each piece of news on a posting position to be judged on a news source within a specified time period is counted, and parameters of each piece of news are determined. Further, the importance of the posting location to be determined is determined according to the determined parameters of each piece of news. It can be seen that the present invention proposes a scheme for judging the importance of the posting position to be determined according to the parameters of each news item at the posting position to be determined, because the parameters of each news item at the publishing position to be determined refer to The parameters of each news link or operation can objectively reflect the actual situation about the importance of the release location, making the judgment result more objective and accurate.
  • the objective and accurate judgment result of the importance of the release position to be judged by the present invention has important significance for the news search engine or the news release engine, for example, the news search engine can more accurately judge the news important according to the judgment result. Sex, and then put the news of high importance in front of the search results as much as possible, so that users can find the news with higher value as soon as possible, and the news release engine can release the important news to be released according to the judgment result in the higher importance. Publishing location.
  • a method of measuring the importance of news comprising: determining at least one publishing attribute of the news to be measured, and a weight of each posting attribute in news importance; obtaining each publishing attribute Attribute value, and weighting the at least one publishing attribute according to the determined weight value and the obtained attribute value, and the calculated value is used as the importance value of the news to be measured; The sex value is compared with a preset measurement rule to measure the importance of the news to be measured.
  • an apparatus for measuring the importance of news comprising: a determining module adapted to determine at least one posting attribute of the news to be measured, and a weight of each posting attribute in the importance of the news; a calculating module, configured to obtain an attribute value of each publishing attribute, and perform weighting processing on the at least one publishing attribute according to the determined weight value and the obtained attribute value, and the calculated value is used as the to-be-measured value
  • the importance value of the news a measurement module adapted to compare the importance value with a preset measurement rule to measure the importance of the news to be measured.
  • At least one publishing attribute of the news to be measured, and a weight of each publishing attribute in the news importance are determined, and the attribute value of each publishing attribute is obtained.
  • Weighting processing on the at least one publishing attribute according to the determined weight and the obtained attribute value, and the calculated value is used as the importance value of the news to be measured.
  • the importance value is then compared to a preset measurement rule to measure the importance of the news to be measured. It can be seen that the present invention comprehensively measures the importance of the news to be measured based on at least one publishing attribute of the news to be measured, so that the measurement result is more objective, accurate and comprehensive. This solves the problem that the related art uses the PageRank method to determine the importance of news and has a negative effect.
  • the news search engine can rank the news with high importance as much as possible in front of the search results according to the objective, accurate and comprehensive measurement results of the importance of the news to be measured provided by the present invention, so that the user can find the news with higher value as soon as possible.
  • a computer program comprising computer readable code, when said computer readable code is run on a computing device, causing said computing device to perform a determination of news according to said A method of publishing the importance of a location and a way to measure the importance of the news.
  • a computer readable medium storing the above computer program is provided.
  • FIG. 1 shows a flow chart of a method of determining the importance of a news release location, in accordance with one embodiment of the present invention
  • FIG. 2 illustrates another flow chart of a method of determining the importance of a news posting location, in accordance with one embodiment of the present invention
  • FIG. 3 is a block diagram showing a structure of an apparatus for determining the importance of a news release location according to an embodiment of the present invention
  • FIG. 4 is a block diagram showing another structure of an apparatus for determining the importance of a news release location according to an embodiment of the present invention
  • FIG. 5 illustrates a flow chart of a method of measuring the importance of news in accordance with one embodiment of the present invention
  • FIG. 6 shows another flow chart of a method of measuring the importance of news in accordance with one embodiment of the present invention
  • FIG. 7 is a block diagram showing a structure of an apparatus for measuring the importance of news according to an embodiment of the present invention.
  • FIG. 8 is a block diagram showing another structure of an apparatus for measuring the importance of news according to an embodiment of the present invention.
  • Figure 9 is a schematic block diagram of a computing device for performing a method of determining the importance of a news posting location and a method of measuring the importance of news in accordance with the present invention
  • Fig. 10 schematically shows a storage unit for holding or carrying a program code implementing a method of determining the importance of a news posting location and a method of measuring the importance of news according to the present invention.
  • FIG. 1 illustrates a flow chart of a method for determining the importance of a news release location according to an embodiment of the present invention. . As shown in FIG. 1, the method includes at least the following steps S102 to S106.
  • Step S102 Statistics each news item at a publishing location to be determined on a news source within a specified time period.
  • Step S104 Determine parameters of each piece of news, wherein the parameters of each piece of news refer to parameters in which each piece of news is linked or operated.
  • Step S106 Determine the importance of the publishing location to be determined according to the parameters of each news determined in step S104.
  • each piece of news on a posting position to be judged on a news source within a specified time period is counted, and parameters of each piece of news are determined. Further, the importance of the posting location to be determined is determined according to the determined parameters of each piece of news. It can be seen that the present invention proposes a scheme for judging the importance of the posting position to be determined according to the parameters of each news item at the posting position to be determined, because the parameters of each news item at the publishing position to be determined refer to The parameters of each news link or operation can objectively reflect the actual situation about the importance of the release location, making the judgment result more objective and accurate.
  • the objective and accurate judgment result of the importance of the release position to be judged by the present invention has important significance for the news search engine or the news release engine, for example, the news search engine can more accurately judge the news important according to the judgment result. Sex, and then put the news of high importance in front of the search results as much as possible, so that users can find the news with higher value as soon as possible, and the news release engine can release the important news to be released according to the judgment result in the higher importance. Publishing location.
  • the news mentioned in the above step S102 may be a news link, a news page content, or the like.
  • the news source refers to a page that aggregates a large amount of news, such as a news source a, and its website address may be http://news.a.com.cn.
  • the publishing location on the news source refers to the location where the news is posted on the news source page.
  • the parameters of each piece of news in the above step S104 refer to parameters in which each piece of news is linked or operated, such as the page rank PageRank of each news in each news, the number of times each news item is clicked in each news, and each piece of news The number of times each news item is displayed, etc., the present invention is not limited thereto.
  • step S106 determines the importance of the release position to be determined according to the parameters of each piece of news determined in step S104, and the present invention provides a preferred embodiment, in which a preferred embodiment is provided.
  • Determining, according to each news parameter in each piece of news, an average value of each parameter of each piece of news, and then determining one or more of the average values of the respective parameters of each piece of news, determining the release position to be determined importance. For example, to count the news on the publishing location to be judged on a certain news source within a specified time period (for example, 12 hours) as News 1, News 2, News 3, and determine PageRank of News 1, News 2, and News 3. For P1, P2, and P3, respectively, the average value of PageRank is P0 (P1+P2+P3)/3.
  • one or more of the average values of the respective parameters of each piece of news may be weighted, and the calculated value is used as the importance value of the release position to be determined, and then the importance value and the preset judgment rule are Then, a comparison is made to determine the importance of the posting location to be determined.
  • the average values of the respective parameters of each piece of news may be weighted, and the calculated value is used as the importance value of the release position to be determined, and then the importance value and the preset judgment rule are Then, a comparison is made to determine the importance of the posting location to be determined.
  • the preset judgment rule here may be a correspondence relationship between the numerical interval and the importance of the release position.
  • the importance of the release position corresponding to the numerical interval P1 is “very high”, and the importance corresponding to the numerical interval P2 is The importance of the "high”, the release position corresponding to the numerical interval P3 is “medium”, and the importance of the release position corresponding to the numerical interval P4 is “low”, etc., which is only listed here, and may be other The correspondence between the numerical interval and the importance of the publishing location.
  • the present invention provides a preferred solution, that is, before step S102, multiple times can be taken at preset time intervals.
  • the news source captures news that is published no longer than the specified duration, records the time when the captured news was first published, the news source, and the location posted on the news source.
  • FIG. 2 illustrates another flow diagram of a method of determining the importance of a news posting location, in accordance with one embodiment of the present invention. As shown in FIG. 2, the method includes the following steps S202 to S210.
  • Step S202 Statistics each news item at a publishing location to be determined on a news source within a specified time period.
  • the news here can be news links, news page content, and the like.
  • the publishing time may be fetched from the plurality of news sources at a preset time interval, and the current time does not exceed the specified duration.
  • News recording the time when the crawled news was first published, the source of the news, and the location posted on the news source.
  • Step S204 Determine parameters of each piece of news, wherein the parameters of each piece of news refer to parameters in which each piece of news is linked or operated.
  • the parameters of each news item may be the page rank PageRank of each news in each news, the number of times each news item is clicked in each news, the number of times each news item is displayed in each news, and the like, and the present invention does not Limited to this.
  • Step S206 Calculate an average value of each parameter of each piece of news according to each news parameter in each determined news item.
  • Step S208 Perform weighting processing on one or more of the average values of the respective parameters of each piece of news, and calculate the obtained value as the importance value of the issuing position to be determined.
  • Step S210 comparing the importance value with a preset determination rule, and determining the importance of the release location to be determined.
  • the importance of the release location to be determined is determined according to the parameters of each news item at the release location to be determined, because the parameters of each news item at the release location to be determined means that each news item is linked or The parameters of the operation, thus being able to objectively reflect the actual situation regarding the importance of the release location, so that The judgment results are more objective and accurate.
  • the objective and accurate judgment result of the importance of the release position to be judged by the present invention has important significance for the news search engine or the news release engine, for example, the news search engine can more accurately judge the news important according to the judgment result. Sex, and then put the news of high importance in front of the search results as much as possible, so that users can find the news with higher value as soon as possible, and the news release engine can release the important news to be released according to the judgment result in the higher importance. Publishing location.
  • an embodiment of the present invention further provides a device for determining the importance of a news release location to implement the above method for determining the importance of a news release location.
  • FIG. 3 shows a schematic structural diagram of an apparatus for determining the importance of a news release location according to an embodiment of the present invention.
  • the apparatus at least includes: a statistics module 310, a determining module 320, and a determining module 330.
  • the statistics module 310 is configured to count each news on the publishing location to be determined on a certain news source within a specified time period;
  • the determining module 320 is coupled to the statistic module 310 and is adapted to determine parameters of each piece of news, wherein the parameters of each piece of news refer to parameters for which each piece of news is linked or operated;
  • the determining module 330 is coupled to the determining module 320, and is adapted to determine the importance of the publishing location to be determined according to the determined parameters of each news item.
  • the parameters of each piece of news include at least one of the following:
  • the determining module 330 is further adapted to: respectively calculate an average value of each parameter of each piece of news according to each of the determined news items; and based on an average value of each parameter of each piece of news One or more, determining the importance of the publishing location to be determined.
  • the determining module 330 is further adapted to: perform weighting processing on one or more of the average values of the respective parameters of the respective news, and calculate the obtained value as the importance value of the publishing position to be determined; The sex value is compared with a preset judgment rule to determine the importance of the release location to be determined.
  • FIG. 4 illustrates another structural diagram of an apparatus for determining the importance of a news posting location, in accordance with one embodiment of the present invention.
  • the method further includes: a capture module 410 coupled to the statistics module 310, The preset time interval captures news from a plurality of news sources that is not longer than the specified duration, records the time when the captured news is first released, the news source, and the posting location posted on the news source.
  • each piece of news is a link to each piece of news.
  • the embodiment of the present invention can achieve the following beneficial effects:
  • each piece of news on a posting position to be judged on a news source within a specified time period is counted, and parameters of each piece of news are determined. Further, the importance of the posting location to be determined is determined according to the determined parameters of each piece of news. It can be seen that the present invention proposes a scheme for judging the importance of the posting position to be determined according to the parameters of each news item at the posting position to be determined, because the parameters of each news item at the publishing position to be determined refer to The parameters of each news link or operation can objectively reflect the actual situation about the importance of the release location, making the judgment result more objective and accurate.
  • the objective and accurate judgment result of the importance of the release position to be judged by the present invention has important significance for the news search engine or the news release engine, for example, the news search engine can more accurately judge the news important according to the judgment result. Sex, and then put the news of high importance in front of the search results as much as possible, so that users can find the news with higher value as soon as possible, and the news release engine can release the important news to be released according to the judgment result in the higher importance. Publishing location.
  • FIG. 5 illustrates a flow chart of a method of measuring the importance of news in accordance with one embodiment of the present invention. As shown in FIG. 5, the method includes at least the following steps S502 to S506.
  • Step S502 Determine at least one publishing attribute of the news to be measured, and a weight of each publishing attribute in the news importance.
  • Step S504 Acquire an attribute value of each publishing attribute, and perform weighting processing on the at least one publishing attribute according to the determined weight value and the obtained attribute value, and calculate the calculated value as the importance value of the news to be measured.
  • Step S506 comparing the importance value with a preset measurement rule, and measuring the importance of the news to be measured.
  • At least one publishing attribute of the news to be measured, and a weight of each publishing attribute in the news importance are determined, and the attribute value of each publishing attribute is obtained.
  • Weighting processing on the at least one publishing attribute according to the determined weight and the obtained attribute value, and the calculated value is used as the importance value of the news to be measured.
  • the importance value is then compared to a preset measurement rule to measure the importance of the news to be measured. It can be seen that the present invention comprehensively measures the importance of the news to be measured based on at least one publishing attribute of the news to be measured, so that the measurement result is more objective, accurate and comprehensive. This solves the problem that the related art uses the PageRank method to determine the importance of news and has a negative effect.
  • the news search engine can rank the news with high importance as much as possible in front of the search results according to the objective, accurate and comprehensive measurement results of the importance of the news to be measured provided by the present invention, so that the user can find the news with higher value as soon as possible.
  • the news mentioned in the above step S502 may be a news link, a news page content, or the like.
  • the posting attribute of the news to be measured refers to information related to publishing the news to be measured, such as the posting time, the news source that publishes the news to be measured, the position where the news to be measured is posted on the news source, the text in the posted content, or The picture information, the length of the posted content, and the like, the present invention is not limited thereto.
  • the news source mentioned here refers to a large aggregate A web page or page of news, such as a news source a, whose URL can be http://news.a.com.cn.
  • the weight of each publishing attribute in the importance of news refers to the relative importance of each publishing attribute in the importance of news, so multiple methods can be used to determine the weight of each publishing attribute in the importance of news.
  • determining at least one publishing attribute of the news to be measured is the publishing time, the news source that publishes the news to be measured, and the length of the published content.
  • the publishing time is more important in the importance of the news, and can be given a larger right.
  • the value (such as a weight of 0.5), and the weight of the news source that publishes the news to be measured and the content of the published content are 0.3 and 0.2, respectively.
  • determining at least one publishing attribute of the news to be measured is a publishing time, a news source that publishes the news to be measured, a location where the news to be measured is posted on the news source, and a length of the posted content, which may be given a weight of 0.5. , 0.3, 0.1 and 0.1. Only the common methods for determining the weights are listed above, and other methods for determining the weights are applicable to the present invention.
  • the attribute value of the published attribute in the above step S504 is a numerical representation of the publishing attribute.
  • the publishing time is 6:00.
  • the attribute value 1 indicates that the publishing time is 6:00, and the publishing time is 12:00.
  • the release time is 12:00 by the attribute value 2, and can of course be represented by other attribute values.
  • the content of the published content is less than 100 words, and the length of the 100 words can be represented by the attribute value 100.
  • the length of the published content is 100 words to 500 words, and the value 100 can be used to represent the 100 words to 500 words.
  • the length of the content to be published is 500 words to 1000 words, and the value of the 500 words to 1000 words can be expressed by the attribute value 1000, and can also be expressed by other attribute values to be applicable to the present invention.
  • the embodiment of the present invention further provides a preferred calculation of the two publishing attributes (a scheme for publishing a property value of a news source to be measured and a location where the news to be measured is posted on a news source, in which a posting can be calculated based on a webpage link relationship of a news source that issues the news to be measured The attribute value of the news source of the news to be measured and the attribute value of the location where the news to be measured is posted on the news source.
  • the calculation methods of the attribute values of the above two publishing attributes will be described in detail below.
  • the attribute value of the news source that publishes the news to be measured is calculated, which may be based on the webpage link relationship of the news source that issues the news to be measured, and the news to be released is calculated.
  • the PageRank of the news source is used as the attribute value of the news source that publishes the news to be measured. PageRank is a technology based on the hyperlinks between web pages to determine the level of a web page through the vast hyperlink relationship of the web.
  • the link from the A web page to the B web page can be interpreted as the A web page voting for the B web page, and the new level is determined according to the voting source (based on the source of the source, that is, the web page linked to the A web page) and the rating of the voting target. Simply put, a high-level web page can raise the level of other low-level web pages.
  • the attribute value of the news to be measured at a certain position on the news source is calculated. That is, counting a plurality of news items of the position published on the news source within a specified time period, and determining parameters of the plurality of news items, and then calculating the attribute value of the position on the news source according to the determined parameters of the plurality of news items.
  • the parameters of a plurality of news items refer to parameter values in which each piece of news is linked or operated, for example, multiple pieces. PageRank of each news in the news, the number of times each news item is clicked, the number of times each piece of news is displayed, and so on.
  • the present invention provides A preferred scheme for calculating an attribute value of the location on the news source, in which the plurality of news items are respectively calculated according to the PageRank of each news in the determined plurality of news, the number of times of being clicked, and the number of times of being displayed.
  • the average of PageRank, the number of times clicked, and the number of times displayed, and then the weighted sum of the calculated average values is used as the attribute value of the position on the news source.
  • counting a plurality of news items at the location posted on the news source within a specified time period is News 1, News 2, and News 3, and determining that the PageRank of News 1, News 2, and News 3 are respectively P1.
  • the weighted summation of P0, C0, and D0 is taken as the attribute value of the location on the news source, and News 1, News 2, and News 3 listed here are merely illustrative and are not intended to limit the present invention.
  • the above-mentioned parameters of multiple news items include PageRank of each news in multiple news items, the number of times each news item is clicked in multiple news items, and the number of times each news item is displayed in multiple news items, The scheme of the attribute value of the location on the news source.
  • the parameters of the plurality of news include the PageRank of each of the plurality of news items and the number of times each of the plurality of news items is clicked, or when the parameters of the plurality of news items include the number of times each of the plurality of news items is clicked and
  • the number of times each news item is displayed in a plurality of news items may also be used to calculate the attribute value of the position on the news source using the above scheme.
  • the parameters of the plurality of news include the PageRank of each of the plurality of news items and the number of times each of the plurality of news items is clicked, the PageRank of each of the plurality of pieces of the determined news may be clicked.
  • the PageRank of the plurality of news items and the average number of times of the clicks are respectively calculated, and the calculated average value is weighted and summed as the attribute value of the position on the news source.
  • the present invention may also capture news from a plurality of news sources at a preset time interval from the current time limit of not longer than the specified time. , the time at which the captured news was first published, the source of the news, and the location posted on the news source.
  • the above describes the number of news sources that publish news to be measured as one. If the number of news sources that publish news to be measured is multiple (for some particularly important news, it may be reprinted by multiple news sources at the same time). ), based on the webpage link relationship of the news source that publishes the news to be measured, calculating the attribute value of the news source that publishes the news to be measured and/or the attribute value of the location where the news to be measured is posted on the news source, which may be based on the release
  • the webpage link relationship of the plurality of news sources of the news to be measured, the attribute value of each news source that publishes the news to be measured, and/or the attribute value of the position where the news to be measured is posted on each news source, and the calculation release is to be measured
  • the attribute value of each news source of the news and/or the attribute value of the position where the news to be measured is posted on each news source may be implemented by using the above-described scheme of the number of news sources for which the news to be measured is one.
  • the preset measurement rule mentioned in the above step S506 may be a correspondence relationship between the numerical interval and the importance of the news, for example, the importance of the news corresponding to the numerical interval 1 is “very high” and falls within the numerical interval. 2 The importance of the corresponding news is "high”, the importance of the news corresponding to the numerical interval 3 is “medium”, and the importance of the news corresponding to the numerical interval 4 is “low”, etc. It can also be the correspondence between other numerical intervals and the importance of news.
  • Figure 6 illustrates another flow diagram of a method of measuring the importance of news in accordance with one embodiment of the present invention. As shown in FIG. 6, the method includes the following steps S602 to S614.
  • Step S602 determining at least one publishing attribute of the news to be measured, and weights of each publishing attribute in the news importance, wherein at least one publishing attribute includes a news source that publishes the news to be measured, and the news to be measured is published in the news source. Somewhere on the top.
  • the at least one publishing attribute may further include a publishing time, a text or picture information in the posted content, a length of the posted content, and the like.
  • Step S604 Calculate, according to the webpage link relationship of the news source that issues the news to be measured, the PageRank of the news source that issues the news to be measured as the attribute value of the news source that issues the news to be measured.
  • Step S606 Counting a plurality of news posts at the location on the news source within a specified time period, and determining a PageRank of each news in the plurality of news, a number of times of being clicked, and a number of times of being displayed.
  • the news that the publishing time is not longer than the specified duration may be fetched from the plurality of news sources at a preset time interval before the step S606. Record when the crawled news was first published, the source of the news, and the location posted on the news feed.
  • Step S608 Calculate, according to the PageRank of each news in the determined plurality of news, the number of times of being clicked, and the number of times of being displayed, the average of the PageRank of the plurality of news, the number of times of being clicked, and the number of times of being displayed.
  • Step S610 performing weighted summation on the calculated average value as the attribute value of the position on the news source.
  • Step S612 Acquire an attribute value of each publishing attribute, and perform weighting processing on the at least one publishing attribute according to the determined weight value and the obtained attribute value, and calculate the calculated value as the importance value of the news to be measured.
  • step S614 the importance value is compared with a preset measurement rule to measure the importance of the news to be measured.
  • step S604 and step S606 to step S610 have no sequential execution order, and steps S606 to S610 may be performed first, and then step S604 is performed.
  • an embodiment of the present invention also provides a device for measuring the importance of news to implement the above method for measuring the importance of news.
  • FIG. 7 shows a schematic structural diagram of an apparatus for measuring the importance of news according to an embodiment of the present invention.
  • the apparatus at least includes: a determining module 710, a calculating module 720, and a measuring module 730.
  • the determining module 710 is adapted to determine at least one publishing attribute of the news to be measured, and a weight of each publishing attribute in the news importance;
  • the calculating module 720 is coupled to the determining module 710, and is adapted to obtain an attribute value of each publishing attribute, and weighting the at least one publishing attribute according to the determined weight and the obtained attribute value, and the calculated value is to be measured.
  • the measurement module 730 is coupled to the calculation module 720 and is adapted to compare the importance value with a preset measurement rule to measure the importance of the news to be measured.
  • the publishing attribute includes any of the following:
  • the location where the news to be measured is posted on the news source
  • the computing module 720 obtains the attribute value of each published attribute before the computing module 720 is further adapted to: calculate an attribute value of a news source that issues the news to be measured and/or an attribute value of a location where the news to be measured is posted on the news source based on a webpage link relationship of the news source that issues the news to be measured.
  • the calculating module 720 is further adapted to: count a plurality of news items that are posted on the news source within a specified time period; determine parameters of the plurality of news items; calculate the news according to the determined parameters of the plurality of news items
  • the attribute value of the location on the source is the attribute value of the location where the news to be measured is posted on the news source.
  • FIG. 8 illustrates another structural schematic of an apparatus for measuring the importance of news in accordance with one embodiment of the present invention.
  • the method further includes: a capture module 810 coupled to the calculation module 720 and adapted to be preset.
  • the time interval captures news from a plurality of news sources that are not longer than the specified duration, records the time when the captured news was first published, the news source, and the location posted on the news source.
  • the plurality of news parameters include at least one of the following:
  • the calculating module 720 is further configured to: calculate, according to the PageRank of each news, the number of times of being clicked, and the number of times of being displayed, the PageRank of the plurality of news, the number of times of being clicked, The average of the number of times displayed; the calculated average is weighted and summed as the attribute value of the position on the news source.
  • the calculating module 720 is further configured to: calculate the news to be measured based on the webpage link relationship of the plurality of news sources that are to be measured. The attribute value of each news source and/or the attribute value of the location where the news to be measured is posted on each news source.
  • the embodiment of the present invention can achieve the following beneficial effects:
  • At least one publishing attribute of the news to be measured, and a weight of each publishing attribute in the news importance are determined, and the attribute value of each publishing attribute is obtained.
  • Weighting processing on the at least one publishing attribute according to the determined weight and the obtained attribute value, and the calculated value is used as the importance value of the news to be measured.
  • the importance value is then compared to a preset measurement rule to measure the importance of the news to be measured. It can be seen that the present invention comprehensively measures the importance of the news to be measured based on at least one publishing attribute of the news to be measured, so that the measurement result is more objective, accurate and comprehensive. This solves the problem that the related art uses the PageRank method to determine the importance of news and has a negative effect.
  • the news search engine can rank the news with high importance as much as possible in front of the search results according to the objective, accurate and comprehensive measurement results of the importance of the news to be measured provided by the present invention, so that the user can find the news with higher value as soon as possible.
  • modules in the devices of the embodiments can be adaptively changed and placed in one or more devices different from the embodiment.
  • the modules or units or components of the embodiments may be combined into one module or unit or component, and further they may be divided into a plurality of sub-modules or sub-units or sub-components.
  • any combination of the features disclosed in the specification, including the accompanying claims, the abstract and the drawings, and any methods so disclosed, or All processes or units of the device are combined.
  • each feature disclosed in the specification, including the accompanying claims, the abstract and the drawings may be replaced by alternative features that provide the same, equivalent or similar purpose.
  • the various component embodiments of the present invention may be implemented in hardware, or in a software module running on one or more processors, or in a combination thereof. It will be understood by those skilled in the art that a microprocessor or digital signal processor (DSP) can be used in practice to implement a device for determining the importance of a news posting location and an apparatus for measuring the importance of news, in accordance with an embodiment of the present invention. Some or all of the features of some or all of the components.
  • the invention can also be implemented as a device or device program (e.g., a computer program and a computer program product) for performing some or all of the methods described herein. Such a program implementing the invention may be stored on a computer readable medium or may be in the form of one or more signals. Such signals may be downloaded from an Internet website, provided on a carrier signal, or provided in any other form.
  • FIG. 9 illustrates a computing device that can implement a method of determining the importance of a news posting location and a method of measuring the importance of news.
  • the computing device conventionally includes a processor 910 and a computer program product or computer readable medium in the form of a memory 920.
  • the memory 920 may be an electronic memory such as a flash memory, an EEPROM (Electrically Erasable Programmable Read Only Memory), an EPROM, a hard disk, or a ROM.
  • Memory 920 has a memory space 930 for program code 931 for performing any of the method steps described above.
  • storage space 930 for program code may include various program code 931 for implementing various steps in the above methods, respectively.
  • the program code can be read from or written to one or more computer program products.
  • Such computer program products include program code carriers such as hard disks, compact disks (CDs), memory cards or floppy disks.
  • Such a computer program product is typically a portable or fixed storage unit as described with reference to FIG.
  • the storage unit may have storage segments, storage spaces, and the like that are similarly arranged to memory 920 in the computing device of FIG.
  • the program code can be compressed, for example, in an appropriate form.
  • the storage unit includes computer readable code 931', ie, code that can be read by a processor, such as 910, that when executed by a computing device causes the computing device to perform each of the methods described above step.

Abstract

A method and apparatus for judging the importance of a news release location, and a method and apparatus for measuring the importance of news. The method for judging the importance of a news release location comprises: counting each piece of news on a release location to be judged on a certain news source within a specified time period (S102); determining a parameter of each piece of news, wherein the parameter of each piece of news refers to a parameter that each piece of news is linked or operated (S104); and according to the determined parameter of each piece of news, judging the importance of the release location to be judged (S106). Since the parameter of each piece of news on the release location to be judged refers to the parameter that each piece of news is linked or operated, the practical situation with regard to the importance of the release location can be reflected objectively, so that the obtained judgement result is more objective and accurate.

Description

判断新闻发布位置和新闻的重要性的方法和装置Method and apparatus for judging the importance of news release location and news 技术领域Technical field
本发明涉及互联网技术领域,特别是一种判断新闻发布位置和新闻的重要性的方法和装置。The present invention relates to the field of Internet technologies, and in particular, to a method and apparatus for determining the importance of a news release location and news.
背景技术Background technique
对于新闻搜索引擎而言,新闻的重要性是其对搜索结果进行排序的重要依据。在人工运营的新闻源(这里的新闻源是指一个聚合了大量新闻的页面,比如新闻源a,其页面网址可以为http://news.a.com.cn)上,新闻发布位置能够说明新闻的价值。例如,最重要的新闻可能会放置于头条位置。又例如,对于出现在头条位置的新闻,能够判断该新闻可能比较重要。由此,判断新闻发布位置的重要性对于新闻重要性的判断有着重要的意义,因而亟需提供一种客观、准确地判断新闻发布位置的重要性的方案。For news search engines, the importance of news is an important basis for sorting search results. In a manually operated news source (the news source here refers to a page that aggregates a large amount of news, such as news source a, its page URL can be http://news.a.com.cn), and the news release location can explain The value of the news. For example, the most important news may be placed in the headline. As another example, for news that appears in the headline position, it can be important to be able to determine the news. Therefore, judging the importance of the location of the news release is of great significance for the judgment of the importance of the news, and therefore it is urgent to provide a solution for objectively and accurately determining the importance of the location of the news release.
另外,在搜索引擎中,在其它相关性因素相近时,通常将重要性、权威性高的网页尽量排在搜索结果的前面,方便用户尽快找到有价值的网页信息。对于新闻搜索引擎而言,新闻的重要性是其对搜索结果进行排序的重要依据。传统的网页等级(PageRank)方法对于新闻搜索引擎并不适用,原因是PageRank所依赖的链接指向关系是一个需要时间积累的数据,新闻由于其时效性特点,在第一时间发布时并未积累多少PageRank,以至于跟历史新闻比较权重时,依赖PageRank会起负面作用。因此,如何有效、准确地衡量新闻重要性成为目前亟待解决的技术问题。In addition, in the search engine, when other relevant factors are similar, the important and authoritative webpages are usually placed in front of the search results as much as possible, so that the user can find valuable webpage information as soon as possible. For news search engines, the importance of news is an important basis for sorting search results. The traditional PageRank method is not applicable to news search engines because the link-pointing relationship that PageRank relies on is a data that takes time to accumulate. Because of its timeliness, news does not accumulate in the first time. PageRank, so that when you compare weights with historical news, relying on PageRank will have a negative effect. Therefore, how to effectively and accurately measure the importance of news has become a technical problem that needs to be solved urgently.
发明内容Summary of the invention
鉴于上述问题,提出了本发明以便提供一种克服上述问题或者至少部分地解决上述问题的判断新闻发布位置的重要性的方法和装置,以及衡量新闻重要性的方法和装置。In view of the above problems, the present invention has been made in order to provide a method and apparatus for judging the importance of a news posting location that overcomes the above problems or at least partially solves the above problems, and a method and apparatus for measuring the importance of news.
根据本发明的一方面,提出一种判断新闻发布位置的重要性的方法,包括:统计指定时间段内某一新闻源上待判断的发布位置上的各条新闻;确定所述各条新闻的参数,其中,所述各条新闻的参数是指所述各条新闻被链接或操作的参数;根据确定的所述各条新闻的参数判断出所述待判断的发布位置的重要性。According to an aspect of the present invention, a method for determining the importance of a news posting location includes: counting various news items at a publishing location to be determined on a news source within a specified time period; determining the respective news items a parameter, wherein the parameters of the pieces of news refer to parameters in which the pieces of news are linked or operated; and the importance of the release position to be determined is determined according to the determined parameters of the pieces of news.
根据本发明的另一方面,还提出一种判断新闻发布位置的重要性的装置,包括:统计模块,适于统计指定时间段内某一新闻源上待判断的发布位置上的各条新闻;确定模块,适于确定所述各条新闻的参数,其中,所述各条新闻的参数是指所述各条新闻被链接或操作的参数;判断模块,适于根据确定的所述各条新闻的参数判断出所述待判断的发布位置的重要性。 According to another aspect of the present invention, there is also provided an apparatus for determining the importance of a location of a news release, comprising: a statistics module adapted to count each piece of news on a posting location to be determined on a news source within a specified time period; a determining module, configured to determine parameters of the respective pieces of news, wherein the parameters of the pieces of news refer to parameters in which the pieces of news are linked or operated; and the determining module is adapted to be based on the determined pieces of news The parameter determines the importance of the posting location to be determined.
本发明的有益效果为:The beneficial effects of the invention are:
依据本发明提供的技术方案,通过统计指定时间段内某一新闻源上待判断的发布位置上的各条新闻,并确定各条新闻的参数。进而根据确定的各条新闻的参数判断出待判断的发布位置的重要性。由此可见,本发明提出了一种根据待判断的发布位置上的各条新闻的参数判断待判断的发布位置的重要性的方案,由于待判断的发布位置上的各条新闻的参数是指各条新闻被链接或操作的参数,因而能够客观地反映关于发布位置的重要性的实际情况,使得得到的判断结果更加客观、准确。并且,本发明提供的客观、准确的待判断的发布位置的重要性的判断结果,对于新闻搜索引擎或新闻发布引擎有着重要的意义,如新闻搜索引擎能够根据该判断结果更准确地判断新闻重要性,进而将重要性高的新闻尽量排在搜索结果的前面,方便用户尽快找到价值较高的新闻,又如新闻发布引擎能够根据该判断结果将待发布的重要新闻发布在重要性较高的发布位置。According to the technical solution provided by the present invention, each piece of news on a posting position to be judged on a news source within a specified time period is counted, and parameters of each piece of news are determined. Further, the importance of the posting location to be determined is determined according to the determined parameters of each piece of news. It can be seen that the present invention proposes a scheme for judging the importance of the posting position to be determined according to the parameters of each news item at the posting position to be determined, because the parameters of each news item at the publishing position to be determined refer to The parameters of each news link or operation can objectively reflect the actual situation about the importance of the release location, making the judgment result more objective and accurate. Moreover, the objective and accurate judgment result of the importance of the release position to be judged by the present invention has important significance for the news search engine or the news release engine, for example, the news search engine can more accurately judge the news important according to the judgment result. Sex, and then put the news of high importance in front of the search results as much as possible, so that users can find the news with higher value as soon as possible, and the news release engine can release the important news to be released according to the judgment result in the higher importance. Publishing location.
根据本发明的另一方面,还提出一种衡量新闻重要性的方法,包括:确定待衡量新闻的至少一个发布属性,以及每个发布属性在新闻重要性中的权值;获取每个发布属性的属性值,并根据确定的所述权值以及获取的所述属性值对所述至少一个发布属性进行加权处理,计算得出的值作为所述待衡量新闻的重要性值;将所述重要性值与预设的衡量规则进行比较,衡量出所述待衡量新闻的重要性。According to another aspect of the present invention, there is also provided a method of measuring the importance of news, comprising: determining at least one publishing attribute of the news to be measured, and a weight of each posting attribute in news importance; obtaining each publishing attribute Attribute value, and weighting the at least one publishing attribute according to the determined weight value and the obtained attribute value, and the calculated value is used as the importance value of the news to be measured; The sex value is compared with a preset measurement rule to measure the importance of the news to be measured.
根据本发明的另一方面,还提出一种衡量新闻重要性的装置,包括:确定模块,适于确定待衡量新闻的至少一个发布属性,以及每个发布属性在新闻重要性中的权值;计算模块,适于获取每个发布属性的属性值,并根据确定的所述权值以及获取的所述属性值对所述至少一个发布属性进行加权处理,计算得出的值作为所述待衡量新闻的重要性值;衡量模块,适于将所述重要性值与预设的衡量规则进行比较,衡量出所述待衡量新闻的重要性。According to another aspect of the present invention, there is also provided an apparatus for measuring the importance of news, comprising: a determining module adapted to determine at least one posting attribute of the news to be measured, and a weight of each posting attribute in the importance of the news; a calculating module, configured to obtain an attribute value of each publishing attribute, and perform weighting processing on the at least one publishing attribute according to the determined weight value and the obtained attribute value, and the calculated value is used as the to-be-measured value The importance value of the news; a measurement module adapted to compare the importance value with a preset measurement rule to measure the importance of the news to be measured.
本发明的有益效果为:The beneficial effects of the invention are:
依据本发明提供的技术方案,确定待衡量新闻的至少一个发布属性,以及每个发布属性在新闻重要性中的权值,并获取每个发布属性的属性值。进而根据确定的权值以及获取的属性值对至少一个发布属性进行加权处理,计算得出的值作为待衡量新闻的重要性值。随后将重要性值与预设的衡量规则进行比较,衡量出待衡量新闻的重要性。可见,本发明基于待衡量新闻的至少一个发布属性来综合衡量待衡量新闻的重要性,使得衡量结果更加客观、准确、全面。由此解决了相关技术中若采用网页等级(PageRank)方法确定新闻重要性而产生负面作用的问题。并且,新闻搜索引擎能够根据本发明提供的客观、准确、全面的待衡量新闻的重要性的衡量结果将重要性高的新闻尽量排在搜索结果的前面,方便用户尽快找到价值较高的新闻。According to the technical solution provided by the present invention, at least one publishing attribute of the news to be measured, and a weight of each publishing attribute in the news importance are determined, and the attribute value of each publishing attribute is obtained. And further performing weighting processing on the at least one publishing attribute according to the determined weight and the obtained attribute value, and the calculated value is used as the importance value of the news to be measured. The importance value is then compared to a preset measurement rule to measure the importance of the news to be measured. It can be seen that the present invention comprehensively measures the importance of the news to be measured based on at least one publishing attribute of the news to be measured, so that the measurement result is more objective, accurate and comprehensive. This solves the problem that the related art uses the PageRank method to determine the importance of news and has a negative effect. Moreover, the news search engine can rank the news with high importance as much as possible in front of the search results according to the objective, accurate and comprehensive measurement results of the importance of the news to be measured provided by the present invention, so that the user can find the news with higher value as soon as possible.
根据本发明的又一方面,提供了一种计算机程序,其包括计算机可读代码,当所述计算机可读代码在计算设备上运行时,导致所述计算设备执行根据上文所述的判断新闻发布位置的重要性的方法以及衡量新闻重要性的方法。 According to still another aspect of the present invention, a computer program is provided, comprising computer readable code, when said computer readable code is run on a computing device, causing said computing device to perform a determination of news according to said A method of publishing the importance of a location and a way to measure the importance of the news.
根据本发明的再一方面,提供了一种计算机可读介质,其中存储了上述的计算机程序。According to still another aspect of the present invention, a computer readable medium storing the above computer program is provided.
上述说明仅是本发明技术方案的概述,为了能够更清楚了解本发明的技术手段,而可依照说明书的内容予以实施,并且为了让本发明的上述和其它目的、特征和优点能够更明显易懂,以下特举本发明的具体实施方式。The above description is only an overview of the technical solutions of the present invention, and the above-described and other objects, features and advantages of the present invention can be more clearly understood. Specific embodiments of the invention are set forth below.
附图说明DRAWINGS
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本发明的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:Various other advantages and benefits will become apparent to those skilled in the art from a The drawings are only for the purpose of illustrating the preferred embodiments and are not to be construed as limiting. Throughout the drawings, the same reference numerals are used to refer to the same parts. In the drawing:
图1示出了根据本发明一个实施例的判断新闻发布位置的重要性的方法的一种流程图;1 shows a flow chart of a method of determining the importance of a news release location, in accordance with one embodiment of the present invention;
图2示出了根据本发明一个实施例的判断新闻发布位置的重要性的方法的另一种流程图;2 illustrates another flow chart of a method of determining the importance of a news posting location, in accordance with one embodiment of the present invention;
图3示出了根据本发明一个实施例的判断新闻发布位置的重要性的装置的一种结构示意图;FIG. 3 is a block diagram showing a structure of an apparatus for determining the importance of a news release location according to an embodiment of the present invention; FIG.
图4示出了根据本发明一个实施例的判断新闻发布位置的重要性的装置的另一种结构示意图;4 is a block diagram showing another structure of an apparatus for determining the importance of a news release location according to an embodiment of the present invention;
图5示出了根据本发明一个实施例的衡量新闻重要性的方法的一种流程图;FIG. 5 illustrates a flow chart of a method of measuring the importance of news in accordance with one embodiment of the present invention;
图6示出了根据本发明一个实施例的衡量新闻重要性的方法的另一种流程图;6 shows another flow chart of a method of measuring the importance of news in accordance with one embodiment of the present invention;
图7示出了根据本发明一个实施例的衡量新闻重要性的装置的一种结构示意图;FIG. 7 is a block diagram showing a structure of an apparatus for measuring the importance of news according to an embodiment of the present invention; FIG.
图8示出了根据本发明一个实施例的衡量新闻重要性的装置的另一种结构示意图;FIG. 8 is a block diagram showing another structure of an apparatus for measuring the importance of news according to an embodiment of the present invention; FIG.
图9示意性地示出了用于执行根据本发明的判断新闻发布位置的重要性的方法以及衡量新闻重要性的方法的计算设备的框图;以及Figure 9 is a schematic block diagram of a computing device for performing a method of determining the importance of a news posting location and a method of measuring the importance of news in accordance with the present invention;
图10示意性地示出了用于保持或者携带实现根据本发明的判断新闻发布位置的重要性的方法以及衡量新闻重要性的方法的程序代码的存储单元。Fig. 10 schematically shows a storage unit for holding or carrying a program code implementing a method of determining the importance of a news posting location and a method of measuring the importance of news according to the present invention.
具体实施方式detailed description
下面结合附图和具体的实施方式对本发明作进一步的描述。The invention is further described below in conjunction with the drawings and specific embodiments.
为解决上述技术问题,本发明实施例提供了一种判断新闻发布位置的重要性的方法,图1示出了根据本发明一个实施例的判断新闻发布位置的重要性的方法的一种流程图。如图1所示,该方法至少包括以下步骤S102至步骤S106。In order to solve the above technical problem, an embodiment of the present invention provides a method for determining the importance of a news release location, and FIG. 1 illustrates a flow chart of a method for determining the importance of a news release location according to an embodiment of the present invention. . As shown in FIG. 1, the method includes at least the following steps S102 to S106.
步骤S102、统计指定时间段内某一新闻源上待判断的发布位置上的各条新闻。 Step S102: Statistics each news item at a publishing location to be determined on a news source within a specified time period.
步骤S104、确定各条新闻的参数,其中,各条新闻的参数是指各条新闻被链接或操作的参数。Step S104: Determine parameters of each piece of news, wherein the parameters of each piece of news refer to parameters in which each piece of news is linked or operated.
步骤S106、根据步骤S104确定的各条新闻的参数判断出待判断的发布位置的重要性。Step S106: Determine the importance of the publishing location to be determined according to the parameters of each news determined in step S104.
依据本发明提供的技术方案,通过统计指定时间段内某一新闻源上待判断的发布位置上的各条新闻,并确定各条新闻的参数。进而根据确定的各条新闻的参数判断出待判断的发布位置的重要性。由此可见,本发明提出了一种根据待判断的发布位置上的各条新闻的参数判断待判断的发布位置的重要性的方案,由于待判断的发布位置上的各条新闻的参数是指各条新闻被链接或操作的参数,因而能够客观地反映关于发布位置的重要性的实际情况,使得得到的判断结果更加客观、准确。并且,本发明提供的客观、准确的待判断的发布位置的重要性的判断结果,对于新闻搜索引擎或新闻发布引擎有着重要的意义,如新闻搜索引擎能够根据该判断结果更准确地判断新闻重要性,进而将重要性高的新闻尽量排在搜索结果的前面,方便用户尽快找到价值较高的新闻,又如新闻发布引擎能够根据该判断结果将待发布的重要新闻发布在重要性较高的发布位置。According to the technical solution provided by the present invention, each piece of news on a posting position to be judged on a news source within a specified time period is counted, and parameters of each piece of news are determined. Further, the importance of the posting location to be determined is determined according to the determined parameters of each piece of news. It can be seen that the present invention proposes a scheme for judging the importance of the posting position to be determined according to the parameters of each news item at the posting position to be determined, because the parameters of each news item at the publishing position to be determined refer to The parameters of each news link or operation can objectively reflect the actual situation about the importance of the release location, making the judgment result more objective and accurate. Moreover, the objective and accurate judgment result of the importance of the release position to be judged by the present invention has important significance for the news search engine or the news release engine, for example, the news search engine can more accurately judge the news important according to the judgment result. Sex, and then put the news of high importance in front of the search results as much as possible, so that users can find the news with higher value as soon as possible, and the news release engine can release the important news to be released according to the judgment result in the higher importance. Publishing location.
上文步骤S102提及的新闻可以是新闻链接、新闻页面内容等。进一步地,新闻源是指一个聚合了大量新闻的页面,例如某新闻源a,其网址可以为http://news.a.com.cn。此外,新闻源上的发布位置是指将新闻发布在新闻源页面上的位置。The news mentioned in the above step S102 may be a news link, a news page content, or the like. Further, the news source refers to a page that aggregates a large amount of news, such as a news source a, and its website address may be http://news.a.com.cn. In addition, the publishing location on the news source refers to the location where the news is posted on the news source page.
上文步骤S104中各条新闻的参数是指各条新闻被链接或操作的参数,如各条新闻中每条新闻的网页等级PageRank、各条新闻中每条新闻被点击的次数、各条新闻中每条新闻被展示的次数等等,本发明不限于此。在步骤S104确定各条新闻的参数后,步骤S106根据步骤S104确定的各条新闻的参数判断出待判断的发布位置的重要性,本发明提供了一种优选的实施方案,在该方案中可以根据确定的各条新闻中每条新闻参数,分别计算得到各条新闻的各个参数的平均值,进而基于各条新闻的各个参数的平均值中的一个或多个,确定待判断的发布位置的重要性。举例来说,统计指定时间段内(如12小时)某一新闻源上待判断的发布位置上的各条新闻为新闻1、新闻2、新闻3,确定新闻1、新闻2以及新闻3的PageRank分别为P1、P2和P3,则其PageRank的平均值为P0=(P1+P2+P3)/3。确定新闻1、新闻2以及新闻3被点击的次数分别为C1、C2和C3,则其被点击的次数的平均值为C0=(C1+C2+C3)/3。确定新闻1、新闻2以及新闻3被展示的次数分别为D1、D2和D3,其被展示的次数的平均值为D0=(D1+D2+D3)/3。进而基于P0、C0以及D0一个或多个,确定待判断的发布位置的重要性。需要说明的是,此处列举的新闻1、新闻2以及新闻3仅仅是示意性的,不用于限制本发明。The parameters of each piece of news in the above step S104 refer to parameters in which each piece of news is linked or operated, such as the page rank PageRank of each news in each news, the number of times each news item is clicked in each news, and each piece of news The number of times each news item is displayed, etc., the present invention is not limited thereto. After determining the parameters of each piece of news in step S104, step S106 determines the importance of the release position to be determined according to the parameters of each piece of news determined in step S104, and the present invention provides a preferred embodiment, in which a preferred embodiment is provided. Determining, according to each news parameter in each piece of news, an average value of each parameter of each piece of news, and then determining one or more of the average values of the respective parameters of each piece of news, determining the release position to be determined importance. For example, to count the news on the publishing location to be judged on a certain news source within a specified time period (for example, 12 hours) as News 1, News 2, News 3, and determine PageRank of News 1, News 2, and News 3. For P1, P2, and P3, respectively, the average value of PageRank is P0=(P1+P2+P3)/3. It is determined that the number of times that News 1, News 2, and News 3 are clicked is C1, C2, and C3, respectively, and the average number of times of being clicked is C0=(C1+C2+C3)/3. It is determined that the number of times News 1, News 2, and News 3 are displayed are D1, D2, and D3, respectively, and the average number of times of being displayed is D0 = (D1 + D2 + D3) / 3. Further, based on one or more of P0, C0, and D0, the importance of the posting location to be determined is determined. It should be noted that News 1, News 2, and News 3 listed herein are merely illustrative and are not intended to limit the present invention.
进一步地,可以对各条新闻的各个参数的平均值中的一个或多个进行加权处理,计算得到的值作为待判断的发布位置的重要性值,随后将重要性值与预设的判断规 则进行比较,确定待判断的发布位置的重要性。仍然以各条新闻为新闻1、新闻2以及新闻3为例,可以对P0、C0以及D0中的一个或多个进行加权处理,计算得到的值作为待判断的发布位置的重要性值,随后将重要性值与预设的判断规则进行比较,确定待判断的发布位置的重要性。这里的预设的判断规则可以是数值区间与发布位置的重要性的对应关系,例如落入数值区间P1对应的发布位置的重要性为“非常高”、落入数值区间P2对应的重要性为“高”、落入数值区间P3对应的发布位置的重要性为“中”、落入数值区间P4对应的发布位置的重要性为“低”等,此处仅是列举的,还可以是其他数值区间与发布位置的重要性的对应关系。Further, one or more of the average values of the respective parameters of each piece of news may be weighted, and the calculated value is used as the importance value of the release position to be determined, and then the importance value and the preset judgment rule are Then, a comparison is made to determine the importance of the posting location to be determined. Still taking news as news 1, news 2, and news 3 as an example, one or more of P0, C0, and D0 may be weighted, and the calculated value is used as the importance value of the release position to be judged, and then The importance value is compared with a preset judgment rule to determine the importance of the release location to be judged. The preset judgment rule here may be a correspondence relationship between the numerical interval and the importance of the release position. For example, the importance of the release position corresponding to the numerical interval P1 is “very high”, and the importance corresponding to the numerical interval P2 is The importance of the "high", the release position corresponding to the numerical interval P3 is "medium", and the importance of the release position corresponding to the numerical interval P4 is "low", etc., which is only listed here, and may be other The correspondence between the numerical interval and the importance of the publishing location.
此外,为了统计指定时间段内某一新闻源上待判断的发布位置上的各条新闻,本发明提供了一种优选的方案,即在步骤S102之前,可以以预设的时间间隔从多个新闻源上抓取发布时间距当前不超过指定时长的新闻,记录抓取的新闻首次被发布的时间、新闻源以及被发布在该新闻源上的位置。In addition, in order to count the pieces of news on the posting position to be determined on a certain news source in a specified time period, the present invention provides a preferred solution, that is, before step S102, multiple times can be taken at preset time intervals. The news source captures news that is published no longer than the specified duration, records the time when the captured news was first published, the news source, and the location posted on the news source.
以上介绍了图1所示的实施例中各环节的多种实现方式,下面通过具体的优选实施例对本发明实施例提供的判断新闻发布位置的重要性的方法做进一步说明。The foregoing describes various implementation manners of the various links in the embodiment shown in FIG. 1. The method for determining the importance of the news release location provided by the embodiment of the present invention is further described below through a specific preferred embodiment.
图2示出了根据本发明一个实施例的判断新闻发布位置的重要性的方法的另一种流程图。如图2所示,该方法包括以下步骤S202至步骤S210。2 illustrates another flow diagram of a method of determining the importance of a news posting location, in accordance with one embodiment of the present invention. As shown in FIG. 2, the method includes the following steps S202 to S210.
步骤S202、统计指定时间段内某一新闻源上待判断的发布位置上的各条新闻。这里的新闻可以是新闻链接、新闻页面内容等。Step S202: Statistics each news item at a publishing location to be determined on a news source within a specified time period. The news here can be news links, news page content, and the like.
为了统计指定时间段内某一新闻源上待判断的发布位置上的各条新闻,在步骤S202之前,可以以预设的时间间隔从多个新闻源上抓取发布时间距当前不超过指定时长的新闻,记录抓取的新闻首次被发布的时间、新闻源以及被发布在该新闻源上的位置。In order to count the pieces of news on the publishing location to be determined on a certain news source in a specified time period, before the step S202, the publishing time may be fetched from the plurality of news sources at a preset time interval, and the current time does not exceed the specified duration. News, recording the time when the crawled news was first published, the source of the news, and the location posted on the news source.
步骤S204、确定各条新闻的参数,其中,各条新闻的参数是指各条新闻被链接或操作的参数。Step S204: Determine parameters of each piece of news, wherein the parameters of each piece of news refer to parameters in which each piece of news is linked or operated.
这里,各条新闻的参数可以是各条新闻中每条新闻的网页等级PageRank、各条新闻中每条新闻被点击的次数、各条新闻中每条新闻被展示的次数等等,本发明不限于此。Here, the parameters of each news item may be the page rank PageRank of each news in each news, the number of times each news item is clicked in each news, the number of times each news item is displayed in each news, and the like, and the present invention does not Limited to this.
步骤S206、根据确定的各条新闻中每条新闻参数,分别计算得到各条新闻的各个参数的平均值。Step S206: Calculate an average value of each parameter of each piece of news according to each news parameter in each determined news item.
步骤S208、对各条新闻的各个参数的平均值中的一个或多个进行加权处理,计算得到的值作为待判断的发布位置的重要性值。Step S208: Perform weighting processing on one or more of the average values of the respective parameters of each piece of news, and calculate the obtained value as the importance value of the issuing position to be determined.
步骤S210、将重要性值与预设的判断规则进行比较,确定待判断的发布位置的重要性。Step S210: comparing the importance value with a preset determination rule, and determining the importance of the release location to be determined.
本发明实施例中,根据待判断的发布位置上的各条新闻的参数判断待判断的发布位置的重要性,由于待判断的发布位置上的各条新闻的参数是指各条新闻被链接或操作的参数,因而能够客观地反映关于发布位置的重要性的实际情况,使得得到 的判断结果更加客观、准确。并且,本发明提供的客观、准确的待判断的发布位置的重要性的判断结果,对于新闻搜索引擎或新闻发布引擎有着重要的意义,如新闻搜索引擎能够根据该判断结果更准确地判断新闻重要性,进而将重要性高的新闻尽量排在搜索结果的前面,方便用户尽快找到价值较高的新闻,又如新闻发布引擎能够根据该判断结果将待发布的重要新闻发布在重要性较高的发布位置。In the embodiment of the present invention, the importance of the release location to be determined is determined according to the parameters of each news item at the release location to be determined, because the parameters of each news item at the release location to be determined means that each news item is linked or The parameters of the operation, thus being able to objectively reflect the actual situation regarding the importance of the release location, so that The judgment results are more objective and accurate. Moreover, the objective and accurate judgment result of the importance of the release position to be judged by the present invention has important significance for the news search engine or the news release engine, for example, the news search engine can more accurately judge the news important according to the judgment result. Sex, and then put the news of high importance in front of the search results as much as possible, so that users can find the news with higher value as soon as possible, and the news release engine can release the important news to be released according to the judgment result in the higher importance. Publishing location.
需要说明的是,实际应用中,上述所有可选实施方式可以采用结合的方式任意组合,形成本发明的可选实施例,在此不再一一赘述。It should be noted that, in an actual application, all the foregoing optional embodiments may be combined in any combination to form an optional embodiment of the present invention, and details are not described herein again.
基于同一发明构思,本发明实施例还提供了一种判断新闻发布位置的重要性的装置,以实现上述判断新闻发布位置的重要性的方法。Based on the same inventive concept, an embodiment of the present invention further provides a device for determining the importance of a news release location to implement the above method for determining the importance of a news release location.
图3示出了根据本发明一个实施例的判断新闻发布位置的重要性的装置的一种结构示意图。参见图3,该装置至少包括:统计模块310、确定模块320以及判断模块330。FIG. 3 shows a schematic structural diagram of an apparatus for determining the importance of a news release location according to an embodiment of the present invention. Referring to FIG. 3, the apparatus at least includes: a statistics module 310, a determining module 320, and a determining module 330.
现介绍本发明实施例的判断新闻发布位置的重要性的装置的各组成或器件的功能以及各部分间的连接关系:The functions of each component or device of the apparatus for determining the importance of the location of the news release and the connection relationship between the components of the embodiment of the present invention will now be described:
统计模块310,适于统计指定时间段内某一新闻源上待判断的发布位置上的各条新闻;The statistics module 310 is configured to count each news on the publishing location to be determined on a certain news source within a specified time period;
确定模块320,与统计模块310相耦合,适于确定各条新闻的参数,其中,各条新闻的参数是指各条新闻被链接或操作的参数;The determining module 320 is coupled to the statistic module 310 and is adapted to determine parameters of each piece of news, wherein the parameters of each piece of news refer to parameters for which each piece of news is linked or operated;
判断模块330,与确定模块320相耦合,适于根据确定的各条新闻的参数判断出待判断的发布位置的重要性。The determining module 330 is coupled to the determining module 320, and is adapted to determine the importance of the publishing location to be determined according to the determined parameters of each news item.
在一个实施例中,各条新闻的参数包括下列至少之一:In one embodiment, the parameters of each piece of news include at least one of the following:
各条新闻中每条新闻的网页等级PageRank;Page rank of each news in each news page PageRank;
各条新闻中每条新闻被点击的次数;The number of times each news item was clicked in each news item;
各条新闻中每条新闻被展示的次数。The number of times each news item was displayed in each news item.
在一个实施例中,判断模块330还适于:根据确定的各条新闻中每条新闻参数,分别计算得到各条新闻的各个参数的平均值;基于各条新闻的各个参数的平均值中的一个或多个,确定待判断的发布位置的重要性。In an embodiment, the determining module 330 is further adapted to: respectively calculate an average value of each parameter of each piece of news according to each of the determined news items; and based on an average value of each parameter of each piece of news One or more, determining the importance of the publishing location to be determined.
在一个实施例中,判断模块330还适于:对各条新闻的各个参数的平均值中的一个或多个进行加权处理,计算得到的值作为待判断的发布位置的重要性值;将重要性值与预设的判断规则进行比较,确定待判断的发布位置的重要性。In an embodiment, the determining module 330 is further adapted to: perform weighting processing on one or more of the average values of the respective parameters of the respective news, and calculate the obtained value as the importance value of the publishing position to be determined; The sex value is compared with a preset judgment rule to determine the importance of the release location to be determined.
在一个实施例中,图4示出了根据本发明一个实施例的判断新闻发布位置的重要性的装置的另一种结构示意图。如图4所示,在统计模块310统计指定时间段内某一新闻源上待判断的发布位置上的各条新闻之前,还包括:抓取模块410,与统计模块310相耦合,适于以预设的时间间隔从多个新闻源上抓取发布时间距当前不超过指定时长的新闻,记录抓取的新闻首次被发布的时间、新闻源以及被发布在该新闻源上的发布位置。 In one embodiment, FIG. 4 illustrates another structural diagram of an apparatus for determining the importance of a news posting location, in accordance with one embodiment of the present invention. As shown in FIG. 4, before the statistics module 310 counts each news on the publishing location to be determined on a certain news source in a specified time period, the method further includes: a capture module 410 coupled to the statistics module 310, The preset time interval captures news from a plurality of news sources that is not longer than the specified duration, records the time when the captured news is first released, the news source, and the posting location posted on the news source.
在一个实施例中,各条新闻为各条新闻链接。In one embodiment, each piece of news is a link to each piece of news.
根据上述任意一个优选实施例或多个优选实施例的组合,本发明实施例能够达到如下有益效果:According to any one of the preferred embodiments or the combination of the preferred embodiments, the embodiment of the present invention can achieve the following beneficial effects:
依据本发明提供的技术方案,通过统计指定时间段内某一新闻源上待判断的发布位置上的各条新闻,并确定各条新闻的参数。进而根据确定的各条新闻的参数判断出待判断的发布位置的重要性。由此可见,本发明提出了一种根据待判断的发布位置上的各条新闻的参数判断待判断的发布位置的重要性的方案,由于待判断的发布位置上的各条新闻的参数是指各条新闻被链接或操作的参数,因而能够客观地反映关于发布位置的重要性的实际情况,使得得到的判断结果更加客观、准确。并且,本发明提供的客观、准确的待判断的发布位置的重要性的判断结果,对于新闻搜索引擎或新闻发布引擎有着重要的意义,如新闻搜索引擎能够根据该判断结果更准确地判断新闻重要性,进而将重要性高的新闻尽量排在搜索结果的前面,方便用户尽快找到价值较高的新闻,又如新闻发布引擎能够根据该判断结果将待发布的重要新闻发布在重要性较高的发布位置。According to the technical solution provided by the present invention, each piece of news on a posting position to be judged on a news source within a specified time period is counted, and parameters of each piece of news are determined. Further, the importance of the posting location to be determined is determined according to the determined parameters of each piece of news. It can be seen that the present invention proposes a scheme for judging the importance of the posting position to be determined according to the parameters of each news item at the posting position to be determined, because the parameters of each news item at the publishing position to be determined refer to The parameters of each news link or operation can objectively reflect the actual situation about the importance of the release location, making the judgment result more objective and accurate. Moreover, the objective and accurate judgment result of the importance of the release position to be judged by the present invention has important significance for the news search engine or the news release engine, for example, the news search engine can more accurately judge the news important according to the judgment result. Sex, and then put the news of high importance in front of the search results as much as possible, so that users can find the news with higher value as soon as possible, and the news release engine can release the important news to be released according to the judgment result in the higher importance. Publishing location.
根据本发明的另一实施例,还提供了一种衡量新闻重要性的方法,图5示出了根据本发明一个实施例的衡量新闻重要性的方法的一种流程图。如图5所示,该方法至少包括以下步骤S502至步骤S506。In accordance with another embodiment of the present invention, a method of measuring the importance of news is also provided, and FIG. 5 illustrates a flow chart of a method of measuring the importance of news in accordance with one embodiment of the present invention. As shown in FIG. 5, the method includes at least the following steps S502 to S506.
步骤S502、确定待衡量新闻的至少一个发布属性,以及每个发布属性在新闻重要性中的权值。Step S502: Determine at least one publishing attribute of the news to be measured, and a weight of each publishing attribute in the news importance.
步骤S504、获取每个发布属性的属性值,并根据确定的权值以及获取的属性值对至少一个发布属性进行加权处理,计算得出的值作为待衡量新闻的重要性值。Step S504: Acquire an attribute value of each publishing attribute, and perform weighting processing on the at least one publishing attribute according to the determined weight value and the obtained attribute value, and calculate the calculated value as the importance value of the news to be measured.
步骤S506、将重要性值与预设的衡量规则进行比较,衡量出待衡量新闻的重要性。Step S506, comparing the importance value with a preset measurement rule, and measuring the importance of the news to be measured.
依据本发明提供的技术方案,确定待衡量新闻的至少一个发布属性,以及每个发布属性在新闻重要性中的权值,并获取每个发布属性的属性值。进而根据确定的权值以及获取的属性值对至少一个发布属性进行加权处理,计算得出的值作为待衡量新闻的重要性值。随后将重要性值与预设的衡量规则进行比较,衡量出待衡量新闻的重要性。可见,本发明基于待衡量新闻的至少一个发布属性来综合衡量待衡量新闻的重要性,使得衡量结果更加客观、准确、全面。由此解决了相关技术中若采用网页等级(PageRank)方法确定新闻重要性而产生负面作用的问题。并且,新闻搜索引擎能够根据本发明提供的客观、准确、全面的待衡量新闻的重要性的衡量结果将重要性高的新闻尽量排在搜索结果的前面,方便用户尽快找到价值较高的新闻。According to the technical solution provided by the present invention, at least one publishing attribute of the news to be measured, and a weight of each publishing attribute in the news importance are determined, and the attribute value of each publishing attribute is obtained. And further performing weighting processing on the at least one publishing attribute according to the determined weight and the obtained attribute value, and the calculated value is used as the importance value of the news to be measured. The importance value is then compared to a preset measurement rule to measure the importance of the news to be measured. It can be seen that the present invention comprehensively measures the importance of the news to be measured based on at least one publishing attribute of the news to be measured, so that the measurement result is more objective, accurate and comprehensive. This solves the problem that the related art uses the PageRank method to determine the importance of news and has a negative effect. Moreover, the news search engine can rank the news with high importance as much as possible in front of the search results according to the objective, accurate and comprehensive measurement results of the importance of the news to be measured provided by the present invention, so that the user can find the news with higher value as soon as possible.
上文步骤S502中提及的新闻可以是新闻链接、新闻页面内容等。进一步地,待衡量新闻的发布属性是指与发布待衡量新闻相关的信息,如发布时间、发布待衡量新闻的新闻源、待衡量新闻被发布在新闻源上的位置、发布内容中的文字或图片信息、发布内容的篇幅等等,本发明不限于此。这里提及的新闻源是指一个聚合了大 量新闻的网页或页面,例如某新闻源a,其网址可以为http://news.a.com.cn。此外,每个发布属性在新闻重要性中的权值是指每个发布属性在新闻重要性中的相对重要程度,因而可以采用多种方法确定每个发布属性在新闻重要性中的权值,如主观经验法、专家调查法等。例如,确定待衡量新闻的至少一个发布属性为发布时间、发布待衡量新闻的新闻源以及发布内容的篇幅,根据主观经验,其中的发布时间在新闻重要性中较重要,可以赋予较大的权值(如权值为0.5),并赋予发布待衡量新闻的新闻源以及发布内容的篇幅的权值分别为0.3和0.2。又例如,确定待衡量新闻的至少一个发布属性为发布时间、发布待衡量新闻的新闻源、待衡量新闻被发布在新闻源上的位置以及发布内容的篇幅,可以赋予它们的权值分别为0.5、0.3、0.1和0.1。以上仅列举了常见的确定权值的方法,其它用于确定权值的方法均适用于本发明。The news mentioned in the above step S502 may be a news link, a news page content, or the like. Further, the posting attribute of the news to be measured refers to information related to publishing the news to be measured, such as the posting time, the news source that publishes the news to be measured, the position where the news to be measured is posted on the news source, the text in the posted content, or The picture information, the length of the posted content, and the like, the present invention is not limited thereto. The news source mentioned here refers to a large aggregate A web page or page of news, such as a news source a, whose URL can be http://news.a.com.cn. In addition, the weight of each publishing attribute in the importance of news refers to the relative importance of each publishing attribute in the importance of news, so multiple methods can be used to determine the weight of each publishing attribute in the importance of news. Such as subjective experience law, expert investigation method, etc. For example, determining at least one publishing attribute of the news to be measured is the publishing time, the news source that publishes the news to be measured, and the length of the published content. According to the subjective experience, the publishing time is more important in the importance of the news, and can be given a larger right. The value (such as a weight of 0.5), and the weight of the news source that publishes the news to be measured and the content of the published content are 0.3 and 0.2, respectively. For another example, determining at least one publishing attribute of the news to be measured is a publishing time, a news source that publishes the news to be measured, a location where the news to be measured is posted on the news source, and a length of the posted content, which may be given a weight of 0.5. , 0.3, 0.1 and 0.1. Only the common methods for determining the weights are listed above, and other methods for determining the weights are applicable to the present invention.
上文步骤S504中发布属性的属性值是发布属性的一种数值表示,例如发布时间6:00整,可以通过属性值1来表示该发布时间6:00整,发布时间12:00整,可以通过属性值2来表示该发布时间12:00整,当然还可以通过其它属性值来表示。又例如,发布内容的篇幅为100字以内,可以通过属性值100来表示该100字以内的篇幅,发布内容的篇幅为100字至500字,可以通过属性值500来表示该100字至500字的篇幅,发布内容的篇幅为500字至1000字,可以通过属性值1000来表示该500字至1000字的篇幅,此外还可以通过其它属性值来表示均适用于本发明。The attribute value of the published attribute in the above step S504 is a numerical representation of the publishing attribute. For example, the publishing time is 6:00. The attribute value 1 indicates that the publishing time is 6:00, and the publishing time is 12:00. The release time is 12:00 by the attribute value 2, and can of course be represented by other attribute values. For another example, the content of the published content is less than 100 words, and the length of the 100 words can be represented by the attribute value 100. The length of the published content is 100 words to 500 words, and the value 100 can be used to represent the 100 words to 500 words. The length of the content to be published is 500 words to 1000 words, and the value of the 500 words to 1000 words can be expressed by the attribute value 1000, and can also be expressed by other attribute values to be applicable to the present invention.
若待衡量新闻的至少一个发布属性为发布待衡量新闻的新闻源或者待衡量新闻被发布在新闻源上的某个位置,本发明实施例还提供了一种优选的计算上述两种发布属性(发布待衡量新闻的新闻源以及待衡量新闻被发布在新闻源上的某个位置)的属性值的方案,在该方案中,可以基于发布待衡量新闻的新闻源的网页链接关系,计算出发布待衡量新闻的新闻源的属性值以及待衡量新闻被发布在新闻源上的某个位置的属性值。下面将分别对上述两种发布属性的属性值的计算方式进行详细介绍。If the at least one publishing attribute of the news to be measured is the news source that publishes the news to be measured or the news to be measured is published in a certain position on the news source, the embodiment of the present invention further provides a preferred calculation of the two publishing attributes ( a scheme for publishing a property value of a news source to be measured and a location where the news to be measured is posted on a news source, in which a posting can be calculated based on a webpage link relationship of a news source that issues the news to be measured The attribute value of the news source of the news to be measured and the attribute value of the location where the news to be measured is posted on the news source. The calculation methods of the attribute values of the above two publishing attributes will be described in detail below.
首先,基于发布待衡量新闻的新闻源的网页链接关系,计算出发布待衡量新闻的新闻源的属性值,可以是基于发布待衡量新闻的新闻源的网页链接关系,计算出发布待衡量新闻的新闻源的PageRank作为发布待衡量新闻的新闻源的属性值。PageRank是一种根据网页之间相互的超链接计算的技术,通过网络浩瀚的超链接关系来确定一个网页的等级。可以把从A网页到B网页的链接解释为A网页给B网页投票,根据投票来源(基于来源的来源,即链接到A网页的网页)和投票目标的等级来决定新的等级。简单的说,一个高等级的网页可以使其他低等级网页的等级提升。First, based on the webpage link relationship of the news source that publishes the news to be measured, the attribute value of the news source that publishes the news to be measured is calculated, which may be based on the webpage link relationship of the news source that issues the news to be measured, and the news to be released is calculated. The PageRank of the news source is used as the attribute value of the news source that publishes the news to be measured. PageRank is a technology based on the hyperlinks between web pages to determine the level of a web page through the vast hyperlink relationship of the web. The link from the A web page to the B web page can be interpreted as the A web page voting for the B web page, and the new level is determined according to the voting source (based on the source of the source, that is, the web page linked to the A web page) and the rating of the voting target. Simply put, a high-level web page can raise the level of other low-level web pages.
其次,基于发布待衡量新闻的新闻源的网页链接关系,计算出待衡量新闻被发布在新闻源上的某个位置的属性值。即统计指定时间段内被发布在新闻源上的该位置的多条新闻,并确定多条新闻的参数,进而根据确定的多条新闻的参数,计算出新闻源上的该位置的属性值作为待衡量新闻被发布在新闻源上的该位置的属性值。这里,多条新闻的参数是指多条新闻中每条新闻被链接或操作的参数值,例如多条 新闻中每条新闻的PageRank、多条新闻中每条新闻被点击的次数、多条新闻中每条新闻被展示的次数,等等。Secondly, based on the webpage link relationship of the news source that publishes the news to be measured, the attribute value of the news to be measured at a certain position on the news source is calculated. That is, counting a plurality of news items of the position published on the news source within a specified time period, and determining parameters of the plurality of news items, and then calculating the attribute value of the position on the news source according to the determined parameters of the plurality of news items. The attribute value of the location at which the news is to be posted on the news source. Here, the parameters of a plurality of news items refer to parameter values in which each piece of news is linked or operated, for example, multiple pieces. PageRank of each news in the news, the number of times each news item is clicked, the number of times each piece of news is displayed, and so on.
进一步地,当多条新闻的参数包括多条新闻中每条新闻的PageRank、多条新闻中每条新闻被点击的次数以及多条新闻中每条新闻被展示的次数时,本发明提供了一种优选的计算新闻源上的该位置的属性值的方案,在该方案中根据确定的多条新闻中每条新闻的PageRank、被点击的次数、被展示的次数,分别计算得到多条新闻的PageRank、被点击的次数、被展示的次数的平均值,进而将计算得到的平均值进行加权求和作为新闻源上的该位置的属性值。例如,统计指定时间段内(如24小时)被发布在新闻源上的该位置的多条新闻为新闻1、新闻2以及新闻3,确定新闻1、新闻2以及新闻3的PageRank分别为P1、P2和P3,则其PageRank的平均值为P0=(P1+P2+P3)/3。确定新闻1、新闻2以及新闻3被点击的次数分别为C1、C2和C3,则其被点击的次数的平均值为C0=(C1+C2+C3)/3。确定新闻1、新闻2以及新闻3被展示的次数分别为D1、D2和D3,其被展示的次数的平均值为D0=(D1+D2+D3)/3。对P0、C0以及D0进行加权求和作为新闻源上的该位置的属性值,这里列举的新闻1、新闻2以及新闻3仅仅是示意性的,不用于限制本发明。需要说明的是,上述列举了多条新闻的参数包括多条新闻中每条新闻的PageRank、多条新闻中每条新闻被点击的次数以及多条新闻中每条新闻被展示的次数时,计算出新闻源上的该位置的属性值的方案。当多条新闻的参数包括多条新闻中每条新闻的PageRank和多条新闻中每条新闻被点击的次数时,或者当多条新闻的参数包括多条新闻中每条新闻被点击的次数和多条新闻中每条新闻被展示的次数时也可以采用上述方案计算出新闻源上的该位置的属性值。例如,当多条新闻的参数包括多条新闻中每条新闻的PageRank和多条新闻中每条新闻被点击的次数时,可以根据确定的多条新闻中每条新闻的PageRank、被点击的次数,分别计算得到多条新闻的PageRank、被点击的次数的平均值,进而将计算得到的平均值进行加权求和作为新闻源上的该位置的属性值。Further, when the parameters of the plurality of news include the PageRank of each of the plurality of news, the number of times each of the plurality of news is clicked, and the number of times each of the plurality of news is displayed, the present invention provides A preferred scheme for calculating an attribute value of the location on the news source, in which the plurality of news items are respectively calculated according to the PageRank of each news in the determined plurality of news, the number of times of being clicked, and the number of times of being displayed. The average of PageRank, the number of times clicked, and the number of times displayed, and then the weighted sum of the calculated average values is used as the attribute value of the position on the news source. For example, counting a plurality of news items at the location posted on the news source within a specified time period (eg, 24 hours) is News 1, News 2, and News 3, and determining that the PageRank of News 1, News 2, and News 3 are respectively P1. For P2 and P3, the average value of PageRank is P0=(P1+P2+P3)/3. It is determined that the number of times that News 1, News 2, and News 3 are clicked is C1, C2, and C3, respectively, and the average number of times of being clicked is C0=(C1+C2+C3)/3. It is determined that the number of times News 1, News 2, and News 3 are displayed are D1, D2, and D3, respectively, and the average number of times of being displayed is D0 = (D1 + D2 + D3) / 3. The weighted summation of P0, C0, and D0 is taken as the attribute value of the location on the news source, and News 1, News 2, and News 3 listed here are merely illustrative and are not intended to limit the present invention. It should be noted that the above-mentioned parameters of multiple news items include PageRank of each news in multiple news items, the number of times each news item is clicked in multiple news items, and the number of times each news item is displayed in multiple news items, The scheme of the attribute value of the location on the news source. When the parameters of the plurality of news include the PageRank of each of the plurality of news items and the number of times each of the plurality of news items is clicked, or when the parameters of the plurality of news items include the number of times each of the plurality of news items is clicked and The number of times each news item is displayed in a plurality of news items may also be used to calculate the attribute value of the position on the news source using the above scheme. For example, when the parameters of the plurality of news include the PageRank of each of the plurality of news items and the number of times each of the plurality of news items is clicked, the PageRank of each of the plurality of pieces of the determined news may be clicked. The PageRank of the plurality of news items and the average number of times of the clicks are respectively calculated, and the calculated average value is weighted and summed as the attribute value of the position on the news source.
此外,为了统计指定时间段内被发布在新闻源上的该位置的多条新闻,本发明还可以以预设的时间间隔从多个新闻源上抓取发布时间距当前不超过指定时长的新闻,记录抓取的新闻首次被发布的时间、新闻源以及被发布在该新闻源上的位置。In addition, in order to count multiple news items of the location published on the news source within a specified time period, the present invention may also capture news from a plurality of news sources at a preset time interval from the current time limit of not longer than the specified time. , the time at which the captured news was first published, the source of the news, and the location posted on the news source.
以上介绍了发布待衡量新闻的新闻源的个数为一个的情况,若发布待衡量新闻的新闻源的个数为多个时(对于一些特别重要的新闻,可能会同时被多个新闻源转载),则基于发布待衡量新闻的新闻源的网页链接关系,计算出发布待衡量新闻的新闻源的属性值和/或待衡量新闻被发布在新闻源上的位置的属性值,可以是基于发布待衡量新闻的多个新闻源的网页链接关系,计算出发布待衡量新闻的各个新闻源的属性值和/或待衡量新闻被发布在各个新闻源上的位置的属性值,且计算发布待衡量新闻的各个新闻源的属性值和/或待衡量新闻被发布在各个新闻源上的位置的属性值可以采用上述发布待衡量新闻的新闻源的个数为一个的情况的方案来实现。 The above describes the number of news sources that publish news to be measured as one. If the number of news sources that publish news to be measured is multiple (for some particularly important news, it may be reprinted by multiple news sources at the same time). ), based on the webpage link relationship of the news source that publishes the news to be measured, calculating the attribute value of the news source that publishes the news to be measured and/or the attribute value of the location where the news to be measured is posted on the news source, which may be based on the release The webpage link relationship of the plurality of news sources of the news to be measured, the attribute value of each news source that publishes the news to be measured, and/or the attribute value of the position where the news to be measured is posted on each news source, and the calculation release is to be measured The attribute value of each news source of the news and/or the attribute value of the position where the news to be measured is posted on each news source may be implemented by using the above-described scheme of the number of news sources for which the news to be measured is one.
另外,上文步骤S506中提及的预设的衡量规则可以是数值区间与新闻的重要性的对应关系,例如落入数值区间1对应的新闻的重要性为“非常高”、落入数值区间2对应的新闻的重要性为“高”、落入数值区间3对应的新闻的重要性为“中”、落入数值区间4对应的新闻的重要性为“低”等,此处仅是列举的,还可以是其他数值区间与新闻的重要性的对应关系。In addition, the preset measurement rule mentioned in the above step S506 may be a correspondence relationship between the numerical interval and the importance of the news, for example, the importance of the news corresponding to the numerical interval 1 is “very high” and falls within the numerical interval. 2 The importance of the corresponding news is "high", the importance of the news corresponding to the numerical interval 3 is "medium", and the importance of the news corresponding to the numerical interval 4 is "low", etc. It can also be the correspondence between other numerical intervals and the importance of news.
以上介绍了图5所示的实施例中各环节的多种实现方式,下面通过具体的优选实施例对本发明实施例提供的衡量新闻重要性的方法做进一步说明。The various implementations of the various steps in the embodiment shown in FIG. 5 are described above. The method for measuring the importance of the news provided by the embodiment of the present invention is further described below through a specific preferred embodiment.
图6示出了根据本发明一个实施例的衡量新闻重要性的方法的另一种流程图。如图6所示,该方法包括以下步骤S602至步骤S614。Figure 6 illustrates another flow diagram of a method of measuring the importance of news in accordance with one embodiment of the present invention. As shown in FIG. 6, the method includes the following steps S602 to S614.
步骤S602、确定待衡量新闻的至少一个发布属性,以及每个发布属性在新闻重要性中的权值,其中,至少一个发布属性包括发布待衡量新闻的新闻源以及待衡量新闻被发布在新闻源上的某个位置。Step S602, determining at least one publishing attribute of the news to be measured, and weights of each publishing attribute in the news importance, wherein at least one publishing attribute includes a news source that publishes the news to be measured, and the news to be measured is published in the news source. Somewhere on the top.
这里,至少一个发布属性还可以包括发布时间、发布内容中的文字或图片信息、发布内容的篇幅等。Here, the at least one publishing attribute may further include a publishing time, a text or picture information in the posted content, a length of the posted content, and the like.
步骤S604、基于发布待衡量新闻的新闻源的网页链接关系,计算出发布待衡量新闻的新闻源的PageRank作为发布待衡量新闻的新闻源的属性值。Step S604: Calculate, according to the webpage link relationship of the news source that issues the news to be measured, the PageRank of the news source that issues the news to be measured as the attribute value of the news source that issues the news to be measured.
步骤S606、统计指定时间段内被发布在新闻源上的该位置的多条新闻,并确定多条新闻中每条新闻的PageRank、被点击的次数、被展示的次数。Step S606: Counting a plurality of news posts at the location on the news source within a specified time period, and determining a PageRank of each news in the plurality of news, a number of times of being clicked, and a number of times of being displayed.
为了统计指定时间段内被发布在新闻源上的该位置的多条新闻,在步骤S606之前可以以预设的时间间隔从多个新闻源上抓取发布时间距当前不超过指定时长的新闻,记录抓取的新闻首次被发布的时间、新闻源以及被发布在该新闻源上的位置。In order to count a plurality of news items of the location that are posted on the news source within a specified time period, the news that the publishing time is not longer than the specified duration may be fetched from the plurality of news sources at a preset time interval before the step S606. Record when the crawled news was first published, the source of the news, and the location posted on the news feed.
步骤S608、根据确定的多条新闻中每条新闻的PageRank、被点击的次数、被展示的次数,分别计算得到多条新闻的PageRank、被点击的次数、被展示的次数的平均值。Step S608: Calculate, according to the PageRank of each news in the determined plurality of news, the number of times of being clicked, and the number of times of being displayed, the average of the PageRank of the plurality of news, the number of times of being clicked, and the number of times of being displayed.
步骤S610、将计算得到的平均值进行加权求和作为新闻源上的该位置的属性值。Step S610, performing weighted summation on the calculated average value as the attribute value of the position on the news source.
步骤S612、获取每个发布属性的属性值,并根据确定的权值以及获取的属性值对至少一个发布属性进行加权处理,计算得出的值作为待衡量新闻的重要性值。Step S612: Acquire an attribute value of each publishing attribute, and perform weighting processing on the at least one publishing attribute according to the determined weight value and the obtained attribute value, and calculate the calculated value as the importance value of the news to be measured.
步骤S614、将重要性值与预设的衡量规则进行比较,衡量出待衡量新闻的重要性。In step S614, the importance value is compared with a preset measurement rule to measure the importance of the news to be measured.
需要说明的是,上述步骤S604与步骤S606至步骤S610无先后执行顺序之分,也可以先执行步骤S606至步骤S610,然后执行步骤S604。It should be noted that the above step S604 and step S606 to step S610 have no sequential execution order, and steps S606 to S610 may be performed first, and then step S604 is performed.
本发明实施例中,基于待衡量新闻的发布待衡量新闻的新闻源以及待衡量新闻被发布在新闻源上的某个位置等至少一个发布属性来综合衡量待衡量新闻的重要性,使得衡量结果更加客观、准确、全面。并且,新闻搜索引擎能够根据本发明提供的客观、准确、全面的待衡量新闻的重要性的衡量结果将重要性高的新闻尽量排 在搜索结果的前面,方便用户尽快找到价值较高的新闻。In the embodiment of the present invention, the importance of the news to be measured is comprehensively measured based on at least one publishing attribute, such as the news source of the news to be measured, the news source to be measured, and the location where the news to be measured is published on the news source, so that the measurement result is More objective, accurate and comprehensive. Moreover, the news search engine can rank the news of high importance as much as possible according to the objective, accurate and comprehensive measurement of the importance of the news to be measured provided by the present invention. In front of the search results, users can find high-value news as soon as possible.
需要说明的是,实际应用中,上述所有可选实施方式可以采用结合的方式任意组合,形成本发明的可选实施例,在此不再一一赘述。It should be noted that, in an actual application, all the foregoing optional embodiments may be combined in any combination to form an optional embodiment of the present invention, and details are not described herein again.
基于同一发明构思,本发明实施例还提供了一种衡量新闻重要性的装置,以实现上述衡量新闻重要性的方法。Based on the same inventive concept, an embodiment of the present invention also provides a device for measuring the importance of news to implement the above method for measuring the importance of news.
图7示出了根据本发明一个实施例的衡量新闻重要性的装置的一种结构示意图。参见图7,该装置至少包括:确定模块710、计算模块720以及衡量模块730。FIG. 7 shows a schematic structural diagram of an apparatus for measuring the importance of news according to an embodiment of the present invention. Referring to FIG. 7, the apparatus at least includes: a determining module 710, a calculating module 720, and a measuring module 730.
现介绍本发明实施例的衡量新闻重要性的装置的各组成或器件的功能以及各部分间的连接关系:The functions of the components or devices of the device for measuring the importance of news and the connection relationship between the components of the embodiment of the present invention will now be described:
确定模块710,适于确定待衡量新闻的至少一个发布属性,以及每个发布属性在新闻重要性中的权值;The determining module 710 is adapted to determine at least one publishing attribute of the news to be measured, and a weight of each publishing attribute in the news importance;
计算模块720,与确定模块710相耦合,适于获取每个发布属性的属性值,并根据确定的权值以及获取的属性值对至少一个发布属性进行加权处理,计算得出的值作为待衡量新闻的重要性值;The calculating module 720 is coupled to the determining module 710, and is adapted to obtain an attribute value of each publishing attribute, and weighting the at least one publishing attribute according to the determined weight and the obtained attribute value, and the calculated value is to be measured. The importance value of the news;
衡量模块730,与计算模块720相耦合,适于将重要性值与预设的衡量规则进行比较,衡量出待衡量新闻的重要性。The measurement module 730 is coupled to the calculation module 720 and is adapted to compare the importance value with a preset measurement rule to measure the importance of the news to be measured.
在一个实施例中,发布属性包括下列任意之一:In one embodiment, the publishing attribute includes any of the following:
发布时间;release time;
发布待衡量新闻的新闻源;Publish news sources for news to be measured;
待衡量新闻被发布在新闻源上的位置;The location where the news to be measured is posted on the news source;
发布内容中的文字或图片信息;Publish text or image information in the content;
发布内容的篇幅。The length of the content posted.
在一个实施例中,若至少一个发布属性包括发布待衡量新闻的新闻源和/或待衡量新闻被发布在新闻源上的位置,在计算模块720获取每个发布属性的属性值之前,计算模块720还适于:基于发布待衡量新闻的新闻源的网页链接关系,计算出发布待衡量新闻的新闻源的属性值和/或待衡量新闻被发布在新闻源上的位置的属性值。In one embodiment, if at least one publishing attribute includes a news source that publishes the news to be measured and/or a location where the news to be measured is posted on the news source, the computing module 720 obtains the attribute value of each published attribute before the computing module 720 is further adapted to: calculate an attribute value of a news source that issues the news to be measured and/or an attribute value of a location where the news to be measured is posted on the news source based on a webpage link relationship of the news source that issues the news to be measured.
在一个实施例中,计算模块720还适于:统计指定时间段内被发布在新闻源上的位置的多条新闻;确定多条新闻的参数;根据确定的多条新闻的参数,计算出新闻源上的位置的属性值作为待衡量新闻被发布在新闻源上的位置的属性值。In one embodiment, the calculating module 720 is further adapted to: count a plurality of news items that are posted on the news source within a specified time period; determine parameters of the plurality of news items; calculate the news according to the determined parameters of the plurality of news items The attribute value of the location on the source is the attribute value of the location where the news to be measured is posted on the news source.
在一个实施例中,图8示出了根据本发明一个实施例的衡量新闻重要性的装置的另一种结构示意图。如图8所示,在计算模块720统计指定时间段内被发布在新闻源上的位置的多条新闻之前,还包括:抓取模块810,与计算模块720相耦合,适于以预设的时间间隔从多个新闻源上抓取发布时间距当前不超过指定时长的新闻,记录抓取的新闻首次被发布的时间、新闻源以及被发布在该新闻源上的位置。In one embodiment, FIG. 8 illustrates another structural schematic of an apparatus for measuring the importance of news in accordance with one embodiment of the present invention. As shown in FIG. 8 , before the calculation module 720 counts a plurality of news items that are posted on the news source within a specified time period, the method further includes: a capture module 810 coupled to the calculation module 720 and adapted to be preset. The time interval captures news from a plurality of news sources that are not longer than the specified duration, records the time when the captured news was first published, the news source, and the location posted on the news source.
在一个实施例中,多条新闻的参数包括下列至少之一:In one embodiment, the plurality of news parameters include at least one of the following:
多条新闻中每条新闻的网页等级PageRank; Page rank of each news in multiple news items PageRank;
多条新闻中每条新闻被点击的次数;The number of times each news item was clicked in multiple news items;
多条新闻中每条新闻被展示的次数。The number of times each news item was displayed in multiple news items.
在一个实施例中,计算模块720还适于:根据确定的多条新闻中每条新闻的PageRank、被点击的次数、被展示的次数,分别计算得到多条新闻的PageRank、被点击的次数、被展示的次数的平均值;将计算得到的平均值进行加权求和作为新闻源上的位置的属性值。In an embodiment, the calculating module 720 is further configured to: calculate, according to the PageRank of each news, the number of times of being clicked, and the number of times of being displayed, the PageRank of the plurality of news, the number of times of being clicked, The average of the number of times displayed; the calculated average is weighted and summed as the attribute value of the position on the news source.
在一个实施例中,若发布待衡量新闻的新闻源的个数为多个,计算模块720还适于:基于发布待衡量新闻的多个新闻源的网页链接关系,计算出发布待衡量新闻的各个新闻源的属性值和/或待衡量新闻被发布在各个新闻源上的位置的属性值。In one embodiment, if the number of news sources for which the news to be measured is published is multiple, the calculating module 720 is further configured to: calculate the news to be measured based on the webpage link relationship of the plurality of news sources that are to be measured. The attribute value of each news source and/or the attribute value of the location where the news to be measured is posted on each news source.
根据上述任意一个优选实施例或多个优选实施例的组合,本发明实施例能够达到如下有益效果:According to any one of the preferred embodiments or the combination of the preferred embodiments, the embodiment of the present invention can achieve the following beneficial effects:
依据本发明提供的技术方案,确定待衡量新闻的至少一个发布属性,以及每个发布属性在新闻重要性中的权值,并获取每个发布属性的属性值。进而根据确定的权值以及获取的属性值对至少一个发布属性进行加权处理,计算得出的值作为待衡量新闻的重要性值。随后将重要性值与预设的衡量规则进行比较,衡量出待衡量新闻的重要性。可见,本发明基于待衡量新闻的至少一个发布属性来综合衡量待衡量新闻的重要性,使得衡量结果更加客观、准确、全面。由此解决了相关技术中若采用网页等级(PageRank)方法确定新闻重要性而产生负面作用的问题。并且,新闻搜索引擎能够根据本发明提供的客观、准确、全面的待衡量新闻的重要性的衡量结果将重要性高的新闻尽量排在搜索结果的前面,方便用户尽快找到价值较高的新闻。According to the technical solution provided by the present invention, at least one publishing attribute of the news to be measured, and a weight of each publishing attribute in the news importance are determined, and the attribute value of each publishing attribute is obtained. And further performing weighting processing on the at least one publishing attribute according to the determined weight and the obtained attribute value, and the calculated value is used as the importance value of the news to be measured. The importance value is then compared to a preset measurement rule to measure the importance of the news to be measured. It can be seen that the present invention comprehensively measures the importance of the news to be measured based on at least one publishing attribute of the news to be measured, so that the measurement result is more objective, accurate and comprehensive. This solves the problem that the related art uses the PageRank method to determine the importance of news and has a negative effect. Moreover, the news search engine can rank the news with high importance as much as possible in front of the search results according to the objective, accurate and comprehensive measurement results of the importance of the news to be measured provided by the present invention, so that the user can find the news with higher value as soon as possible.
在此处所提供的说明书中,说明了大量具体细节。然而,能够理解,本发明的实施例可以在没有这些具体细节的情况下实践。在一些实例中,并未详细示出公知的方法、结构和技术,以便不模糊对本说明书的理解。In the description provided herein, numerous specific details are set forth. However, it is understood that the embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures, and techniques are not shown in detail so as not to obscure the understanding of the description.
类似地,应当理解,为了精简本公开并帮助理解各个发明方面中的一个或多个,在上面对本发明的示例性实施例的描述中,本发明的各个特征有时被一起分组到单个实施例、图、或者对其的描述中。然而,并不应将该公开的方法解释成反映如下意图:即所要求保护的本发明要求比在每个权利要求中所明确记载的特征更多的特征。更确切地说,如下面的权利要求书所反映的那样,发明方面在于少于前面公开的单个实施例的所有特征。因此,遵循具体实施方式的权利要求书由此明确地并入该具体实施方式,其中每个权利要求本身都作为本发明的单独实施例。Similarly, the various features of the invention are sometimes grouped together into a single embodiment, in the above description of the exemplary embodiments of the invention, Figure, or a description of it. However, the method disclosed is not to be interpreted as reflecting the intention that the claimed invention requires more features than those recited in the claims. Rather, as the following claims reflect, inventive aspects reside in less than all features of the single embodiments disclosed herein. Therefore, the claims following the specific embodiments are hereby explicitly incorporated into the embodiments, and each of the claims as a separate embodiment of the invention.
本领域那些技术人员可以理解,可以对实施例中的设备中的模块进行自适应性地改变并且把它们设置在与该实施例不同的一个或多个设备中。可以把实施例中的模块或单元或组件组合成一个模块或单元或组件,以及此外可以把它们分成多个子模块或子单元或子组件。除了这样的特征和/或过程或者单元中的至少一些是相互排斥之外,可以采用任何组合对本说明书(包括伴随的权利要求、摘要和附图)中公开的所有特征以及如此公开的任何方法或者设备的所有过程或单元进行组合。除非 另外明确陈述,本说明书(包括伴随的权利要求、摘要和附图)中公开的每个特征可以由提供相同、等同或相似目的的替代特征来代替。Those skilled in the art will appreciate that the modules in the devices of the embodiments can be adaptively changed and placed in one or more devices different from the embodiment. The modules or units or components of the embodiments may be combined into one module or unit or component, and further they may be divided into a plurality of sub-modules or sub-units or sub-components. In addition to such features and/or at least some of the processes or units being mutually exclusive, any combination of the features disclosed in the specification, including the accompanying claims, the abstract and the drawings, and any methods so disclosed, or All processes or units of the device are combined. Unless In addition, it is expressly stated that each feature disclosed in the specification, including the accompanying claims, the abstract and the drawings, may be replaced by alternative features that provide the same, equivalent or similar purpose.
此外,本领域的技术人员能够理解,尽管在此所述的一些实施例包括其它实施例中所包括的某些特征而不是其它特征,但是不同实施例的特征的组合意味着处于本发明的范围之内并且形成不同的实施例。例如,在下面的权利要求书中,所要求保护的实施例的任意之一都可以以任意的组合方式来使用。In addition, those skilled in the art will appreciate that, although some embodiments described herein include certain features that are included in other embodiments and not in other features, combinations of features of different embodiments are intended to be within the scope of the present invention. Different embodiments are formed and formed. For example, in the following claims, any one of the claimed embodiments can be used in any combination.
本发明的各个部件实施例可以以硬件实现,或者以在一个或者多个处理器上运行的软件模块实现,或者以它们的组合实现。本领域的技术人员应当理解,可以在实践中使用微处理器或者数字信号处理器(DSP)来实现根据本发明实施例的判断新闻发布位置的重要性的装置以及衡量新闻重要性的装置中的一些或者全部部件的一些或者全部功能。本发明还可以实现为用于执行这里所描述的方法的一部分或者全部的设备或者装置程序(例如,计算机程序和计算机程序产品)。这样的实现本发明的程序可以存储在计算机可读介质上,或者可以具有一个或者多个信号的形式。这样的信号可以从因特网网站上下载得到,或者在载体信号上提供,或者以任何其他形式提供。The various component embodiments of the present invention may be implemented in hardware, or in a software module running on one or more processors, or in a combination thereof. It will be understood by those skilled in the art that a microprocessor or digital signal processor (DSP) can be used in practice to implement a device for determining the importance of a news posting location and an apparatus for measuring the importance of news, in accordance with an embodiment of the present invention. Some or all of the features of some or all of the components. The invention can also be implemented as a device or device program (e.g., a computer program and a computer program product) for performing some or all of the methods described herein. Such a program implementing the invention may be stored on a computer readable medium or may be in the form of one or more signals. Such signals may be downloaded from an Internet website, provided on a carrier signal, or provided in any other form.
例如,图9示出了可以实现判断新闻发布位置的重要性的方法以及衡量新闻重要性的方法的计算设备。该计算设备传统上包括处理器910和以存储器920形式的计算机程序产品或者计算机可读介质。存储器920可以是诸如闪存、EEPROM(电可擦除可编程只读存储器)、EPROM、硬盘或者ROM之类的电子存储器。存储器920具有用于执行上述方法中的任何方法步骤的程序代码931的存储空间930。例如,用于程序代码的存储空间930可以包括分别用于实现上面的方法中的各种步骤的各个程序代码931。这些程序代码可以从一个或者多个计算机程序产品中读出或者写入到这一个或者多个计算机程序产品中。这些计算机程序产品包括诸如硬盘,紧致盘(CD)、存储卡或者软盘之类的程序代码载体。这样的计算机程序产品通常为如参考图10所述的便携式或者固定存储单元。该存储单元可以具有与图9的计算设备中的存储器920类似布置的存储段、存储空间等。程序代码可以例如以适当形式进行压缩。通常,存储单元包括计算机可读代码931’,即可以由例如诸如910之类的处理器读取的代码,这些代码当由计算设备运行时,导致该计算设备执行上面所描述的方法中的各个步骤。For example, FIG. 9 illustrates a computing device that can implement a method of determining the importance of a news posting location and a method of measuring the importance of news. The computing device conventionally includes a processor 910 and a computer program product or computer readable medium in the form of a memory 920. The memory 920 may be an electronic memory such as a flash memory, an EEPROM (Electrically Erasable Programmable Read Only Memory), an EPROM, a hard disk, or a ROM. Memory 920 has a memory space 930 for program code 931 for performing any of the method steps described above. For example, storage space 930 for program code may include various program code 931 for implementing various steps in the above methods, respectively. The program code can be read from or written to one or more computer program products. These computer program products include program code carriers such as hard disks, compact disks (CDs), memory cards or floppy disks. Such a computer program product is typically a portable or fixed storage unit as described with reference to FIG. The storage unit may have storage segments, storage spaces, and the like that are similarly arranged to memory 920 in the computing device of FIG. The program code can be compressed, for example, in an appropriate form. Typically, the storage unit includes computer readable code 931', ie, code that can be read by a processor, such as 910, that when executed by a computing device causes the computing device to perform each of the methods described above step.
本文中所称的“一个实施例”、“实施例”或者“一个或者多个实施例”意味着,结合实施例描述的特定特征、结构或者特性包括在本发明的至少一个实施例中。此外,请注意,这里“在一个实施例中”的词语例子不一定全指同一个实施例。"an embodiment," or "an embodiment," or "an embodiment," In addition, it is noted that the phrase "in one embodiment" is not necessarily referring to the same embodiment.
应该注意的是上述实施例对本发明进行说明而不是对本发明进行限制,并且本领域技术人员在不脱离所附权利要求的范围的情况下可设计出替换实施例。在权利要求中,不应将位于括号之间的任何参考符号构造成对权利要求的限制。单词“包含”不排除存在未列在权利要求中的元件或步骤。位于元件之前的单词“一”或“一 个”不排除存在多个这样的元件。本发明可以借助于包括有若干不同元件的硬件以及借助于适当编程的计算机来实现。在列举了若干装置的单元权利要求中,这些装置中的若干个可以是通过同一个硬件项来具体体现。单词第一、第二、以及第三等的使用不表示任何顺序。可将这些单词解释为名称。It is to be noted that the above-described embodiments are illustrative of the invention and are not intended to be limiting, and that the invention may be devised without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as a limitation. The word "comprising" does not exclude the presence of the elements or steps that are not recited in the claims. The word "one" or "one" before the component A plurality of such elements are not excluded. The invention can be implemented by means of hardware comprising several distinct elements and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means It can be embodied by the same hardware item. The use of the words first, second, and third does not indicate any order. These words can be interpreted as names.
此外,还应当注意,本说明书中使用的语言主要是为了可读性和教导的目的而选择的,而不是为了解释或者限定本发明的主题而选择的。因此,在不偏离所附权利要求书的范围和精神的情况下,对于本技术领域的普通技术人员来说许多修改和变更都是显而易见的。对于本发明的范围,对本发明所做的公开是说明性的,而非限制性的,本发明的范围由所附权利要求书限定。 In addition, it should be noted that the language used in the specification has been selected for the purpose of readability and teaching, and is not intended to be construed or limited. Therefore, many modifications and changes will be apparent to those skilled in the art without departing from the scope of the invention. The disclosure of the present invention is intended to be illustrative, and not restrictive, and the scope of the invention is defined by the appended claims.

Claims (30)

  1. 一种判断新闻发布位置的重要性的方法,包括:A method of determining the importance of a news release location, including:
    统计指定时间段内某一新闻源上待判断的发布位置上的各条新闻;Counting the news on the release location of a news source within a specified time period;
    确定所述各条新闻的参数,其中,所述各条新闻的参数是指所述各条新闻被链接或操作的参数;Determining parameters of the respective pieces of news, wherein the parameters of the pieces of news refer to parameters in which the pieces of news are linked or operated;
    根据确定的所述各条新闻的参数判断出所述待判断的发布位置的重要性。Determining the importance of the posting location to be determined according to the determined parameters of the respective news items.
  2. 根据权利要求1所述的方法,其中,所述各条新闻的参数包括下列至少之一:The method of claim 1, wherein the parameters of the respective pieces of news comprise at least one of the following:
    所述各条新闻中每条新闻的网页等级PageRank;The page rank of each news in each of the news items PageRank;
    所述各条新闻中每条新闻被点击的次数;The number of times each news item in the various news items was clicked;
    所述各条新闻中每条新闻被展示的次数。The number of times each news item in the various news items was displayed.
  3. 根据权利要求1或2所述的方法,其中,根据确定的所述各条新闻的参数判断出所述待判断的发布位置的重要性,包括:The method according to claim 1 or 2, wherein determining the importance of the posting location to be determined based on the determined parameters of the respective pieces of news comprises:
    根据确定的所述各条新闻中每条新闻参数,分别计算得到所述各条新闻的各个参数的平均值;Calculating an average value of each parameter of each piece of news according to each news parameter in each of the determined pieces of news;
    基于所述各条新闻的各个参数的平均值中的一个或多个,确定所述待判断的发布位置的重要性。The importance of the posting location to be determined is determined based on one or more of the average values of the respective parameters of the respective pieces of news.
  4. 根据权利要求1-3任一项所述的方法,其中,基于所述各条新闻的各个参数的平均值中的一个或多个,确定所述待判断的发布位置的重要性,包括:The method according to any one of claims 1 to 3, wherein determining the importance of the posting position to be determined based on one or more of the average values of the respective parameters of the respective pieces of news comprises:
    对所述各条新闻的各个参数的平均值中的一个或多个进行加权处理,计算得到的值作为所述待判断的发布位置的重要性值;Performing weighting processing on one or more of the average values of the respective parameters of the pieces of news, and calculating the obtained value as the importance value of the publishing position to be determined;
    将所述重要性值与预设的判断规则进行比较,确定所述待判断的发布位置的重要性。Comparing the importance value with a preset determination rule to determine the importance of the release location to be determined.
  5. 根据权利要求1-4任一项所述的方法,其中,统计指定时间段内某一新闻源上待判断的发布位置上的各条新闻之前,还包括:The method according to any one of claims 1 to 4, wherein, before counting each piece of news on the posting position to be determined on a news source within a specified time period, the method further comprises:
    以预设的时间间隔从多个新闻源上抓取发布时间距当前不超过指定时长的新闻,记录抓取的所述新闻首次被发布的时间、新闻源以及被发布在该新闻源上的发布位置。Grab news from multiple news sources at a preset time interval that is not longer than the specified duration, record the time when the captured news was first published, the news source, and the postings posted on the news source position.
  6. 根据权利要求1-5任一项所述的方法,其中,所述各条新闻为各条新闻链接。The method according to any one of claims 1 to 5, wherein the respective pieces of news are respective news links.
  7. 一种衡量新闻重要性的方法,包括:A method of measuring the importance of news, including:
    确定待衡量新闻的至少一个发布属性,以及每个发布属性在新闻重要性中的权值;Determining at least one publishing attribute of the news to be measured, and the weight of each publishing attribute in the importance of the news;
    获取每个发布属性的属性值,并根据确定的所述权值以及获取的所述属性值对所述至少一个发布属性进行加权处理,计算得出的值作为所述待衡量新闻的重要性值;Obtaining an attribute value of each publishing attribute, and performing weighting processing on the at least one publishing attribute according to the determined weight value and the obtained attribute value, and the calculated value is used as the importance value of the to-be-measured news ;
    将所述重要性值与预设的衡量规则进行比较,衡量出所述待衡量新闻的重要性。The importance value is compared with a preset measurement rule to measure the importance of the news to be measured.
  8. 根据权利要求7所述的方法,其中,所述发布属性包括下列任意之一: The method of claim 7, wherein the publishing attribute comprises any one of the following:
    发布时间;release time;
    发布所述待衡量新闻的新闻源;Publish the news source of the news to be measured;
    所述待衡量新闻被发布在新闻源上的位置;The location where the news to be measured is posted on the news source;
    发布内容中的文字或图片信息;Publish text or image information in the content;
    发布内容的篇幅。The length of the content posted.
  9. 根据权利要求7或8所述的方法,其中,The method according to claim 7 or 8, wherein
    若所述至少一个发布属性包括发布所述待衡量新闻的新闻源和/或所述待衡量新闻被发布在新闻源上的位置,If the at least one posting attribute includes a news source that publishes the news to be measured and/or a location where the news to be measured is posted on a news source,
    所述获取每个发布属性的属性值之前,还包括:Before the obtaining the attribute value of each publishing attribute, the method further includes:
    基于发布所述待衡量新闻的新闻源的网页链接关系,计算出发布所述待衡量新闻的新闻源的属性值和/或所述待衡量新闻被发布在所述新闻源上的位置的属性值。Calculating an attribute value of a news source that publishes the news to be measured and/or an attribute value of a position at which the news to be measured is posted on the news source based on a webpage link relationship of a news source that publishes the news to be measured .
  10. 根据权利要求7-9任一项所述的方法,其中,基于发布所述待衡量新闻的新闻源的网页链接关系,计算出所述待衡量新闻被发布在所述新闻源上的位置的属性值,包括:The method according to any one of claims 7 to 9, wherein, based on a webpage link relationship of a news source that issues the news to be measured, an attribute of a location at which the news to be measured is posted on the news source is calculated Values, including:
    统计指定时间段内被发布在所述新闻源上的所述位置的多条新闻;Counting a plurality of news items of the location posted on the news source during a specified time period;
    确定所述多条新闻的参数;Determining parameters of the plurality of news items;
    根据确定的所述多条新闻的参数,计算出所述新闻源上的所述位置的属性值作为所述待衡量新闻被发布在所述新闻源上的所述位置的属性值。And determining, according to the determined parameters of the plurality of news items, an attribute value of the location on the news source as an attribute value of the location where the to-be-measured news is posted on the news source.
  11. 根据权利要求7-10任一项所述的方法,其中,统计指定时间段内被发布在所述新闻源上的所述位置的多条新闻之前,还包括:The method according to any one of claims 7 to 10, wherein, before counting a plurality of news items of the location posted on the news source within a specified time period, the method further comprises:
    以预设的时间间隔从多个新闻源上抓取发布时间距当前不超过指定时长的新闻,记录抓取的所述新闻首次被发布的时间、新闻源以及被发布在该新闻源上的位置。Grab news from multiple news sources at a preset time interval from the current time limit, and record the time when the captured news was first published, the news source, and the location posted on the news source. .
  12. 根据权利要求7-11任一项所述的方法,其中,所述多条新闻的参数包括下列至少之一:The method according to any one of claims 7 to 11, wherein the parameters of the plurality of news items comprise at least one of the following:
    所述多条新闻中每条新闻的网页等级PageRank;The page rank of each news in the plurality of news items PageRank;
    所述多条新闻中每条新闻被点击的次数;The number of times each of the plurality of news items was clicked;
    所述多条新闻中每条新闻被展示的次数。The number of times each of the plurality of news items was displayed.
  13. 根据权利要求7-12任一项所述的方法,其中,根据确定的所述多条新闻的参数,计算出所述新闻源上的所述位置的属性值,包括:The method according to any one of claims 7 to 12, wherein the attribute value of the location on the news source is calculated according to the determined parameters of the plurality of news items, including:
    根据确定的所述多条新闻中每条新闻的PageRank、被点击的次数、被展示的次数,分别计算得到所述多条新闻的PageRank、被点击的次数、被展示的次数的平均值;Calculating, according to the determined PageRank of each of the plurality of news items, the number of times of being clicked, and the number of times of being displayed, respectively, the PageRank of the plurality of news items, the number of times of being clicked, and the average number of times of being displayed;
    将计算得到的所述平均值进行加权求和作为所述新闻源上的所述位置的属性值。The calculated average value is weighted and summed as an attribute value of the position on the news source.
  14. 根据权利要求7-13任一项所述的方法,其中,A method according to any one of claims 7 to 13, wherein
    若发布所述待衡量新闻的新闻源的个数为多个,If the number of news sources for publishing the news to be measured is plural,
    基于发布所述待衡量新闻的新闻源的网页链接关系,计算出发布所述待衡量新闻的新闻源的属性值和/或所述待衡量新闻被发布在所述新闻源上的位置的属性值,包 括:Calculating an attribute value of a news source that publishes the news to be measured and/or an attribute value of a position at which the news to be measured is posted on the news source based on a webpage link relationship of a news source that publishes the news to be measured Package include:
    基于发布所述待衡量新闻的多个新闻源的网页链接关系,计算出发布所述待衡量新闻的各个新闻源的属性值和/或所述待衡量新闻被发布在各个所述新闻源上的位置的属性值。Calculating an attribute value of each news source that issues the news to be measured and/or the news to be measured is published on each of the news sources based on a webpage link relationship of a plurality of news sources that publish the news to be measured The attribute value of the location.
  15. 一种判断新闻发布位置的重要性的装置,包括:A device for determining the importance of a news release location, including:
    统计模块,适于统计指定时间段内某一新闻源上待判断的发布位置上的各条新闻;a statistical module, configured to count each news on a publishing location to be judged on a news source within a specified time period;
    确定模块,适于确定所述各条新闻的参数,其中,所述各条新闻的参数是指所述各条新闻被链接或操作的参数;a determining module, configured to determine parameters of the respective pieces of news, wherein the parameters of the pieces of news refer to parameters in which the pieces of news are linked or operated;
    判断模块,适于根据确定的所述各条新闻的参数判断出所述待判断的发布位置的重要性。The determining module is adapted to determine the importance of the publishing location to be determined according to the determined parameters of the respective news items.
  16. 根据权利要求15所述的装置,其中,所述各条新闻的参数包括下列至少之一:The apparatus of claim 15, wherein the parameters of the respective pieces of news comprise at least one of the following:
    所述各条新闻中每条新闻的网页等级PageRank;The page rank of each news in each of the news items PageRank;
    所述各条新闻中每条新闻被点击的次数;The number of times each news item in the various news items was clicked;
    所述各条新闻中每条新闻被展示的次数。The number of times each news item in the various news items was displayed.
  17. 根据权利要求15或16所述的装置,其中,所述判断模块还适于:The apparatus according to claim 15 or 16, wherein the determining module is further adapted to:
    根据确定的所述各条新闻中每条新闻参数,分别计算得到所述各条新闻的各个参数的平均值;Calculating an average value of each parameter of each piece of news according to each news parameter in each of the determined pieces of news;
    基于所述各条新闻的各个参数的平均值中的一个或多个,确定所述待判断的发布位置的重要性。The importance of the posting location to be determined is determined based on one or more of the average values of the respective parameters of the respective pieces of news.
  18. 根据权利要求15-17任一项所述的装置,其中,所述判断模块还适于:The apparatus of any one of claims 15-17, wherein the determining module is further adapted to:
    对所述各条新闻的各个参数的平均值中的一个或多个进行加权处理,计算得到的值作为所述待判断的发布位置的重要性值;Performing weighting processing on one or more of the average values of the respective parameters of the pieces of news, and calculating the obtained value as the importance value of the publishing position to be determined;
    将所述重要性值与预设的判断规则进行比较,确定所述待判断的发布位置的重要性。Comparing the importance value with a preset determination rule to determine the importance of the release location to be determined.
  19. 根据权利要求15-18任一项所述的装置,其中,在所述统计模块统计指定时间段内某一新闻源上待判断的发布位置上的各条新闻之前,还包括:The apparatus according to any one of claims 15 to 18, wherein before the statistics module counts each piece of news on the posting position to be determined on a certain news source within a specified time period, the method further comprises:
    抓取模块,适于以预设的时间间隔从多个新闻源上抓取发布时间距当前不超过指定时长的新闻,记录抓取的所述新闻首次被发布的时间、新闻源以及被发布在该新闻源上的发布位置。a capture module, configured to capture news from a plurality of news sources at a preset time interval from a current time that does not exceed a specified duration, and record the time when the captured news is first released, the news source, and the published The publishing location on this news feed.
  20. 根据权利要求15-19任一项所述的装置,其中,所述各条新闻为各条新闻链接。The apparatus according to any one of claims 15 to 19, wherein each of the pieces of news is a respective news link.
  21. 一种衡量新闻重要性的装置,包括:A device that measures the importance of news, including:
    确定模块,适于确定待衡量新闻的至少一个发布属性,以及每个发布属性在新闻重要性中的权值; a determining module adapted to determine at least one publishing attribute of the news to be measured, and a weight of each publishing attribute in the importance of the news;
    计算模块,适于获取每个发布属性的属性值,并根据确定的所述权值以及获取的所述属性值对所述至少一个发布属性进行加权处理,计算得出的值作为所述待衡量新闻的重要性值;a calculating module, configured to obtain an attribute value of each publishing attribute, and perform weighting processing on the at least one publishing attribute according to the determined weight value and the obtained attribute value, and the calculated value is used as the to-be-measured value The importance value of the news;
    衡量模块,适于将所述重要性值与预设的衡量规则进行比较,衡量出所述待衡量新闻的重要性。The measurement module is adapted to compare the importance value with a preset measurement rule to measure the importance of the news to be measured.
  22. 根据权利要求21所述的装置,其中,所述发布属性包括下列任意之一:The apparatus of claim 21, wherein the publishing attribute comprises any one of the following:
    发布时间;release time;
    发布所述待衡量新闻的新闻源;Publish the news source of the news to be measured;
    所述待衡量新闻被发布在新闻源上的位置;The location where the news to be measured is posted on the news source;
    发布内容中的文字或图片信息;Publish text or image information in the content;
    发布内容的篇幅。The length of the content posted.
  23. 根据权利要求21或22所述的装置,其中,The device according to claim 21 or 22, wherein
    若所述至少一个发布属性包括发布所述待衡量新闻的新闻源和/或所述待衡量新闻被发布在新闻源上的位置,If the at least one posting attribute includes a news source that publishes the news to be measured and/or a location where the news to be measured is posted on a news source,
    在所述计算模块获取每个发布属性的属性值之前,所述计算模块还适于:Before the calculating module acquires the attribute value of each publishing attribute, the calculating module is further adapted to:
    基于发布所述待衡量新闻的新闻源的网页链接关系,计算出发布所述待衡量新闻的新闻源的属性值和/或所述待衡量新闻被发布在所述新闻源上的位置的属性值。Calculating an attribute value of a news source that publishes the news to be measured and/or an attribute value of a position at which the news to be measured is posted on the news source based on a webpage link relationship of a news source that publishes the news to be measured .
  24. 根据权利要求21-23任一项所述的装置,其中,所述计算模块还适于:The apparatus according to any one of claims 21 to 23, wherein the calculation module is further adapted to:
    统计指定时间段内被发布在所述新闻源上的所述位置的多条新闻;Counting a plurality of news items of the location posted on the news source during a specified time period;
    确定所述多条新闻的参数;Determining parameters of the plurality of news items;
    根据确定的所述多条新闻的参数,计算出所述新闻源上的所述位置的属性值作为所述待衡量新闻被发布在所述新闻源上的所述位置的属性值。And determining, according to the determined parameters of the plurality of news items, an attribute value of the location on the news source as an attribute value of the location where the to-be-measured news is posted on the news source.
  25. 根据权利要求21-24任一项所述的装置,其中,在所述计算模块统计指定时间段内被发布在所述新闻源上的所述位置的多条新闻之前,还包括:The apparatus according to any one of claims 21 to 24, wherein before the calculating module counts a plurality of pieces of news posted at the location on the news source within a specified time period, the method further comprises:
    抓取模块,适于以预设的时间间隔从多个新闻源上抓取发布时间距当前不超过指定时长的新闻,记录抓取的所述新闻首次被发布的时间、新闻源以及被发布在该新闻源上的位置。a capture module, configured to capture news from a plurality of news sources at a preset time interval from a current time that does not exceed a specified duration, and record the time when the captured news is first released, the news source, and the published The location on the news source.
  26. 根据权利要求21-25任一项所述的装置,其中,所述多条新闻的参数包括下列至少之一:The apparatus according to any one of claims 21 to 25, wherein the parameters of the plurality of news items comprise at least one of the following:
    所述多条新闻中每条新闻的网页等级PageRank;The page rank of each news in the plurality of news items PageRank;
    所述多条新闻中每条新闻被点击的次数;The number of times each of the plurality of news items was clicked;
    所述多条新闻中每条新闻被展示的次数。The number of times each of the plurality of news items was displayed.
  27. 根据权利要求21-26任一项所述的装置,其中,所述计算模块还适于:The apparatus of any of claims 21-26, wherein the computing module is further adapted to:
    根据确定的所述多条新闻中每条新闻的PageRank、被点击的次数、被展示的次数,分别计算得到所述多条新闻的PageRank、被点击的次数、被展示的次数的平均值;Calculating, according to the determined PageRank of each of the plurality of news items, the number of times of being clicked, and the number of times of being displayed, respectively, the PageRank of the plurality of news items, the number of times of being clicked, and the average number of times of being displayed;
    将计算得到的所述平均值进行加权求和作为所述新闻源上的所述位置的属性值。 The calculated average value is weighted and summed as an attribute value of the position on the news source.
  28. 根据权利要求21-27任一项所述的装置,其中,若发布所述待衡量新闻的新闻源的个数为多个,所述计算模块还适于:The device according to any one of claims 21 to 27, wherein if the number of news sources for which the news to be measured is published is plural, the calculation module is further adapted to:
    基于发布所述待衡量新闻的多个新闻源的网页链接关系,计算出发布所述待衡量新闻的各个新闻源的属性值和/或所述待衡量新闻被发布在各个所述新闻源上的位置的属性值。Calculating an attribute value of each news source that issues the news to be measured and/or the news to be measured is published on each of the news sources based on a webpage link relationship of a plurality of news sources that publish the news to be measured The attribute value of the location.
  29. 一种计算机程序,包括计算机可读代码,当所述计算机可读代码在计算设备上运行时,导致所述计算设备执行根据权利要求1至14任一项所述的方法。A computer program comprising computer readable code that, when executed on a computing device, causes the computing device to perform the method of any one of claims 1-14.
  30. 一种计算机可读介质,其中存储了如权利要求29所述的计算机程序。 A computer readable medium storing the computer program of claim 29.
PCT/CN2015/091870 2014-10-13 2015-10-13 Method and apparatus for judging importance of news release location and news WO2016058521A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CN201410539702.7 2014-10-13
CN201410539703.1 2014-10-13
CN201410539702.7A CN104331419A (en) 2014-10-13 2014-10-13 Method and device for measuring importance of news
CN201410539703.1A CN104331420A (en) 2014-10-13 2014-10-13 Method and device for judging news releasing position significance

Publications (1)

Publication Number Publication Date
WO2016058521A1 true WO2016058521A1 (en) 2016-04-21

Family

ID=55746135

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/091870 WO2016058521A1 (en) 2014-10-13 2015-10-13 Method and apparatus for judging importance of news release location and news

Country Status (1)

Country Link
WO (1) WO2016058521A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1853183A (en) * 2003-09-16 2006-10-25 Google公司 Systems and methods for improving the ranking of news articles
CN101477556A (en) * 2009-01-22 2009-07-08 苏州智讯科技有限公司 Method for discovering hot sport in internet mass information
CN103399884A (en) * 2013-07-14 2013-11-20 王国栋 Random news system and automatic refresh method thereof
US8667037B1 (en) * 2007-11-12 2014-03-04 Google Inc. Identification and ranking of news stories of interest
CN104331420A (en) * 2014-10-13 2015-02-04 北京奇虎科技有限公司 Method and device for judging news releasing position significance
CN104331419A (en) * 2014-10-13 2015-02-04 北京奇虎科技有限公司 Method and device for measuring importance of news

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1853183A (en) * 2003-09-16 2006-10-25 Google公司 Systems and methods for improving the ranking of news articles
US8667037B1 (en) * 2007-11-12 2014-03-04 Google Inc. Identification and ranking of news stories of interest
CN101477556A (en) * 2009-01-22 2009-07-08 苏州智讯科技有限公司 Method for discovering hot sport in internet mass information
CN103399884A (en) * 2013-07-14 2013-11-20 王国栋 Random news system and automatic refresh method thereof
CN104331420A (en) * 2014-10-13 2015-02-04 北京奇虎科技有限公司 Method and device for judging news releasing position significance
CN104331419A (en) * 2014-10-13 2015-02-04 北京奇虎科技有限公司 Method and device for measuring importance of news

Similar Documents

Publication Publication Date Title
TWI539305B (en) Personalized information push method and device
JP6403787B2 (en) Method, apparatus and system for determining a location corresponding to an IP address
JP6211605B2 (en) Ranking search results based on click-through rate
Thompson et al. Effect of species richness and relative abundance on the shape of the species accumulation curve
CN106302350B (en) URL monitoring method, device and equipment
US20110166926A1 (en) Evaluating Online Marketing Efficiency
WO2015070735A1 (en) Traffic quality analysis method and device
Haustein et al. When is an article actually published? An analysis of online availability, publication, and indexation dates
CN103593415A (en) Method and device for detecting cheating on visitor volumes of web pages
CN107578263A (en) A kind of detection method, device and the electronic equipment of advertisement abnormal access
JP2014006898A5 (en) How to predict call topics
US9245035B2 (en) Information processing system, information processing method, program, and non-transitory information storage medium
CN106874335B (en) Behavior data processing method and device and server
CN106650433A (en) Detecting method and system for abnormal behavior
CN106327230B (en) Abnormal user detection method and equipment
CN104331419A (en) Method and device for measuring importance of news
CN109472017B (en) Method and device for obtaining relevant information of text court deeds of referee to be generated
CN104090908A (en) Method and device for counting mean detention time in page group and generalizing content in website
KR101212457B1 (en) Web page searching system and method using access time and frequency
JP2010020745A (en) Method of outputting reputation index and reputation index output device
CN105809379A (en) Logistics branch evaluation method, device and electronic device
JP2013050905A5 (en)
WO2015067179A1 (en) Method and apparatus for detecting invalid commodity templates
US20160307223A1 (en) Method for determining a user profile in relation to certain web content
WO2016058521A1 (en) Method and apparatus for judging importance of news release location and news

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15851028

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15851028

Country of ref document: EP

Kind code of ref document: A1