CN101751437A - Web active retrieval system based on reinforcement learning - Google Patents
Web active retrieval system based on reinforcement learning Download PDFInfo
- Publication number
- CN101751437A CN101751437A CN200810240358A CN200810240358A CN101751437A CN 101751437 A CN101751437 A CN 101751437A CN 200810240358 A CN200810240358 A CN 200810240358A CN 200810240358 A CN200810240358 A CN 200810240358A CN 101751437 A CN101751437 A CN 101751437A
- Authority
- CN
- China
- Prior art keywords
- web
- user
- agent module
- page
- learning
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000002787 reinforcement Effects 0.000 title abstract description 9
- 230000006870 function Effects 0.000 claims abstract description 26
- 238000004458 analytical method Methods 0.000 claims abstract description 15
- 230000006399 behavior Effects 0.000 claims abstract description 11
- 239000003795 chemical substances by application Substances 0.000 claims description 46
- 238000000034 method Methods 0.000 claims description 16
- 238000004422 calculation algorithm Methods 0.000 claims description 7
- 230000008569 process Effects 0.000 claims description 4
- 230000000694 effects Effects 0.000 claims description 2
- 230000008676 import Effects 0.000 claims description 2
- 238000001914 filtration Methods 0.000 abstract description 6
- 238000007619 statistical method Methods 0.000 abstract description 3
- 230000009471 action Effects 0.000 description 8
- 238000013459 approach Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 230000007704 transition Effects 0.000 description 5
- 238000000342 Monte Carlo simulation Methods 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 3
- 238000005070 sampling Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 230000006855 networking Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000003542 behavioural effect Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000013549 information retrieval technique Methods 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000007789 sealing Methods 0.000 description 1
- 238000010845 search algorithm Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Abstract
The invention discloses a Web active retrieval system based on reinforcement learning; the system comprises a Web search Agent module, a Web filter Agent module, a Web interface Agent module and a user information learning Agent module; wherein, the Web search Agent module is used for searching subjects based on user interests, analyzing Web content and realizing Web download function; the Web filter Agent module is used for finishing web content analysis, page filtering and classified index; the Web interface Agent module is used for recommending webs on behalf of user interests after learning, receiving the user feedbacks and recording user browsing behaviors and having statistical analysis function; and the user information learning Agent module is used for realizing the interest updates based on reinforced learning, updating the user information continuously and finishing the optimum model on behalf of user interest. The Web active retrieval system based on reinforcement learning has strong self-adaptability, high accuracy and convenient use.
Description
Technical field
The present invention relates to Web user's active retrieval technique field, relate in particular to a kind of Web active retrieval system, be used to realize Web user is best embodied the Web page recommendation of user interest pattern based on intensified learning.
Background technology
The Markovian decision process comprises an ambient condition collection S, method behavior set A, award function R and state transition function P.Award function R (s, a, s ') is to adopt action a under the situation of state s, and ambient condition is transferred to the instantaneous reward value of s ' acquisition; Note P (s, a, s ') adopts action a to make ambient condition transfer to the probability of s ' under the situation of state s.The essence of Markovian decision process is: current state is only relevant with the action of selecting under current state and the current state with reward value to the probability of next state transitions, and has nothing to do with former historic state and historical action.Therefore under the knowledge frame of the environmental model that state transition probability function P and award function R have all been determined, the technology of dynamic programming can be used for finding the solution optimal strategy.Yet under the most of situation in real world, the environmental model of state transition probability function P and award function R is difficult to but determine that intensified learning mainly is that research is awarded under function and the state transition function condition of unknown emphatically, how to learn the optimum behavior strategy.
Intensified learning (reinforcement learning claims to encourage study again again, estimates study) is an important branch of machine learning method, in fields such as Based Intelligent Control machine people and analyses and prediction many application is arranged.Intensified learning is to the study of mapping from the environment to the behavior in the intelligence system, so that award (enhanced signal) the functional value maximum of accumulation, the supervised learning that intensified learning is different from the conventional machines study mainly shows on the teacher signal, the enhanced signal that is provided by environment in the intensified learning is that the action of making is made a kind of evaluation as reward value, rather than tells how direct reinforcement learning system (reinforcementlearning system) goes to produce correct action.Because the information that external environment condition provides is less, the experience that reinforcement learning system must obtain is on one's own account learnt.In this way, reinforcement learning system in action-obtain to calculate the basis in the environment estimated, improved action scheme is proposed to conform.The learning art of present intensified learning roughly can be divided into two classes: the one, and the action space of search intelligence system, thereby the behavior of the optimum that discovery can be made.Search techniques such as typical technology such as genetic algorithm; The 2nd, adopt based on the thought of the technology of statistics and dynamic programming and estimate and predict cost function value under a certain definite ambient condition, thereby determine optimum behavior by the cost function that obtains.
In the problem that intensified learning need solve, because environment is uncertain, the resulting R of study each time under tactful π instructs
tMight be inequality.Therefore to consider the mathematical expectation of all possible return function in difference study at the value function under the s state.Often adopt approach method to carry out the estimation of value function in the reality, a kind of topmost method is exactly the Monte Carlo method of sampling.The Monte Carlo method of sampling and dynamic programming technology are combined, pass through test of many times, remove to approach real state value function with the rewards and punishments rreturn value that reality obtains, the value function that the Monte Carlo method of sampling normally adopts once study circulation to be obtained goes to approach actual value function, and reinforcement Learning Method uses the value function (being the Bootstrapping method) and the instantaneous award of current acquisition of NextState to approach the current state value function.Reinforcement Learning Method needs repeatedly, and the study circulation could finally approach actual value function.
Information retrieval (Information Retrieval) is commonly referred to as the text based information retrieval, comprises the various aspects such as storage, tissue, performance, inquiry, access of information, and its core is the index and the retrieval of text message.From the historical development progress, information retrieval has experienced human manual information retrieval, computer automation retrieval, a plurality of developing stage such as intelligent network retrieval.At present, information retrieval has developed into networking and intelligentized stage.Retrieval object of information also from sealing, more stable unanimity, by independent database integrated management information expansion of content to opening, dynamic, quick, widely distributed, the loose complex Web content of pages of management; The user who uses information retrieval originally is the Intelligence Specialist, present information retrieval comprises business people, managerial personnel, teacher, student, each professional person's etc. ordinary populace, and they have proposed higher, more diversified requirement to information retrieval from result to the mode.The needs that adapt to networking, intellectuality and Web personalization are new trends of present information retrieval technique development.In the real world, exist at present more based on statistical method for network personalized method.But this method adaptive ability is relatively poor, and does not possess learning ability.Yet the characteristics of intensified learning can be improved present this network personalized analytical approach based on statistical method.
Summary of the invention
(1) technical matters that will solve
In view of this, fundamental purpose of the present invention is to provide a kind of Web active retrieval system based on intensified learning, browses Web more easily with assisting users, and finds the needed target pages of user more accurately.
(2) technical scheme
For achieving the above object, the invention provides a kind of Web active retrieval system based on intensified learning, this system comprises:
1) Web search Agent module is downloaded these several functional blocks by information search, Web page analysis and the Web page and is formed, and mainly finishes the download function of utilizing search, web page contents analysis and the page relevant with the user interest theme.At first, import original request earlier by the user, carry out obtaining of initial page, and the peer link on the page is extracted and be stored in a buffer zone according to search engine, page download module will be visited corresponding webpage according to link (URL address), preserve according to the subject key words classification simultaneously.
2) Web filters the Agent module, filtering two functional blocks by the page analysis and the page forms, mainly finish the page that search is obtained to information search Agent and carry out content analysis, utilize the Q learning system in the intensified learning that each Web page is carried out the calculating of Q study median, at this moment, reward value immediately during the TFIDF value for keyword of the Web page is learnt as Q, can calculate the pairing Q value of each Web page, to the ordering of Q value and the filtration of the Web page, take out the more forward Web page of Q value rank; The effect that Web filters the Agent module comprises that mainly Web page analysis, the Web page filter and Web page classifications index, and provides Web interface Agent module with the result.
3) Web interface Agent module, by page recommending module, page display module with browse the behavioral statistics module and form, finish the maximally related webpage of recommending to finish of user model and browsed and receive value of feedback, to functions such as the analysis of browsing behavior and statistics with study module.
4) user profile study Agent module, by initialization, major function pieces such as interest-degree corrected Calculation and interest-degree renewal are formed, at first need user's registration, automatically generate this user's initial interest file (Profiles) according to the information of knowledge base, in search browsing page process, utilize the TD learning algorithm in the intensified learning user's interest model to be upgraded and improve, the record of being finished according to Web interface Agent module to the behavior of user's browsing pages generates the user's who gets access to value of feedback, constantly user's interest is upgraded by interest-degree calculating and update module, utilize the TD learning algorithm that user's interest mode is calculated, the optimal weight that finally reaches the user profile model distributes.
(3) beneficial effect
From technique scheme as can be seen, the present invention has following beneficial effect:
1, this Web active retrieval system provided by the invention has the accuracy height, and characteristics easy to use can realize that Web user is carried out the personalized Web page to be recommended.
2, this Web active retrieval system provided by the invention, utilize the system of the Q study of intensified learning, the page by Web search Agent is filtered, can better consider the page of not just current recommendation, and consider that Web search Agent recommends the info web of the hyperlink of the page, has made full use of the Web page structure and has carried out the optimization of filtering system.
3, the present invention is in carrying out user profile study, having utilized the TD in the intensified learning to learn diversified model learns and upgrades, make user model can be more and more near user's best representative model, model is applied to Web filters Agent and just can extract the Web page that more can meet user interest model and recommended.
Description of drawings
Fig. 1 is the overall logic block diagram of Web active retrieval system provided by the invention;
Fig. 2 is the structural representation that in the Web active retrieval system provided by the invention the Web page is filtered;
Fig. 3 is the middle user profile study of Web active retrieval system provided by the invention and the structural representation that upgrades.
Embodiment
For making the purpose, technical solutions and advantages of the present invention clearer, below in conjunction with specific embodiment, and with reference to accompanying drawing, the present invention is described in more detail.
As shown in Figure 1, Fig. 1 is the overall logic block diagram of Web active retrieval system provided by the invention, comprises an original input end and a page recommendation output terminal.Wherein, original input end is used for sending original user's request to Webpage (Web) search Agent module, after receiving user's request, Web search Agent module utilizes search engine search, after search finishes, Web filtration Agent module is learnt the page wherein and is sorted, and extracts the page and recommends the user by Web interface Agent module; The user utilizes Web interface Agent module to carry out record after browsing, and obtains user's browsing information, analyzes and adds up, and value of feedback is submitted to user profile study Agent module, and user profile study Agent module will be upgraded and optimizes it.
As shown in Figure 2, Fig. 2 is the structural representation of the Web filtration Agent module in the Web active retrieval system provided by the invention.This Web filters the Agent module and comprises correlation estimation module, Q study module and Q value order module composition.After Web search Agent block search finishes, the page of storage is taken out, carry out the analysis of content of pages, according to the TFIDF technology, the Web page can be represented with vector pattern, utilize the method calculating user of cosine computed range and the similarity of the Web page, we need learn each keyword wherein for a given i (i pagefile), the reward value immediately that the TFIDF value of each corresponding page is obtained during just as this page of arrival, the pairing reward value that can be used as next state corresponding point of the award of one deck page down of the hyperlink of current page, when asking the reward value sum total, need be multiplied by a discount value, here, we taked based on the depth-first search algorithm of figure realize calculating maximum path reward value discount and, and give Q[i] with its assignment, Q[i] just be recorded as current i page Q value, calculate and the Q value of all webpage correspondences is carried out after the Q value of all pages.Preceding K the page that rank is forward recommended the user.
As shown in Figure 3, Fig. 3 is the structural representation of user profile study Agent module in the Web active retrieval system provided by the invention.This user profile study Agent module comprises that mainly calculating user model upgrades weight module and TD study update module, after Web interface Agent has obtained user's the Web information of browsing, the value of feedback that Web interface Agent can provide a user is learnt for the user profile study module to user profile study Agent module and is upgraded, value of feedback mainly is made up of with value of feedback implicit expression explicit, wherein explicit feedback mainly is to be obtained by user's scoring, and the value of feedback of implicit expression depends mainly on elemental: bookmarking (bm), reading time (rt), scrolling (sc), following up the hyperlinks in the filtered documents (fl).Utilize value of feedback to upgrade to user's weight:
W
p,k←W
p,k+βr
i,k
β is the speed of user learning.W
P, kThe value of the k dimension of expression user vector.
The reward value in measurement future of utilizing that the variation of the item weights (after the normalized) of vector is similar to.When the user model weight changes less than the threshold value that obtains through test in advance, represent that the interest of the selection representative of consumer of keyword has been tending towards optimum, when this variations be timing, illustrate that active user's model vector will obtain better value of feedback.
W
pk,t←W
pk,t-1+[R
t+γΔv
p,t]
If W
PkOccur in the page of recommending, then increase the keyword weight of these appearance, if changing value is a negative value, then the variation of the weights of these keywords is less.Handle after the content of this K page, the Web active retrieval system can proceed to next retrieval point, and study will be proceeded, until can reach the approximate optimal user model of representing.
Web active retrieval system provided by the invention has that learning ability is strong, easy to use, high reliability features, can be convenient to use this system's assisting users and carry out active retrieval and recommend the Web page.
Above-described specific embodiment; purpose of the present invention, technical scheme and beneficial effect are further described; institute is understood that; the above only is specific embodiments of the invention; be not limited to the present invention; within the spirit and principles in the present invention all, any modification of being made, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.
Claims (8)
1. the Web active retrieval system based on intensified learning is characterized in that, this system comprises:
Web searches for the Agent module, is used to receive user's initial request, and user's request is analyzed, and utilizes the related subject analysis to download the Web page, the result is committed to Web filters the Agent module;
Web filters the Agent module, is used to realize that the page that search is obtained to information search Agent carries out content analysis, utilizes the Q learning system in the intensified learning that each Web page is carried out the calculating of Q study median, and provides Web interface Agent module with the result;
Web interface Agent module is used to provide the user to recommend the Web page, and recording user is browsed behavior, and the result is submitted to user profile study Agent module;
User profile study Agent module, the TD learning algorithm that is used for utilizing intensified learning upgrades user's interest model and improves, the record of being finished according to Web interface Agent module to the behavior of user's browsing pages, the user's that generation gets access to value of feedback, constantly user's interest is upgraded by interest-degree calculating and update module, utilize the TD learning algorithm that user's interest mode is calculated, the optimal weight that finally reaches the user profile model distributes.
2. the Web active retrieval system based on intensified learning according to claim 1, it is characterized in that, this Web search Agent module is downloaded these several functional modules by information search, Web page analysis and the Web page and is formed, and is used to realize the download function of the search relevant with the user interest theme, web page contents analysis and the page; Import original request earlier by the user, carry out obtaining of initial page according to search engine, and the peer link on the page is extracted and be stored in a buffer zone, page download module will be visited corresponding webpage according to the link URL address, preserve according to the subject key words classification simultaneously.
3. the Web active retrieval system based on intensified learning according to claim 1, it is characterized in that, this Web filters the Agent module and receives after the Web search Agent module result of implementation, utilize the Q learning algorithm in conjunction with the user characteristics model Web content of pages to be analyzed and sorted, take out the forward Web page of Q value rank and filter; The effect that Web filters the Agent module comprises that mainly Web page analysis, the Web page filter and Web page classifications index, and provides Web interface Agent module with the result.
4. the Web active retrieval system based on intensified learning according to claim 1, it is characterized in that, this Web interface Agent module is used to realize that the maximally related webpage of recommending to finish with study module of user model browsed and receive value of feedback, to analysis and the statistics of browsing behavior.
5. the Web active retrieval system based on intensified learning according to claim 1, it is characterized in that, this user profile study Agent module is used for initially generating according to knowledge base this user's essential information file, in the process of search in the future and browsing page, constantly user's interest is upgraded, to determine the user profile model.
6. the Web active retrieval system based on intensified learning according to claim 5, it is characterized in that, this user profile study Agent module is according to the user feedback value, further the user profile model is upgraded, user model is carried out iterative computation and renewal and judge whether user model reaches convergence based on the TD algorithm.
7. according to claim 5 or 6 described Web active retrieval systems, it is characterized in that this user profile study Agent module is submitted to Web with operation result and filtered Agent, enters next step study based on intensified learning.
8. according to claim 5 or 6 described Web active retrieval systems based on intensified learning, it is characterized in that this user profile study Agent module is when the iteration result reaches convergence, it is optimum that the user profile model reaches, and, obtain and recommend the Web page according to the user profile model.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200810240358A CN101751437A (en) | 2008-12-17 | 2008-12-17 | Web active retrieval system based on reinforcement learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200810240358A CN101751437A (en) | 2008-12-17 | 2008-12-17 | Web active retrieval system based on reinforcement learning |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101751437A true CN101751437A (en) | 2010-06-23 |
Family
ID=42478428
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200810240358A Pending CN101751437A (en) | 2008-12-17 | 2008-12-17 | Web active retrieval system based on reinforcement learning |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101751437A (en) |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102567408A (en) * | 2010-12-31 | 2012-07-11 | 阿里巴巴集团控股有限公司 | Method and device for recommending search keyword |
CN102722543A (en) * | 2012-05-24 | 2012-10-10 | 潘勇刚 | Method for storing files |
US8898180B2 (en) | 2009-01-12 | 2014-11-25 | Alibaba Group Holding Limited | Method and system for querying information |
CN105095279A (en) * | 2014-05-13 | 2015-11-25 | 深圳市腾讯计算机系统有限公司 | File recommendation method and apparatus |
CN105447376A (en) * | 2015-10-30 | 2016-03-30 | 广州市汇助惠电子商务有限公司 | User management system |
CN105631052A (en) * | 2016-03-01 | 2016-06-01 | 北京百度网讯科技有限公司 | Artificial intelligence based retrieval method and artificial intelligence based retrieval device |
CN106933959A (en) * | 2013-11-15 | 2017-07-07 | 北京奇虎科技有限公司 | Browser side carries out the method and browser of web search |
CN108596360A (en) * | 2018-03-16 | 2018-09-28 | 北京中科闻歌科技股份有限公司 | A kind of judgement prediction technique and system based on machine learning |
CN108604314A (en) * | 2015-12-01 | 2018-09-28 | 渊慧科技有限公司 | List is acted using intensified learning selection |
CN108701130A (en) * | 2015-10-20 | 2018-10-23 | 维尔塞特公司 | Hints model is updated using auto-browsing cluster |
CN109451038A (en) * | 2018-12-06 | 2019-03-08 | 北京达佳互联信息技术有限公司 | A kind of information-pushing method, device, server and computer readable storage medium |
CN110008332A (en) * | 2019-02-13 | 2019-07-12 | 阿里巴巴集团控股有限公司 | The method and device of trunk word is extracted by intensified learning |
CN110119411A (en) * | 2019-05-14 | 2019-08-13 | 张良 | A kind of technique transfers platform intelligent search method and system |
CN110221959A (en) * | 2019-04-16 | 2019-09-10 | 阿里巴巴集团控股有限公司 | Test method, equipment and the computer-readable medium of application program |
CN110226188A (en) * | 2017-01-26 | 2019-09-10 | 株式会社诺瑞韩国 | Support method, system and the non-transitorycomputer readable storage medium of arithmetic study |
CN110765345A (en) * | 2018-07-10 | 2020-02-07 | 阿里巴巴集团控股有限公司 | Searching method, device and equipment |
WO2020171944A1 (en) * | 2019-02-20 | 2020-08-27 | Alibaba Group Holding Limited | Method and apparatus for displaying page to user through reinforcement learning |
-
2008
- 2008-12-17 CN CN200810240358A patent/CN101751437A/en active Pending
Cited By (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8898180B2 (en) | 2009-01-12 | 2014-11-25 | Alibaba Group Holding Limited | Method and system for querying information |
US9430568B2 (en) | 2009-01-12 | 2016-08-30 | Alibaba Group Holding Limited | Method and system for querying information |
CN102567408B (en) * | 2010-12-31 | 2014-06-04 | 阿里巴巴集团控股有限公司 | Method and device for recommending search keyword |
US8799306B2 (en) | 2010-12-31 | 2014-08-05 | Alibaba Group Holding Limited | Recommendation of search keywords based on indication of user intention |
CN102567408A (en) * | 2010-12-31 | 2012-07-11 | 阿里巴巴集团控股有限公司 | Method and device for recommending search keyword |
CN102722543A (en) * | 2012-05-24 | 2012-10-10 | 潘勇刚 | Method for storing files |
CN102722543B (en) * | 2012-05-24 | 2014-12-24 | 潘勇刚 | Method for storing files |
CN106933959A (en) * | 2013-11-15 | 2017-07-07 | 北京奇虎科技有限公司 | Browser side carries out the method and browser of web search |
CN105095279A (en) * | 2014-05-13 | 2015-11-25 | 深圳市腾讯计算机系统有限公司 | File recommendation method and apparatus |
CN108701130A (en) * | 2015-10-20 | 2018-10-23 | 维尔塞特公司 | Hints model is updated using auto-browsing cluster |
CN108701130B (en) * | 2015-10-20 | 2023-06-20 | 维尔塞特公司 | Updating hint models using auto-browse clusters |
CN105447376A (en) * | 2015-10-30 | 2016-03-30 | 广州市汇助惠电子商务有限公司 | User management system |
CN105447376B (en) * | 2015-10-30 | 2018-08-03 | 广州市汇助惠电子商务有限公司 | A kind of Subscriber Management System |
CN108604314A (en) * | 2015-12-01 | 2018-09-28 | 渊慧科技有限公司 | List is acted using intensified learning selection |
CN108604314B (en) * | 2015-12-01 | 2022-08-19 | 渊慧科技有限公司 | Selecting action lists using reinforcement learning |
CN105631052A (en) * | 2016-03-01 | 2016-06-01 | 北京百度网讯科技有限公司 | Artificial intelligence based retrieval method and artificial intelligence based retrieval device |
CN110226188A (en) * | 2017-01-26 | 2019-09-10 | 株式会社诺瑞韩国 | Support method, system and the non-transitorycomputer readable storage medium of arithmetic study |
CN108596360A (en) * | 2018-03-16 | 2018-09-28 | 北京中科闻歌科技股份有限公司 | A kind of judgement prediction technique and system based on machine learning |
CN110765345A (en) * | 2018-07-10 | 2020-02-07 | 阿里巴巴集团控股有限公司 | Searching method, device and equipment |
CN110765345B (en) * | 2018-07-10 | 2023-04-25 | 阿里巴巴集团控股有限公司 | Searching method, device and equipment |
CN109451038A (en) * | 2018-12-06 | 2019-03-08 | 北京达佳互联信息技术有限公司 | A kind of information-pushing method, device, server and computer readable storage medium |
CN110008332B (en) * | 2019-02-13 | 2020-11-10 | 创新先进技术有限公司 | Method and device for extracting main words through reinforcement learning |
CN110008332A (en) * | 2019-02-13 | 2019-07-12 | 阿里巴巴集团控股有限公司 | The method and device of trunk word is extracted by intensified learning |
WO2020171944A1 (en) * | 2019-02-20 | 2020-08-27 | Alibaba Group Holding Limited | Method and apparatus for displaying page to user through reinforcement learning |
US10956532B2 (en) | 2019-02-20 | 2021-03-23 | Advanced New Technologies Co., Ltd. | Method and apparatus for displaying page to user through reinforcement learning |
CN110221959A (en) * | 2019-04-16 | 2019-09-10 | 阿里巴巴集团控股有限公司 | Test method, equipment and the computer-readable medium of application program |
CN110221959B (en) * | 2019-04-16 | 2022-12-27 | 创新先进技术有限公司 | Application program testing method, device and computer readable medium |
CN110119411A (en) * | 2019-05-14 | 2019-08-13 | 张良 | A kind of technique transfers platform intelligent search method and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101751437A (en) | Web active retrieval system based on reinforcement learning | |
Salehi | Application of implicit and explicit attribute based collaborative filtering and BIDE for learning resource recommendation | |
CN103064945B (en) | Based on the Situational searching method of body | |
KR101712988B1 (en) | Method and apparatus providing for internet service in mobile communication terminal | |
CN111241407B (en) | Personalized search method based on reinforcement learning | |
AU2020101885A4 (en) | A Novel Tensor Factorization Using Trust and Rating for Recommendation, system and method thereof | |
CN103514239A (en) | Recommendation method and system integrating user behaviors and object content | |
CN104268292A (en) | Label word library update method of portrait system | |
Casillo et al. | Context-aware recommender systems and cultural heritage: a survey | |
CN115048586B (en) | Multi-feature-fused news recommendation method and system | |
CN112288554B (en) | Commodity recommendation method and device, storage medium and electronic device | |
CN105677828A (en) | User information processing method based on big data | |
CN110069713A (en) | A kind of personalized recommendation method based on user's context perception | |
Zhong et al. | Design of a personalized recommendation system for learning resources based on collaborative filtering | |
Manrique et al. | Exploring knowledge graphs for the identification of concept prerequisites | |
CN105677825A (en) | Analysis method for client browsing operation | |
Luo et al. | Adaptive attention-aware gated recurrent unit for sequential recommendation | |
CN116956183A (en) | Multimedia resource recommendation method, model training method, device and storage medium | |
CN110189197A (en) | Electric business personalized recommendation method based on context multi-arm fruit machine | |
KR101827338B1 (en) | Method and apparatus providing for internet service in mobile communication terminal | |
Braunhofer et al. | Parsimonious and Adaptive Contextual Information Acquisition in Recommender Systems. | |
CN115481325A (en) | Personalized news recommendation method and system based on user global interest migration perception | |
Chen et al. | A recommendation system of personalized resource reliability for online teaching system under large-scale user access | |
An et al. | Research on an Agent-Based Intelligent Social Tagging Recommendation System | |
Wu et al. | Prediction of the intention of purchase of the user surfing on the Web using hidden Markov model |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20100623 |