CN103577566A - Web reading content loading method and device - Google Patents

Web reading content loading method and device Download PDF

Info

Publication number
CN103577566A
CN103577566A CN201310513334.4A CN201310513334A CN103577566A CN 103577566 A CN103577566 A CN 103577566A CN 201310513334 A CN201310513334 A CN 201310513334A CN 103577566 A CN103577566 A CN 103577566A
Authority
CN
China
Prior art keywords
web data
sections
chapters
reading
web
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201310513334.4A
Other languages
Chinese (zh)
Other versions
CN103577566B (en
Inventor
吴华铠
陈虞付
任寰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd, Qizhi Software Beijing Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201310513334.4A priority Critical patent/CN103577566B/en
Publication of CN103577566A publication Critical patent/CN103577566A/en
Application granted granted Critical
Publication of CN103577566B publication Critical patent/CN103577566B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9574Browsing optimisation, e.g. caching or content distillation of access to content, e.g. by caching
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Abstract

The invention discloses a web reading content loading method and device, and relates to the technical field of browsers. The method comprises the steps of extracting characteristic data corresponding to reading content in web data at the aim of web data in a browser, sending the characteristic data to a target server, ensuring that the target server obtains and integrates web data of characteristic data relevant chapters, without reading conditionality, from all webs, receiving the web data of integrated characteristic data relevant chapters fed back by the target server, and obtaining and loading the reading content of corresponding web data according to commands. The method and the device solve the problem that a user needs to search web data of different chapters of an identical theme, so that multi-step operation is caused, and the method and the device obtain the beneficial effects of reducing user operation and improving the efficiency.

Description

A kind of web page browing content loading method and device
Technical field
The present invention relates to calculate browser technology field, be specifically related to a kind of web page browing content loading method and device.
Background technology
Along with the development of network, increasing literary works are delivered on the net, and read in order to help reader, and the literary works on network, such as the network novel, have produced unique content and supplied a pattern.The novel website of each large main flow, it is all to use the form of segmentation merogenesis that network novel content is provided, popular understanding is by chapters and sections and issues.But may in different websites, issue for the same piece of writing network novel, and the speed of issue may be also different, because all can there be a collection of identical novel resource each novel website, but due to update time and opening strategy different, when user reads certain this novel, current site may only provide the part chapters and sections of this this novel, may exist other novel websites that other chapters and sections of this novel are provided.When but user searches in search engine, need to carry out the input of novel keyword, obtain after Search Results, then click one by one Search Results and investigate, until find all chapters and sections, in this process, for user, frequent operation, inefficiency.
Summary of the invention
In view of the above problems, the present invention has been proposed to provide a kind of web page browing content charger that overcomes the problems referred to above or address the above problem at least in part and corresponding web page browing content loading method.
According to one aspect of the present invention, a kind of web page browing content loading method is provided, comprising:
The web data of opening for browser, extracts the characteristic of reading content in corresponding web data;
Described characteristic is sent to destination server; Described destination server from each website, obtain do not have reading condition restriction and with the web data of described characteristic related Sections, and integrate;
The web data of the described characteristic related Sections after the described integration that receiving target server returns, and obtain the reading content of corresponding web data and load according to instruction.
Alternatively, described destination server from each website, obtain do not have the restriction of encrypting and with the web data of described characteristic related Sections, and integrate, comprising:
Obtain web data relevant to described characteristic in each website;
Whether the reading content that judges described web data has reading condition restriction; If do not have reading condition to limit, retain this web data;
A plurality of web datas for having identical reading content, retain one of them.
Alternatively, at the reading content that judges described web data, whether have when reading condition limits and also comprise:
Judge whether safety of each web data, and the web data of safety is retained.
Alternatively, also comprise:
According to the order of chapters and sections, by each related web page Data Integration, be a catalogue webpage.
Alternatively, also comprise:
Recording a state corresponding to catalogue webpage publishes in instalments or finishes;
Further, after the web data of the described characteristic related Sections after the described integration of returning at browser receiving target server, also comprise:
Whether browser regularly has renewal chapters and sections to server interrogates in publishing in instalments the catalogue webpage of state; Described server judgement has renewal chapters and sections, and catalogue and state after correspondence being upgraded return to client.
Alternatively, described in obtain do not have reading condition restriction and with the web data of described characteristic related Sections, and integrate and comprise:
According to browser when the current chapters and sections of front opening webpage, return current chapters and sections at least before N chapters and sections web data reading content and/or at least after the reading content of M chapters and sections web data.
Alternatively, also comprise:
The direction of operating that webpage is operated is sent to server; Described server is M chapter web data based on the corresponding at least front N chapter of described direction of operating or at least.
Alternatively, the described web data of opening for browser, the characteristic of extracting reading content in corresponding web data comprises:
Extract the URL of current web page, described URL is mated with reading domain name matched rule, when meeting matching condition, extract the characteristic of reading content in corresponding web data; Described characteristic comprises corresponding chapters and sections information and/or title information and/or author information.
Alternatively, the URL of described extraction current web page, mates described URL with reading domain name matched rule, and when meeting matching condition, in corresponding web data, the characteristic of reading content comprises:
Whether the URL that front opening is worked as in judgement is novel website;
If novel website judges whether described URL points to novel brief introduction webpage, and/or judge whether described URL points to novel chapters and sections webpage;
If point to novel brief introduction webpage, extract the characteristic of reading content in corresponding web data, described characteristic comprises title information and/or author information;
If point to novel chapters and sections webpage, extract the characteristic of reading content in corresponding web data, described characteristic comprises chapters and sections information.
Alternatively, the described web data of opening for browser, the characteristic of extracting reading content in corresponding web data comprises:
Described in browser resolves during web data, judge whether web data exists reading condition restriction; If existed, extract the characteristic of reading content in corresponding web data.
The invention also discloses a kind of web page browing content charger, comprising:
Extraction module, the web data that is suitable for opening for browser, extracts the characteristic of reading content in corresponding web data;
Sending module, is suitable for described characteristic to be sent to destination server; Described destination server from each website, obtain do not have reading condition restriction and with the web data of described characteristic related Sections, and integrate;
Receiver module, is suitable for the web data of the described characteristic related Sections after described integration that receiving target server returns, and obtains the reading content of corresponding web data and load according to instruction.
Alternatively, in described destination server, comprising:
Acquisition module, is suitable for obtaining web data relevant to described characteristic in each website;
Analysis module, is suitable for judging whether the reading content of described web data has reading condition restriction; If do not have reading condition to limit, retain this web data;
The first integrate module, is suitable for, for a plurality of web datas with identical reading content, retaining one of them.
Alternatively, also comprise:
Analysis module, is suitable for judging whether safety of each web data, and the web data of safety is retained.
Alternatively, in described destination server, also comprise:
The second integrate module, is suitable for the order according to chapters and sections, by each related web page Data Integration, is a catalogue webpage.
Alternatively, in described server, also comprise:
State recording module, is suitable for recording a state corresponding to catalogue webpage and publishes in instalments or finish;
Further, in browser, also comprise:
The first update module, whether be suitable for browser regularly has renewal chapters and sections to server interrogates in publishing in instalments the catalogue webpage of state;
In described server, also comprise:
The second update module, being suitable for server judgement has renewal chapters and sections, and the catalogue after correspondence being upgraded returns to client.
Alternatively, described the first integrate module comprises:
The 3rd integrate module, is suitable for the current chapters and sections when front opening webpage according to browser, returns to the reading content of at least front N chapters and sections web data and/or the reading content of at least rear M chapters and sections web data of current chapters and sections.
Alternatively, described web page browing content charger also comprises: direction of operating sending module, is suitable for the direction of operating that webpage is operated to send to server;
In described destination server, also comprise: the 4th integrate module, is suitable for based on the corresponding at least front N chapter of described direction of operating or at least M chapter web data.
Alternatively, described extraction module comprises:
The first extraction module, is suitable for extracting the URL of current web page, by described URL with read domain name matched rule and mate, when meeting matching condition, extract the characteristic of reading content in corresponding web data; Described characteristic comprises corresponding chapters and sections information and/or title information and/or author information.
Alternatively, described the first extraction module comprises:
Website judge module, is suitable for judgement and works as whether the URL of front opening is novel website; When judgement is novel website, enter brief introduction webpage judge module;
Type of webpage judge module, is suitable for judging whether described URL points to novel brief introduction webpage, and/or judges whether described URL points to novel chapters and sections webpage;
If point to novel brief introduction webpage, extract the characteristic of reading content in corresponding web data, described characteristic comprises title information and/or author information;
If point to novel chapters and sections webpage, extract the characteristic of reading content in corresponding web data, described characteristic comprises chapters and sections information.
Alternatively, described extraction module comprises:
Restrictive condition judge module, is suitable for, described in browser resolves during web data, judging whether web data exists reading condition restriction; If existed, extract the characteristic of reading content in corresponding web data.
According to web page browing content loading method of the present invention, each related Sections web data without the same Web page subject of reading condition restriction can be integrated, offering browser directly loads, solved thus the multistep operation that user need to search for the different chapters and sections web datas of same subject, obtained minimizing user operation, the beneficial effect of raising the efficiency.
Above-mentioned explanation is only the general introduction of technical solution of the present invention, in order to better understand technological means of the present invention, and can be implemented according to the content of instructions, and for above and other objects of the present invention, feature and advantage can be become apparent, below especially exemplified by the specific embodiment of the present invention.
Accompanying drawing explanation
By reading below detailed description of the preferred embodiment, various other advantage and benefits will become cheer and bright for those of ordinary skills.Accompanying drawing is only for the object of preferred implementation is shown, and do not think limitation of the present invention.And in whole accompanying drawing, by identical reference symbol, represent identical parts.In the accompanying drawings:
Fig. 1 shows a kind of according to an embodiment of the invention web page browing content loading method schematic flow sheet;
Fig. 2 shows a kind of according to an embodiment of the invention integration example of web page browing data;
Fig. 3 shows a kind of according to an embodiment of the invention web page browing content and loads concrete example; And
Fig. 4 shows a kind of according to an embodiment of the invention structural representation of web page browing content charger.
Embodiment
Exemplary embodiment of the present disclosure is described below with reference to accompanying drawings in more detail.Although shown exemplary embodiment of the present disclosure in accompanying drawing, yet should be appreciated that and can realize the disclosure and the embodiment that should do not set forth limits here with various forms.On the contrary, it is in order more thoroughly to understand the disclosure that these embodiment are provided, and can by the scope of the present disclosure complete convey to those skilled in the art.
Embodiment mono-
With reference to Fig. 1, the schematic flow sheet that it shows a kind of web page browing content loading method of the embodiment of the present invention one, specifically can comprise:
Step 102, the web data of opening for browser, extracts the characteristic of reading content in corresponding web data;
In embodiments of the present invention, when user reads by browser the network works that certain this online novel or other provide with chapters and sections form, the novel that active user is read or the characteristic of network works extract.
Preferably, the described web data of opening for browser, the characteristic of extracting reading content in corresponding web data comprises:
Steps A 01, described in browser resolves during web data, judges whether web data exists reading condition restriction; If existed, extract the characteristic of reading content in corresponding web data.
The middle web data of reading content such as to(for) described web data, such as novel content concrete in novel chapters and sections webpage is defined as and needs a certain membership, could show while having certain special string in other words time, can extract the characteristics such as novel title information and/or author information and/or chapters and sections information by affiliated web site.
The judgement that the decision operation of this step can be carried out according to the web data returning or bullet window data in browser side.
The present invention is preferred, the described web data of opening for browser, and the characteristic of extracting reading content in corresponding web data comprises:
Steps A 10, the URL of extraction current web page, mates described URL with reading domain name matched rule, when meeting matching condition, extract the characteristic of reading content in corresponding web data; Described characteristic comprises corresponding chapters and sections information and/or title information and/or author information.
In embodiments of the present invention, its characteristic comprises: URL(Uniform Resource Locator, URL(uniform resource locator)), or comprise URL, and chapters and sections information; Or comprise URL, chapters and sections information, and title information; Or comprise URL, author information and title information; Comprise URL,, the multiple situations such as chapters and sections information author information and title information.
Preferably, the URL of described extraction current web page, mates described URL with reading domain name matched rule, and when meeting matching condition, the characteristic of extracting reading content in corresponding web data comprises:
Step B10, whether the URL that front opening is worked as in judgement is novel website; If novel website enters step B12, if not being novel website, stop subsequent operation;
Step B12, judges whether described URL points to novel brief introduction webpage, and/or judges whether described URL points to novel chapters and sections webpage;
If point to novel brief introduction webpage, extract the characteristic of reading content in corresponding web data, described characteristic comprises title information and/or author information;
If point to novel chapters and sections webpage, extract the characteristic of reading content in corresponding web data, described characteristic comprises chapters and sections information.
If neither point to novel brief introduction webpage, do not point to novel chapters and sections webpage yet, stop subsequent operation.
The present invention can set up in advance and read domain name matched rule:
1, judge whether current web page data are the matched rule of novel website:
Such as setting up matched rule for the domain name of each novel website, (as starting point Chinese network http://www.qidian.com, as long as domain name thinks that for " qidian.com " user is in access novel website; ).So first, can shift to an earlier date the URL of the webpage of user's current accessed, judge whether current URL is novel website: such as URL is http://read.qidian.com/BookReader/2884952,46619558.aspx, so with starting point.Com mates, and can judge that this URL is novel website.
After adopting this judgment rule to judge whether current web page data are the website of novel website, and then carry out follow-up judgement.Also after being judged as novel website, just can carry out 2 and/or 3 judgement, to reduce system consumption, save system resource.
2, judge whether current URL points to the matched rule of concrete certain this novel:
Such as can build " http://www.qidian.com/Book/*.aspx " matched rule for starting point website, so can judge that this URL points to concrete novel, such as for URL:http: //www.qidian.com/Book/2884952.aspx, after being judged as starting point website and webpage data, then judge that its sensing is encoded to 2884952 concrete novel.
After it is the web data of novel website in judgement, then judge whether it points to concrete novel, also can be understood as the brief introduction webpage that whether points to novel, by the web data in this brief introduction webpage, can enter the concrete chapters and sections webpage of novel.
3, judge whether current URL points to the matched rule of concrete certain this novel chapters and sections webpage:
Such as building http://read.qidian.com/BookReader/* for starting point website, * .aspx rule, * wherein, * is variable code.The URL:http of starting point novel website so: //read.qidian.com/BookReader/2884952,46619558.aspx, is judged as the web data of chapters and sections webpage of sensing one novel of starting point;
Can from the html document of this URL, obtain user's current read chapters and sections information so.Obtain the html document of described URL, analyze described html document, obtain the chapters and sections information of current web page.Such as the chapters and sections information getting is chapter 10.
In abovementioned steps B12, the embodiment of the present invention can be preferably following order:
Step C12, judges whether described URL points to novel brief introduction webpage; If point to novel brief introduction webpage, the characteristic of reading content in corresponding web data, described characteristic comprises title information and/or author information; If not pointing to novel brief introduction webpage, enter step C14;
Step C14, judges whether described URL points to novel chapters and sections webpage; If point to novel chapters and sections webpage, extract the characteristic of reading content in corresponding web data, described characteristic comprises chapters and sections information; If not, stop subsequent operation.
Or step C14, judges whether described URL points to novel chapters and sections webpage; If point to novel chapters and sections webpage, the characteristic of reading content in corresponding web data, described characteristic comprises chapters and sections information; If not pointing to novel chapters and sections webpage, enter step C14;
Step C12, judges whether described URL points to novel brief introduction webpage; If point to novel brief introduction webpage, the characteristic of reading content in corresponding web data, described characteristic comprises title information and/or author information; If not pointing to novel brief introduction webpage, stop subsequent operation.
In above-mentioned characteristic, also can comprise URL itself.
Step 104, is sent to destination server by described characteristic; Described destination server from each website, obtain do not have reading condition restriction and with the web data of described characteristic related Sections, and integrate;
At browser, get after characteristic, characteristic can be sent to destination server, by server obtain do not have reading condition restriction and with the web data of described characteristic related Sections, and integrate.
Preferably, described destination server from each website, obtain do not have the restriction of encrypting and with the web data of described characteristic related Sections, and integrate, comprising:
Steps A 20, obtains web data relevant to described characteristic in each website;
Preferably, this step comprises:
Step D10, according to described characteristic, obtains title information and author information;
In this step, if only have the URL of novel brief introduction webpage in described characteristic, directly obtain the html document that URL is corresponding, from html document, obtain title information, author information.
If described characteristic has comprised novel brief introduction webpage URL, title information, author information, directly enter subsequent step.
If only have the URL of novel chapters and sections webpage in described characteristic, obtain the html document that URL is corresponding, from html document, obtain title information, author information, chapters and sections information.
If only have URL and the chapters and sections information of novel chapters and sections webpage in described characteristic, obtain the html document that URL is corresponding, from html document, obtain title information, author information, enter subsequent step.
If only have URL, chapters and sections information, title information, the author information of novel chapters and sections webpage in described characteristic, can directly enter subsequent step.
If only have URL and title information, author information in described characteristic, can obtain the html document that URL is corresponding, from html document, obtain chapters and sections information, enter subsequent step.
Certainly also only a title information, author information enter subsequent step.
Step D12, take described title information and author information is keyword, obtains the web data of each chapters and sections identical with author information with described title information.
In the present invention, can be captured in advance by server all information of each novel of each novel website: novel title information, author information and all chapters and sections and source website address URL.Then directly in server, obtain the web data of each chapters and sections identical with author information with described title information.
Also can from network, capture in real time the web data of each chapters and sections identical with author information with described title information.
In crawl, obtain after each web data, also can carry out safety detection to described web data, retain safe web data.
Steps A 22, judges whether the reading content of described web data has reading condition restriction; If do not have reading condition to limit, retain this web data;
Such as novel webpage needs VIP membership or needs special string or could access during other qualificationss, abandon this vertical web data; Otherwise, can retain this connection.
Preferably, also comprise: steps A 23, judges whether safety of each web data, and the web data of safety is retained.
Steps A 24, a plurality of web datas for having identical reading content, retain one of them.
So, can be only a with retaining for each chapters and sections webpage web data of same title, same author's novel webpage.
Further, also comprise: also comprise:
Steps A 26, according to the order of chapters and sections, is a catalogue webpage by each related web page Data Integration.
Each chapters and sections web data of a novel is integrated into after catalogue webpage, can conveniently stores.Also these chapters and sections Information Organizations can be become the full directory structure of this this novel represent to user, when user clicks certain chapters and sections in catalogue, jump to this chapters and sections content that this source web provides, to load for browser.
When follow-up browser is browsed certain chapters and sections, can directly by this catalogue webpage, browse.
Preferably, also comprise:
Steps A 28, records a state corresponding to catalogue webpage and publishes in instalments or finish;
In this step, server also can be inquired about the state of each novel: publish in instalments or finish, state is returned to client together.
Further, after the web data of the described characteristic related Sections after the described integration of returning at browser receiving target server, also comprise:
Steps A 30, whether browser regularly has renewal chapters and sections to server interrogates in publishing in instalments the catalogue webpage of state; Described server judgement has renewal chapters and sections, and the catalogue after correspondence being upgraded returns to client.
Browser is after obtaining catalogue webpage, for in publishing in instalments the catalogue webpage of state, can regularly go server interrogates whether to have up-to-date webpage, server goes inquiry whether to have the chapters and sections of renewal, if had, more new directory, then returns to client by the catalogue of renewal and state.
With reference to Fig. 2, it is a kind of webpage web data Integration Mode example of A20 of the present invention to A28.
The web data of browser access website A chapters and sections 1, after aforementioned determining step, obtain title+author, title+author is sent to server, server obtains the web data of all chapters and sections from each websites such as website B, C by abovementioned steps, form catalogue, and state is returned to browser together.Browser is in publishing in instalments state, each goes to check and accept whether have up-to-date chapters and sections (or whether query directory upgrades) for 1 hour, if catalogue is upgraded, return to up-to-date chapters and sections information and state (finish or publish in instalments), until this listing of novel is in the state of finishing.
Preferably, described according to the order of chapters and sections, by each related web page Data Integration, be that a catalogue webpage comprises:
Steps A 32, according to browser when the current chapters and sections of front opening webpage, return current chapters and sections at least before N chapters and sections web data reading content and/or at least after the reading content of M chapters and sections web data.
In abovementioned steps, if browser client is the web data that points to novel chapters and sections webpage when the web data of front opening, can obtain its chapters and sections information.So this step can return simultaneously the N chapters and sections above of these chapters and sections and below the reading content of M chapters and sections return to browser loading.
In the present invention, N and M maximal value can be set to 5, and the chapters and sections sum before current chapters and sections is greater than at 5 o'clock, and N is desirable 5, if be less than 5, gets actual value; Chapters and sections sum after current chapters and sections is greater than at 5 o'clock, and N is desirable 5, if be less than 5, gets actual value.
In addition, in embodiments of the present invention, described destination server from each website, obtain do not have reading condition restriction and with the web data of described characteristic related Sections, and integrate and comprise:
Steps A 33 is extracted when the relevant reading content of front opening web data from each website, and described reading content is integrated.
Such as only extracting the current document data (reading content) that is limited to load chapters and sections from third party website, then document data is returned to browser.
Or from third party website obtain current chapters and sections the reading content of N chapters and sections and/or rear M chapters and sections, integrate in order (be about to each reading content and be integrated into one piece of document data), then the reading content after integrating is returned to browser.
Step 106, the web data of the described characteristic related Sections after the described integration that receiving target server returns, and obtain the reading content of corresponding web data and load according to instruction.
When browser reads the content of current chapters and sections, while jumping to next chapters and sections or other chapters and sections, can directly in the web data from integrating, obtain corresponding reading content and load.
In addition,
In the present invention, the reading content of the web data returning for each, browser can judge whether it can load in reading model, described reading model can be removed the non-reading content in html document corresponding to web data for judgement browser, and the reading content that only retains novel loads.If of course, can enter reading model, load reading content; If cannot, in browser, newly play tab Shipping Options Page, the reading content of Web page loading data in tab Shipping Options Page.In reading model, can load the reading content of at least front N chapters and sections web data and/or the reading content of at least rear M chapters and sections web data of the current chapters and sections that meet reading model requirement.
In addition, in practical operation, when reading, user also may operate by roll mouse, and so preferred, also comprise:
Steps A 34, sends to server by the direction of operating that webpage is operated; Described server is M chapter web data based on the corresponding at least front N chapter of described direction of operating or at least.
Be user while scrolling up mouse, return at least before N chapter connect to browser and load; During the downward roll mouse of user, after returning at least, N chapter connects to browser and loads.
According to web page browing content loading method of the present invention, each related Sections web data without the same Web page subject of reading condition restriction can be integrated, offering browser directly loads, the present invention realizes for the chapters and sections content of the different network novel in source and carries out the integration in browser side, is the demonstration of unification for user; Can reduce user's operation, raise the efficiency; And can prevent that user is owing to being deceived by fishing when searching novel.
Embodiment bis-
With reference to Fig. 3, it shows the example of a kind of web page browing content of the present invention loading method, and it can comprise:
Step 302, after judging that current novel webpage is limited to load, obtains the chapters and sections information of current novel webpage;
Be that browser is opened webpage, when judgement current web page is the novel webpage of novel website, and be restricted to and cannot load after reading content, obtain the chapters and sections information of current novel webpage;
Step 304, uploads corresponding URL and chapters and sections information to server;
Step 306, server from each third party's novel website, obtains the text data of relevant novel chapters and sections according to the URL uploading;
Step 308, pushes to browser by text data;
Step 310, browser loads the text message returning under reading model.
The present embodiment is just for more simple and clear making an explanation to embodiment mono-, and the step principle of the present embodiment is basic and embodiment mono-is similar, is not described in detail in this.
Embodiment tri-
With reference to Fig. 4, the structural representation that it shows a kind of web page browing content of the present invention charger, comprising:
Extraction module 402, the web data that is suitable for opening for browser, extracts the characteristic of reading content in corresponding web data;
Sending module 404, is suitable for described characteristic to be sent to destination server; Described destination server from each website, obtain do not have reading condition restriction and with the web data of described characteristic related Sections, and integrate;
Receiver module 406, is suitable for the web data of the described characteristic related Sections after described integration that receiving target server returns, and obtains the reading content of corresponding web data and load according to instruction.
Preferably, in described destination server, comprising:
Acquisition module, is suitable for obtaining web data relevant to described characteristic in each website;
Analysis module, is suitable for judging whether the reading content of described web data has reading condition restriction; If do not have reading condition to limit, retain this web data;
The first integrate module, is suitable for, for a plurality of web datas with identical reading content, retaining one of them.
Preferably, also comprise:
Judge whether safety of each web data, and the web data of safety is retained.
Preferably, in described destination server, also comprise:
The second integrate module, is suitable for the order according to chapters and sections, by each related web page Data Integration, is a catalogue webpage.
Preferably, also comprise:
In described server, also comprise:
State recording module, is suitable for recording a state corresponding to catalogue webpage and publishes in instalments or finish;
Further, in browser, also comprise:
The first update module, whether be suitable for browser regularly has renewal chapters and sections to server interrogates in publishing in instalments the catalogue webpage of state;
In described server, also comprise:
The second update module, being suitable for server judgement has renewal chapters and sections, and the catalogue after correspondence being upgraded returns to client.
Preferably, described the first integrate module comprises:
The 3rd integrate module, is suitable for the current chapters and sections when front opening webpage according to browser, returns to the reading content of at least front N chapters and sections web data and/or the reading content of at least rear M chapters and sections web data of current chapters and sections.
Preferably, described web page browing content charger also comprises: direction of operating sending module, is suitable for the direction of operating that webpage is operated to send to server;
In described destination server, also comprise: the 4th integrate module, is suitable for based on the corresponding at least front N chapter of described direction of operating or at least M chapter web data.
Preferably, described extraction module comprises:
The first extraction module, is suitable for extracting the URL of current web page, by described URL with read domain name matched rule and mate, when meeting matching condition, extract the characteristic of reading content in corresponding web data; Described characteristic comprises corresponding chapters and sections information and/or title information and/or author information.
Preferably, described the first extraction module comprises:
Website judge module, is suitable for judgement and works as whether the URL of front opening is novel website; When judgement is novel website, enter brief introduction webpage judge module;
Type of webpage judge module, is suitable for judging whether described URL points to novel brief introduction webpage, and/or judges whether described URL points to novel chapters and sections webpage;
If point to novel brief introduction webpage, extract the characteristic of reading content in corresponding web data, described characteristic comprises title information and/or author information;
If point to novel chapters and sections webpage, extract the characteristic of reading content in corresponding web data, described characteristic comprises chapters and sections information.
Preferably, described extraction module comprises:
Restrictive condition judge module, is suitable for, described in browser resolves during web data, judging whether web data exists reading condition restriction; If existed, extract the characteristic of reading content in corresponding web data.
The algorithm providing at this is intrinsic not relevant to any certain computer, virtual bench or miscellaneous equipment with demonstration.Various fexible units also can with based on using together with this teaching.According to description above, it is apparent constructing the desired structure of this class device.In addition, the present invention is not also for any certain programmed language.It should be understood that and can utilize various programming languages to realize content of the present invention described here, and the description of above language-specific being done is in order to disclose preferred forms of the present invention.
In the instructions that provided herein, a large amount of details have been described.Yet, can understand, embodiments of the invention can not put into practice in the situation that there is no these details.In some instances, be not shown specifically known method, structure and technology, so that not fuzzy understanding of this description.
Similarly, be to be understood that, in order to simplify the disclosure and to help to understand one or more in each inventive aspect, in the above in the description of exemplary embodiment of the present invention, each feature of the present invention is grouped together into single embodiment, figure or sometimes in its description.Yet, the method for the disclosure should be construed to the following intention of reflection: the present invention for required protection requires than the more feature of feature of clearly recording in each claim.Or rather, as reflected in claims below, inventive aspect is to be less than all features of disclosed single embodiment above.Therefore, claims of following embodiment are incorporated to this embodiment thus clearly, and wherein each claim itself is as independent embodiment of the present invention.
Those skilled in the art are appreciated that and can the module in the equipment in embodiment are adaptively changed and they are arranged in one or more equipment different from this embodiment.Module in embodiment or unit or assembly can be combined into a module or unit or assembly, and can put them into a plurality of submodules or subelement or sub-component in addition.At least some in such feature and/or process or unit are mutually repelling, and can adopt any combination to combine all processes or the unit of disclosed all features in this instructions (comprising claim, summary and the accompanying drawing followed) and disclosed any method like this or equipment.Unless clearly statement in addition, in this instructions (comprising claim, summary and the accompanying drawing followed) disclosed each feature can be by providing identical, be equal to or the alternative features of similar object replaces.
In addition, those skilled in the art can understand, although embodiment more described herein comprise some feature rather than further feature included in other embodiment, the combination of the feature of different embodiment means within scope of the present invention and forms different embodiment.For example, in the following claims, the one of any of embodiment required for protection can be used with array mode arbitrarily.
All parts embodiment of the present invention can realize with hardware, or realizes with the software module moved on one or more processor, or realizes with their combination.It will be understood by those of skill in the art that and can use in practice microprocessor or digital signal processor (DSP) to realize according to the some or all functions of the some or all parts in the web page browing content loading equipemtn of the embodiment of the present invention.The present invention for example can also be embodied as, for carrying out part or all equipment or device program (, computer program and computer program) of method as described herein.Realizing program of the present invention and can be stored on computer-readable medium like this, or can there is the form of one or more signal.Such signal can be downloaded and obtain from internet website, or provides on carrier signal, or provides with any other form.
It should be noted above-described embodiment the present invention will be described rather than limit the invention, and those skilled in the art can design alternative embodiment in the situation that do not depart from the scope of claims.In the claims, any reference symbol between bracket should be configured to limitations on claims.Word " comprises " not to be got rid of existence and is not listed as element or step in the claims.Being positioned at word " " before element or " one " does not get rid of and has a plurality of such elements.The present invention can be by means of including the hardware of some different elements and realizing by means of the computing machine of suitably programming.In having enumerated the unit claim of some devices, several in these devices can be to carry out imbody by same hardware branch.The use of word first, second and C grade does not represent any order.Can be title by these word explanations.
The invention discloses A1, a kind of web page browing content loading method, comprising:
The web data of opening for browser, extracts the characteristic of reading content in corresponding web data;
Described characteristic is sent to destination server; Described destination server from each website, obtain do not have reading condition restriction and with the web data of described characteristic related Sections, and integrate;
The web data of the described characteristic related Sections after the described integration that receiving target server returns, and obtain the reading content of corresponding web data and load according to instruction.
A2, the method as described in A1, described destination server from each website, obtain do not have the restriction of encrypting and with the web data of described characteristic related Sections, and integrate, comprising:
Obtain web data relevant to described characteristic in each website;
Whether the reading content that judges described web data has reading condition restriction; If do not have reading condition to limit, retain this web data;
A plurality of web datas for having identical reading content, retain one of them.
Whether A3, the method as described in A2, have when reading condition limits and also comprise at the reading content that judges described web data:
Judge whether safety of each web data, and the web data of safety is retained.
A4, the method as described in A2, also comprise:
According to the order of chapters and sections, by each related web page Data Integration, be a catalogue webpage.
A5, the method as described in A2, also comprise:
Recording a state corresponding to catalogue webpage publishes in instalments or finishes;
Further, after the web data of the described characteristic related Sections after the described integration of returning at browser receiving target server, also comprise:
Whether browser regularly has renewal chapters and sections to server interrogates in publishing in instalments the catalogue webpage of state; Described server judgement has renewal chapters and sections, and catalogue and state after correspondence being upgraded return to client.
A6, the method as described in A1, described in obtain do not have reading condition restriction and with the web data of described characteristic related Sections, and integrate and comprise:
According to browser when the current chapters and sections of front opening webpage, return current chapters and sections at least before N chapters and sections web data reading content and/or at least after the reading content of M chapters and sections web data.
A7, the method as described in A2, also comprise:
The direction of operating that webpage is operated is sent to server; Described server is M chapter web data based on the corresponding at least front N chapter of described direction of operating or at least.
A8, the method as described in A1 or A2, the described web data of opening for browser, the characteristic of extracting reading content in corresponding web data comprises:
Extract the URL of current web page, described URL is mated with reading domain name matched rule, when meeting matching condition, extract the characteristic of reading content in corresponding web data; Described characteristic comprises corresponding chapters and sections information and/or title information and/or author information.
A9, the method as described in A8, the URL of described extraction current web page, mates described URL with reading domain name matched rule, and when meeting matching condition, in corresponding web data, the characteristic of reading content comprises:
Whether the URL that front opening is worked as in judgement is novel website;
If novel website judges whether described URL points to novel brief introduction webpage, and/or judge whether described URL points to novel chapters and sections webpage;
If point to novel brief introduction webpage, extract the characteristic of reading content in corresponding web data, described characteristic comprises title information and/or author information;
If point to novel chapters and sections webpage, extract the characteristic of reading content in corresponding web data, described characteristic comprises chapters and sections information.
A10, the method as described in A1, the described web data of opening for browser, the characteristic of extracting reading content in corresponding web data comprises:
Described in browser resolves during web data, judge whether web data exists reading condition restriction; If existed, extract the characteristic of reading content in corresponding web data.
The invention also discloses A11, a kind of web page browing content charger, comprising:
Extraction module, the web data that is suitable for opening for browser, extracts the characteristic of reading content in corresponding web data;
Sending module, is suitable for described characteristic to be sent to destination server; Described destination server from each website, obtain do not have reading condition restriction and with the web data of described characteristic related Sections, and integrate;
Receiver module, is suitable for the web data of the described characteristic related Sections after described integration that receiving target server returns, and obtains the reading content of corresponding web data and load according to instruction.
A12, the device as described in A11, in described destination server, comprising:
Acquisition module, is suitable for obtaining web data relevant to described characteristic in each website;
Analysis module, is suitable for judging whether the reading content of described web data has reading condition restriction; If do not have reading condition to limit, retain this web data;
The first integrate module, is suitable for, for a plurality of web datas with identical reading content, retaining one of them.
A13, the device as described in A12, also comprise:
Analysis module, is suitable for judging whether safety of each web data, and the web data of safety is retained.
A14, the device as described in A12, in described destination server, also comprise:
The second integrate module, is suitable for the order according to chapters and sections, by each related web page Data Integration, is a catalogue webpage.
A15, the device as described in A13 also comprise in described server:
State recording module, is suitable for recording a state corresponding to catalogue webpage and publishes in instalments or finish;
Further, in browser, also comprise:
The first update module, whether be suitable for browser regularly has renewal chapters and sections to server interrogates in publishing in instalments the catalogue webpage of state;
In described server, also comprise:
The second update module, being suitable for server judgement has renewal chapters and sections, and the catalogue after correspondence being upgraded returns to client.
A16, the device as described in A11, described the first integrate module comprises:
The 3rd integrate module, is suitable for the current chapters and sections when front opening webpage according to browser, returns to the reading content of at least front N chapters and sections web data and/or the reading content of at least rear M chapters and sections web data of current chapters and sections.
A17, the device as described in 12,
Described web page browing content charger also comprises: direction of operating sending module, is suitable for the direction of operating that webpage is operated to send to server;
In described destination server, also comprise: the 4th integrate module, is suitable for based on the corresponding at least front N chapter of described direction of operating or at least M chapter web data.
A18, the device as described in A11 or A12, described extraction module comprises:
The first extraction module, is suitable for extracting the URL of current web page, by described URL with read domain name matched rule and mate, when meeting matching condition, extract the characteristic of reading content in corresponding web data; Described characteristic comprises corresponding chapters and sections information and/or title information and/or author information.
A19, the device as described in 18, described the first extraction module comprises:
Website judge module, is suitable for judgement and works as whether the URL of front opening is novel website; When judgement is novel website, enter brief introduction webpage judge module;
Type of webpage judge module, is suitable for judging whether described URL points to novel brief introduction webpage, and/or judges whether described URL points to novel chapters and sections webpage;
If point to novel brief introduction webpage, extract the characteristic of reading content in corresponding web data, described characteristic comprises title information and/or author information;
If point to novel chapters and sections webpage, extract the characteristic of reading content in corresponding web data, described characteristic comprises chapters and sections information.
A20, the device as described in A11, described extraction module comprises:
Restrictive condition judge module, is suitable for, described in browser resolves during web data, judging whether web data exists reading condition restriction; If existed, extract the characteristic of reading content in corresponding web data.

Claims (10)

1. a web page browing content loading method, comprising:
The web data of opening for browser, extracts the characteristic of reading content in corresponding web data;
Described characteristic is sent to destination server; Described destination server from each website, obtain do not have reading condition restriction and with the web data of described characteristic related Sections, and integrate;
The web data of the described characteristic related Sections after the described integration that receiving target server returns, and obtain the reading content of corresponding web data and load according to instruction.
2. the method for claim 1, is characterized in that, described destination server from each website, obtain do not have the restriction of encrypting and with the web data of described characteristic related Sections, and integrate, comprising:
Obtain web data relevant to described characteristic in each website;
Whether the reading content that judges described web data has reading condition restriction; If do not have reading condition to limit, retain this web data;
A plurality of web datas for having identical reading content, retain one of them.
3. whether method as claimed in claim 2, is characterized in that, at the reading content that judges described web data, have when reading condition limits and also comprise:
Judge whether safety of each web data, and the web data of safety is retained.
4. method as claimed in claim 2, is characterized in that, also comprises:
According to the order of chapters and sections, by each related web page Data Integration, be a catalogue webpage.
5. method as claimed in claim 2, is characterized in that, also comprises:
Recording a state corresponding to catalogue webpage publishes in instalments or finishes;
Further, after the web data of the described characteristic related Sections after the described integration of returning at browser receiving target server, also comprise:
Whether browser regularly has renewal chapters and sections to server interrogates in publishing in instalments the catalogue webpage of state; Described server judgement has renewal chapters and sections, and catalogue and state after correspondence being upgraded return to client.
6. the method for claim 1, is characterized in that, described in obtain do not have reading condition restriction and with the web data of described characteristic related Sections, and integrate and comprise:
According to browser when the current chapters and sections of front opening webpage, return current chapters and sections at least before N chapters and sections web data reading content and/or at least after the reading content of M chapters and sections web data.
7. a web page browing content charger, comprising:
Extraction module, the web data that is suitable for opening for browser, extracts the characteristic of reading content in corresponding web data;
Sending module, is suitable for described characteristic to be sent to destination server; Described destination server from each website, obtain do not have reading condition restriction and with the web data of described characteristic related Sections, and integrate;
Receiver module, is suitable for the web data of the described characteristic related Sections after described integration that receiving target server returns, and obtains the reading content of corresponding web data and load according to instruction.
8. device as claimed in claim 7, is characterized in that, in described destination server, comprising:
Acquisition module, is suitable for obtaining web data relevant to described characteristic in each website;
Analysis module, is suitable for judging whether the reading content of described web data has reading condition restriction; If do not have reading condition to limit, retain this web data;
The first integrate module, is suitable for, for a plurality of web datas with identical reading content, retaining one of them.
9. device as claimed in claim 8, is characterized in that, also comprises:
Analysis module, is suitable for judging whether safety of each web data, and the web data of safety is retained.
10. device as claimed in claim 8, is characterized in that, in described destination server, also comprises:
The second integrate module, is suitable for the order according to chapters and sections, by each related web page Data Integration, is a catalogue webpage.
CN201310513334.4A 2013-10-25 2013-10-25 A kind of web page browing content loading method and device Active CN103577566B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310513334.4A CN103577566B (en) 2013-10-25 2013-10-25 A kind of web page browing content loading method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310513334.4A CN103577566B (en) 2013-10-25 2013-10-25 A kind of web page browing content loading method and device

Publications (2)

Publication Number Publication Date
CN103577566A true CN103577566A (en) 2014-02-12
CN103577566B CN103577566B (en) 2017-07-28

Family

ID=50049342

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310513334.4A Active CN103577566B (en) 2013-10-25 2013-10-25 A kind of web page browing content loading method and device

Country Status (1)

Country Link
CN (1) CN103577566B (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104317903A (en) * 2014-10-24 2015-01-28 北京奇虎科技有限公司 Chapter type text chapter integrity identification method and device
CN104484415A (en) * 2014-12-16 2015-04-01 北京百度网讯科技有限公司 E-book supplying method and e-book supplying device
CN105302913A (en) * 2015-11-12 2016-02-03 北京奇虎科技有限公司 Network novel chapter list evaluating method and device
CN106709008A (en) * 2016-12-23 2017-05-24 掌阅科技股份有限公司 Internet article revising and renewing method and device and method and device for renewing internet article reading processing rate
CN106844769A (en) * 2017-02-27 2017-06-13 百度在线网络技术(北京)有限公司 With reference to the pattern of passing through and in limited time reading model information flow recommend method and apparatus
CN108268429A (en) * 2017-06-15 2018-07-10 广东神马搜索科技有限公司 The determining method and apparatus of online literature chapters and sections
CN104965825B (en) * 2014-04-16 2018-12-11 腾讯科技(深圳)有限公司 A kind of method and terminal of data processing
CN111143718A (en) * 2018-11-02 2020-05-12 上海奥陶网络科技有限公司 Branch novel management system and method
CN111428173A (en) * 2020-02-25 2020-07-17 泰康保险集团股份有限公司 Method and device for accessing third-party website

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1809827A (en) * 2000-11-21 2006-07-26 汤姆森许可公司 System and process for network site fragmented search
US20080039464A1 (en) * 2006-07-28 2008-02-14 Berry Angela Compounds Which Modulate The CB2 Receptor
US7340481B1 (en) * 2000-01-21 2008-03-04 International Business Machines Corp. Method and system for adding user-provided content to a content object stored in a data repository
US20120023133A1 (en) * 2009-04-01 2012-01-26 Woodt Inc. Document searching system and method
CN103020266A (en) * 2012-12-25 2013-04-03 北京奇虎科技有限公司 Method and device for extracting webpage text content

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7340481B1 (en) * 2000-01-21 2008-03-04 International Business Machines Corp. Method and system for adding user-provided content to a content object stored in a data repository
CN1809827A (en) * 2000-11-21 2006-07-26 汤姆森许可公司 System and process for network site fragmented search
US20080039464A1 (en) * 2006-07-28 2008-02-14 Berry Angela Compounds Which Modulate The CB2 Receptor
US20120023133A1 (en) * 2009-04-01 2012-01-26 Woodt Inc. Document searching system and method
CN103020266A (en) * 2012-12-25 2013-04-03 北京奇虎科技有限公司 Method and device for extracting webpage text content

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104965825B (en) * 2014-04-16 2018-12-11 腾讯科技(深圳)有限公司 A kind of method and terminal of data processing
CN104317903B (en) * 2014-10-24 2017-10-13 北京奇虎科技有限公司 The recognition methods of the chapters and sections integrality of chapters and sections formula text and device
CN104317903A (en) * 2014-10-24 2015-01-28 北京奇虎科技有限公司 Chapter type text chapter integrity identification method and device
CN104484415A (en) * 2014-12-16 2015-04-01 北京百度网讯科技有限公司 E-book supplying method and e-book supplying device
CN105302913A (en) * 2015-11-12 2016-02-03 北京奇虎科技有限公司 Network novel chapter list evaluating method and device
CN105302913B (en) * 2015-11-12 2018-09-18 北京奇虎科技有限公司 Network novel Chapter List appraisal procedure and device
CN106709008A (en) * 2016-12-23 2017-05-24 掌阅科技股份有限公司 Internet article revising and renewing method and device and method and device for renewing internet article reading processing rate
CN106844769A (en) * 2017-02-27 2017-06-13 百度在线网络技术(北京)有限公司 With reference to the pattern of passing through and in limited time reading model information flow recommend method and apparatus
CN108268429A (en) * 2017-06-15 2018-07-10 广东神马搜索科技有限公司 The determining method and apparatus of online literature chapters and sections
CN108268429B (en) * 2017-06-15 2021-08-06 阿里巴巴(中国)有限公司 Method and device for determining network literature chapters
CN111143718A (en) * 2018-11-02 2020-05-12 上海奥陶网络科技有限公司 Branch novel management system and method
CN111428173A (en) * 2020-02-25 2020-07-17 泰康保险集团股份有限公司 Method and device for accessing third-party website
CN111428173B (en) * 2020-02-25 2023-04-07 泰康保险集团股份有限公司 Method and device for accessing third-party website

Also Published As

Publication number Publication date
CN103577566B (en) 2017-07-28

Similar Documents

Publication Publication Date Title
CN103577566A (en) Web reading content loading method and device
CN110688554B (en) Indexing data for native applications
CN103023714B (en) The liveness of topic Network Based and cluster topology analytical system and method
CN103714115A (en) Method and device for loading web page content
CN102831252A (en) Method and device for updating index database and search method and system
CN102982174A (en) Method and device for performing web search in browser
CN102930057A (en) Search implementation method and device
US20070162524A1 (en) Network document management
CN103020239A (en) Web searching method and device
CN102968451A (en) Method for loading website data in browser format page and browser client
CN102930058A (en) Method and device for realizing search in address field of browser
CN102880711A (en) Processing method and processing device for input data in browser address bar
CN102982117A (en) Information search method and device
CN104050286A (en) Method and device for providing search result integration
CN102982118A (en) Searching method and device based on favorites
CN103793523A (en) Automatic search engine construction method based on content similarity calculation
CN107491465A (en) For searching for the method and apparatus and data handling system of content
CN103984757A (en) Method and system for inserting news information articles in search result page
CN104199865A (en) Searching method, client-side and system of custom result providing content provider
CN102567521B (en) Webpage data capturing and filtering method
CN102902784A (en) Web page classification storage system and method
CN104572719A (en) Information collecting method and device
CN102955847A (en) System for loading website data on browser format page
CN103617225A (en) Associated webpage searching method and system
CN103631906A (en) Method and device for recognizing page number identification in webpage URL

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20220729

Address after: Room 801, 8th floor, No. 104, floors 1-19, building 2, yard 6, Jiuxianqiao Road, Chaoyang District, Beijing 100015

Patentee after: BEIJING QIHOO TECHNOLOGY Co.,Ltd.

Address before: 100088 room 112, block D, 28 new street, new street, Xicheng District, Beijing (Desheng Park)

Patentee before: BEIJING QIHOO TECHNOLOGY Co.,Ltd.

Patentee before: Qizhi software (Beijing) Co.,Ltd.