CA2242504C - Network information searching apparatus and network information searching method - Google Patents

Network information searching apparatus and network information searching method Download PDF

Info

Publication number
CA2242504C
CA2242504C CA002242504A CA2242504A CA2242504C CA 2242504 C CA2242504 C CA 2242504C CA 002242504 A CA002242504 A CA 002242504A CA 2242504 A CA2242504 A CA 2242504A CA 2242504 C CA2242504 C CA 2242504C
Authority
CA
Canada
Prior art keywords
information
user
extracted
network
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CA002242504A
Other languages
French (fr)
Other versions
CA2242504A1 (en
Inventor
Hiroshi Matsui
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Publication of CA2242504A1 publication Critical patent/CA2242504A1/en
Application granted granted Critical
Publication of CA2242504C publication Critical patent/CA2242504C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Abstract

A collecting process of data which is managed by each server on a network is first started by information collection means. In the collecting process, data under a point serving as a root is collected while repeating processes such that data at the point serving as a certain root is collected on the basis of access setting information, a link destination is extracted from the collected data, and data on the extracted link destination is further collected.
Subsequently, the data which is necessary by the user is extracted by extraction means. The extracted data is outputted to an output destination. Thus, the information which is required by the user can be searched from the information managed and held in a plurality of information processing apparatuses on the network without forcing surplus labor and time to the user.

Description

CA 02242~04 1998-07-06 cA

NETWORK INFORMATION SEARCHING APPARATUS
AND NETWORK INFORMATION SEARCHING METHOD

BACKGROUND OF THE INVENTION
Field of the Invention The invention relates to a network information searching apparatus which is connected to a plurality of information processing apparatuses on a network through the network and is used to search information required by the user from information managed and held in each of the information processing apparatuses and also relates to a method for such an apparatus.
Related Background Art In case of searching desired information of the user from information managed and held by a plurality of computers on a network, it is necessary to repetitively execute an operation to obtain the information and an operation to confirm the contents of the obtained information until necessary information or information including the necessary information is obtained. That is, it is necessary to repetitively execute an operation for accessing from a computer of the user through the network to a server program operating on the computer connected to the network and for obtaining information in accordance with a procedure corresponding to the server program after accessing, an operation for reading the contents of the CA 02242~04 l998-07-06 obtained information and confirming whether the necessary information is included in the information or not, and an operation for getting next information if the necessary information is not included in the 5 obtained information.
In case of searching necessary information from information of a page construction which is managed by a WWW (World Wide Web) system constructed by a WWW
program operating on a computer connected to the internet and is logically linked by a URL (Uniform Resource Locator), the necessary information is searched by repeating operations such that the contents of an arbitrary page in information of the page construction managed are read, if the necessary 15 information does not exist in the page, characters or a figure, namely, what is called an anchor concerned with a URL of a page which may include the necessary information at a page that is at present opened is clicked by a mouse, a corresponding next page is displayed, and the contents of this page are read.
An example of the operations in case of searching necessary information from the information managed by the WWW system will now be described with reference to Fig. 14. Fig. 14 is a diagram for explaining the 25 operation example in case of searching the necessary information from the information managed by the conventional WWW system.

CA 02242~04 1998-07-06 For example, as shown in Fig. 14, the contents of an arbitrary page 8101 in information of a page construction managed by a plurality of WWW servers 81, 82, ..., and 8n are read. If the necessary information does not exist in this page, characters or a figure, that is, an anchor 8101-1 here concerned with a URL
which may include the necessary information at this page is clicked, so that a page 8102 is displayed. The user reads the contents of the page 8102. When he confirms that the information which is needed by the user does not exist in the page 8102 in a manner similar to the above operation, characters or a figure concerned with the URL of the page which may include the necessary information is clicked by the mouse, thereby displaying the next page.
However, in case of searching the necessary information from the information managed by the WWW
system mentioned above, as characters or a figure concerned with the URL of the page which may include the necessary information at the page that is at present opened, which characters or figure should be selected often depends on intuition of the user. The user cannot know the relations among the pages.
Therefore, it is necessary to repeat the operations many times and the user is forced to read unnecessary information other than the object for a period of time until the necessary information is searched. Much CA 02242~04 1998-07-06 surplus labor and surplus working time are required for the operation by the page including the necessary information is found.

SUMMARY OF THE INVENTION
It is an object of the invention to provide a network information searching apparatus and a network information searching method which can search information required by the user from information managed and held in a plurality of information processing apparatuses on a network without forcing surplus labor and time to the user.
To accomplish the above object, according to the invention, there is provided a network information searching apparatus which is connected through a network to a plurality of information processing apparatuses on the network and is used to search information required by the user from information stored in each memory means in each of the information processing apparatuses, comprising:
information collection means for collecting the information stored in each of the memory means through the network;
information holding means for holding the information collected by the information collection means;
extraction means for extracting information CA 02242~04 1998-07-06 corresponding to the information required by the user from the information held in the information holding means; and output means for outputting the information extracted by the extraction means and corresponding to the information required by the user.
To accomplish the above object, according to the invention, there is provided a network information searching method in a network information searching apparatus which is connected through a network to a plurality of information processing apparatuses on the network and is used to search information required by the user from information stored in each of the memory means in each of the information processing apparatuses, comprising:
an information collecting step of collecting the information stored in each of the memory means through the network;
an information holding step of holding the information collected by the information collecting step into information holing means;
an extracting step of extracting information corresponding to the information required by the user from the information held in the information holding means; and an output step of outputting the information extracted by the extracting step and corresponding to CA 02242~04 1998-07-06 the information required by the user.
To accomplish the above object, according to the invention, there is provided a storage medium which is connected through a network to a plurality of information processing apparatuses on the network and in which a network information searching program to search information required by the user from information stored in each memory means in each of the information processing apparatuses has been stored, wherein the network information searching program comprises:
an information collecting module for collecting the information stored in each of the memory means through the network;
an information holding module for holding the collected information into information holding means;
an extracting module for extracting information corresponding to the information required by the user from the information held in the information holding means; and an output module for outputting the extracted information corresponding to the information required by the user.
To accomplish the above object, according to the invention, there is provided a network information searching system constructed by a plurality of information processing apparatuses and a network CA 02242~04 1998-07-06 information searching apparatus which are mutually connected through a network, wherein the network information searching apparatus comprises:
information collection means for collecting the information stored in each of the memory means in each of the information processing apparatuses through the network;
information holding means for holding the information collected by the information collection means;
extraction means for extracting information corresponding to the information required by the user from the information held in the information holding means; and output means for outputting the information extracted by the extraction means and corresponding to the information required by the user.

BRIEF DESCRIPTION OF THE DRAWINGS
Fig. 1 is a block diagram showing a construction of an embodiment of a network information searching apparatus of the invention;
Fig. 2 is a diagram showing the contents of environment setting information which is used in the network information searching apparatus of Fig. 1;
Fig. 3 is a diagram showing the contents of access CA 02242~04 1998-07-06 setting information which is used in the network information searching apparatus of Fig. l;
Fig. 4 is a diagram showing the contents of extraction condition setting information which is used in the network information searching apparatus of Fig.
l;
Fig. 5 is a diagram showing the contents of address information which is used in the network information searching apparatus of Fig. l;
Fig. 6 is a diagram showing the contents of image characteristics information which is used in the network information searching apparatus of Fig. l;
Fig. 7 is a diagram for explaining an example of the operation in an information search using the network information searching apparatus of Fig. l;
Fig. 8 is a diagram conceptually showing data stored in storage means in the network information searching apparatus of Fig. l;
Fig. 9 is a diagram for explaining an example of a directivity data collection from a WWW system in the network information searching apparatus of Fig. l;
Fig. 10 is a flowchart showing a processing procedure in the network information searching apparatus of Fig. l;
Fig. 11 is a flowchart showing a processing procedure for searching a server program which operates on a server computer on a network in the network CA 02242~04 1998-07-06 information searching apparatus of Fig. 1;
Fig. 12 is a flowchart showing a processing procedure in case of collecting information having directivity in the network information searching apparatus of Fig. 1;
Fig. 13 is a flowchart showing a processing procedure for extracting image data having designated characteristics in the network information searching apparatus of Fig. 1; and Fig. 14 is a diagram for explaining an operation example in case of searching necessary information from information managed by a conventional WWW system.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
An embodiment of the invention will now be described hereinbelow with reference to the drawings.
(a) First Embodiment:
Fig. 1 is a block diagram showing a construction of an embodiment of a network information searching apparatus of the invention.
As shown in Fig. 1, a plurality of server computers 103(m) (m = 1, 2, ..., n) are connected to an internet network (hereinafter, referred to as a network) 102. An information management system to manage holding information by server programs such as WWW server programs 1 to n, file server programs 1 to n, and the like is constructed in each of the server CA 02242~04 1998-07-06 computers 103(m).
A network information searching apparatus 101 is used as an apparatus for searching information required by the user from the information held in each of the server computers 103(m). The apparatus 101 is connected to the network 102 through a public line 112 and a provider 111. The network information searching apparatus 101 comprises input means 104, storage means 105, timer start means 106, information collection means 107, memory means 108, extraction means 109, and output means 110.
The input means 104 inputs various information such as environment setting information, access setting information, extraction condition setting information, address information, extraction image information, and the like in accordance with the operation by the user.
The inputted various information is stored into the storage means 105.
Fig. 2 is a diagram showing the contents of the environment setting information which is used in the network information searching apparatus of Fig. 1.
As shown in Fig. 2, in the environment setting information, there have been registered: a date, a day of the week, and time of "start"; a date, a day of the week, and time of "end"; a timer setting to set everyday, every week, or every month in case of periodically starting; a timeout of the connection; the CA 02242~04 1998-07-06 number of times of retry; a transfer amount upon downloading; and the number of sessions which are simultaneously accessed.
Fig. 3 is a diagram showing the contents of the access setting information which is used in the network information searching apparatus of Fig. 1.
As shown in Fig. 3, in the access setting information, there have been registered: an access setting No.; an address of the server which is used; a classification of the server; a URL serving as a root in case of accessing to the WWW server; a hierarchical layer showing a depth of downloading; a mail address of the user in case of accessing to a mail system; a directory serving as a root in case of accessing to a file server; a user ID; a password; a directivity collection to set whether the directivity collection is performed or not; and a hierarchical layer serving as a unit of discrimination about the directivity collection.
Fig. 4 is a diagram showing the contents of the extraction condition setting information which is used in the network information searching apparatus of Fig.
1.
As shown in Fig. 4, in the extraction condition setting information, there have been registered: an extraction condition setting No.; an access setting No.
concerned with the access setting No. of the access CA 02242~04 l998-07-06 setting information; a search extraction to set whether the extraction by the search is performed or not; a synonym extraction to set whether the extraction by a synonym is performed or not; an analogue extraction to set whether the extraction by an analogue is performed or not; a search object to designate a search object;
search conditions 1 to n to set a search keyword and the like; a data type extraction to set whether the extraction by a data type is performed or not; data types 1 to n to designate a type of data to be extracted; an image extraction to set whether the extraction by characteristics of the image data is performed or not; image conditions 1 to n to designate characteristics of the image data to be extracted; an output destination designation to set an output destination; an address No. in case of outputting to a mail or FAX; and extra condition data to set whether the data which is not congruous with the extraction conditions is left or not.
Fig. 5 is a diagram showing the contents of the address information which is used in the network information searching apparatus of Fig. 1. Fig. 6 is a diagram showing the contents of the image characteristics information.
As shown in Fig. 5, a mail or an FAX telephone number of the address has been registered in the address information. As shown in Fig. 6, an image CA 02242~04 1998-07-06 condition No. and image characteristics information have been registered in the image extraction information.
The information collection means 107 comprises means for accessing to the corresponding server computer (m) on the basis of the access setting information and collecting information held by each server computer (m) through the network 102. The information collected by the information collection means 107 is held in the memory means 108. The extraction means 109 extracts the information corresponding to the information required by the user on the basis of the extraction condition setting information set by the user from the information held in the memory means 108. The extracted information is outputted to the output destination designated by the output destination designation included in the extraction condition setting information by the output means 110. The timer start means 106 comprises means for starting the information collection means 107 on the basis of the information included in the environment setting information.
Subsequently, a series of operations in case of searching the information which is needed by the user will now be described by using the network information searching apparatus 101 with reference to Figs. 1 to 5 and 7. Fig. 7 is a diagram for explaining an example CA 02242~04 1998-07-06 of the operation in the information search using the network information searching apparatus of Fig. 1. The series of operations will now be described with respect to an example in the case where the apparatus is connected to the network 102 at night when the network 102 is free, a series of WWW pages designated by the user is collected, the pages which are needed by the user are extracted from the collected pages, and the extracted pages are browsed in the daytime.
It is assumed that the information which is needed by the user, for example, the pages including the information regarding a printing device are a page 718 in a server 710 of A newspaper company and a page 7n4 in a server 7nn of N newspaper company and that a page of an index having a logical link to each newspaper company is a page 702 under the management of the WWW
server 701 of CC Co. Ltd.
In this case, first, the user sets various information through the input means 104. In the access setting information, as shown in the access setting No.
1 in Fig. 3, a URL of the page 702 as a URL serving as a root is set to 3 as a hierarchical layer which traces the link. In case of storing the extracted information into a homepage and confirming, in the extraction condition setting information, as shown in the extraction condition No. 1 in Fig. 4, 1 is set into the access setting No. and the storage device (storage CA 02242~04 l998-07-06 means 105 here) is set to the output destination, respectively. Further, for example, in case of setting so as to start the information collection means 1 at 3 a.m. when it is presumed that the network 102 and WWW
server 701 will be free, as shown in Fig. 2, the start time in the environment information setting information is set to 3 a.m..
When the time of 3 a.m. comes, the information collection means 107 is started by the timer start means 106. The information collection means 107 refers to the access setting information held in the storage means lOS, connects to the provider 111 through the public line 112, and connects to the network 102 via the provider 111. After it was connected to the network 102, first, the data at the page 702 under the management of the WWW server 701 of CC Co., Ltd. is collected. The information of the linked URL is extracted from the collected data. The data at pages 711 to 7nl on the linked destination side is collected from the page 702 on the basis of the extracted link information. The information of the linked URL is extracted from those pages. By repeating the processes for collecting the page data of further lower hierarchical layers on the basis of the extracted link information, the data as much as four hierarchical layers is collected. After completion of the collection of the data, the connection to the provider CA 02242~04 l998-07-06 111 through the public line 112 is disconnected. The data collected by the information collection means 107 is held in the memory means 108.
The data corresponding to the information which is needed by the user is extracted by the extraction means 109. In the extracting process, with reference to the extraction condition setting information, the data held in the memory means 108 is discriminated to see whether a keyword of "printing device", a synonym, or an analogue is included in the whole text or not on a page unit basis. If there is a page including the keyword, synonym, or analogue, the data of this page is stored into the storage means 105.
After the data was stored, the user displays the data stored in the storage means 105 to display means (not shown) and confirms the contents of the data.
The data stored in the storage means 105 will now be described with reference to Fig. 8. Fig. 8 is a diagram conceptually showing the data stored in the storage means of the network information searching apparatus of Fig. 1.
As shown in Fig. 8, page data 9101, 9102, and 9103 extracted by the extracting process is stored in the storage means 105. The page data 9101 is the data of the page having index information of the extraction result. The page data 9102 is the data of the page corresponding to the page 718 shown in Fig. 7. The CA 02242~04 1998-07-06 page data 9103 is the data of the page corresponding to the page 7n4 shown in Fig. 7. Therefore, it is sufficient for the user to confirm the page data 9102 and 9103. It will be understood that the work can be remarkably reduced as compared with a work to confirm the pages of the lower hierarchical layers in which the page 702 in Fig. 7 is the top hierarchy.
A processing procedure in the network information searching apparatus 101 will now be described with reference to Fig. 10. Fig. 10 is a flowchart showing a processing procedure in the network information searching apparatus of Fig. 1.
First, the processing routine is started in step S1. The "start" of the processing routine denotes that the information collecting process by the information collection means 107 is started. As a starting method, there is a start by the timer start means 106 or a start by the operation of the input means 104.
In step S2, the collecting process of the network information by the information collection means 107 is subsequently executed. In the collecting process, the data under the point serving as a root is collected up to the set hierarchical layer while repeating processes for collecting the data at a point serving as a certain root on the basis of the access setting information, extracting the link destination from the collected data, and further collecting the data on the extracted CA 02242~04 l998-07-06 link destination. The "point serving as a root" here denotes a page shown by the URL in the WWW system, a directory of the file system in the file server system on the network, or a mail address in the mail system.
In step S3, the data which is needed by the user is extracted by the extraction means 109. The extraction condition which is used at the time of data extraction is held as extraction condition setting information in the storage means 105 as mentioned above. As an extracting method of the data, there is an extraction of data including the image set by the user, an extraction of data of the type set by the user, an extraction of data including the data set by the user, an extraction by the keyword set by the user, an extraction by the synonym of the keyword set by the user, or an extraction by the analogue of the keyword set by the user.
The extraction of data of the type set by the user denotes that, for example, in case of the data of the page under the management of the WWW system, only the data of the designated type is extracted from the page.
As a type of data, there is text data, image data, audio data, or binary data.
The extraction of data including the data set by the user denotes that, for example, in case of the data of the page under the management of the WWW system, all of the data of the page including the data of the CA 02242~04 1998-07-06 designated type is extracted from the page, and that in case of the file, the file itself including the data of the designated type is extracted.
The extraction by the keyword, synonym of the keyword, or analogue of the keyword denotes that on the basis of the keyword and its synonym or analogue, the search is performed for the title of the data, file name, URL name, and the whole sentences of the text as targets, and the data including the keyword and its synonym or analogue is extracted. Whether the extracted data is congruous with the search condition or not is determined by designating by the user after the data was extracted.
In step S4, the extracted data is outputted. In the data output, the extracted data is outputted to the output destination designated by the output destination designation included in the extraction condition setting information. As an output destination, it is possible to set the storage means 105, printing device, display means, arbitrary FAX number, arbitrary mail address, or the like.
In step S5, the confirmation of the user for the outputted information is performed. The processing routine is finished.

(b) Second Embodiment In step S2 mentioned above, the process has been CA 02242~04 1998-07-06 performed on the assumption of the case where the information such as server address, URL, or the like in which the point serving as a root exists has been set by the access setting information, namely, the case where the user knows the information of the server address, URL, or the like as a prerequisite. However, processes in the case where the user does not know such information will now be described with reference to Fig. 11. Fig. 11 is a flowchart showing a procedure for processes for searching a server program operating on the server computer on the network in the network information searching apparatus of Fig. 1.
First in step S21, a response request is issued to each of the server computers 103(m) on the network 102.
In step S22, a check is made to see if there is a response to the response request from each server computer 103(m). If there is a response, step S23 follows and a request is issued to the server on the server computer 103(m) which responded. In step S24, a check is made to see if there is a response to the request from the corresponding server. If there is a response to the request from the corresponding server, the processing routine advances to a process to search the data which is needed by the user while using the server as a root. If the target data does not exist in the hierarchical data constructed under the dominant of the root, the processing routine is returned to step CA 02242~04 1998-07-06 S21 and the process to search the next server is performed.
The order of the server computers 103(m) to generate the response request in step S21 is determined by preferentially using the method selected by the user among the well-known method of preferentially using the typical address, a method of preferentially using the address near the address of the user, and a method of accessing at random.

(c) Third Embodiment:
In step S2, a processing procedure in case of collecting the information having directivity will now be described with reference to Fig. 12. Fig. 12 is a flowchart showing the processing procedure in case of collecting the information having directivity in the network information searching apparatus of Fig. 1.
First in step S31, the data as much as the hierarchical layers set in the access setting information is collected. In step S32, a grade congruous with the extraction condition is numerically evaluated.
In step S33, the hierarchical data of data located under the page having a high grade congruous with the extraction condition is collected by only an amount of the directivity collection hierarchical layers set in the access setting information. In step S34, a check CA 02242~04 1998-07-06 is made to see if an end condition has been satisfied.
If the end condition is not satisfied, the processing routine is returned to step S32. Processes from step S32 are repetitively executed by the end condition is satisfied. When the end condition is satisfied, the processing routine is finished.
The collection of the information having the directivity will now be described with reference to Fig. 9 with respect to an example in the case of collecting the data from the WWW system. Fig. 9 is a diagram for explaining an example of the directivity data collection from the WWW system in the network information searching apparatus of Fig. 1.
In Fig. 9, in the case where the hierarchical layer to collect the data in a lump is labelled as "1"
while using a page 10101 as a root, first, the data of pages 10102 to 10104 of the hierarchical layers under the page 10101 is collected, a grade congruous with the extraction condition is numerically evaluated, and it is assumed that the page in which the congruous grade is the highest is the page 10103. In this instance, the data of pages 10106 to 10108 under the page 10103 is collected. Similar processes are repeated. When it is assumed that the pages in which the grade congruous with the extraction condition is high are the page 10106 among the pages 10106 to 10108 and a page 10202 among pages 10202 to 10204, the pages included in the CA 02242~04 l998-07-06 range of a page 11111 to be collected are collected.

(d) Fourth Embodiment:
In step S3, the case of extracting the image data having the designated characteristics will now be described with reference to Fig. 13. Fig. 13 is a flowchart showing a processing procedure for extracting the image data having designated characteristics in the network information searching apparatus of Fig. 1.
First in step S41, the image data is extracted from the information collected in step S2 ( shown in Fig. 10). In step S42, characteristics of the extracted image data are extracted. In step S43, by comparing the extracted characteristics of the image with the image extraction information (shown in Fig. 6) set by the user, the image data having the characteristics of the image data which is needed by the user is extracted.
As characteristics of the image which is set by the user, there are an outline and a color such as human object, house, or the like solely having certain characteristics in case of an outline, scenery characteristics such as mountain, plain, sea, four seasons, or the like in case of a scene, and the like.
As mentioned above, in the network information searching apparatus 101, the data held by the server system of each server computer 103(m) is collected CA 02242~04 l998-07-06 through the network 102. The data corresponding to the data required by the user is extracted from the collected data. The extracted data is outputted to the set output destination. Therefore, the data which is needed by the user can be searched from the data managed and held by the server system of each of the server computers 103(m) on the network 102 without forcing surplus labor and time to the user.
Data can be automatically collected in a time zone when the network is free, the communication time can be reduced, and the communication costs can be reduced.
Further, since the extracted data is outputted to the designated output destination, for example, it is possible to easily set a mode to print and output the pages which are seen everyday by a predetermined time in the morning, a mode to output them as a mail to himself, a mode to transmit them to an FAX apparatus of a predetermined address, or the like. The extracted information can be confirmed in a desired output form 20 of the user.
In the embodiment, although the example in which the network information searching apparatus 101 is constructed as a single apparatus has been shown, the network information searching apparatus 101 can be also constructed by a personal computer. In this case, there is used a personal computer comprising at least:
a keyboard or a mouse constructing the input means 104;

CA 02242~04 1998-07-06 a display; an external memory device such as hard disk or the like constructing the storage means 105 or memory means 108; and an interface to connect to the provider 111 through the public line. A network information searching program including an information collecting module constructing the information collection means 107, an information holding module for holding the collected data into the external memory device, an extracting module constructing the extraction means 109, and an output module constructing the output means 110 is stored in the external memory device. By reading out and executing the network information searching program as necessary, the network information searching apparatus is constructed. The network information searching program is supplied by a memory medium such as FD (floppy disk), CD-ROM, or the like.
As described above, according to the first to fourth embodiments, the information which is required by the user can be searched from the information managed and held by the information processing program of a plurality of information processing apparatuses on the network without forcing surplus labor and time to the user.
The data can be automatically collected in a time zone when the network is free, the communication time can be reduced, and the communication costs can be CA 02242~04 1998-07-06 decreased.
It is also possible to easily connect to the information processing apparatus in which the connection is permitted from a plurality of information processing apparatuses on the network.
The information required by the user can be properly extracted.
It is possible to easily set the mode to print and output the pages which are seen everyday by a predetermined time, the mode to output them as a mail to himself, the mode to transmit them to the FAX
apparatus of a predetermined address, or the like. The extracted information can be confirmed in a desired output form of the user can be confirmed.

Claims (25)

1. A network information searching apparatus which is connected through a network to a plurality of information processing apparatuses on the network and is used to search information required by a user from information stored in each memory means in each of said information processing apparatuses, comprising:
information collection means for collecting the information stored in each of said memory means through said network;
information holding means for holding the information collected by said information collection means;
extraction means for extracting information corresponding to the information required by said user from the information held in said information holding means; and output means for outputting the information extracted by said extraction means and corresponding to the information required by said user.
2. An apparatus according to claim 1, further comprising timer start means for starting said information collection means at a start time set by said user.
3. An apparatus according to claim 1, wherein said information collection means has detection means for detecting an address of the information processing apparatus in which a connection is permitted for said user among said information processing apparatuses and an information management system of said address.
4. An apparatus according to claim 1, wherein each of said memory means is controlled by an information management program such as World Wide Web server program, file server program for a file server system, or the like.
5. An apparatus according to claim 1, wherein said information collection means collects the information stored in each of said memory means with directivity for preferentially tracing information concerned with the information required by said user at a predetermined probability.
6. An apparatus according to claim 1, wherein when the information required by said user is image data, said extraction means extracts image data from the information held in said information holding means, extracts characteristics of said extracted image data, compares said extracted image data with characteristics of the image data required by said user, and extracts image data corresponding to the image data required by said user in accordance with a result of said comparison.
7. An apparatus according to claim 1, wherein said extraction means analyzes a data type of the information held in said information holding means, compares a result of said analysis with a data type of the information required by said user, and extracts information corresponding to the information required by said user in accordance with a result of said comparison.
8. An apparatus according to claim 1, wherein said extraction means analyzes a data type of the information held in said information holding means, compares a result of said analysis with a data type of the information required by said user, and extracts information including information of a data type corresponding to the data type of the information required by said user in accordance with a result of said comparison.
9. An apparatus according to claim 1, wherein said extraction means analyzes an attribute of the information held in said information holding means, compares a result of said analysis with an attribute of the information required by said user, and extracts information corresponding to the information required by said user in accordance with a result of said comparison.
10. An apparatus according to claim 1, wherein said extraction means performs a search to the information held in said information holding means on the basis of a keyword included in a text portion of said information and a synonym or an analogue of said keyword and extracts information corresponding to the information required by said user by said search.
11. An apparatus according to claim 1, wherein said extraction means extracts information corresponding to the information required by said user and forms an index of information concerned in an extracting step of said extracted information, and said output means outputs said index formed together with the information extracted by said extraction means.
12. An apparatus according to claim 1, wherein said output means outputs said extracted information to an apparatus which can output said extracted information in a desired output form of said user in accordance with a selection of said user.
13. A network information searching method of a network information searching apparatus which is connected through a network to a plurality of information processing apparatuses on the network and is used to search information required by a user from information stored in each memory means in each of said information processing apparatuses, comprising:
an information collecting step of collecting the information stored in each of said memory means through said network;
an information holding step of holding said collected information into information holding means;
an extracting step of extracting information corresponding to the information required by said user from the information held in said information holding means; and an output step of outputting said extracted information corresponding to the information required by said user.
14. A method according to claim 13, further comprising a step of instructing so as to start the collection of the information stored in each of said memory means at a start time set by said user.
15. A method according to claim 13, wherein each of said memory means is controlled by an information management program such as World Wide Web server program, file server program for a file server system, or the like.
16. A method according to claim 13, wherein in said information collecting step, the information stored in each of said memory means is collected with directivity for preferentially tracing information concerned with the information required by said user at a predetermined probability.
17. A method according to claim 13, wherein in said extracting step, when the information required by said user is image data, image data is extracted from the information held in said information holding means, characteristics of said extracted image data are extracted, said extracted image data is compared with characteristics of the image data required by said user, and image data corresponding to the image data required by said user is extracted in accordance with a result of said comparison.
18. A method according to claim 13, wherein in said extracting step, a data type of the information held in said information holding means is analyzed, a result of said analysis is compared with a data type of the information required by said user, and information corresponding to the information required by said user is extracted in accordance with a result of said comparison.
19. A method according to claim 13, wherein in said extracting step, a data type of the information held in said information holding means is analyzed, a result of said analysis is compared with a data type of the information required by said user, and information including information of a data type corresponding to the data type of the information required by said user is extracted in accordance with a result of said comparison.
20. A method according to claim 13, wherein in said extracting step, an attribute of the information held in said information holding means is analyzed, a result of said analysis is compared with an attribute of the information required by said user, and information corresponding to the information required by said user is extracted in accordance with a result of said comparison.
21. A method according to claim 13, wherein in said extracting step, a search is executed to the information held in said information holding means on the basis of a keyword included in a text portion of said information and a synonym or an analogue of said keyword and information corresponding to the information required by said user by said search is extracted.
22. A method according to claim 13, wherein in said extracting step, information corresponding to the information required by said user is extracted and an index of information concerned in an extracting step of said extracted information is formed, and in said output step, said formed index is outputted together with the information extracted by said extracting step.
23. A method according to claim 13, wherein in said output step, said extracted information is outputted to an apparatus which can output said extracted information in a desired output form of said user in accordance with a selection of said user.
24. A storage medium which is connected through a network to a plurality of information processing apparatuses on the network and in which a network information searching program to search information required by a user from information stored in each of memory means arranged in each of said information processing apparatus has been stored, wherein said network information searching program comprises computer-readable program codes of:
executing an information collecting of collecting the information stored in each of said memory means through said network;
executing an information holding of holding said collected information into information holding means;
executing an extracting of extracting information corresponding to the information required by said user from the information held in said information holding means; and executing an output of outputting said extracted information corresponding to the information required by said user.
25. A network information searching system constructed by a plurality of information processing apparatuses and a network information searching apparatus which are mutually connected through a network, wherein said network information searching apparatus comprises:
information collection means for collecting information stored in each memory means in each of said information processing apparatuses through said network;
information holding means for holding the information collected by said information collection means;

extraction means for extracting information corresponding to the information required by said user from the information held in said information holding means; and output means for outputting the information extracted by said extraction means and corresponding to the information required by said user.
CA002242504A 1997-07-08 1998-07-06 Network information searching apparatus and network information searching method Expired - Fee Related CA2242504C (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP9196433A JPH1125125A (en) 1997-07-08 1997-07-08 Network information retrieving device, its method and storage medium
JP9-196433 1997-07-08

Publications (2)

Publication Number Publication Date
CA2242504A1 CA2242504A1 (en) 1999-01-08
CA2242504C true CA2242504C (en) 2002-02-26

Family

ID=16357757

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002242504A Expired - Fee Related CA2242504C (en) 1997-07-08 1998-07-06 Network information searching apparatus and network information searching method

Country Status (5)

Country Link
US (1) US6163804A (en)
JP (1) JPH1125125A (en)
AU (1) AU754845B2 (en)
CA (1) CA2242504C (en)
GB (1) GB2328050B (en)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6427165B1 (en) * 1998-11-18 2002-07-30 Gateway, Inc. Method and apparatus for information retrieval from a network using parameter value sampling
JP3418563B2 (en) * 1999-01-20 2003-06-23 昇作 川合 Network communication system
JP2001075859A (en) * 1999-08-31 2001-03-23 Just Syst Corp Device for cyclic acquiring information
JP4225703B2 (en) * 2001-04-27 2009-02-18 インターナショナル・ビジネス・マシーンズ・コーポレーション Information access method, information access system and program
AUPR914601A0 (en) * 2001-11-27 2001-12-20 Webtrack Media Pty Ltd Method and apparatus for information retrieval
US7072883B2 (en) * 2001-12-21 2006-07-04 Ut-Battelle Llc System for gathering and summarizing internet information
JP2003271169A (en) * 2002-03-19 2003-09-25 Nec Corp Information speech system, information speech method and information speech program
AU2003278006A1 (en) * 2002-10-15 2004-05-04 Creo Inc. Automated information management system and methods
US20050010556A1 (en) * 2002-11-27 2005-01-13 Kathleen Phelan Method and apparatus for information retrieval
JP2005310008A (en) * 2004-04-23 2005-11-04 Fuji Xerox Co Ltd Device and method for synchronizing tree structure data
US8661102B1 (en) * 2005-11-28 2014-02-25 Mcafee, Inc. System, method and computer program product for detecting patterns among information from a distributed honey pot system
US20080195590A1 (en) * 2007-02-08 2008-08-14 Mitsuo Nakamura Network device, image forming device, and data searching method

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5717913A (en) * 1995-01-03 1998-02-10 University Of Central Florida Method for detecting and extracting text data using database schemas
EP0807291B1 (en) * 1995-01-23 2000-01-05 BRITISH TELECOMMUNICATIONS public limited company Methods and/or systems for accessing information
US5802292A (en) * 1995-04-28 1998-09-01 Digital Equipment Corporation Method for predictive prefetching of information over a communications network
US5659732A (en) * 1995-05-17 1997-08-19 Infoseek Corporation Document retrieval over networks wherein ranking and relevance scores are computed at the client for multiple database documents
US5931907A (en) * 1996-01-23 1999-08-03 British Telecommunications Public Limited Company Software agent for comparing locally accessible keywords with meta-information and having pointers associated with distributed information
US5828837A (en) * 1996-04-15 1998-10-27 Digilog As Computer network system and method for efficient information transfer
US5752242A (en) * 1996-04-18 1998-05-12 Electronic Data Systems Corporation System and method for automated retrieval of information
EP0898754B1 (en) * 1996-05-20 2003-07-09 BRITISH TELECOMMUNICATIONS public limited company Information retrieval in cache database
US5893091A (en) * 1997-04-11 1999-04-06 Immediata Corporation Multicasting with key words

Also Published As

Publication number Publication date
CA2242504A1 (en) 1999-01-08
US6163804A (en) 2000-12-19
GB9814612D0 (en) 1998-09-02
JPH1125125A (en) 1999-01-29
GB2328050A (en) 1999-02-10
AU7504098A (en) 1999-01-21
GB2328050B (en) 2002-07-10
AU754845B2 (en) 2002-11-28

Similar Documents

Publication Publication Date Title
US7099861B2 (en) System and method for facilitating internet search by providing web document layout image
US6006217A (en) Technique for providing enhanced relevance information for documents retrieved in a multi database search
US6161124A (en) Method and system for preparing and registering homepages, interactive input apparatus for multimedia information, and recording medium including interactive input programs of the multimedia information
US6237006B1 (en) Methods for graphically representing web sites and hierarchical node structures
US6651065B2 (en) Search and index hosting system
CN100485603C (en) Systems and methods for generating concept units from search queries
US6947924B2 (en) Group based search engine generating search results ranking based on at least one nomination previously made by member of the user group where nomination system is independent from visitation system
JP4846922B2 (en) Method and system for accessing information on network
JP3987133B2 (en) Search hypertext information using profiles and topics
US8739027B2 (en) Methods and apparatus for enabling use of web content on various types of devices
US6470383B1 (en) System and methods for generating and displaying web site usage data
US6721729B2 (en) Method and apparatus for electronic file search and collection
US5958008A (en) Software system and associated methods for scanning and mapping dynamically-generated web documents
US8276065B2 (en) System and method for classifying electronically posted documents
CA2242504C (en) Network information searching apparatus and network information searching method
JP2003173280A (en) Apparatus, method and program for generating database
KR20010009983A (en) Method for extracing web information and the apparatus therefor
JP2002099568A (en) Www server having function of automatically generating book mark for personal use
US20060116992A1 (en) Internet search environment number system
KR100390855B1 (en) method and system of service providing on internet
WO2007073097A1 (en) Method and system for sorting/searching file and record media therefor
JP2005056371A (en) Management method and system for web retrieval information, and computer software program
CN102449609A (en) Browsing information gathering system, browsing information gathering method, server, and recording medium
JP4879612B2 (en) Annotation management device, web display terminal, annotation management method, and web display method
JP2965018B2 (en) Search information display method and search information display device in hypermedia system

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed

Effective date: 20160706