US20090164418A1 - Retrieval system and method of searching information in the Internet - Google Patents

Retrieval system and method of searching information in the Internet Download PDF

Info

Publication number
US20090164418A1
US20090164418A1 US11/959,501 US95950107A US2009164418A1 US 20090164418 A1 US20090164418 A1 US 20090164418A1 US 95950107 A US95950107 A US 95950107A US 2009164418 A1 US2009164418 A1 US 2009164418A1
Authority
US
United States
Prior art keywords
information
internet
retrieval system
web
automatic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US11/959,501
Inventor
Valentina Pulnikova
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to US11/959,501 priority Critical patent/US20090164418A1/en
Publication of US20090164418A1 publication Critical patent/US20090164418A1/en
Priority to US13/076,688 priority patent/US9524341B2/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Definitions

  • This invention relates, in general, to retrieval systems and methods of searching information in the Internet.
  • the Internet is one of the main sources of information for people together with TV, the radio, newspapers, books, magazines and other kinds of press products.
  • Web sites The main part of information in the Internet is present in the form of Web sites, which are stored on the numerous network servers.
  • Retrieval systems are used for the search information in the Internet. These are Google.com, Yahoo.com, Search.com, Rambler.ru and others. Web sites are registered on retrieval systems. Web sites specified URL addresses and key words for Web sites in general and for separated Web pages. This information is stored in database of server of retrieval system.
  • a user In order to find needed information, a user has to fill any key words in the specified field in a retrieval system.
  • the search for information is performed on the basis of said key words.
  • the search for information is implemented with the help of special searching programs that retrieve relevant key words in databases of the retrieval system and provide corresponding links to the accessible Web sites and/or Web pages in the Internet.
  • the collected information is stored on the server of the retrieval system in the form of a list of URL addresses of Web sites and Web pages corresponding to key words specified by the user.
  • the user normally sees on a screen of his computer a portion of the collected information, i.e. a list with 10-20 URL addresses out of the total number of the Web sites found by the retrieval system. Then user can get an access to any Web site and/or Web page with the help of the browser by selecting a corresponding URL address provided by the retrieval system.
  • a flow of unnecessary information slows down operation of local and global computer networks increases demands for extra space on hard disks of servers of retrieval systems, puts additional requirements on improvement of searching programs based on analysis of key words and causes inefficient usage of other material and human resources.
  • a special skill is needed in the selection of key words in order to find required information.
  • a change of the order of key words a change of the search phrase often affects the search result. If key words have homonyms one can get information for needed and not needed significances of these key words.
  • catalogues are available at Google.com, Yahoo.com, Apport.ru and others. These catalogues have a small numbers of the main categories (generally less than 20). But this is insufficient for the existing amount of information available in the Internet and does not solve the problem of increasing the quality of the search of information in the Internet. These catalogues typically include the following categories: computers, work, education, house, society, entertainment, recreation, sport, manufacture, business, Internet for kids, mass media, inquiries and so on. Obviously, retrieval systems make attempts to classify information on edutainment and entertainment, as this kind of information seems more popular among the users of the Internet on opinion retrieval systems. However, all the information available in the Internet must be classified including information required for scientists, politicians, students and others.
  • a user can find needed information. This information is searched in the following way. The user selects the descriptor phrase. Then the number of this descriptor phrase is searched in the Composite Catalogue. The desired information is searched using this number. An alphabet sorting and sorting on the basis of the level in the catalogue are proposed in this patent. Only the use of specified descriptor phrases is proposed to use in said patent. Arbitrary descriptor phrases cannot be used for search in this patent. This limits freedom and capability of searching. Moreover, the proposed retrieval system does not deal with search information in the Internet.
  • An application of any Classification of Information in the Internet for categorizing and searching of information in the Internet can solve existing problems.
  • the new classification of the information in the Internet could be represented as a catalogue, similar to the classification in the librarianship. Of course, such classification would have to be adapted to the needs and specifics of the Internet.
  • An every division and a subdivision of the catalogue cover a certain field of information. For users' comfort, a brief characteristic has to be provided for an every division and subdivision of the catalogue.
  • An every division and subdivision of the catalogue must have a specific code.
  • the classification must have a possibility of evolution and take into account all possible future changes in the world information system and in the Internet.
  • the retrieval system and the method of searching information in the Internet are proposed in this invention.
  • the algorithm of allocating information about Web sites in the database of the retrieval system and the algorithm of searching information are based on the Global Classification of Information in the Internet.
  • This Global Classification of Information in the Internet is classification modified and adopted to conditions of the Internet and covering all the known forms of information in the Internet.
  • a supplier of information fills in an application form and inserts therein the following data: an URL address of the Web site, a name of the owner of the Web site, a home or an official address of the owner of the Web site, the name of a division of the Global Classification Information in the Internet relevant to the information presented in the Web site, key words which completely characterize information presented in the Web site, a kind of information, an author of information, country where the Web site is situated, a language of information, free information or information to be paid for, a free access to information or a registration is required, characteristics of information specific for selected division of the Global Classification of Information in the Internet.
  • Data files created during the registration procedure and/or data files created during an update of the data on Web sites are sorted according to the codes of the Global Classification Information in the Internet and allocated in the corresponding parts of the database containing information on the Web sites registered in the retrieval system.
  • a searcher of information can find required information by searching through the tree of the Global Classification of Information in the Internet.
  • the searcher of information can define in which division or a subdivision of the Global Classification of Information in the Internet does he have to look for information. Then he inserts the name of this division into a search window of a browser of the retrieval system and begins the search.
  • the second way of search is based on key words specified by the searcher of information.
  • the retrieval system presents the user a list of divisions of the Global Classification of Information in the Internet where said key words match key words provided by suppliers of information.
  • the searcher of information has to choose a name of division of the Global Classification of Information in the Internet corresponding to his/her key words and insert the name of this division into a search window of a browser of the retrieval system and start the search.
  • the retrieval system provides a list of addresses of Web sites and Web pages stored in the database and relevant to the selected division of the Global Classification of Information in the Internet.
  • sorting and selecting of search results is provided in the retrieval system. This will provide additional comfort to users.
  • the retrieval system creates data files for an every Web site. A part of information from these files will be used as a criterion for sorting and selecting of search results.
  • FIG. 1 is a picture showing a structure of a retrieval system
  • FIG. 2 is a block diagram of a method of searching of information in the Internet
  • FIG. 3 is a block diagram showing registration procedure
  • FIG. 4 is a picture showing an example of an application form
  • FIG. 5 is a picture showing a process of choosing a required division from the Global Classification of Information in the Internet
  • FIG. 6 is a block diagram showing a process of forming the database of information on registered Web sites on the server of the Retrieval System
  • FIG. 7 is a block diagram showing a process of searching information
  • FIG. 8 is a block diagram showing a process of automatic sorting and selecting
  • FIG. 9 is a block diagram showing a process of stepwise sorting and selecting
  • FIG. 10 is a block diagram showing a procedure of sorting and selecting on the basis of key words
  • FIG. 11 is a picture showing organization of a data transfer for automation systems
  • FIG. 12 is a block diagram showing organization of a data transfer for automation systems.
  • a retrieval system wherein acquisition, storing and searching for information is build on the Global Classification of Information in the Internet (GCII), is proposed in the present invention.
  • the Global Classification of Information in the Internet is a classification, modified and adopted to conditions of the Internet classifying all known information presented in the Internet and covering all the forms of human activity: a material production, trade, science, education, history, culture and so on and represent the properties and characteristics of outward things, the nature, the animals and vegetal.
  • the Global Classification of Information in the Internet represents by itself a hierarchical tree with names and corresponding numbers of all divisions and subdivisions. Further the Global Classification of Information in the Internet include specific characteristics of information for every division and subdivision, which will use for future sorting and selecting of any retrieved information.
  • the Global Classification of Information in the Internet is classification, which will be in progress constantly and will consider all changes occurred in the Internet.
  • the retrieval system designed for searching of information in the Internet comprises a server of retrieval system ( 1 ) interconnected with the Internet including multilingual Web site of retrieval system with searching programs ( 2 ), database of the Global Classification of Information in the Internet ( 3 ) and the other databases of the retrieval system ( 4 ).
  • the retrieval system also comprises numerous network servers ( 5 ) wherein information belonging to information suppliers is stored, and numerous computers of users ( 6 ) interconnected with the Internet.
  • the users of retrieval system having an access to the Internet can either a as suppliers of information or searchers of information or both.
  • the method of searching of information in the Internet comprises a registration procedure of Web sites ( 1 ) by information suppliers, a procedure of building the database of information about Web sites ( 2 ) registered in retrieval system, including procedures of building the information database of the retrieval system and updating and adding information to the database of the retrieval system, a procedure of searching information ( 3 ) and procedure of sorting and selecting the search results ( 4 ).
  • An information supplier has to complete the registration procedure ( FIG. 3 ) in order to have information about his/her Web site included into the database of the retrieval system.
  • the information supplier has to enter the registration page of retrieval system and fill in an application form ( 1 . 1 ).
  • the application form includes the following data ( FIG. 3 , FIG.
  • an URL address of the Web site a name of the owner of the Web site, a home or an official address of the owner of the Web site, an e-mail address, a name of a division of the Global Classification of Information in the Internet relevant to the Web site, key words which completely characterize information presented on the Web site, a kind of information, an author of information, country, where the Web site is situated, a language of information, free information or information to be paid for, a free access to information or an access for the registered users only.
  • the information supplier also wishes to register separate Web pages of the Web site, he/she can fill into the application form additional data on these Web pages comprising: an URL address of the Web page, a name of a division of the Global Classification of Information in the Internet relevant to information on the Web page, key words completely characterizing information presented on the Web page, a kind of information, an author of information, a country and a language of information. Further the information supplier can choose any characteristics, which are specific for selected division of the Global Classification of Information in the Internet. These characteristics include in every division and subdivision of the Global Classification of Information in the Internet.
  • the information supplier registers Web site with information about a yacht and selects division “Motor yachts”, he/she can specify a tonnage of ship, a vessel speed and so on. Specified information will use for future sorting and selecting also.
  • the retrieval system will propose a hierarchic tree of the classification system ( FIG. 5 ) for the choice of a division of the Global Classification of Information in the Internet during registration procedure. Choosing an end division in a selected branch of the tree of the classification system would be the best option for an information supplier, because searchers of information would also most probably choose an end division of the classification for searching. After selecting a required division of the Global Classification of Information in the Internet the information supplier has to click or press on the name of the required division of the Global Classification of Information in the Internet. Then the name of this division will appear in the right field of the application form and the corresponding number of this division will be written into a determined place in the data file of the Web site.
  • the retrieval system will propose different kinds of information in accordance with the Classification of Kinds of information, for example: news, advertisements, announcements, scientific information, information of electronic shops and so on.
  • the information supplier should choose a suitable kind of information and the corresponding name of kind of information would appear in the right field of the application form and the corresponding number of this kind of information would have been written into the determined place of the data file of the Web site.
  • the retrieval system After the application form is filled, the retrieval system will create a data file of the Web site or the Web page and allocate it into an intermediate database. Then the retrieval system will evaluate the Web site ( 1 . 2 , FIG. 3 ) with respect to readability of this site by Internet browsers, with respect to compliance of the Web site to the general aspects of the Web technology, national and international legal regulations and so on. Eventually, the user gets a message confirming a successful evaluation ( 1 . 6 ) or a message with a request urging to correct and improve the Web site with a list of detected errors ( 1 . 4 ).
  • the retrieval system creates a memory space for the user wherein the information previously provided during the registration procedure would be stored ( 1 . 6 ).
  • the retrieval system will ask the user to pay a registration fee ( 1 . 7 ). In case if no payment is received during a specified period, the retrieval system will delete the memory space of the user ( 1 . 9 ). In case if the registration fee is paid, the retrieval system will send the user a message about successful completion of the registration procedure ( 1 . 10 ). Whereat the information supplier can check with the help of the retrieval system that the information about his/her Web site is situated in the database of retrieval system.
  • the existence of a payment of an information supplier for the service of the retrieval system switches mutual relation of the retrieval system and an information supplier to a frame of a contract relation.
  • the retrieval system is obliged to execute entered into an undertakings for the delivery of information by suppliers of information to searchers of information.
  • the information supplier can demand of improving of quality of provided by the retrieval system service, if this quality seems to him not enough high.
  • the retrieval system is obliged to respond for complaints of their partners.
  • the retrieval system will recommend information suppliers to provide a letter with a notarial acknowledgement of either he name and the home address of the owner of the Web site, if he/she is a private person, or a notarial acknowledgement of the name of the company and the official address of the owner of the Web site, if the owner of the Web site is a company or any other juridical person ( 1 . 11 ).
  • FIG. 6 A procedure of building the database of information on the Web sites and Web pages registered in the retrieval system is presented on FIG. 6 .
  • Data files, created during the registration procedure ( 2 . 2 ), or data files, created or modified during the process of an update of the data about Web sites and Web pages ( 2 . 3 ), are sorted in accordance with codes of the Global Classification of Information in the Internet ( 2 . 4 ) and placed into corresponding parts of the database of information on Web sites and Web pages ( 2 . 5 ) registered in the retrieval system.
  • Data storing in the database of the server of retrieval system ( 2 .
  • Web site or Web page for every registered Web site or Web page comprise: a code of information according to the Global Classification of Information in the Internet, a URL address of this Web site or Web page, key words concerning to main content of Web site or Web pages, a kind of information according to Classification of Kinds of information, a author of information, a country, where is situated Web site, a language of information, a free or fee-based information, a free access to information or needed registration procedure, a data characterizing an authenticity of information, a volume of information, a date registration or update of information, characteristics of information specific for selected division of the Global Classification of Information in the Internet.
  • the registered information supplier can modify and add data about his information. He can correct characteristics of existing Web site or Web page comprising key words, a kind of information, an author of information, country, where is situated Web site, a language of information, a free or fee-based information, free access to information or needed registration procedure, volume of information, characteristics of information specific for selected division of the Global Classification of Information in the Internet in case of necessity. He can add data about new Web pages of registering Web site comprising a choice of a division of the Global Classification of Information in the Internet to which this Web page are related on opinion of the information supplier.
  • He also can insert data about characteristics of new Web pages comprising: key words, a kind of information, an author of information, a country, where is situated Web site, a language of information, a free or feebased information, a free access to information or needed registration procedure, a volume of information, characteristics of information specific for selected division of the Global Classification of Information in the Internet.
  • a searcher of information has two ways for finding required information ( FIG. 7 ).
  • the searcher can use the hierarchical tree of the Global Classification of Information in the Internet ( 3 . 3 ).
  • the searcher of information can define within which division or an end division of the Global Classification of Information in the Internet is the needed information located. Then the searcher of information has to insert this division or an end division into a searching window in the retrieval system and starts the search.
  • the retrieval system provides the user a list of addresses of Web sites and Web pages, which are stored in the database and correspond to the selected division of the Global Classification of Information in the Internet.
  • the second way of search is the search based on key words ( 3 . 4 ).
  • For searching the user can type any combination of key words in a searching window of the retrieval system ( 3 . 5 ).
  • the retrieval system give to the user a list of divisions of the Global Classification of Information in the Internet, where said key words, inserted by information suppliers during of registration, are occurred.
  • the searcher of information has to choose a division of the Global Classification of Information in the Internet, which to his opinion better correspondents to key words. Then he/she has to insert the name of these divisions in a searching window of the retrieval system and start searching.
  • the retrieval system will provide the user a list of addresses of Web sites and Web pages, which are stored in database and correspond to the selected division of the Classification of Information in the Internet.
  • the retrieval system will give a message that none of divisions contain said key words and suggest the searcher of information either to change key words or use the first way of searching ( 3 . 3 ).
  • a user can ask a question to the retrieval system in the case if some difficulties arise with determination of a required division of the Global Classification of Information in the Internet and determination of key words. Recommendations of the retrieval system for determination of key words both for searching of information and for the registration procedure will be freely available for users.
  • a searcher of information is familiar with any classification of information or he/she has been using divisions of the Global Classification of Information in the Internet for some time, he/she can directly insert the name of a required division in a searching window of the retrieval system ( 3 . 3 ). If the user not quit correct inserts a name of required division, the retrieval system will correct his and propose to him more correct name of this division.
  • the searcher of information will get a list of addresses of Web sites and Web pages, which are stored in the database and correspond to the selected division of the Global Classification of Information in the Internet.
  • the obtained list of results can be large and contain thousands of addresses of Web sites and Web pages. Not every user would be able to review all the information.
  • For user's comfort sorting and selecting of search results is offered in the proposed invention ( FIG. 8 , FIG. 9 ).
  • the retrieval system creates data files for every Web site and Web page after the registration procedure. A part of information from said data files will be used as criterions for sorting and selecting the search results.
  • This information comprises: a kind of information, author of information, a country, where Web site is situated, a language of information, free or to be paid for information is provided, free access or registration is needed, characteristics of information specific for selected division of the Global Classification of Information in the Internet, key words. Moreover, the date of registration or the date of the last update of information on the Web site or the Web page, a volume of information presented on the Web site or Web pages, data confirming authenticity of information are included into this information.
  • a searcher of information can use two kinds of sorting and selecting: an automatic sorting and selecting and a stepwise sorting and selecting.
  • the user has to choose criterions for sorting and selecting, define priorities for these criterions and choose key words ( 4 . 2 , FIG. 8 ).
  • the retrieval system will sort and select the search results in accordance with priorities defined for the selected criterions ( 4 . 3 ). That is, the retrieval system will sort first the addresses of Web sites and Web pages from the list of search results in accordance with a criterion having the first priority. After that, the retrieval system will select the addresses of Web sites and Web pages from the sorted list of search results in accordance with specified criterion.
  • the retrieval system will sort and select the obtained result on the basis of key words ( 4 . 4 ).
  • the user will get a list of addresses of Web sites and Web pages sorted in accordance with selected criterions and key words ( 4 . 5 ). If for any reason the obtained information does not satisfy the user, he can return to the beginning of sorting and selecting and sort and select the originally obtained list of search results in accordance with other criterions and other priorities set to these criterions. If obtained information satisfies the user, he can proceed further with the obtained list ( 4 . 7 ). The user can save the search results before or after sorting and selecting on hard disk of his computer by means of a corresponding service of the retrieval system.
  • stepwise sorting and selecting the user chooses one of criterions for the first sorting and selecting ( 4 . 1 . 2 ).
  • the searcher of information should insert required key words into corresponding fields in the retrieval system.
  • the retrieval system sorts and selects the search results in accordance with the selected criterion ( 4 . 1 . 3 ).
  • the retrieval system will perform sorting and selecting on the basis of the selected key words and find all the Web pages from obtained result of sorting and selecting, which correspond to the defined key words ( 4 . 1 . 4 ). If obtained information satisfies the user, he can further proceed with the results of searching, sorting and selecting ( 4 . 1 . 6 ).
  • the searcher of information can return to the beginning of the previous step and perform sorting and selecting using another criterion.
  • the searcher of information will perform sorting and selecting using the next criterion ( 4 . 1 . 8 ) and so on, until a suitable result is obtained. Thereat, the searcher of information uses key words inserted on the first sorting and selecting only for first step of sorting and selecting.
  • Two main methods are proposed for sorting and selecting on the basis of key words. If user chooses the first method ( 4 . 4 . 1 ), then the retrieval system proposes to the user to choose suitable key words from the list of key words situated in the database and corresponding to the selected division of the Global Classification of Information in the Internet. A list of key words will be presented in the alphabetical order using the first key word ( 4 . 4 . 2 , 4 . 4 . 3 ). After choosing suitable key words, the retrieval system will sort and select information accordingly ( 4 . 4 . 4 ). Then the retrieval system starts a program of searching on the basis key words in order to find all the Web pages of retrieved Web sites, which relevant to selected key words ( 4 . 4 . 5 ).
  • the second method of sorting and selecting on the basis of key words provides sorting and selecting on the basis of arbitrary key words typed in by the user ( 4 . 4 . 7 ).
  • the retrieval system After typing in arbitrary key words in a specified field of the retrieval system, the retrieval system checks whether said key words match keywords in the considered division of the database corresponding to the selected division of the Global Classification of Information in the Internet. In case of the occurrence of the same key words in said division of the database, the retrieval system sorts and selects information in accordance with these key words ( 4 . 4 . 10 ). Then the retrieval system starts a program of searching on the basis of key words in order to find all the Web pages of retrieved Web sites, which relevant to selected key words ( 4 . 4 .
  • the retrieval system will start a program of searching on the basis of said key words in order to find these key words in all retrieved Web pages and to choose information ( 4 . 4 . 13 ) relevant to these key words.
  • the retrieval system and the Global Classification of Information in the Internet can be used for coding and storing information in the Internet that could be used for automatic control, automatic data exchange and functioning of automatic systems ( FIG. 11 ). Such information could be stored on network servers. Data on this information will be stored in the database of the retrieval system. In this case the supplier of information dedicated for automatic systems will have to register this information accordingly in the retrieval system. The supplier of information will have to include the corresponding URL address, the name of the owner of information, an official address of the owner of information, an e-mail address, the name of the division of the Global Classification of Information in the Internet to which the provided information is related, a kind of information according to the Classification of Kinds of information.
  • Information for automatic control and operation must have a strictly defined search path. Therefore data on this information must be stored in the section of the database of the retrieval system corresponding to the end division of the Global Classification of Information in the Internet. Besides, this information must be of a specific type as described in the manual of the retrieval system. This could, for instance, be “information for automatic control”, “information for operation of automatic systems” and so on.
  • the retrieval system will sort the information for automatic control and operation in accordance with the Global Classification of Information in the Internet and store the acquired information in the part of the database corresponding to the division of the Global Classification of Information in the Internet.
  • the following data will be stored: the code of information according to the Global Classification of Information in the Internet, the code of the kind of information according to the Classification of Kinds of information, URL address, and additional information dedicated to the automatic control and operation.
  • the owner of the automatic system has to complete the registration procedure.
  • the owner of the automatic system has to specify the IP address used by the automatic system, the name of the owner of the automatic system, an official or a home address of the owner of automatic system, an e-mail address.
  • the owner of automatic system Upon completion the registration procedure, the owner of automatic system will acquire a password for entering the retrieval system.
  • the automatic system In order to activate the process of information interchange, the automatic system has to send a request to the retrieval system in the automatic mode ( FIG. 11 and item 12 . 1 in FIG. 12 ) specifying the user password.
  • This request will include the code of information according to the Global Classification of Information in the Internet, the code of the kind of information according to the Classification of Kinds of information, URL address of control information, an instruction file containing a description on how the information transfer must be implemented including a transfer protocol and other conditions for information transfer.
  • the specified information code points to a certain section in the Global Classification of Information in the Internet rather than to a specific reference within a section, the information retrieval system will search for the most suitable reference.
  • the retrieval system finds the mentioned information in its database. After the required or directly requested link is found, the instruction file is opened and analyzed. The retrieval system organizes the data transfer in accordance with the instruction file ( 12 . 2 FIG. 12 ). This could be establishing a connection with a server, reading the control information or any other information dedicated for automatic systems and terminating the connection. This could also be establishing a connection with an automatic control device through the server of the retrieval system and terminating connection with the server and so on ( 12 . 2 - 12 . 4 ).
  • An instruction file can contain commands for creating a control program out of different control programs distributed in the Internet.
  • the automatic system can use an intermediate service that, for instance, could allow data conversion from the format of the supplier of information into the format of the user of information and vice versa.
  • This way the retrieval system would be able not only to establish a direct connection between the suppliers of information and the users of information, but also to create more elaborate schemes and data transfer structures.
  • Information support for navigation systems of vehicles, ships or planes are examples of possible applications of data exchange between automatic systems.
  • Other examples for such a data exchange are information support for meteorological services, sophisticated motion control, control of machines, control of automatic production lines and so on.
  • the retrieval system and the Global Classification of Information in the Internet can be used for coding and storing information in the Internet that could be used for automatic collecting of information by means of autonomously operating programs as well as for information exchange between autonomously operating programs.
  • Automatic collecting of information implies the presence of certain standards for the representation of information.
  • a supplier of information In order to make information suitable for automatic collecting, a supplier of information must either create his Web site or Web page in the specified format, or provide additional files containing information dedicated for automatic collecting. Information dedicated for automatic collecting must be split into information elements, which could be used or considered separately.
  • the content of additional files as well as the format of the presented information must be in accordance with the considered division of the Global Classification of Information in the Internet.
  • Automatic collecting of information is an optional service.
  • a supplier of information is able to refuse from providing information for automatic collecting. If he wishes to make his information available for automatic collecting, he will have to complete an additional registration procedure. During this procedure he will have to provide a file created in accordance with the standard of retrieval system for considered division of the Global Classification of Information in the Internet. Instead of creating such a file himself, the user will be able to use a corresponding program of the retrieval system for creating such a file. During registration the retrieval system will check whether the file corresponds to the standard. If necessary some changes will be requested. In order to facilitate automatic generation of such files, the retrieval system will provide corresponding libraries, programs and services.
  • the retrieval system will conduct sorting of data on information dedicated for automatic collecting on basis of the Global Classification of information in the Internet.
  • the data on this information will be stored in the database of the retrieval system.
  • the searcher of information will have to send a request to the retrieval system including: the name or code of the division of the Global Classification of Information in the Internet, key words defining the content of information to be found, a code of a kind of information according to the Classification of Kinds of information, the language of information, characteristics of information specific for the considered division of the Global Classification of Information in the Internet.
  • the searcher of information can include in the request a necessary level of authenticity of information according to recommendation of the retrieval system.
  • the searcher of information can include in the request a date of publication of information or a preferred time range, a country where this information was published, an author or authors or the owner of information. If the user has an access to Web sites where registration is required, corresponding registration information could be provided in the request.
  • the searcher of information will have a possibility to record his actions in the retrieval system into a macro. He can also record preparation of a request for automatic collecting of information. The user will be able to modify macros.
  • the retrieval system will start a program for automatic searching and collecting of information.
  • This program will first find the links on the required Web sites and Web pages. These Web sites and Web pages must be available for automatic collecting of information. After that the program will conduct an analysis of information in the found Web sites and Web pages. Then the program will produce an output, for instance, in a form of file out of information elements collected from the found Web sites and Web pages with or without reference being provided depending on the user's choice.
  • Some examples for automatic collecting of information are: collecting news or facts on a certain event, collecting historical information, collecting information for educational or technical purposes and so on.
  • the searcher of information will be able to create his own program for searching or automatic collecting of information. He will be able to use this program without actually starting the retrieval system.
  • the user In order to be able to use the database of the retrieval system, the user has to be registered accordingly in the retrieval system. In this case the retrieval system will provide the user with a password for accessing the database of the retrieval system. This password must be included into the program for searching and/or automatic collecting of information. This service of the retrieval system will be provided for a certain fee.
  • corresponding examples will be provided along with modules and libraries written in all the popular programming languages.

Abstract

A retrieval system and a method of searching of information in the Internet are proposed. The algorithm of allocation of information about Web sites in the database of retrieval system and the algorithm of searching information are based on the Global Classification of Information in the Internet, which comprises all known forms of information in the Internet. The retrieval system designed for searching of information in the Internet comprise a interconnected with the Internet server of retrieval system including multi-language Web site of retrieval system with searching programs and database of retrieval system. The retrieval system comprise also plural network servers wherein information, that is belong to information suppliers, is stored, and plural computers of users interconnected with the Internet. The retrieval system also provides organization of a data transfer for automatic systems, and provides organization of an automatic collection of information. The users of retrieval system, computers of which interconnected with the Internet, can be as suppliers of information as searchers of information. The method of searching of information in the Internet comprise a procedure of registration of Web sites by information suppliers, a procedure of forming database of information about registered in retrieval system Web sites, including procedures of forming database of information of retrieval system and renovate and addendum information to database of retrieval system, a procedure of searching of information and procedure of sorting and selecting results of the search.

Description

    BACKGROUND OF THE INVENTION
  • 1. Field of the Invention
  • This invention relates, in general, to retrieval systems and methods of searching information in the Internet.
  • 2. Description of the Prior Art
  • At present the Internet is one of the main sources of information for people together with TV, the radio, newspapers, books, magazines and other kinds of press products.
  • The main part of information in the Internet is present in the form of Web sites, which are stored on the numerous network servers. Retrieval systems are used for the search information in the Internet. These are Google.com, Yahoo.com, Search.com, Rambler.ru and others. Web sites are registered on retrieval systems. Web sites specified URL addresses and key words for Web sites in general and for separated Web pages. This information is stored in database of server of retrieval system.
  • In order to find needed information, a user has to fill any key words in the specified field in a retrieval system. The search for information is performed on the basis of said key words. The search for information is implemented with the help of special searching programs that retrieve relevant key words in databases of the retrieval system and provide corresponding links to the accessible Web sites and/or Web pages in the Internet. The collected information is stored on the server of the retrieval system in the form of a list of URL addresses of Web sites and Web pages corresponding to key words specified by the user. The user normally sees on a screen of his computer a portion of the collected information, i.e. a list with 10-20 URL addresses out of the total number of the Web sites found by the retrieval system. Then user can get an access to any Web site and/or Web page with the help of the browser by selecting a corresponding URL address provided by the retrieval system.
  • There are various algorithms of searching on the basis of key words used by retrieval systems. The common feature of these algorithms is that for some requests extremely long lists can be provided with hundreds, thousands and even millions of URL addresses, if according to the retrieval system there is any relation between the requested keywords and the provided URL addresses. For the available amount of information in the Internet, this situation is not uncommon. In most cases the user is unable to browse all the provided offered information. As experience shows, there is no need to browse all the provided URL addresses, because there are only a few tens or a few hundreds of addresses, which are truly related to what the user is looking for. The rest of information is in the most cases irrelevant to the request. This is variegated information from different branch of knowledge of people or from different field of activity of people and so on. Moreover, it is not always certain that the required information could be found on the first page of result of search, even. The above mentioned problem takes place, because a search via key words is based on mathematical algorithms, such as a comparison of requested key words with key words specified for or in Web sites, an estimation of a number of matches between the requested key words and the words in the title or in the text of Web pages, and so on. The search results on the basis of mathematical algorithms do not always represent the meaning of site's information. Therefore the user gets a huge amount of unnecessary information on his request. As the amount of information in the Internet steadily increases, this problem will worsen. The improvement of search algorithms operating on the basis of key words will not solve this problem, because identical key words can be situated in sources of information belonging to different branches of knowledge, different fields of people's activity and so on.
  • A flow of unnecessary information slows down operation of local and global computer networks increases demands for extra space on hard disks of servers of retrieval systems, puts additional requirements on improvement of searching programs based on analysis of key words and causes inefficient usage of other material and human resources.
  • A special skill is needed in the selection of key words in order to find required information. A change of the order of key words, a change of the search phrase often affects the search result. If key words have homonyms one can get information for needed and not needed significances of these key words.
  • Existent retrieval systems do not provide a sorting and selecting of obtained search results of searching on the basis of specified criterions.
  • Existent retrieval systems do not give any guarantee to owners of Web sites that their site would appear in the list of search result even if its content completely corresponds to the specified key words. Some retrieval systems apply mathematical methods for estimation to the specified key words. Some retrieval systems apply mathematical methods for estimation of popularity and ranking of Web sites, which gives a possibility for the Web sites with the highest rank to appear in the list of the first 10-20 URL addresses. For artificial increasing the rating of a Web site, some owners of Web sites create spam-Web sites, which increase number of references to needed Web sites. Some companies of Web designers elaborate and propose methods of increasing rating of Web sites. These measures not improve situation for searches of information.
  • Some retrieval systems attempt improving the quality of search of information by introducing catalogues. Catalogues are available at Google.com, Yahoo.com, Apport.ru and others. These catalogues have a small numbers of the main categories (generally less than 20). But this is insufficient for the existing amount of information available in the Internet and does not solve the problem of increasing the quality of the search of information in the Internet. These catalogues typically include the following categories: computers, work, education, house, society, entertainment, recreation, sport, manufacture, business, Internet for kids, mass media, inquiries and so on. Obviously, retrieval systems make attempts to classify information on edutainment and entertainment, as this kind of information seems more popular among the users of the Internet on opinion retrieval systems. However, all the information available in the Internet must be classified including information required for scientists, politicians, students and others.
  • There are a great number of patents devoted to the problem of the search of information in the Internet. The following patents are more relevant to the subject of the proposed invention.
  • In patent, U.S. Pat. No. 5,369,763 “Data storage and retrieval system with improved database structure” by Biles from 29th of November 1994, a system of storing and searching information, based on the modified Library of Congress of USA Classification System, is proposed for a local computer system. According to this patent, data on numerous topics and subjects are stored in the Subject Database. Descriptor phrases, associated with an every subject and topic, are introduced into this Data Base together with identifying information. Data based on a classification system are stored in the Typology Database. The Identification Database facilitates an access to the information stored in the Subject Database. Titles of topics, designation numbers and corresponding descriptor phrases, identification information from the Subject Database are stored in the Composite Catalogue. With the help of stored descriptor phrases related to a specific topic, a user can find needed information. This information is searched in the following way. The user selects the descriptor phrase. Then the number of this descriptor phrase is searched in the Composite Catalogue. The desired information is searched using this number. An alphabet sorting and sorting on the basis of the level in the catalogue are proposed in this patent. Only the use of specified descriptor phrases is proposed to use in said patent. Arbitrary descriptor phrases cannot be used for search in this patent. This limits freedom and capability of searching. Moreover, the proposed retrieval system does not deal with search information in the Internet.
  • In patent, U.S. Pat. No. 5,907,838 “Information search and collection method and system” by Miyasaka et al. from 25th of May 1999, the method of searching for information in the Internet based on object-oriented programming is proposed. According to the proposed method, properties are set for information units for each category of class and the method of data collection is described for each property. A user formulates his request for search of required information in terms of key words, which is transformed in a format understandable for the system. The request is then classified into the class category and information units are found according to the properties of the class, which are determined by the request of the user. This method is designed for collecting specific information in the Internet.
  • In patent, U.S. Pat. No. 6,233,575 “Multilevel taxonomy based on features derived from training documents classification using Fisher values as discrimination values” by Agrawal et al. from 15th of May 2001, the method is proposed for evaluation of large text documents on the basis of Fisher value and addition of these documents into a hierarchic structure. A topic path of hierarchic structure is used along with key words for the purpose of improving searching.
  • Unfortunately, the problem of searching of information in the Internet not finds a full solution in existing retrieval systems and in patents literature, at present time. In order to release of a searcher of information from browsing of large numbers of sites with unneeded information, a classification to a different direction of human activity and different branches of knowledge for registered in a retrieval system information needs. In this case, a searcher of information will get information not from all volume of information of the Internet, but from part of information that is interested for a user. There is a need in a search system and a method of searching information based on a global classification of information. Such a system would be capable of automatic classifying incoming information in accordance with the categories of information and retrieve information in accordance with these categories. This would be a solution for increasing the efficiency of a search for information.
  • At present, there are some library classifications of information available. These classifications exist next ages before. Within these classifications a successful system of classifying a large amount of existing information has been developed. Well-known examples of such classifications are the Library of Congress of USA Classification System, the Decimal Classification, the Bibliothecal-bibliographical Classification and others. The amount of information within a library is comparable with the amount of information in the Internet. Library Classifications are convenient and simple in usage. They are logical and understandable for users. Library Classifications are constantly improving and accommodate changes happening in the information world. Evidently, some Library Classifications of information can be used as an example for the development of Global Classification of Information in the Internet. An application of any Classification of Information in the Internet for categorizing and searching of information in the Internet can solve existing problems. The new classification of the information in the Internet could be represented as a catalogue, similar to the classification in the librarianship. Of course, such classification would have to be adapted to the needs and specifics of the Internet. There is a need in classifying additional sources of information, such as electronic shops, forums and others available only in the Internet. An every division and a subdivision of the catalogue cover a certain field of information. For users' comfort, a brief characteristic has to be provided for an every division and subdivision of the catalogue. An every division and subdivision of the catalogue must have a specific code. The classification must have a possibility of evolution and take into account all possible future changes in the world information system and in the Internet.
  • Therefore there is a need in a system and a method addressing the above-mentioned problems in the search of information in the Internet.
  • SUMMARY
  • The retrieval system and the method of searching information in the Internet are proposed in this invention. The algorithm of allocating information about Web sites in the database of the retrieval system and the algorithm of searching information are based on the Global Classification of Information in the Internet. This Global Classification of Information in the Internet is classification modified and adopted to conditions of the Internet and covering all the known forms of information in the Internet.
  • During registration procedure a supplier of information fills in an application form and inserts therein the following data: an URL address of the Web site, a name of the owner of the Web site, a home or an official address of the owner of the Web site, the name of a division of the Global Classification Information in the Internet relevant to the information presented in the Web site, key words which completely characterize information presented in the Web site, a kind of information, an author of information, country where the Web site is situated, a language of information, free information or information to be paid for, a free access to information or a registration is required, characteristics of information specific for selected division of the Global Classification of Information in the Internet.
  • Data files created during the registration procedure and/or data files created during an update of the data on Web sites are sorted according to the codes of the Global Classification Information in the Internet and allocated in the corresponding parts of the database containing information on the Web sites registered in the retrieval system.
  • A searcher of information can find required information by searching through the tree of the Global Classification of Information in the Internet. The searcher of information can define in which division or a subdivision of the Global Classification of Information in the Internet does he have to look for information. Then he inserts the name of this division into a search window of a browser of the retrieval system and begins the search. The second way of search is based on key words specified by the searcher of information. The retrieval system presents the user a list of divisions of the Global Classification of Information in the Internet where said key words match key words provided by suppliers of information. The searcher of information has to choose a name of division of the Global Classification of Information in the Internet corresponding to his/her key words and insert the name of this division into a search window of a browser of the retrieval system and start the search. As a result of search, the retrieval system provides a list of addresses of Web sites and Web pages stored in the database and relevant to the selected division of the Global Classification of Information in the Internet.
  • According to the proposed invention, sorting and selecting of search results is provided in the retrieval system. This will provide additional comfort to users. During the registration procedure the retrieval system creates data files for an every Web site. A part of information from these files will be used as a criterion for sorting and selecting of search results.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • These and/or other aspects and advantages of the embodiment will become more readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:
  • FIG. 1 is a picture showing a structure of a retrieval system;
  • FIG. 2 is a block diagram of a method of searching of information in the Internet;
  • FIG. 3 is a block diagram showing registration procedure;
  • FIG. 4 is a picture showing an example of an application form;
  • FIG. 5 is a picture showing a process of choosing a required division from the Global Classification of Information in the Internet;
  • FIG. 6 is a block diagram showing a process of forming the database of information on registered Web sites on the server of the Retrieval System;
  • FIG. 7 is a block diagram showing a process of searching information;
  • FIG. 8 is a block diagram showing a process of automatic sorting and selecting;
  • FIG. 9 is a block diagram showing a process of stepwise sorting and selecting;
  • FIG. 10 is a block diagram showing a procedure of sorting and selecting on the basis of key words;
  • FIG. 11 is a picture showing organization of a data transfer for automation systems;
  • FIG. 12 is a block diagram showing organization of a data transfer for automation systems.
  • DETAILED DESCRIPTION
  • Reference will now be made in detail to embodiments, examples of which are illustrated in the accompanying drawings.
  • A retrieval system, wherein acquisition, storing and searching for information is build on the Global Classification of Information in the Internet (GCII), is proposed in the present invention. The Global Classification of Information in the Internet is a classification, modified and adopted to conditions of the Internet classifying all known information presented in the Internet and covering all the forms of human activity: a material production, trade, science, education, history, culture and so on and represent the properties and characteristics of outward things, the nature, the animals and vegetal. The Global Classification of Information in the Internet represents by itself a hierarchical tree with names and corresponding numbers of all divisions and subdivisions. Further the Global Classification of Information in the Internet include specific characteristics of information for every division and subdivision, which will use for future sorting and selecting of any retrieved information. The Global Classification of Information in the Internet is classification, which will be in progress constantly and will consider all changes occurred in the Internet.
  • The retrieval system designed for searching of information in the Internet (FIG. 1) comprises a server of retrieval system (1) interconnected with the Internet including multilingual Web site of retrieval system with searching programs (2), database of the Global Classification of Information in the Internet (3) and the other databases of the retrieval system (4). The retrieval system also comprises numerous network servers (5) wherein information belonging to information suppliers is stored, and numerous computers of users (6) interconnected with the Internet. The users of retrieval system having an access to the Internet can either a as suppliers of information or searchers of information or both.
  • The method of searching of information in the Internet (FIG. 2) comprises a registration procedure of Web sites (1) by information suppliers, a procedure of building the database of information about Web sites (2) registered in retrieval system, including procedures of building the information database of the retrieval system and updating and adding information to the database of the retrieval system, a procedure of searching information (3) and procedure of sorting and selecting the search results (4).
  • An information supplier has to complete the registration procedure (FIG. 3) in order to have information about his/her Web site included into the database of the retrieval system. The information supplier has to enter the registration page of retrieval system and fill in an application form (1.1). The application form includes the following data (FIG. 3, FIG. 4): an URL address of the Web site, a name of the owner of the Web site, a home or an official address of the owner of the Web site, an e-mail address, a name of a division of the Global Classification of Information in the Internet relevant to the Web site, key words which completely characterize information presented on the Web site, a kind of information, an author of information, country, where the Web site is situated, a language of information, free information or information to be paid for, a free access to information or an access for the registered users only. If the information supplier also wishes to register separate Web pages of the Web site, he/she can fill into the application form additional data on these Web pages comprising: an URL address of the Web page, a name of a division of the Global Classification of Information in the Internet relevant to information on the Web page, key words completely characterizing information presented on the Web page, a kind of information, an author of information, a country and a language of information. Further the information supplier can choose any characteristics, which are specific for selected division of the Global Classification of Information in the Internet. These characteristics include in every division and subdivision of the Global Classification of Information in the Internet. For example, if the information supplier registers Web site with information about a yacht and selects division “Motor yachts”, he/she can specify a tonnage of ship, a vessel speed and so on. Specified information will use for future sorting and selecting also.
  • The retrieval system will propose a hierarchic tree of the classification system (FIG. 5) for the choice of a division of the Global Classification of Information in the Internet during registration procedure. Choosing an end division in a selected branch of the tree of the classification system would be the best option for an information supplier, because searchers of information would also most probably choose an end division of the classification for searching. After selecting a required division of the Global Classification of Information in the Internet the information supplier has to click or press on the name of the required division of the Global Classification of Information in the Internet. Then the name of this division will appear in the right field of the application form and the corresponding number of this division will be written into a determined place in the data file of the Web site.
  • During filling the paragraph “kind of information”, the retrieval system will propose different kinds of information in accordance with the Classification of Kinds of information, for example: news, advertisements, announcements, scientific information, information of electronic shops and so on. The information supplier should choose a suitable kind of information and the corresponding name of kind of information would appear in the right field of the application form and the corresponding number of this kind of information would have been written into the determined place of the data file of the Web site.
  • After the application form is filled, the retrieval system will create a data file of the Web site or the Web page and allocate it into an intermediate database. Then the retrieval system will evaluate the Web site (1.2, FIG. 3) with respect to readability of this site by Internet browsers, with respect to compliance of the Web site to the general aspects of the Web technology, national and international legal regulations and so on. Eventually, the user gets a message confirming a successful evaluation (1.6) or a message with a request urging to correct and improve the Web site with a list of detected errors (1.4).
  • In case of successful evaluation of the Web site, the retrieval system creates a memory space for the user wherein the information previously provided during the registration procedure would be stored (1.6).
  • Then the retrieval system will ask the user to pay a registration fee (1.7). In case if no payment is received during a specified period, the retrieval system will delete the memory space of the user (1.9). In case if the registration fee is paid, the retrieval system will send the user a message about successful completion of the registration procedure (1.10). Whereat the information supplier can check with the help of the retrieval system that the information about his/her Web site is situated in the database of retrieval system.
  • The existence of a payment of an information supplier for the service of the retrieval system switches mutual relation of the retrieval system and an information supplier to a frame of a contract relation. In this case, the retrieval system is obliged to execute entered into an undertakings for the delivery of information by suppliers of information to searchers of information. In case of existing of a contract relation, the information supplier can demand of improving of quality of provided by the retrieval system service, if this quality seems to him not enough high. The retrieval system is obliged to respond for complaints of their partners.
  • For increasing the level of authenticity of information, the retrieval system will recommend information suppliers to provide a letter with a notarial acknowledgement of either he name and the home address of the owner of the Web site, if he/she is a private person, or a notarial acknowledgement of the name of the company and the official address of the owner of the Web site, if the owner of the Web site is a company or any other juridical person (1.11).
  • Except for the said information, the date of registration of the Web site, an amount of information presented in the Web site, data confirming authenticity of information will also be included in data files built by the retrieval system.
  • A procedure of building the database of information on the Web sites and Web pages registered in the retrieval system is presented on FIG. 6. Data files, created during the registration procedure (2.2), or data files, created or modified during the process of an update of the data about Web sites and Web pages (2.3), are sorted in accordance with codes of the Global Classification of Information in the Internet (2.4) and placed into corresponding parts of the database of information on Web sites and Web pages (2.5) registered in the retrieval system. Data, storing in the database of the server of retrieval system (2.5), for every registered Web site or Web page comprise: a code of information according to the Global Classification of Information in the Internet, a URL address of this Web site or Web page, key words concerning to main content of Web site or Web pages, a kind of information according to Classification of Kinds of information, a author of information, a country, where is situated Web site, a language of information, a free or fee-based information, a free access to information or needed registration procedure, a data characterizing an authenticity of information, a volume of information, a date registration or update of information, characteristics of information specific for selected division of the Global Classification of Information in the Internet.
  • The registered information supplier can modify and add data about his information. He can correct characteristics of existing Web site or Web page comprising key words, a kind of information, an author of information, country, where is situated Web site, a language of information, a free or fee-based information, free access to information or needed registration procedure, volume of information, characteristics of information specific for selected division of the Global Classification of Information in the Internet in case of necessity. He can add data about new Web pages of registering Web site comprising a choice of a division of the Global Classification of Information in the Internet to which this Web page are related on opinion of the information supplier. He also can insert data about characteristics of new Web pages comprising: key words, a kind of information, an author of information, a country, where is situated Web site, a language of information, a free or feebased information, a free access to information or needed registration procedure, a volume of information, characteristics of information specific for selected division of the Global Classification of Information in the Internet.
  • A searcher of information has two ways for finding required information (FIG. 7). In the first way the searcher can use the hierarchical tree of the Global Classification of Information in the Internet (3.3). The searcher of information can define within which division or an end division of the Global Classification of Information in the Internet is the needed information located. Then the searcher of information has to insert this division or an end division into a searching window in the retrieval system and starts the search. As a result of the search, the retrieval system provides the user a list of addresses of Web sites and Web pages, which are stored in the database and correspond to the selected division of the Global Classification of Information in the Internet.
  • The second way of search is the search based on key words (3.4). For searching the user can type any combination of key words in a searching window of the retrieval system (3.5). The retrieval system give to the user a list of divisions of the Global Classification of Information in the Internet, where said key words, inserted by information suppliers during of registration, are occurred. The searcher of information has to choose a division of the Global Classification of Information in the Internet, which to his opinion better correspondents to key words. Then he/she has to insert the name of these divisions in a searching window of the retrieval system and start searching. As a result of searching, as in the first case, the retrieval system will provide the user a list of addresses of Web sites and Web pages, which are stored in database and correspond to the selected division of the Classification of Information in the Internet. In the case if said key words are not encountered within the division of the Global Classification of Information in the Internet, the retrieval system will give a message that none of divisions contain said key words and suggest the searcher of information either to change key words or use the first way of searching (3.3).
  • A user can ask a question to the retrieval system in the case if some difficulties arise with determination of a required division of the Global Classification of Information in the Internet and determination of key words. Recommendations of the retrieval system for determination of key words both for searching of information and for the registration procedure will be freely available for users.
  • If a searcher of information is familiar with any classification of information or he/she has been using divisions of the Global Classification of Information in the Internet for some time, he/she can directly insert the name of a required division in a searching window of the retrieval system (3.3). If the user not quit correct inserts a name of required division, the retrieval system will correct his and propose to him more correct name of this division.
  • As a result of searching, the searcher of information will get a list of addresses of Web sites and Web pages, which are stored in the database and correspond to the selected division of the Global Classification of Information in the Internet. The obtained list of results can be large and contain thousands of addresses of Web sites and Web pages. Not every user would be able to review all the information. For user's comfort sorting and selecting of search results is offered in the proposed invention (FIG. 8, FIG. 9). As shown earlier, the retrieval system creates data files for every Web site and Web page after the registration procedure. A part of information from said data files will be used as criterions for sorting and selecting the search results. This information comprises: a kind of information, author of information, a country, where Web site is situated, a language of information, free or to be paid for information is provided, free access or registration is needed, characteristics of information specific for selected division of the Global Classification of Information in the Internet, key words. Moreover, the date of registration or the date of the last update of information on the Web site or the Web page, a volume of information presented on the Web site or Web pages, data confirming authenticity of information are included into this information.
  • A searcher of information can use two kinds of sorting and selecting: an automatic sorting and selecting and a stepwise sorting and selecting. In the first case, the user has to choose criterions for sorting and selecting, define priorities for these criterions and choose key words (4.2, FIG. 8). Then the retrieval system will sort and select the search results in accordance with priorities defined for the selected criterions (4.3). That is, the retrieval system will sort first the addresses of Web sites and Web pages from the list of search results in accordance with a criterion having the first priority. After that, the retrieval system will select the addresses of Web sites and Web pages from the sorted list of search results in accordance with specified criterion. Then the remaining list of search results will be sorted and selected in accordance with a criterion having the second priority and so on down to the last criterion of sorting and selecting. After completion of the last step of sorting and selecting based on selected criterions, the retrieval system will sort and select the obtained result on the basis of key words (4.4). As a result of automatic sorting and selecting, the user will get a list of addresses of Web sites and Web pages sorted in accordance with selected criterions and key words (4.5). If for any reason the obtained information does not satisfy the user, he can return to the beginning of sorting and selecting and sort and select the originally obtained list of search results in accordance with other criterions and other priorities set to these criterions. If obtained information satisfies the user, he can proceed further with the obtained list (4.7). The user can save the search results before or after sorting and selecting on hard disk of his computer by means of a corresponding service of the retrieval system.
  • In case of stepwise sorting and selecting (FIG. 9), the user chooses one of criterions for the first sorting and selecting (4.1.2). In addition to that the searcher of information should insert required key words into corresponding fields in the retrieval system. Then the retrieval system sorts and selects the search results in accordance with the selected criterion (4.1.3). Whereat the retrieval system will perform sorting and selecting on the basis of the selected key words and find all the Web pages from obtained result of sorting and selecting, which correspond to the defined key words (4.1.4). If obtained information satisfies the user, he can further proceed with the results of searching, sorting and selecting (4.1.6). If obtained information does not satisfy the user, he has two variants. In the first variant, the searcher of information can return to the beginning of the previous step and perform sorting and selecting using another criterion. In the second variant, the searcher of information will perform sorting and selecting using the next criterion (4.1.8) and so on, until a suitable result is obtained. Thereat, the searcher of information uses key words inserted on the first sorting and selecting only for first step of sorting and selecting.
  • Lets consider in more detail the sorting and selecting on the basis of key words (FIG. 10). Two main methods are proposed for sorting and selecting on the basis of key words. If user chooses the first method (4.4.1), then the retrieval system proposes to the user to choose suitable key words from the list of key words situated in the database and corresponding to the selected division of the Global Classification of Information in the Internet. A list of key words will be presented in the alphabetical order using the first key word (4.4.2, 4.4.3). After choosing suitable key words, the retrieval system will sort and select information accordingly (4.4.4). Then the retrieval system starts a program of searching on the basis key words in order to find all the Web pages of retrieved Web sites, which relevant to selected key words (4.4.5).
  • The second method of sorting and selecting on the basis of key words provides sorting and selecting on the basis of arbitrary key words typed in by the user (4.4.7). After typing in arbitrary key words in a specified field of the retrieval system, the retrieval system checks whether said key words match keywords in the considered division of the database corresponding to the selected division of the Global Classification of Information in the Internet. In case of the occurrence of the same key words in said division of the database, the retrieval system sorts and selects information in accordance with these key words (4.4.10). Then the retrieval system starts a program of searching on the basis of key words in order to find all the Web pages of retrieved Web sites, which relevant to selected key words (4.4.11) like in the first case. If there are no such key words in said division of the database, then the retrieval system will start a program of searching on the basis of said key words in order to find these key words in all retrieved Web pages and to choose information (4.4.13) relevant to these key words.
  • The retrieval system and the Global Classification of Information in the Internet can be used for coding and storing information in the Internet that could be used for automatic control, automatic data exchange and functioning of automatic systems (FIG. 11). Such information could be stored on network servers. Data on this information will be stored in the database of the retrieval system. In this case the supplier of information dedicated for automatic systems will have to register this information accordingly in the retrieval system. The supplier of information will have to include the corresponding URL address, the name of the owner of information, an official address of the owner of information, an e-mail address, the name of the division of the Global Classification of Information in the Internet to which the provided information is related, a kind of information according to the Classification of Kinds of information.
  • Information for automatic control and operation must have a strictly defined search path. Therefore data on this information must be stored in the section of the database of the retrieval system corresponding to the end division of the Global Classification of Information in the Internet. Besides, this information must be of a specific type as described in the manual of the retrieval system. This could, for instance, be “information for automatic control”, “information for operation of automatic systems” and so on.
  • After the registration procedure, the retrieval system will sort the information for automatic control and operation in accordance with the Global Classification of Information in the Internet and store the acquired information in the part of the database corresponding to the division of the Global Classification of Information in the Internet. The following data will be stored: the code of information according to the Global Classification of Information in the Internet, the code of the kind of information according to the Classification of Kinds of information, URL address, and additional information dedicated to the automatic control and operation.
  • In order to be able to use the service of the retrieval system for arranging information interchange between automatic systems, the owner of the automatic system has to complete the registration procedure. The owner of the automatic system has to specify the IP address used by the automatic system, the name of the owner of the automatic system, an official or a home address of the owner of automatic system, an e-mail address. Upon completion the registration procedure, the owner of automatic system will acquire a password for entering the retrieval system.
  • In order to activate the process of information interchange, the automatic system has to send a request to the retrieval system in the automatic mode (FIG. 11 and item 12.1 in FIG. 12) specifying the user password. This request will include the code of information according to the Global Classification of Information in the Internet, the code of the kind of information according to the Classification of Kinds of information, URL address of control information, an instruction file containing a description on how the information transfer must be implemented including a transfer protocol and other conditions for information transfer. In case if the specified information code points to a certain section in the Global Classification of Information in the Internet rather than to a specific reference within a section, the information retrieval system will search for the most suitable reference. Such a search is conducted within a limited number of links, which will provide a fast response. The retrieval system finds the mentioned information in its database. After the required or directly requested link is found, the instruction file is opened and analyzed. The retrieval system organizes the data transfer in accordance with the instruction file (12.2 FIG. 12). This could be establishing a connection with a server, reading the control information or any other information dedicated for automatic systems and terminating the connection. This could also be establishing a connection with an automatic control device through the server of the retrieval system and terminating connection with the server and so on (12.2-12.4). An instruction file can contain commands for creating a control program out of different control programs distributed in the Internet. If necessary the automatic system can use an intermediate service that, for instance, could allow data conversion from the format of the supplier of information into the format of the user of information and vice versa. This way the retrieval system would be able not only to establish a direct connection between the suppliers of information and the users of information, but also to create more elaborate schemes and data transfer structures. Information support for navigation systems of vehicles, ships or planes are examples of possible applications of data exchange between automatic systems. Other examples for such a data exchange are information support for meteorological services, sophisticated motion control, control of machines, control of automatic production lines and so on.
  • The retrieval system and the Global Classification of Information in the Internet can be used for coding and storing information in the Internet that could be used for automatic collecting of information by means of autonomously operating programs as well as for information exchange between autonomously operating programs. Automatic collecting of information implies the presence of certain standards for the representation of information. In order to make information suitable for automatic collecting, a supplier of information must either create his Web site or Web page in the specified format, or provide additional files containing information dedicated for automatic collecting. Information dedicated for automatic collecting must be split into information elements, which could be used or considered separately. The content of additional files as well as the format of the presented information must be in accordance with the considered division of the Global Classification of Information in the Internet.
  • Automatic collecting of information is an optional service. A supplier of information is able to refuse from providing information for automatic collecting. If he wishes to make his information available for automatic collecting, he will have to complete an additional registration procedure. During this procedure he will have to provide a file created in accordance with the standard of retrieval system for considered division of the Global Classification of Information in the Internet. Instead of creating such a file himself, the user will be able to use a corresponding program of the retrieval system for creating such a file. During registration the retrieval system will check whether the file corresponds to the standard. If necessary some changes will be requested. In order to facilitate automatic generation of such files, the retrieval system will provide corresponding libraries, programs and services.
  • After registration procedure is completed, some files would be suitable as for a common use, as for the automatic collecting.
  • After registration procedure is completed, the retrieval system will conduct sorting of data on information dedicated for automatic collecting on basis of the Global Classification of information in the Internet. The data on this information will be stored in the database of the retrieval system.
  • It is possible that certain Web pages could be related to a few sections of the Global Classification of Information in the Internet. Such files will have a few registration numbers. This means that the retrieval system will have a few links on the Web page or the same file.
  • In order to conduct automatic collecting of information, the searcher of information will have to send a request to the retrieval system including: the name or code of the division of the Global Classification of Information in the Internet, key words defining the content of information to be found, a code of a kind of information according to the Classification of Kinds of information, the language of information, characteristics of information specific for the considered division of the Global Classification of Information in the Internet. Additionally the searcher of information can include in the request a necessary level of authenticity of information according to recommendation of the retrieval system. In addition to that the searcher of information can include in the request a date of publication of information or a preferred time range, a country where this information was published, an author or authors or the owner of information. If the user has an access to Web sites where registration is required, corresponding registration information could be provided in the request.
  • The searcher of information will have a possibility to record his actions in the retrieval system into a macro. He can also record preparation of a request for automatic collecting of information. The user will be able to modify macros.
  • After the request for automatic collecting of information is sent, the retrieval system will start a program for automatic searching and collecting of information. This program will first find the links on the required Web sites and Web pages. These Web sites and Web pages must be available for automatic collecting of information. After that the program will conduct an analysis of information in the found Web sites and Web pages. Then the program will produce an output, for instance, in a form of file out of information elements collected from the found Web sites and Web pages with or without reference being provided depending on the user's choice. Some examples for automatic collecting of information are: collecting news or facts on a certain event, collecting historical information, collecting information for educational or technical purposes and so on.
  • The searcher of information will be able to create his own program for searching or automatic collecting of information. He will be able to use this program without actually starting the retrieval system. In order to be able to use the database of the retrieval system, the user has to be registered accordingly in the retrieval system. In this case the retrieval system will provide the user with a password for accessing the database of the retrieval system. This password must be included into the program for searching and/or automatic collecting of information. This service of the retrieval system will be provided for a certain fee. In order to facilitate creating a program for searching or automatic collecting of information, corresponding examples will be provided along with modules and libraries written in all the popular programming languages.
  • In order to increase the speed of automatic collecting of information, the most popular information will be located directly in the database of the retrieval system.

Claims (8)

1. A retrieval system for searching of information in the Internet for users, including information suppliers and searchers of information as equitable participants of the process, comprising:
1.1) a server of the retrieval system including a database of an information and a Web site of retrieval system with multi-language support and searching programs;
1.2) the Global Classification of Information in the Internet, classifying the all known information presented in the Internet, storing in a database of the server of the retrieval system, representing by itself a hierarchical tree with names and corresponding numbers of all divisions and subdivisions, including characteristics of information specific for every division and subdivision, and situated in progress constantly and considered all changes, occurred in the Internet; and
1.3) user computers capable of communicating with the server of the retrieval system and plurals network servers;
1.4) wherein the server of the retrieval system registers information suppliers, provides a modification and an addition of an information, provides an information for searchers of information, provides a sorting and selecting information for searchers of information, provides organization of a data transfer for automatic systems, and provides organization of an automatic collection of information.
2. A method acquisition, storing and searching of information in the Internet by means of retrieval system, comprising steps of:
2.1) registering of information provided by information suppliers, comprising step of:
a) filling an application form comprising data of: URL address, a name of owner of Web site, a home address or official address of owner of Web site, e-mail address, a name of a required division of the Global Classification of Information in the Internet, to which this Web site or this Web-page are related on opinion of the information supplier, characteristics of Web site or Web-page, which will use for future sorting and selecting, comprising key words, a kind of information according to Classification of Kinds of information, an author of information, country, where is situated Web site, a language of information, a free or fee-based information, free access to information or needed registration procedure, characteristics of information specific for selected division of the Global Classification of Information in the Internet;
b) examining of Web site and Web-pages by retrieval system;
c) sending a message to supplier of information in case of finding of defects of Web site with suggestion about detected errors and request to improve Web site or Web-pages;
d) sending a message to supplier of information in case of successful examination about registration of Web site and Web-pages;
2.2) providing a modification and an addition of an information by registered information suppliers;
2.3) acquisition and storing the information provided by information suppliers comprising steps of:
a) creating the memory space for new Web site and Web-pages in the database of the server of retrieval system and inserting to this memory space data provided by the information supplier during the registration procedure or during the procedure of the modification and the addition of information;
b) sorting data about Web site and Web-pages, provided in result the registration procedure or the procedure of the modification and the addition of information, according to codes of the Global Classification of Information in the Internet; and
c) storing said data in the database of the server of retrieval system, wherein these date comprise for every registered Web site or Web-page: a code of information according to the Global Classification of Information in the Internet, a URL address of this Web site or Web-page, key words concerning to main content of Web site or Web-pages, a kind of information according to Classification of Kinds of information, a author of information, a country, where is situated Web site, a language of information, a free or fee-based information, a free access to information or needed registration procedure, a data characterizing an authenticity of information, a volume of information, a date registration or update of information, characteristics of information specific for selected division of the Global Classification of Information in the Internet;
2.4) searching for an information to searchers of information comprising steps of:
a) preparation of a request of a searcher of information comprising a name of relevant division of the Global Classification of Information in the Internet or key words relating to a needed information;
b) analyzing of the list of the name divisions in the Global Classification of Information in the Internet, where said key words include, and a choice of the needed division of the Global Classification of Information in the Internet by a searcher of information, in case of using of key words for searching;
c) searching for needed information in database of retrieval system according to codes of said information in the Global Classification of Information in the Internet;
d) storing of found information in temporary storage of the server of retrieval system;
e) delivering information to computer of a searcher of information;
2.5) sorting and selecting of a retrieved information for searchers of information in the automatic mode or in the stepwise operation on the basis of criterions provided during registration procedure.
3. The method according to claim 2, wherein the procedure of a modification and an addition of information by the registered information suppliers comprises steps of:
3.1) correcting characteristics of Web site or Web-page comprising key words, a kind of information, an author of information, country, where is situated Web site, a language of information, a free or fee-based information, free access to information or needed registration procedure, volume of information, characteristics of information specific for selected division of the Global Classification of Information in the Internet in case of necessity;
3.2) adding data for new Web-pages of registering Web site comprising:
a) choosing of a division of the Global Classification of Information in the Internet to which this Web-page are related on opinion of the information supplier; and
b) inserting data about characteristics of new Web-pages comprising key words, a kind of information, an author of information, a country, where is situated Web site, a language of information, a free or feebased information, a free access to information or needed registration procedure, a volume of information, characteristics of information specific for selected division of the Global Classification of Information in the Internet.
4. The method according to claim 2, wherein sorting and selecting information for searchers of information in the automatic mode comprises steps of:
4.1) choosing of any combination of criterions and ranking of selected criterions of information for sorting and selecting, which comprise:
a) a kind of information;
b) a author of information;
c) a country, where is situated Web site;
d) a language of information;
e) a authenticity of information;
f) a date of registration or a date of last update of information;
g) a free or fee-based information;
h) a free access to information or needed registration procedure;
i) a volume of information;
j) characteristics of information specific for selected division of the Global Classification of Information in the Internet;
4.2) choosing key words;
4.3) sorting and selecting information according to numerical order of selected criterions;
4.4) sorting and selecting of a retrieved information on the basis of key words;
4.5) analyzing of results of sorting and selecting;
4.6) repeating of the automatic sorting and selecting information with other criterions, if obtained information not satisfies a user;
4.7) working with results of search, sorting and selecting information in case of suitable result of sorting and selecting;
4.8) storing of search, sorting and selecting results on the computer of a searcher of information.
5. The method according to claim 2, wherein sorting and selecting information for searchers of information in the stepwise operation comprises steps of:
5.1) choosing of key words, which will use for the first step of sorting and selecting;
5.2) choosing of first criterions for sorting and selecting;
5.3) sorting and selecting information according to selected criterion;
5.4) sorting and selecting of a retrieved information on the basis of key words for the first step of sorting and selecting;
5.5) analysis of results of sorting and selecting;
5.6) continuation of sorting and selecting information with next criterion, if obtained information not satisfies a searcher of information, until a suitable result will be reach;
5.7) working with results of search, sorting and selecting in case of suitable result of sorting;
5.8) storing of search, sorting and selecting results on the computer of a searcher of information.
6. The method according to claim 4 and claim 5, wherein sorting and selecting of a retrieved information on the basis of key words comprises two possible ways:
6.1) wherein first way comprises steps of:
a) choosing suitable key words from list of key words, which are situated in database in accordance with selected division of the Global Classification of Information in the Internet;
b) sorting and selecting information in accordance with selected key words;
c) finding all Web pages from obtained result of sorting and selecting, which correspond to defined key words;
6.2) wherein second way comprises steps of:
a) inserting of arbitrary key words;
b) checking of the existence said key words in the division of database, which correspond to selected division of the Global Classification of Information in the Internet;
c) sorting and selecting information in accordance with selected key words and finding all Web pages from obtained result of sorting and selecting, which correspond to these key words, in case of the occurrence the same key words in said division of database;
d) searching Web pages relevant to said arbitrary key words, if there are no such key words in said division of database.
7. A method of coding and storing information in the Internet for automatic control and automatic data exchange, a method of organizing automatic control and automatic data exchange between automatic systems via the retrieval system, a method of creating, coding and storing information in the Internet for the purpose of automatic collecting of information and a method of organizing automatic collecting of information by means of the retrieval system, comprising:
7.1) registration of information on automatic control and automatic data exchange in the retrieval system comprising the following data: an URL address where information is located, a name of owner of information, an official address of the owner of information, an e-mail address, the name of the section of the Global Classification of Information in the Internet to which the proposed information could be related, a kind of information according to the Classification of Kinds of information;
7.2) sorting of information on the basis of the Global Classification of Information in the Internet and storing said information in the database of the retrieval system, where the data on the said information comprise the following for each piece of information:
a) a code of information according to the Global Classification of Information in the Internet;
b) a code of a kind of information according to the Classification of Kinds of information;
c) URL address where control information is located;
7.3) registration of an automatic system in the retrieval system comprising the following data: the IP address of the automatic system, the name of owner of the automatic system, a home or an official address of the owner of the automatic system, an e-mail address;
7.4) automatic transfer of a request from an automatic system to the retrieval system for activating an automatic control and data exchange between the automatic system and the retrieval system, wherein said request comprises:
a) the password of the automatic system;
b) a code of information according to the Global Classification of Information in the Internet;
c) a code of a kind of information according to the Classification of Kinds of information;
d) an URL address where the control information is located;
e) a file with an instruction on how the data transfer could be arranged including the information transfer protocol and other conditions for organizing the data transfer;
7.5) reading the file with an instruction on how the data transfer could be arranged and analyzing the instruction by the retrieval system, organizing the data transfer by the retrieval system from a server containing required information or from another automatic system supplying information to the automatic system requesting information during the process of the data transfer;
7.6) registration of information, dedicated for automatic collecting, in the retrieval system, wherein said information comprises: an URL address where information is located, the name of an owner of information, a home or an official address of the owner of information, an e-mail address, the name of the division of the Global Classification of Information in the Internet to which the proposed information could be related, key words reflecting the content of information, a kind of information according to the Classification of Kinds of information, an author or authors of information, the country, the language of information, a free or fee-based information, a free access to information or needed registration procedure, characteristics of information specific for the considered division of the Global Classification of Information in the Internet.
7.7) converting information dedicated for automatic collecting of information into the format corresponding to the standard of the Global Classification of Information in the Internet for the considered division, wherein conversion can either be done by a provided program of the retrieval system, or a converted file can also come directly from the supplier of information;
7.8) sorting information dedicated to automatic collecting of information on the basis of the Global Classification of Information in the Internet and storing the provided data into the database of the retrieval system, where these data on each piece of information comprise the following data: a code of information according to the Global Classification of Information in the Internet, an URL address where information is located, keywords reflecting the content of information, a code of kinds of information according to the Classification of Kinds of information, an author or authors of information, the country, the language of information, a free or fee-based information, a free access to information or needed registration procedure, a authenticity of information, a date of registration or updating of information, characteristics of information specific for the considered division of the Global Classification of Information in the Internet;
7.9) creating a request for automatic collecting of information by the searcher of information, wherein said request comprises: a name of the division of information according to the Global Classification of Information in the Internet, key words reflecting the content of information, a code of kinds of information according to the Classification of Kinds of information, the language of information, characteristics of information specific for the considered division of the Global Classification of Information in the Internet, data included on optional of the searcher of information: a authenticity of information, the date of publication of information or a period of publication, the country, an author or authors of information, a name of sites where the user is currently registered and required registration information;
7.10) an automatic collecting of information using a program for automatic searching and collecting information comprising steps of:
a) searching for the required addresses of Web sites and Web-pages in the database of the retrieval system according to the request of the searcher of information;
b) an analysis of information containing in the found Web sites and Web-pages;
c) providing the collected information consisting of elements acquired from the found Web sites or Web-pages in the form of a file or any other format with or without a reference for each source being provided;
7.11) an automatic collecting of information using a program for automatic collecting of information, made by a user, and executed from a user computer, comprising steps of:
a) getting an access to the database of the retrieval system on the basis of the user's password;
b) searching for addresses of the required Web sites and Web-pages in the database of the retrieval system on the basis of the user's request included into the program for automatic collecting of information;
c) providing an output in the form of the file or any other format with a list of addresses of found Web sites and Web-pages;
7.12) an automatic collecting of information using the program for automatic collecting of information, made by a user, and executed from a user computer, comprising steps of:
a) getting an access to the database of the retrieval system on the basis of the user's password;
b) searching for addresses of the required Web sites and Web-pages in the database of the retrieval system on the basis of the user's request included into the program for automatic collecting of information;
c) providing the collected information consisting of elements acquired from the found Web sites or Web-pages in the form of a file or any other format with or without a reference for each source being provided.
8. The method according to claim 7, wherein an automatic system transfers automatically of a request to the retrieval system for activating an automatic control and data exchange between the automatic system and the retrieval system, wherein said request comprises: the password of the automatic system, a code of information according to the Global Classification of Information in the Internet, a code of a kind of information according to the Classification of Kinds of information, a file with an instruction on how the data transfer could be arranged including the information transfer protocol and other conditions for organizing the data transfer, and wherein the retrieval system offers a suitable reference to a control program on the basis of analysis of the said request.
US11/959,501 2007-12-19 2007-12-19 Retrieval system and method of searching information in the Internet Abandoned US20090164418A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US11/959,501 US20090164418A1 (en) 2007-12-19 2007-12-19 Retrieval system and method of searching information in the Internet
US13/076,688 US9524341B2 (en) 2007-12-19 2011-03-31 Retrieval system and method of searching of information in the internet

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US11/959,501 US20090164418A1 (en) 2007-12-19 2007-12-19 Retrieval system and method of searching information in the Internet

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US13/076,688 Continuation US9524341B2 (en) 2007-12-19 2011-03-31 Retrieval system and method of searching of information in the internet

Publications (1)

Publication Number Publication Date
US20090164418A1 true US20090164418A1 (en) 2009-06-25

Family

ID=40789801

Family Applications (2)

Application Number Title Priority Date Filing Date
US11/959,501 Abandoned US20090164418A1 (en) 2007-12-19 2007-12-19 Retrieval system and method of searching information in the Internet
US13/076,688 Active 2028-12-23 US9524341B2 (en) 2007-12-19 2011-03-31 Retrieval system and method of searching of information in the internet

Family Applications After (1)

Application Number Title Priority Date Filing Date
US13/076,688 Active 2028-12-23 US9524341B2 (en) 2007-12-19 2011-03-31 Retrieval system and method of searching of information in the internet

Country Status (1)

Country Link
US (2) US20090164418A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120221582A1 (en) * 2011-02-25 2012-08-30 Oracle International Corporation Setting and displaying primary objects for one or more purposes in a table for enterprise business applications
US20130198445A1 (en) * 2011-07-29 2013-08-01 Yosuke Bando Semiconductor memory device and information processing device
CN103617286A (en) * 2013-12-13 2014-03-05 仲兆满 Multi-topic information collecting method based on search strategy
CN106960160A (en) * 2011-11-24 2017-07-18 商业合伙有限公司 The database search of safety

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3171281A1 (en) * 2015-11-17 2017-05-24 Dassault Systèmes Thematic web corpus
CN106202498A (en) * 2016-07-20 2016-12-07 淮阴工学院 A kind of network behavior custom quantization method based on classification corpus key word word frequency record association
US11042509B1 (en) * 2020-03-26 2021-06-22 Adp, Llc Mobile learning system

Citations (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5369763A (en) * 1989-02-01 1994-11-29 Kansas State University Research Foundation Data storage and retrieval system with improved data base structure
US5794236A (en) * 1996-05-29 1998-08-11 Lexis-Nexis Computer-based system for classifying documents into a hierarchy and linking the classifications to the hierarchy
US5842206A (en) * 1996-08-20 1998-11-24 Iconovex Corporation Computerized method and system for qualified searching of electronically stored documents
US5907838A (en) * 1996-12-10 1999-05-25 Seiko Epson Corporation Information search and collection method and system
US6151624A (en) * 1998-02-03 2000-11-21 Realnames Corporation Navigating network resources based on metadata
US6233575B1 (en) * 1997-06-24 2001-05-15 International Business Machines Corporation Multilevel taxonomy based on features derived from training documents classification using fisher values as discrimination values
US6381607B1 (en) * 1999-06-19 2002-04-30 Kent Ridge Digital Labs System of organizing catalog data for searching and retrieval
US20020087599A1 (en) * 1999-05-04 2002-07-04 Grant Lee H. Method of coding, categorizing, and retrieving network pages and sites
US6427123B1 (en) * 1999-02-18 2002-07-30 Oracle Corporation Hierarchical indexing for accessing hierarchically organized information in a relational system
US6502081B1 (en) * 1999-08-06 2002-12-31 Lexis Nexis System and method for classifying legal concepts using legal topic scheme
US6571240B1 (en) * 2000-02-02 2003-05-27 Chi Fai Ho Information processing for searching categorizing information in a document based on a categorization hierarchy and extracted phrases
US20040167931A1 (en) * 1999-04-02 2004-08-26 Sherwin Han Internet organizer
US20050080854A1 (en) * 2003-10-09 2005-04-14 Jay Tervo Internet-based system and method for providing selected information to recipients
US20070083423A1 (en) * 2005-10-06 2007-04-12 Delbridge David M Method and system for unmoderated content collaboration
US20070239760A1 (en) * 2006-04-09 2007-10-11 Daniel Simon System for providing an interactive intelligent internet based knowledgebase
US20080270418A1 (en) * 2007-04-27 2008-10-30 Te-Tsung Chen Method for registering a domain name and signing up with a search website using a computer network service provider on behalf of a user, and a modem

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998012616A2 (en) * 1996-09-23 1998-03-26 Lowrie Mcintosh Defining a uniform subject classification system incorporating document management/records retention functions
US20030061243A1 (en) * 1998-05-21 2003-03-27 Kim Jeong Jung Information auto classification method and information search and analysis method
US20020040363A1 (en) * 2000-06-14 2002-04-04 Gadi Wolfman Automatic hierarchy based classification
US7139747B1 (en) * 2000-11-03 2006-11-21 Hewlett-Packard Development Company, L.P. System and method for distributed web crawling
US7194490B2 (en) * 2001-05-22 2007-03-20 Christopher Zee Method for the assured and enduring archival of intellectual property
US20040162738A1 (en) * 2003-02-19 2004-08-19 Sanders Susan O. Internet directory system
US20080059461A1 (en) * 2006-08-29 2008-03-06 Attributor Corporation Content search using a provided interface

Patent Citations (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5369763A (en) * 1989-02-01 1994-11-29 Kansas State University Research Foundation Data storage and retrieval system with improved data base structure
US5794236A (en) * 1996-05-29 1998-08-11 Lexis-Nexis Computer-based system for classifying documents into a hierarchy and linking the classifications to the hierarchy
US5842206A (en) * 1996-08-20 1998-11-24 Iconovex Corporation Computerized method and system for qualified searching of electronically stored documents
US5907838A (en) * 1996-12-10 1999-05-25 Seiko Epson Corporation Information search and collection method and system
US6233575B1 (en) * 1997-06-24 2001-05-15 International Business Machines Corporation Multilevel taxonomy based on features derived from training documents classification using fisher values as discrimination values
US6151624A (en) * 1998-02-03 2000-11-21 Realnames Corporation Navigating network resources based on metadata
US6427123B1 (en) * 1999-02-18 2002-07-30 Oracle Corporation Hierarchical indexing for accessing hierarchically organized information in a relational system
US20040167931A1 (en) * 1999-04-02 2004-08-26 Sherwin Han Internet organizer
US20020087599A1 (en) * 1999-05-04 2002-07-04 Grant Lee H. Method of coding, categorizing, and retrieving network pages and sites
US6381607B1 (en) * 1999-06-19 2002-04-30 Kent Ridge Digital Labs System of organizing catalog data for searching and retrieval
US6502081B1 (en) * 1999-08-06 2002-12-31 Lexis Nexis System and method for classifying legal concepts using legal topic scheme
US6571240B1 (en) * 2000-02-02 2003-05-27 Chi Fai Ho Information processing for searching categorizing information in a document based on a categorization hierarchy and extracted phrases
US20050080854A1 (en) * 2003-10-09 2005-04-14 Jay Tervo Internet-based system and method for providing selected information to recipients
US20070083423A1 (en) * 2005-10-06 2007-04-12 Delbridge David M Method and system for unmoderated content collaboration
US20070239760A1 (en) * 2006-04-09 2007-10-11 Daniel Simon System for providing an interactive intelligent internet based knowledgebase
US20080270418A1 (en) * 2007-04-27 2008-10-30 Te-Tsung Chen Method for registering a domain name and signing up with a search website using a computer network service provider on behalf of a user, and a modem

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120221582A1 (en) * 2011-02-25 2012-08-30 Oracle International Corporation Setting and displaying primary objects for one or more purposes in a table for enterprise business applications
US9244989B2 (en) * 2011-02-25 2016-01-26 Oracle International Corporation Setting and displaying primary objects for one or more purposes in a table for enterprise business applications
US10146821B2 (en) 2011-02-25 2018-12-04 Oracle International Corporation Method and system for sorting and displaying data
US20130198445A1 (en) * 2011-07-29 2013-08-01 Yosuke Bando Semiconductor memory device and information processing device
US9530499B2 (en) * 2011-07-29 2016-12-27 Kabushiki Kaisha Toshiba Semiconductor memory device and information processing device
CN106960160A (en) * 2011-11-24 2017-07-18 商业合伙有限公司 The database search of safety
CN103617286A (en) * 2013-12-13 2014-03-05 仲兆满 Multi-topic information collecting method based on search strategy

Also Published As

Publication number Publication date
US9524341B2 (en) 2016-12-20
US20110179077A1 (en) 2011-07-21

Similar Documents

Publication Publication Date Title
Chowdhury et al. Introduction to digital libraries
US6636853B1 (en) Method and apparatus for representing and navigating search results
JP4365074B2 (en) Document expansion system with user-definable personality
US9069853B2 (en) System and method of goal-oriented searching
US9524341B2 (en) Retrieval system and method of searching of information in the internet
US20130018805A1 (en) Method and system for linking information regarding intellectual property, items of trade, and technical, legal or interpretive analysis
US20180004850A1 (en) Method for inputting and processing feature word of file content
US20090222444A1 (en) Query disambiguation
US20070027861A1 (en) Automated content categorization
US20120246135A1 (en) Image search engine augmenting search text based upon category selection
US20080243787A1 (en) System and method of presenting search results
KR20070040162A (en) System and method for offering searching service based on topics
US7024405B2 (en) Method and apparatus for improved internet searching
CN101303698A (en) Information process apparatus and information process method
Balabanovic Learning to Surf: Multiagent systems for adaptive Web page recommendation
KR20000054312A (en) Establishing provide Method for ordered web information
US8918403B2 (en) Semantically ranking content in a website
KR100616152B1 (en) Control method for automatically sending to other web site news automatically classified on internet
Zhang et al. Informing the curious negotiator: Automatic news extraction from the internet
CN107357881A (en) A kind of Chinese Text Classification System based on news data
KR20020089677A (en) Method for classifying a document automatically and system for the performing the same
Tsay Literature growth, journal characteristics, and author productivity in subject indexing, 1977 to 2000
Rosenbusch Are our users being served?: a report on online archival databases.
US20060190534A1 (en) Method and system for browsing a plurality of information items
JPH10162011A (en) Information retrieval method, information retrieval system, information retrieval terminal equipment, and information retrieval device

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: EXPRESSLY ABANDONED -- DURING EXAMINATION