CN1315084C - A professional searching engine data gathering method - Google Patents

A professional searching engine data gathering method Download PDF

Info

Publication number
CN1315084C
CN1315084C CNB2004100401910A CN200410040191A CN1315084C CN 1315084 C CN1315084 C CN 1315084C CN B2004100401910 A CNB2004100401910 A CN B2004100401910A CN 200410040191 A CN200410040191 A CN 200410040191A CN 1315084 C CN1315084 C CN 1315084C
Authority
CN
China
Prior art keywords
search engine
data
article
picture
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB2004100401910A
Other languages
Chinese (zh)
Other versions
CN1595401A (en
Inventor
朱龙安
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CNB2004100401910A priority Critical patent/CN1315084C/en
Publication of CN1595401A publication Critical patent/CN1595401A/en
Application granted granted Critical
Publication of CN1315084C publication Critical patent/CN1315084C/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The present invention discloses a professional search engine data gathering method which is characterized in that the present invention comprises the following steps: (1) a management thermal receives pictures or articles inputted by users and descriptive data or keywords related to the pictures or the articles, and the pictures or the articles and the descriptive data or the keywords related to the pictures or the articles are uploaded to own websites of the users according to uploading commands of the users; (2) a sending assembly is started, and the descriptive data or the keywords inputted by the users and related to the pictures or the articles are sent to search engine websites; (3) the descriptive data or the keywords are received by the search engine websites which analyze and conform to form a picture or article descriptive database of a search engine, and the picture or the article descriptive database can be searched by the users. The present invention can establish the professional search engine, the occupation space of useless data is avoided as far as possible when the search accuracy is greatly improved, and the present invention increases the effective utilization rate of the search engine websites.

Description

A kind of specialized search engine data collection method
Technical field:
The present invention relates to a kind of method of data capture that is used for search engine.
Background technology:
At present, various search websites emerge in an endless stream on the internet.Industry calls " search engine " year to 2003, and each macroreticular giant is one after another the huge online cake of this piece of target diversion " search engine ".One of first cause is exactly the growth along with how much levels of internet information amount, pendulum in face of the netizen is the various information of flood tide numerous and complicated, not only mixed and disorderly but also a lot of repetition of these information wants to find the content that is fit to oneself in boundless and indistinct net sea, just must be by search engine.The quantity of information of internet is big more, and the importance of search engine is just high more.
But these search engine web sites are the ubiquity weak point aspect commercial picture of search and professional article, present search engine obtains relevant search data by thousands upon thousands webpages on the scanning internet basically, these huge data are to the value of picture or the professional very difficult judgement of article content, so these search engine web sites are difficult to satisfactory to the search of valuable commercial picture or professional article.Particularly aspect commercial picture searching, present search engine works hardly.
The reasons for the above problems are partly relevant with " comprehensive " of search engine.Present search engine mainly is the search engine of " comprehensive " of picture Google, Baidu, Yahoo and so on, so-calledly comprehensively refers to that almost what can both search out on their search engine.Wide more as an individual knowledge face, he just is difficult to accomplish to be proficient in each intellectual grasp.Because present search engine groundwork principle is to scan in thousands upon thousands webpages, and the major part of scanning is static page, and the content that scanning is come out involves a wide range of knowledge and be assorted, therefore is difficult in very meticulous that certain professional search aspect does.But along with Internet development, people must be more and more higher to the professional requirement of search! We have reason to believe that specialized search engine will be following main flow! What people need search for will arrive removal search on the search engine of certain aspect specialty.
Summary of the invention:
Purpose of the present invention is exactly in order to overcome the above problems, and a kind of specialized search engine method of data capture is provided, and utilizes it can set up professional search engine, can farthest reject gibberish void again and take up space.
For achieving the above object, the present invention proposes a kind of specialized search engine method of data capture, step (1) office terminal receives picture or the article and data of description or the keyword relevant with this picture or article of user's input, and according to user's last teletype command, to website uploading pictures or article and data of description or the keyword relevant that the user controls oneself with this picture or article, it is characterized in that comprising the steps: that (2) start sending assembly, data of description or the keyword relevant with this picture or article of user's input are sent to search engine web site; (3) search engine web site receives this data of description or keyword, and carries out analytical integration, forms the picture or the article descriptive data base of search engine, for the search subscriber search.
(picture then is to be stored in thousands upon thousands user websites because search engine has only been collected the descriptor relevant with picture or article, disperseed the storage area cleverly), thereby the requirement of search engine self website requisite space greatly reduced, especially for the picture searching website, this point is extremely important! Because utilize the present invention can not need the picture storage area basically.On the other hand, the mode that the present invention collects relevant data is that the office terminal initiatively starts the form that sending assembly sends, search engine web site only needs data of description or the keyword of passive reception about picture or article, therefore, the data of search engine web site are real-time update basically, unlike existing search engine web site, need a regular week of Spider system program, two weeks even just carried out the collection and the renewal of new data in one month, the real-time update of search engine web site data and variation have guaranteed that search subscriber can at every moment search out up-to-date information.The information age that this point became for moment ten thousand, extremely important! Also have because picture or article system user's regularity and professional, resulting data of description of search engine and keyword comparatively speaking more accurately, specialty more.Therefore, the result's who comes out by search engine searches accuracy will improve greatly! Can be good at realizing specialized search.
Description of drawings:
Fig. 1 is the embodiment of the invention one a flow process synoptic diagram.
Fig. 2 is the overview synoptic diagram of searching the figure engine according to the embodiment of the invention one operation.
Embodiment:
Also the present invention is described in further detail in conjunction with the accompanying drawings below by specific embodiment.
Embodiment one:
See Fig. 1,2, this example has illustrated the situation of a photographic search engine, is called for short " searching the figure engine ".
Shown in Fig. 1 process flow diagram, this search engine data collection method comprises the steps: that (1) office terminal receives picture or the article and data of description or the keyword relevant with this picture or article of user's input, and according to user's last teletype command, to website uploading pictures or article and data of description or the keyword relevant with this picture or article that the user controls oneself: (2) start sending assembly, and the data of description relevant with this picture or article or the keyword of user's input are sent to search engine web site (server address of a preassigned search engine); (3) search engine web site receives this data of description or keyword, and automatically to these data according to classification, data such as professional parameter, descriptive information, author, price, carry out analytical integration (for example: also can as existing search engine web site), the category classification and storage is set up index, form the picture or the article descriptive data base of search engine, be stored as the form of suitable search, for the search subscriber search.
Need to prove, utilizing the present invention to set up to search the figure engine needs office terminal (being the computer terminal that the user manages the website of controlling oneself), also need the program assembly that sends and receives information, this assembly can be (assembly that the Windows that uses of following literary grace carries) that computer operating system carries, also can be install separately or picture, article supervisory routine carry.The present embodiment specialized designs one " pictures management platform " select for use for the user: this is that a cover offers the individual, the procedure site that uses such as photographer, shutterbugs, artist, artwork collector, artist particularly, the user can upload and manage the picture of oneself by this procedure site, show the works of oneself on the net, exchange works gains in depth of comprehension and relevant technologies etc. with the online friend.The user uses this " pictures management platform ", in uploading pictures, automatically call Microsoft.XMLHTTP (the Microsoft spare company provides) assembly that Windows carries, the relevant data of the picture of being uploaded or keyword (as picture classification, picture name, author, image parameters, date issued, picture character (original still reprinting) and the picture owner and price thereof etc.) are sent to the server at " searching the figure engine " place by this assembly.
Correspondingly, in the server of searching figure engine place, also need a receiving unit, program can be by calling Microsoft.XMLDOM (the Microsoft spare company provides) assembly that Windows operating system carries for example " to search the figure engine ", all data that automatic reception sends over from " pictures management platform ", and automatically to these data according to classification, professional parameter, descriptive information, the author, data such as price, carry out analytical integration, formed the database that comprises numerous pictorial informations, finally become " search engine " that a special search pictures is used, promptly " search the figure engine ".(transmission of image data and reception all are to carry out automatically, do not need manual intervention)
Because user's faulty operation or other reasons, image data needs probably to be modified or is deleted, therefore the present invention also provides more operational means such as new data and deleted data, comprise step: (4) office terminal receives the instruction of user's input, send to website that the user controls oneself and to revise or delete instruction, picture that the user has been uploaded or article or the data of description relevant with this picture or article are made amendment or are deleted; (5) start sending assembly, described modification or delete instruction information are sent to search engine web site; (6) search engine web site receives this modification or delete instruction information, and the picture in the search engine or article data of description are carried out website corresponding modification or deletion action with the user.Because its principle is similar to Fig. 1, therefore omit its process flow diagram.
Specifically, comprising:
One, new data more:
Use the user of " pictures management platform ", revise pictorial information each time, the capital is by sending information to " searching the figure engine " with the same mode of transmission data, " search the figure engine " and revise relevant pictorial information database by the mode identical again, accomplish the image data information of " searching the figure engine " lining and the image data information synchronization renewal of " pictures management platform " with receiving data.(" synchronously " here be not refer to proper synchronously, can allow the regular hour poor)
Two, deleted data:
Use the user of " pictures management platform ", delete pictorial information each time, the capital is by sending information to " searching the figure engine " with the same mode of transmission data, after " searching the figure engine " and receiving deletion information, automatically the relevant pictorial information of deletion remains image data information of " searching the figure engine " lining and the image data information synchronization deletion of " pictures management platform " lining.(" synchronously " here be not refer to proper synchronously, can allow the regular hour poor)
According to the abovementioned embodiments of the present invention, by searching the cooperation of figure engine website and picture management system, can collect valuable and specialized image data targetedly,, and finally realize commercial value with related service on the net with the commercial value of lifting website self.Fig. 2 shows an operation situation of searching the figure engine.
Embodiment two:
Same reason, for the article management system, only need when uploading article, set up some data such as keyword about this piece article, then data such as these keywords even entire article are sent to the server at search engine place, so just can do the article management system of certain aspect specialty and relevant article search engine.
Protection scope of the present invention is not limited only to the foregoing description, utilize the present invention to conceive and develop realization any program language,, picture management system, article management system and the relevant image searching method and the article searching method of any database all belong to protection scope of the present invention.

Claims (3)

1, a kind of specialized search engine method of data capture, step (1): the office terminal receives picture or the article and data of description or the keyword relevant with this picture or article of user's input, and according to user's last teletype command, to website uploading pictures or article and data of description or the keyword relevant that the user controls oneself with this picture or article, it is characterized in that comprising the steps: that (2) start sending assembly, data of description or the keyword relevant with this picture or article of user's input are sent to search engine web site; (3) search engine web site receives this data of description or keyword, and carries out analytical integration, forms the picture or the article descriptive data base of search engine, for the search subscriber search.
2, a kind of specialized search engine method of data capture as claimed in claim 1, it is characterized in that described sending assembly comprise that computer operating system carries, install separately or picture, article supervisory routine carry.
3, a kind of specialized search engine method of data capture as claimed in claim 1 or 2, it is characterized in that comprising the steps: that (4) office terminal receives the instruction of user's input, send to website that the user controls oneself and to revise or delete instruction, picture that the user has been uploaded or article or the data of description relevant with this picture or article or keyword are made amendment or are deleted; (5) start sending assembly, described modification or delete instruction information are sent to search engine web site; (6) search engine web site receives this modification or delete instruction information, and the picture in the search engine or article data of description or keyword are carried out website corresponding modification or deletion action with the user.
CNB2004100401910A 2004-07-05 2004-07-05 A professional searching engine data gathering method Expired - Fee Related CN1315084C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2004100401910A CN1315084C (en) 2004-07-05 2004-07-05 A professional searching engine data gathering method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2004100401910A CN1315084C (en) 2004-07-05 2004-07-05 A professional searching engine data gathering method

Publications (2)

Publication Number Publication Date
CN1595401A CN1595401A (en) 2005-03-16
CN1315084C true CN1315084C (en) 2007-05-09

Family

ID=34664521

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2004100401910A Expired - Fee Related CN1315084C (en) 2004-07-05 2004-07-05 A professional searching engine data gathering method

Country Status (1)

Country Link
CN (1) CN1315084C (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101009009B (en) * 2006-01-26 2015-07-15 腾讯科技(深圳)有限公司 Individualized Information display method
CN101814080B (en) * 2006-09-05 2013-02-13 阿里巴巴集团控股有限公司 Method and device for realizing information search
CN101140573B (en) * 2006-09-05 2010-07-14 阿里巴巴集团控股有限公司 Method and system for realizing information searching
US7941467B2 (en) 2007-05-29 2011-05-10 Research In Motion Limited System and method for integrating image upload objects with a message list
CN102339292A (en) * 2010-07-27 2012-02-01 中国电信股份有限公司 Distributed searching method and system
CN104572741A (en) * 2013-10-24 2015-04-29 北京万嘉祉洋教育科技有限公司 Data and material processing method

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001069450A2 (en) * 2000-03-10 2001-09-20 General Electric Company Method for automated web site maintenance via searching
WO2001080077A1 (en) * 2000-04-18 2001-10-25 Korea Telecom Method and system for retrieving information based on meaningful core word
WO2003027907A1 (en) * 2001-09-28 2003-04-03 Client Dynamics,Inc. Method and system for database queries and information delivery

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001069450A2 (en) * 2000-03-10 2001-09-20 General Electric Company Method for automated web site maintenance via searching
WO2001080077A1 (en) * 2000-04-18 2001-10-25 Korea Telecom Method and system for retrieving information based on meaningful core word
WO2003027907A1 (en) * 2001-09-28 2003-04-03 Client Dynamics,Inc. Method and system for database queries and information delivery

Also Published As

Publication number Publication date
CN1595401A (en) 2005-03-16

Similar Documents

Publication Publication Date Title
US20230259491A1 (en) Context-based file selection
JP6854041B2 (en) Project management in a content management system
US11023858B2 (en) System and method for generating desktop focus work areas
US8533199B2 (en) Intelligent bookmarks and information management system based on the same
US7725451B2 (en) Generating clusters of images for search results
US7930301B2 (en) System and method for searching computer files and returning identified files and associated files
US11461428B2 (en) Intelligently generating and managing third-party sources within a contextual hub
US20070112719A1 (en) System and method for dynamically generating and managing an online context-driven interactive social network
US7346607B2 (en) System, method, and software to automate and assist web research tasks
GB2327787A (en) Data classification and retrieval system
US20100293157A1 (en) Information processing apparatus for generating ranking information representing degree of popularity of data and information processing method therefor
US20040054670A1 (en) Dynamic object type for information management and real time graphic collaboration
CN110362740B (en) Water conservancy portal information hybrid recommendation method
CN110417873B (en) Network information extraction system for realizing recording webpage interactive operation
CN102214183A (en) Search engine query method for combining feedback contents of pages with fixed ranking
CN108874722A (en) A kind of electronic-book reading system
CN107291940A (en) Content of pages management method, device and associated server
CN103745006A (en) Internet information searching system and internet information searching method
CN1315084C (en) A professional searching engine data gathering method
AU2009215809A1 (en) Methods, systems, and computer program products for retrieving a file of machine-readable data
JP4469432B2 (en) INTERNET INFORMATION PROCESSING DEVICE, INTERNET INFORMATION PROCESSING METHOD, AND COMPUTER-READABLE RECORDING MEDIUM CONTAINING PROGRAM FOR CAUSING COMPUTER TO EXECUTE THE METHOD
RU2635886C2 (en) Systems and methods for managing files through mobile computer devices
JP7336400B2 (en) Image processing device, image processing method, program and image processing system
CN110134851B (en) Search engine system based on domain intranet and construction method
Wang et al. Integration system of network information resources based on multi-agent collaboration

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20070509

Termination date: 20110705