CN1296860C - Digital data characteristic management system and method thereof - Google Patents

Digital data characteristic management system and method thereof Download PDF

Info

Publication number
CN1296860C
CN1296860C CNB2003101023854A CN200310102385A CN1296860C CN 1296860 C CN1296860 C CN 1296860C CN B2003101023854 A CNB2003101023854 A CN B2003101023854A CN 200310102385 A CN200310102385 A CN 200310102385A CN 1296860 C CN1296860 C CN 1296860C
Authority
CN
China
Prior art keywords
characteristic
classification
digital data
numerical data
feature
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB2003101023854A
Other languages
Chinese (zh)
Other versions
CN1612152A (en
Inventor
叶秀雄
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inventec Besta Co Ltd
Original Assignee
Inventec Besta Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inventec Besta Co Ltd filed Critical Inventec Besta Co Ltd
Priority to CNB2003101023854A priority Critical patent/CN1296860C/en
Publication of CN1612152A publication Critical patent/CN1612152A/en
Application granted granted Critical
Publication of CN1296860C publication Critical patent/CN1296860C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Abstract

The present invention relates to a digital data characteristic management system and a method thereof. The characteristic of a digital datum picked by an image picking device is analyzed according to a characteristic condition set by a user, and the storage of the digital datum picked by an image picking device is managed according to a classification condition set by the user; when inputting a search condition, the user can quickly associate with and search out the digital datum needed by the user; thus, the digital datum picked by the image picking device can be systematically classified and stored, and the efficient search application is provided for the user.

Description

Digital data characterizing management system and method thereof
Technical field
The present invention relates to a kind of digital data system and method, be meant a kind of being applied in the image capture unit especially, the system and method that numerical data can be classified and stored and associative search is provided according to the literal field block feature.
Background technology
Image capture unit, such as: scanner, digital camera or be installed in the camera head of (as: mobile phone, mobile computer, personal digital assistant) in other portable apparatus, usually the function of all having only picked-up image, storage image, often can't effectively be classified and be managed for the image that is absorbed, so that after the numerical data of a large amount of storage images, the user also can't effectively grasp the content of numerical data and be used.
Though existing many image capture units can simply be filed classification to pick-up image, as: utilize the picked-up image time as filing classification foundation, utilizing unified naming rule as the filing classification foundation, but these mode classifications are set up on their own by system mostly, still do not have in all senses through the sorted numerical data of filing for the user, searching to digital data is still quite inconvenient, therefore is necessary to propose to improve the method that classification stores.
Therefore, to how to utilize computer software assisting to image capture unit, image capture unit institute picked image numerical data can be done appropriate classification and store, and provide the user efficiently to search utilization, be the function place of following image capture unit indispensability.
Summary of the invention
The object of the present invention is to provide a kind of digital data characterizing management system and method thereof, on existing image capture unit, utilize the auxiliary of computer software, reach classification storage management numerical data that image capture unit captures.
For achieving the above object, the invention provides a kind of digital data characterizing management system, this system is executed in the image capture unit, can carry out feature parsing and classification storage management to the numerical data that this image capture unit captured, this image capture unit includes a CPU (central processing unit) and is responsible for linking up between unit module coordination, one image acquisition unit is responsible for capturing external image and in addition digitizing, one storage element is in order to store this system and data, one display unit is in order to the video data result, this system comprises: an operating and setting module, in order to the setting operation to a feature-set and a classification setting to be provided, with the input operation that a search condition is provided, and can be by the operation of this CPU (central processing unit) performance element intermodule, wherein this feature-set includes a font Boolean of different weight proportions, the yi word pattern Boolean, one chronological order value; Reach one and resolve sort module, in order to according to operation, determine a characteristic that this numerical data is unique and set the classification storage management that carries out this numerical data by this feature-set, when receiving this search condition, resolve this search condition and produce this characteristic tabulation according to this classification.
The present invention also provides a kind of digital data characterizing management method, is to carry out feature parsing and classification storage management to the numerical data that an image capture unit is captured, and this method comprises the following step: pick-up image also is converted to a numerical data; Execution picture and text characteristic solution is analysed and is determined a characteristic; Carry out this characteristic classification according to a classification setting; And specify corresponding classification number to give this numerical data and store.Wherein, execution picture and text characteristic solution is analysed and also comprised the following step in the step that determines a characteristic: identification produces several literal blocks; Resolve respectively this literal field piece according to a feature-set; Weighted calculation respectively this literal field piece meet ratio; Screening surpasses default respectively this literal field piece that meets ratio; And conversion meets this highest literal field piece of ratio for this characteristic.
That is to say, aspect the classification storage, the present invention is that the feature of utilizing the user to set comes the picture and text in institute's pick-up image are resolved, find out meet feature most the literal block as characteristic, and then the mode classification that sets according to the user is with the numerical data storage of classifying; Aspect data retrieval, then be the search condition that the user imported can be resolved to set up relatedly, and according to association results the numerical data of correspondence is searched and to be shown to the user.
So system and method for the present invention, mainly be in the technical field that numerical data is handled, a kind of technological means of the acquisition of feature to digital data identification is provided, reach purpose and effect that auxiliary classification to digital data stores then by this technological means, and provide the technological means of associative search to reach to allow the user can efficient fast another purpose and the effect that finds the needed number data.
The feasible embodiment of relevant the present invention is described as follows with regard to conjunction with figs. now.
Description of drawings
Fig. 1 is the system block diagram of digital data characterizing management system of the present invention and method thereof;
Fig. 2 is that the classification of digital data characterizing management system of the present invention and method thereof stores process flow diagram;
Fig. 3 is that the feature of digital data characterizing management system of the present invention and method thereof is resolved process flow diagram;
Fig. 4 is the data retrieval process flow diagram of digital data characterizing management system of the present invention and method thereof; And
Fig. 5 is that the feature of digital data characterizing management system of the present invention and method thereof is resolved the embodiment synoptic diagram.
Wherein, description of reference numerals is as follows:
The 10-image acquisition unit; 20-resolves sort module; 30-operating and setting module;
The 40-storage element; The 50-display unit; The 60-CPU (central processing unit);
70-image source; The 100-image capture unit;
Step 200-pick-up image also is converted to a numerical data;
Step 300-execution picture and text characteristic solution is analysed and is determined a characteristic;
Step 310-identification produces several literal blocks;
Step 320-resolves respectively this literal field piece according to a feature-set;
Step 330-weighted calculation respectively this literal field piece meet ratio;
Step 340-screening surpasses default respectively this literal field piece that meets ratio;
Step 350-conversion meets this highest literal field piece of ratio and is this characteristic;
Step 400-sets according to a classification and carries out this characteristic classification;
Step 500-specifies corresponding classification number to give this numerical data and store;
Step 600-receives the user and imports a search condition;
Step 700-resolves this search condition and corresponds to each classification number;
Step 800-produces this characteristic tabulation with each classification number;
Step 900-chooses this corresponding numerical data of acquisition according to user's operation.
Embodiment
The present invention proposes a kind of digital data characterizing management system and method thereof, it mainly is auxiliary by the computer software that is executed in the image capture unit 100, in pick-up image originate 70 o'clock can be at the feature that image the had storage of classifying, and provide the user to carry out the system and method for associative search.
System of the present invention sees also Fig. 1, is increase computer software on existing image capture unit 100 auxiliary, below explains with regard to the part of system.General image capture unit 100 all includes CPU (central processing unit) 60 in order to be responsible for the communication and coordination of each unit and intermodule, when the user has any operation and sets, is responsible for execution by CPU (central processing unit) 60 and handles; Image acquisition unit 10, in order to acquisition external image source 70 and in addition digitizing, normally constituted by camera lens, photosensory assembly and analog/digital conversion assembly, after camera lens is caught image, just be responsible for induction and detect and receive image information by photosensory assembly, and then via the analog/digital conversion assembly analog information is converted to can be for the numerical data of subsequent treatment, but the operating principle of this image acquisition unit 10 is to belong to known part, seldom gives unnecessary details at this; Storage element 40, in order to the numerical data that stores total system program, related setting and capture, the general storage element of taking 40 main flows are constituted with flash memory (FLASH) at present; Display unit 50, in order to display digit data and user's retrieve data result, the general display unit of taking 50 mostly is liquid crystal display (LCD) at present.
Be essential infrastructure component in traditional image capture unit 100 with top, and system of the present invention is characterised in that to have the following modules of utilizing computer software performed:
(1) resolves sort module 20, in order to operation according to the user, originated 70 o'clock at image capture unit 100 pick-up images, with the numerical data that captured feature-set according to the user, determine the unique characteristic of numerical data (this characteristic, must satisfy resolve set in the sort module 20 meet ratio, otherwise the accuracy of characteristic is with deficiency), and then set according to user's classification, will be through the classify action of storage of the characteristic that is converted to text attribute (text mode) and corresponding numerical data thereof.That is, this parsing sort module also comprises one and meets ratio, in order to screen this characteristic.
When data retrieval, the user who receives can be imported the action that search condition is done parsing, and utilize related mode to find the characteristic that meets, produce the characteristic tabulation and supply the user to make the usefulness of choosing.
(2) the operating and setting module 30, in order to provide the user to carry out the setting operation of prior feature-set and classification setting, in addition when carrying out data retrieval, can provide the user to carry out the input of search condition, and be responsible for the operation between control CPU (central processing unit) 60 and each unit module, to carry out other function of image capture unit 100.
Feature-set, mainly be to set to be used to provide system the most representative data content in the numerical data can be come out to acquisition, at least can comprise the following setting of different weight proportions: the font Boolean, the font Boolean, and chronological order value, that is appear in the literal block in the numerical data, no matter be font, font or layout position (the layout position preferential more person will give high more sequence valve) all can the effect characteristics data decision, and the user can also adjust the weight proportion that every setting had arbitrarily according to the numerical data attribute, so that the characteristic that identification is come out can satisfy user's demand more.
Classification is set, mainly be to set to be used for carrying out keyword related and classification, the user can import any words and be used as keyword, it is to include more than one keyword at least to set that this classification is set, the corresponding classification number that each keyword all can have system to give is used and can find characteristic of correspondence data and numerical data fast when retrieval.
Search condition, mainly be not to use the person to import in order to search the condition of numerical data, the inquiry word that can be input more than is used for carrying out typed compound query, the natural language of can also the person of being to use importing (natural language), natural language is by after the parsing of resolving sort module 20, just can correspond on the keyword and produces corresponding classification number in order to find out corresponding numerical data.
Fig. 2 is the classify process flow diagram of storage method of the present invention, and at first by image capture unit 100 pick-up images source 70 and be converted to count the number of words and be step 200 in view of the above, this step is the basic operation workflow step that general image capture unit 100 is possessed; Then carry out the characteristic number that the picture and text characteristic solution analysed and determined numerical data and be step 300 in view of the above, the detailed process of this step will cooperate the part of Fig. 3 to illustrate again; Then the classification that sets according to the user set the numerical data that will have characteristic do classification action this be step 400, classification comprises the keyword that an above user sets in setting, and each keyword all has corresponding classification number, then can be divided to corresponding classification number when same keyword occurring in the characteristic; Specify corresponding classification number to give numerical data at last and store this to be step 500, at this moment a corresponding classification number.So far the classification of finishing to digital data stores flow process.
Wherein the flow process of characteristic in the resolution digital data will cooperate Fig. 3 to make the following description.At first, carry out the identification program of literal block at the numerical data that is captured, this is a step 310, and the technology of picture and text identification belongs to known technology and seldom describes; Then resolve each literal block one by one according to the feature-set of setting in advance, this is a step 320, comprises font Boolean, font Boolean, chronological order value with different weight proportions in the feature-set; Then, according to weighted calculation go out in the numerical data in each literal block because font, font and chronological order difference had meets proportional numerical value, this is a step 330; Then filter out all above default each literal block that meets ratio according to the default scale that meets, this is a step 340, meets the ratio person of being to use and sets up the parameter value that can be used to accurately filter out the characteristic that meets on their own; To meet the characteristic that the highest literal block of ratio is set at this numerical data at last, this is a step 350, and this characteristic is the data of text attribute, so the usefulness of subsequent association retrieval can be provided.So far finish the flow process that features relevant is resolved.
Fig. 4 is the corresponding process flow diagram of data retrieval of the present invention, when the user desires to search before stored numerical data, only needs the input search condition to get final product.At first, receive the search condition that the user imported, this is a step 600, and search condition can be to carry out compound search or utilize natural language also can with more than one inquiry word; Resolve search condition and correspond to each classification number this moment, when search condition is inquiry word, then directly search in the classification setting whether the corresponding keyword that meets is arranged, if then also must carry out keyword lookup again through filtering out accurate inquiry word behind the lexical analysis earlier when search condition is natural language, lexical analysis can adopt existing technology to carry out, corresponding classification number is found in last association, and this is a step 700; Find out the characteristic of correspondence data and produce characteristic according to classification number and tabulate to user's reference, this is a step 800; Last user can select any project in the characteristic tabulation, and the project of choosing according to user's operation captures corresponding numerical data for user's use then, and this is a step 900, finishes the flow process of data retrieval.
Fig. 5 resolves the embodiment synoptic diagram for the simple feature of the present invention.Suppose in the feature-set the font Boolean, the font Boolean, and the weight that the chronological order value sets is respectively 25%, 25% and 50%, and the default ratio that meets is greater than 1 o'clock, then block A is because font, font all with block B, block C difference add chronological order be prepreerence cause (when chronological order preferential more, then give high more sequence valve), so eigenwert is (1,1,3), as for block B, block C then is a font, font is identical, so have only chronological order difference eigenwert to be respectively (0,0,2) and (0,0,1), the ratio that meets through three blocks after the weighted calculation is respectively 2,1 and 0.5, this moment, block A was the highest and surpass and meet ratio because meet ratio, so block A will be considered to be the characteristic in this numerical data, will be carried out classification and store.In fact, the project of feature-set be not limited in the present embodiment for example, many more and cooperate when giving different weight proportions when the project of feature-set, will be more accurately also easier for the decision of characteristic.
The above only is the present invention's preferred embodiment wherein, is not to be used for limiting practical range of the present invention; Be that all equalizations of being done according to the present patent application claim change and modification, be all claim of the present invention and contain.

Claims (9)

1. digital data characterizing management system, this system is executed in the image capture unit, can carry out feature parsing and classification storage management to the numerical data that this image capture unit captured, this image capture unit include a CPU (central processing unit) be responsible between unit module linking up coordinate, an image acquisition unit is responsible for capturing external image and in addition digitizing, a storage element in order to store this system and data, a display unit in order to the video data result, it is characterized in that this system comprises:
One operating and setting module, in order to the setting operation to a feature-set and a classification setting to be provided, with the input operation that a search condition is provided, and can be by the operation of this CPU (central processing unit) performance element intermodule, wherein this feature-set includes a font Boolean, yi word pattern Boolean, a chronological order value of different weight proportions; And
One resolves sort module, in order to according to operation, determine a characteristic that this numerical data is unique and set the classification storage management that carries out this numerical data by this feature-set, when receiving this search condition, resolve this search condition and produce this characteristic tabulation according to this classification.
2. digital data characterizing management system as claimed in claim 1 it is characterized in that it is to include more than one keyword at least to set that this classification is set, and respectively this keyword all has corresponding classification number.
3. digital data characterizing management system as claimed in claim 1 is characterized in that this parsing sort module also comprises one and meets ratio, in order to screen this characteristic.
4. digital data characterizing management system as claimed in claim 1 is characterized in that this characteristic is a text attribute.
5. a digital data characterizing management method is to carry out feature parsing and classification storage management to the numerical data that an image capture unit is captured, and it is characterized in that this method comprises the following step:
Pick-up image also is converted to a numerical data;
Execution picture and text characteristic solution is analysed and is determined a characteristic, and wherein this step also comprises the following step: identification produces several literal blocks; Resolve respectively this literal field piece according to a feature-set; Weighted calculation respectively this literal field piece meet ratio; Screening surpasses default respectively this literal field piece that meets ratio; And conversion meets this highest literal field piece of ratio for this characteristic;
Carry out this characteristic classification according to a classification setting; And
Specify corresponding classification number to give this numerical data and store.
6. digital data characterizing management method as claimed in claim 5 is characterized in that this characteristic is a text attribute.
7. digital data characterizing management method as claimed in claim 5 it is characterized in that it is to include more than one keyword at least to set that this classification is set, and respectively this keyword all has corresponding classification number.
8. digital data characterizing management method as claimed in claim 5 is characterized in that this feature-set is a font Boolean, yi word pattern Boolean, a chronological order value that includes different weight proportions at least.
9. digital data characterizing management method as claimed in claim 5 is characterized in that this method also comprises the step of data retrieval, and it also comprises the following step:
Receive the user and import a search condition;
Resolve this search condition and correspond to each classification number;
Produce this characteristic tabulation with each classification number; And
Choose this corresponding numerical data of acquisition according to user's operation.
CNB2003101023854A 2003-10-27 2003-10-27 Digital data characteristic management system and method thereof Expired - Fee Related CN1296860C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2003101023854A CN1296860C (en) 2003-10-27 2003-10-27 Digital data characteristic management system and method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2003101023854A CN1296860C (en) 2003-10-27 2003-10-27 Digital data characteristic management system and method thereof

Publications (2)

Publication Number Publication Date
CN1612152A CN1612152A (en) 2005-05-04
CN1296860C true CN1296860C (en) 2007-01-24

Family

ID=34756389

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2003101023854A Expired - Fee Related CN1296860C (en) 2003-10-27 2003-10-27 Digital data characteristic management system and method thereof

Country Status (1)

Country Link
CN (1) CN1296860C (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101635763A (en) * 2008-07-23 2010-01-27 深圳富泰宏精密工业有限公司 Picture classification system and method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6175830B1 (en) * 1999-05-20 2001-01-16 Evresearch, Ltd. Information management, retrieval and display system and associated method
CN1363898A (en) * 2000-12-28 2002-08-14 富士通株式会社 Purchasing method and systems on line

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6175830B1 (en) * 1999-05-20 2001-01-16 Evresearch, Ltd. Information management, retrieval and display system and associated method
CN1363898A (en) * 2000-12-28 2002-08-14 富士通株式会社 Purchasing method and systems on line

Also Published As

Publication number Publication date
CN1612152A (en) 2005-05-04

Similar Documents

Publication Publication Date Title
CN1240011C (en) File classifying management system and method for operation system
US9552511B2 (en) Identifying images using face recognition
Wang et al. Large-scale duplicate detection for web image search
CN1278533C (en) Handset capable of automatically recording characters and images, and method of recording and processing thereof
CN1432947A (en) Multimedia object searchine device and method
CN1783069A (en) Systems and methods for document data analysis
Valsesia et al. Large-scale image retrieval based on compressed camera identification
CN105824862A (en) Image classification method based on electronic equipment and electronic equipment
CN102165486B (en) Image characteristic amount extraction device
CN108389394B (en) Method and system for analyzing initial city entry of vehicle
CN1818908A (en) Feedbakc information use of searcher in search engine
CN1666200A (en) Method and apparatus for classification of a data object in a database
CN102165490A (en) Image identity scale calculating system
CN1439997A (en) Fingerprint identifying method and system
CN111107319A (en) Target tracking method, device and system based on regional camera
CN101038742A (en) Apparatus and method for assistant voice remote control using image feature
CN105095468A (en) Novel image retrieval method and system
CN1253815C (en) Computer recognizing and indexing method of Chinese names
CN1296860C (en) Digital data characteristic management system and method thereof
CN1949732A (en) Method and system for network community and searching combination
US20220019764A1 (en) Method and device for classifying face image, electronic device and storage medium
CN1271537C (en) Method of converting handwritten note into literal text and traveling equipment therefor
CN1271134A (en) Dynamic feedback and inquiring method for network system
CN1828605A (en) Metadata analysis and processing method in multimedia service, and mobile communication terminal
CN1193309C (en) Key association system and method for searching engine

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20070124

Termination date: 20141027

EXPY Termination of patent right or utility model