WO2003017141A1 - Method for adding metadata to data - Google Patents

Method for adding metadata to data Download PDF

Info

Publication number
WO2003017141A1
WO2003017141A1 PCT/US2002/025700 US0225700W WO03017141A1 WO 2003017141 A1 WO2003017141 A1 WO 2003017141A1 US 0225700 W US0225700 W US 0225700W WO 03017141 A1 WO03017141 A1 WO 03017141A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
labels
file
label
program
Prior art date
Application number
PCT/US2002/025700
Other languages
French (fr)
Other versions
WO2003017141A9 (en
Inventor
Robert S. Block
Gary Gannon
Uday Kothari
Jonathan Lau
Fred Chapus
Original Assignee
Ubmatrix Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ubmatrix Llc filed Critical Ubmatrix Llc
Priority to KR1020047002364A priority Critical patent/KR100990018B1/en
Priority to CN02820705XA priority patent/CN1571968B/en
Priority to JP2003521979A priority patent/JP2005500623A/en
Priority to EP02753452A priority patent/EP1425683A4/en
Priority to NZ531175A priority patent/NZ531175A/en
Publication of WO2003017141A1 publication Critical patent/WO2003017141A1/en
Publication of WO2003017141A9 publication Critical patent/WO2003017141A9/en
Priority to HK05106337.4A priority patent/HK1073898A1/en
Priority to AU2009201447A priority patent/AU2009201447A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/80Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
    • G06F16/84Mapping; Conversion
    • G06F16/86Mapping to a database
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99942Manipulating data structure, e.g. compression, compaction, compilation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99943Generating database or data structure, e.g. via user interface
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99944Object-oriented database structure
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99944Object-oriented database structure
    • Y10S707/99945Object-oriented database structure processing

Definitions

  • the invention relates generally to the field of data processing.
  • XML extensible Markup Language
  • XBRL extensible Business Reporting Language
  • Semantic Web where information Labels are used to facilitate computers talking to computers making decisions and taking action as a result of the communication.
  • Other standards already exist and more will be developed that will benefit by the basic theory of this invention.
  • the recognized practical approach to standardizing the meaning of data is to attach defined Information Labels to the information being conveyed. In this way the meaning of the data can be determined by reviewing the definition of the label. It also means that computers can recognize the "meaning" of the tagged information and act on it based on that meaning. For example, data with the same "tag” can be added or compared without fear of adding or comparing apples and oranges. Taxonomies and their extensions are used to define the Information Labels.
  • the label ⁇ Sales> followed by a numerical value indicates that the numerical value relates to company's Sales.
  • the taxonomies used by standards organizations also include synonyms and alternative phrases that have the same meaning.
  • synonyms for Sales could include "Revenues” or “Fees”.
  • Cost of Goods Sold might be "Cost of Goods” or "Cost of Sales”.
  • the Information Labels can also carry information regarding the organizational authority that defined the label. If the taxonomy were authored by the US Securities & Exchange Commission the labels based on that taxonomy might be identified as USSEC, and so on.
  • a data stream is captured, data in the captured stream are identified, and then the identified data are mapped to a file structure, a schema, or a taxonomy.
  • the output data stream is a data stream to a display screen, a memory, a hard drive, a CD ROM drive, a floppy disk drive, or a printer.
  • the output data stream can be conveyed through serial or parallel ports (including Universal Serial Bus or "USB", FireWireTM, ), via wireless interfaces, and so forth.
  • the identified data are mapped to an XBRL (extensible Business Reporting Language) taxonomy, a spreadsheet, a database, or a flat file.
  • XBRL extensible Markup Language
  • a method for adding labels to data includes a) identifying data in an electronically represented file, b) selecting labels that correspond to text strings in the identified data, based on a list associating labels with text strings, and c) adding the selected labels into the electronically represented file to label the text strings and elements in the identified data associated with the text strings.
  • the labels include information about the data and are defined in one or more taxonomies. In the event the list does not associate a label with the text string, a user can be prompted to select a label corresponding to a text string in the identified data. The association indicated by the user's selection, can then be added to the list associating labels with text strings.
  • the labels are consistent with XML (extensible Markup Language), and also conform to an XBRL (extensible Business Reporting Language) specification.
  • This embodiment can be implemented by a transformation program that receives the electronically represented file from a target program.
  • the transformation program a) performs the steps of identifying, selecting and adding, and b) is configured to appear to the target program as a printer driver.
  • the transformation program can be independent and separate from the target program.
  • a method for importing at least a portion of an XBRL compliant data set into a non XBRL compliant target application.
  • the method includes the steps of exporting data from the target program in an export file, a user associating entries in the export file with labels defined in one or more appropriate XBRL taxonomies, and forming an import file for import into the target program by replacing data in the export file at entries associated with specific labels, with data from the data set having corresponding labels.
  • the associations made by the user are stored for later use, so that an import file can be automatically created by replacing data in a file having the same format as the originally exported file, based on the stored associations.
  • a method for importing at least a portion of a set of data into a target application, where the data set including labels indicating information about data in the data set, and where the labels are defined in one or more taxonomies.
  • the data set is XBRL compliant and the labels are defined in one or more XBRL taxonomies.
  • the method includes a data entry program observing a user entering data associated with the labels into the target application, and storing key strokes associated with the entry of data for each different label.
  • the data entry program can enter the data from the data set into the target application, by performing the stored key strokes corresponding to the labels associated with the data in the data set.
  • the data entry program can prompt the user to enter the data item into the target application, and then observe and store the user's keystrokes for future use.
  • a method for importing at least a portion of a data set into a target database.
  • the method includes entering test data into the target database, and then searching or scanning the database for patterns corresponding to the test data.
  • a pattern recognition application that is independent from the database can be used for this purpose.
  • a structure of the database is modeled based on the search results. Thereafter, the database can be directly accessed using the modeled structure.
  • the modeling process includes associating locations within the database structure with labels, where the labels correspond to elements of the test data that were found at the locations during the step of searching.
  • a data element can then be imported directly to a specific location within the database, using for example an independent software application, based on a label associated with both the location and the element.
  • Exemplary embodiments of the invention include a synonym dictionary that includes synonyms of known labels or terms, or synonymous links between labels and/or terms, to facilitate automatic or user-assisted mapping.
  • the dictionary can include terms that are not part of a taxonomy or schema such as an XML taxonomy, but that are synonymously related to terms in a taxonomy, schema, etc.
  • the synonym dictionary includes foreign languages, so that a label or datum can be mapped from one language into another language.
  • currency values are identified in the data stream, and are converted to corresponding values in different currencies (e.g., from yen to dollars) based on a known or designated exchange rate.
  • the mapping process converts data from one standard to another, for example from U.S. GAAP (Generally Accepted Accounting Principles) to International GAAP.
  • the mapping process includes replacing labels corresponding to identified data, with other labels, for example where minimizing file size is important.
  • data output from a first computer platform or system can be automatically converted by a software module on the first platform, from a first format into an intermediate format, transferred to a second platform or system, and then converted from the intermediate format into a second format by a second software module on the second platform.
  • the intermediate format can be an XML taxonomy, and the software modules can effectively "translate" so that data can be transparently exchanged between the two platforms regardless of whether the first and second formats are compatible or known to each of the two platforms.
  • the intermediate format can also be encrypted, e.g. for secure transfer.
  • processing steps and mechanisms described above are performed in a remote or distributed fashion, in realtime or non-realtime.
  • FIG. IA shows a flowchart in accordance with an exemplary embodiment of the invention.
  • FIG. IB shows a flowchart in accordance with an exemplary embodiment of the invention.
  • FIG. 2 shows a flowchart in accordance with an exemplary embodiment of the invention.
  • FIG. 3 shows a flowchart in accordance with an exemplary embodiment of the invention.
  • FIG. 4 shows a flowchart in accordance with an exemplary embodiment of the invention.
  • FIG. 5 shows a relationship between a target program and a transformation program in accordance with an embodiment of the invention.
  • FIG. 6 shows a relationship between a target module and a transformation module in accordance with an embodiment of the invention.
  • FIG. 7 shows software layers in an exemplary embodiment of the invention. DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
  • a data stream is captured in step 150, data in the captured data stream are identified in step 152, and then in step 154 the identified data are mapped to a file structure, a schema, or a taxonomy.
  • the output data stream is a data stream to a display screen, a memory, a hard drive, a CD ROM drive, a floppy disk drive, or a printer.
  • the output data stream can be conveyed within a computer, through serial or parallel ports (including Universal Serial Bus or "USB”, FireWireTM, etc.), via wireless interfaces, and so forth, and can be captured via duplication or redirection, at any point along the conveyance, via software and/or hardware mechanisms.
  • the identified data are mapped to an XBRL (extensible Business Reporting Language) taxonomy, a spreadsheet, a database, an XML (extensible Markup Language) taxonomy, a standard (e.g. , U.S. GAAP or International GAAP), or a flat file.
  • a specification or "data definition" file can also be generated to indicate the meaning or character of information at different locations in the flat file (e.g. , in different columns, at different locations within a given text string, etc.), and to optionally indicate delimiters (e.g. tabs, commas, spaces, semicolons, etc.) between discrete elements of information or groups of information in the flat file.
  • the flat file and an accompanying data definition can, for example, be generated in accordance with known techniques and formats relating to flat files.
  • a method for adding labels to data includes a) identifying data in an electronically represented file, b) selecting labels that correspond to metadata in the identified data, based on a list associating labels with metadata, and c) adding the selected labels into the electronically represented file to label the metadata and/or elements in the identified data associated with the metadata.
  • the labels include information about the data and are defined in one or more taxonomies.
  • "metadata" or "meta information” is data about data, or information that describes other information.
  • the metadata in the identified data identifies or describes other data elements within the identified data, and can include for example text strings, various control characters (e.g. , various ASCII control characters), and so forth
  • metadata in the captured data stream or file can be used to identify the data to which the metadata refer, and then additional metadata referring to the identified data can be added to the captured data stream or file.
  • the list can contain labels from multiple taxonomies, standards, and so forth, including words from languages, link synonymous or related labels. When a label from a first taxonomy, etc. is recognized in the captured data stream or file, the data element it labels can also be further labeled with a corresponding label from a second, different taxonomy, standard, etc.
  • a new, transformed data stream or file can be formed by adding the new labels for the second taxonomy, and optionally removing the old labels from the first taxonomy (or standard, schema, etc.).
  • the labels are consistent with XML (extensible Markup Language), and also conform to an XBRL (extensible Business Reporting Language) specification.
  • the labels can also be consistent with data formats for spreadsheets, relational databases, and other file structures or schemas or standards.
  • This embodiment can be implemented by a transformation program that receives the electronically represented file from a target program.
  • the transformation program a) performs the steps of identifying, selecting and adding, and b) can be configured to appear to the target program as a type of software known to the target program.
  • the transformation program can appear to the target program as a printer driver.
  • the transformation program can be independent and separate from the target program.
  • the transformation program can also be entirely resident on the same computer or system as the target program, or can be remotely located on a different system, or distributed among different systems.
  • the transformation module can be a single module, or a plurality of cooperating modules.
  • a list and/or synonym dictionary that the transformation program or module(s) use to identify metadata and add corresponding metadata can be stored as a data file separately from the program or module(s), and can be stored or accessed remotely, for example via an Internet web server.
  • the data stream can be captured at an information provider's site, transferred (as a real-time stream of data or as a data file containing contents of the captured data stream) to another location such as an intermediate location or the information receiver's site, and then provided to the information receiver's site.
  • the information provider computer could have, for example, a transformation program emulating a print driver, that is selected when information is to be output for mapping. The output would be provided to the transformation program, and then conveyed to the information receiver machine (by email, modem, file on floppy disk, etc.).
  • a transformation program on the information receiver machine would then open or receive the data, and map it to a batch file format useable by a target import program or to a file format useable by a program written to update a database.
  • the transformation programs on the provider and receiver machines can be identical and both capable of receiving, transferring and mapping data, or can have different capabilities.
  • the transformation programs can be configured to handle an intermediate format so that the transformation program at the information provider would map the data to an intermediate format, and transfer the data in the intermediate format to the transformation program on the receiver machine.
  • the receiver machine would map the data from the intermediate format to another format useful on the receiver machine (or as desired by a user).
  • the programs could be different versions, so that the transformation program recognizes more formats than the transformation program at the receiver machine and thus can map more formats to or from the intermediate format.
  • the transformation program on the receiver machine can be configured or featured to only map the data out of the intermediate format to another format, without being able to map data into the intermediate format in much the same way that Adobe AcrobatTM Readers can open and view, but not create, .pdf files.
  • the transformation programs can also be configured to operate automatically without user intervention.
  • the transformation program on the provider machine can automatically transfer data in response to a request from the transformation program on the receiver machine, subject for example to rules or requirements (e.g. , a user's prior approval to allow public access to information on the provider machine) in place on the provider machine.
  • the provider and receiver machines can communicate via the Internet.
  • the provider machine can interface the Internet or function as a web server, and the receiver machine can interface the Internet or function as a web browser.
  • the intermediate format can be encrypted, and can be decrypted at the receiver machine in a fashion transparent to a user of the receiver machine.
  • the encryption/decryption mechanism can be a proprietary function of the transformation programs.
  • the transformation program can alter or transform the file it receives from the target program, for example by adding appropriate XBRL labels to the file.
  • the transformation program can combine data from the file received from the target program, with the selected labels to generate and output a new, transformed file.
  • the transformation program can replace labels in the file with the newly added labels, for example when converting from one standard or language to another. This is advantageous when it is desirable to minimize the size and complexity of the transformed file or transformed data stream.
  • step 102 data in an electronically represented file is identified.
  • step 104 labels are selected that correspond to metadata such as text strings in the identified data, based on a list that associates labels with text strings.
  • metadata such as text strings in the identified data
  • metadata can be substituted for each occurrence of "text string(s)” .
  • FIG. IB applies also to all other forms of metadata, not just to text strings. This also holds true for the other embodiments described herein.
  • control proceeds to step 106, where a determination is made whether an un-identified text string, or a text string that does not have an associated label on the list, has been encountered. If yes, then control proceeds to step 108, where the user is prompted to select a label that corresponds to the text string. For example, the user can be provided with one or more taxonomies in a pop-up window or as part of the dialog, so that the correct label can be quickly and easily selected.
  • step 110 an association selected by the user in response to the prompt is stored for future use. From step 110, control proceeds to step 112. If in step 106 the determination is negative, then control proceeds from step 106 to step 112.
  • step 112 a determination is made whether labels have been selected (using the list, for example) for all relevant text strings in the identified data.
  • the assumption here is that there will be a label in some form associated with each datum, which can be used to map the datum to an appropriate label in, for example, an XBRL taxonomy.
  • the software application performing this function can exercise a degree of intelligence to filter out extraneous or superfluous text, and to properly interpret text and nearby data.
  • the output may contain a Report Header and a Report Footer, one or both of which need not be translated depending on the circumstances.
  • the text being interpreted and correlated with an XBRL label may span more than one line but data related to the text will be only on one line. In this situation the software application would appropriately merge multiple lines.
  • a text string may be a label referring or applying to multiple items of data, for example a financial statement with a text label called "cash on hand” and another label for the reporting period of "2000". Placement or location of a datum in the file can also help indicate which XBRL label is appropriate for the datum.
  • step 112 Any information relative to the position of the datum in relationship to other data that helps to label it (for example, a placement in a document that would show a data item nested in a specific location within another item, like a hierarchy), can be used help determine an appropriate XBRL label for the datum. If in step 112 the determination is negative, then control returns to step
  • step 112 determines whether the data are re-formatted in accordance with selected labels.
  • the data are re-formatted based on the determined correspondence between the data and defined labels in one or more XBRL taxonomies. This re-formatting can include adding the corresponding XBRL labels into the data.
  • the reformatting can also include reordering the data in accordance with a hierarchy of the selected/corresponding XBRL labels.
  • the transformation program can transform the data in various ways, including inserting and/or interpreting information labels or tags used to describe, characterize, and/or organize the data, to make the data more usable.
  • the transformation program can be made appropriately compatible with various operating systems, including (but not limited to) MS Windows, Unix, Mac OS, Solaris, Linux, and so forth.
  • the transformation program can acquire the data file to be transformed in any of various formats, including as a database file, a flat file, EDI, screen data, or any other collection or stream of data that can be analyzed in a digital format.
  • the transformation program can also output a transformation file including the transformed data, in any appropriate format.
  • the output file can be in any format that is XBRL compliant.
  • the transformation program can also launch or invoke an application or submodule to validate the output file, and can launch a Compare Program to analyze a received file by comparing text strings in the File with a standardized XBRL taxonomy. Then, the transformation program can compare the text strings in the file with the appropriate XBRL taxonomy (including Synonyms). The comparison may be done either by parsing the data or by using Rev-Gen pattern recognition scanning techniques. Any previous User mapping of XBRL Information Labels to data can also be checked.
  • the transformation program can also link the appropriate XBRL Information Label to the related information whenever such a link can be clearly established without user intervention. Any text strings that cannot be automatically identified and linlced with XBRL taxonomy Information Label will be presented to the User on the first occurrence. Using drag and drop or any other convenient mapping technique, the user will link the information in question with the appropriate XBRL Information Label (tag).
  • tag XBRL Information Label
  • the name of the company may not be recognized as ⁇ Company Name> data.
  • the user would simply drag the ⁇ Company Name> Information Label to the name of the company and the link would be established. This link would then remain in the Transformation Program for subsequent reports so the User would make this connection only once.
  • the transformation program also can create a new XBRL output file that includes all the appropriate Information Labels, Style information and the proper XML file extension to be XBRL compliant. Once the XBRL Information Labels have been linked to the appropriate data, some of the steps can be bypassed when producing subsequent reports unless a term in the application program has been changed or a new term has been added to the report.
  • Exemplary embodiments of the invention include a synonym dictionary that includes synonyms of known labels or terms, or synonymous links between labels and/or terms, to facilitate automatic or user-assisted mapping.
  • the dictionary can include synonyms such as "Fees” and “Revenues” so that when the synonyms are identified in the captured data stream the datum they refer to will be mapped appropriately to (or labeled with) the label "Sales".
  • the synonym dictionary can be incorporated within the list associating data and metadata.
  • the dictionary can include terms that are not part of a taxonomy or schema such as an XML taxonomy, but that are synonymously related to terms in a taxonomy, schema, etc.
  • the synonym dictionary includes foreign languages, so that a label or datum can be mapped from one language into another language.
  • the transformation program can also be used to translate terms in a document from one language to another.
  • the list associating data and metadata which the transformation program uses to identify data and select additional or replacement labels, can include languages or portions of languages together links indicating synonyms among the languages.
  • the language portions can be, for example, English language descriptive terms that appear in the U.S. GAAP, and corresponding synonyms in French, German, Spanish, etc., and similar terms that might appear in other standards such as
  • a user can provide a document containing financial information consistent with U.S. GAAP, to the transformation program, and specify that the transformation program output the document with French words instead of English words.
  • a user can also request the transformation program to convert the U.S. GAAP document into an International GAAP document with German words instead of English words, and so forth.
  • the user can specify the desired output language, and optionally the original language.
  • the transformation program can automatically identify the original language, for example when it finds labels in the captured data, that correspond to labels in its list, that it knows are in a specific language.
  • the transformation program can be used to identify currency values identified in the captured data stream or file, and then convert the identified currency values to corresponding values in different currencies (e.g., from yen to dollars) based on a known or designated exchange rate.
  • a default exchange rate can be used, for example the exchange rate that was in effect when a) the original data were created, b) the data stream or file was captured, c) the conversion was performed, or d) a date indicated by a user.
  • the user can also specify the exchange rate.
  • a method for importing at least a portion of an XBRL compliant data set into a non XBRL compliant target application.
  • the method includes the steps of exporting data from the target program in an export file, a user associating entries in the export file with labels defined in one or more appropriate XBRL taxonomies, and forming an import file for import into the target program by replacing data in the export file at entries associated with specific labels, with data from the data set having corresponding labels.
  • the associations made by the user are stored for later use, so that an import file can be automatically created by replacing data in a file having the same format as the originally exported file, based on the stored associations.
  • An import file template can be generated based on the structure of the export file and the associations made by the user, and an import file can then be formed by populating the import file template with data by entering the data based on labels associated with both the data being entered and entries in the import file template.
  • the template can of course be reused to import different sets of data.
  • the user can indicate associations between entries in an export/import file format in any appropriate or suitable way. For example, the user can insert data associated with labels into various entries of the export file, and then software can scan the entries in the export file, discern the associated labels based on the newly entered data, and then store the associations for later use when populating an (empty) import file template with data for import into the target program or target application.
  • a structure of the export file together with the stored associations can represent an import file template.
  • the newly entered data can include the labels themselves.
  • software can, for each entry in the export file, present a list of labels, and a user can select one or more appropriate labels from the list to indicate the association, which is then stored.
  • the template can be populated with data for import, for example, by discerning a label associated with a datum to be imported, locating an entry in the template associated with the same label, entering the datum into the located entry in the template, and repeating these steps for all data in a data set to be imported.
  • step 202 data is exported from target application or program in an export file.
  • control proceeds to step 204, where a user associates entries in the export file, with labels, for example labels defined in an XBRL taxonomy.
  • step 204 control proceeds to step 206, where the associations made by the user are stored.
  • step 206 control proceeds to step 208, where an import file is generated by replacing data in the export file at entries or locations associated with the (e.g. , XBRL) labels, with new data having corresponding labels.
  • a method for importing at least a portion of a set of data into a target application, where the data set including labels indicating information about data in the data set, and where the labels are defined in one or more taxonomies.
  • the data set is XBRL compliant and the labels are defined in one or more XBRL taxonomies.
  • the method includes a program observing a user entering data associated with the labels into the target application, and storing key strokes associated with the entry of data for each different label.
  • the data entry program receives an XBRL compliant data set for entry into the target application (which can be non XBRL and non XML compliant)
  • this program or a different program can enter the data from the data set into the target application, by performing the stored key strokes corresponding to the labels associated with the data in the data set.
  • the program that is automatically entering data into the target application encounters a data item having a label for which no keystrokes are stored, it can prompt the user to enter the data item into the target application, and then observe and store the user's keystrokes for future use.
  • a first software application observes a user entering data associated with the labels, into a target application.
  • control proceeds to step 304, wherein the first application stores observed keystrokes associated with entry of data for each different label (e.g. , XBRL label).
  • step 304 control proceeds to step 306, where the first application receives a data set for entry into the target application.
  • control proceeds to the 308, where the first application enters data from the data set into the target application, by performing the stored keystrokes corresponding to the labels associated with the data in the data set.
  • control proceeds to step 310, where a determination is made by the first application, whether it has encountered any data in the data set for which it has no stored keystrokes. In other words, whether there is any data in the data set having a label for which the first application has not stored or observed any keystrokes. If yes, then control proceeds to step 312, where the first application prompts the user to enter the data item into the target application, or otherwise provide an appropriate sequence of keystrokes to enter the data item into the target application. For example, an appropriate sequence could be selected from a menu or group of prerecorded keystroke sequences. From step 312, control proceeds to step 314, where the provided keystroke sequence is stored for future use by the first application. From step 314, control proceeds to step 316.
  • step 310 determines whether all relevant data in the data set has been entered into the target application. If yes, then control proceeds to step 318, where the process ends. If no, then control returns to step 308.
  • "Relevant" data can be determined or handled subject to the considerations discussed above with respect to step 112 of FIG. 1.
  • a method for importing or inputting at least a portion of a data set into a target database.
  • the method includes entering test data into the target database, and then searching or scanning the database for patterns corresponding to the test data.
  • a pattern recognition application that is independent from the database can be used for this purpose.
  • a structure of the database is modeled based on the search results. Thereafter, the database can be directly accessed using the modeled structure.
  • the modeling process includes associating locations within the database structure with labels, where the labels correspond to elements of the test data that were found at the locations during the step of searching.
  • a data element can then be inserted directly to a specific location within the database, using for example an independent software application, based on a label associated with both the location and the element.
  • a set of test data is imported or inputted into a target database.
  • the set is entered into the database in a conventional fashion, for example by key entry through an interface of an application that manages the database.
  • the database can be separate from the managing application, or can be embedded within the managing application.
  • control proceeds to step 404, where the database is scanned by an independent software application, for example a pattern recognition application suc as that manufactured by the British company RevGen Pic. and distributed by their U.S. affiliate, Generos corporation.
  • the independent application searches or scans the database for patterns corresponding to the set of test data.
  • control proceeds to step 406, where an independent application (for example, the pattern recognition application or another, separate application) constructs a model of the structure of the database, based on the search/scan results.
  • an independent application for example, the pattern recognition application or another, separate application constructs a model of the structure of the database, based on the search/scan results.
  • control proceeds to step 408, where locations in the database structure are associated with labels, for example labels defined in one or more XBRL taxonomies. The labels correspond to elements of the test data found at those locations in the database structure during the search/scan.
  • control proceeds to step 410, where an element from a data set is imported directed into the database based on a label associated with both the location and the element.
  • FIG. 5 shows a transformation program consistent with the embodiment described in FIG. 1.
  • the transformation program can be independent and separate from the target program.
  • a target program 502 provides an output file such as a print file to a transformation program 504.
  • the transformation program 504 is configured so that it appears to the target program as a printer driver.
  • the transformation program 504 does not require or perform any modification or alteration of the target program 502, and can be designed or configured to function compatibly with commercially available target programs such as spreadsheets, accounting programs, word processing programs, and so forth.
  • FIG. 6 shows that the transformation program and the target program of the embodiment described in FIG. 1, can alternatively be implemented respectively as a module 604 and a target program module 602 together within an application 606.
  • the module 604 can appear as a printer driver to the module 602.
  • the transformation program module 604 can be implemented as a DLL, OCX, Active X Control program, or in any other form that can be marketed to software vendors for integration into independently developed applications.
  • FIG. 7 shows an exemplary structure of various embodiments of the invention, with respect to the software layers of a computer.
  • FIG. 7 shows an application layer 702, at which any WindowsTM application 701 operates on the computer.
  • a WindowsTM OS (Operating System) layer 704 in which can be found a GDI (Graphics Interface Device) 703.
  • a data conversion layer 710 which includes a Data Converter 709 that outputs or can output the data in a variety of formats, including the formats 712- 722 shown (HTML, ExcelTM, XML, SQL, xBase, and ASCII respectively).
  • a transformation program that performs various functions of the invention includes the XBRL Printer Driver 705, the XBRL Mapping Agent 707, and the Data Converter 709.
  • the elements 705, 707 are shown as being XBRL- related, the elements 705, 707 can be related to any or all of the formats, taxonomies, protocols, standards, etc. described above and their equivalents.
  • the formats 712-722 are exemplary and not limiting.
  • information provided by the user can be made using drag-and-drop, cut-and-paste, selection of items from a proffered menu, keyboard entry, or any other appropriate technique.
  • Extracting data from the target program or target application can include, in addition to or instead of obtaining a print file, accessing data directly from a file or out of a database without running or launching the parent (target) application, scraping data off of a display screen or window, and so forth.

Abstract

A method for adding labels to data (Fig. 1B), for example XML compliant or XBRL compliant labels, includes a) identifying data in an electronically represented file (102), b) selecting labels that correspond to text strings in the identified data, based on a list associating labels with text strings (104), and c) adding the selected labels into the electronically represented file to label the text strings and elements in the identified data associated with the text strings (114). The labels include information about the data and are defined in one or more taxonomies. When the list does not associate a label with the text string, a user can be prompted to select a label corresponding to a text string in the identified data. The association indicated by the user's selection, can then be added to the list associating labels with text strings (116).

Description

METHOD FOR ADDING METADATA TO DATA
BACKGROUND OF THE INVENTION
1. Field of the Invention
The invention relates generally to the field of data processing.
2. Background Information
Currently there are thousands upon thousands of software programs installed in millions of computers that cannot transfer meaning from one to the other. For example, large companies with many branches or subsidiaries often find that the accounting or operating software programs used by one division or subsidiary is not compatible with the software used by other divisions or subsidiaries or the central corporate programs. This requires substantial conversions of data and often results in a great deal of data reentry along with the attendant costs and data integrity problems that attend data entry.
Because of the great variety of programs, operating systems and software standards currently used by software developers there is a great deal of incompatibility between suppliers and their customers. This also requires substantial conversions of data and often results in a great deal of data reentry and its implications. The unstructured and undefined nature of the current computer software environment imposes great burdens and expense on regulatory organizations such as the SEC, FDIC, Federal and State tax authorities, banks, etc. and the companies reporting to them.
To overcome this problem many standards organizations have been formed and are being formed to establish defined input/output vocabularies for use with the XML (extensible Markup Language) file format. XBRL (extensible Business Reporting Language) is one of the XML language formats being developed. It is expected to become a global standard for financial reporting. Throughout this disclosure we will use XBRL as the example of an XML language. It is not intended to limit the invention to XBRL or XML languages. We find many similarities for the Semantic Web where information Labels are used to facilitate computers talking to computers making decisions and taking action as a result of the communication. Other standards already exist and more will be developed that will benefit by the basic theory of this invention.
Virtually none of the existing software applications can automatically or semi-automatically convert conventional documents or data into outputs tagged with the standardized Information Labels called for by XML or other standards committees. In most cases the standards themselves are still in development. In order for XML and other data dictionaries or business vocabularies to take root, it is required that existing applications and data be associated or tagged with these standard vocabularies. This harsh reality will long delay the widespread use of these standards because it will take years for companies to migrate to new software products that are designed to output the appropriate Information Labels. In some cases that may never happen because it is virtually impossible to replace legacy software systems. For example, retrofitting all the accounting software in current use would be a very complex task that could not be accomplished in any short-term.
The recognized practical approach to standardizing the meaning of data is to attach defined Information Labels to the information being conveyed. In this way the meaning of the data can be determined by reviewing the definition of the label. It also means that computers can recognize the "meaning" of the tagged information and act on it based on that meaning. For example, data with the same "tag" can be added or compared without fear of adding or comparing apples and oranges. Taxonomies and their extensions are used to define the Information Labels.
For example in a financial report, the label <Sales> followed by a numerical value indicates that the numerical value relates to company's Sales. <Cost of Goods Sold> followed by a numerical value indicates that the value represents the company's Cost of Goods Sold. Since Gross Profit is Sales minus Cost of Goods Sold, computers could access 3rd party reports that show these values and easily calculate the Gross Profit with a simple rule that says <Sales> <minus> <Cost of Goods Sold> = <Gross Profιt>.
Because not all companies use the same terminology, the taxonomies used by standards organizations also include synonyms and alternative phrases that have the same meaning. For example synonyms for Sales could include "Revenues" or "Fees". Cost of Goods Sold might be "Cost of Goods" or "Cost of Sales". The Information Labels can also carry information regarding the organizational authority that defined the label. If the taxonomy were authored by the US Securities & Exchange Commission the labels based on that taxonomy might be identified as USSEC, and so on.
Accordingly, there is a need for methods and mechanisms to accurately and efficiently transform data into XML, and in particular XBRL, compliant formats. The transformation would include, for example, adding appropriate labels to the data as defined in relevant XBRL taxonomies. There is also a need for methods and mechanisms to automate entry of XML and XBRL compliant data into non- XML or non-XBRL compliant programs or applications.
XBRL Essentials, authored by Charles Hoffman and Carolyn Strand, copyright 2001 by XBRL Solutions, Inc., ISBN 0-87051-353-2, is hereby incorporated by reference.
SUMMARY OF THE INVENTION
In an exemplary embodiment of the invention, a data stream is captured, data in the captured stream are identified, and then the identified data are mapped to a file structure, a schema, or a taxonomy. In exemplary embodiments of the invention, the output data stream is a data stream to a display screen, a memory, a hard drive, a CD ROM drive, a floppy disk drive, or a printer. The output data stream can be conveyed through serial or parallel ports (including Universal Serial Bus or "USB", FireWire™, ), via wireless interfaces, and so forth. In other exemplary embodiments of the invention, the identified data are mapped to an XBRL (extensible Business Reporting Language) taxonomy, a spreadsheet, a database, or a flat file.
In another exemplary embodiment of the present invention, a method for adding labels to data includes a) identifying data in an electronically represented file, b) selecting labels that correspond to text strings in the identified data, based on a list associating labels with text strings, and c) adding the selected labels into the electronically represented file to label the text strings and elements in the identified data associated with the text strings. The labels include information about the data and are defined in one or more taxonomies. In the event the list does not associate a label with the text string, a user can be prompted to select a label corresponding to a text string in the identified data. The association indicated by the user's selection, can then be added to the list associating labels with text strings. Preferably the labels are consistent with XML (extensible Markup Language), and also conform to an XBRL (extensible Business Reporting Language) specification. This embodiment can be implemented by a transformation program that receives the electronically represented file from a target program. The transformation program a) performs the steps of identifying, selecting and adding, and b) is configured to appear to the target program as a printer driver. The transformation program can be independent and separate from the target program.
In accordance with another embodiment of the invention, a method is provided for importing at least a portion of an XBRL compliant data set into a non XBRL compliant target application. The method includes the steps of exporting data from the target program in an export file, a user associating entries in the export file with labels defined in one or more appropriate XBRL taxonomies, and forming an import file for import into the target program by replacing data in the export file at entries associated with specific labels, with data from the data set having corresponding labels. The associations made by the user are stored for later use, so that an import file can be automatically created by replacing data in a file having the same format as the originally exported file, based on the stored associations.
In accordance with another embodiment of the invention, a method is provided for importing at least a portion of a set of data into a target application, where the data set including labels indicating information about data in the data set, and where the labels are defined in one or more taxonomies. For example, where the data set is XBRL compliant and the labels are defined in one or more XBRL taxonomies. The method includes a data entry program observing a user entering data associated with the labels into the target application, and storing key strokes associated with the entry of data for each different label. Then, when the data entry program receives an XBRL compliant data set for entry into the target application (which can be non XBRL and non XML compliant), the data entry program can enter the data from the data set into the target application, by performing the stored key strokes corresponding to the labels associated with the data in the data set. When the data entry program is automatically entering data into the target application, and encounters a data item having a label for which no keystrokes are stored, the data entry program can prompt the user to enter the data item into the target application, and then observe and store the user's keystrokes for future use.
In accordance with another embodiment of the invention, a method is provided for importing at least a portion of a data set into a target database. The method includes entering test data into the target database, and then searching or scanning the database for patterns corresponding to the test data. A pattern recognition application that is independent from the database can be used for this purpose. A structure of the database is modeled based on the search results. Thereafter, the database can be directly accessed using the modeled structure. In particular, the modeling process includes associating locations within the database structure with labels, where the labels correspond to elements of the test data that were found at the locations during the step of searching. A data element can then be imported directly to a specific location within the database, using for example an independent software application, based on a label associated with both the location and the element. Exemplary embodiments of the invention include a synonym dictionary that includes synonyms of known labels or terms, or synonymous links between labels and/or terms, to facilitate automatic or user-assisted mapping. The dictionary can include terms that are not part of a taxonomy or schema such as an XML taxonomy, but that are synonymously related to terms in a taxonomy, schema, etc. In an exemplary embodiment of the invention, the synonym dictionary includes foreign languages, so that a label or datum can be mapped from one language into another language. In an exemplary embodiment of the invention, currency values are identified in the data stream, and are converted to corresponding values in different currencies (e.g., from yen to dollars) based on a known or designated exchange rate. In accordance with an embodiment of the invention, the mapping process converts data from one standard to another, for example from U.S. GAAP (Generally Accepted Accounting Principles) to International GAAP. In accordance with an embodiment of the invention, the mapping process includes replacing labels corresponding to identified data, with other labels, for example where minimizing file size is important.
In accordance with an embodiment of the invention, data output from a first computer platform or system can be automatically converted by a software module on the first platform, from a first format into an intermediate format, transferred to a second platform or system, and then converted from the intermediate format into a second format by a second software module on the second platform. For example, the intermediate format can be an XML taxonomy, and the software modules can effectively "translate" so that data can be transparently exchanged between the two platforms regardless of whether the first and second formats are compatible or known to each of the two platforms. The intermediate format can also be encrypted, e.g. for secure transfer.
In accordance with embodiments of the invention, the processing steps and mechanisms described above, are performed in a remote or distributed fashion, in realtime or non-realtime.
BRIEF DESCRIPTION OF THE DRAWINGS Other objects and advantages of the present invention will become apparent to those skilled in the art from the following detailed description of preferred embodiments, when read in conjunction with the accompanying drawings wherein like elements have been designated with like reference numerals and wherein:
FIG. IA shows a flowchart in accordance with an exemplary embodiment of the invention.
FIG. IB shows a flowchart in accordance with an exemplary embodiment of the invention. FIG. 2 shows a flowchart in accordance with an exemplary embodiment of the invention.
FIG. 3 shows a flowchart in accordance with an exemplary embodiment of the invention.
FIG. 4 shows a flowchart in accordance with an exemplary embodiment of the invention.
FIG. 5 shows a relationship between a target program and a transformation program in accordance with an embodiment of the invention.
FIG. 6 shows a relationship between a target module and a transformation module in accordance with an embodiment of the invention. FIG. 7 shows software layers in an exemplary embodiment of the invention. DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
In accordance with an embodiment of the invention shown in FIG. IA, a data stream is captured in step 150, data in the captured data stream are identified in step 152, and then in step 154 the identified data are mapped to a file structure, a schema, or a taxonomy. The output data stream is a data stream to a display screen, a memory, a hard drive, a CD ROM drive, a floppy disk drive, or a printer. The output data stream can be conveyed within a computer, through serial or parallel ports (including Universal Serial Bus or "USB", FireWire™, etc.), via wireless interfaces, and so forth, and can be captured via duplication or redirection, at any point along the conveyance, via software and/or hardware mechanisms. The identified data are mapped to an XBRL (extensible Business Reporting Language) taxonomy, a spreadsheet, a database, an XML (extensible Markup Language) taxonomy, a standard (e.g. , U.S. GAAP or International GAAP), or a flat file. When the identified data are mapped to a flat file, a specification or "data definition" file can also be generated to indicate the meaning or character of information at different locations in the flat file (e.g. , in different columns, at different locations within a given text string, etc.), and to optionally indicate delimiters (e.g. tabs, commas, spaces, semicolons, etc.) between discrete elements of information or groups of information in the flat file. The flat file and an accompanying data definition can, for example, be generated in accordance with known techniques and formats relating to flat files.
The embodiment shown in FIG. IA, can be implemented as shown in FIG. IB. In accordance with an exemplary embodiment of the invention illustrated in FIG. IB, a method for adding labels to data includes a) identifying data in an electronically represented file, b) selecting labels that correspond to metadata in the identified data, based on a list associating labels with metadata, and c) adding the selected labels into the electronically represented file to label the metadata and/or elements in the identified data associated with the metadata. The labels include information about the data and are defined in one or more taxonomies. In the context of the present application, "metadata" or "meta information" is data about data, or information that describes other information. In this example the metadata in the identified data identifies or describes other data elements within the identified data, and can include for example text strings, various control characters (e.g. , various ASCII control characters), and so forth For example, metadata in the captured data stream or file can be used to identify the data to which the metadata refer, and then additional metadata referring to the identified data can be added to the captured data stream or file. For example, the list can contain labels from multiple taxonomies, standards, and so forth, including words from languages, link synonymous or related labels. When a label from a first taxonomy, etc. is recognized in the captured data stream or file, the data element it labels can also be further labeled with a corresponding label from a second, different taxonomy, standard, etc. Thus a computer program that recognizes the second taxonomy but not the first, will now be able to use or recognize and organize the information in the data stream or file. A new, transformed data stream or file can be formed by adding the new labels for the second taxonomy, and optionally removing the old labels from the first taxonomy (or standard, schema, etc.).
In the event the list does not associate a label with metadata in the identified data, a user can be prompted to select a label corresponding to the metadata. The association indicated by the user's selection, can then be added to the list associating labels with metadata. Preferably the labels are consistent with XML (extensible Markup Language), and also conform to an XBRL (extensible Business Reporting Language) specification. Of course, the labels can also be consistent with data formats for spreadsheets, relational databases, and other file structures or schemas or standards.
This embodiment can be implemented by a transformation program that receives the electronically represented file from a target program. The transformation program a) performs the steps of identifying, selecting and adding, and b) can be configured to appear to the target program as a type of software known to the target program. For example, the transformation program can appear to the target program as a printer driver.
The transformation program can be independent and separate from the target program. The transformation program can also be entirely resident on the same computer or system as the target program, or can be remotely located on a different system, or distributed among different systems. The transformation module can be a single module, or a plurality of cooperating modules. A list and/or synonym dictionary that the transformation program or module(s) use to identify metadata and add corresponding metadata, can be stored as a data file separately from the program or module(s), and can be stored or accessed remotely, for example via an Internet web server.
For example, the data stream can be captured at an information provider's site, transferred (as a real-time stream of data or as a data file containing contents of the captured data stream) to another location such as an intermediate location or the information receiver's site, and then provided to the information receiver's site. The information provider computer could have, for example, a transformation program emulating a print driver, that is selected when information is to be output for mapping. The output would be provided to the transformation program, and then conveyed to the information receiver machine (by email, modem, file on floppy disk, etc.). A transformation program on the information receiver machine would then open or receive the data, and map it to a batch file format useable by a target import program or to a file format useable by a program written to update a database. The transformation programs on the provider and receiver machines can be identical and both capable of receiving, transferring and mapping data, or can have different capabilities. For example, the transformation programs can be configured to handle an intermediate format so that the transformation program at the information provider would map the data to an intermediate format, and transfer the data in the intermediate format to the transformation program on the receiver machine. The receiver machine would map the data from the intermediate format to another format useful on the receiver machine (or as desired by a user). The programs could be different versions, so that the transformation program recognizes more formats than the transformation program at the receiver machine and thus can map more formats to or from the intermediate format. In addition or as an alternative, the transformation program on the receiver machine can be configured or featured to only map the data out of the intermediate format to another format, without being able to map data into the intermediate format in much the same way that Adobe Acrobat™ Readers can open and view, but not create, .pdf files. The transformation programs can also be configured to operate automatically without user intervention. For example, the transformation program on the provider machine can automatically transfer data in response to a request from the transformation program on the receiver machine, subject for example to rules or requirements (e.g. , a user's prior approval to allow public access to information on the provider machine) in place on the provider machine. The provider and receiver machines can communicate via the Internet. For example, the provider machine can interface the Internet or function as a web server, and the receiver machine can interface the Internet or function as a web browser. Also, the intermediate format can be encrypted, and can be decrypted at the receiver machine in a fashion transparent to a user of the receiver machine. For example, the encryption/decryption mechanism can be a proprietary function of the transformation programs.
The transformation program can alter or transform the file it receives from the target program, for example by adding appropriate XBRL labels to the file.
Alternatively, the transformation program can combine data from the file received from the target program, with the selected labels to generate and output a new, transformed file. As a further alternative, the transformation program can replace labels in the file with the newly added labels, for example when converting from one standard or language to another. This is advantageous when it is desirable to minimize the size and complexity of the transformed file or transformed data stream.
As shown in FIG. IB, in a first step 102, data in an electronically represented file is identified. Next, in step 104, labels are selected that correspond to metadata such as text strings in the identified data, based on a list that associates labels with text strings. Although "text strings" are specifically referred to in FIG. IB, "metadata" can be substituted for each occurrence of "text string(s)" . In other words, the concepts shown in FIG. IB apply also to all other forms of metadata, not just to text strings. This also holds true for the other embodiments described herein.
From step 104 control proceeds to step 106, where a determination is made whether an un-identified text string, or a text string that does not have an associated label on the list, has been encountered. If yes, then control proceeds to step 108, where the user is prompted to select a label that corresponds to the text string. For example, the user can be provided with one or more taxonomies in a pop-up window or as part of the dialog, so that the correct label can be quickly and easily selected.
From step 108, control proceeds to step 110. In step 110, an association selected by the user in response to the prompt is stored for future use. From step 110, control proceeds to step 112. If in step 106 the determination is negative, then control proceeds from step 106 to step 112.
In step 112, a determination is made whether labels have been selected (using the list, for example) for all relevant text strings in the identified data. The assumption here is that there will be a label in some form associated with each datum, which can be used to map the datum to an appropriate label in, for example, an XBRL taxonomy. The software application performing this function can exercise a degree of intelligence to filter out extraneous or superfluous text, and to properly interpret text and nearby data. For example, in the output from an accounting system, say a Balance Sheet, the output may contain a Report Header and a Report Footer, one or both of which need not be translated depending on the circumstances. Also, it is possible that the text being interpreted and correlated with an XBRL label, may span more than one line but data related to the text will be only on one line. In this situation the software application would appropriately merge multiple lines. In addition, it is possible that a text string may be a label referring or applying to multiple items of data, for example a financial statement with a text label called "cash on hand" and another label for the reporting period of "2000". Placement or location of a datum in the file can also help indicate which XBRL label is appropriate for the datum. Any information relative to the position of the datum in relationship to other data that helps to label it (for example, a placement in a document that would show a data item nested in a specific location within another item, like a hierarchy), can be used help determine an appropriate XBRL label for the datum. If in step 112 the determination is negative, then control returns to step
104. If in step 112 the determination is positive, then control proceeds from step 112 to step 114. In step 114, the data are re-formatted in accordance with selected labels. In other words, the data are re-formatted based on the determined correspondence between the data and defined labels in one or more XBRL taxonomies. This re-formatting can include adding the corresponding XBRL labels into the data. As indicated in step 116, the reformatting can also include reordering the data in accordance with a hierarchy of the selected/corresponding XBRL labels.
In summary, the transformation program can transform the data in various ways, including inserting and/or interpreting information labels or tags used to describe, characterize, and/or organize the data, to make the data more usable. The transformation program can be made appropriately compatible with various operating systems, including (but not limited to) MS Windows, Unix, Mac OS, Solaris, Linux, and so forth. The transformation program can acquire the data file to be transformed in any of various formats, including as a database file, a flat file, EDI, screen data, or any other collection or stream of data that can be analyzed in a digital format. The transformation program can also output a transformation file including the transformed data, in any appropriate format. For example, the output file can be in any format that is XBRL compliant.
The transformation program can also launch or invoke an application or submodule to validate the output file, and can launch a Compare Program to analyze a received file by comparing text strings in the File with a standardized XBRL taxonomy. Then, the transformation program can compare the text strings in the file with the appropriate XBRL taxonomy (including Synonyms). The comparison may be done either by parsing the data or by using Rev-Gen pattern recognition scanning techniques. Any previous User mapping of XBRL Information Labels to data can also be checked.
The transformation program can also link the appropriate XBRL Information Label to the related information whenever such a link can be clearly established without user intervention. Any text strings that cannot be automatically identified and linlced with XBRL taxonomy Information Label will be presented to the User on the first occurrence. Using drag and drop or any other convenient mapping technique, the user will link the information in question with the appropriate XBRL Information Label (tag).
For example, the first time the company publishes financial statements using this technique the name of the company may not be recognized as <Company Name> data. To link the <Company Name> label with the company name data, the user would simply drag the <Company Name> Information Label to the name of the company and the link would be established. This link would then remain in the Transformation Program for subsequent reports so the User would make this connection only once.
The transformation program also can create a new XBRL output file that includes all the appropriate Information Labels, Style information and the proper XML file extension to be XBRL compliant. Once the XBRL Information Labels have been linked to the appropriate data, some of the steps can be bypassed when producing subsequent reports unless a term in the application program has been changed or a new term has been added to the report. Exemplary embodiments of the invention include a synonym dictionary that includes synonyms of known labels or terms, or synonymous links between labels and/or terms, to facilitate automatic or user-assisted mapping. For example, where a known label in a standard, schema or taxonomy to which captured data stream or file is being mapped is "Sales", the dictionary can include synonyms such as "Fees" and "Revenues" so that when the synonyms are identified in the captured data stream the datum they refer to will be mapped appropriately to (or labeled with) the label "Sales". The synonym dictionary can be incorporated within the list associating data and metadata. The dictionary can include terms that are not part of a taxonomy or schema such as an XML taxonomy, but that are synonymously related to terms in a taxonomy, schema, etc. In an exemplary embodiment of the invention, the synonym dictionary includes foreign languages, so that a label or datum can be mapped from one language into another language.
For example, the transformation program can also be used to translate terms in a document from one language to another. For example, the list associating data and metadata, which the transformation program uses to identify data and select additional or replacement labels, can include languages or portions of languages together links indicating synonyms among the languages. The language portions can be, for example, English language descriptive terms that appear in the U.S. GAAP, and corresponding synonyms in French, German, Spanish, etc., and similar terms that might appear in other standards such as
International GAAP. Thus, a user can provide a document containing financial information consistent with U.S. GAAP, to the transformation program, and specify that the transformation program output the document with French words instead of English words. A user can also request the transformation program to convert the U.S. GAAP document into an International GAAP document with German words instead of English words, and so forth. The user can specify the desired output language, and optionally the original language. The transformation program can automatically identify the original language, for example when it finds labels in the captured data, that correspond to labels in its list, that it knows are in a specific language.
In addition, in an exemplary embodiment of the invention, the transformation program can be used to identify currency values identified in the captured data stream or file, and then convert the identified currency values to corresponding values in different currencies (e.g., from yen to dollars) based on a known or designated exchange rate. A default exchange rate can be used, for example the exchange rate that was in effect when a) the original data were created, b) the data stream or file was captured, c) the conversion was performed, or d) a date indicated by a user. The user can also specify the exchange rate. In accordance with another embodiment of the invention illustrated in FIG.
2, a method is provided for importing at least a portion of an XBRL compliant data set into a non XBRL compliant target application. The method includes the steps of exporting data from the target program in an export file, a user associating entries in the export file with labels defined in one or more appropriate XBRL taxonomies, and forming an import file for import into the target program by replacing data in the export file at entries associated with specific labels, with data from the data set having corresponding labels. The associations made by the user are stored for later use, so that an import file can be automatically created by replacing data in a file having the same format as the originally exported file, based on the stored associations. An import file template can be generated based on the structure of the export file and the associations made by the user, and an import file can then be formed by populating the import file template with data by entering the data based on labels associated with both the data being entered and entries in the import file template. The template can of course be reused to import different sets of data. The user can indicate associations between entries in an export/import file format in any appropriate or suitable way. For example, the user can insert data associated with labels into various entries of the export file, and then software can scan the entries in the export file, discern the associated labels based on the newly entered data, and then store the associations for later use when populating an (empty) import file template with data for import into the target program or target application. A structure of the export file together with the stored associations can represent an import file template. The newly entered data can include the labels themselves. Alternatively, software can, for each entry in the export file, present a list of labels, and a user can select one or more appropriate labels from the list to indicate the association, which is then stored. The template can be populated with data for import, for example, by discerning a label associated with a datum to be imported, locating an entry in the template associated with the same label, entering the datum into the located entry in the template, and repeating these steps for all data in a data set to be imported.
As shown in FIG. 2, in step 202 data is exported from target application or program in an export file. From step 202 control proceeds to step 204, where a user associates entries in the export file, with labels, for example labels defined in an XBRL taxonomy. From step 204 control proceeds to step 206, where the associations made by the user are stored. From step 206 control proceeds to step 208, where an import file is generated by replacing data in the export file at entries or locations associated with the (e.g. , XBRL) labels, with new data having corresponding labels.
In another embodiment of the invention illustrated in FIG. 3, a method is provided for importing at least a portion of a set of data into a target application, where the data set including labels indicating information about data in the data set, and where the labels are defined in one or more taxonomies. For example, where the data set is XBRL compliant and the labels are defined in one or more XBRL taxonomies. The method includes a program observing a user entering data associated with the labels into the target application, and storing key strokes associated with the entry of data for each different label. Then, when the data entry program receives an XBRL compliant data set for entry into the target application (which can be non XBRL and non XML compliant), this program or a different program can enter the data from the data set into the target application, by performing the stored key strokes corresponding to the labels associated with the data in the data set. If the program that is automatically entering data into the target application, encounters a data item having a label for which no keystrokes are stored, it can prompt the user to enter the data item into the target application, and then observe and store the user's keystrokes for future use.
As shown in FIG. 3, in a first step 302 a first software application observes a user entering data associated with the labels, into a target application. From step 302 control proceeds to step 304, wherein the first application stores observed keystrokes associated with entry of data for each different label (e.g. , XBRL label). From step 304, control proceeds to step 306, where the first application receives a data set for entry into the target application. From step 306, control proceeds to the 308, where the first application enters data from the data set into the target application, by performing the stored keystrokes corresponding to the labels associated with the data in the data set. From step 308, control proceeds to step 310, where a determination is made by the first application, whether it has encountered any data in the data set for which it has no stored keystrokes. In other words, whether there is any data in the data set having a label for which the first application has not stored or observed any keystrokes. If yes, then control proceeds to step 312, where the first application prompts the user to enter the data item into the target application, or otherwise provide an appropriate sequence of keystrokes to enter the data item into the target application. For example, an appropriate sequence could be selected from a menu or group of prerecorded keystroke sequences. From step 312, control proceeds to step 314, where the provided keystroke sequence is stored for future use by the first application. From step 314, control proceeds to step 316.
If in step 310 the determination is negative, then control proceeds to step 316. In step 316, the first application determines whether all relevant data in the data set has been entered into the target application. If yes, then control proceeds to step 318, where the process ends. If no, then control returns to step 308. "Relevant" data can be determined or handled subject to the considerations discussed above with respect to step 112 of FIG. 1. In accordance with another embodiment of the invention illustrated in FIG.
4, a method is provided for importing or inputting at least a portion of a data set into a target database. The method includes entering test data into the target database, and then searching or scanning the database for patterns corresponding to the test data. A pattern recognition application that is independent from the database can be used for this purpose. A structure of the database is modeled based on the search results. Thereafter, the database can be directly accessed using the modeled structure. In particular, the modeling process includes associating locations within the database structure with labels, where the labels correspond to elements of the test data that were found at the locations during the step of searching. A data element can then be inserted directly to a specific location within the database, using for example an independent software application, based on a label associated with both the location and the element.
As shown in FIG. 4, in a step 402, a set of test data is imported or inputted into a target database. Preferably the set is entered into the database in a conventional fashion, for example by key entry through an interface of an application that manages the database. The database can be separate from the managing application, or can be embedded within the managing application. From step 402, control proceeds to step 404, where the database is scanned by an independent software application, for example a pattern recognition application suc as that manufactured by the British company RevGen Pic. and distributed by their U.S. affiliate, Generos corporation. The independent application searches or scans the database for patterns corresponding to the set of test data.
From step 404 control proceeds to step 406, where an independent application (for example, the pattern recognition application or another, separate application) constructs a model of the structure of the database, based on the search/scan results. From step 406 control proceeds to step 408, where locations in the database structure are associated with labels, for example labels defined in one or more XBRL taxonomies. The labels correspond to elements of the test data found at those locations in the database structure during the search/scan. From step 408, control proceeds to step 410, where an element from a data set is imported directed into the database based on a label associated with both the location and the element.
FIG. 5 shows a transformation program consistent with the embodiment described in FIG. 1. As shown in FIG. 5, the transformation program can be independent and separate from the target program. Specifically, a target program 502 provides an output file such as a print file to a transformation program 504. The transformation program 504 is configured so that it appears to the target program as a printer driver. The transformation program 504 does not require or perform any modification or alteration of the target program 502, and can be designed or configured to function compatibly with commercially available target programs such as spreadsheets, accounting programs, word processing programs, and so forth.
FIG. 6 shows that the transformation program and the target program of the embodiment described in FIG. 1, can alternatively be implemented respectively as a module 604 and a target program module 602 together within an application 606. The module 604 can appear as a printer driver to the module 602. For example, the transformation program module 604 can be implemented as a DLL, OCX, Active X Control program, or in any other form that can be marketed to software vendors for integration into independently developed applications.
FIG. 7 shows an exemplary structure of various embodiments of the invention, with respect to the software layers of a computer. In particular, FIG. 7 shows an application layer 702, at which any Windows™ application 701 operates on the computer. Below the layer 702 is a Windows™ OS (Operating System) layer 704, in which can be found a GDI (Graphics Interface Device) 703. Below the layer 704 is a low level OS interface layer 706, at which an XBRL Printer Driver 705 in accordance with the invention can be found. Below the layer 706, is an XML mapping agent layer 708, with an XML Mapping Agent 707. Below the layer 708 is a data conversion layer 710, which includes a Data Converter 709 that outputs or can output the data in a variety of formats, including the formats 712- 722 shown (HTML, Excel™, XML, SQL, xBase, and ASCII respectively). In accordance with various exemplary embodiments of the invention, a transformation program that performs various functions of the invention includes the XBRL Printer Driver 705, the XBRL Mapping Agent 707, and the Data Converter 709. Although the elements 705, 707 are shown as being XBRL- related, the elements 705, 707 can be related to any or all of the formats, taxonomies, protocols, standards, etc. described above and their equivalents. In addition, the formats 712-722 are exemplary and not limiting.
With respect to each of the described embodiments, information provided by the user, for example associations between data from a target application and XBRL labels or tags, can be made using drag-and-drop, cut-and-paste, selection of items from a proffered menu, keyboard entry, or any other appropriate technique. In addition, the described embodiments can be variously combined. Extracting data from the target program or target application can include, in addition to or instead of obtaining a print file, accessing data directly from a file or out of a database without running or launching the parent (target) application, scraping data off of a display screen or window, and so forth. Those skilled in the art will recognize that the software functions described herein can be variously implemented as a) software instructions running on a hardware machine such as a desktop computer having a central microprocessor, b) appropriately configured Field Programmable Gate Array(s) (FPGAs), c) Application Specific Integrated Circuit(s) (ASICs), or any other equivalent or suitable computation device.
It will be appreciated by those skilled in the art that the present invention can be embodied in other specific forms without departing from the spirit or essential characteristics thereof, and that the invention is not limited to the specific embodiments described herein. The presently disclosed embodiments are therefore considered in all respects to be illustrative and not restrictive. The scope of the invention is indicated by the appended claims rather than the foregoing description, and all changes that come within the meaning and range and equivalents thereof are intended to be embraced therein.

Claims

Claims:
1. A computer implemented method for organizing data, the method comprising the steps of: capturing a data stream; identifying data in the captured data stream; and mapping the identified data to at least one of a file structure, a schema, and a taxonomy.
2. The method of claim 1, wherein the steps of capturing and identifying are performed by different computers.
3. The method of claim 1, comprising: storing the captured data stream; wherein the identifying step is performed on the stored data stream.
4. The method of claim 1, wherein the steps of capturing and identifying are performed at different locations.
5. The method of claim 1, wherein: the step of identifying comprises identifying a first value in a first currency; and the step of mapping comprises determining a second value in a second currency, based on the first value and a conversion factor.
6. The method of claim 5, wherein a user specifies the second currency and the conversion factor.
7. The method of claim 6, wherein the user specifies the conversion factor by indicating a date on which the conversion factor is known.
8. The method of 1, wherein: the identified data are organized in accordance with a first standard; and the step of mapping comprises organizing the identified data in accordance with a second standard.
9. The method of 8, wherein: the first standard is one of United States GAAP (Generally Accepted Accounting Principles), and International GAAP; and the second standard is the other of United States GAAP and International GAAP.
10. The method of claim 1, wherein the step of mapping maps the identified data to an extensible Markup Language (XML) taxonomy.
11. The method of claim 1 , wherein the step of mapping maps the identified data into a spreadsheet.
12. The method of claim 1, wherein the step of mapping maps the identified data into a database.
13. The method of claim 1, wherein the step of mapping maps the identified data to a flat file.
14. The method of claim 13, comprising outputting a data definition that defines a structure of the flat file.
15. The method of claim 14, wherein the structure indicates locations of the mapped data within the flat file.
16. The method of claim 1, wherein the data stream is in the form of a data output to a computer display screen.
17. The method of claim 1, wherein the data stream is in the form of a data output to a computer data port.
18. The method of claim 1 , wherein the data stream is in the form of a data output to a data storage device.
19. The method of claim 18, wherein the data storage device is a Random Access Memory in a computer.
20. The method of claim 1, wherein the data storage device is a disk drive.
21. The method of claim 1, wherein the data stream is generated at an
Operating System level of a computer implementing the method.
22. The method of claim 1, wherein: the identifying step comprises identifying metadata in the data stream; and the mapping step comprises selecting labels that correspond to the identified metadata, based on a list associating labels with metadata.
23. The method of claim 22, comprising a step of adding the selected labels into the data stream to label at least one of the identified metadata and the identified data.
24. The method of 23, comprising the step of removing non-selected labels from the data stream.
25. The method of claim 23, comprising a step of creating a file by combining the selected labels with at least the identified data.
26. The method of claim 22, comprising a step of requesting a user to select a label corresponding to a metadatum in the identified data, when the list does not associate a label with the metadatum.
27. The method of claim 26, comprising a step of adding the association indicated by the user's selection, to the list associating labels with metadata.
28. The method of claim 22, wherein the list comprises a plurality of labels associated with a metadatum.
29. The method of claim 28, wherein the plurality of labels comprises synonymous labels.
30. The method of claim 28, wherein the metadatum is identified in the data stream based on a label in the data stream corresponding to the metadatum.
31. The method of claim 30, wherein the selected label is different from the label in the data stream.
32. The method of 28, wherein: the identifying step comprises identifying the metadatum in the data stream; and the mapping step comprises selecting a label from the plurality of labels associated with the identified metadatum.
33. The method of claim 32, wherein the labels in the plurality are in different languages.
34. The method of claim 33, wherein the selected labels are in a specified one of the different languages.
35. The method of claim 34, wherem the specified language is XBRL
(Extensible Business Markup Language).
36. The method of claim 34, wherein the selected labels are in a specified human spoken language.
37. The method of claim 22, wherein the labels are consistent with XML (extensible Markup Language).
38. The method of claim 37, wherein the labels conform to an XBRL (extensible Business Reporting Language) specification.
39. The method of claim 38, wherein the labels are defined in at least one XBRL taxonomy.
40. The method of claim 22, wherein the metadata comprises at least one text string.
41. The method of claim 1, comprising a step of providing the data stream from a target program to a transformation program, wherein the transformation program a) performs the steps of identifying and mapping, and b) appears to the target program as a device driver.
42. The method of claim 41, wherein the transformation program is independent from the target program.
43. The method of claim 41, wherein the transformation program and the target program are modules incorporated within a single program.
44. The method of 1 , wherein the data stream is in a form of data output to a computer printer.
45. A method for transferring information between computers, comprising: in a first computer, converting the mapped data from a first format to an intermediate format; and transferring the converted information to a second computer.
46. The method of 45, comprising: at the second computer, receiving the transferred information and converting the received information from the intermediate format to a second format.
47. The method of 46, wherein the intermediate format is consistent with an XML (extensible Markup Language) taxonomy.
48. The method of 46, comprising: encrypting the information prior to transferring; and decrypting the received transferred information.
49. The method of 46, comprising: performing the steps of converting in the first computer and transferring, in response to a request from the second computer.
50. A method for transferring information between computers, comprising: in a first computer, converting information that is to be transferred, from a first format to an intermediate format; and transferring the converted information to a second computer.
51. The method of 50, comprising: at the second computer, receiving the transferred information and converting the received information from the intermediate format to a second format.
52. The method of 51, wherein the intermediate format is consistent with an XML (extensible Markup Language) taxonomy.
53. The method of 51, comprising: encrypting the information prior to transferring; and decrypting the received transferred information.
54. The method of 51, comprising: performing the steps of converting in the first computer and transferring, in response to a request from the second computer.
55. A method for adding labels to data, the labels including information about the data and being defined in at least one taxonomy, the method comprising the steps of: a transformation program receiving an electronically represented file from a target program, wherein the transformation program appears to the target program as a printer driver; transformation program identifying data in an electronically represented file; and the transformation program selecting labels that correspond to metadata in the identified data, based on a list associating labels with metadata.
56. The method of claim 55, comprising the step of the transformation program adding the selected labels into the electronically represented file to label at least one of a) the elements in the identified data associated with the metadata, and b) the metadata.
57. The method of claim 55, comprising the step of the transformation program creating a new file by combining the selected labels with at least the identified data from the electronically represented file to label at least one of a) the elements in the identified data associated with the metadata, and b) the metadata.
58. A method for forming an import file template for importing at least a portion of a data set into a target application, the data set including labels indicating information about data in the data set, the labels being defined in at least one taxonomy, the method comprising the steps of: the target application exporting data in an export file; a user associating at least one of the entries in the export file with at least one of the labels; and forming the import file template based on a format of the export file and the associated at least one entry and at least one label; and entering data from the date set into the import file template based on labels associated with both the data from the data set being entered and entries in the import file template.
59. The method of 58, comprising the step of storing the associations made by the user.
60. The method of 59, wherein the labels are consistent with XML (extensible Markup Language).
61. The method of 60, wherein the labels conform to an XBRL (extensible Business Reporting Language) specification.
62. The method of claim 61, wherein the target program is not XBRL compliant.
63. A method for importing at least a portion of a data set into a target application, the data set including labels indicating information about data in the data set, the labels being defined in at least one taxonomy, the method comprising the steps of: the target program exporting data in an export file; a user associating entries in the export file with ones of the labels; and forming an import file by replacing data in the export file at entries associated with ones of the labels, with data from the data set, the replacement data having the same labels as the entries.
64. A method for inputting at least a portion of a set of data into a target application, the data set including labels indicating information about data in the data set, the labels being defined in at least one taxonomy, the method comprising the steps of: monitoring entry of data associated with the labels into the target application, and storing key strokes associated with the entry of data for each different label; receiving the data set; and entering data from the data set into the target application, by performing the stored key strokes corresponding to the labels associated with the data in the data set.
65. The method of 64, wherein the program observing the user is a memory resident program.
66. The method of 64, comprising the step of prompting the user to enter a data item into the target application, when no key strokes have been stored for a label associated with the data item.
67. A method for inputting at least a portion of a data set into a target database, the data set including labels indicating information about data in the data set, the labels being defined in at least one taxonomy, the method comprising the steps of: inputting test data into the target database; searching the database for patterns corresponding to the test data; modeling a structure of the database based on the search results; and directly accessing the database using the modeled structure to perform at least one of inserting data into, or retrieving data from, the database.
68. The method of claim 67, wherein the step of searching is performed by a pattern recognition application.
69. The method of claim 67, comprising the step of associating locations within the database structure with labels, the labels corresponding to elements of the test data found at the locations during the step of searching.
70. The method of claim 69, comprising the step of inserting an element of the data set into a location within the database, based on a label associated with both the location and the element.
PCT/US2002/025700 2001-08-17 2002-08-14 Method for adding metadata to data WO2003017141A1 (en)

Priority Applications (7)

Application Number Priority Date Filing Date Title
KR1020047002364A KR100990018B1 (en) 2001-08-17 2002-08-14 Method for adding metadata to data
CN02820705XA CN1571968B (en) 2001-08-17 2002-08-14 Method for adding metadata correlated with data to the data in data aggregate
JP2003521979A JP2005500623A (en) 2001-08-17 2002-08-14 How to add metadata to data
EP02753452A EP1425683A4 (en) 2001-08-17 2002-08-14 Method for adding metadata to data
NZ531175A NZ531175A (en) 2001-08-17 2002-08-14 Method for adding metadata to data
HK05106337.4A HK1073898A1 (en) 2001-08-17 2005-07-25 Method for adding to collection of data metadata associated with the data
AU2009201447A AU2009201447A1 (en) 2001-08-17 2009-04-14 Method for adding metadata to data

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US31278801P 2001-08-17 2001-08-17
US60/312,788 2001-08-17
US10/086,522 US6947947B2 (en) 2001-08-17 2002-03-04 Method for adding metadata to data
US10/086,522 2002-03-04

Publications (2)

Publication Number Publication Date
WO2003017141A1 true WO2003017141A1 (en) 2003-02-27
WO2003017141A9 WO2003017141A9 (en) 2003-09-12

Family

ID=26774839

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2002/025700 WO2003017141A1 (en) 2001-08-17 2002-08-14 Method for adding metadata to data

Country Status (9)

Country Link
US (2) US6947947B2 (en)
EP (2) EP2325757A3 (en)
JP (2) JP2005500623A (en)
KR (1) KR100990018B1 (en)
CN (1) CN1571968B (en)
AU (1) AU2009201447A1 (en)
HK (1) HK1073898A1 (en)
NZ (1) NZ531175A (en)
WO (1) WO2003017141A1 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005302008A (en) * 2004-03-22 2005-10-27 Microsoft Corp Method and system for schema-less data mapping using nested table
JP2006252201A (en) * 2005-03-10 2006-09-21 Fujitsu Ltd Source code analysis program
JP2007527058A (en) * 2004-02-06 2007-09-20 ユービーマトリックス・インク Form composition mechanism and method for linking data and meta data
GB2444514A (en) * 2006-12-04 2008-06-11 Glasswall Electronic file re-generation
WO2009005964A2 (en) * 2007-06-29 2009-01-08 Microsoft Corporation Content-based tagging of rss feeds and e-mail
US8869283B2 (en) 2005-06-09 2014-10-21 Glasswall (Ip) Limited Resisting the spread of unwanted code and data
US9330264B1 (en) 2014-11-26 2016-05-03 Glasswall (Ip) Limited Statistical analytic method for the determination of the risk posed by file based content
US9729513B2 (en) 2007-11-08 2017-08-08 Glasswall (Ip) Limited Using multiple layers of policy management to manage risk
US9832222B2 (en) 2013-10-04 2017-11-28 Glasswall (Ip) Limited Anti-malware mobile content data management apparatus and method

Families Citing this family (135)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10839321B2 (en) 1997-01-06 2020-11-17 Jeffrey Eder Automated data storage system
US20040172484A1 (en) * 2000-04-04 2004-09-02 Gudmundur Hafsteinsson Device-specific communicating between a transmitting device and a receving device
US20040193503A1 (en) * 2000-10-04 2004-09-30 Eder Jeff Scott Interactive sales performance management system
US20040148568A1 (en) * 2001-06-13 2004-07-29 Springer Timothy Stephen Checker and fixer algorithms for accessibility standards
US20050144166A1 (en) * 2003-11-26 2005-06-30 Frederic Chapus Method for assisting in automated conversion of data and associated metadata
US7220910B2 (en) 2002-03-21 2007-05-22 Microsoft Corporation Methods and systems for per persona processing media content-associated metadata
JP2003316765A (en) * 2002-04-23 2003-11-07 Hitachi Ltd Hierarchized document mapping device
US20030210428A1 (en) * 2002-05-07 2003-11-13 Alex Bevlin Non-OCR method for capture of computer filled-in forms
US7283989B1 (en) * 2002-09-27 2007-10-16 At&T Bls Intellectual Property, Inc. System and method for use of application metadata
US7900052B2 (en) 2002-11-06 2011-03-01 International Business Machines Corporation Confidential data sharing and anonymous entity resolution
US8538840B2 (en) 2002-12-20 2013-09-17 Siebel Systems, Inc. Financial services data model
US7856454B2 (en) 2002-12-20 2010-12-21 Siebel Systems, Inc. Data model for business relationships
EP1441499A1 (en) * 2003-01-21 2004-07-28 Deutsche Thomson-Brandt Gmbh Device and method for generating metadata from essence
US20040167905A1 (en) * 2003-02-21 2004-08-26 Eakin William Joseph Content management portal and method for managing digital assets
US8117130B2 (en) * 2003-02-25 2012-02-14 Stragent, Llc Batch loading and self-registration of digital media files
US8392298B2 (en) * 2003-03-04 2013-03-05 Siebel Systems, Inc. Invoice adjustment data object for a common data object format
US8473399B2 (en) * 2003-03-04 2013-06-25 Siebel Systems, Inc. Invoice data object for a common data object format
US8489470B2 (en) * 2003-03-24 2013-07-16 Siebel Systems, Inc. Inventory location common object
AU2003901428A0 (en) * 2003-03-24 2003-04-10 Objective Systems Pty Ltd A system and method for formatting and distributing reading material
US8510179B2 (en) 2003-03-24 2013-08-13 Siebel Systems, Inc. Inventory transaction common object
US9704120B2 (en) 2003-03-24 2017-07-11 Oracle International Corporation Inventory balance common object
US20070208577A1 (en) * 2003-03-24 2007-09-06 Leon Maria T B Position common object
US7912932B2 (en) * 2003-03-24 2011-03-22 Siebel Systems, Inc. Service request common object
US7904340B2 (en) * 2003-03-24 2011-03-08 Siebel Systems, Inc. Methods and computer-readable medium for defining a product model
JP2006523353A (en) * 2003-03-24 2006-10-12 シーベル システムズ,インコーポレイティド Common common object
US20070226037A1 (en) * 2003-03-25 2007-09-27 Shailendra Garg Modeling of opportunity data
US20040199537A1 (en) * 2003-04-03 2004-10-07 Duff Robert Cory System for storing and retrieving database information
US20040205547A1 (en) * 2003-04-12 2004-10-14 Feldt Kenneth Charles Annotation process for message enabled digital content
US7856388B1 (en) * 2003-08-08 2010-12-21 University Of Kansas Financial reporting and auditing agent with net knowledge for extensible business reporting language
US7546288B2 (en) * 2003-09-04 2009-06-09 Microsoft Corporation Matching media file metadata to standardized metadata
US7908548B2 (en) * 2003-10-24 2011-03-15 Oracle International Corporation User interface for establishing mappings from internal metadata to external metadata
US20080052687A1 (en) * 2003-11-03 2008-02-28 Agustin Gonzales-Tuchmann Development environment for data transformation applications
US8135755B2 (en) * 2005-06-29 2012-03-13 Microsoft Corporation Templates in a schema editor
US8112296B2 (en) * 2004-05-21 2012-02-07 Siebel Systems, Inc. Modeling of job profile data
US7865390B2 (en) * 2004-05-21 2011-01-04 Siebel Systems, Inc. Modeling of employee performance result data
US7904488B2 (en) 2004-07-21 2011-03-08 Rockwell Automation Technologies, Inc. Time stamp methods for unified plant model
EP1789894A4 (en) * 2004-08-02 2007-09-19 Justsystems Corp Document processing and management approach to making changes to a document and its representation
KR20070058684A (en) * 2004-09-27 2007-06-08 유비매트릭스 인코포레이티드 Method for searching data elements on the web using a conceptual metadata and contextual metadata search engine
US8756521B1 (en) 2004-09-30 2014-06-17 Rockwell Automation Technologies, Inc. Systems and methods for automatic visualization configuration
CA2489628A1 (en) * 2004-12-07 2006-06-07 Ibm Canada Limited - Ibm Canada Limitee System and method for dynamic editing support and validation of application specific information on business objects
US7415482B2 (en) * 2005-02-11 2008-08-19 Rivet Software, Inc. XBRL enabler for business documents
US20090019358A1 (en) * 2005-02-11 2009-01-15 Rivet Software, Inc. A Delaware Corporation Extensible business reporting language (xbrl) enabler for business documents
US7392263B2 (en) * 2005-02-28 2008-06-24 Microsoft Corporation File system represented inside a database
US7472346B2 (en) * 2005-04-08 2008-12-30 International Business Machines Corporation Multidimensional XBRL engine
WO2006113895A2 (en) * 2005-04-20 2006-10-26 Dynamic Access Systems Llc Xbrl data conversion
US20060242181A1 (en) * 2005-04-22 2006-10-26 Business Objects Apparatus and method for transforming XBRL data into database schema
US20060259519A1 (en) * 2005-05-12 2006-11-16 Microsoft Corporation Iterative definition of flat file data structure by using document instance
US8799800B2 (en) 2005-05-13 2014-08-05 Rockwell Automation Technologies, Inc. Automatic user interface generation
US7676281B2 (en) 2005-05-13 2010-03-09 Rockwell Automation Technologies, Inc. Distributed database in an industrial automation environment
US7672737B2 (en) 2005-05-13 2010-03-02 Rockwell Automation Technologies, Inc. Hierarchically structured data model for utilization in industrial automation environments
US7809683B2 (en) 2005-05-13 2010-10-05 Rockwell Automation Technologies, Inc. Library that includes modifiable industrial automation objects
US7650405B2 (en) 2005-05-13 2010-01-19 Rockwell Automation Technologies, Inc. Tracking and tracing across process boundaries in an industrial automation environment
US8527540B2 (en) * 2005-05-24 2013-09-03 Business Objects Software Ltd. Augmenting a report with metadata for export to a non-report document
US7831913B2 (en) * 2005-07-29 2010-11-09 Microsoft Corporation Selection-based item tagging
US20070050698A1 (en) * 2005-08-29 2007-03-01 Stefan Chopin Add-in tool and method for rendering financial data into spreadsheet compliant format
US7917841B2 (en) * 2005-08-29 2011-03-29 Edgar Online, Inc. System and method for rendering data
US7877678B2 (en) * 2005-08-29 2011-01-25 Edgar Online, Inc. System and method for rendering of financial data
EP3133809A3 (en) * 2005-09-02 2017-03-01 GVBB Holdings S.A.R.L Automatic metadata extraction and metadata controlled production process
WO2007034482A2 (en) * 2005-09-20 2007-03-29 Sterna Technologies (2005) Ltd. A method and system for managing data and organizational constraints
US7881812B2 (en) 2005-09-29 2011-02-01 Rockwell Automation Technologies, Inc. Editing and configuring device
US7548789B2 (en) 2005-09-29 2009-06-16 Rockwell Automation Technologies, Inc. Editing lifecycle and deployment of objects in an industrial automation environment
US7734590B2 (en) 2005-09-30 2010-06-08 Rockwell Automation Technologies, Inc. Incremental association of metadata to production data
US8484250B2 (en) 2005-09-30 2013-07-09 Rockwell Automation Technologies, Inc. Data federation with industrial control systems
US7660638B2 (en) 2005-09-30 2010-02-09 Rockwell Automation Technologies, Inc. Business process execution engine
US8275680B2 (en) 2005-09-30 2012-09-25 Rockwell Automation Technologies, Inc. Enabling transactional mechanisms in an automated controller system
US7801628B2 (en) 2005-09-30 2010-09-21 Rockwell Automation Technologies, Inc. Industrial operator interfaces interacting with higher-level business workflow
KR101240917B1 (en) * 2005-10-12 2013-03-08 주식회사 더존다스 Taxonomy definition structure for extendible business reporting language and method of editing financial information using the same
WO2007071548A1 (en) * 2005-12-22 2007-06-28 International Business Machines Corporation Method and system for automatically generating multilingual electronic content from unstructured data
AU2007200384B2 (en) * 2006-01-31 2012-08-23 Thomson Reuters Enterprise Centre Gmbh Definitions in master documents
US7743026B2 (en) * 2006-01-31 2010-06-22 Microsoft Corporation Redirection to local copies of server-based files
US7703099B2 (en) * 2006-02-24 2010-04-20 Microsoft Corporation Scalable transformation and configuration of EDI interchanges
US8230329B2 (en) * 2006-03-22 2012-07-24 Rivet Software, Inc. Enterprise-level transaction analysis and reporting
US8204213B2 (en) * 2006-03-29 2012-06-19 International Business Machines Corporation System and method for performing a similarity measure of anonymized data
US7836394B2 (en) * 2006-04-18 2010-11-16 Savanet Llc Interactive, customizable display and analysis of electronically tagged financial information
US7953713B2 (en) * 2006-09-14 2011-05-31 International Business Machines Corporation System and method for representing and using tagged data in a management system
US7783641B2 (en) * 2006-10-26 2010-08-24 Microsoft Corporation Taxonometric personal digital media organization
US8949864B2 (en) * 2006-10-30 2015-02-03 Sap Se Method and system for providing a report connector
US8204831B2 (en) 2006-11-13 2012-06-19 International Business Machines Corporation Post-anonymous fuzzy comparisons without the use of pre-anonymization variants
US8185872B2 (en) * 2006-11-27 2012-05-22 Red Hat, Inc. Cross-platform software package generation
US7788247B2 (en) 2007-01-12 2010-08-31 Microsoft Corporation Characteristic tagging
US8156154B2 (en) * 2007-02-05 2012-04-10 Microsoft Corporation Techniques to manage a taxonomy system for heterogeneous resource domain
US20080189265A1 (en) * 2007-02-06 2008-08-07 Microsoft Corporation Techniques to manage vocabulary terms for a taxonomy system
US20080262965A1 (en) * 2007-03-23 2008-10-23 Niko Spyridonos System and method for processing payment of property taxes between multiple payors and multiple municipalities
US8099370B2 (en) * 2007-04-03 2012-01-17 Microsoft Corporation System for financial documentation conversion
US20080275733A1 (en) * 2007-05-03 2008-11-06 Volker Schmidt Method for evaluation of patient identification
EP1995667A1 (en) * 2007-05-25 2008-11-26 Software Ag Method and system for processing a non-XML document for storage in a XML database
US20080294604A1 (en) * 2007-05-25 2008-11-27 International Business Machines Xquery join predicate selectivity estimation
US20080301096A1 (en) * 2007-05-29 2008-12-04 Microsoft Corporation Techniques to manage metadata fields for a taxonomy system
US7970649B2 (en) * 2007-06-07 2011-06-28 Christopher Jay Wu Systems and methods of task cues
KR100884200B1 (en) * 2007-06-29 2009-02-18 이화여자대학교 산학협력단 System for management contents based on meaning identification of tag and method thereof
US8176003B2 (en) 2007-06-29 2012-05-08 Microsoft Corporation Automatic designation of XBRL taxonomy tags
US8560950B2 (en) * 2007-09-04 2013-10-15 Apple Inc. Advanced playlist creation
US8140531B2 (en) * 2008-05-02 2012-03-20 International Business Machines Corporation Process and method for classifying structured data
WO2010008685A2 (en) 2008-05-28 2010-01-21 Aptima, Inc. Systems and methods for analyzing entity profiles
US9076126B2 (en) * 2008-11-07 2015-07-07 R.R. Donnelley Financial, Inc. Document management system
US9047346B2 (en) * 2008-11-11 2015-06-02 Microsoft Technology Licensing, Llc Reporting language filtering and mapping to dimensional concepts
WO2010074655A1 (en) * 2008-12-24 2010-07-01 Nachiappan Sudha System and method for providing an incentivized tagging system for electronic resources
US8504624B2 (en) * 2009-09-08 2013-08-06 Ricoh Co., Ltd. Stroke and image aggregation and analytics
US20110072047A1 (en) * 2009-09-21 2011-03-24 Microsoft Corporation Interest Learning from an Image Collection for Advertising
JP5424807B2 (en) * 2009-10-15 2014-02-26 キヤノン株式会社 Image processing apparatus and image processing apparatus control method
US8478705B2 (en) * 2010-01-15 2013-07-02 International Business Machines Corporation Portable data management using rule definitions
US8484401B2 (en) 2010-04-15 2013-07-09 Rockwell Automation Technologies, Inc. Systems and methods for conducting communications among components of multidomain industrial automation system
US8984533B2 (en) 2010-04-15 2015-03-17 Rockwell Automation Technologies, Inc. Systems and methods for conducting communications among components of multidomain industrial automation system
US9392072B2 (en) 2010-04-15 2016-07-12 Rockwell Automation Technologies, Inc. Systems and methods for conducting communications among components of multidomain industrial automation system
US20110276873A1 (en) * 2010-05-06 2011-11-10 Chethan Gorur System and Method for Re-Using XBRL-Tags Across Period Boundaries
US9703782B2 (en) * 2010-05-28 2017-07-11 Microsoft Technology Licensing, Llc Associating media with metadata of near-duplicates
US8903798B2 (en) 2010-05-28 2014-12-02 Microsoft Corporation Real-time annotation and enrichment of captured video
US8666998B2 (en) 2010-09-14 2014-03-04 International Business Machines Corporation Handling data sets
US8949166B2 (en) 2010-12-16 2015-02-03 International Business Machines Corporation Creating and processing a data rule for data quality
US9430801B2 (en) * 2011-01-25 2016-08-30 Intuit Inc. Methods systems and computer program products for generating financial statement complying with accounting standard
WO2012126015A2 (en) * 2011-03-17 2012-09-20 Xbrl Cloud, Inc. Xbrl database mapping system and method
CN102222083A (en) * 2011-05-06 2011-10-19 中国科学院研究生院 Creation-object-based extensible business reporting language (XBRL) taxonomy rapid-resolution method
US9678992B2 (en) 2011-05-18 2017-06-13 Microsoft Technology Licensing, Llc Text to image translation
US9501455B2 (en) * 2011-06-30 2016-11-22 The Boeing Company Systems and methods for processing data
US8496165B2 (en) 2011-06-30 2013-07-30 Recall Technology Pty Limited Method and system for RFID label replacement of conventional labels on media in a media storage system
US8612489B2 (en) 2011-07-14 2013-12-17 International Business Machines Corporation LossLess transformation of XBRL instance to XML data model instance
US8898104B2 (en) 2011-07-26 2014-11-25 International Business Machines Corporation Auto-mapping between source and target models using statistical and ontology techniques
US9164990B2 (en) * 2011-12-20 2015-10-20 Sap Portals Israel Ltd Annotating contextual workspaces
US8849843B1 (en) 2012-06-18 2014-09-30 Ez-XBRL Solutions, Inc. System and method for facilitating associating semantic labels with content
US9135327B1 (en) 2012-08-30 2015-09-15 Ez-XBRL Solutions, Inc. System and method to facilitate the association of structured content in a structured document with unstructured content in an unstructured document
US10055600B2 (en) * 2012-10-04 2018-08-21 Tata Consultancy Services Limited Analysis and specification creation for web documents
US20150242961A1 (en) * 2014-02-25 2015-08-27 Sri International Enriched financial transaction records
US10783116B2 (en) 2015-03-10 2020-09-22 Royal Bank Of Canada Systems and methods for managing data
US10803229B2 (en) * 2015-07-16 2020-10-13 Thinxtream Technologies Pte. Ltd. Hybrid system and method for data and file conversion across computing devices and platforms
CN106407188A (en) * 2015-07-16 2017-02-15 中兴通讯股份有限公司 Document error-checking method and device
ES2724001T3 (en) * 2015-11-26 2019-09-05 Tata Consultancy Services Ltd System and procedure for enabling data masking for web documents
CN106997369B (en) * 2016-01-26 2020-11-24 阿里巴巴集团控股有限公司 Data cleaning method and device
US10956469B2 (en) 2017-01-06 2021-03-23 International Business Machines Corporation System and method for metadata correlation using natural language processing
CN107590266B (en) * 2017-09-22 2021-03-30 深圳易嘉恩科技有限公司 Method for replicating XBRL classification criteria
WO2019146090A1 (en) * 2018-01-26 2019-08-01 富士通株式会社 Evaluation program, information processing device, and evaluation method
JP2021009528A (en) * 2019-06-30 2021-01-28 大 益滿 Database and information processing system
KR102249350B1 (en) * 2019-08-23 2021-05-07 넷마블 주식회사 System and method for processing data
KR102558521B1 (en) * 2021-03-30 2023-07-21 주식회사 잇다헬스케어 Server for recommending solution based on user health information and mehtod thereof
CN113377741A (en) * 2021-05-28 2021-09-10 中国铁道科学研究院集团有限公司电子计算技术研究所 Method and device for managing metadata of railway engineering design
CN114863906B (en) * 2022-07-07 2022-10-28 北京中电慧声科技有限公司 Method and device for marking alias of text-to-speech processing

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6049819A (en) * 1997-12-10 2000-04-11 Nortel Networks Corporation Communications network incorporating agent oriented computing environment
US6151624A (en) * 1998-02-03 2000-11-21 Realnames Corporation Navigating network resources based on metadata

Family Cites Families (59)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4939698A (en) * 1966-04-26 1990-07-03 The United States Of America As Represented By The Secretary Of The Navy Sonar system
US4604607A (en) * 1979-12-27 1986-08-05 Protection Products Corporation Security device simulating currency pack or the like
US4993030A (en) * 1988-04-22 1991-02-12 Amdahl Corporation File system for a plurality of storage classes
US5491495A (en) * 1990-11-13 1996-02-13 Wang Laboratories, Inc. User interface having simulated devices
US5418957A (en) * 1992-03-09 1995-05-23 Narayan; Rom Network data dictionary
US5423034A (en) * 1992-06-10 1995-06-06 Cohen-Levy; Leon Network file management with user determined hierarchical file structures and means for intercepting application program open and save commands for inputting and displaying user inputted descriptions of the location and content of files
CA2099916C (en) * 1992-07-06 2002-12-24 Steven J. Sinofsky Method and system for organizing internal structure of a file
US5978811A (en) * 1992-07-29 1999-11-02 Texas Instruments Incorporated Information repository system and method for modeling data
CA2124752C (en) * 1993-06-30 2005-04-12 Mark Zbikowski Meta-data structure and handling
EP0647909B1 (en) * 1993-10-08 2003-04-16 International Business Machines Corporation Information catalog system with object-dependent functionality
US5590319A (en) * 1993-12-15 1996-12-31 Information Builders, Inc. Query processor for parallel processing in homogenous and heterogenous databases
US5909570A (en) * 1993-12-28 1999-06-01 Webber; David R. R. Template mapping system for data translation
US5586264A (en) * 1994-09-08 1996-12-17 Ibm Corporation Video optimized media streamer with cache management
US5630092A (en) * 1994-10-20 1997-05-13 International Business Machines System and method for transferring compressed and uncompressed data between storage systems
US5873093A (en) * 1994-12-07 1999-02-16 Next Software, Inc. Method and apparatus for mapping objects to a data source
US6181867B1 (en) * 1995-06-07 2001-01-30 Intervu, Inc. Video storage and retrieval system
US5713032A (en) * 1996-01-03 1998-01-27 Eastman Kodak Company Compound document processing system
US5862325A (en) * 1996-02-29 1999-01-19 Intermind Corporation Computer-based communication system and method using metadata defining a control structure
US6035330A (en) * 1996-03-29 2000-03-07 British Telecommunications World wide web navigational mapping system and method
US5721911A (en) * 1996-06-25 1998-02-24 International Business Machines Corporation Mechanism for metadata for an information catalog system
US5857203A (en) * 1996-07-29 1999-01-05 International Business Machines Corporation Method and apparatus for dividing, mapping and storing large digital objects in a client/server library system
US6131100A (en) * 1997-03-04 2000-10-10 Zellweger; Paul Method and apparatus for a menu system for generating menu data from external sources
US5924090A (en) * 1997-05-01 1999-07-13 Northern Light Technology Llc Method and apparatus for searching a database of records
US6282548B1 (en) * 1997-06-21 2001-08-28 Alexa Internet Automatically generate and displaying metadata as supplemental information concurrently with the web page, there being no link between web page and metadata
US6199046B1 (en) * 1997-07-29 2001-03-06 Adsura Pty Ltd. Method system and article of manufacture for performing real time currency conversion
US6163781A (en) * 1997-09-11 2000-12-19 Physician Weblink Technology Services, Inc. Object-to-relational data converter mapping attributes to object instance into relational tables
US6061692A (en) * 1997-11-04 2000-05-09 Microsoft Corporation System and method for administering a meta database as an integral component of an information server
US6014670A (en) * 1997-11-07 2000-01-11 Informatica Corporation Apparatus and method for performing data transformations in data warehousing
US6055543A (en) * 1997-11-21 2000-04-25 Verano File wrapper containing cataloging information for content searching across multiple platforms
US6028605A (en) * 1998-02-03 2000-02-22 Documentum, Inc. Multi-dimensional analysis of objects by manipulating discovered semantic properties
US7079269B2 (en) * 1998-03-19 2006-07-18 Océ-Technologies B.V. Interactive printing with a plurality of printer devices
US6199059B1 (en) * 1998-04-22 2001-03-06 International Computex, Inc. System and method for classifying and retrieving information with virtual object hierarchy
US6240407B1 (en) * 1998-04-29 2001-05-29 International Business Machines Corp. Method and apparatus for creating an index in a database system
US6212524B1 (en) * 1998-05-06 2001-04-03 E.Piphany, Inc. Method and apparatus for creating and populating a datamart
US6256031B1 (en) * 1998-06-26 2001-07-03 Microsoft Corporation Integration of physical and virtual namespace
US6363377B1 (en) * 1998-07-30 2002-03-26 Sarnoff Corporation Search data processor
US6449341B1 (en) * 1998-08-25 2002-09-10 Mci Communications Corporation Apparatus and method for managing a software system via analysis of call center trouble tickets
US6405215B1 (en) * 1998-11-06 2002-06-11 International Business Machines Corp. Workflow agent for a multimedia database system
US6389426B1 (en) * 1999-02-09 2002-05-14 Worldcom, Inc. Central trouble ticket database and system and method for managing same to facilitate ticketing, trending, and tracking processes
US6535949B1 (en) * 1999-04-19 2003-03-18 Research In Motion Limited Portable electronic device having a log-structured file system in flash memory
EP1065584A1 (en) * 1999-06-29 2001-01-03 Telefonaktiebolaget Lm Ericsson Command handling in a data processing system
JP2001034513A (en) * 1999-07-19 2001-02-09 Hitachi Ltd Method and device for data conversion, and storage medium storing data conversion program
US6523027B1 (en) * 1999-07-30 2003-02-18 Accenture Llp Interfacing servers in a Java based e-commerce architecture
US6549916B1 (en) * 1999-08-05 2003-04-15 Oracle Corporation Event notification system tied to a file system
US6430575B1 (en) * 1999-09-10 2002-08-06 Xerox Corporation Collaborative document management system with customizable filing structures that are mutually intelligible
US6593943B1 (en) * 1999-11-30 2003-07-15 International Business Machines Corp. Information grouping configuration for use with diverse display devices
US6940614B2 (en) * 2000-01-05 2005-09-06 Hewlett-Packard Development Company, L.P. Method for defining printer driver settings
US6810429B1 (en) * 2000-02-03 2004-10-26 Mitsubishi Electric Research Laboratories, Inc. Enterprise integration system
US7107229B1 (en) * 2000-02-11 2006-09-12 Claremont Investment Partners, Llc Apparatus and method for creating and managing a financial instrument
US6615312B1 (en) * 2000-02-29 2003-09-02 Western Digital Ventures, Inc. Method for processing file system service requests in a computer having an attached disk drive that can reproduce stream data and non-stream data
US6311194B1 (en) * 2000-03-15 2001-10-30 Taalee, Inc. System and method for creating a semantic web and its applications in browsing, searching, profiling, personalization and advertising
US6760721B1 (en) * 2000-04-14 2004-07-06 Realnetworks, Inc. System and method of managing metadata data
US6925452B1 (en) * 2000-05-22 2005-08-02 International Business Machines Corporation Method and system for recognizing end-user transactions
US7076521B2 (en) * 2000-06-26 2006-07-11 Vertical Computer Systems, Inc. Web-based collaborative data collection system
US6571259B1 (en) * 2000-09-26 2003-05-27 Emc Corporation Preallocation of file system cache blocks in a data storage system
US7047491B2 (en) * 2000-12-05 2006-05-16 Schubert Daniel M Electronic information management system for abstracting and reporting document information
US20020129059A1 (en) * 2000-12-29 2002-09-12 Eck Jeffery R. XML auto map generator
US20030018661A1 (en) * 2001-07-19 2003-01-23 Darugar Parand Tony XML smart mapping system and method
GB2395041A (en) * 2001-08-14 2004-05-12 Bloomberg Lp Distribution and mapping of financial records from data stream

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6049819A (en) * 1997-12-10 2000-04-11 Nortel Networks Corporation Communications network incorporating agent oriented computing environment
US6151624A (en) * 1998-02-03 2000-11-21 Realnames Corporation Navigating network resources based on metadata

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007527058A (en) * 2004-02-06 2007-09-20 ユービーマトリックス・インク Form composition mechanism and method for linking data and meta data
JP2005302008A (en) * 2004-03-22 2005-10-27 Microsoft Corp Method and system for schema-less data mapping using nested table
JP4711708B2 (en) * 2004-03-22 2011-06-29 マイクロソフト コーポレーション System and method for schemaless data mapping using nested tables
JP2006252201A (en) * 2005-03-10 2006-09-21 Fujitsu Ltd Source code analysis program
US8869283B2 (en) 2005-06-09 2014-10-21 Glasswall (Ip) Limited Resisting the spread of unwanted code and data
US11799881B2 (en) 2005-06-09 2023-10-24 Glasswall (Ip) Limited Resisting the spread of unwanted code and data
US11218495B2 (en) 2005-06-09 2022-01-04 Glasswall (Ip) Limited Resisting the spread of unwanted code and data
US10462164B2 (en) 2005-06-09 2019-10-29 Glasswall (Ip) Limited Resisting the spread of unwanted code and data
US9516045B2 (en) 2005-06-09 2016-12-06 Glasswall (Ip) Limited Resisting the spread of unwanted code and data
US10462163B2 (en) 2005-06-09 2019-10-29 Glasswall (Ip) Limited Resisting the spread of unwanted code and data
US10419456B2 (en) 2005-06-09 2019-09-17 Glasswall (Ip) Limited Resisting the spread of unwanted code and data
GB2444514A (en) * 2006-12-04 2008-06-11 Glasswall Electronic file re-generation
US8533824B2 (en) 2006-12-04 2013-09-10 Glasswall (Ip) Limited Resisting the spread of unwanted code and data
US10348748B2 (en) 2006-12-04 2019-07-09 Glasswall (Ip) Limited Using multiple layers of policy management to manage risk
US9038174B2 (en) 2006-12-04 2015-05-19 Glasswall IP Limited Resisting the spread of unwanted code and data
WO2009005964A3 (en) * 2007-06-29 2009-03-12 Microsoft Corp Content-based tagging of rss feeds and e-mail
US8239460B2 (en) 2007-06-29 2012-08-07 Microsoft Corporation Content-based tagging of RSS feeds and E-mail
WO2009005964A2 (en) * 2007-06-29 2009-01-08 Microsoft Corporation Content-based tagging of rss feeds and e-mail
US9729513B2 (en) 2007-11-08 2017-08-08 Glasswall (Ip) Limited Using multiple layers of policy management to manage risk
US9832222B2 (en) 2013-10-04 2017-11-28 Glasswall (Ip) Limited Anti-malware mobile content data management apparatus and method
US10360388B2 (en) 2014-11-26 2019-07-23 Glasswall (Ip) Limited Statistical analytic method for the determination of the risk posed by file based content
US9729564B2 (en) 2014-11-26 2017-08-08 Glasswall (Ip) Limited Statistical analytic method for the determination of the risk posed by file based content
US9330264B1 (en) 2014-11-26 2016-05-03 Glasswall (Ip) Limited Statistical analytic method for the determination of the risk posed by file based content

Also Published As

Publication number Publication date
CN1571968B (en) 2010-04-28
JP2011028753A (en) 2011-02-10
JP5883557B2 (en) 2016-03-15
JP2005500623A (en) 2005-01-06
CN1571968A (en) 2005-01-26
US20050182777A1 (en) 2005-08-18
WO2003017141A9 (en) 2003-09-12
EP1425683A4 (en) 2008-10-08
US20030037038A1 (en) 2003-02-20
EP1425683A1 (en) 2004-06-09
KR100990018B1 (en) 2010-10-26
AU2009201447A1 (en) 2009-05-07
EP2325757A2 (en) 2011-05-25
US6947947B2 (en) 2005-09-20
KR20040058177A (en) 2004-07-03
NZ531175A (en) 2006-02-24
HK1073898A1 (en) 2005-10-21
EP2325757A3 (en) 2017-12-06

Similar Documents

Publication Publication Date Title
US6947947B2 (en) Method for adding metadata to data
US8290977B2 (en) Database system providing methodology for execution of functions in XML queries
US8484210B2 (en) Representing markup language document data in a searchable format in a database system
CN102713834B (en) Management accounts format information
US6799184B2 (en) Relational database system providing XML query support
EP1128277A2 (en) Data processing method and system, program for realizing the method, and computer readable storage medium storing the program
US8417513B2 (en) Representation of objects and relationships in databases, directories, web services, and applications as sentences as a method to represent context in structured data
US8219901B2 (en) Method and device for filtering elements of a structured document on the basis of an expression
US7290003B1 (en) Migrating data using an intermediate self-describing format
US9037613B2 (en) Self-learning data lenses for conversion of information from a source form to a target form
US20090125529A1 (en) Extracting information based on document structure and characteristics of attributes
US20090019015A1 (en) Mathematical expression structured language object search system and search method
WO2005029222A2 (en) Method and system for the specification of interface definitions and business rules
US9754083B2 (en) Automatic creation of clinical study reports
US8180799B1 (en) Dynamically creating tables to store received data
AU2012201150A1 (en) Method for adding metadata to data
JP2002534741A (en) Method and apparatus for processing semi-structured text data
AU2002313739A1 (en) Method for adding metadata to data
Kotásek et al. An XML Framework Proposal for Knowledge Discovery in Databases
CN1332302C (en) Navigation in computer software applications developed in procedural language
WO2001018657A1 (en) Dynamic content identifier for xml web pages
Moens et al. Legislative databases: Current problems and possible solutions
Teng Use of xml for web-based query processing of geospatial data
Ormston et al. Lessons Learned from Creating a Universal Parser for Operational Files
Curry et al. Stratigraphic distribution of brachiopods–a new method of storing and querying loosely-structured biodiversity information

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BY BZ CA CH CN CO CR CU CZ DE DM DZ EC EE ES FI GB GD GE GH HR HU ID IL IN IS JP KE KG KP KR LC LK LR LS LT LU LV MA MD MG MN MW MX MZ NO NZ OM PH PL PT RU SD SE SG SI SK SL TJ TM TN TR TZ UA UG US UZ VC VN YU ZA ZM

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ UG ZM ZW AM AZ BY KG KZ RU TJ TM AT BE BG CH CY CZ DK EE ES FI FR GB GR IE IT LU MC PT SE SK TR BF BJ CF CG CI GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
COP Corrected version of pamphlet

Free format text: PAGES 1/7-7/7, DRAWINGS, REPLACED BY NEW PAGES 1/7-7/7; DUE TO LATE TRANSMITTAL BY THE RECEIVING OFFICE

WWE Wipo information: entry into national phase

Ref document number: 531175

Country of ref document: NZ

Ref document number: 2003521979

Country of ref document: JP

Ref document number: 1020047002364

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 2002313739

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 525/DELNP/2004

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2002753452

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2002820705X

Country of ref document: CN

WWP Wipo information: published in national office

Ref document number: 2002753452

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 531175

Country of ref document: NZ