CN104881464A - Data file format defining method - Google Patents

Data file format defining method Download PDF

Info

Publication number
CN104881464A
CN104881464A CN201510271136.0A CN201510271136A CN104881464A CN 104881464 A CN104881464 A CN 104881464A CN 201510271136 A CN201510271136 A CN 201510271136A CN 104881464 A CN104881464 A CN 104881464A
Authority
CN
China
Prior art keywords
file
data
data file
text
definition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510271136.0A
Other languages
Chinese (zh)
Inventor
冯昱川
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Ting Ruan Science And Technology Ltd
Original Assignee
Beijing Ting Ruan Science And Technology Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Ting Ruan Science And Technology Ltd filed Critical Beijing Ting Ruan Science And Technology Ltd
Priority to CN201510271136.0A priority Critical patent/CN104881464A/en
Publication of CN104881464A publication Critical patent/CN104881464A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2291User-Defined Types; Storage management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/221Column-oriented storage; Management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases

Abstract

The invention belongs to the technical field of computers, and provides a data file format defining method. A format of a data file is defined by utilizing a configuration file comprising a relational database tabling statement. The configuration file comprises the column amount of the data file, the column name of each column, the data type, the data length, the empty or not field, the effective value range and the ordering of the column. The problems of uncertainty of the content in a text file and a hard code and the like are solved.

Description

A kind of document format data define method
Technical field
The invention belongs to a kind of document format data define method in field of computer technology.
Background technology
The content of text can be undertaken checking, revising by the default multiple text editor provided of operating system, and linux, unix system also provide the utility routines such as cat, more, head, tail, wc, grep, diff, sort, uniq, sed, awk, optionally can show text, filter, the operation such as amendment, therefore adopt text files memory data to be convenient to exploitation, the debugging of handling procedure, be also convenient to the maintenance of system.Text files memory is clear data in addition, does not deposit label and descriptor makes data space comparatively save, and the efficiency of transmission, process is higher.Therefore text uses widely under linux, unix environment, uses through being usually used in cross-system Data Migration, usually as the file layout of a system derived data, is also the carrier of data transmission and the data source of a system.
Text extensively, frequent use while, also have that deposit data degree of freedom is large, the non-self-described of data, store data content, form define indefinite problem, namely do not have metadata to define.Text lacks metadata definition and causes programming usually to adopt hard coded mode to realize, this makes the data source of text be difficult to revise as required, application developer changes indigestion, the maintenances of rear text, program, and exploitation New function is extremely difficult further.
Text is usually used in importing and exporting of system data, and the multiplex relevant database of system carries out the longer-term storage of data and supporting business is handled, and therefore also requires that text has consistent structure with the table imported and exported corresponding in system.Text lack metadata definition make the consistance of text structure and list structure ensure difficulty, be unfavorable for data importing, derivation engineering carry out and system cloud gray model time conventional maintenance.
Above-mentioned text describes the problem that causes and brings a lot of uncertain and difficult owing to lacking metadata to application and development, system maintenance, inter-system data migration.
Text, text compression stores, text cryptographic storage and other change store the problem that all there is form definition, and this few class file is referred to as data file in this patent, and the form definition of these data files is all within this patent range of application.
Summary of the invention
The present invention is directed to the problems referred to above in data file use, provide a kind of document format data define method.By providing a configuration file comprising data file definition information, determine the form of store data in data file uniquely, to comprise in data file the row name of each row, data type, length, whether can empty, valid value range, also determine quantity and the ordering of data file row simultaneously.The description that data file and these two files of its configuration file not only carry data itself but also contain data, solves the form, the uncertain problem of business implication that wherein store data when only having a data file.
Configuration file content is used for reference relevant database and is built predicative sentence to realize the definition of document format data, and configuration file itself is also a text.
Embodiment
For making the object, technical solutions and advantages of the present invention clearly, describe the embodiment of a kind of document format data define method of the present invention in detail below with reference to example.Whereby to a kind of document format data define method of the present invention how application technology means technical solution problem, and the implementation procedure reaching technique effect can fully understand and implement according to this.
Embodiment
The present embodiment provides a kind of document format data define method, and for root name is yuser, the data file having 32 row realizes document format data definition, comprising:
Step 1: fully understand the data file needing definition.
This step fully understands for needing the data file of definition, understand what is the purposes comprising such as data file? the total how many row of data file? what does is the purposes often arranged? does is which kind of type store data? does is length how many? be empty? can do you span be determined?
If there is corresponding table in relevant database, so list structure definition can help to understand data file.
Step 2: the configuration file writing data file.
This step is need the target data file of clear and definite form definition to create configuration file, and correctly arranges the content of configuration file.Configuration filename can be mapped to Data Filename, be named can traditionally, realize convenience decide.By linux, UNIX custom, might as well to fetch data filename important name by configuration file important name, extension name gets .conf.
Under linux operating system, vi yuser.conf input content is as shown in table 1.
Table 1 configuration file content is illustrated
Column data type is consistent with the definition in relevant database, can use char, varchar, longvar, short, tinyint, smallint, int, integer, bigint, long, num, numeric, real, dec, decimal, float, double, date, time, datetime, timestamp.
Content in this arrangements of steps file also can utilize Database Modeling instrument as ERWin, PowerDesigner, by graphical interfaces and the mutualization operation generation of modeling tool, these instrument annexation type databases also can be utilized to read list structure and generate.Modeling tool must not use, and the object mentioned here is that remind can the definition of multiplexing existing list structure, reduces the workload of generating configuration file content, increases work efficiency.
Step 3: resolve configuration file, obtain the definition of data file.
Configuration file store content be exactly in fact standard build predicative sentence, to be also not difficult the rule that finds wherein by this example, the configuration file of foregoing of can having programmed is resolved, and obtains the definition of data file.
Reading file and string processing is the basic function that any programming language all provides, is also the basic program capability of programmer, for ease of implementing, only extracts the roughly process of C statement parse here, can for reference when realizing.
● open configuration file
● first pass configuration file, logarithmic data file number also distributes data file metadata storage space
● second compiles scan profile, the number arranged in logarithmic data object, and allocated column metadata store space
● the 3rd time scan profile, resolve and record the metadata of all row, comprising row name, data type, length, whether can be empty, valid value range.
Step 4: according to the definition process data file of data file.
Understand according to the document format data that step 3 parses, process data file.

Claims (4)

1. a document format data define method, is characterized in that, comprising:
The form of the configuration file definition data file of building predicative sentence is comprised by content.
2. method as claimed in claim 1, wherein builds the standard DDL statement that predicative sentence refers to relevant database, the one section of text guided with " CREATE TABLE " character, and in text, all character boundaries are write insensitive; The independent word followed closely after " CREATE TABLE " key word is data file important name, and the first character Duan Ze building the definition of predicative sentence is first row defining data file, the like complete the definition of all row in data file; Method as claimed in claim 1, wherein data file refers to text, the storage file that the compression storage file of text, the cryptographic storage file of text and File generate via other conversion process, data file stores text data character by it and determines, and its root name, extended file name have nothing to do.
3. method as claimed in claim 2, wherein data file important name refers to the partial document name in filename before character ". "; The definition of data file row comprise row title, data type, length, whether can the information such as sky, valid value range.
4. method as claimed in claim 1, wherein configuration file need comprise at least one and builds predicative sentence, and many also can be had to build predicative sentence for defining the form of multiple data file simultaneously; Building except predicative sentence except comprising in configuration file, comment statement and other required contents can also be had.
CN201510271136.0A 2015-05-26 2015-05-26 Data file format defining method Pending CN104881464A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510271136.0A CN104881464A (en) 2015-05-26 2015-05-26 Data file format defining method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510271136.0A CN104881464A (en) 2015-05-26 2015-05-26 Data file format defining method

Publications (1)

Publication Number Publication Date
CN104881464A true CN104881464A (en) 2015-09-02

Family

ID=53948957

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510271136.0A Pending CN104881464A (en) 2015-05-26 2015-05-26 Data file format defining method

Country Status (1)

Country Link
CN (1) CN104881464A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105808772A (en) * 2016-03-28 2016-07-27 中国建设银行股份有限公司 Data defining file generation method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101211359A (en) * 2006-12-31 2008-07-02 凌群电脑股份有限公司 Database inputting data inspection method and database management system
US20100017395A1 (en) * 2008-07-16 2010-01-21 Sapphire Information Systems Ltd. Apparatus and methods for transforming relational queries into multi-dimensional queries
CN101917433A (en) * 2010-08-17 2010-12-15 上海交通大学 Network remote registration localization reappearing protection system
CN102479184A (en) * 2010-11-23 2012-05-30 大连兆阳软件科技有限公司 Database management system and input data checking method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101211359A (en) * 2006-12-31 2008-07-02 凌群电脑股份有限公司 Database inputting data inspection method and database management system
US20100017395A1 (en) * 2008-07-16 2010-01-21 Sapphire Information Systems Ltd. Apparatus and methods for transforming relational queries into multi-dimensional queries
CN101917433A (en) * 2010-08-17 2010-12-15 上海交通大学 Network remote registration localization reappearing protection system
CN102479184A (en) * 2010-11-23 2012-05-30 大连兆阳软件科技有限公司 Database management system and input data checking method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
丁涛: "基于领域驱动设计的物流平台系统实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105808772A (en) * 2016-03-28 2016-07-27 中国建设银行股份有限公司 Data defining file generation method and device

Similar Documents

Publication Publication Date Title
Capriolo et al. Programming Hive: Data warehouse and query language for Hadoop
CN111462327B (en) Unstructured data analysis method for three-dimensional inspection model of three-dimensional modeling software
Miller et al. An overview of the CellML API and its implementation
US9959098B1 (en) Data processing systems and methods
TWI411928B (en) Method and system for exposing workbook ranges as data sources
CN101976235B (en) Extensible Word report automatically-generating method based on dynamic web page
AU776139B2 (en) Spreadsheet cell-data source binding
US20120011134A1 (en) Systems and methods for database query translation
Chillón et al. A model-driven approach to generate schemas for object-document mappers
US20100251227A1 (en) Binary resource format and compiler
CN112667563A (en) Document management and operation method and system
CN102004787A (en) Method for combining multiple application scene forms based on office software plugins
Yevsieiev et al. Development of databases interconnection “essences” information model for cyber-physical production systems additive cyber design creation automation
CN108766507A (en) A kind of clinical quality index calculating method based on CQL Yu standard information model openEHR
CN105550176A (en) Basic mapping method for relational database and XML
CN104881464A (en) Data file format defining method
Pikus et al. Semi-automatic ontology-driven development documentation: generating documents from RDF data and DITA templates
Radonić et al. ETLator-a scripting ETL framework
Poinot et al. Seven keys for practical understanding and use of CGNS
Gleim et al. Representing and maintaining large corpora
Jordan Mastering the SAS DS2 Procedure: Advanced Data-Wrangling Techniques, (Hardcover edition)
Bawankule et al. Design and implementation of massive MYSQL data intelligent export system to excel by using Apache–POI libraries
Rakha Simulation Dataset-Input files
Sandvad Hypertext in an object-oriented programming environment
Min et al. The research on the jena-based web page ontology extracting and processing

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20150902