CN104881464A - Data file format defining method - Google Patents
Data file format defining method Download PDFInfo
- Publication number
- CN104881464A CN104881464A CN201510271136.0A CN201510271136A CN104881464A CN 104881464 A CN104881464 A CN 104881464A CN 201510271136 A CN201510271136 A CN 201510271136A CN 104881464 A CN104881464 A CN 104881464A
- Authority
- CN
- China
- Prior art keywords
- file
- data
- data file
- text
- definition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2291—User-Defined Types; Storage management thereof
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/221—Column-oriented storage; Management thereof
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
Abstract
The invention belongs to the technical field of computers, and provides a data file format defining method. A format of a data file is defined by utilizing a configuration file comprising a relational database tabling statement. The configuration file comprises the column amount of the data file, the column name of each column, the data type, the data length, the empty or not field, the effective value range and the ordering of the column. The problems of uncertainty of the content in a text file and a hard code and the like are solved.
Description
Technical field
The invention belongs to a kind of document format data define method in field of computer technology.
Background technology
The content of text can be undertaken checking, revising by the default multiple text editor provided of operating system, and linux, unix system also provide the utility routines such as cat, more, head, tail, wc, grep, diff, sort, uniq, sed, awk, optionally can show text, filter, the operation such as amendment, therefore adopt text files memory data to be convenient to exploitation, the debugging of handling procedure, be also convenient to the maintenance of system.Text files memory is clear data in addition, does not deposit label and descriptor makes data space comparatively save, and the efficiency of transmission, process is higher.Therefore text uses widely under linux, unix environment, uses through being usually used in cross-system Data Migration, usually as the file layout of a system derived data, is also the carrier of data transmission and the data source of a system.
Text extensively, frequent use while, also have that deposit data degree of freedom is large, the non-self-described of data, store data content, form define indefinite problem, namely do not have metadata to define.Text lacks metadata definition and causes programming usually to adopt hard coded mode to realize, this makes the data source of text be difficult to revise as required, application developer changes indigestion, the maintenances of rear text, program, and exploitation New function is extremely difficult further.
Text is usually used in importing and exporting of system data, and the multiplex relevant database of system carries out the longer-term storage of data and supporting business is handled, and therefore also requires that text has consistent structure with the table imported and exported corresponding in system.Text lack metadata definition make the consistance of text structure and list structure ensure difficulty, be unfavorable for data importing, derivation engineering carry out and system cloud gray model time conventional maintenance.
Above-mentioned text describes the problem that causes and brings a lot of uncertain and difficult owing to lacking metadata to application and development, system maintenance, inter-system data migration.
Text, text compression stores, text cryptographic storage and other change store the problem that all there is form definition, and this few class file is referred to as data file in this patent, and the form definition of these data files is all within this patent range of application.
Summary of the invention
The present invention is directed to the problems referred to above in data file use, provide a kind of document format data define method.By providing a configuration file comprising data file definition information, determine the form of store data in data file uniquely, to comprise in data file the row name of each row, data type, length, whether can empty, valid value range, also determine quantity and the ordering of data file row simultaneously.The description that data file and these two files of its configuration file not only carry data itself but also contain data, solves the form, the uncertain problem of business implication that wherein store data when only having a data file.
Configuration file content is used for reference relevant database and is built predicative sentence to realize the definition of document format data, and configuration file itself is also a text.
Embodiment
For making the object, technical solutions and advantages of the present invention clearly, describe the embodiment of a kind of document format data define method of the present invention in detail below with reference to example.Whereby to a kind of document format data define method of the present invention how application technology means technical solution problem, and the implementation procedure reaching technique effect can fully understand and implement according to this.
Embodiment
The present embodiment provides a kind of document format data define method, and for root name is yuser, the data file having 32 row realizes document format data definition, comprising:
Step 1: fully understand the data file needing definition.
This step fully understands for needing the data file of definition, understand what is the purposes comprising such as data file? the total how many row of data file? what does is the purposes often arranged? does is which kind of type store data? does is length how many? be empty? can do you span be determined?
If there is corresponding table in relevant database, so list structure definition can help to understand data file.
Step 2: the configuration file writing data file.
This step is need the target data file of clear and definite form definition to create configuration file, and correctly arranges the content of configuration file.Configuration filename can be mapped to Data Filename, be named can traditionally, realize convenience decide.By linux, UNIX custom, might as well to fetch data filename important name by configuration file important name, extension name gets .conf.
Under linux operating system, vi yuser.conf input content is as shown in table 1.
Table 1 configuration file content is illustrated
Column data type is consistent with the definition in relevant database, can use char, varchar, longvar, short, tinyint, smallint, int, integer, bigint, long, num, numeric, real, dec, decimal, float, double, date, time, datetime, timestamp.
Content in this arrangements of steps file also can utilize Database Modeling instrument as ERWin, PowerDesigner, by graphical interfaces and the mutualization operation generation of modeling tool, these instrument annexation type databases also can be utilized to read list structure and generate.Modeling tool must not use, and the object mentioned here is that remind can the definition of multiplexing existing list structure, reduces the workload of generating configuration file content, increases work efficiency.
Step 3: resolve configuration file, obtain the definition of data file.
Configuration file store content be exactly in fact standard build predicative sentence, to be also not difficult the rule that finds wherein by this example, the configuration file of foregoing of can having programmed is resolved, and obtains the definition of data file.
Reading file and string processing is the basic function that any programming language all provides, is also the basic program capability of programmer, for ease of implementing, only extracts the roughly process of C statement parse here, can for reference when realizing.
● open configuration file
● first pass configuration file, logarithmic data file number also distributes data file metadata storage space
● second compiles scan profile, the number arranged in logarithmic data object, and allocated column metadata store space
● the 3rd time scan profile, resolve and record the metadata of all row, comprising row name, data type, length, whether can be empty, valid value range.
Step 4: according to the definition process data file of data file.
Understand according to the document format data that step 3 parses, process data file.
Claims (4)
1. a document format data define method, is characterized in that, comprising:
The form of the configuration file definition data file of building predicative sentence is comprised by content.
2. method as claimed in claim 1, wherein builds the standard DDL statement that predicative sentence refers to relevant database, the one section of text guided with " CREATE TABLE " character, and in text, all character boundaries are write insensitive; The independent word followed closely after " CREATE TABLE " key word is data file important name, and the first character Duan Ze building the definition of predicative sentence is first row defining data file, the like complete the definition of all row in data file; Method as claimed in claim 1, wherein data file refers to text, the storage file that the compression storage file of text, the cryptographic storage file of text and File generate via other conversion process, data file stores text data character by it and determines, and its root name, extended file name have nothing to do.
3. method as claimed in claim 2, wherein data file important name refers to the partial document name in filename before character ". "; The definition of data file row comprise row title, data type, length, whether can the information such as sky, valid value range.
4. method as claimed in claim 1, wherein configuration file need comprise at least one and builds predicative sentence, and many also can be had to build predicative sentence for defining the form of multiple data file simultaneously; Building except predicative sentence except comprising in configuration file, comment statement and other required contents can also be had.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510271136.0A CN104881464A (en) | 2015-05-26 | 2015-05-26 | Data file format defining method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510271136.0A CN104881464A (en) | 2015-05-26 | 2015-05-26 | Data file format defining method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN104881464A true CN104881464A (en) | 2015-09-02 |
Family
ID=53948957
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510271136.0A Pending CN104881464A (en) | 2015-05-26 | 2015-05-26 | Data file format defining method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104881464A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105808772A (en) * | 2016-03-28 | 2016-07-27 | 中国建设银行股份有限公司 | Data defining file generation method and device |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101211359A (en) * | 2006-12-31 | 2008-07-02 | 凌群电脑股份有限公司 | Database inputting data inspection method and database management system |
US20100017395A1 (en) * | 2008-07-16 | 2010-01-21 | Sapphire Information Systems Ltd. | Apparatus and methods for transforming relational queries into multi-dimensional queries |
CN101917433A (en) * | 2010-08-17 | 2010-12-15 | 上海交通大学 | Network remote registration localization reappearing protection system |
CN102479184A (en) * | 2010-11-23 | 2012-05-30 | 大连兆阳软件科技有限公司 | Database management system and input data checking method |
-
2015
- 2015-05-26 CN CN201510271136.0A patent/CN104881464A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101211359A (en) * | 2006-12-31 | 2008-07-02 | 凌群电脑股份有限公司 | Database inputting data inspection method and database management system |
US20100017395A1 (en) * | 2008-07-16 | 2010-01-21 | Sapphire Information Systems Ltd. | Apparatus and methods for transforming relational queries into multi-dimensional queries |
CN101917433A (en) * | 2010-08-17 | 2010-12-15 | 上海交通大学 | Network remote registration localization reappearing protection system |
CN102479184A (en) * | 2010-11-23 | 2012-05-30 | 大连兆阳软件科技有限公司 | Database management system and input data checking method |
Non-Patent Citations (1)
Title |
---|
丁涛: "基于领域驱动设计的物流平台系统实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105808772A (en) * | 2016-03-28 | 2016-07-27 | 中国建设银行股份有限公司 | Data defining file generation method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Capriolo et al. | Programming Hive: Data warehouse and query language for Hadoop | |
CN111462327B (en) | Unstructured data analysis method for three-dimensional inspection model of three-dimensional modeling software | |
Miller et al. | An overview of the CellML API and its implementation | |
US9959098B1 (en) | Data processing systems and methods | |
TWI411928B (en) | Method and system for exposing workbook ranges as data sources | |
CN101976235B (en) | Extensible Word report automatically-generating method based on dynamic web page | |
AU776139B2 (en) | Spreadsheet cell-data source binding | |
US20120011134A1 (en) | Systems and methods for database query translation | |
Chillón et al. | A model-driven approach to generate schemas for object-document mappers | |
US20100251227A1 (en) | Binary resource format and compiler | |
CN112667563A (en) | Document management and operation method and system | |
CN102004787A (en) | Method for combining multiple application scene forms based on office software plugins | |
Yevsieiev et al. | Development of databases interconnection “essences” information model for cyber-physical production systems additive cyber design creation automation | |
CN108766507A (en) | A kind of clinical quality index calculating method based on CQL Yu standard information model openEHR | |
CN105550176A (en) | Basic mapping method for relational database and XML | |
CN104881464A (en) | Data file format defining method | |
Pikus et al. | Semi-automatic ontology-driven development documentation: generating documents from RDF data and DITA templates | |
Radonić et al. | ETLator-a scripting ETL framework | |
Poinot et al. | Seven keys for practical understanding and use of CGNS | |
Gleim et al. | Representing and maintaining large corpora | |
Jordan | Mastering the SAS DS2 Procedure: Advanced Data-Wrangling Techniques, (Hardcover edition) | |
Bawankule et al. | Design and implementation of massive MYSQL data intelligent export system to excel by using Apache–POI libraries | |
Rakha | Simulation Dataset-Input files | |
Sandvad | Hypertext in an object-oriented programming environment | |
Min et al. | The research on the jena-based web page ontology extracting and processing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
EXSB | Decision made by sipo to initiate substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20150902 |