SYSTEMS, PROCESSES, AND PRODUCTS FOR STORAGE AND
RETRIEVAL OF PHYSICAL PAPER DOCUMENTS,
ELECTRO-OPTICALLY GENERATED DOCUMENTS, AND
COMPUTER GENERATED ELECTRONIC DOCUMENTS BACKGROUND OF THE INVENTION
Field of the Invention
The present invention relates to systems, processes and products for the storage and retrieval of documentary information, including: physical paper documents; corresponding electro-optically generated electronic documents, e.g. digital images produced by scanning or photography; and computer generated electronic documents, e.g. digital text produced by word processing and/or digital graphics produced by computer aided design. The Prior Art
Numerous systems and processes have been proposed for the storage and retrieval of documentary information. Traditional practices over the centuries, of course, have involved storage and retrieval by manually accessing indexed arrangements of original papers and "hard copies" in folders, boxes, shelving and cabinets. Later practices have involved photographically reducing the original papers to produce indexed miniaturizations in microfilm spools or microfiche sheets, storing the spools or sheets in indexed containers, folders or other physical repositories, and retrieving images or hard copies of the miniaturizations by optical projection or photographic reproduction.
Now there are a proliferation of proposals for digital computer systems that opto-electronically scan original papers: to create electronic representations in computer memory; to store digital records of these representations in magnetic and/or optical media; and to retrieve images or hard copies corresponding to these records electro-magnetically or electro-optically. It is common experience that effective
paper filing systems have required unerring care by trustworthy persons who acquire some understanding of the business or other activity involved. Considerable reliance has been placed on the memory of such persons, who have a tendency to become "indispensable" in mission critical situations. Moreover, even a generally effective system often is not conducive to physical and logical arrangements capable of implementing prompt storage and retrieval without constant inquiries to and guidance by professional level personnel. Finally, of course, a paper file is available to only one person at a time unless hard copies are made, in which case undesired paper proliferation occurs.
Although a few, but not all, of these problems are alleviated by the use of microfilm and microfiche, new and different problems arise. It is true that a large number of photographic miniaturizations can be stored in a much smaller space than an equivalent number of original papers. However, problems of storing and retrieving containers and folders of microfilm and microfiche are not unlike problems of storing and retrieving batches of their paper counterparts. More important, producing, imaging and retrieving photographic miniaturizations often are costly procedures requiring unwieldy hardware. As a practical matter, microfilm and microfiche may be limited to archival-type documentation, which may not be compatible with a dynamic work environment. A critical advantage of electronic files over paper and microfilm/microfiche files is that electronic files can be readily generated, sorted, retrieved and reorganized. Also, electronic media are a readily available source of paper (hard) copies whenever desired.
Recent advances in low cost computer architecture, particularly, faster clock speeds, higher resolution displays, and denser storage media, have generated
discussions of whether or not a "paperless office" is possible, and, indeed, what is the meaning of "paperless office." It is apparent that electronic files are capable of obviating tedious manipulation of original paper files when there may be no further need ever to refer to them.
Nevertheless, despite the rapid growth of electronic filing, paper documents continue to proliferate. One possible reason for this phenomenon is a natural reluctance on the part of many to discard original paper files even after they have been imaged and stored in electronic media. For many, it simply will be difficult to overcome a habitual predilection to keep paper documents indefinitely, and to accept the idea that electronic media, in many cases, are a secure and relatively versatile substitute. Furthermore, in many cases, reference to original papers or other original physical documents, even if infrequent, may be essential. For example, original physical documents may be required or desired: (1) in litigation where specific rules of evidence pertain; (2) as backup in the event that lost or faulty images are discovered later; (3) in major commercial, financial and insurance dealings where physical signatures may be important or merely may be perceived to be important; (4) in Internet transactions where the parties do not meet legal requirements for dispensing with paper records or do not agree to do so; (5) as backup for technical graphics, precision photographs, medical diagnostics, and the like, where differences in resolution between images and originals may become significant; (6) as backup in the event of corruption of electronic files; (7) as complete copies of voluminous treatises in which only the table of contents and/or other selected portions have been imaged for convenience; and (8) in large organizations, where careful planning requires the periodic destruction of selected
corresponding paper and electronic records that may bear dates prior to some legally mandated or permitted threshold, or where unnecessary paper proliferation may become a source of confusion. Modern society demands knowledge work that is appropriate for the information age. The requirements are that: tedious, relatively low paying manual work, like filing paper documents, must be phased out; and more interesting, relatively high paying knowledge work, like processing electronic documents, must be phased in. The fact is that total elimination of paper documents may be unachievable or undesirable. Rather, the present invention is based on the insight that there is an essential interaction between minimizing the inherent inefficiency of paper documents, and maximizing the inherent efficiency of electronic documents.
Much confusion has been encountered in the implementation of systems that are based on: original paper documents themselves; imaged electronic documents of the type that are created by scanning or photographing paper documents and the like; and computer generated electronic documents of the type that are created by word processors and the like. This confusion compounds a variety of conflicts, among which are: whether to store bit maps based on simple scanning, or text resulting from optical character recognition, or both; whether or not to mix optically generated electronic documents and computer generated electronic documents; whether or not the storage of optically generated electronic image files should mirror the storage of the original paper documents; whether or not an original paper filing system should be retained without change; whether or not a system is so sophisticated that computer literate professional level operators and/or supervisors are required; and whether or not
a system is so rudimentary that even entry level operators may be sufficiently trustworthy.
The foregoing problems are greatly magnified in networked systems, where difficulties may be encountered particularly in standardizing, at disparate locations, the storage and retrieval of original physical paper documents, and, sometimes, hard paper copies of electro-optically generated and computer generated documents . There is a requirement for systems: in which the physical filing of paper documents is radically simplified; in which access to infrequently needed paper documents is precisely indicated; and in which optically generated electronic documents and computer generated electronic documents are clearly differentiated.
DEFINITIONS In order to maintain clarity, the present application adopts the following definitions :
(I) physical documents (sometimes abbreviated to physical files) are defined herein as actual paper documents or actual photographs or the like, of the type that are directly visually perceptible.
(II) electronic image documents (sometimes abbreviated to image documents or image files) are defined herein as electronic files of the type that are generated electro- optically by scanning or photographing physical documents such as actual paper documents, photographs, and the like. In one form, these electronic image documents are bitmap documents .
(III) electronic work documents (sometimes abbreviated to work documents or work files) are defined herein as electronic files of the type that are generated in a computer by a word processor program, a spreadsheet, a computer aided design program, or the like. In one form, these electronic
work documents are PostScript documents . PostScript is a page description language originated by Adobe Systems, Inc.
(IV) electronic documents (sometimes referred to as electronic files) include electronic image documents and electronic work documents .
SUMMARY OF THE INVENTION The primary object of the present invention is to provide, for the storage and retrieval of original physical documents (printed paper and photographic presentations) , electronic image documents (electro-optically generated electronic documents) , and electronic work documents (computer generated electronic documents): systems, processes and products, which facilitate the use of electronic documents rather than physical documents whenever feasible or acceptable, but which also facilitate the use of physical documents rather than electronic documents whenever necessary or preferred. The result is less organizational confusion and emotional frustration, and more creative use of time and resources. The systems, processes and products of the present invention are particularly advantageous in connection with simultaneously archiving and later simultaneously disposing of corresponding physical and electronic documents. More specifically, the present invention contemplates the following features. (1) Electronic image documents and electronic work documents , which may be completely unrelated in form or content, are acquired or produced to provide one or more real time sequences of electronic documents in one or more real time sequences of date/time instances. (2) Acquired or produced physical documents, for ultimate convenience, simply are added to the front (or the back) of one or more cumulative physical stacks. (3) The resulting electronic records, with assured precision, uniquely identify the
physical locations of the physical documents and the electronic locations of the electronic documents, both in terms of their associated date/time instances. (4) In effect, these date/time instances guarantee that virtual sets of related physical documents may be conveniently accessed and physically retrieved, and that corresponding sets of related electronic documents may be immediately displayed and suitably presented. (5) Also, pursuant to any particular archive policy, these date/time instances can guarantee the joint disposal of corresponding physical and electronic documents that have been received and/or created prior to any selected date .
BRIEF DESCRIPTION OF THE DRAWINGS For a fuller understanding of the nature and objects of the present invention, reference is made to the following specification, which is to be taken in connection with the accompanying drawings wherein:
Fig. 1 illustrates a system embodying the present invention; Fig. la is a flow diagram of the system of Fig. 1.
Fig. 2 illustrates a graphical user interface displaying a form with date/time indexing features and a window that presents thumbnail displays of images and views, the entries of which fall within a specific date/time interval; Fig. 3 illustrates a graphical user interface displaying a form with date/time indexing features and an imaging window that displays a blow-up view corresponding to one of the thumbnail views of Fig. 2, the entries of which fall within a specific date/time interval; Fig. 4 illustrates labels that are affixed to a first stream of primary and secondary repositories in accordance with the present invention;
Fig. 5 illustrates a file folder of the first stream having visual indicia specifying a date/time interval that bounds a sequence of physical paper documents of the type corresponding to the thumbnail images of Fig. 2; Fig. 6 illustrates a box of the first stream having visual indicia specifying a date/time interval that bounds a sequence of file folders of the type corresponding to the file folder of Fig. 5;
Fig. 7 illustrates labels that are affixed to a second stream of primary and secondary repositories in accordance with the present invention;
Fig. 8 illustrates a file folder of the second stream having visual indicia specifying a date/time interval that bounds a sequence of physical paper documents of the type corresponding to the thumbnail images of Fig. 2;
Fig. 9 illustrates a box of the second stream having visual indicia specifying a date/time interval that bounds a sequence of file folders of the type corresponding to the file folder of Fig. 8; Fig. 10 illustrates a plurality of stacks of boxes that are organized pursuant to one application of the system of the present invention;
Fig. 11 illustrates a table of a relational database that supports the system, process and product of Fig. 1; Fig. 12 illustrates another table of the relational database referred to in Fig. 11;
Fig. 13 illustrates a further table of the relational database referred to in Figs . 11 and 12 ;
Fig. 14 represents an electronic diagram that indicates relationships among the tables of Figs. 11, 12 and 13;
Fig. 15 illustrates an electronic query based upon the electronic diagram of Fig. 14; and
Fig. 16 illustrates another electronic query based upon the electronic diagram of Fig. 14.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT Introduction As shown in -Fig. 1 and la, the illustrated embodiment of the present invention comprises a master system 25 that includes a physical system 30 and an electronic system 32. The physical and electronic systems share a scanner 34 and a printer 36. Scanner 34 processes random sequences of paper documents 38 to produce sequences of electronic image documents 27 in sequences of date/time instances. In the present embodiment, scanner 34 processes two document streams 29 and 31 to produce two streams of electronic image documents 33 and 35. Physical documents 29 and electronic image documents 33 of the first stream are archived permanently. Physical documents 31 and electronic image documents 35 of the second stream are archived for limited date/time periods. The Physical System In the first document stream 29, date/time sequences of physical folders 42 store date/time sequences of paper documents 38 from the scanner 34, and date/time sequences of boxes 44 in turn store date/time sequences of folders 42. In the second document stream 31, date/time sequences of physical folders 46 store date/time sequences of paper documents 38 from the scanner 34, and date/time sequences of boxes 48 in turn store date/time sequences of physical folders 46. The destination of folders 42 and boxes 44 of the first stream 29 is location 37, designated Location A. The destination of folders 46 and boxes 48 of the second stream 31 is location 39, designated Location B. The physical folders and boxes typically are constructed from
inexpensive materials, for example, cardboard, plastic or the like.
As is shown in Fig. 4, the physical visual indicia are in the form of pressure sensitive labels 52 and 54, which are removably mounted on the plastic coated faces of carrier sheets 56 and 58, respectively. As shown in Fig. 5, when label 52 is peeled from carrier sheet 56 and applied to folder 42, printed indicia 60 and 62 indicate the start and finish of the range of date/time instances of the paper documents therewithin, and additionally their physical destination, Location A. As shown in Fig. 6, when label 54 is peeled from carrier sheet 58 and applied to box 44, printed indicia 64 and 66 indicate the start and finish of the range of date/time instances of the folders therewithin, and additionally their physical destination, Location B.
As is shown in Fig. 7, the physical visual indicia are in the form of pressure sensitive labels 70 and 72, which are removably mounted on the plastic coated faces of carrier sheets 74 and 76, respectively. As shown in Fig. 8, when label 70 is peeled from carrier sheet 74 and applied to folder 46, printed indicia 78 and 80 indicate the beginning and ending of the range of date/time instances of the paper documents therewithin, and additionally the physical destination, location B 37, of the relevant document stream. As shown in Fig. 9, when label 72 is peeled from carrier sheet 76 and applied to box 48, printed indicia 82 and 84 indicate the beginning and ending of the range of date/time instances of the folders therewithin, and additionally the physical destination, location B 37, of the relevant document stream.
As shown in Fig. 10, distinguishably marked boxes 44 and 48 are stored on shelves 50 and 51 in physical locations A and B, respectively. Location A receives those physical
documents that are to be archived permanently, and location B receives those documents that are to be archived for limited periods of time.
In one embodiment of the present invention, the different documents of the sequential batches are separated by slip sheets, which are imprinted with applicable date/time instances by printer 36. The integrity of any designated stack is maintained by monitoring the physical storage and retrieval of any processed document to and from that stack. The Electronic System
As shown in Fig. 1, electronic system 32 includes a workstation 86 and an electronic storage 88. Workstation 86 is operatively connected to scanner 34 for transmission to the scanner of control signals that synchronize the production of date/time instances, and that receive, from the scanner, data signals representing electronic image documents and their date/time identifiers. Workstation 86 is operatively connected to electronic storage 88 for transmission to and retrieval from electronic storage 84 of control and data signals that represent electronic documents, and their date/time identifiers.
As shown in Fig. 2, electronic image documents 33 and 35 are assembled by electronic system 32 as a date time sequence of electronic image documents 41. Also, under the control of any of a variety of word processor, fax, computer aided design (CAD) , and e-mail programs, computer 86 produces a date/time sequence of electronic work documents 43. Electronic image documents 42 and electronic work documents 43 are assembled by electronic system 32 as a date/time sequence of electronic documents 47. Electronic documents 45 are assembled by electronic system 32 in a relational database 47, to be described in detail below. Electronic work documents 43 include electronic image documents in any of
various bitmap, vector, text or other formats, such as the PDF (Portable Document Format) format promulgated by Adobe Systems, Inc., or the XML (Extensible Markup Language) format promulgated by Microsoft Corporation, among others. As shown in Figs. 2 and 3, workstation 86 can present the electronic documents 45, generated by scanner 34 and computer 86, as thumbnail views 90 or blow-up views 92. Workstation 86 is operatively connected to printer 36 for transmission to and receipt from the printer of control signals, and for the production of visual materials. These visual materials include labels 52, 54, 70 and 72, as well as read-outs of logical sets of records that are retrieved from relational database 47.
Relational database 47 includes tables, which are shown in Figs. 11, 12 and 13. The fields of these tables are described in the following schedules.
SCHEDULE 1 (See Table 1, Fig. 11)
Field Name Data Type Field Type - Description
Entity_Code Text Primary - unique identifier of associated organization or individual
Entity_Name Text Secondary - name of organization or individual
Entity_Address Text Secondary - address of organization or individual
Entity_Info Text Secondary - relevant notes
Entity_Tel_No Text Secondary - telephone number of organization or individual
Entity_Fax__No Text Secondary - fax number of organization or individual
Entity_Emai1 Text Secondary - uniform resource locator of organization or individual
Reference_Code Text Secondary - entity code of contact or other associated individual or organization
Reference_Kind Text Secondary - nature of association
Reference_Info Text Secondary - relevant notes
SCHEDULE 2 (See Table 2, Fig. 12)
Field Name Data Type Field Type - Description
Pro ect_No Text Primary - unique identifier of project
Project_Kind Text Secondary - select from categories [1], [2], ..., [n]
Proj ect_Name Text Secondary -formal project name
Security_Level Text Secondary - select from categories [1], [2], ..., [n]
Archive_Term Date/Time Secondary - document stream selection - term finish date
Task_Entry_Date Date/Time Secondary - review date
Task_Due_Date Date/Time Secondary - due date
Task_Done_Date Date/Time Secondary - done date
Review_Due_Date Date/Time Secondary - due date
Review_Done_Date Date/Time Secondary - done date
SCHEDULE 3 (See Table 3, Fig. 13;
As will be seen in Fig. 14, the following relational links exist among Tables 1, 2 and 3 of Figs. 11, 12 and 13, above. The Entity_Code MANY (secondary) field of Table 1 is linked to the Entity_Code ONE (primary) field of Table 2. The Project_No MANY (secondary) field of Table 1 is linked to the Pro ect_No ONE (primary) field of Table 3.
Preferably, a date/time instance value in the Entry_Date/Time field of Table 1 is automatically generated by the system during scanning or computer generation. In one form, this value is in terms of year, month, day, hour, minute, second, and fractions of second, i.e., yy, mm, dd, hh, nn, ss, ff . In another form, this value is in terms of year, month, day, hour, minute and sequential integers, i.e., yy, mm, dd, hh, nn, ii. The timing is arranged so that each electronic document (image or work) is uniquely identified at the moment of its acquisition or creation by entry of a date/time instance or a date/time instance that includes an integer.
OPERATION The system, processes and products of the present invention operate as follows. Electronic image documents 41 and electronic work documents 43 are acquired or produced to provide one or more real time sequences of electronic documents 45 in one or more real time sequences of date/time instances 52, 54, 70, 72. Acquired or produced physical documents simply are added to the front (or the back) of one or more cumulative physical stacks 29, 31. The resulting electronic records, as shown in Tables 1, 2 and 3 (Figs. 11, 12 and 13), uniquely identify the physical locations of the physical documents 38 and the electronic locations of the electronic documents 45, both in terms of their associated date/time instances. These date/time instances guarantee that virtual sets of related physical documents 38 may be
readily located and conveniently retrieved, and that corresponding sets of related electronic documents 45 may be immediately displayed and suitably reported. Pursuant to any particular archive policy, these date/time instances guarantee the joint disposal of corresponding physical and electronic documents, which are scheduled for disposal on any selected archival termination date.