US20090164881A1 - Scan-to-Redact Searchable Documents - Google Patents
Scan-to-Redact Searchable Documents Download PDFInfo
- Publication number
- US20090164881A1 US20090164881A1 US12/182,334 US18233408A US2009164881A1 US 20090164881 A1 US20090164881 A1 US 20090164881A1 US 18233408 A US18233408 A US 18233408A US 2009164881 A1 US2009164881 A1 US 2009164881A1
- Authority
- US
- United States
- Prior art keywords
- redaction
- document
- text
- redacted
- image
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/60—Protecting data
- G06F21/62—Protecting access to data via a platform, e.g. using keys or access control rules
- G06F21/6218—Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
- G06F21/6245—Protecting personal data, e.g. for financial or medical purposes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/12—Detection or correction of errors, e.g. by rescanning the pattern
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/14—Image acquisition
- G06V30/1444—Selective acquisition, locating or processing of specific regions, e.g. highlighted text, fiducial marks or predetermined fields
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2221/00—Indexing scheme relating to security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F2221/21—Indexing scheme relating to G06F21/00 and subgroups addressing additional information or applications relating to security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F2221/2143—Clearing memory, e.g. to prevent the data from being stolen
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Definitions
- the present invention relates to image processing, and more specifically, to scanning and processing of documents.
- Redaction is the process of blacking out or removing confidential information from documents so that they can be shared. Confidential information may include the name of a person or place of business, the address, telephone number or other identifying information including Social Security or customer numbers. Redaction is most commonly used in legal or governmental organizations, but is also widely used in other industries including finance, insurance, and manufacturing. The purpose of redaction is to keep sensitive information from being distributed with documents that may need to be shared with the public outside of a closed business, governmental or legal process.
- redaction starts by making a paper copy of the confidential document. The copy is then given to a proofreader whose job is to identify sensitive areas of information based on a list of keywords or subjects.
- the proofreader redacts the words directly on a copy of the document.
- the initial proofreader will highlight candidates for redaction which will then be reviewed by one or more informed or senior proofreaders who will make the final decision on redaction.
- notes or comments often need to be attached to the document to identify why areas should be redacted, who did the initial review of the redaction, and who approved the final redaction so follow-up questions or investigations into the redaction can be made.
- Redaction in a paper process uses a black marker applied to the paper document to completely hide or destroy the contents or text in the document. Due to the differences in the density of the marker compared to the original toner or ink on the document and the paper used, complete removal of the information may not be achieved in a satisfactory way. To ensure that the areas are completely removed and can not be identified upon closer examination the redacted document is usually photocopied again before distribution. This achieves a uniform black area over the information to be hidden so that it is impossible to detect the underlying text. Absolute destruction of the confidential information is critical in the redaction process. A paper redaction process requires skilled proofreaders and is quite inefficient compared to a digital process.
- a proofreader uses a search tool to identify words required for redaction. Upon finding the word in the document, the appropriate tool is used to highlight, comment or redact the words. To further speed the process, the search function may be combined with an automatic redaction or highlight for redaction feature. If the document is redacted directly, the final document is then saved as a final redacted copy ready for distribution. If words are highlighted or commented for redaction, the document is then forwarded to the appropriate person or persons, either through email or by a document management process, for final redaction. As in a paper process, the absolute removal of confidential information must be achieved in all areas of the document including hidden layers or properties of the document.
- OCR technology can be used to simply identify the areas on the scanned image document that need to be redacted and then apply those changes directly to the image. This is also not an optimum process because the text in the final TIFF image file still remains image only with no way to search the remaining text for important information. And the ability is lost to attach notes or codes to the redacted areas for future investigation or understanding of how and why those areas of redaction where applied.
- the PDF file format provides a suitable framework to address short comings of current processes while still providing an original image that is an accurate representation of the original document, a text layer to search, and annotation fields for notes and comments.
- Embodiments of the present invention are directed to an automatic scan-to-redacted electronic document process.
- a user input is received which identifies a scanned document.
- the scanned document is automatically processed to produce a corresponding redacted document which having searchable document text and a document image.
- the searchable document text includes coded redaction text satisfying defined redaction parameters.
- the document image includes redacted image areas corresponding to redacted elements.
- the redaction text may be visually highlighted for review.
- the redaction text may be removed from the redacted document, and placeholder characters substituted for the redaction text.
- An annotation field may be provided for recording post-redaction comments associated with selected redaction text.
- the processing may use pre-established redaction criteria to identify and produce the redaction text.
- the redaction criteria may include a set of keywords associated with redaction and/or use of redaction patterns representing patterns of text associated with redaction.
- the searchable document text may include bookmarks identifying segments of redaction text.
- the redacted image areas may be visually distinctive of redaction, such as by highlighting or by having a uniform non-text appearance.
- the user-input may be produced from a user selectable button on a computer network device or from a user selectable button on a computer display.
- the redacted document may specifically be a PDF format document or an editable word processor document such as a Microsoft Word document.
- FIG. 1 shows various steps in the functional flow of an automatic scan-to-redact process according to one embodiment of the present invention.
- Embodiments of the present invention are based on scanning and OCR technology, combined with some recognition intelligence, pre-defined keywords with redaction codes, and the PDF file format to produce an automated scan to redacted PDF file process that can be reviewed, annotated and corrected at anytime.
- the entire process called a “workflow” can be defined using the application and applied with a single press of a button either within the application itself or assigned to a physical or virtual button on a device.
- the end result is a “Searchable PDF” file.
- FIG. 1 shows various steps in the functional flow of an automatic scan-to-redact process according to one embodiment of the present invention.
- the source document is scanned, step 101 , including initial defining of the scanning process for some image enhancement or corrections.
- Scanning enhancements improve the OCR recognition capabilities and increase accuracy. Examples of such corrections include without limitation de-skew, despeckle, and auto-rotate. Settings for b/w, grayscale, color and resolution are also set in this step.
- an OCR process is applied to the scanned document, step 102 .
- Layout or formatting guidance and language detection is used at this point. For example, as the text is recognized the words are checked against internal dictionaries so that any words not found in the dictionary can be marked as suspect during the OCR text proofreading process. Professional dictionaries including legal, financial and medical are used for even greater accuracy.
- step 103 When text is checked against the user and professional dictionaries possible mistakes are highlighted during the next step of proofreading the OCR document, step 103 .
- the user checks the accuracy of recognized words in the context of the actual sentence in the document. Corrections may be applied using the proofreading tool. As the words are verified or corrected the user dictionaries “learn” and are updated accordingly to improve the correction process of additionally processed documents.
- the document redaction process can begin, step 104 .
- a user input is received that identifies a scanned document for redaction.
- the user input may be produced by a user selectable button—either a physical button on a computer network device such as a scanner, etc., or a virtual button produced by software on a computer display.
- the scanned document is automatically processed to produce a corresponding redacted document which includes both searchable document text and a document image.
- the searchable document text includes coded redaction text which satisfies various defined redaction parameters.
- the document image includes redacted image areas corresponded to the redacted elements.
- Redaction rules may also include use of pre-established redaction criteria to identify and produce the redaction text.
- redaction patterns may be used which correspond to patterns of text associated with redaction, including automatic markup of information that fits a specific pattern or “looks like” specific information.
- the pattern could be the format of a number such as a Social Security number, date, currency value or telephone number.
- a specific list of redaction keywords, redaction codes and notes are loaded from a comma or tab delimited text file or directly from an ODBC database file.
- redaction text may be made immediately and directly, or redaction text may be highlighted for further review in a PDF editing application such as PDF Converter Professional.
- the identified redaction text may usefully be visually distinctive of redaction, for example, having a uniform non-text appearance.
- highlighting redaction text candidates for redaction can be customized, for example, with highlight color or an alternative method for identification such as outline or strikethrough can be used.
- redacted areas can have a highlight color applied to a preset number of words or characters before and/or after the redacted areas for easy visual identification. Redacted or highlighted areas can be created with or as annotation fields in the final document.
- An additional post-redaction proofreading, step 105 may be employed similar to the post-OCR proofreading step. This checks that possible redaction candidates defined as a pattern or “looks like” are accurately removed or marked as text that must be redacted. Identified words may be displayed in the context of the actual sentence in the document. Tools may be provided to mark or unmark redaction text as required.
- An alternative to an interactive “redact proofreading” step would be to automatically highlight suspect words in a pre-defined color for review in a PDF editing tool.
- the redacted (or redaction highlighted) document can be saved in a propriety application format, step 106 , for later review, adjustments at a later date, and or archiving and storage.
- An embodiment of the application may also read marked PDF files that have been reviewed and marked with highlights in other applications so that the redaction can be automatically applied.
- redaction and PDF preferences are applied. Redaction preferences include redaction color, the default being “black”, the width and height tolerance for the redacted area on the image, and a tolerance to objects or text close to the redacted areas so they are not inadvertently corrupted.
- a PDF file includes various main components or layers, the most relevant being the image and text layers.
- the image layer comprises the original scanned document where the redacted areas are replaced with the redacted color and the pixel information is “destroyed”.
- the text layer is a hidden layer above or below the image that can be indexed and/or searched by commercial search products.
- the text layer precisely follows the formatting of the original document so that the text selectable in PDF viewing or editing applications in context of the original.
- the redaction text is removed from the redacted document, and placeholder characters are substituted to form equally spaced text “lines” that, although they can be copied and pasted into other application, do not represent the original text, thereby destroying that information.
- Metadata can be applied to the PDF file properties so it can be searched using commercially available search tools.
- PDF Bookmarks can be automatically created to identify redacted pages.
- Custom stamps, headers and footers can also be applied to the document to display relevant information such the creator name, creation date, and/or terms and conditions for use of the redacted document.
- Annotations as notes or redaction/exemption codes are created and applied to the final PDF document.
- Standard PDF security tools are also set and applied at this step. This includes password definition in order to view or print a document. 40-bit and 128-bit encryption can also be applied to a PDF file with the user able to control whether the document can be viewed, printed, edited, copied or annotated. The application will also include everything necessary to digitally sign documents for protection and authentication of information. No certificate from a third-party vendor is necessary.
- Redaction workflows combine all of the above described steps into a single workflow file that can be named, exported and imported into other copies of the application program so the process can be replicated.
- Workflows can be initiated as a single step within the application or assigned to physical or virtual buttons on a device for one-button process automation.
- the application can also batch process files stored on a network utilizing folder watching or automatically process incoming files from email applications. Scanned, recognized and redacted Searchable PDF files can be automatically saved to local hard drives, be saved to document management systems or emailed to specific recipients.
- the application can also save the files as editable Microsoft Word documents.
- Embodiments of the invention may be implemented in any conventional computer programming language.
- preferred embodiments may be implemented in a procedural programming language (e.g., “C”) or an object oriented programming language (e.g., “C++”, Python).
- Alternative embodiments of the invention may be implemented as pre-programmed hardware elements, other related components, or as a combination of hardware and software components.
- Embodiments can be implemented as a computer program product for use with a computer system.
- Such implementation may include a series of computer instructions fixed either on a tangible medium, such as a computer readable medium (e.g., a diskette, CD-ROM, ROM, or fixed disk) or transmittable to a computer system, via a modem or other interface device, such as a communications adapter connected to a network over a medium.
- the medium may be either a tangible medium (e.g., optical or analog communications lines) or a medium implemented with wireless techniques (e.g., microwave, infrared or other transmission techniques).
- the series of computer instructions embodies all or part of the functionality previously described herein with respect to the system.
- Such computer instructions can be written in a number of programming languages for use with many computer architectures or operating systems. Furthermore, such instructions may be stored in any memory device, such as semiconductor, magnetic, optical or other memory devices, and may be transmitted using any communications technology, such as optical, infrared, microwave, or other transmission technologies. It is expected that such a computer program product may be distributed as a removable medium with accompanying printed or electronic documentation (e.g., shrink wrapped software), preloaded with a computer system (e.g., on system ROM or fixed disk), or distributed from a server or electronic bulletin board over the network (e.g., the Internet or World Wide Web). Of course, some embodiments of the invention may be implemented as a combination of both software (e.g., a computer program product) and hardware. Still other embodiments of the invention are implemented as entirely hardware, or entirely software (e.g., a computer program product).
Abstract
An automatic scan-to-redacted electronic document is described. A user input is received which identifies a scanned document. Then the scanned document is automatically processed to produce a corresponding redacted document which having searchable document text and a document image. The searchable document text includes coded redaction text satisfying defined redaction parameters. The document image includes redacted image areas corresponding to redacted elements.
Description
- This application claims priority from U.S. Provisional Patent Application 60/952,653, filed Jul. 30, 2007, which is incorporated herein by reference.
- The present invention relates to image processing, and more specifically, to scanning and processing of documents.
- Redaction is the process of blacking out or removing confidential information from documents so that they can be shared. Confidential information may include the name of a person or place of business, the address, telephone number or other identifying information including Social Security or customer numbers. Redaction is most commonly used in legal or governmental organizations, but is also widely used in other industries including finance, insurance, and manufacturing. The purpose of redaction is to keep sensitive information from being distributed with documents that may need to be shared with the public outside of a closed business, governmental or legal process.
- When documents exist as paper, redaction starts by making a paper copy of the confidential document. The copy is then given to a proofreader whose job is to identify sensitive areas of information based on a list of keywords or subjects. In a simplified process, the proofreader redacts the words directly on a copy of the document. In a more complex process the initial proofreader will highlight candidates for redaction which will then be reviewed by one or more informed or senior proofreaders who will make the final decision on redaction. Furthermore, notes or comments often need to be attached to the document to identify why areas should be redacted, who did the initial review of the redaction, and who approved the final redaction so follow-up questions or investigations into the redaction can be made. These notes or comments are often coded so that when the final document is redacted and the notes travel with the document to an unauthorized recipient they will not be able to discern why the areas were redacted or guess as to contents of the redacted information. Authorized participants in the redaction process will be able to use the comments or notes and trace the history and purpose of the redaction as may be required in an investigation involving the documents.
- Redaction in a paper process uses a black marker applied to the paper document to completely hide or destroy the contents or text in the document. Due to the differences in the density of the marker compared to the original toner or ink on the document and the paper used, complete removal of the information may not be achieved in a satisfactory way. To ensure that the areas are completely removed and can not be identified upon closer examination the redacted document is usually photocopied again before distribution. This achieves a uniform black area over the information to be hidden so that it is impossible to detect the underlying text. Absolute destruction of the confidential information is critical in the redaction process. A paper redaction process requires skilled proofreaders and is quite inefficient compared to a digital process.
- In a digital process, similar tools to highlight, comment, and redact document are provided in various software applications where the digital document was either created or can be edited. To speed the process, a proofreader uses a search tool to identify words required for redaction. Upon finding the word in the document, the appropriate tool is used to highlight, comment or redact the words. To further speed the process, the search function may be combined with an automatic redaction or highlight for redaction feature. If the document is redacted directly, the final document is then saved as a final redacted copy ready for distribution. If words are highlighted or commented for redaction, the document is then forwarded to the appropriate person or persons, either through email or by a document management process, for final redaction. As in a paper process, the absolute removal of confidential information must be achieved in all areas of the document including hidden layers or properties of the document.
- It is recognized that sharing of information is much easier, more productive, and less expensive when done digitally. Therefore, the need to scan and convert paper to a digital document is ever increasing as paper is migrated into digital processes. Redaction in this sense is usually achieved after the document is converted to a suitable digital format with reliance on the currently available digital tools to accomplish the redaction. The currently available paper to digital tools either produce a text based PDF file from a converted editable document, or a TIFF based image file from the original scanned image file. Each approach has benefits and short comings.
- When converting a document to an editable format for redaction from a scanned file, the integrity of the original document is most likely lost in the conversion process. While the text may be 100% accurate, the format, graphics and other elements of the document may be distorted, rearranged or lost altogether. Therefore, this method of converting and redacting a paper document is not optimum.
- To prevent compromising the original document, OCR technology can be used to simply identify the areas on the scanned image document that need to be redacted and then apply those changes directly to the image. This is also not an optimum process because the text in the final TIFF image file still remains image only with no way to search the remaining text for important information. And the ability is lost to attach notes or codes to the redacted areas for future investigation or understanding of how and why those areas of redaction where applied. The PDF file format provides a suitable framework to address short comings of current processes while still providing an original image that is an accurate representation of the original document, a text layer to search, and annotation fields for notes and comments.
- Embodiments of the present invention are directed to an automatic scan-to-redacted electronic document process. A user input is received which identifies a scanned document. Then the scanned document is automatically processed to produce a corresponding redacted document which having searchable document text and a document image. The searchable document text includes coded redaction text satisfying defined redaction parameters. The document image includes redacted image areas corresponding to redacted elements.
- In further specific embodiments, the redaction text may be visually highlighted for review. In addition or alternatively, the redaction text may be removed from the redacted document, and placeholder characters substituted for the redaction text. An annotation field may be provided for recording post-redaction comments associated with selected redaction text.
- In some embodiments, the processing may use pre-established redaction criteria to identify and produce the redaction text. For example, the redaction criteria may include a set of keywords associated with redaction and/or use of redaction patterns representing patterns of text associated with redaction. In some embodiments, the searchable document text may include bookmarks identifying segments of redaction text. The redacted image areas may be visually distinctive of redaction, such as by highlighting or by having a uniform non-text appearance. The user-input may be produced from a user selectable button on a computer network device or from a user selectable button on a computer display. The redacted document may specifically be a PDF format document or an editable word processor document such as a Microsoft Word document.
-
FIG. 1 shows various steps in the functional flow of an automatic scan-to-redact process according to one embodiment of the present invention. - Embodiments of the present invention are based on scanning and OCR technology, combined with some recognition intelligence, pre-defined keywords with redaction codes, and the PDF file format to produce an automated scan to redacted PDF file process that can be reviewed, annotated and corrected at anytime. The entire process, called a “workflow” can be defined using the application and applied with a single press of a button either within the application itself or assigned to a physical or virtual button on a device. The end result is a “Searchable PDF” file.
-
FIG. 1 shows various steps in the functional flow of an automatic scan-to-redact process according to one embodiment of the present invention. First, the source document is scanned, step 101, including initial defining of the scanning process for some image enhancement or corrections. Scanning enhancements improve the OCR recognition capabilities and increase accuracy. Examples of such corrections include without limitation de-skew, despeckle, and auto-rotate. Settings for b/w, grayscale, color and resolution are also set in this step. - Once the document is scanned, an OCR process is applied to the scanned document,
step 102. Layout or formatting guidance and language detection is used at this point. For example, as the text is recognized the words are checked against internal dictionaries so that any words not found in the dictionary can be marked as suspect during the OCR text proofreading process. Professional dictionaries including legal, financial and medical are used for even greater accuracy. - When text is checked against the user and professional dictionaries possible mistakes are highlighted during the next step of proofreading the OCR document, step 103. The user checks the accuracy of recognized words in the context of the actual sentence in the document. Corrections may be applied using the proofreading tool. As the words are verified or corrected the user dictionaries “learn” and are updated accordingly to improve the correction process of additionally processed documents.
- Once the text has been verified after the OCR proofreading step, the document redaction process can begin,
step 104. A user input is received that identifies a scanned document for redaction. For example, the user input may be produced by a user selectable button—either a physical button on a computer network device such as a scanner, etc., or a virtual button produced by software on a computer display. In response to the user input, the scanned document is automatically processed to produce a corresponding redacted document which includes both searchable document text and a document image. The searchable document text includes coded redaction text which satisfies various defined redaction parameters. The document image includes redacted image areas corresponded to the redacted elements. - Established process rules are utilized such as the ability to automatically bookmark document pages for easy identification of redacted text in long documents. Redaction rules may also include use of pre-established redaction criteria to identify and produce the redaction text. For example, redaction patterns may be used which correspond to patterns of text associated with redaction, including automatic markup of information that fits a specific pattern or “looks like” specific information. The pattern could be the format of a number such as a Social Security number, date, currency value or telephone number. A specific list of redaction keywords, redaction codes and notes are loaded from a comma or tab delimited text file or directly from an ODBC database file.
- The redaction may be made immediately and directly, or redaction text may be highlighted for further review in a PDF editing application such as PDF Converter Professional. The identified redaction text may usefully be visually distinctive of redaction, for example, having a uniform non-text appearance. In addition, highlighting redaction text candidates for redaction can be customized, for example, with highlight color or an alternative method for identification such as outline or strikethrough can be used. Additionally, redacted areas can have a highlight color applied to a preset number of words or characters before and/or after the redacted areas for easy visual identification. Redacted or highlighted areas can be created with or as annotation fields in the final document.
- An additional post-redaction proofreading,
step 105, may be employed similar to the post-OCR proofreading step. This checks that possible redaction candidates defined as a pattern or “looks like” are accurately removed or marked as text that must be redacted. Identified words may be displayed in the context of the actual sentence in the document. Tools may be provided to mark or unmark redaction text as required. An alternative to an interactive “redact proofreading” step would be to automatically highlight suspect words in a pre-defined color for review in a PDF editing tool. - The redacted (or redaction highlighted) document can be saved in a propriety application format,
step 106, for later review, adjustments at a later date, and or archiving and storage. An embodiment of the application may also read marked PDF files that have been reviewed and marked with highlights in other applications so that the redaction can be automatically applied. When the PDF searchable image files are created, redaction and PDF preferences are applied. Redaction preferences include redaction color, the default being “black”, the width and height tolerance for the redacted area on the image, and a tolerance to objects or text close to the redacted areas so they are not inadvertently corrupted. - A PDF file includes various main components or layers, the most relevant being the image and text layers. The image layer comprises the original scanned document where the redacted areas are replaced with the redacted color and the pixel information is “destroyed”. The text layer is a hidden layer above or below the image that can be indexed and/or searched by commercial search products. The text layer precisely follows the formatting of the original document so that the text selectable in PDF viewing or editing applications in context of the original. The redaction text is removed from the redacted document, and placeholder characters are substituted to form equally spaced text “lines” that, although they can be copied and pasted into other application, do not represent the original text, thereby destroying that information.
- Metadata can be applied to the PDF file properties so it can be searched using commercially available search tools. PDF Bookmarks can be automatically created to identify redacted pages. Custom stamps, headers and footers can also be applied to the document to display relevant information such the creator name, creation date, and/or terms and conditions for use of the redacted document. Annotations as notes or redaction/exemption codes are created and applied to the final PDF document.
- Standard PDF security tools are also set and applied at this step. This includes password definition in order to view or print a document. 40-bit and 128-bit encryption can also be applied to a PDF file with the user able to control whether the document can be viewed, printed, edited, copied or annotated. The application will also include everything necessary to digitally sign documents for protection and authentication of information. No certificate from a third-party vendor is necessary.
- Redaction workflows combine all of the above described steps into a single workflow file that can be named, exported and imported into other copies of the application program so the process can be replicated. Workflows can be initiated as a single step within the application or assigned to physical or virtual buttons on a device for one-button process automation. The application can also batch process files stored on a network utilizing folder watching or automatically process incoming files from email applications. Scanned, recognized and redacted Searchable PDF files can be automatically saved to local hard drives, be saved to document management systems or emailed to specific recipients. The application can also save the files as editable Microsoft Word documents.
- Embodiments of the invention may be implemented in any conventional computer programming language. For example, preferred embodiments may be implemented in a procedural programming language (e.g., “C”) or an object oriented programming language (e.g., “C++”, Python). Alternative embodiments of the invention may be implemented as pre-programmed hardware elements, other related components, or as a combination of hardware and software components.
- Embodiments can be implemented as a computer program product for use with a computer system. Such implementation may include a series of computer instructions fixed either on a tangible medium, such as a computer readable medium (e.g., a diskette, CD-ROM, ROM, or fixed disk) or transmittable to a computer system, via a modem or other interface device, such as a communications adapter connected to a network over a medium. The medium may be either a tangible medium (e.g., optical or analog communications lines) or a medium implemented with wireless techniques (e.g., microwave, infrared or other transmission techniques). The series of computer instructions embodies all or part of the functionality previously described herein with respect to the system. Those skilled in the art should appreciate that such computer instructions can be written in a number of programming languages for use with many computer architectures or operating systems. Furthermore, such instructions may be stored in any memory device, such as semiconductor, magnetic, optical or other memory devices, and may be transmitted using any communications technology, such as optical, infrared, microwave, or other transmission technologies. It is expected that such a computer program product may be distributed as a removable medium with accompanying printed or electronic documentation (e.g., shrink wrapped software), preloaded with a computer system (e.g., on system ROM or fixed disk), or distributed from a server or electronic bulletin board over the network (e.g., the Internet or World Wide Web). Of course, some embodiments of the invention may be implemented as a combination of both software (e.g., a computer program product) and hardware. Still other embodiments of the invention are implemented as entirely hardware, or entirely software (e.g., a computer program product).
- Although various exemplary embodiments of the invention have been disclosed, it should be apparent to those skilled in the art that various changes and modifications can be made which will achieve some of the advantages of the invention without departing from the true scope of the invention.
Claims (15)
1. A method of producing a redacted electronic document, the method comprising:
receiving a user input identifying a scanned document; and
automatically processing the scanned document to produce a corresponding redacted document which includes:
i. searchable document text including coded redaction text satisfying defined redaction parameters, and
ii. a document image having redacted image areas corresponding to redacted elements.
2. A method according to claim 1 , wherein the redaction text is visually highlighted for review.
3. A method according to claim 2 , further comprising:
removing the redaction text from the redacted document; and
substituting placeholder characters for the redaction text.
4. A method according to claim 1 , wherein the redaction text comprises placeholder characters which substitute for text in the scanned document which satisfies the defined redaction parameters.
5. A method according to claim 1 , further comprising:
providing an annotation field for recording post-redaction comments associated with selected redaction text.
6. A method according to claim 1 , wherein the processing uses pre-established redaction criteria to identify and produce the redaction text.
7. A method according to claim 6 , wherein the redaction criteria includes a set of keywords associated with redaction.
8. A method according to claim 6 , wherein the redaction criteria includes redaction patterns representing patterns of text associated with redaction.
9. A method according to claim 1 , wherein the searchable document text includes bookmarks identifying segments of redaction text.
10. A method according to claim 1 , wherein the redacted image areas are visually distinctive of redaction.
11. A method according to claim 10 , wherein the redacted image areas have a uniform non-text appearance.
12. A method according to claim 1 , wherein the user-input is produced from a user selectable button on a computer network device.
13. A method according to claim 1 , wherein the user-input is produced from a user selectable button on a computer display.
14. A method according to claim 1 , wherein the redacted document is a PDF format document.
15. A method according to claim 1 , wherein the redacted document is an editable word processor document.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/182,334 US20090164881A1 (en) | 2007-07-30 | 2008-07-30 | Scan-to-Redact Searchable Documents |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US95265307P | 2007-07-30 | 2007-07-30 | |
US12/182,334 US20090164881A1 (en) | 2007-07-30 | 2008-07-30 | Scan-to-Redact Searchable Documents |
Publications (1)
Publication Number | Publication Date |
---|---|
US20090164881A1 true US20090164881A1 (en) | 2009-06-25 |
Family
ID=39822006
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/182,334 Abandoned US20090164881A1 (en) | 2007-07-30 | 2008-07-30 | Scan-to-Redact Searchable Documents |
Country Status (4)
Country | Link |
---|---|
US (1) | US20090164881A1 (en) |
EP (1) | EP2171643A1 (en) |
CN (1) | CN101802840A (en) |
WO (1) | WO2009018328A1 (en) |
Cited By (31)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090067414A1 (en) * | 2007-09-09 | 2009-03-12 | Francis Toscano | Systems and Methods for Communicating Documents |
US20090128861A1 (en) * | 2007-09-09 | 2009-05-21 | Xpedite Systems, Llc | Systems and Methods for Communicating Multimodal Messages |
US20090190159A1 (en) * | 2008-01-30 | 2009-07-30 | Francis Toscano | Systems and Methods for Generating and Communicating Enhanced Portable Document Format Files |
US20100033753A1 (en) * | 2008-08-06 | 2010-02-11 | Kenneth Stephenson | System and method for selective redaction of scanned documents |
US20110055932A1 (en) * | 2009-08-26 | 2011-03-03 | International Business Machines Corporation | Data Access Control with Flexible Data Disclosure |
US20110066606A1 (en) * | 2009-09-15 | 2011-03-17 | International Business Machines Corporation | Search engine with privacy protection |
US20110119576A1 (en) * | 2009-11-16 | 2011-05-19 | Yehonatan Aumann | Method for system for redacting and presenting documents |
US20110162084A1 (en) * | 2009-12-29 | 2011-06-30 | Joshua Fox | Selecting portions of computer-accessible documents for post-selection processing |
JP2013109728A (en) * | 2011-11-24 | 2013-06-06 | Jfe Steel Corp | Mill sheet browsing method and mill sheet browsing system |
JP2013130997A (en) * | 2011-12-21 | 2013-07-04 | Kyocera Document Solutions Inc | Image forming apparatus |
US20130185634A1 (en) * | 2012-01-15 | 2013-07-18 | International Business Machines Corporation | Automated document redaction |
US20130215474A1 (en) * | 2011-11-04 | 2013-08-22 | Document Security Systems, Inc. | System and Method for Printing Documents Containing Dynamically Generated Security Features |
US8548280B2 (en) | 2011-02-14 | 2013-10-01 | Hewlett-Packard Development Company, L.P. | Systems and methods for replacing non-image text |
US20140053231A1 (en) * | 2012-08-16 | 2014-02-20 | Berkeley Information Technology Pty Ltd | Streamlined security-level determination of an electronic document and selective release into an information system |
US8719083B2 (en) | 2010-12-06 | 2014-05-06 | Xpedite Systems, Inc. | On net faxing |
US8737583B2 (en) | 2002-04-08 | 2014-05-27 | Open Text S.A. | Document transmission and routing with recipient control |
US20140212040A1 (en) * | 2013-01-31 | 2014-07-31 | Longsand Limited | Document Alteration Based on Native Text Analysis and OCR |
US20140281871A1 (en) * | 2013-03-15 | 2014-09-18 | Meditory Llc | Method for mapping form fields from an image containing text |
US9007604B2 (en) | 2010-06-30 | 2015-04-14 | Xpedite Systems, Llc | System, method, and apparatus for an interactive virtual fax machine |
US20150378973A1 (en) * | 2014-06-24 | 2015-12-31 | Abbyy Development Llc | Electronic document content redaction |
JP2016197463A (en) * | 2016-08-16 | 2016-11-24 | Jfeスチール株式会社 | Mill sheet browsing method, mill sheet browsing system, mill sheet disclosing method, and mill sheet disclosing system |
US9892278B2 (en) | 2012-11-14 | 2018-02-13 | International Business Machines Corporation | Focused personal identifying information redaction |
US20180061074A1 (en) * | 2016-08-31 | 2018-03-01 | Canon Kabushiki Kaisha | Apparatus, method, and storage medium |
RU2648636C2 (en) * | 2014-03-31 | 2018-03-26 | Общество с ограниченной ответственностью "Аби Девелопмент" | Storage of the content in converted documents |
US20180374007A1 (en) * | 2017-06-27 | 2018-12-27 | Canon Kabushiki Kaisha | Workflow generation apparatus, workflow generation method, and recording medium |
US20190018983A1 (en) * | 2017-07-17 | 2019-01-17 | Microsoft Technology Licensing, Llc | Removing Sensitive Content from Documents while Preserving their Usefulness for Subsequent Processing |
US20190199878A1 (en) * | 2017-12-26 | 2019-06-27 | Kyocera Document Solutions Inc. | Image processing apparatus |
US20190246009A1 (en) * | 2017-03-31 | 2019-08-08 | Kyocera Document Solutions Inc. | Scan privacy tool and methods using sensors or control buttons |
US10733434B2 (en) | 2018-09-24 | 2020-08-04 | International Business Machines Corporation | Method and system for accurately detecting, extracting and representing redacted text blocks in a document |
US10878186B1 (en) | 2017-09-18 | 2020-12-29 | University Of South Florida | Content masking attacks against information-based services and defenses thereto |
US10976899B2 (en) * | 2014-02-03 | 2021-04-13 | Bluebeam, Inc. | Method for automatically applying page labels using extracted label contents from selected pages |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8838554B2 (en) | 2008-02-19 | 2014-09-16 | Bank Of America Corporation | Systems and methods for providing content aware document analysis and modification |
US9104659B2 (en) | 2010-01-20 | 2015-08-11 | Bank Of America Corporation | Systems and methods for providing content aware document analysis and modification |
US9378379B1 (en) | 2011-01-19 | 2016-06-28 | Bank Of America Corporation | Method and apparatus for the protection of information in a device upon separation from a network |
CN104537026B (en) * | 2014-12-22 | 2018-08-24 | 福建亿榕信息技术有限公司 | Archives of paper quality document handling method based on local cache |
US9870484B2 (en) * | 2015-01-30 | 2018-01-16 | Konica Minolta Laboratory U.S.A., Inc. | Document redaction |
JP6690170B2 (en) * | 2015-09-28 | 2020-04-28 | 富士ゼロックス株式会社 | Image processing device and program |
CN105302506B (en) * | 2015-11-10 | 2018-03-20 | 武汉启明泰和软件服务有限公司 | A kind of sorting bulk print method called automatically based on data |
CN108009143A (en) * | 2017-12-04 | 2018-05-08 | 上海互盾信息科技有限公司 | A kind of PDF document real-time edition scan tool |
EP3895068A4 (en) * | 2018-12-12 | 2022-07-13 | Hewlett-Packard Development Company, L.P. | Scanning devices with zonal ocr user interfaces |
CN117649673A (en) * | 2024-01-30 | 2024-03-05 | 陕西巨微图书文化传播有限公司 | Picture processing method in book editing and correction |
Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5581682A (en) * | 1991-06-28 | 1996-12-03 | International Business Machines Corporation | Method for storing and retrieving annotations and redactions in final form documents |
US20020083079A1 (en) * | 2000-11-16 | 2002-06-27 | Interlegis, Inc. | System and method of managing documents |
US20040260569A1 (en) * | 2000-09-07 | 2004-12-23 | Cyber Legal Solutions, Inc. | Expert legal task management |
US20070005637A1 (en) * | 2005-07-01 | 2007-01-04 | Juliano Elizabeth B | System for Litigation Management |
US20070030528A1 (en) * | 2005-07-29 | 2007-02-08 | Cataphora, Inc. | Method and apparatus to provide a unified redaction system |
US20070094594A1 (en) * | 2005-10-06 | 2007-04-26 | Celcorp, Inc. | Redaction system, method and computer program product |
US20080112620A1 (en) * | 2006-10-26 | 2008-05-15 | Hubin Jiang | Automated system for understanding document content |
US20080204788A1 (en) * | 2004-10-14 | 2008-08-28 | Onstream Systems Limited | Process for Electronic Document Redaction |
US7428701B1 (en) * | 1998-12-18 | 2008-09-23 | Appligent Inc. | Method, system and computer program for redaction of material from documents |
US20090019379A1 (en) * | 2007-07-12 | 2009-01-15 | Pendergast Brian S | Document Redaction in a Web-Based Data Analysis and Document Review System |
US7802305B1 (en) * | 2006-10-10 | 2010-09-21 | Adobe Systems Inc. | Methods and apparatus for automated redaction of content in a document |
-
2008
- 2008-07-30 WO PCT/US2008/071571 patent/WO2009018328A1/en active Application Filing
- 2008-07-30 CN CN200880107998A patent/CN101802840A/en active Pending
- 2008-07-30 US US12/182,334 patent/US20090164881A1/en not_active Abandoned
- 2008-07-30 EP EP08782516A patent/EP2171643A1/en not_active Withdrawn
Patent Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5581682A (en) * | 1991-06-28 | 1996-12-03 | International Business Machines Corporation | Method for storing and retrieving annotations and redactions in final form documents |
US7428701B1 (en) * | 1998-12-18 | 2008-09-23 | Appligent Inc. | Method, system and computer program for redaction of material from documents |
US20040260569A1 (en) * | 2000-09-07 | 2004-12-23 | Cyber Legal Solutions, Inc. | Expert legal task management |
US20020083079A1 (en) * | 2000-11-16 | 2002-06-27 | Interlegis, Inc. | System and method of managing documents |
US20080204788A1 (en) * | 2004-10-14 | 2008-08-28 | Onstream Systems Limited | Process for Electronic Document Redaction |
US20070005637A1 (en) * | 2005-07-01 | 2007-01-04 | Juliano Elizabeth B | System for Litigation Management |
US20070030528A1 (en) * | 2005-07-29 | 2007-02-08 | Cataphora, Inc. | Method and apparatus to provide a unified redaction system |
US7805673B2 (en) * | 2005-07-29 | 2010-09-28 | Der Quaeler Loki | Method and apparatus to provide a unified redaction system |
US20070094594A1 (en) * | 2005-10-06 | 2007-04-26 | Celcorp, Inc. | Redaction system, method and computer program product |
US7802305B1 (en) * | 2006-10-10 | 2010-09-21 | Adobe Systems Inc. | Methods and apparatus for automated redaction of content in a document |
US20080112620A1 (en) * | 2006-10-26 | 2008-05-15 | Hubin Jiang | Automated system for understanding document content |
US20090019379A1 (en) * | 2007-07-12 | 2009-01-15 | Pendergast Brian S | Document Redaction in a Web-Based Data Analysis and Document Review System |
Cited By (64)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9635199B2 (en) | 2002-04-08 | 2017-04-25 | Open Text Sa Ulc | System and method for document transmission and routing with recipient control |
US8737583B2 (en) | 2002-04-08 | 2014-05-27 | Open Text S.A. | Document transmission and routing with recipient control |
US9160881B2 (en) | 2002-04-08 | 2015-10-13 | Open Text S.A. | System and method for document transmission and routing with recipient control |
US20090128861A1 (en) * | 2007-09-09 | 2009-05-21 | Xpedite Systems, Llc | Systems and Methods for Communicating Multimodal Messages |
US20090067414A1 (en) * | 2007-09-09 | 2009-03-12 | Francis Toscano | Systems and Methods for Communicating Documents |
US8395795B2 (en) | 2007-09-09 | 2013-03-12 | Xpedite Systems, Llc | Systems and methods for communicating documents |
US9191527B2 (en) | 2007-09-09 | 2015-11-17 | Xpedite Systems, Llc | Systems and methods for communicating documents |
US11445072B2 (en) | 2008-01-30 | 2022-09-13 | Open Text Holdings, Inc. | Systems and methods for generating and communicating enhanced portable document format files |
US10003701B2 (en) * | 2008-01-30 | 2018-06-19 | Xpedite Systems, Llc | Systems and methods for generating and communicating enhanced portable document format files |
US10887474B2 (en) | 2008-01-30 | 2021-01-05 | Open Text Holdings, Inc. | Systems and methods for generating and communicating enhanced portable document format files |
US10440199B2 (en) | 2008-01-30 | 2019-10-08 | Open Text Holdings, Inc. | Systems and methods for generating and communicating enhanced portable document format files |
US20090190159A1 (en) * | 2008-01-30 | 2009-07-30 | Francis Toscano | Systems and Methods for Generating and Communicating Enhanced Portable Document Format Files |
US20100033753A1 (en) * | 2008-08-06 | 2010-02-11 | Kenneth Stephenson | System and method for selective redaction of scanned documents |
US20110055932A1 (en) * | 2009-08-26 | 2011-03-03 | International Business Machines Corporation | Data Access Control with Flexible Data Disclosure |
US10169599B2 (en) | 2009-08-26 | 2019-01-01 | International Business Machines Corporation | Data access control with flexible data disclosure |
US20110066606A1 (en) * | 2009-09-15 | 2011-03-17 | International Business Machines Corporation | Search engine with privacy protection |
US10454932B2 (en) | 2009-09-15 | 2019-10-22 | International Business Machines Corporation | Search engine with privacy protection |
US9224007B2 (en) | 2009-09-15 | 2015-12-29 | International Business Machines Corporation | Search engine with privacy protection |
US10902202B2 (en) * | 2009-11-16 | 2021-01-26 | Refinitiv Us Organization Llc | Method for system for redacting and presenting documents |
US20110119576A1 (en) * | 2009-11-16 | 2011-05-19 | Yehonatan Aumann | Method for system for redacting and presenting documents |
US20110162084A1 (en) * | 2009-12-29 | 2011-06-30 | Joshua Fox | Selecting portions of computer-accessible documents for post-selection processing |
US9886159B2 (en) | 2009-12-29 | 2018-02-06 | International Business Machines Corporation | Selecting portions of computer-accessible documents for post-selection processing |
US9600134B2 (en) | 2009-12-29 | 2017-03-21 | International Business Machines Corporation | Selecting portions of computer-accessible documents for post-selection processing |
US9143631B2 (en) | 2010-06-30 | 2015-09-22 | Xpedite Systems, Llc | System, method and apparatus for an interactive virtual fax machine |
US9007604B2 (en) | 2010-06-30 | 2015-04-14 | Xpedite Systems, Llc | System, method, and apparatus for an interactive virtual fax machine |
US10154159B2 (en) | 2010-06-30 | 2018-12-11 | Open Text Holdings, Inc. | System, method, and apparatus for an interactive virtual fax machine |
US10277754B2 (en) | 2010-06-30 | 2019-04-30 | Open Text Holdings, Inc. | System, method, and apparatus for an interactive virtual fax machine |
US9641708B2 (en) | 2010-06-30 | 2017-05-02 | Xpedite Systems, Llc | System, method and apparatus for an interactive virtual fax machine |
US9854123B2 (en) | 2010-12-06 | 2017-12-26 | Xpedite Systems, Llc | On net faxing |
US8719083B2 (en) | 2010-12-06 | 2014-05-06 | Xpedite Systems, Inc. | On net faxing |
US10694067B2 (en) | 2010-12-06 | 2020-06-23 | Open Text Holdings, Inc. | On net faxing |
US11178305B2 (en) | 2010-12-06 | 2021-11-16 | Open Text Holdings, Inc. | On net faxing |
US8548280B2 (en) | 2011-02-14 | 2013-10-01 | Hewlett-Packard Development Company, L.P. | Systems and methods for replacing non-image text |
US20130215474A1 (en) * | 2011-11-04 | 2013-08-22 | Document Security Systems, Inc. | System and Method for Printing Documents Containing Dynamically Generated Security Features |
JP2013109728A (en) * | 2011-11-24 | 2013-06-06 | Jfe Steel Corp | Mill sheet browsing method and mill sheet browsing system |
JP2013130997A (en) * | 2011-12-21 | 2013-07-04 | Kyocera Document Solutions Inc | Image forming apparatus |
US20130185634A1 (en) * | 2012-01-15 | 2013-07-18 | International Business Machines Corporation | Automated document redaction |
US9195853B2 (en) * | 2012-01-15 | 2015-11-24 | International Business Machines Corporation | Automated document redaction |
US20140053231A1 (en) * | 2012-08-16 | 2014-02-20 | Berkeley Information Technology Pty Ltd | Streamlined security-level determination of an electronic document and selective release into an information system |
US9892278B2 (en) | 2012-11-14 | 2018-02-13 | International Business Machines Corporation | Focused personal identifying information redaction |
US9904798B2 (en) | 2012-11-14 | 2018-02-27 | International Business Machines Corporation | Focused personal identifying information redaction |
US20140212040A1 (en) * | 2013-01-31 | 2014-07-31 | Longsand Limited | Document Alteration Based on Native Text Analysis and OCR |
US9256798B2 (en) * | 2013-01-31 | 2016-02-09 | Aurasma Limited | Document alteration based on native text analysis and OCR |
US20140281871A1 (en) * | 2013-03-15 | 2014-09-18 | Meditory Llc | Method for mapping form fields from an image containing text |
US20210200397A1 (en) * | 2014-02-03 | 2021-07-01 | Bluebeam, Inc. | Method for automatically indexing an electronic document |
US10976899B2 (en) * | 2014-02-03 | 2021-04-13 | Bluebeam, Inc. | Method for automatically applying page labels using extracted label contents from selected pages |
US11592967B2 (en) * | 2014-02-03 | 2023-02-28 | Bluebeam, Inc. | Method for automatically indexing an electronic document |
RU2648636C2 (en) * | 2014-03-31 | 2018-03-26 | Общество с ограниченной ответственностью "Аби Девелопмент" | Storage of the content in converted documents |
US10108815B2 (en) * | 2014-06-24 | 2018-10-23 | Abbyy Development Llc | Electronic document content redaction |
US20150378973A1 (en) * | 2014-06-24 | 2015-12-31 | Abbyy Development Llc | Electronic document content redaction |
JP2016197463A (en) * | 2016-08-16 | 2016-11-24 | Jfeスチール株式会社 | Mill sheet browsing method, mill sheet browsing system, mill sheet disclosing method, and mill sheet disclosing system |
US10803308B2 (en) * | 2016-08-31 | 2020-10-13 | Canon Kabushiki Kaisha | Apparatus for deciding whether to include text in searchable data, and method and storage medium thereof |
US20180061074A1 (en) * | 2016-08-31 | 2018-03-01 | Canon Kabushiki Kaisha | Apparatus, method, and storage medium |
US10812682B2 (en) * | 2017-03-31 | 2020-10-20 | Kyocera Document Solutions Inc. | Scan privacy tool and methods using sensors or control buttons |
US20190246009A1 (en) * | 2017-03-31 | 2019-08-08 | Kyocera Document Solutions Inc. | Scan privacy tool and methods using sensors or control buttons |
US10453006B2 (en) * | 2017-06-27 | 2019-10-22 | Canon Kabushiki Kaisha | Workflow generation apparatus, workflow generation method, and recording medium |
US20180374007A1 (en) * | 2017-06-27 | 2018-12-27 | Canon Kabushiki Kaisha | Workflow generation apparatus, workflow generation method, and recording medium |
US10713390B2 (en) * | 2017-07-17 | 2020-07-14 | Microsoft Technology Licensing, Llc | Removing sensitive content from documents while preserving their usefulness for subsequent processing |
US20190018983A1 (en) * | 2017-07-17 | 2019-01-17 | Microsoft Technology Licensing, Llc | Removing Sensitive Content from Documents while Preserving their Usefulness for Subsequent Processing |
US10878186B1 (en) | 2017-09-18 | 2020-12-29 | University Of South Florida | Content masking attacks against information-based services and defenses thereto |
US11775749B1 (en) | 2017-09-18 | 2023-10-03 | University Of South Florida | Content masking attacks against information-based services and defenses thereto |
US10630854B2 (en) * | 2017-12-26 | 2020-04-21 | Kyocera Document Solutions Inc. | Image processing apparatus |
US20190199878A1 (en) * | 2017-12-26 | 2019-06-27 | Kyocera Document Solutions Inc. | Image processing apparatus |
US10733434B2 (en) | 2018-09-24 | 2020-08-04 | International Business Machines Corporation | Method and system for accurately detecting, extracting and representing redacted text blocks in a document |
Also Published As
Publication number | Publication date |
---|---|
EP2171643A1 (en) | 2010-04-07 |
CN101802840A (en) | 2010-08-11 |
WO2009018328A1 (en) | 2009-02-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20090164881A1 (en) | Scan-to-Redact Searchable Documents | |
US7796309B2 (en) | Integrating analog markups with electronic documents | |
US7568104B2 (en) | Method and apparatus for adding signature information to electronic documents | |
US8179556B2 (en) | Masking of text in document reproduction | |
US9471800B2 (en) | Securing visual information on images for document capture | |
US20240012913A1 (en) | Systems and methods for creating enhanced documents for perfect automated parsing | |
Garfinkel | Leaking Sensitive Information in Complex Document Files--and How to Prevent It | |
US11188707B1 (en) | Systems and methods for creating enhanced documents for perfect automated parsing | |
US20190361962A1 (en) | A method and a system for providing an extract document | |
US11157639B2 (en) | Systems, processes, and computer program products for authentication of documents based on invisible information in documents | |
JP2012159987A (en) | Document browsing confirmation device, document browsing confirmation method, and program | |
Liu et al. | Hidden information in Microsoft word | |
Klindt | PDF/A considered harmful for digital preservation. | |
US20160259957A1 (en) | System And Method For Monitoring And Protecting Healthcare Data | |
JP2003317075A (en) | Processor and method for masking electronized document | |
Ewan et al. | Race to the Virtual Courthouse-How Standards Drive Electronic Recording of Real Property Documents | |
JP2022144768A (en) | Information processing device and information processing program | |
JP2020126322A (en) | Electronic document proof reading work management method using electronic tag and management method thereof | |
Hricik et al. | Metadata: Lawyers Need to Know What it is and What to do About It1 | |
Hart-Davis | Printing, Finalizing, and Sharing Documents | |
Cosgrove | Forensic Engineers and the New Federal Rules Regarding Electronically Stored Information (ESI). | |
BERMAN et al. | E-Discovery | |
TC | IMPACT FACTOR | |
Mathematicians et al. | IMPACT FACTOR |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: NUANCE COMMUNICATIONS, INC.,MASSACHUSETTS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:SEGARRA, JEFFREY;CSATADI, GYORGY;DUDAS, CSABA;AND OTHERS;SIGNING DATES FROM 20090224 TO 20090305;REEL/FRAME:022361/0072 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |