US20170075766A1 - Selective processing of file system objects for image level backups - Google Patents

Selective processing of file system objects for image level backups Download PDF

Info

Publication number
US20170075766A1
US20170075766A1 US15/359,128 US201615359128A US2017075766A1 US 20170075766 A1 US20170075766 A1 US 20170075766A1 US 201615359128 A US201615359128 A US 201615359128A US 2017075766 A1 US2017075766 A1 US 2017075766A1
Authority
US
United States
Prior art keywords
backup
file system
disk
fat
blocks
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US15/359,128
Inventor
Ratmir TIMASHEV
Anton GOSTEV
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Veeam Software Group GmbH
Original Assignee
Veeam Software AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Veeam Software AG filed Critical Veeam Software AG
Priority to US15/359,128 priority Critical patent/US20170075766A1/en
Publication of US20170075766A1 publication Critical patent/US20170075766A1/en
Assigned to VEEAM SOFTWARE AG reassignment VEEAM SOFTWARE AG ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: GOSTEV, ANTON, TIMASHEV, RATMIR
Priority to US16/197,644 priority patent/US11068349B2/en
Assigned to VEEAM SOFTWARE GROUP GMBH reassignment VEEAM SOFTWARE GROUP GMBH CHANGE OF NAME (SEE DOCUMENT FOR DETAILS). Assignors: VEEAM SOFTWARE AG
Assigned to JPMORGAN CHASE N.A. reassignment JPMORGAN CHASE N.A. SECURITY INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: VEEAM SOFTWARE GROUP GMBH
Priority to US17/380,523 priority patent/US11789823B2/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1448Management of the data involved in backup or backup restore
    • G06F11/1451Management of the data involved in backup or backup restore by selection of backup contents
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1458Management of the backup or restore process
    • G06F11/1464Management of the backup or restore process for networked environments
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/14Details of searching files based on file metadata
    • G06F16/148File search processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems
    • G06F16/184Distributed file systems implemented as replicated file system
    • G06F16/1844Management specifically adapted to replicated file systems
    • G06F17/30106
    • G06F17/30215
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/80Database-specific techniques

Definitions

  • the present invention is related to methods for backing up physical and virtual machine data into image level backups and replicas.
  • the present invention relates to methods, systems, and computer program products for reducing the amount of data that needs to be backed up or replicated at the image level by limiting processing to disk image blocks belonging to file system objects that represent value to applications and users.
  • Image level backups used for disaster recovery present new challenges as compared to legacy file system level backups.
  • the size of disk images that need to be backed up require much longer times to backup. Backups of large disk images also significantly increase backup file storage requirements.
  • image level backups save complete images of backed up disks.
  • conventional image level backups typically include unnecessary data blocks belonging to file system objects that are of no value to users, deleted file system objects, file system objects marked for deletion, unallocated space, and unused space.
  • VEEAMTM Backup and Replication from Veeam Software International Ltd. are able to efficiently remove white spaces (e.g., by using compression and deduplication), other unneeded data blocks mentioned above are still processed as part of image-level backups. This slows down backup performance and requires additional backup storage space.
  • conventional image level backup techniques process and store significant amounts of data that are unnecessary in backup files.
  • conventional methods process and store the disk image data blocks corresponding to the contents of swap files, hibernation files, the contents of temporary (‘temp’) folders, recycling bin folders; and/or data such as Windows operating system (OS) system files which either do not need to be backed up at all, or can be easily restored from multiple other readily available sources.
  • OS Windows operating system
  • certain OS file system objects such as directories and files for a server or computer can be readily restored from other similar servers or computers with the same OS installed.
  • Conventional image-level backup optimization methods fail to take this into account and as a result consume valuable time and storage space processing data blocks that correspond to contents of files of no value to users.
  • Embodiments of the present invention include methods, systems, and computer program products for efficient processing of image level backups.
  • the methods described herein with reference to image level backups can also be applied to other image level disaster recovery techniques, such as creating replicas via replication and simple copying of images.
  • a system for selective processing of file system objects for an image level backup comprises a backup engine which includes a receiving module, a connection module, a file allocation table processing module, and a block processing module.
  • the receiving module is configured to receive backup parameters for the image level backup, wherein the backup parameters include a selection of a machine to backup, and a selection of at least one file system object to include in the image level backup.
  • the connection module is configured to connect to production storage corresponding to the selected machine, wherein the connection module is further configured to obtain data from a source disk corresponding to the selected at least one file system object, and wherein the source disk is in the production storage.
  • the file allocation table (FAT) processing module configured to fetch FAT blocks from the source disk, search the fetched FAT blocks to determine selected set of data blocks of the source disk, wherein the selected set of data blocks correspond to the selected at least one file system object, and create a backup FAT from the fetched FAT blocks, wherein the backup FAT comprises only records corresponding to the selected at least one file system object.
  • the block processing module configured to read the determined selected set of data blocks, and save the backup FAT and the determined selected set of data blocks to a reconstructed disk image.
  • Embodiments of the invention achieve at least five key improvements over conventional image level backup optimization techniques.
  • embodiments of the present invention do not rely on determining and skipping processing of deleted data blocks. This improvement enables reduction of the amount of data to be backed up during image level backup even in cases when there are no blocks containing deleted data on the disk being backed up.
  • embodiments of the invention achieve significant improvements in processing speed and reduction in the size of backups.
  • Embodiments of the invention achieve backup performance improvements not just for full backups, but also for incremental and differential image block level backups that backup file system objects that have been changed or added since the last backup.
  • embodiments of the present invention also enable improvements for file systems that natively wipe (zero out) data blocks belonging to deleted file system objects, such as, but not limited to the Linux ext3 and ext4 file systems.
  • embodiments of the invention enable filtering out unimportant data blocks from processing and storing as a part of image level backups, such as data blocks occupied by swap files. This is because in accordance with embodiments of the invention, swap files, temporary (‘temp’) files, and other data blocks that are not important to users and applications are not backed up, even in cases when full backups are being processed.
  • embodiments of the present invention filter out paging and virtual memory files used by Windows server and workstation operating systems (OSs) and only backup data blocks used for applications and corresponding to application executables.
  • OSs workstation operating systems
  • embodiments of the invention also optimize incremental and differential backups. This is important because embodiments of the present invention are compatible with commercially available disaster recovery tools, such as VEEAMTM Backup and Replication from Veeam Software International Ltd., that only require that a full backup be performed once, with subsequent backups being forever-incremental with only changed and new blocks being processed.
  • VEEAMTM Backup and Replication from Veeam Software International Ltd.
  • FIG. 1 illustrates a system architecture for object-selective backup processing, in accordance with an embodiment of the present invention.
  • FIG. 2 is a flowchart illustrating steps by which object-selective processing of image level backups are performed, in accordance with an embodiment of the present invention.
  • FIG. 3 illustrates an exemplary graphical user interface (GUI), wherein objects can be selected for image level backup processing, in accordance with an embodiment of the invention.
  • GUI graphical user interface
  • FIG. 4 depicts an example computer system in which an embodiment of the present invention may be implemented.
  • a user, a backup operator, and an administrator are interchangeably used herein to identify a human user, a software agent, or a group of users and/or software agents.
  • a software application or agent may sometimes process image level backups. Accordingly, unless specifically stated, the terms “backup operator,” “administrator,” and “user” as used herein are not limited to a human being.
  • server encompasses computing devices that are designed to function as one or more of file servers, email servers, Domain Name System (DNS) servers, Domain Controller (DC) servers, application servers, database servers, web servers, firewall servers, other enterprise servers, and back end servers.
  • a server may comprise of one or more server machines.
  • a server may be implemented as collection of servers such as a server farm or server cluster.
  • web servers may be commercially available server machines with one or more central processing units (CPUs).
  • CPUs central processing units
  • these web servers may comprise multiple computing devices and/or computing functionality hosted on multiple server machines (i.e., a server farm).
  • the present invention relates to systems, methods, and computer program products for object-selective processing of image level backups.
  • FIG. 1 depicts system architecture 100 for processing object-selective image level backups, in accordance with an embodiment of the invention.
  • An operator console 110 includes a user interface (UI) 115 for backup operators and administrators.
  • the UI 115 may be displayed on computer display 430 shown in FIG. 4 .
  • UI 115 can be used to add and select individual file system objects to be included in, or excluded from an image level backup.
  • an image level backup is a backup of the disk images of a physical or virtual machine (VM) corresponding to a server or computer. Because any physical machine can be backed up on image level (for example, by leveraging an agent), the invention applies to both image level backup of both virtual and physical machines.
  • VM physical or virtual machine
  • a “virtual machine” is a software implementation of a machine such as a server, computer, or other computing device that supports the execution of a complete operating system (OS) and executes application programs like a physical machine.
  • a VM duplicates the functionality of a physical machine implemented in hardware and software.
  • Software applications and the OS running on a VM are limited to the resources and abstractions provided by the VM.
  • virtual machines (VMs) are viewable within an overall virtual infrastructure.
  • the backup file system objects selected to be backed up can be located in production storage 130 , which includes one or more disks 140 which form parts of a production disk storage.
  • embodiments of the invention read data 135 to be backed up by either attaching an image of disk 140 to a backup engine 120 (in case of virtual machine), or by leveraging an agent inside each processed machine to get data from disk 140 (in case of physical or virtual machine).
  • source disk is used to refer to storage in production storage 130 to be backed up, such as disk 140 , which may be a disk of a physical machine or a disk image of a virtual machine.
  • UI 115 can also be used to remove a previously selected file system object from an image level backup to be processed. Operator console 110 can also be used to enter and configure other backup parameters 125 for an image level backup. For example, in the exemplary embodiment depicted in FIG. 3 , UI 115 can be used to disable object-selective image level processing for a backup, to process all but a selected subset of file system objects in a backup, or to include (copy) only selected file system objects in an image level backup.
  • operator console 110 includes a backup object selection interface 300 for selecting machine's file system objects to backup for an image level backup of a machine. Selections of file system objects to include and exclude are received by backup engine 120 as backup parameters 125 . According to an embodiment, the file system objects to be included may be programmatically determined based upon the file system objects selected to be excluded. After acquiring backup parameters 125 , backup engine 120 connects to production storage 130 and initiates block level access to read data 135 from the corresponding disk 140 .
  • backup engine 120 is an application comprising modules configured to process an object-selective image level backup.
  • backup engine 120 is configured to receive backup parameters 125 from backup operator console 110 .
  • the received backup parameters 125 are acquired by a receiving module (not shown).
  • Backup engine 120 comprises a module configured to read data 135 from production storage 130 in order retrieve and parse file allocation table (FAT) 150 of disk 140 , which in turn comprises part of production disk storage.
  • FAT 150 data can be retrieved directly from storage, by reading the disk data blocks corresponding to FAT data location.
  • the FAT data can be retrieved by an agent (not shown) installed in the processed virtual machine or physical computer.
  • Backup engine 120 further comprises a module configured to create a reconstructed disk image 170 comprising a modified backup FAT 160 .
  • Backup engine 120 also includes a module configured to write an image level backup 175 to backup file storage 180 corresponding to reconstructed disk image 170 . Additional functionalities and features of backup engine 120 are discussed below with continued reference to FIG. 1 .
  • production storage 130 can comprise one or more disks (or disk images—in case of virtual machines) 140 corresponding to each disk used by production machine disk storage corresponding to a machine being backed up.
  • Operator console can be used to select file system objects such as, but not limited to, directories, applications, data files, log files, and other file system objects associated with a machine's applications.
  • disk image refers to logical storage that has been abstracted and separated from physical storage, such as network-attached storage (NAS), file servers, disks, and other physical storage devices.
  • NAS network-attached storage
  • a disk image is implemented via virtual storage logic and is viewable within a virtual infrastructure as a storage device containing one or more virtual disks, which are separated from physical storage disks.
  • backup engine 120 is an application that functions as a backup agent. According to an embodiment, backup engine 120 is configured to retrieve disk blocks that store file systems' file allocation table (FAT) 150 .
  • FAT refers to a file allocation table used in a variety of file system architectures for various Operating Systems (OSs) and is not limited to a FAT file system used in MICROSOFTTM Windows.
  • OSs Operating Systems
  • the contents of FAT 150 are parsed to determine the location (on the disk 140 ) of blocks of file system objects selected for inclusion in the image level backup, as specified by a backup operator using operator console 110 . In this way, only the blocks of disk 140 corresponding to selected file system objects need to be read from disk or disk image 140 .
  • a copy of the contents of FAT 150 is made as backup FAT 160 , which is optionally modified.
  • the optional modification may include removing references to file system objects that have been excluded from backup per selections made in operator console 110 .
  • FAT 160 remains as an unmodified copy of FAT 150 .
  • certain unimportant files such as temporary files, virtual memory files (i.e., pagefile.sys and other paging files) and hibernation files (i.e., hyberfil.sys), will still be represented in the file system of restored backup 175 , but will have empty content (zeroed out data blocks).
  • embodiments of the present invention do not look up or process deleted data information.
  • Backup engine 120 is configured to effectively reconstruct a modified, reconstructed disk image 170 on the fly, while simultaneously compressing and saving backup data 175 to backup file storage 180 .
  • backup engine 120 replicates the reconstructed disk image 170 to a replica VM (not shown).
  • reconstructed disk image 170 can be replicated to remote file storage.
  • Backup engine 120 can also copy reconstructed disk image 170 to another local or remote storage device.
  • backup 175 will be used to perform a restoration onto a replica of a VM, such as a standby VM or failover VM
  • reconstructed disk image 170 is replicated to the backup storage accessible from a hypervisor host running a replica VM.
  • reconstructed disk image 170 is created by using modified data blocks corresponding to backup FAT 160 , and then retrieving and applying only those image blocks of disk 140 that correspond to the file system objects selected for backup in UI 115 . Instead of including all sequential blocks of disk image 140 , reconstructed disk image 170 skips blocks corresponding to file system objects that were selected for exclusion based on settings provided in UI 115 of operator console 110 .
  • exclusions can be pre-configured. For example, it may be pre-configured that files such as paging and virtual memory files (e.g., swap files), are always excluded from the backup.
  • reconstructed disk image 170 is being created using backup FAT 160 , it is simultaneously compressed and stored in backup file storage 180 as a backup data 175 .
  • disk image data blocks containing data that is to be excluded from processing is substituted by zeroed data blocks in reconstructed disk image 170 .
  • zeroed data blocks are written to reconstructed disk image 170 instead of actual data blocks belonging to objects selected for exclusion in UI 115 .
  • the storage space needed in backup file storage 180 to store reconstructed disk image 170 is reduced in cases when data is compressed and/or deduplicated before saving it to a backup file.
  • backup data 175 can be made available to data consuming processes as a local volume so that the reconstructed disk image 170 can be later used for additional processing, verification and/or restore the backed up file system objects.
  • backup file storage 180 is made available to data consuming processes as remote storage via public or proprietary storage access protocols such as, but not limited to the Network File System (NFS), Common Internet File System (CIFS), and Internet Small Computer System Interface (iSCSI).
  • NFS Network File System
  • CIFS Common Internet File System
  • iSCSI Internet Small Computer System Interface
  • additional processing include mounting reconstructed disk image 170 to a server as a volume, creating, updating or deleting some file system objects using native OS and third party tools, and committing the changes to backup data 175 .
  • Example methods for restoring file system objects and items from an image level backup are described in U.S.
  • FIG. 2 is a flowchart 200 illustrating steps by which a method is used to process object-selective image level backups, in accordance with an embodiment of the present invention.
  • flowchart 200 illustrates the steps by which file system object-selective image level backups are performed using a reconstructed disk image, such as reconstructed disk image 170 , according to an embodiment of the present invention.
  • FIG. 2 is described with continued reference to the embodiment illustrated in FIG. 1 . However, FIG. 2 is not limited to that embodiment. Note that the steps in the flowchart do not necessarily have to occur in the order shown.
  • steps of flowchart 200 described below may be accomplished via execution of computer executable instructions that, in response to execution by a computing device, perform an algorithm for creating an object-selective image level backup.
  • the method begins at step 210 .
  • a backup application is started in step 210 .
  • backup engine 120 and backup operator console 110 may be started in this step.
  • the method proceeds to step 220 .
  • Backup parameters 125 may include one or more of physical or virtual machines (VMs) to backup, and a list of file system objects to either include in or exclude from an image level backup.
  • the file system objects may include directories and files, specified individually or using file name masks.
  • VMs physical or virtual machines
  • the file system objects may include directories and files, specified individually or using file name masks.
  • a directory is selected to be included in an image level backup, all data files in the directory and subdirectories below the selected directory are automatically selected for inclusion in the image level backup.
  • all dependent file system objects such as files within the excluded directory and all of its subdirectories, will not be processed in the image level backup.
  • the list of data items to be included in the backup may be programmatically determined based upon the one or more data items selected by the user to be excluded from the backup. For example, it may programmatically be determined that all files or a predetermined subset of files, except for the user selected one or more files to be excluded, are enumerated and included in the list of file system objects to be backed up.
  • backup parameters 125 are received via user input in UI 115 within operator console 110 . After receiving backup parameters 125 , the method proceeds to step 230 .
  • backup engine 120 connects to production storage 130 used by the computer selected to be backed up in step 220 .
  • production storage 130 comprises one or more disks or (disk images) 140 of the machine to be backed up.
  • backup engine 120 attaches to the required disk 140 .
  • block level read access is initialized in order to be able to retrieve process the data blocks of objects selected in step 220 .
  • an agent inside the processed physical machine can be leveraged to provide backup agent 120 with the processed disk's data.
  • backup engine 120 fetches content of disk blocks containing FAT 150 , and parses the contents of FAT 150 to determine the locations of data blocks of all file system objects selected for backup in step 220 . After backup engine 120 parses FAT 150 , the method proceeds to step 260 .
  • step 260 backup engine 120 makes a copy of FAT 150 data blocks into backup FAT 160 , and saves FAT 160 data blocks as part of reconstructed disk image 170 .
  • Step 260 includes optionally modifying backup FAT 160 data records to remove pointers to any file system objects not selected for backup in step 220 , and saving data blocks representing FAT 160 to reconstructed disk image 170 after modification is completed.
  • step 265 a disk block counter is examined and a determination is made as to whether the previously processed disk block was the last block of disk 140 (i.e., if the end of disk 140 has been reached). Step 265 is performed by comparing current block number to total number of blocks in the disk 140 . If it is determined that the last block has not been processed, control is passed to step 270 . If it determined that the last block has been processed, the processing of disk 140 completes and control is passed to step 290 .
  • steps 265 - 285 are repeated as a loop or cycle until each block of disk 140 has been sequentially processed.
  • the first time step 265 is performed a number, N, of blocks in disk 140 is determined, and in steps 265 - 285 are repeated for N cycles sequentially to process all blocks of disk 140 .
  • the current block number (corresponding to the current cycle step) in disk 140 is compared to N to determine if the last block has been reached.
  • the current block is looked up in FAT 150 or FAT 160 to see if it belongs to a file system object that needs to be processed.
  • FAT 150 or 160 contents can be cached in memory (RAM) for better lookup performance.
  • processing of a block corresponding to a selected file system object is performed by completing steps 270 - 285 , which are described below.
  • step 270 backup engine 120 looks up the current data block in FAT 150 or 160 .
  • the current block from step 265 is looked up in FAT 150 or 160 to obtain information on what file system object this block belongs to in disk 140 .
  • the method proceeds to step 275 .
  • step 275 determination is made as to whether the block contents looked up in step 270 form part of a file system object selected to be backed up in step 220 .
  • this step is performed by deciding if the block contents looked up in step 270 corresponds to a file selected for the backup by correlating the actual block location (address) to FAT 150 or FAT 160 records, and determining whether it belongs to a file selected to be backed up in step 220 .
  • step 275 is performed by determining if the block contents do not correspond to a file system object selected to be excluded from the backup in step 220 . If it is determined that the block contents correspond to a file system object selected for the backup, the method proceeds to step 278 . If it is determined that the block contents do not correspond to a file system object selected to be backed up, the method proceeds to step 285 .
  • step 278 the block contents corresponding to the read block are retrieved from disk 140 within production storage 130 . After the block contents are read from production storage 130 , the method proceeds to step 280 .
  • step 280 the block contents retrieved in step 278 are saved to reconstructed disk image 170 .
  • the block contents are saved to the same position in reconstructed disk image 170 as the position of that content in disk or disk image 140 of production disk storage 130 .
  • steps 265 - 280 block contents for files that were selected to be processed in step 220 are fetched from disk 140 and saved to reconstructed disk image 170 .
  • control is passed back to step 265 so that the next block in the disk can be processed.
  • step 285 backup engine 120 writes a zeroed block to backup data 175 if the block was determined in step 275 to not correspond to a file system object selected for processing.
  • this step is performed by saving zeroed data block in reconstructed disk image 170 instead of saving block contents from disk image 140 that correspond to a file system object not selected to be backed up in step 220 .
  • Step 285 reduces the amount of time needed to process backup 175 by not fetching block contents from disk 140 , which do not need to be processed. Instead, step 285 saves zeroed blocks to reconstructed disk image 170 .
  • step 285 also writes zeroed blocks for file system objects which will not be restored from the backup, such as, but not limited to, temporary files, virtual memory files, and hibernation files.
  • the method saves storage space used to subsequently store backup 175 in backup file storage 180 without sacrificing the usefulness of backup 175 .
  • control is passed back to step 265 so that the next block in the disk can be processed.
  • zeroed data block may not be actually written to backup data 175 , and instead pointer to previously stored block is written.
  • step 290 backup engine 120 is shut down and the process ends. Step 290 is performed after it has been determined in step 265 that the last block of disk 140 has been reached.
  • FIG. 3 illustrates a graphical user interface (GUI), according to an embodiment of the present invention.
  • GUI graphical user interface
  • the GUI depicted in FIG. 3 is described with reference to the embodiments of FIGS. 1 and 2 .
  • the GUI is not limited to those example embodiments.
  • the GUI may be the UI 115 within operator console 110 used to select object-selective backup parameters 125 , as described in step 220 above with reference to FIG. 2 .
  • GUI is shown as an interface running on a computer terminal, it is understood that the GUI can be readily adapted to execute on a display of other platforms such as mobile device platforms running various operating systems, or another display of a computing device.
  • GUI illustrated in FIG. 3 can be displayed on a mobile device having an input device and a display.
  • FIG. 3 illustrates an exemplary backup object selection interface 300 , wherein one or more file system file system objects from production storage 130 of a physical or virtual machine to be backed up can be displayed and selected by a backup operator. As described below and illustrated in FIG. 3 , backup object selection interface 300 can be used to select file system objects for either inclusion in or exclusion from backup data 175 .
  • a backup operator by clicking, using an input device (not shown), include button 306 , a backup operator can browse a list of displayed file system objects from the selected machine's production storage 130 .
  • a backup operator using an input device (not shown), selects Add button 308 to select one or more of the displayed file system objects to be included in backup 175 .
  • Add button 308 selects one or more file system objects to be processed from production storage 130 and included in reconstructed disk image 170 .
  • a backup operator can select one or more file system objects (e.g., “d: ⁇ Share ⁇ Home Folders” in the exemplary embodiment of FIG. 3 ) by either typing in the object name(s) or browsing to the location of the file system object(s) within production storage 130 .
  • a backup operator can remove previously added file system objects from a backup by clicking on Remove button 310 .
  • One or more file system objects can be selected for inclusion in backup 175 by clicking on the file system objects displayed within backup object selection interface 300 and clicking Add button 308 .
  • backup parameters are saved by clicking on OK button 312 .
  • backup parameters 125 are saved as VM processing settings to be used by backup engine 120 .
  • the current file system object selections can be canceled by clicking on Cancel button 314 .
  • file system objects to be excluded from backup 175 can be selected by clicking on exclude button 304 .
  • a backup operator can browse a list of displayed file system objects from the selected machine's production storage 130 .
  • Add button 308 allows a backup operator to add one or more file system objects or environment variables (e.g., “c: ⁇ pagefile.sys,”, “c: ⁇ hyberfil.sys,” and “% TEMP %” in the exemplary embodiment of FIG. 3 ) to a list of file system objects to be excluded from backup 175 .
  • a backup operator using an input device (not shown), selects Add button 308 to select one or more of the displayed file system objects to be excluded from backup 175 . For example, through moving a pointer or cursor within file system objects displayed in as result of clicking exclude button 304 and subsequently selecting Add button 308 , a backup operator selects one or more file system objects that will not be read from production storage 130 and to be excluded from reconstructed disk image 170 . A backup operator can remove previously added file system objects from the backup exclusion list by clicking on Remove button 310 .
  • disable button 302 can be selected if the backup operator does not wish to select individual file system objects to be included in or excluded from backup 175 .
  • an object-selective image level backup is subsequently performed based upon backup parameters 125 selected and saved in backup object selection interface 300 .
  • the display may be a computer display 430 shown in FIG. 4
  • backup object selection interface 300 may be display interface 402 .
  • the input device can be, but is not limited to, for example, a touch screen, a keyboard, a pointing device, a track ball, a touch pad, a joy stick, a voice activated control system, or other input devices used to provide interaction between a backup operator and backup object selection interface 300 .
  • FIG. 4 illustrates an example computer system 400 in which the present invention, or portions thereof, can be implemented as computer-readable code.
  • the methods illustrated by the flowchart 200 of FIG. 2 can be implemented in system 400 .
  • Object-selective backup processing architecture 100 of FIG. 1 can also be implemented in system 400 .
  • Various embodiments of the invention are described in terms of this example computer system 400 . After reading this description, it will become apparent to a person skilled in the relevant art how to implement the invention using other computer systems and/or computer architectures.
  • Computer system 400 includes one or more processors, such as processor 404 .
  • Processor 404 can be a special purpose or a general-purpose processor.
  • Processor 404 is connected to a communication infrastructure 406 (for example, a bus, or network).
  • Computer system 400 also includes a main memory 408 , preferably random access memory (RAM), and may also include a secondary memory 410 .
  • Secondary memory 410 may include, for example, a hard disk drive 412 , a removable storage drive 414 , flash memory, a memory stick, and/or any similar non-volatile storage mechanism.
  • Removable storage drive 414 may comprise a floppy disk drive, a magnetic tape drive, an optical disk drive, a flash memory, or the like.
  • the removable storage drive 414 reads from and/or writes to a removable storage unit 418 in a well-known manner.
  • Removable storage unit 418 may comprise a floppy disk, magnetic tape, optical disk, etc. which is read by and written to by removable storage drive 414 .
  • removable storage unit 418 includes a non-transitory computer usable storage medium having stored therein computer software and/or data.
  • secondary memory 410 may include other similar means for allowing computer programs or other instructions to be loaded into computer system 400 .
  • Such means may include, for example, a removable storage unit 422 and an interface 420 .
  • Examples of such means may include a program cartridge and cartridge interface (such as that found in video game devices), a removable memory chip (such as an EPROM, or PROM) and associated socket, and other removable storage units 422 and interfaces 420 which allow software and data to be transferred from the removable storage unit 422 to computer system 400 .
  • Computer system 400 may also include a communications interface 424 .
  • Communications interface 424 allows software and data to be transferred between computer system 400 and external devices.
  • Communications interface 424 may include a modem, a network interface (such as an Ethernet card), a communications port, a PCMCIA slot and card, or the like.
  • Computer system 400 may additionally include computer display 430 .
  • computer display 430 in conjunction with display interface 402 , can be used to display UI 115 on operator console 110 .
  • Computer display 430 may also be used to display backup object selection interface 300 depicted in FIG. 3 .
  • computer program medium non-transitory computer readable medium
  • computer usable medium are used to generally refer to media such as removable storage unit 418 , removable storage unit 422 , and a hard disk installed in hard disk drive 412 .
  • Computer program medium, computer readable storage medium, and computer usable medium can also refer to memories, such as main memory 408 and secondary memory 410 , which can be memory semiconductors (e.g. DRAMs, etc.). These computer program products are means for providing software to computer system 400 .
  • Computer programs are stored in main memory 408 and/or secondary memory 410 . Computer programs may also be received via communications interface 424 . Such computer programs, when executed, enable computer system 400 to implement the present invention as discussed herein. In particular, the computer programs, when executed, enable processor 404 to implement the processes of the present invention, such as the steps in the methods illustrated by flowchart 200 of FIG. 2 and system architecture 100 of FIG. 1 discussed above. Accordingly, such computer programs represent controllers of the computer system 400 . Where the invention is implemented using software, the software may be stored in a computer program product and loaded into computer system 400 using removable storage drive 414 , interface 420 , hard drive 412 , or communications interface 424 .
  • the invention is also directed to computer program products comprising software stored on any computer useable medium.
  • Such software when executed in one or more data processing device, causes a data processing device(s) to operate as described herein.
  • Embodiments of the invention employ any computer useable or readable medium, known now or in the future.
  • Examples of computer useable mediums include, but are not limited to, primary storage devices (e.g., any type of random access memory), secondary storage devices (e.g., hard drives, floppy disks, CD ROMS, ZIP disks, tapes, magnetic storage devices, optical storage devices, MEMS, nanotechnological storage device, etc.), and communication mediums (e.g., wired and wireless communications networks, local area networks, wide area networks, intranets, etc.).

Abstract

Systems, methods, and computer program products are provided for reducing the size of image level backups. An example method receives backup parameters identifying a physical or Virtual Machine (VM) to backup and at least one file system object to include in the backup. The method connects to production storage corresponding to the selected physical or virtual machine and obtains access to data stored in disk corresponding to the selected file system object(s). The method fetches file allocation table (FAT) blocks from the disk and parses contents of the FAT blocks to determine if the disk blocks correspond to the selected file system object(s). The method creates a backup disk image FAT comprising blocks corresponding to the selected file system object(s). The method creates a reconstructed disk image FAT blocks corresponding to the backup FAT and disk image data blocks belonging to the selected file system object(s) and all other disk image data blocks are saved as zero blocks. A reconstructed disc image is compressed and stored in a backup file on backup storage, or replicated (copied) to another storage intact.

Description

    CROSS-REFERENCE TO RELATED APPLICATIONS
  • The present application is a continuation of U.S. patent application Ser. No. 13/159,229, filed on Jun. 13, 2011, which claims the benefit of U.S. Provisional Patent Application No. 61/354,529, filed on Jun. 14, 2010, entitled “Selective Processing of File System Objects for Image Level Backups,” all of which are incorporated by reference herein in their entireties.
  • FIELD OF THE INVENTION
  • The present invention is related to methods for backing up physical and virtual machine data into image level backups and replicas. In particular, the present invention relates to methods, systems, and computer program products for reducing the amount of data that needs to be backed up or replicated at the image level by limiting processing to disk image blocks belonging to file system objects that represent value to applications and users.
  • BACKGROUND OF THE INVENTION
  • Image level backups used for disaster recovery present new challenges as compared to legacy file system level backups. In particular, the size of disk images that need to be backed up require much longer times to backup. Backups of large disk images also significantly increase backup file storage requirements.
  • As compared to file level backups, which are typically set to backup only required file system objects, image level backups save complete images of backed up disks. Thus, unlike file-level backups, conventional image level backups typically include unnecessary data blocks belonging to file system objects that are of no value to users, deleted file system objects, file system objects marked for deletion, unallocated space, and unused space. While currently available commercial backup solutions such as VEEAM™ Backup and Replication from Veeam Software International Ltd. are able to efficiently remove white spaces (e.g., by using compression and deduplication), other unneeded data blocks mentioned above are still processed as part of image-level backups. This slows down backup performance and requires additional backup storage space. Thus, there is a need for methods of excluding unnecessary data from image level backups.
  • Conventional methods for reducing the amount of data which needs to be retrieved from a source disk and stored in the backup include querying a specific part of file system's FAT (File Allocation Table) to identify disk blocks which contain deleted data. The identified data blocks are then skipped during backup activities. For example, in systems using the MICROSOFT™ Windows New Technology File System (NTFS), deleted data blocks can be identified by querying and parsing a Master File Table (MFT), which is a part of the NTFS FAT. Some currently available disaster recovery tools, such as vRanger and vReplicator from QUEST SOFTWARE™, implement this technique.
  • Conventional methods for optimizing image level backups have significant drawbacks. Some of these shortcomings are discussed below.
  • First, conventional backup optimization techniques do not provide significant benefits unless a disk being backed up has a significant amount of blocks with deleted data (i.e., blocks marked as contained deleted data). However, many disks, such as disks used by newly-provisioned servers and computers with newly-installed applications and file system objects, do not have significant amounts of blocks with deleted data. In fact, using conventional techniques, additional processing is required to determine which disk blocks have deleted data. This additional processing may result in slow backup times.
  • Second, conventional backup optimization techniques provide little or no benefit during “incremental” backups, and may only be effective for “full” backups. Currently available technologies that facilitate efficient incremental backup, such as VMware Changed Block Tracking (CBT), allow backup solutions to determine data blocks in which content has changed since a previous backup, so that only those blocks are backed up during the incremental backup cycle. However, deleting data in file systems like NTFS, does not actually change the blocks corresponding to deleted file system objects, so the data blocks are not changed. Thus, these unchanged data blocks will not be picked up by the CBT for inclusion in an incremental backup without requiring some special processing. For example, ‘deleted’ NTFS file system objects like directories and files are merely marked for deletion in the MFT until the storage space is needed, at which point the corresponding blocks are filled with the new content.
  • Third, conventional backup techniques provide little benefit for incremental backups due to the nature of modern server workloads, which result in relatively little data being deleted, and primarily result in new data being added. This results in deleted data blocks being almost instantly reused by new data, leading to relatively few performance or storage benefits for incremental backups.
  • Fourth, conventional backup methods are not effective at optimizing backups of file systems which natively wipe deleted blocks (i.e., zero out) upon file system object deletion, such as the Linux third and fourth extended file systems (ext3 and ext4).
  • Finally, and perhaps most importantly, conventional image level backup techniques process and store significant amounts of data that are unnecessary in backup files. For example, conventional methods process and store the disk image data blocks corresponding to the contents of swap files, hibernation files, the contents of temporary (‘temp’) folders, recycling bin folders; and/or data such as Windows operating system (OS) system files which either do not need to be backed up at all, or can be easily restored from multiple other readily available sources. For example, certain OS file system objects, such as directories and files for a server or computer can be readily restored from other similar servers or computers with the same OS installed. Conventional image-level backup optimization methods fail to take this into account and as a result consume valuable time and storage space processing data blocks that correspond to contents of files of no value to users.
  • Therefore, there is a need for an efficient techniques for optimizing image level backups which address the shortcomings of the image level backup optimization techniques described above.
  • SUMMARY OF THE INVENTION
  • Accordingly, what is needed are tools which enable backup operators and administrators to selectively reduce the amount of data that needs to be read from a source disk and stored in a corresponding image level backup. What is further needed are systems, methods, and computer program products for selective processing of objects (i.e., object-selective processing) within image level backups.
  • Embodiments of the present invention include methods, systems, and computer program products for efficient processing of image level backups. As would be understood by one skilled in the relevant art(s), the methods described herein with reference to image level backups can also be applied to other image level disaster recovery techniques, such as creating replicas via replication and simple copying of images.
  • According to an exemplary embodiment, a system for selective processing of file system objects for an image level backup is disclosed. The system comprises a backup engine which includes a receiving module, a connection module, a file allocation table processing module, and a block processing module. The receiving module is configured to receive backup parameters for the image level backup, wherein the backup parameters include a selection of a machine to backup, and a selection of at least one file system object to include in the image level backup. The connection module is configured to connect to production storage corresponding to the selected machine, wherein the connection module is further configured to obtain data from a source disk corresponding to the selected at least one file system object, and wherein the source disk is in the production storage. The file allocation table (FAT) processing module configured to fetch FAT blocks from the source disk, search the fetched FAT blocks to determine selected set of data blocks of the source disk, wherein the selected set of data blocks correspond to the selected at least one file system object, and create a backup FAT from the fetched FAT blocks, wherein the backup FAT comprises only records corresponding to the selected at least one file system object. The block processing module configured to read the determined selected set of data blocks, and save the backup FAT and the determined selected set of data blocks to a reconstructed disk image.
  • Embodiments of the invention achieve at least five key improvements over conventional image level backup optimization techniques.
  • First, in contrast to traditional solutions, embodiments of the present invention do not rely on determining and skipping processing of deleted data blocks. This improvement enables reduction of the amount of data to be backed up during image level backup even in cases when there are no blocks containing deleted data on the disk being backed up.
  • Secondly, embodiments of the invention achieve significant improvements in processing speed and reduction in the size of backups. Embodiments of the invention achieve backup performance improvements not just for full backups, but also for incremental and differential image block level backups that backup file system objects that have been changed or added since the last backup.
  • Thirdly, embodiments of the present invention also enable improvements for file systems that natively wipe (zero out) data blocks belonging to deleted file system objects, such as, but not limited to the Linux ext3 and ext4 file systems.
  • Fourthly, embodiments of the invention enable filtering out unimportant data blocks from processing and storing as a part of image level backups, such as data blocks occupied by swap files. This is because in accordance with embodiments of the invention, swap files, temporary (‘temp’) files, and other data blocks that are not important to users and applications are not backed up, even in cases when full backups are being processed. For example, embodiments of the present invention filter out paging and virtual memory files used by Windows server and workstation operating systems (OSs) and only backup data blocks used for applications and corresponding to application executables.
  • Finally, in contrast to traditional backup techniques that only optimize full backups, embodiments of the invention also optimize incremental and differential backups. This is important because embodiments of the present invention are compatible with commercially available disaster recovery tools, such as VEEAM™ Backup and Replication from Veeam Software International Ltd., that only require that a full backup be performed once, with subsequent backups being forever-incremental with only changed and new blocks being processed.
  • BRIEF DESCRIPTION OF THE DRAWINGS/FIGURES
  • The accompanying drawings, which are incorporated herein and form a part of the specification, illustrate the present invention and, together with the description, further serve to explain the principles of the invention and to enable a person skilled in the relevant art to make and use the invention.
  • FIG. 1 illustrates a system architecture for object-selective backup processing, in accordance with an embodiment of the present invention.
  • FIG. 2 is a flowchart illustrating steps by which object-selective processing of image level backups are performed, in accordance with an embodiment of the present invention.
  • FIG. 3 illustrates an exemplary graphical user interface (GUI), wherein objects can be selected for image level backup processing, in accordance with an embodiment of the invention.
  • FIG. 4 depicts an example computer system in which an embodiment of the present invention may be implemented.
  • The present invention will now be described with reference to the accompanying drawings. In the drawings, generally, like reference numbers indicate identical or functionally similar elements. Additionally, generally, the left-most digit(s) of a reference number identifies the drawing in which the reference number first appears.
  • DETAILED DESCRIPTION OF THE INVENTION
  • The following detailed description of the present invention refers to the accompanying drawings that illustrate exemplary embodiments consistent with this invention. Other embodiments are possible, and modifications can be made to the embodiments within the spirit and scope of the invention. Therefore, the detailed description is not meant to limit the invention. Rather, the scope of the invention is defined by the appended claims.
  • It would be apparent to one of skill in the art that the present invention, as described below, can be implemented in many different embodiments of software, hardware, firmware, non-transitory computer readable media having instructions stored thereon, and/or the entities illustrated in the figures. Any actual software code with the specialized control of hardware to implement the present invention is not limiting of the present invention. Thus, the operational behavior of the present invention will be described with the understanding that modifications and variations of the embodiments are possible, given the level of detail presented herein.
  • Unless specifically stated differently, a user, a backup operator, and an administrator are interchangeably used herein to identify a human user, a software agent, or a group of users and/or software agents. Besides a human user who may perform object-selective backups, a software application or agent may sometimes process image level backups. Accordingly, unless specifically stated, the terms “backup operator,” “administrator,” and “user” as used herein are not limited to a human being.
  • As used herein, in an embodiment, the term “server” encompasses computing devices that are designed to function as one or more of file servers, email servers, Domain Name System (DNS) servers, Domain Controller (DC) servers, application servers, database servers, web servers, firewall servers, other enterprise servers, and back end servers. A server may comprise of one or more server machines. A server may be implemented as collection of servers such as a server farm or server cluster. For example, web servers may be commercially available server machines with one or more central processing units (CPUs). Alternatively, these web servers may comprise multiple computing devices and/or computing functionality hosted on multiple server machines (i.e., a server farm).
  • The present invention relates to systems, methods, and computer program products for object-selective processing of image level backups.
  • Object-Selective Backup System Architecture
  • FIG. 1 depicts system architecture 100 for processing object-selective image level backups, in accordance with an embodiment of the invention. An operator console 110 includes a user interface (UI) 115 for backup operators and administrators. In an embodiment, the UI 115 may be displayed on computer display 430 shown in FIG. 4. UI 115 can be used to add and select individual file system objects to be included in, or excluded from an image level backup. As used herein, an image level backup is a backup of the disk images of a physical or virtual machine (VM) corresponding to a server or computer. Because any physical machine can be backed up on image level (for example, by leveraging an agent), the invention applies to both image level backup of both virtual and physical machines.
  • As used herein, a “virtual machine” (VM) is a software implementation of a machine such as a server, computer, or other computing device that supports the execution of a complete operating system (OS) and executes application programs like a physical machine. A VM duplicates the functionality of a physical machine implemented in hardware and software. Software applications and the OS running on a VM are limited to the resources and abstractions provided by the VM. In an embodiment, virtual machines (VMs) are viewable within an overall virtual infrastructure. According to an embodiment of the invention, the backup file system objects selected to be backed up can be located in production storage 130, which includes one or more disks 140 which form parts of a production disk storage. As described in detail below, embodiments of the invention read data 135 to be backed up by either attaching an image of disk 140 to a backup engine 120 (in case of virtual machine), or by leveraging an agent inside each processed machine to get data from disk 140 (in case of physical or virtual machine). Herein, the phrase “source disk” is used to refer to storage in production storage 130 to be backed up, such as disk 140, which may be a disk of a physical machine or a disk image of a virtual machine.
  • UI 115 can also be used to remove a previously selected file system object from an image level backup to be processed. Operator console 110 can also be used to enter and configure other backup parameters 125 for an image level backup. For example, in the exemplary embodiment depicted in FIG. 3, UI 115 can be used to disable object-selective image level processing for a backup, to process all but a selected subset of file system objects in a backup, or to include (copy) only selected file system objects in an image level backup.
  • In the exemplary embodiments illustrated in FIGS. 1 and 3, operator console 110 includes a backup object selection interface 300 for selecting machine's file system objects to backup for an image level backup of a machine. Selections of file system objects to include and exclude are received by backup engine 120 as backup parameters 125. According to an embodiment, the file system objects to be included may be programmatically determined based upon the file system objects selected to be excluded. After acquiring backup parameters 125, backup engine 120 connects to production storage 130 and initiates block level access to read data 135 from the corresponding disk 140.
  • In accordance with an embodiment of the invention, backup engine 120 is an application comprising modules configured to process an object-selective image level backup. In the exemplary embodiment depicted in FIG. 1, backup engine 120 is configured to receive backup parameters 125 from backup operator console 110. In an embodiment, the received backup parameters 125 are acquired by a receiving module (not shown). Backup engine 120 comprises a module configured to read data 135 from production storage 130 in order retrieve and parse file allocation table (FAT) 150 of disk 140, which in turn comprises part of production disk storage. In one embodiment, FAT 150 data can be retrieved directly from storage, by reading the disk data blocks corresponding to FAT data location. In another embodiment, the FAT data can be retrieved by an agent (not shown) installed in the processed virtual machine or physical computer. Backup engine 120 further comprises a module configured to create a reconstructed disk image 170 comprising a modified backup FAT 160. Backup engine 120 also includes a module configured to write an image level backup 175 to backup file storage 180 corresponding to reconstructed disk image 170. Additional functionalities and features of backup engine 120 are discussed below with continued reference to FIG. 1.
  • As illustrated in FIG. 1, production storage 130 can comprise one or more disks (or disk images—in case of virtual machines) 140 corresponding to each disk used by production machine disk storage corresponding to a machine being backed up. Operator console can be used to select file system objects such as, but not limited to, directories, applications, data files, log files, and other file system objects associated with a machine's applications.
  • As used herein, “disk image” refers to logical storage that has been abstracted and separated from physical storage, such as network-attached storage (NAS), file servers, disks, and other physical storage devices. In an embodiment, a disk image is implemented via virtual storage logic and is viewable within a virtual infrastructure as a storage device containing one or more virtual disks, which are separated from physical storage disks.
  • In an embodiment, backup engine 120 is an application that functions as a backup agent. According to an embodiment, backup engine 120 is configured to retrieve disk blocks that store file systems' file allocation table (FAT) 150. As used herein, FAT refers to a file allocation table used in a variety of file system architectures for various Operating Systems (OSs) and is not limited to a FAT file system used in MICROSOFT™ Windows. According to an embodiment of the invention, the contents of FAT 150 are parsed to determine the location (on the disk 140) of blocks of file system objects selected for inclusion in the image level backup, as specified by a backup operator using operator console 110. In this way, only the blocks of disk 140 corresponding to selected file system objects need to be read from disk or disk image 140.
  • In accordance with an embodiment of the invention, a copy of the contents of FAT 150 is made as backup FAT 160, which is optionally modified. The optional modification may include removing references to file system objects that have been excluded from backup per selections made in operator console 110. In another implementation, FAT 160 remains as an unmodified copy of FAT 150. In this case, certain unimportant files, such as temporary files, virtual memory files (i.e., pagefile.sys and other paging files) and hibernation files (i.e., hyberfil.sys), will still be represented in the file system of restored backup 175, but will have empty content (zeroed out data blocks). Unlike conventional techniques, embodiments of the present invention do not look up or process deleted data information.
  • Backup engine 120 is configured to effectively reconstruct a modified, reconstructed disk image 170 on the fly, while simultaneously compressing and saving backup data 175 to backup file storage 180. In an alternative embodiment, backup engine 120 replicates the reconstructed disk image 170 to a replica VM (not shown). For example, reconstructed disk image 170 can be replicated to remote file storage. Backup engine 120 can also copy reconstructed disk image 170 to another local or remote storage device. For example, in cases where backup 175 will be used to perform a restoration onto a replica of a VM, such as a standby VM or failover VM, reconstructed disk image 170 is replicated to the backup storage accessible from a hypervisor host running a replica VM.
  • According to an embodiment, reconstructed disk image 170 is created by using modified data blocks corresponding to backup FAT 160, and then retrieving and applying only those image blocks of disk 140 that correspond to the file system objects selected for backup in UI 115. Instead of including all sequential blocks of disk image 140, reconstructed disk image 170 skips blocks corresponding to file system objects that were selected for exclusion based on settings provided in UI 115 of operator console 110. According to an embodiment, exclusions can be pre-configured. For example, it may be pre-configured that files such as paging and virtual memory files (e.g., swap files), are always excluded from the backup.
  • In accordance with an embodiment, as reconstructed disk image 170 is being created using backup FAT 160, it is simultaneously compressed and stored in backup file storage 180 as a backup data 175. In an embodiment, disk image data blocks containing data that is to be excluded from processing is substituted by zeroed data blocks in reconstructed disk image 170. Thus, zeroed data blocks are written to reconstructed disk image 170 instead of actual data blocks belonging to objects selected for exclusion in UI 115. In this way, the storage space needed in backup file storage 180 to store reconstructed disk image 170 is reduced in cases when data is compressed and/or deduplicated before saving it to a backup file.
  • In an embodiment of the invention, backup data 175 can be made available to data consuming processes as a local volume so that the reconstructed disk image 170 can be later used for additional processing, verification and/or restore the backed up file system objects. In alternative embodiments, backup file storage 180 is made available to data consuming processes as remote storage via public or proprietary storage access protocols such as, but not limited to the Network File System (NFS), Common Internet File System (CIFS), and Internet Small Computer System Interface (iSCSI). Examples of additional processing include mounting reconstructed disk image 170 to a server as a volume, creating, updating or deleting some file system objects using native OS and third party tools, and committing the changes to backup data 175. Example methods for restoring file system objects and items from an image level backup are described in U.S. patent application Ser. No. 12/901,233, filed on Oct. 8, 2010 entitled “Item-Level Restoration from Image Level Backups,” which incorporates by reference and claims priority to U.S. Patent Provisional Application No. 61/250,586, filed on Oct. 12, 2009 entitled “Item-Level Restoration from Image Level Backup.” U.S. patent application Ser. No. 12/901,233 and U.S. Patent Provisional Application No. 61/250,586 are incorporated by reference herein in their entireties. Example methods for displaying and verifying file system objects from an image level backup without fully extracting, decompressing, or decrypting the image level backup are described in U.S. patent application Ser. No. 12/901,233, which incorporates by reference and claims priority to U.S. Provisional Patent Application No. 61/302,743, filed on Feb. 9, 2010 and entitled “Systems, Methods, and Computer Program Products for Verification of Image Level Backups,” which are incorporated herein by reference in their entireties. Example methods for recovering file system file system objects from an image level backup without requiring the restoration process to be executed on a computer running an operating system (OS) that supports the virtual disk file system type backed up in the image level backup are described in U.S. patent application Ser. No. 13/021,312 filed on Feb. 4, 2011 and entitled “Cross-Platform Object Level Restoration From Image Level Backups,” which incorporates by reference and claims priority to U.S. Provisional Patent Application No. 61/302,877, filed on Feb. 9, 2010, and entitled “Cross-Platform Object Level Restoration From Image Level Backups.” U.S. patent application Ser. No. 13/021,312 and U.S. Provisional Patent Application No. 61/302,877 are both incorporated herein by reference in their entireties.
  • Object-Selective Image Level Backup Methods
  • FIG. 2 is a flowchart 200 illustrating steps by which a method is used to process object-selective image level backups, in accordance with an embodiment of the present invention.
  • More particularly, flowchart 200 illustrates the steps by which file system object-selective image level backups are performed using a reconstructed disk image, such as reconstructed disk image 170, according to an embodiment of the present invention. FIG. 2 is described with continued reference to the embodiment illustrated in FIG. 1. However, FIG. 2 is not limited to that embodiment. Note that the steps in the flowchart do not necessarily have to occur in the order shown.
  • As would be understood by one of skill in the relevant art(s), the steps of flowchart 200 described below may be accomplished via execution of computer executable instructions that, in response to execution by a computing device, perform an algorithm for creating an object-selective image level backup.
  • The method begins at step 210. In an embodiment, a backup application is started in step 210. For example, backup engine 120 and backup operator console 110 may be started in this step. After the backup application is started, the method proceeds to step 220.
  • In step 220, object-selective backup parameters 125 are received. Backup parameters 125 may include one or more of physical or virtual machines (VMs) to backup, and a list of file system objects to either include in or exclude from an image level backup. The file system objects may include directories and files, specified individually or using file name masks. In an embodiment, if a directory is selected to be included in an image level backup, all data files in the directory and subdirectories below the selected directory are automatically selected for inclusion in the image level backup. In another embodiment of the invention, if a file system object such as a directory is selected to be excluded from an image level backup, all dependent file system objects, such as files within the excluded directory and all of its subdirectories, will not be processed in the image level backup. According to an embodiment, the list of data items to be included in the backup may be programmatically determined based upon the one or more data items selected by the user to be excluded from the backup. For example, it may programmatically be determined that all files or a predetermined subset of files, except for the user selected one or more files to be excluded, are enumerated and included in the list of file system objects to be backed up. According to an embodiment, backup parameters 125 are received via user input in UI 115 within operator console 110. After receiving backup parameters 125, the method proceeds to step 230.
  • In step 230, backup engine 120 connects to production storage 130 used by the computer selected to be backed up in step 220. As discussed above with reference to FIG. 1, production storage 130 comprises one or more disks or (disk images) 140 of the machine to be backed up.
  • In step 240, backup engine 120 attaches to the required disk 140. In this step, block level read access is initialized in order to be able to retrieve process the data blocks of objects selected in step 220. In case of backup processing for a physical machine, according to an embodiment, an agent inside the processed physical machine can be leveraged to provide backup agent 120 with the processed disk's data. After backup engine 120 is attached to production storage 130 and disk (s) 140 containing the selected file system objects, the method proceeds to step 250.
  • In step 250, backup engine 120 fetches content of disk blocks containing FAT 150, and parses the contents of FAT 150 to determine the locations of data blocks of all file system objects selected for backup in step 220. After backup engine 120 parses FAT 150, the method proceeds to step 260.
  • In step 260, backup engine 120 makes a copy of FAT 150 data blocks into backup FAT 160, and saves FAT 160 data blocks as part of reconstructed disk image 170. Step 260 includes optionally modifying backup FAT 160 data records to remove pointers to any file system objects not selected for backup in step 220, and saving data blocks representing FAT 160 to reconstructed disk image 170 after modification is completed.
  • In step 265, a disk block counter is examined and a determination is made as to whether the previously processed disk block was the last block of disk 140 (i.e., if the end of disk 140 has been reached). Step 265 is performed by comparing current block number to total number of blocks in the disk 140. If it is determined that the last block has not been processed, control is passed to step 270. If it determined that the last block has been processed, the processing of disk 140 completes and control is passed to step 290.
  • As shown in FIG. 2, steps 265-285 are repeated as a loop or cycle until each block of disk 140 has been sequentially processed. In accordance with an embodiment, the first time step 265 is performed, a number, N, of blocks in disk 140 is determined, and in steps 265-285 are repeated for N cycles sequentially to process all blocks of disk 140. When step 265 is subsequently repeated after the initial performance, the current block number (corresponding to the current cycle step) in disk 140 is compared to N to determine if the last block has been reached. During processing of the block in steps 270-280, the current block is looked up in FAT 150 or FAT 160 to see if it belongs to a file system object that needs to be processed. According to an embodiment, FAT 150 or 160 contents can be cached in memory (RAM) for better lookup performance.
  • In accordance with an embodiment of the invention, processing of a block corresponding to a selected file system object is performed by completing steps 270-285, which are described below.
  • In step 270, backup engine 120 looks up the current data block in FAT 150 or 160. In this step, the current block from step 265 is looked up in FAT 150 or 160 to obtain information on what file system object this block belongs to in disk 140. After the block is looked up, the method proceeds to step 275.
  • In step 275, determination is made as to whether the block contents looked up in step 270 form part of a file system object selected to be backed up in step 220. According to an embodiment, this step is performed by deciding if the block contents looked up in step 270 corresponds to a file selected for the backup by correlating the actual block location (address) to FAT 150 or FAT 160 records, and determining whether it belongs to a file selected to be backed up in step 220. In another embodiment, step 275 is performed by determining if the block contents do not correspond to a file system object selected to be excluded from the backup in step 220. If it is determined that the block contents correspond to a file system object selected for the backup, the method proceeds to step 278. If it is determined that the block contents do not correspond to a file system object selected to be backed up, the method proceeds to step 285.
  • In step 278, the block contents corresponding to the read block are retrieved from disk 140 within production storage 130. After the block contents are read from production storage 130, the method proceeds to step 280.
  • In step 280, the block contents retrieved in step 278 are saved to reconstructed disk image 170. The block contents are saved to the same position in reconstructed disk image 170 as the position of that content in disk or disk image 140 of production disk storage 130. By repeating steps 265-280, block contents for files that were selected to be processed in step 220 are fetched from disk 140 and saved to reconstructed disk image 170. After the block contents are saved to reconstructed disk image 170, control is passed back to step 265 so that the next block in the disk can be processed.
  • In step 285, backup engine 120 writes a zeroed block to backup data 175 if the block was determined in step 275 to not correspond to a file system object selected for processing. According to an embodiment, this step is performed by saving zeroed data block in reconstructed disk image 170 instead of saving block contents from disk image 140 that correspond to a file system object not selected to be backed up in step 220. Step 285 reduces the amount of time needed to process backup 175 by not fetching block contents from disk 140, which do not need to be processed. Instead, step 285 saves zeroed blocks to reconstructed disk image 170. In an embodiment, step 285 also writes zeroed blocks for file system objects which will not be restored from the backup, such as, but not limited to, temporary files, virtual memory files, and hibernation files. In this way, the method saves storage space used to subsequently store backup 175 in backup file storage 180 without sacrificing the usefulness of backup 175. After the zeroed data block has been written to reconstructed disk image 170, control is passed back to step 265 so that the next block in the disk can be processed. In one embodiment, such as when backup solution features built-in block level deduplication and/or compression, zeroed data block may not be actually written to backup data 175, and instead pointer to previously stored block is written.
  • In step 290, backup engine 120 is shut down and the process ends. Step 290 is performed after it has been determined in step 265 that the last block of disk 140 has been reached.
  • Example Selective Processing User Interface
  • FIG. 3 illustrates a graphical user interface (GUI), according to an embodiment of the present invention. The GUI depicted in FIG. 3 is described with reference to the embodiments of FIGS. 1 and 2. However, the GUI is not limited to those example embodiments. For example, the GUI may be the UI 115 within operator console 110 used to select object-selective backup parameters 125, as described in step 220 above with reference to FIG. 2.
  • Although in the exemplary embodiment depicted in FIG. 3 the GUI is shown as an interface running on a computer terminal, it is understood that the GUI can be readily adapted to execute on a display of other platforms such as mobile device platforms running various operating systems, or another display of a computing device. For example, in an embodiment of the invention, the GUI illustrated in FIG. 3 can be displayed on a mobile device having an input device and a display.
  • FIG. 3 illustrates an exemplary backup object selection interface 300, wherein one or more file system file system objects from production storage 130 of a physical or virtual machine to be backed up can be displayed and selected by a backup operator. As described below and illustrated in FIG. 3, backup object selection interface 300 can be used to select file system objects for either inclusion in or exclusion from backup data 175.
  • According to an embodiment, by clicking, using an input device (not shown), include button 306, a backup operator can browse a list of displayed file system objects from the selected machine's production storage 130. In an embodiment, a backup operator, using an input device (not shown), selects Add button 308 to select one or more of the displayed file system objects to be included in backup 175. For example, through moving a pointer or cursor within file system objects displayed in as result of clicking include button 306 and subsequently selecting Add button 308, a backup operator selects one or more file system objects to be processed from production storage 130 and included in reconstructed disk image 170. According to an embodiment of the present invention, a backup operator can select one or more file system objects (e.g., “d:\Share\Home Folders” in the exemplary embodiment of FIG. 3) by either typing in the object name(s) or browsing to the location of the file system object(s) within production storage 130. A backup operator can remove previously added file system objects from a backup by clicking on Remove button 310.
  • One or more file system objects can be selected for inclusion in backup 175 by clicking on the file system objects displayed within backup object selection interface 300 and clicking Add button 308. Once the backup operator has finished selecting file system objects, backup parameters are saved by clicking on OK button 312. According to an embodiment, once the backup operator clicks OK button 312, backup parameters 125 are saved as VM processing settings to be used by backup engine 120. The current file system object selections can be canceled by clicking on Cancel button 314.
  • In an embodiment, file system objects to be excluded from backup 175 can be selected by clicking on exclude button 304. By clicking, using an input device (not shown), include button 306, a backup operator can browse a list of displayed file system objects from the selected machine's production storage 130. Add button 308 allows a backup operator to add one or more file system objects or environment variables (e.g., “c:\pagefile.sys,”, “c:\hyberfil.sys,” and “% TEMP %” in the exemplary embodiment of FIG. 3) to a list of file system objects to be excluded from backup 175. In an embodiment, a backup operator, using an input device (not shown), selects Add button 308 to select one or more of the displayed file system objects to be excluded from backup 175. For example, through moving a pointer or cursor within file system objects displayed in as result of clicking exclude button 304 and subsequently selecting Add button 308, a backup operator selects one or more file system objects that will not be read from production storage 130 and to be excluded from reconstructed disk image 170. A backup operator can remove previously added file system objects from the backup exclusion list by clicking on Remove button 310.
  • According to an embodiment, disable button 302 can be selected if the backup operator does not wish to select individual file system objects to be included in or excluded from backup 175.
  • As described above with reference to FIGS. 1 and 2, an object-selective image level backup is subsequently performed based upon backup parameters 125 selected and saved in backup object selection interface 300. In an embodiment, the display may be a computer display 430 shown in FIG. 4, and backup object selection interface 300 may be display interface 402. According to embodiments of the present invention, the input device can be, but is not limited to, for example, a touch screen, a keyboard, a pointing device, a track ball, a touch pad, a joy stick, a voice activated control system, or other input devices used to provide interaction between a backup operator and backup object selection interface 300.
  • Example Computer System Implementation
  • Various aspects of the present invention can be implemented by software, firmware, hardware, or a combination thereof. FIG. 4 illustrates an example computer system 400 in which the present invention, or portions thereof, can be implemented as computer-readable code. For example, the methods illustrated by the flowchart 200 of FIG. 2 can be implemented in system 400. Object-selective backup processing architecture 100 of FIG. 1 can also be implemented in system 400. Various embodiments of the invention are described in terms of this example computer system 400. After reading this description, it will become apparent to a person skilled in the relevant art how to implement the invention using other computer systems and/or computer architectures.
  • Computer system 400 includes one or more processors, such as processor 404. Processor 404 can be a special purpose or a general-purpose processor. Processor 404 is connected to a communication infrastructure 406 (for example, a bus, or network).
  • Computer system 400 also includes a main memory 408, preferably random access memory (RAM), and may also include a secondary memory 410. Secondary memory 410 may include, for example, a hard disk drive 412, a removable storage drive 414, flash memory, a memory stick, and/or any similar non-volatile storage mechanism. Removable storage drive 414 may comprise a floppy disk drive, a magnetic tape drive, an optical disk drive, a flash memory, or the like. The removable storage drive 414 reads from and/or writes to a removable storage unit 418 in a well-known manner. Removable storage unit 418 may comprise a floppy disk, magnetic tape, optical disk, etc. which is read by and written to by removable storage drive 414. As will be appreciated by persons skilled in the relevant art(s), removable storage unit 418 includes a non-transitory computer usable storage medium having stored therein computer software and/or data.
  • In alternative implementations, secondary memory 410 may include other similar means for allowing computer programs or other instructions to be loaded into computer system 400. Such means may include, for example, a removable storage unit 422 and an interface 420. Examples of such means may include a program cartridge and cartridge interface (such as that found in video game devices), a removable memory chip (such as an EPROM, or PROM) and associated socket, and other removable storage units 422 and interfaces 420 which allow software and data to be transferred from the removable storage unit 422 to computer system 400.
  • Computer system 400 may also include a communications interface 424. Communications interface 424 allows software and data to be transferred between computer system 400 and external devices. Communications interface 424 may include a modem, a network interface (such as an Ethernet card), a communications port, a PCMCIA slot and card, or the like.
  • Computer system 400 may additionally include computer display 430. According to an embodiment, computer display 430, in conjunction with display interface 402, can be used to display UI 115 on operator console 110. Computer display 430 may also be used to display backup object selection interface 300 depicted in FIG. 3.
  • In this document, the terms “computer program medium,” “non-transitory computer readable medium,” and “computer usable medium” are used to generally refer to media such as removable storage unit 418, removable storage unit 422, and a hard disk installed in hard disk drive 412. Computer program medium, computer readable storage medium, and computer usable medium can also refer to memories, such as main memory 408 and secondary memory 410, which can be memory semiconductors (e.g. DRAMs, etc.). These computer program products are means for providing software to computer system 400.
  • Computer programs (also called computer control logic) are stored in main memory 408 and/or secondary memory 410. Computer programs may also be received via communications interface 424. Such computer programs, when executed, enable computer system 400 to implement the present invention as discussed herein. In particular, the computer programs, when executed, enable processor 404 to implement the processes of the present invention, such as the steps in the methods illustrated by flowchart 200 of FIG. 2 and system architecture 100 of FIG. 1 discussed above. Accordingly, such computer programs represent controllers of the computer system 400. Where the invention is implemented using software, the software may be stored in a computer program product and loaded into computer system 400 using removable storage drive 414, interface 420, hard drive 412, or communications interface 424.
  • The invention is also directed to computer program products comprising software stored on any computer useable medium. Such software, when executed in one or more data processing device, causes a data processing device(s) to operate as described herein. Embodiments of the invention employ any computer useable or readable medium, known now or in the future. Examples of computer useable mediums include, but are not limited to, primary storage devices (e.g., any type of random access memory), secondary storage devices (e.g., hard drives, floppy disks, CD ROMS, ZIP disks, tapes, magnetic storage devices, optical storage devices, MEMS, nanotechnological storage device, etc.), and communication mediums (e.g., wired and wireless communications networks, local area networks, wide area networks, intranets, etc.).
  • CONCLUSION
  • While various embodiments of the present invention have been described above, it should be understood that they have been presented by way of example only, and not limitation. It will be understood by those skilled in the relevant art(s) that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined in the appended claims. It should be understood that the invention is not limited to these examples. The invention is applicable to any elements operating as described herein. Accordingly, the breadth and scope of the present invention should not be limited by any of the above-described exemplary embodiments, but should be defined only in accordance with the following claims and their equivalents.

Claims (1)

What is claimed is:
1. A system for selective processing of file system objects for an image level backup, comprising:
a backup engine including:
a receiving module configured to receive backup parameters for the image level backup, wherein the backup parameters include a selection of a machine to backup, and a selection of at least one file system object to include in the image level backup;
a connection module configured to connect to production storage corresponding to the selected machine, wherein the connection module is further configured to obtain data from a source disk corresponding to the selected at least one file system object, and wherein the source disk is in the production storage;
a file allocation table (FAT) processing module configured to:
fetch FAT blocks from the source disk;
search the fetched FAT blocks to determine a selected set of data blocks of the source disk, wherein the selected set of data blocks corresponds to the selected at least one file system object; and
create a backup FAT from the fetched FAT blocks, wherein the backup FAT comprises only records corresponding to the selected at least one file system object; and
a block processing module configured to:
read the determined selected set of data blocks; and
save the backup FAT and the determined selected set of data blocks to a reconstructed disk image.
US15/359,128 2010-06-14 2016-11-22 Selective processing of file system objects for image level backups Abandoned US20170075766A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
US15/359,128 US20170075766A1 (en) 2010-06-14 2016-11-22 Selective processing of file system objects for image level backups
US16/197,644 US11068349B2 (en) 2010-06-14 2018-11-21 Selective processing of file system objects for image level backups
US17/380,523 US11789823B2 (en) 2010-06-14 2021-07-20 Selective processing of file system objects for image level backups

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US35452910P 2010-06-14 2010-06-14
US13/159,229 US9507670B2 (en) 2010-06-14 2011-06-13 Selective processing of file system objects for image level backups
US15/359,128 US20170075766A1 (en) 2010-06-14 2016-11-22 Selective processing of file system objects for image level backups

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US13/159,229 Continuation US9507670B2 (en) 2010-06-14 2011-06-13 Selective processing of file system objects for image level backups

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US16/197,644 Continuation US11068349B2 (en) 2010-06-14 2018-11-21 Selective processing of file system objects for image level backups

Publications (1)

Publication Number Publication Date
US20170075766A1 true US20170075766A1 (en) 2017-03-16

Family

ID=45097180

Family Applications (4)

Application Number Title Priority Date Filing Date
US13/159,229 Active 2032-05-03 US9507670B2 (en) 2010-06-14 2011-06-13 Selective processing of file system objects for image level backups
US15/359,128 Abandoned US20170075766A1 (en) 2010-06-14 2016-11-22 Selective processing of file system objects for image level backups
US16/197,644 Active US11068349B2 (en) 2010-06-14 2018-11-21 Selective processing of file system objects for image level backups
US17/380,523 Active US11789823B2 (en) 2010-06-14 2021-07-20 Selective processing of file system objects for image level backups

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US13/159,229 Active 2032-05-03 US9507670B2 (en) 2010-06-14 2011-06-13 Selective processing of file system objects for image level backups

Family Applications After (2)

Application Number Title Priority Date Filing Date
US16/197,644 Active US11068349B2 (en) 2010-06-14 2018-11-21 Selective processing of file system objects for image level backups
US17/380,523 Active US11789823B2 (en) 2010-06-14 2021-07-20 Selective processing of file system objects for image level backups

Country Status (5)

Country Link
US (4) US9507670B2 (en)
EP (1) EP2580662B1 (en)
DK (1) DK2580662T3 (en)
ES (1) ES2634990T3 (en)
WO (1) WO2011159701A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11068349B2 (en) 2010-06-14 2021-07-20 Veeam Software Ag Selective processing of file system objects for image level backups

Families Citing this family (44)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9563681B1 (en) 2012-08-08 2017-02-07 Amazon Technologies, Inc. Archival data flow management
US9767098B2 (en) 2012-08-08 2017-09-19 Amazon Technologies, Inc. Archival data storage system
US8843443B1 (en) 2011-06-30 2014-09-23 Emc Corporation Efficient backup of virtual data
US9311327B1 (en) 2011-06-30 2016-04-12 Emc Corporation Updating key value databases for virtual backups
US9158632B1 (en) 2011-06-30 2015-10-13 Emc Corporation Efficient file browsing using key value databases for virtual backups
US9229951B1 (en) 2011-06-30 2016-01-05 Emc Corporation Key value databases for virtual backups
US8849777B1 (en) * 2011-06-30 2014-09-30 Emc Corporation File deletion detection in key value databases for virtual backups
US8949829B1 (en) 2011-06-30 2015-02-03 Emc Corporation Virtual machine disaster recovery
US9489133B2 (en) * 2011-11-30 2016-11-08 International Business Machines Corporation Optimizing migration/copy of de-duplicated data
US8700572B2 (en) * 2011-12-20 2014-04-15 Hitachi, Ltd. Storage system and method for controlling storage system
US10365950B2 (en) * 2011-12-30 2019-07-30 Veritas Technologies Llc Resource throttling and automated policy management in a virtual machine environment
US9904565B2 (en) * 2012-02-01 2018-02-27 Veritas Technologies Llc Subsequent operation input reduction systems and methods for virtual machines
US9946559B1 (en) * 2012-02-13 2018-04-17 Veritas Technologies Llc Techniques for managing virtual machine backups
US9652487B1 (en) 2012-08-08 2017-05-16 Amazon Technologies, Inc. Programmable checksum calculations on data storage devices
US9904788B2 (en) 2012-08-08 2018-02-27 Amazon Technologies, Inc. Redundant key management
US9830111B1 (en) 2012-08-08 2017-11-28 Amazon Technologies, Inc. Data storage space management
US10120579B1 (en) 2012-08-08 2018-11-06 Amazon Technologies, Inc. Data storage management for sequentially written media
US8805793B2 (en) 2012-08-08 2014-08-12 Amazon Technologies, Inc. Data storage integrity validation
US9779035B1 (en) 2012-08-08 2017-10-03 Amazon Technologies, Inc. Log-based data storage on sequentially written media
US8959067B1 (en) 2012-08-08 2015-02-17 Amazon Technologies, Inc. Data storage inventory indexing
US9225675B2 (en) 2012-08-08 2015-12-29 Amazon Technologies, Inc. Data storage application programming interface
KR20140077821A (en) * 2012-12-14 2014-06-24 삼성전자주식회사 Apparatus and method for contents back-up in home network system
US10558581B1 (en) * 2013-02-19 2020-02-11 Amazon Technologies, Inc. Systems and techniques for data recovery in a keymapless data storage system
US10719486B1 (en) * 2013-03-14 2020-07-21 Emc Corporation De-duplicated file share mounting for granular level data recoveries
US9235582B1 (en) 2013-03-14 2016-01-12 Emc Corporation Tracking files excluded from backup
GB2512060A (en) 2013-03-18 2014-09-24 Ibm Virtual machine image disk usage
US9361321B1 (en) 2013-04-15 2016-06-07 Emc Corporation Backend capacity report for de-duplicated storage systems
EP3005123A4 (en) * 2013-06-03 2017-01-18 Hewlett-Packard Enterprise Development LP Restoring a file system object
US9116846B2 (en) * 2013-06-10 2015-08-25 Veeam Software Ag Virtual machine backup from storage snapshot
US9053216B1 (en) 2013-08-09 2015-06-09 Datto, Inc. CPU register assisted virtual machine screenshot capture timing apparatuses, methods and systems
US9798791B1 (en) * 2013-12-04 2017-10-24 Ca, Inc. System and method for filtering files during data replication
US9747309B1 (en) * 2013-12-23 2017-08-29 EMC IP Holding Company LLC Auto-determining backup level
US9594636B2 (en) 2014-05-30 2017-03-14 Datto, Inc. Management of data replication and storage apparatuses, methods and systems
US8943105B1 (en) 2014-06-02 2015-01-27 Storagecraft Technology Corporation Exposing a proprietary disk file to a hypervisor as a native hypervisor disk file
US10970179B1 (en) * 2014-09-30 2021-04-06 Acronis International Gmbh Automated disaster recovery and data redundancy management systems and methods
US9075649B1 (en) 2015-01-26 2015-07-07 Storagecraft Technology Corporation Exposing a proprietary image backup to a hypervisor as a disk file that is bootable by the hypervisor
US11386060B1 (en) 2015-09-23 2022-07-12 Amazon Technologies, Inc. Techniques for verifiably processing data in distributed computing systems
US10157103B2 (en) * 2015-10-20 2018-12-18 Veeam Software Ag Efficient processing of file system objects for image level backups
CN111045859B (en) * 2018-10-12 2023-11-03 伊姆西Ip控股有限责任公司 Method, apparatus and computer program product for backing up virtual machines
KR102159835B1 (en) * 2019-05-02 2020-09-24 김덕우 Apparatus and method for managing recovery information of auxiliary storage device
US11847333B2 (en) * 2019-07-31 2023-12-19 EMC IP Holding Company, LLC System and method for sub-block deduplication with search for identical sectors inside a candidate block
US11922043B2 (en) * 2021-06-08 2024-03-05 EMC IP Holding Company LLC Data migration between storage systems
DE102021206374A1 (en) * 2021-06-22 2022-12-22 Robert Bosch Gesellschaft mit beschränkter Haftung Procedure for editing a system image
US11687416B2 (en) 2021-09-27 2023-06-27 Kyndryl, Inc. Data backup optimization

Citations (49)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5247618A (en) * 1989-06-30 1993-09-21 Digital Equipment Corporation Transferring data in a digital data processing system
US5357475A (en) * 1992-10-30 1994-10-18 Intel Corporation Method for detaching sectors in a flash EEPROM memory array
US5539837A (en) * 1992-04-29 1996-07-23 Lindmark; Richard C. Apparatus and method for measuring curved surfaces
US5742818A (en) * 1995-12-15 1998-04-21 Microsoft Corporation Method and system of converting data from a source file system to a target file system
US5765173A (en) * 1996-01-11 1998-06-09 Connected Corporation High performance backup via selective file saving which can perform incremental backups and exclude files and uses a changed block signature list
US5778395A (en) * 1995-10-23 1998-07-07 Stac, Inc. System for backing up files from disk volumes on multiple nodes of a computer network
US5890169A (en) * 1996-06-24 1999-03-30 Sun Microsystems, Inc. Disk fragmentation reduction using file allocation tables
US5907672A (en) * 1995-10-04 1999-05-25 Stac, Inc. System for backing up computer disk volumes with error remapping of flawed memory addresses
US6189081B1 (en) * 1996-05-24 2001-02-13 Nec Corporation Non-volatile semiconductor storage with memory requirement and availability comparison means and method
US20010051954A1 (en) * 2000-06-06 2001-12-13 Kazuhiko Yamashita Data updating apparatus that performs quick restoration processing
US20020051260A1 (en) * 2000-02-29 2002-05-02 Keihiro Kurakata Image pickup apparatus, storing method of image data and storage medium thereof
US6453383B1 (en) * 1999-03-15 2002-09-17 Powerquest Corporation Manipulation of computer volume segments
US20030031095A1 (en) * 2001-08-09 2003-02-13 Lg Electronics Inc. Portable hard disk system for an internet protocol network
US6580804B1 (en) * 1998-08-07 2003-06-17 Ricoh Co., Ltd Pixel-based digital watermarks located near edges of an image
US20030142960A1 (en) * 2000-12-07 2003-07-31 Teppei Yokota Reproduction apparatus and reproducing method
US20030185111A1 (en) * 2002-04-01 2003-10-02 Sony Corporation Track management method and apparatus for managing tracks on a storage medium
US6665779B1 (en) * 1998-12-24 2003-12-16 Roxio, Inc. Image backup method for backing up disk partitions of a storage device
US6742147B1 (en) * 1998-10-22 2004-05-25 Matsushita Electric Industrial Co., Ltd. Information recording medium, and method and apparatus for managing defect thereof
US20040153689A1 (en) * 1999-09-23 2004-08-05 Mahmoud Assaf System and method for storage of device performance data
US20050165853A1 (en) * 2004-01-22 2005-07-28 Altiris, Inc. Method and apparatus for localized protected imaging of a file system
US20050177777A1 (en) * 2004-01-23 2005-08-11 Seaburg Gunnar P. Cluster-based disk backup and restoration
US20050228836A1 (en) * 2004-04-08 2005-10-13 Bacastow Steven V Apparatus and method for backing up computer files
US20050259542A1 (en) * 2002-08-13 2005-11-24 Teruhiko Mochizuki Reproduction device and method, recording medium, and program
US20060029230A1 (en) * 1999-03-03 2006-02-09 Nobuyuki Kihara Recording apparatus, recording method, reproducing apparatus, and reproducing method
US20060218435A1 (en) * 2005-03-24 2006-09-28 Microsoft Corporation Method and system for a consumer oriented backup
US7254682B1 (en) * 2004-07-28 2007-08-07 Symantec Corporation Selective file and folder snapshot image creation
US20070186070A1 (en) * 2006-02-03 2007-08-09 Neoware, Inc. Computer operating system with selective restriction of memory write operations
US20070261056A1 (en) * 2006-02-17 2007-11-08 Hitachi, Ltd. Method for constructing job operation environment
US20080028004A1 (en) * 2004-06-04 2008-01-31 Chang-Ju Lee Apparatus and Method for Protecting System Data on Computer Hard-Disk
US20080069522A1 (en) * 2006-09-04 2008-03-20 Sony Corporation Apparatus, method and computer program for processing information
US20080109616A1 (en) * 2006-10-31 2008-05-08 Taylor James A System and method for optimizing write operations in storage systems
US20080228841A1 (en) * 2007-03-16 2008-09-18 Jun Mizuno Information processing system, data storage allocation method, and management apparatus
US20080307333A1 (en) * 2007-06-08 2008-12-11 Mcinerney Peter Deletion in Electronic Backups
US20090055452A1 (en) * 2007-08-21 2009-02-26 Sunplus Mmobile Inc. Journaling FAT file system and accessing method thereof
US20090106484A1 (en) * 2007-10-19 2009-04-23 Phison Electronics Corp. Data writing method for non-volatile memory and controller using the same
US20090123124A1 (en) * 2007-11-14 2009-05-14 Ham Young Jun Method for editing digital moving picture files in set-top box
US20090164529A1 (en) * 2007-12-21 2009-06-25 Mccain Greg Efficient Backup of a File System Volume to an Online Server
US20100070725A1 (en) * 2008-09-05 2010-03-18 Anand Prahlad Systems and methods for management of virtualization data
US20100076932A1 (en) * 2008-09-05 2010-03-25 Lad Kamleshkumar K Image level copy or restore, such as image level restore without knowledge of data object metadata
US20100138406A1 (en) * 2008-03-12 2010-06-03 Samsung Electronics Co., Ltd. File access method and system
US7831789B1 (en) * 2005-10-06 2010-11-09 Acronis Inc. Method and system for fast incremental backup using comparison of descriptors
US20100332534A1 (en) * 2009-06-30 2010-12-30 Robert Chang File system and method of file access
US20110016089A1 (en) * 2009-07-16 2011-01-20 Apple Inc. Restoring data to a mobile device
US8127089B1 (en) * 2007-02-14 2012-02-28 Marvell International Ltd. Hard disk controller which coordinates transmission of buffered data with a host
US20120060006A1 (en) * 2008-08-08 2012-03-08 Amazon Technologies, Inc. Managing access of multiple executing programs to non-local block data storage
US8156165B2 (en) * 2002-10-22 2012-04-10 Microsoft Corporation Transaction-safe FAT files system
US8200637B1 (en) * 2008-09-30 2012-06-12 Symantec Operating Corporation Block-based sparse backup images of file system volumes
US8682862B2 (en) * 2009-04-10 2014-03-25 Phd Virtual Technologies Inc. Virtual machine file-level restoration
US9507670B2 (en) * 2010-06-14 2016-11-29 Veeam Software Ag Selective processing of file system objects for image level backups

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4961134A (en) * 1988-07-15 1990-10-02 International Business Machines Corporation Method for minimizing locking and reading in a segmented storage space
US5634052A (en) * 1994-10-24 1997-05-27 International Business Machines Corporation System for reducing storage requirements and transmission loads in a backup subsystem in client-server environment by transmitting only delta files from client to server
KR100424654B1 (en) * 1999-08-02 2004-03-24 삼성전자주식회사 Apparatus and method for retransmitting data according to radio link protocol in mobile communication system
US6931558B1 (en) 2000-11-29 2005-08-16 Veritas Operating Corporation Computer restoration systems and methods
CA2453070A1 (en) 2001-07-06 2003-01-16 Computer Associates Think, Inc. Systems and methods of information backup
US20040054877A1 (en) * 2001-10-29 2004-03-18 Macy William W. Method and apparatus for shuffling data
US7093086B1 (en) 2002-03-28 2006-08-15 Veritas Operating Corporation Disaster recovery and backup using virtual machines
US7191299B1 (en) 2003-05-12 2007-03-13 Veritas Operating Corporation Method and system of providing periodic replication
CN100583278C (en) * 2005-03-04 2010-01-20 松下电器产业株式会社 Data processing apparatus
CN101243413B (en) 2005-06-24 2013-08-14 信科索尔特公司 System and method for virtualizing backup images
US8065273B2 (en) * 2006-05-10 2011-11-22 Emc Corporation Automated priority restores
US7805631B2 (en) 2007-05-03 2010-09-28 Microsoft Corporation Bare metal recovery from backup media to virtual machine
US8949585B2 (en) 2007-10-09 2015-02-03 Vmware, Inc. In-place conversion of virtual machine state
US20110040812A1 (en) * 2007-12-20 2011-02-17 Virtual Computer, Inc. Layered Virtual File System
US8631217B2 (en) 2008-02-26 2014-01-14 International Business Machines Corporation Apparatus, system, and method for virtual machine backup
US8577845B2 (en) 2008-06-13 2013-11-05 Symantec Operating Corporation Remote, granular restore from full virtual machine backup
JP5346536B2 (en) * 2008-10-02 2013-11-20 株式会社日立ソリューションズ Information backup / restore processing device and information backup / restore processing system
US8499297B2 (en) 2008-10-28 2013-07-30 Vmware, Inc. Low overhead fault tolerance through hybrid checkpointing and replay
US20110061046A1 (en) * 2008-12-18 2011-03-10 Virtual Computer, Inc. Installing Software Applications in a Layered Virtual Workspace
US8874529B2 (en) * 2009-03-16 2014-10-28 Bert A. Silich User-determinable method and system for manipulating and displaying textual and graphical information
JP2012116127A (en) * 2010-12-01 2012-06-21 Canon Inc Printing device, data storage method and program
US8996467B2 (en) * 2011-12-29 2015-03-31 Druva Inc. Distributed scalable deduplicated data backup system
US9742838B2 (en) * 2014-01-09 2017-08-22 Red Hat, Inc. Locked files for cartridges in a multi-tenant platform-as-a-service (PaaS) system
US10032062B2 (en) * 2015-04-15 2018-07-24 Samsung Electronics Co., Ltd. Method and apparatus for recognizing fingerprint
US10157103B2 (en) * 2015-10-20 2018-12-18 Veeam Software Ag Efficient processing of file system objects for image level backups
US11692839B2 (en) * 2020-05-20 2023-07-04 Here Global B.V. Methods and apparatuses for providing navigation instructions

Patent Citations (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5247618A (en) * 1989-06-30 1993-09-21 Digital Equipment Corporation Transferring data in a digital data processing system
US5539837A (en) * 1992-04-29 1996-07-23 Lindmark; Richard C. Apparatus and method for measuring curved surfaces
US5357475A (en) * 1992-10-30 1994-10-18 Intel Corporation Method for detaching sectors in a flash EEPROM memory array
US5907672A (en) * 1995-10-04 1999-05-25 Stac, Inc. System for backing up computer disk volumes with error remapping of flawed memory addresses
US5778395A (en) * 1995-10-23 1998-07-07 Stac, Inc. System for backing up files from disk volumes on multiple nodes of a computer network
US5742818A (en) * 1995-12-15 1998-04-21 Microsoft Corporation Method and system of converting data from a source file system to a target file system
US5765173A (en) * 1996-01-11 1998-06-09 Connected Corporation High performance backup via selective file saving which can perform incremental backups and exclude files and uses a changed block signature list
US6189081B1 (en) * 1996-05-24 2001-02-13 Nec Corporation Non-volatile semiconductor storage with memory requirement and availability comparison means and method
US5890169A (en) * 1996-06-24 1999-03-30 Sun Microsystems, Inc. Disk fragmentation reduction using file allocation tables
US6580804B1 (en) * 1998-08-07 2003-06-17 Ricoh Co., Ltd Pixel-based digital watermarks located near edges of an image
US6742147B1 (en) * 1998-10-22 2004-05-25 Matsushita Electric Industrial Co., Ltd. Information recording medium, and method and apparatus for managing defect thereof
US6665779B1 (en) * 1998-12-24 2003-12-16 Roxio, Inc. Image backup method for backing up disk partitions of a storage device
US20060029230A1 (en) * 1999-03-03 2006-02-09 Nobuyuki Kihara Recording apparatus, recording method, reproducing apparatus, and reproducing method
US6453383B1 (en) * 1999-03-15 2002-09-17 Powerquest Corporation Manipulation of computer volume segments
US20040153689A1 (en) * 1999-09-23 2004-08-05 Mahmoud Assaf System and method for storage of device performance data
US20020051260A1 (en) * 2000-02-29 2002-05-02 Keihiro Kurakata Image pickup apparatus, storing method of image data and storage medium thereof
US20010051954A1 (en) * 2000-06-06 2001-12-13 Kazuhiko Yamashita Data updating apparatus that performs quick restoration processing
US20030142960A1 (en) * 2000-12-07 2003-07-31 Teppei Yokota Reproduction apparatus and reproducing method
US20030031095A1 (en) * 2001-08-09 2003-02-13 Lg Electronics Inc. Portable hard disk system for an internet protocol network
US20030185111A1 (en) * 2002-04-01 2003-10-02 Sony Corporation Track management method and apparatus for managing tracks on a storage medium
US20050259542A1 (en) * 2002-08-13 2005-11-24 Teruhiko Mochizuki Reproduction device and method, recording medium, and program
US8156165B2 (en) * 2002-10-22 2012-04-10 Microsoft Corporation Transaction-safe FAT files system
US20050165853A1 (en) * 2004-01-22 2005-07-28 Altiris, Inc. Method and apparatus for localized protected imaging of a file system
US20050177777A1 (en) * 2004-01-23 2005-08-11 Seaburg Gunnar P. Cluster-based disk backup and restoration
US20050228836A1 (en) * 2004-04-08 2005-10-13 Bacastow Steven V Apparatus and method for backing up computer files
US20080028004A1 (en) * 2004-06-04 2008-01-31 Chang-Ju Lee Apparatus and Method for Protecting System Data on Computer Hard-Disk
US7254682B1 (en) * 2004-07-28 2007-08-07 Symantec Corporation Selective file and folder snapshot image creation
US20060218435A1 (en) * 2005-03-24 2006-09-28 Microsoft Corporation Method and system for a consumer oriented backup
US7831789B1 (en) * 2005-10-06 2010-11-09 Acronis Inc. Method and system for fast incremental backup using comparison of descriptors
US20070186070A1 (en) * 2006-02-03 2007-08-09 Neoware, Inc. Computer operating system with selective restriction of memory write operations
US20070261056A1 (en) * 2006-02-17 2007-11-08 Hitachi, Ltd. Method for constructing job operation environment
US20080069522A1 (en) * 2006-09-04 2008-03-20 Sony Corporation Apparatus, method and computer program for processing information
US20080109616A1 (en) * 2006-10-31 2008-05-08 Taylor James A System and method for optimizing write operations in storage systems
US8127089B1 (en) * 2007-02-14 2012-02-28 Marvell International Ltd. Hard disk controller which coordinates transmission of buffered data with a host
US20080228841A1 (en) * 2007-03-16 2008-09-18 Jun Mizuno Information processing system, data storage allocation method, and management apparatus
US20080307333A1 (en) * 2007-06-08 2008-12-11 Mcinerney Peter Deletion in Electronic Backups
US20090055452A1 (en) * 2007-08-21 2009-02-26 Sunplus Mmobile Inc. Journaling FAT file system and accessing method thereof
US20090106484A1 (en) * 2007-10-19 2009-04-23 Phison Electronics Corp. Data writing method for non-volatile memory and controller using the same
US20090123124A1 (en) * 2007-11-14 2009-05-14 Ham Young Jun Method for editing digital moving picture files in set-top box
US20090164529A1 (en) * 2007-12-21 2009-06-25 Mccain Greg Efficient Backup of a File System Volume to an Online Server
US20100138406A1 (en) * 2008-03-12 2010-06-03 Samsung Electronics Co., Ltd. File access method and system
US20120060006A1 (en) * 2008-08-08 2012-03-08 Amazon Technologies, Inc. Managing access of multiple executing programs to non-local block data storage
US20100070725A1 (en) * 2008-09-05 2010-03-18 Anand Prahlad Systems and methods for management of virtualization data
US20100076932A1 (en) * 2008-09-05 2010-03-25 Lad Kamleshkumar K Image level copy or restore, such as image level restore without knowledge of data object metadata
US20140250093A1 (en) * 2008-09-05 2014-09-04 Commvault Systems, Inc. Systems and methods for management of virtualization data
US8200637B1 (en) * 2008-09-30 2012-06-12 Symantec Operating Corporation Block-based sparse backup images of file system volumes
US8682862B2 (en) * 2009-04-10 2014-03-25 Phd Virtual Technologies Inc. Virtual machine file-level restoration
US20100332534A1 (en) * 2009-06-30 2010-12-30 Robert Chang File system and method of file access
US20110016089A1 (en) * 2009-07-16 2011-01-20 Apple Inc. Restoring data to a mobile device
US9507670B2 (en) * 2010-06-14 2016-11-29 Veeam Software Ag Selective processing of file system objects for image level backups

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11068349B2 (en) 2010-06-14 2021-07-20 Veeam Software Ag Selective processing of file system objects for image level backups
US11789823B2 (en) 2010-06-14 2023-10-17 Veeam Software Ag Selective processing of file system objects for image level backups

Also Published As

Publication number Publication date
US11789823B2 (en) 2023-10-17
EP2580662A4 (en) 2016-01-20
EP2580662A2 (en) 2013-04-17
EP2580662B1 (en) 2017-05-03
ES2634990T3 (en) 2017-10-02
DK2580662T3 (en) 2017-08-28
WO2011159701A3 (en) 2012-04-19
US11068349B2 (en) 2021-07-20
WO2011159701A2 (en) 2011-12-22
US20190332489A1 (en) 2019-10-31
US20220156155A1 (en) 2022-05-19
US20110307657A1 (en) 2011-12-15
US9507670B2 (en) 2016-11-29

Similar Documents

Publication Publication Date Title
US11789823B2 (en) Selective processing of file system objects for image level backups
US11513926B2 (en) Systems and methods for instantiation of virtual machines from backups
US9015129B2 (en) Cross-platform object level restoration from image level backups
US9104624B2 (en) Systems, methods, and computer program products for instant recovery of image level backups
AU2012347883B2 (en) System and method for restoring application data
US8898114B1 (en) Multitier deduplication systems and methods
US9235474B1 (en) Systems and methods for maintaining a virtual failover volume of a target computing system
US8996468B1 (en) Block status mapping system for reducing virtual machine backup storage
US20140289566A1 (en) Item-Level Restoration and Verification of Image Level
US8949829B1 (en) Virtual machine disaster recovery
US9183130B2 (en) Data control system for virtual environment
EP3159797B1 (en) Efficient processing of file system objects for image level backups
US20150106334A1 (en) Systems and methods for backing up a live virtual machine
US10877854B2 (en) Partial restore from tape backup
US20230126234A1 (en) Independent object data backup between clusters

Legal Events

Date Code Title Description
AS Assignment

Owner name: VEEAM SOFTWARE AG, SWITZERLAND

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:TIMASHEV, RATMIR;GOSTEV, ANTON;REEL/FRAME:042662/0036

Effective date: 20120427

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION

AS Assignment

Owner name: VEEAM SOFTWARE GROUP GMBH, SWITZERLAND

Free format text: CHANGE OF NAME;ASSIGNOR:VEEAM SOFTWARE AG;REEL/FRAME:052690/0914

Effective date: 20200504

AS Assignment

Owner name: JPMORGAN CHASE N.A., NEW YORK

Free format text: SECURITY INTEREST;ASSIGNOR:VEEAM SOFTWARE GROUP GMBH;REEL/FRAME:052790/0483

Effective date: 20200528