Skip to main content

How can we help you?

Druva Documentation

Data usage discrepancy between inSync Cloud and Google Drive

 

Problem description

The size of inSync’s backup for Google Drive is extremely higher than the total size of the end user’s Google Drive in Google Workspace.

Cause

 

Deleted Data in Google:

  1. End-users have a behavior of uploading large data to their Drives -> sharing them with others -> then removing them from their drives later on when no longer needed.

  2. inSync may end up backing up the entire data before it gets deleted.

  3. This leads to a discrepancy between inSync’s backup size and Google Drive’s usage after the data was deleted.

 

Modified Files in Google Drive:

  1. Frequently modified files in Google Drive -> will only show up the size of the latest file in Google Workspace.

  2. However, in inSync, even with incremental backups, we do add up the modified data to our previously backed up data -> leading to a larger size of backup in inSync than the data size in Google Drive.

 

Google Native Apps:

  1. Google Native files like Google Sheets, Google Docs, Google Sides, etc. do not show in the storage calculation in Google Workspace.

  2. Supporting document for this point from Google: https://support.google.com/a/answer/9214707?hl=enzippy=%2Cwhat-are-the-drive-storage-and-file-size-limits%2Cwhat-counts-towards-storage%2Chow-do-i-know-how-much-storage-is-being-used-by-my-organization%2Chow-much-storage-do-google-docs-editors-files-use

  3. But in inSync, each byte is accounted for. This will create some discrepancy between inSync backup data and Google Workspace stats.

 

Trash Data:

  1. In Google Workspace, default retention of items in trash is 30 days, post which the data is erased from Google.
    Thus, the data that got permanently erased, will not show up in Google Workspace storage usage.

  2. However, inSync backs up the trash as well, leading to some discrepancy between inSync backup data and Google Workspace stats.

 

G-Drive Compression Vs inSync's compression:

  1. Google has more compression for files that get uploaded.

  2. Each file, including image, audio, videos, and all others, gets compressed on Google Storage.

  3. However, inSync's compression is lesser than that of Google.

  4. Also, compression of video, audio, and image files is extremely less on inSync cloud storage by design.

  5. This explains further the discrepancy between inSync backup data and Google Workspace stats.

 

The “Back up Shared Data owned by other users” setting is enabled in inSync Profile.

  1. When the above setting is enabled in the inSync profile, inSync will back up the “Shared with me” data from Google Drive for each user that belongs to that profile.

  2. For example; if UserA has 10GB of data in his Google Drive and User B shared some 2GB of data with UserA -> then inSync will back up the 10 + 2 = 12 GB of data in total; which will lead to a discrepancy of data size between inSync and Google Drive for UserA.

Resolution

Deleting selective folders/files from the backup snapshots is not possible. Thus, if shared data was backed up with the above-explained settings, then that cannot be deleted selectively from the backup snapshots.

  1. In the above-mentioned causes, the only controllable cause is the setting for “Back up Shared Data owned by other users”.

  2. This is present under inSync Profile -> SaaS Apps -> Backup Cloud Apps -> Edit -> Google Workspace -> Google Drive

 

  1. Ensure to disable this check box unless absolutely necessary.

  2. If you want to backup shared data by other users for a specific user or set of users, then ensure to put them in one profile where the above checkbox is enabled → and move out all the other users into a new profile where the above checkbox is disabled.

  3. For data that has already been backed up; you will need to get rid of all those older snapshots, to reduce the storage usage, OR discuss with your Druva Account Manager to add more storage to your instance.

  4. For deleting older snapshots, you may do it manually from inSync admin console -> Users -> select the users one by one -> click the Backups tab -> select the data source as “Google Drive” -> select the snapshots that may have the shared data backed up -> to click on “Delete” button to delete them- 

  1. If you wish to do it in bulk for a large set of users -> then the best option is to reduce the retention period under the inSync Profile for Google Drive.

  2. Click on Profiles -> click on the profile in which these users reside -> click on SaaS Apps tab -> Schedule & Retention -> edit -> Data Retention for Files -> reduce the “Daily”, “Weekly” and “Monthly” numbers to target deletion of all the snapshots that may have backed up the shared data so far.

 

After manual deletion of snapshots or via retention reduction; the targeted snapshots will be removed from the user’s and admin’s UI immediately and will no longer be available. However, the deleted data may reside in the storage in the cloud as “garbage” data. This “garbage” data may take approximately 7 - 10 days to be completely cleaned out from the storage; as the process of “Compaction” runs in the cloud, gradually cleaning out the garbage data. This is called “Storage Housekeeping” and it requires some time to complete its process.

Please reach out to Druva Support or your Druva Account Manager if the Storage Housekeeping delay is causing your backup operations to fail or if the process is taking longer than the above-explained time frame.