Data Deduplication on a Distributed File System

PublishedOctober 13, 2020

Assigneenot available in USPTO data we have

InventorsRajiv Desai Nathan E. Rosenblum

Technical Abstract

Patent Claims

15 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1. A method for deduplicating data on a distributed file system, the method comprising: transmitting a write request from a client to a metadata server (“MDS”), wherein the write request comprises an object identifier associated with a data object, wherein the MDS maintains metadata identifying locations of data objects stored in object stores included in the distributed file system; receiving an object store location for an object store from the MDS and a first object designator assigned to the data object by the MDS, wherein the object store is separate from the MDS and wherein the object store stores data objects, wherein both the first object designator and the object identifier uniquely identify the data object and wherein the MDS server maps object designators to object identifiers; deduplicating the data object by: transmitting a metadata request to the object store using the object store location, wherein the metadata request includes the object identifier; receiving a metadata response from the object store; determining whether the metadata response contains a second object designator; transmitting a commit request to the MDS that includes the second object designator in response to determining the metadata response contains the second object designator, wherein the second object designator allows a number of instances of the data object in the distributed file system to be determined; and transmitting the data object that includes the first object designator to the object store in response to determining the metadata response does not contain any object designator and transmitting a commit request to the MDS that includes the first object designator.

2. The method of claim 1 , wherein the metadata request is a HEAD request.

3. The method of claim 1 , further comprising, when the metadata response does not include any object designator, transmitting the commit request to the MDS includes the first designator after transmitting the data object to the object store.

4. The method of claim 1 , wherein the first object designator uniquely identifies the data object and wherein the second object designator uniquely identifies the data object.

5. The method of claim 1 , wherein the object store is a cloud object store.

6. A non-transitory computer readable storage medium comprising processor instructions for deduplicating data on a distributed file system, the instructions comprising: transmitting a write request from a client to a metadata server (“MDS”), wherein the write request comprises an object identifier associated with a data object, wherein the MDS maintains metadata identifying locations of data objects stored in object stores included in the distributed file system; receiving an object store location for an object store from the MDS and a first object designator assigned to the data object by the MDS, wherein the object store is separate from the MDS and wherein the object store stores data objects, wherein both the first object designator and the object identifier uniquely identify the data object and wherein the MDS server maps object designators to object identifiers; deduplication the data object by: transmitting a metadata request to the object store using the object store location, wherein the metadata request includes the object identifier; receiving a metadata response from the object store; determining whether the metadata response contains a second object designator; transmitting a commit request to the MDS that includes the second object designator in response to determining the metadata response contains the second object designator, wherein the second object designator allows a number of instances of the data object in the distributed file system to be determined; and transmitting the data object that includes the first object designator to the object store in response to determining the metadata response does not contain any object designator and transmitting a commit request to the MDS that includes the first object designator.

7. The non-transitory computer readable storage medium of claim 6 , wherein the metadata request is a HEAD request.

8. The non-transitory computer readable storage medium of claim 6 , further comprising transmitting, when the metadata response does not include any object designator, the commit request that includes the first object designator to the MDS after transmitting the data object to the object store.

9. The non-transitory computer readable storage medium of claim 6 , wherein the first and second object designators uniquely identifies the data object.

10. The non-transitory computer readable storage medium of claim 6 , wherein the object store is a cloud object store.

11. A system for deduplicating data on a distributed file system, the system comprising a non-transitory computer readable medium and processor enabled to execute instructions for: transmitting a write request from a client to a metadata server (“MDS”), wherein the write request comprises an object identifier associated with a data object, wherein the MDS maintains metadata identifying locations of data objects stored in object stores included in the distributed file system; receiving an object store location for an object store from the MDS and a first object designator assigned to the data object by the MDS, wherein the object store is separate from the MDS and wherein the object store stores data objects, wherein both the first object designator and the object identifier uniquely identify the data object and wherein the MDS server maps object designators to object identifiers; deduplication the data object by: transmitting a metadata request to the object store using the object store location, wherein the metadata request includes the object identifier; receiving a metadata response from the object store; determining whether the metadata response contains a second object designator; transmitting a commit request to the MDS that includes the second object designator in response to determining the metadata response contains the second object designator, wherein the second object designator allows a number of instances of the data object in the distributed file system to be determined; and transmitting the data object that includes the first object designator to the object store in response to determining the metadata response does not contain any object designator and transmitting a commit request to the MDS that includes the first object designator.

12. The system of claim 11 , wherein the metadata request is a HEAD request.

13. The system of claim 11 , further comprising transmitting, when the metadata response does not include any object designator, the commit request that includes the first object designator to the MDS after transmitting the data object to the object store.

14. The system of claim 11 , wherein the first and second object designators uniquely identifies the data object.

15. The system of claim 11 , wherein the object store is a cloud object store.

Patent Metadata

Filing Date

Unknown

Publication Date

October 13, 2020

Inventors

Rajiv Desai

Nathan E. Rosenblum

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Browse All Patents Try Prior Art Search