Architectures and mechanisms for rebuilding data on a node in a multimode system providing replicated data stores. A primary node provides a primary data store and a secondary node provides a replicated data store and replication to the second node occurs substantially synchronously during periods of normal operation for the second node. If the secondary node is available after a period of unavailability a copy of previously-replicated data on the secondary node is copied and stored when the secondary node is available after the period of unavailability before rebuilding of the secondary node to recover from the period of unavailability. The copy of the previously-replicated data can be used as a backup copy to the primary node during the rebuilding. The data on the secondary node is updated after the copy has been made. The updating comprises asynchronously writing data to the data on the secondary node. The secondary node synchronously replicates data from the primary node after the rebuilding.
Legal claims defining the scope of protection, as filed with the USPTO.
2. The computing network of claim 1 wherein the backup copy of data on the second data storage device after the second node becomes available after the period of unavailability and before rebuilding the data stored on the second data storage device to replicate data stored on the first data storage device comprises a crash-consistent copy of the second set of data.
3. The computing network of claim 1 wherein the rebuilding the data stored on the second data storage device is performed in a non-chronological order, the non-chronological write order comprises a logical block address (LBA) order.
4. The computing network of claim 1 wherein the first node requests that the second data storage device store the backup copy of data on the second data storage device after the second node becomes available after the period of unavailability and before rebuilding the data stored on the second data storage device to replicate data stored on the first data storage device.
5. The computing network of claim 1 wherein the control plane comprises at least a virtual controller to at least detect availability and unavailability of the second node.
6. The computing network of claim 5 wherein the virtual controller further functions to detect availability and unavailability of the first node and, in response to unavailability of the first node, causes the second node to service requests directed to the first node.
8. The computer-implemented method of claim 7 wherein the backup copy of data on the second data storage device after the second node becomes available after the period of unavailability and before rebuilding the data stored on the second data storage device to replicate data stored on the first data storage device comprises a crash-consistent copy of the second set of data.
9. The computer-implemented method of claim 7 wherein the rebuilding of data from the primary node to the secondary node is performed in a non-chronological order.
10. The computer-implemented method of claim 9 wherein the non-chronological order comprises a logical block address (LBA) order.
11. The computer-implemented method of claim 7 wherein the functionality is provided by a control plane that comprises at least a virtual controller to at least detect availability and unavailability of the secondary node.
12. The computer-implemented method of claim 11 wherein the virtual controller further functions to detect availability and unavailability of the primary node and, in response to unavailability of the primary node, causes the secondary node to service requests directed to the primary node.
13. The computer-implemented method of claim 7 wherein the unavailability of the secondary node results from failure of all or a portion of the secondary node.
14. The computer-implemented method of claim 7 wherein the unavailability of the secondary node results from a network connectivity failure.
16. The non-transitory computer-readable medium of claim 15 wherein the backup copy of data on the second data storage device after the second node becomes available after the period of unavailability and before rebuilding the data stored on the second data storage device to replicate data stored on the first data storage device comprises a crash-consistent copy of the second set of data.
17. The non-transitory computer-readable medium of claim 15 wherein the replication of data from the primary node to the secondary node is performed in a non-chronological order.
18. The non-transitory computer-readable medium of claim 17 wherein the non-chronological write order comprises a logical block address (LBA) order.
19. The non-transitory computer-readable medium of claim 15 wherein the functionality is provided by a control plane that comprises at least a virtual controller to at least detect availability and unavailability of the secondary node.
20. The non-transitory computer-readable medium of claim 15 wherein the unavailability of the secondary node results from failure of all or a portion of the secondary node.
Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.
March 9, 2020
August 16, 2022
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.