8392434

Random Sampling from Distributed Streams

PublishedMarch 5, 2013
Assigneenot available in USPTO data we have
Technical Abstract

Patent Claims
20 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1

1. A method for distributed sampling on a network with a plurality of sites and a coordinator, comprising: receiving at the coordinator a data element from a site of the plurality of sites, said data element having a weight randomly associated therewith deemed reportable by comparison at the site to a locally stored global value; comparing the weight of the data element received with a global value stored at the coordinator; and performing one of: updating the global value stored at the coordinator to the weight of the data element received; and communicating the global value stored at the coordinator back to the site of the plurality of sites.

2

2. The method of claim 1 , wherein the weight randomly associated with the data element at the site is deemed reportable by the site if the weight randomly associated with the data element is less than the locally stored global value at the site.

3

3. The method of claim 2 , wherein the communicating the global value stored at the coordinator back to the site of the plurality of sites is performed responsive to a determination that the weight randomly associated with the data element received is greater than the global value stored at the coordinator.

4

4. The method of claim 3 , wherein the updating the global value stored at the coordinator to the weight of the data element received is performed responsive to a determination that the weight randomly associated with the data element received is less than the global value stored at the coordinator.

5

5. The method of claim 1 , wherein the weight randomly associated with the data element at the site is deemed reportable by the site if the weight randomly associated with the data element is greater than the locally stored global value.

6

6. The method of claim 5 , wherein the communicating the global value stored at the coordinator back to the site of the plurality of sites is performed responsive to a determination that the weight randomly associated with the data element received less than the global value stored at the coordinator.

7

7. The method of claim 6 , wherein the updating the global value stored at the coordinator to the weight of the data element received is performed responsive to a determination that the weight randomly associated with the data element received is greater than the global value stored at the coordinator.

8

8. The method of claim 1 , further comprising: at each site: receiving a plurality of data elements from a data stream; associating a random weight to each of the plurality of elements; and responsive to a data element being associated with a random weight below the locally stored global value at a receiving site, sending the data element to the coordinator.

9

9. The method of claim 1 , further comprising storing, at the coordinator, at least one received data element from the plurality of sites.

10

10. The method of claim 1 , further comprising estimating a number of distinct elements using the global value stored at the coordinator.

11

11. The method of claim 1 , further comprising: receiving a query at the coordinator; and answering said query using a random sample based on at least one data element stored at the coordinator.

12

12. The method of claim 1 , wherein said plurality of sites are network routers, and further wherein said network routers are connected to said coordinator via at least one network connection.

13

13. A computer program product for distributed sampling on a network with a plurality of sites and a coordinator, comprising: a computer readable storage medium having computer readable program code embodied therewith, the computer readable program code comprising: computer readable program code configured to receive, at the coordinator, a data element from a site of the plurality of sites, said data element having a weight randomly associated therewith deemed reportable by comparison at the site to a locally stored global value; computer readable program code configured to compare the weight of the data element received with a global value stored at the coordinator; and computer readable program code configured to perform one of: updating the global value stored at the coordinator to the weight of the data element received; and communicating the global value stored at the coordinator back to the site of the plurality of sites.

14

14. The computer program product of claim 13 , wherein the computer readable program code configured to communicate the global value stored at the coordinator back to the site of the plurality of sites is further configured to communicate the global value responsive to a determination that the weight randomly associated with the data element received is greater than the global value stored at the coordinator.

15

15. The computer program product of claim 14 , wherein computer readable program code configured to update the global value stored at the coordinator to the weight of the data element received is further configured to update the global value stored at the coordinator responsive to a determination that the weight randomly associated with the data element received is less than the global value stored at the coordinator.

16

16. The computer program product of claim 13 , wherein the computer readable program code configured to communicate the global value stored at the coordinator back to the site of the plurality of sites is further configured to communicate the global value responsive to a determination that the weight randomly associated with the data element received less than the global value stored at the coordinator.

17

17. The computer program product of claim 16 , wherein the computer readable program code configured to update the global value stored at the coordinator to the weight of the data element received is further configured to update the global value at the coordinator responsive to a determination that the weight randomly associated with the data element received is greater than the global value stored at the coordinator.

18

18. The computer program product of claim 13 , further comprising computer readable program code configured to: store, at the coordinator, at least one received data element from the plurality of sites; receive a query at the coordinator; and answer said query using a random sample based on at least one data element stored at the coordinator.

19

19. The computer program product of claim 13 , wherein said plurality of sites are network routers, and further wherein said network routers are connected to said coordinator via at least one network connection.

20

20. A system comprising: at least one processor; and a memory device operatively connected to the at least one processor; wherein, responsive to execution of program instructions accessible to the at least one processor, the at least one processor is configured to: receive at the coordinator a data element from a site of the plurality of sites, said data element having a weight randomly associated therewith deemed reportable by comparison at the site to a locally stored global value; compare the weight of the data element received with a global value stored at the coordinator; and perform one of: updating the global value stored at the coordinator to the weight of the data element received; and communicating the global value stored at the coordinator back to the site of the plurality of sites.

Patent Metadata

Filing Date

Unknown

Publication Date

March 5, 2013

Inventors

David P. Woodruff
Srikanta N. Tirthapura

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Citation & reuse

Analysis on this page is generated by Patentable — an AI-powered patent intelligence platform. AI-generated summaries, explanations, and analysis may be reused with attribution and a visible link back to the canonical URL below. Patent abstracts and claims are USPTO public domain.

Cite as: Patentable. “RANDOM SAMPLING FROM DISTRIBUTED STREAMS” (8392434). https://patentable.app/patents/8392434

© 2026 Patentable. All rights reserved.

Patentable is a research and drafting-assistant tool, not a law firm, and does not provide legal advice. Documents we generate are drafts for review by a licensed patent attorney.