Legal claims defining the scope of protection, as filed with the USPTO.
1. A method operable on a computer for inferring a user's interests from user generated tags, the method comprising: collecting a plurality of user-generated tags from a given user and one or more users in a social network of the given user over a predetermined period of time; electronically, calculating, via a processing device, a corresponding z-score for each of the plurality of user-generated tags, where the corresponding z-score is a measure of a statistical significance for a given tag based on an observed frequency of the given tag being selected and a tag selection frequency distribution over the plurality of user-generated tags for the predetermined period of time; retaining one or more tags from the plurality of user-generated tags where the corresponding z-score of the one or more tags is higher than a threshold determined based on random tag selection by the given user and the one or more users in the social network of the given user based on the tag selection frequency distribution; comparing the retained one or more tags of the given user with the retained one or more tags of the one or more users in the social network of the given user; and inferring the given user's interest based on the comparison of the retained one or more tags of the given user and the retained one or more tags of the one or more users in the social network of the given user.
2. The method of claim 1 , further comprising the step of sorting the plurality of user generated tags based on the plurality of user generated tags' corresponding z-score.
3. The method of claim 1 , wherein the threshold has an associated confidence value that is determined on the basis of a desired statistical confidence for inferring the given user's interests.
4. The method of claim 1 , wherein the collecting step collects the plurality of user generated tags from one or more web-based content.
5. The method of claim 1 , wherein the predetermined time period is a configurable parameter.
6. The method of claim 1 , wherein the predetermined time period is a continuous time period.
7. The method of claim 1 , wherein the predetermined time period is a non-continuous time period.
9. A non-transitory computer-readable medium having computer executable instructions for performing the method of claim 1 .
10. A system for inferring a user's interests from user generated tags, the system comprising: a processor; a memory connected to the processor and storing instructions for operating the processor to perform steps of: collecting a plurality of user-generated tags from a given user and one or more users in a social network of the given user over a predetermined period of time; calculating a corresponding z-score for each of the plurality of user-generated tags, where the corresponding z-score is a measure of a statistical significance for a given tag based on an observed frequency of the given tag being selected and a tag selection frequency distribution over the plurality of user-generated tags for the predetermined period of time; retaining one or more tags from the plurality of user-generated tags where the corresponding z-score of the one or more tags is higher than a threshold determined based on random tag selection by the given user and the one or more users in the social network of the given user based on the tag selection frequency distribution; comparing the retained one or more tags of the given user with the retained one or more tags of the one or more users in the social network of the given user; and inferring the given user's interest based on the comparison of the retained one or more tags of the given user and the retained one or more tags of the one or more users in the social network of the given user.
11. The system of claim 10 , wherein the processor further performs the step of sorting the plurality of user generated tags based on the plurality of user generated tags' corresponding z-score.
12. The system of claim 10 , wherein threshold has an associated confidence value that is determined on the basis of a desired statistical confidence for inferring the given user's interests.
13. The system of claim 10 , wherein the collecting step collects the plurality of user generated tags from one or more web-based content.
14. The system of claim 10 , wherein the predetermined time period is a configurable parameter.
15. The system of claim 10 , wherein the predetermined time period is a continuous time period.
16. The system of claim 10 , wherein the predetermined time period is a non-continuous time period.
17. A system for inferring a user's interests from user generated tags, the system comprising: means for collecting a plurality of user-generated tags from a given user and one or more users in a social network of the given user over a predetermined period of time; means for calculating a corresponding z-score for each of the plurality of user-generated tags, where the corresponding z-score is a measure of a statistical significance for a given tag based on an observed frequency of the given tag being selected and a tag selection frequency distribution over the plurality of user-generated tags for the predetermined period of time; means for retaining one or more tags from the plurality of user-generated tags where the corresponding z-score of the one or more tags is higher than a threshold determined based on random tag selection by the given user and the one or more users in the social network of the given user based on the tag selection frequency distribution; means for comparing the retained one or more tags of the given user with the retained one or more tags of the one or more users in the social network of the given user; and means for inferring the given user's interest based on the comparison of the retained one or more tags of the given user and the retained one more tags of the one or more users in the social network of the given user.
18. The system of claim 17 , further comprising means for sorting the plurality of user generated tags based on the plurality of user generated tags' corresponding z-score.
19. The system of claim 17 , further comprising means for comparing tags from two or more users.
20. A program stored on non-transitory computer readable media for making a computer execute steps of: collecting a plurality of user-generated tags from a given user and one or more users in a social network of the given user over a predetermined period of time; calculating a corresponding z-score for each of the plurality of user-generated tags, where the corresponding z-score is a measure of a statistical significance for a given tag based on an observed frequency of the given tag being selected and a tag selection frequency distribution over the plurality of user-generated tags for the predetermined period of time; retaining one or more tags from the plurality of user-generated tags where the corresponding z-score of the one or more tags is higher than a threshold determined based on random tag selection by the given user and the one or more users in the social network of the given user based on the tag selection frequency distribution; comparing the retained one or more tags of the given user with the retained one or more tags of the one or more users in the social network of the given user; and inferring the given user's interest based on the comparison of the retained one or more tags of the given user and the retained one or more tags of the one or more users in the social network of the given user.
21. The program of claim 20 , wherein the computer further executes the step of sorting the plurality of user generated tags based on the plurality of user generated tags' corresponding z-score.
22. The program of claim 20 , wherein the threshold has an associated confidence value that is determined on the basis of a desired statistical confidence for inferring the given user's interests.
23. A non-transitory recording medium recording a program for making a computer execute steps of: collecting a plurality of user-generated tags from a given user and one or more users in a social network of the given user over a predetermined period of time; calculating a corresponding z-score for each of the plurality of user-generated tags, where the corresponding z-score is a measure of a statistical significance for a given tag based on an observed frequency of the given tag being selected and a tag selection frequency distribution over the plurality of user-generated tags for the predetermined period of time; retaining one or more tags from the plurality of user-generated tags where the corresponding z-score of the one or more tags is higher than a threshold determined based on random tag selection by the given user and the one or more users in the social network of the given user based on the tag selection frequency distribution; comparing the retained one or more tags of the given user with the retained one or more tags of the one or more users in the social network of the given user; and inferring the given user's interest based on the comparison of the retained one or more tags of the given user and the retained one or more tags of the one or more users in the social network of the given user.
24. The non-transitory recording medium of claim 23 , wherein the computer further executes the step of sorting the plurality of user generated tags based on the plurality of user generated tags' corresponding z-score.
25. The non-transitory recording medium of claim 23 , wherein the threshold has an associated confidence value that is determined on the basis of a desired statistical confidence for inferring the given user's interests.
Unknown
April 22, 2014
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.