A computer identifies and ranks URL hyperlinks to possible pirated media content by searching a web page from a first website for one or more indicator keywords, wherein a strength of an indicator keyword is related to a likelihood of pirated media content. Responsive to locating a plurality of instances of the one or more indicator keywords, identifying a plurality of hyperlinks respectively associated with one or more of the plurality of instances. Weighting, the identified plurality of hyperlinks based on at least one of: a strength of associated indicator keywords, number of associated indicator keywords, number of times each hyperlink was identified, and date of posting. Ranking the plurality of hyperlinks according to weight indicating a relative likelihood that respective hyperlinks point to pirated media content in a ranked list.
Legal claims defining the scope of protection, as filed with the USPTO.
1. A method for identifying and ranking URI, hyperlinks to possible pirated media content, the method comprising: searching on a computer, by one or more processors, a web page from a first website for one or more indicator keywords, wherein a strength of an indicator keyword is related to a likelihood of pirated media content; responsive to the computer locating a plurality of instances of the one or more indicator keywords, the computer identifying, by one or more processors, a plurality of hyperlinks respectively associated with one or more of the plurality of instances; weighting on the computer, by one or more processors, the identified plurality of hyperlinks based on at least one of: a strength of associated indicator keywords, number of associated indicator keywords, number of times each hyperlink was identified, and date of posting; ranking on the computer, by one or more processors, the plurality of hyperlinks according to weight indicating a relative likelihood that respective hyperlinks point to pirated media content in a ranked list; determining, by one or more processors on the computer, whether each of the identified plurality of hyperlinks associated with one or more of the plurality of instances has met one or more thresholds, wherein the one or more thresholds are based on at least one or more of: a ratio of strong indicator keywords to weak indicator keywords, a number of strong indicator keywords, and a proximity of each of the plurality of hyperlinks to the one or more indicator keywords; responsive to determining one or more of the identified plurality of hyperlinks has not met the one or more thresholds, the computer removing the one or more of the identified plurality of hyperlinks from the ranked list; displaying the ranked list on a display screen of said computer to direct a user to one or more web pages that may contain pirated media content of copyrighted media; and said user identifying those web pages providing illegal access to said copyrighted media for protection thereof.
2. The method of claim 1 , further comprising: displaying, by one or more processors, a first portion of the rankings of the plurality of hyperlinks according to weight in a first view; receiving, by one or more processors, an input to display a second portion of the rankings of the plurality of hyperlinks according to weight in a second view; and displaying, by one or more processors, the second portion of the rankings of the plurality of hyperlinks according to weight in the second view.
3. The method of claim 1 , wherein weighting the identified plurality of hyperlinks further composes: identifying, by one or more processors, a primary hyperlink for each of the plurality of hyperlinks; and determining, by one or more processors, a number of times respective primary hyperlinks have been identified.
4. A method for identifying and ranking URL hyperlinks to possible pirated media content, the method comprising: searching on a computer, by one or more processors, a web page from a first website for one or more indicator keywords, wherein a strength of an indicator keyword is related to a likelihood of pirated media content; responsive to the computer locating a plurality of instances of the one or more indicator keywords, the computer identifying, by one or more processors, a plurality of hyperlinks respectively associated with one or more of the plurality of instances; weighting on the computer, by one or more processors, the identified plurality of hyperlinks based on at least one of a strength of associated indicator keywords, number of associated indicator keywords, number of times each hyperlink was identified, and date of posting; identifying a primary hyperlink for each of the plurality of hyperlinks and determining a number of times respective primary hyperlinks have been identified; ranking on the computer, by one or more processors, the plurality of hyperlinks according to weight indicating a relative likelihood that respective hyperlinks point to pirated media content in a ranked list; determining, by one or more processors on the computer, whether each of the identified plurality of hyperlinks associated with one or more of the plurality of instances has met one or more thresholds, wherein the one or more thresholds are based on at least one or more of: a ratio of strong indicator keywords to weak indicator keywords, a number of strong indicator keywords, and a proximity of each of the plurality of hyperlinks to the one or more indicator keywords; and responsive to determining one or more of the identified plurality of hyperlinks has not met the one or more thresholds, removing, by one or more processors, the one or more of the identified plurality of hyperlinks from the ranked list; displaying on a display screen of said computer a first portion of the rankings of the plurality of hyperlinks according to weight in a first view; receiving, by one or more processors on the computer, an input to display a second portion of the rankings of the plurality of hyperlinks according to weight in a second view; displaying on the display screen of said computer the second portion of the rankings of the plurality of hyperlinks according to weight in the second view; and a user viewing said rankings of the plurality of hyperlinks to direct the user to one or more web pages that may contain pirated media content of copyrighted media and identifying those web pages providing illegal access to said copyrighted media for protection thereof.
5. The method of claim 4 , further comprising: searching on the computer, by one or more processors, a second website for one or more indicator keywords in text posted to one or more web pages of the second website; responsive to locating a second plurality of instances of the one or more indicator keywords, identifying on the computer, by one or more processors, a second plurality of hyperlinks associated with one or more of the second plurality of instances; weighting on the computer, by one or more processors, the identified the second plurality of hyperlinks based on at least one of: a strength of associated keywords, number of associated keywords, number of times each hyperlink was identified, and date of posting; and ranking on the computer the second plurality of hyperlinks according to weight.
6. The method of claim 5 , further comprising: weighting on the computer, by one or more processors, the first and second website based on at least one of: an average of the number of the plurality of hyperlinks identified per web page of each of the websites and a number of the plurality of hyperlinks provided by the web pages of each website; and ranking on the computer, by one or more processors, the first and second website according to weight.
7. A non-transitory computer program product for identifying and ranking URL hyperlinks to possible pirated media content, the non-transitory computer program product comprising: one or more non-transitory computer-readable storage media; program instructions stored on the one or more non-transitory computer readable storage media which when executed by one or more processors, perform: searching on a computer, by one or more processors, a web page from a first website for one or more indicator keywords, wherein a strength of an indicator keyword is related to a likelihood of pirated media content; responsive to the computer locating a plurality of instances of the one or more indicator keywords, the computer identifying, by one or more processors, a plurality of hyperlinks associated with one or more of the plurality of instances; weighting on the computer, by one or more processors, the identified plurality of hyperlinks based on at least one of: a strength of associated keywords, number of associated keywords, number of times each hyperlink was identified, and date of posting; ranking on the computer, by one or more processors the plurality of hyperlinks according to weight indicating a relative likelihood that respective hyperlinks point to pirated media content in a ranked list; determining, by one or more processors on the computer, whether each of the identified plurality of hyperlinks associated with one or more of the plurality of instances has met one or more thresholds, wherein the one or more thresholds are based on at least one or more of: a ratio of strong indicator keywords to weak indicator keywords, a number of strong indicator keywords, and a proximity of each of the plurality of hyperlinks to the one or more indicator keywords; responsive to determining one or more of the identified plurality of hyperlinks has not met the one or more thresholds, the computer removing the one or more of the identified plurality of hyperlinks from the ranked list; displaying the ranked list on a display screen of said computer to direct a user to one or more web pages that may contain pirated media content of copyrighted media; and said user identifying those web pages providing illegal access to said copyrighted media for protection thereof.
8. The non-transitory computer program product of claim 7 , further comprising program instructions, stored on the one or more computer-readable storage media, which when executed by one or more processors, perform: displaying, by one or more processors, a first portion of the rankings of the plurality of hyperlinks according to weight in a first view; receiving on the computer, by one or more processors, an input to display a second portion of the rankings of the plurality of hyperlinks according to weight in a second view; and displaying, by one or more processors, the second portion of the rankings of the plurality of hyperlinks according to weight in the second view.
9. The non-transitory computer program product of claim 8 , wherein weighting the identified plurality of hyperlinks further comprises program instructions, stored on the one or more computer-readable storage media, which when executed by one or more processors, perform: identifying on the computer, by one or more processors, a primary hyperlink for each of the plurality of hyperlinks; and determining, by one or more processors, a number of times respective primary hyperlinks have been identified.
10. The non-transitory computer program product of claim 7 , further comprising program instructions, stored on the one or more computer-readable storage media, which when executed by one or more processors, perform: searching on the computer, by one or more processors, a second website for one or more indicator keywords in text posted to one or more web pages of the second website; responsive to locating a second plurality of instances of the one or more indicator keywords, identifying, by one or more processors, a second plurality of hyperlinks associated with one or more of the second plurality of instances; weighting on the computer, by one or more processors, the identified the second plurality of hyperlinks based on at least one of: a strength of associated keywords, number of associated keywords, number of times each hyperlink was identified, and date of posting; and ranking on the computer, by one or more processors, the second plurality of hyperlinks according to weight.
11. The non-transitory computer program product of claim 10 , further comprising program instructions, stored on the one or more computer-readable storage media, which when executed by one or more processors, perform: weighting on the computer, by one or more processors, the first and second website based on at least one of: an average of the number of the plurality of hyperlinks identified per web page of each of the websites and a number of the plurality of hyperlinks provided by the web pages of each website; and ranking, by one or more processors, the first and second website according to weight.
12. The non-transitory computer program product of claim 7 , wherein the program instructions stored on the one or more computer-readable storage media were downloaded over a network from a remote computing system.
13. The computer program product of claim 7 , wherein the program instructions stored on the one or more non-transitory computer-readable storage media are stored on at least one computer-readable storage medium of a server computing system, and wherein the program instructions can be downloaded over a network by a remote computing system for storage on at least one computer-readable storage medium of the remote computing system.
14. A computer system for identifying and ranking URL hyperlinks to possible pirated media content, the computer system comprising: one or more computer processors; one or more computer-readable storage media; program instructions stored on the one or more computer-readable storage media for execution by at least one of the one or more computer processors, which when executed, perform: searching on a computer, by one or more processors, a web page from a first website for one or more indicator keywords, wherein a strength of an indicator keyword is related to a likelihood of pirated media content; responsive to the computer locating a plurality of instances of the one or more indicator keywords, the computer identifying, by one or more processors, a plurality of hyperlinks associated with one or more of the plurality of instances; weighting on the computer, by one or more processors, the identified plurality of hyperlinks based on at least one of: a strength of associated keywords, number of associated keywords, number of times each hyperlink was identified, and date of posting; ranking on the computer, by one or more processors, the plurality of hyperlinks according to weight indicating a relative likelihood that respective hyperlinks point to pirated media content in a ranked list; determining, by one or more processors on the computer, whether each of the identified plurality of hyperlinks associated with one or more of the plurality of instances has met one or more thresholds, wherein the one or more thresholds are based on at least one or more of: a ratio of strong indicator keywords to weak indicator keywords, a number of strong indicator keywords, and a proximity of each of the plurality of hyperlinks to the one or more indicator keywords; responsive to determining one or more of the identified plurality of hyperlinks has not met the one or more thresholds, the computer removing the one or more of the identified plurality of hyperlinks from the ranked list; displaying the ranked list on a display screen of said computer to direct a user to one or more web pages that may contain pirated media content of copyrighted media; and said user identifying those web pages providing illegal access to said copyrighted media for protection thereof.
15. The computer system of claim 14 , further comprising program instructions, stored on the one or more computer-readable storage mcdia, which when executed by one or more processors, perform: displaying, by one or more processors, a first portion of the rankings of the plurality of hyperlinks according to weight in a first view; receiving, by one or more processors, an input to display second portion of the rankings of the plurality of hyperlinks according to weight in a second view; and displaying, by one or more processors, the second portion of the rankings of the plurality of hyperlinks according to weight in the second view.
16. The computer system of claim 15 , wherein weighting the identified plurality of hyperlinks further comprises program instructions, stored on the one or more computer-readable storage media, which when executed by one or more processors, perform: identifying, by one or more processors, a primary hyperlink for each of the plurality of hyperlinks; and determining, by one or more processors, a number of times respective primary hyperlinks have been identified.
17. The computer system of claim 14 , further comprising program instructions, stored on the one or more computer-readable storage media, which when executed by one or more processors, perform: searching, by one or more processors, a second website for one or more indicator keywords in text posted to one or more web pages of the second website; responsive to locating a second plurality of instances of the one or more indicator keywords, identifying, by one or more processors, a second plurality of hyperlinks associated with one or more of the second plurality of instances; weighting, by one or more processors, the identified the second plurality of hyperlinks based on at least one of: a strength of associated keywords, number of associated keywords, number of times each hyperlink was identified, and date of posting; and ranking the second plurality of hyperlinks according to weight.
18. The computer system of claim 17 , further comprising program instructions, stored on the one or more computer-readable storage media, which when executed by one or more processors, perform: weighting, by one or more processors, the first and second website based on at least one of: an average of the number of the plurality of hyperlinks identified per web page of each of the websites and a number of the plurality of hyperlinks provided by the web pages of each website; and ranking, by one or more processors, the first and second website according to weight.
Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.
September 30, 2013
December 15, 2015
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.