Legal claims defining the scope of protection, as filed with the USPTO.
1. A recommendation method using rough-set and multiple features mining integrally, said method comprising a training session and a prediction session, said training session building association rules, user clusters and rating tables, said training session comprising steps of: (a) providing data including user profiles, user rating logs and item contents; (b) pre-processing said data to obtain a transaction table; (c) associating transactions in said transaction table to obtain a plurality of associations to further obtain a plurality of association rules through data mining to be saved in an association rule database; (d) obtaining said user rating logs to divide users in said user rating logs into user clusters through a clustering algorithm to be saved in a user cluster database; and (e) analyzing said transactions in said transaction table to re-symbolize items into item categories through a statistical analysis and reorganizing user rating logs to obtain rating averages of said item categories and to further obtain a rating table of said re-symbolized item categories, said prediction session applying rough-set and statistical analysis prediction to obtain predicted rating values from said user rating logs, said prediction session comprising steps of: (f) finding a user cluster of related users to a target user from said user clusters to obtain a rating table of said related users and said target user; (g) based on said association rules, predicting unknown values in said rating table other than rating value of a target item of said target user to obtain a complete sub-matrix; (h) obtaining a class item, a referred item and a plurality of item sets in said sub-matrix, obtaining a plurality of first elementary sets by dividing said users with said class item, obtaining a plurality of second elementary sets by dividing said users with said item sets, and comparing said first elementary sets and said second elementary sets to obtain a lower approximation through a rough-set algorithm using a user cardinality constraint and an item cardinality constraint to further obtain a predicted rating value of said target item of said target user; (i) obtaining predicted rating values of said item categories in said rating table obtained through said statistical analysis prediction in said training session to further obtain another predicted rating value of said target item of said target user; and (j) obtaining a final predicted rating value of said target item of said target user through a switch-based mixing, wherein a first standard deviation is pre-set as a threshold; wherein said predicted rating value obtained through said statistical analysis prediction is obtained as said final predicted rating value of said target item on obtaining a second standard deviation bigger than said threshold, said second standard deviation being a standard deviation of past rating values of the same item category as that of said target item; and wherein said predicted rating value obtained through said rough-set algorithm is obtained as said final predicted rating value on obtaining said second standard deviation not bigger than said threshold.
2. The recommendation method according to claim 1 , wherein, in step (d), a Pearson correlation coefficient is used in said clustering algorithm to divide users into said user clusters based on similarities of said users to said target user.
3. The recommendation method according to claim 1 , wherein said clustering algorithm is a K-means algorithm.
4. The recommendation method according to claim 1 , wherein, in step (f), similarities of centers of said user clusters to said target user on rating logs are obtained through a distance formula of Pearson correlation coefficient to obtain a plurality of said user clusters having a nearest distance to said target user.
5. The recommendation method according to claim 1 , wherein, in step (g), a distance formula of Pearson correlation coefficient is used to obtain a similarity of each item in said sub-matrix to said target item and said distance formula is as follows: γ = ∑ XY - ∑ X ∑ Y N ( ∑ X 2 - ( ∑ X ) 2 N x ) ( ∑ Y 2 - ( ∑ Y ) 2 N y ) .
6. The recommendation method according to claim 1 , wherein said user cardinality constraint is used to limit a size of said lower approximation.
7. The recommendation method according to claim 1 , wherein said item cardinality constraint is used to limit a number of said item sets on building said second elementary sets.
8. The recommendation method according to claim 1 , wherein step (h) further comprises steps of: (h 1 ) obtaining said sub-matrix built in step (g); (h 2 ) obtaining similarities of items in said sub-matrix to said target item and obtaining an item having the highest similarity as a class item; (h 3 ) obtaining similarities of items in said sub-matrix other than said class item to said target item to obtain a referred item by setting an item cardinality constraint in said rough-set algorithm and to obtain item sets most related to said target item through sorting; (h 4 ) based on said class item, dividing said users into user clusters to obtain a plurality of first elementary sets and, based on said item sets, dividing said users other than said target user having rating logs of equivalent class into user clusters to obtain a plurality of second elementary sets; (h 5 ) comparing said first elementary sets and said second elementary sets to obtain items completely included in both groups of sets as a lower approximation; (h 6 ) obtaining related items in the lower approximation by setting a user cardinality constraint in said rough-set algorithm, wherein step (h 3 ) is went back on number of said related items in the lower approximation bigger than said user cardinality constraint; and (h 7 ) based on said lower approximation, obtaining predicted rating value of said target item of said target user while rating value of said target item is similar to rating value of said item set.
9. The recommendation method according to claim 1 , wherein said standard deviation used in said switch-based mixing has a formula as follows: α = 1 N ∑ i = 1 N ( x i - x _ ) 2 .
Unknown
January 1, 2013
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.