{"schema_version":"1.0","canonical_url":"https://patentable.app/patents/US-8472728","patent":{"patent_number":"US-8472728","title":"System and method for identifying and characterizing content within electronic files using example sets","assignee":null,"inventors":[],"filing_date":"2009-10-30T00:00:00.000Z","publication_date":"2013-06-25T00:00:00.000Z","cpc_codes":["G06V","G06F","G06V"],"num_claims":26,"abstract":"A system and method for determining inappropriate content within images. A plurality of training images are used to teach the machine. The training images are converted into numerical data and stored along with its human judged label in a BigMatrix. Through the BigMatrix, a RandomForest is created to discern patterns among the training images and human-judged labels. To determine whether an image contains inappropriate content, the image is converted into numerical data. The numerical data is fed to the RandomForest generated from the plurality of training images and known content. The numerical data is fed down each tree within the RandomForest. When the numerical data is routed down through the branches of the trees and terminated at a leaf node, a vote for the leaf node is obtained. The overall response of the RandomForest is given by a majority rules vote for each tree within the RandomForest."},"analysis":{"summary":null,"layman_explanation":null,"technical_analysis":null,"business_analysis":null,"faqs":null,"topics":[],"tech_cluster":null},"seo":{"title":"System and method for identifying and characterizing content within electronic files using example sets","description":"A system and method for determining inappropriate content within images. A plurality of training images are used to teach the machine. The training images are converted into numerical data and stored ","keywords":[]},"attribution":{"source":"Patentable","source_url":"https://patentable.app","canonical_url":"https://patentable.app/patents/US-8472728","license":"CC-BY-4.0-like","license_terms":"AI-generated analysis on this page (summary, layman_explanation, technical_analysis, business_analysis, faqs) may be reused with attribution and a visible link back to the canonical URL above. Patent abstracts, claims, and bibliographic data are USPTO public domain.","required_link":"https://patentable.app/patents/US-8472728","citation_suggestion":"Patentable. \"System and method for identifying and characterizing content within electronic files using example sets\" (US-8472728). https://patentable.app/patents/US-8472728","copyright_holder":"Nomic Interactive Technology LLC"},"links":{"html":"https://patentable.app/patents/US-8472728","json":"https://patentable.app/api/llm-context/US-8472728","site":"https://patentable.app","llms_txt":"https://patentable.app/llms.txt"},"generated_at":"2026-05-31T14:01:26.853Z"}