{"schema_version":"1.0","canonical_url":"https://patentable.app/patents/US-11249876","patent":{"patent_number":"US-11249876","title":"System and method for predicting application performance for large data size on big data cluster","assignee":null,"inventors":[],"filing_date":"2018-08-21T00:00:00.000Z","publication_date":"2022-02-15T00:00:00.000Z","cpc_codes":["G06F","G06F","G06F","G06F","G06F","G06F","G06F"],"num_claims":5,"abstract":"A system and method for estimating execution time of an application with Spark™ platform in a production environment. The application on Spark™ platform is executed as a sequence of Spark jobs. Each Spark job is executed as a directed acyclic graph (DAG) consisting of stages. Each stage has multiple executors running in parallel and the each executor has set of concurrent tasks. Each executor spawns multiple threads, one for each task. All jobs in the same executor share the same JVM memory. The execution time for each Spark job is predicted as summation of the estimated execution time of all its stages. The execution time constitutes scheduler delay, serialization time, de-serialization time, and JVM overheads. The JVM time estimation depends on type of computation hardware system and number of threads."},"analysis":{"summary":null,"layman_explanation":null,"technical_analysis":null,"business_analysis":null,"faqs":null,"topics":[],"tech_cluster":null},"seo":{"title":"System and method for predicting application performance for large data size on big data cluster","description":"A system and method for estimating execution time of an application with Spark™ platform in a production environment. The application on Spark™ platform is executed as a sequence of Spark jobs. Each S","keywords":[]},"attribution":{"source":"Patentable","source_url":"https://patentable.app","canonical_url":"https://patentable.app/patents/US-11249876","license":"CC-BY-4.0-like","license_terms":"AI-generated analysis on this page (summary, layman_explanation, technical_analysis, business_analysis, faqs) may be reused with attribution and a visible link back to the canonical URL above. Patent abstracts, claims, and bibliographic data are USPTO public domain.","required_link":"https://patentable.app/patents/US-11249876","citation_suggestion":"Patentable. \"System and method for predicting application performance for large data size on big data cluster\" (US-11249876). https://patentable.app/patents/US-11249876","copyright_holder":"Nomic Interactive Technology LLC"},"links":{"html":"https://patentable.app/patents/US-11249876","json":"https://patentable.app/api/llm-context/US-11249876","site":"https://patentable.app","llms_txt":"https://patentable.app/llms.txt"},"generated_at":"2026-05-30T11:14:18.839Z"}