HPCC Systems

  • hpcc-systems
  • HPCC_System_Diagram
  • HPCC-system

Place Category: Software DevelopmentPlace Tags: HPCC Systems and Tier 2

  • Profile

    HPCC Systems is a powerful, open-source, enterprise-proven big data analytics platform. It helps businesses of all sizes find the answers they need by making data easier to process, analyze, and understand.

    Born from the deep data analytics history of LexisNexis® Risk Solutions, HPCC Systems provides high-performance, parallel processing and delivery for applications using big data.

    The open-source platform incorporates a software architecture implemented on commodity shared-nothing computing clusters for resilience and scalability. It is configurable to support both parallel batch data processing and high-performance data delivery applications using indexed data files. The platform includes a high-level, implicitly parallel data-centric declarative programming language that adds to its flexibility and efficiency.

    Developers, data scientists and technology leaders adopt HPCC Systems because it is cost-effective, comprehensive, fast, powerful and scalable.

    Ultimately, it makes managing big data easier.

    The HPCC Systems Platform

    The HPCC Systems platform is a set of easy-to-use software features, enabling developers and data scientists to process and analyze data at any scale. With a strong commitment to the open source community, the HPCC Systems platform is available free of licensing and service costs.

    HPCC Systems provides all the functionality to execute a data project. Specifically, the HPCC Systems stack comprises of:

    Thor: The Data Refinery Cluster

    Known as “Thor” after the hammer-wielding god of thunder, this cluster is designed to execute big data workflows including extraction, loading, cleansing, transformations, linking and indexing.

    Data Management Tools

    Data Profiling, Data Cleansing, Snapshot Data Updates and consolidation, Job Scheduling and automation are some of the key features.

    ROXIE: The Data Delivery Engine

    Rapid data delivery cluster provides separate high-performance online query delivery for big data. ROXIE (Rapid Online XML Inquiry Engine) utilizes highly optimized distributed B-tree indexed data structures conceived for high concurrent use.

    Predictive Modeling Tools

    In place (supporting distributed linear algebra) predictive modeling functionality to perform Linear Regression, Logistic Regression, Decision Trees and Random Forests.
  • Photos
  • Video