Hamburger Hamburger Hamburger

Pentaho Platform: Pentaho Business Analytics - Datasheet

End-to-End Data Integration and Analytics at Enterprise Scale

Download PDF

Pentaho data integration and analytics technology enables organizations to access, prepare, and analyze all data from any source, in any environment. Pentaho Business Analytics is part of Pentaho Platform Data Integration and Analytics, which provides intelligent data management for digital innovation.

The Open, Embeddable Technology of Pentaho Business Analytics

Supports flexible analytics that both leverage existing data infrastructure and future-proof deployments against tomorrow’s inevitable change (see Figure 1).

By tightly coupling data integration with business analytics, this Pentaho technology brings together IT and business users to ingest, prepare, blend and analyze all data that impacts business results. Its intuitive data integration and preparation capabilities drastically reduce the hand coding required to bring data together for insight. At the same time, Pentaho Business Analytics provides a spectrum of analytics for all user roles, from visual data analysis for business analysts to tailored dashboards for all audiences—from business executives to front line workers. Pentaho software is fast to deploy, easy to use, and purpose-built for the future of analytics.

Data Integration

Organizations face an increasing challenge to manage and extract value from a growing variety and volume of data across their edge-to-cloud infrastructure. With Pentaho Data Integration, organizations can access data from complex and heterogeneous sources and blend it with existing relational data to produce high-quality, ready-to-analyze information— all without writing a line of code. A rich graphical user interface paired with a powerful multithreaded transformation engine offers high-performance ETL (extract, transform and load) capabilities that cover all data integration needs, including big data ingestion and processing.

Pentaho Data Integration Features

  • Intuitive drag-and-drop interface to simplify the creation of analytic data pipelines (see Figure 2).
  • Broad connectivity to virtually any data source, either on premises or in the cloud, including flat files, relational database management systems (RDBMS), APIs and more.
  • Integration with transactional databases, including Oracle, IBM® DB2®, PostgreSQL, MySQL and others.
  • Access to data in enterprise applications, including SAP,, Google Analytics and more.
  • Rich library of prebuilt components to access, prepare, blend and cleanse data.
  • Direct access to complete analytics, including charts, visualizations and reporting from any step of Pentaho Data Integration.
  • Robust orchestration capabilities to coordinate complex workflows, including scheduling and alerts.
  • Support for advanced analytic model development in R, Python, Scala and Weka that incorporate libraries, such as scikit-learn, Spark MLib, Tensorflow and Keras, into the data flow.
  • Enterprise-grade administration, scalability, load balancing, and security capabilities.
  • Native integration with the Data Catalog, a part of Pentaho Platform.

Big Data

Pentaho technology enables companies to realize business value from large volumes of diverse data by dramatically reducing the time and complexity required to design, develop and deploy big data analytics. Pentaho technology covers the entire big data life cycle, from data extraction and preparation of diverse data, to scalable processing on Spark and Hadoop, leading to end-to-end analytics solutions.

The Leading Solution for Big Data Integration and Analytics

  • Visual design environment for blending multiple big data sources (see Figure 3) and processing data at scale.
  • Integration with leading Hadoop distributions, object stores, NoSQL stores and analytic databases, as well as log file data and JSON/XML formats.
  • Code-free data transformation design that empowers 15 times faster productivity versus hand coding and executes Spark or Hadoop jobs in clusters for high performance.
  • Operationalize with Spark stream and batch job execution, SQL on Spark connectivity, Kafka access and more.
  • Seamlessly switch between execution engines such as Spark and Pentaho software’s native engine to fit data volume, velocity and transformation complexity.
  • Template-based approach to rapidly onboard data sources into Hadoop via metadata injection feature set.
  • Adaptive big data layer that enables smooth portability of transformations across different Hadoop and Spark distributions.
  • Pragmatic solutions to deliver on-demand data marts in a big data environment.

“Using Pentaho, we are now helping clients blend a 360-degree view of all equipment data sources to enable early prediction of potential machinery failure.”

– Caterpillar Marine Asset Intelligence

Multicloud Support

Extend the benefits of the open, extensible Pentaho technology to address a wide range of needs in multi, hybrid and private cloud deployments. Modern Pentaho data architecture simplifies management of your increasingly distributed data architecture with a single data management tool.

End-to-End Data Platform

  • Connectivity to cloud storage and computing in AWS, Google Cloud and Microsoft Azure.
  • Robust support, including bulk loading, for popular cloud data warehouses, like Amazon Redshift, Snowflake and Google BigQuery.
  • Big data processing in Amazon EMR, Google DataProc and Azure HDInsight environments.
  • Filtering and contextual analysis of streaming data in AWS Kinesis and Kafka.

Business Analytics

Pentaho Business Analytics provides a modern, highly interactive, and intuitive web-based interface for business users to discover and explore virtually any data. With a full spectrum of analytics tools, users can create reports and dashboards as well as visualize and analyze data across multiple dimensions without dependence on IT or developers. Meanwhile, IT benefits from secure, scalable and governed analytics for the whole enterprise. Pentaho software can be deployed on premises or in the cloud and can be seamlessly embedded into other software applications. Pentaho Business Analytics provides:

Ad Hoc Analysis and Visualization

  • A rich library of interactive visualizations such as geographic maps, heat grids, bubble charts and more (see Figure 4).
  • Extreme scale of in-memory data caching for speedof- thought analysis on large data volumes using a drag-and-drop paradigm.
  • Visual lasso filtering and zooming to understand or exclude outliers.
  • Attribute highlighting for better visual contrast among data displays.
  • The ability to drill down into supporting reports for detailed data.


CERN Turns to Pentaho to Optimize Operations - Read the case study

“After reviewing five different proprietary and open source platforms, Pentaho emerged as the best. I believe it is a future-proof solution that will help us to ensure data governance, ‘one version of the truth,’ and a great user experience.”

– CERN, the European Organization for Nuclear Research

Flexible Dashboards

  • Web-based drag-and-drop dashboard designer for business users.
  • Portal and mashup integration to seamlessly embed business analytics with other web applications.
  • Rich visualizations with navigation, drill-down capabilities and a library of filter controls.
  • Advanced dashboard framework for 100% tailored user experiences.

Enterprise Reporting

  • Full support for operational reports, parameterized reports, and interactive self-service reporting against transactional databases.
  • Intuitive web-based interactive reporting for business users.
  • Rich graphical pixel-perfect report designer for power users.


Embedded Analytics

Flexible, cloud-ready Pentaho technology is purpose-built for embedding into and integrating with your applications, portals and processes. Our powerful analytics and extensible architecture ensure that you can get to market quickly and delight your customers. Our embedded analytics solution offers:

  • The ability to seamlessly embed real-time visualizations reports and dashboards into existing applications (See Figures 5 and 6). 
  • Highly customizable web-based user interface and robust web APIs that offer maximum control over the look, feel and user experience.
  • Flexible capabilities for multitenant deployment as well as rich single sign-on and security integration.
  • Tailored training and access to architect-level staff who have made hundreds of organizations successful. 
Any Analytics, Any Data, Simplified

Pentaho technology addresses the barriers that block organizations from getting value from all of their data. It simplifies the process of preparing and blending any data and includes a spectrum of tools to easily analyze, visualize, explore, report and predict. Open, embeddable and extensible, Pentaho software is architected to ensure that each member of your team—from developers to business users—can easily translate data into value.

Pentaho and Lumada

Pentaho is now part of Pentaho Platform, which provides intelligent data management for digital innovation through advanced insights based on trusted data. Pentaho Platform is open and modular to deliver AI-driven automation and collaboration, and includes: Data Integration and Analytics, Data Catalog and Data Optimizer for Hadoop. Pentaho Platform is built with Pentaho technology that includes Pentaho Business Analytics and Pentaho Data Integration.

Learn more about Pentaho Platform: Pentaho Business Analytics.

Learn More Learn More

We Are Hitachi Vantara

Hitachi Vantara solves digital challenges by guiding you from what’s now to what’s next. Our unmatched industrial and digital capabilities benefit both business and society

{ "FirstName": "First Name", "LastName": "Last Name", "Email": "Business Email", "Title": "Job Title", "Company": "Company Name", "Address": "Address", "City": "City", "State":"State", "Country":"Country", "Phone": "Business Telephone", "LeadCommentsExtended": "Additional Information(optional)", "LblCustomField1": "What solution area are you wanting to discuss?", "ApplicationModern": "Application Modernization", "InfrastructureModern": "Infrastructure Modernization", "Other": "Other", "DataModern": "Data Modernization", "GlobalOption": "If you select 'Yes' below, you consent to receive commercial communications by email in relation to Hitachi Vantara's products and services.", "GlobalOptionYes": "Yes", "GlobalOptionNo": "No", "Submit": "Submit", "EmailError": "Must be valid email.", "RequiredFieldError": "This field is required." }