Hamburger Hamburger Hamburger

Press Release

May 21, 2017

Pentaho Scales Spark Across the Enterprise

Pentaho 7.1 leapfrogs industry with innovative approach to support Spark for data integration, expands to Microsoft Azure, adds Hortonworks security

May 22, 2017, San Francisco, CA — Building on the company's vision of creating a single consistent experience across the entire data pipeline, Pentaho, a Hitachi Group Company, today announced Pentaho Business Analytics 7.1. Highlights of this release include: adaptive execution on any engine for big data processing, starting with Spark; expanded cloud integration with Microsoft Azure HDInsight; enterprise-level security for Hortonworks, and improved in-line visualizations.

Build Once, Execute on Any Engine, Starting with Spark
Unlike other vendors, Pentaho 7.1 supports Spark with virtually all of its data integration steps in a visual drag-and-drop environment, and provides the freedom to choose an execution engine at run-time. Other vendors require users to create Spark-specific data integration logic, often requiring advanced Java programming skills, at a time when developer talent shortages are a reality. With adaptive execution, Pentaho 7.1 makes big data developers two times more productive and expands the profile of technology talent who can work with Spark across the enterprise. While this release starts with Spark support, the architecture sets the stage for users to execute on the best engine for any given data workload in the future, insulating customers from emerging technologies.

"Big data will continue to create complexity, but that shouldn't inhibit enterprise success," said Donna Prlich, Chief Product Officer, Pentaho, a Hitachi Group Company. "Teams of data engineers, data scientists and analysts can now work in a single environment that eliminates multiple tools, complex coding and provides a consistent user experience across the data pipeline. This release significantly advances our vision for a single analytic data workflow."

Leverage Additional Cloud Deployments with Microsoft Azure HDInsight
Pentaho 7.1 recognizes the growing momentum of enterprise cloud adoption with the need for flexible on-premises deployment and processing, especially in big data and IoT environments that use machine learning and AI. Building on current cloud support for Amazon EMR, the new version supports Microsoft Azure HDInsight, Azure SQL and Azure SQL Server, offering more options to store and process big data in hybrid, on premises, and public cloud environments.

Ensure Enterprise-Level Security for Hortonworks
Concerns over the lack of comprehensive security and authentication for big data environments are a reality. As a leader in big data governance, Pentaho 7.1 builds on its existing enterprise-level security for Cloudera by adding similar security for Hortonworks with Kerberos Impersonation support to protect clusters from intrusion. Pentaho 7.1 also adds Apache Ranger support for authorizations and role-based access to specific data sets and applications for Hortonworks deployments. This ensures business access rules are enforced across Hadoop data and components and extends security support to protect vital customer resources and reduces risk. Providing similar enterprise-level security for both Cloudera and Hortonworks also gives Pentaho customers more options.

Enhanced Data Visualization Across the Pipeline
Pentaho 7.1 provides even more access to visualizations during data preparation, allowing users to spot check data for quality issues and prototype analytic data at every stage of the data pipeline, without switching in and out of tools or waiting until the very end to find data quality problems. With Pentaho 7.1, users can now interact with heat grids, geo maps, and sunbursts, as well as drill-down into data sets for further exploration. Users can leverage an easy to use and flexible API with full documentation to bring in visualizations from third party libraries such as D3 or FusionCharts, making third party visualizations reusable across the entire Pentaho platform.


  • Learn more about Pentaho 7.1
  • Register for the live demo of Pentaho 7.1 on June 7
  • Visit us at booth #K208 at Strata Data Conference in London the week of May 22nd
  • Register for PentahoWorld on October 25-27, 2017. Don't miss your chance to hear Pentaho experts, customers and partners share best practices on how big data can drive growth for your business
  • Follow us on LinkedIn, Twitter and Facebook

About Pentaho, a Hitachi Group company
Pentaho, a Hitachi Group company, is a leading data integration and business analytics company with an enterprise-class, open source-based platform for diverse big data deployments. Pentaho's unified data integration and analytics platform is comprehensive, completely embeddable and delivers governed data to power any analytics in any environment. Pentaho's mission is to help organizations across multiple industries harness the value from all their data, including big data and IoT, enabling them to find new revenue streams, operate more efficiently, deliver outstanding service and minimize risk. Pentaho has over 15,000 product deployments and 1,500 commercial customers today including ABN-AMRO Clearing, BT, EMC, NASDAQ and Sears Holdings Corporation. For more information visit

{ "FirstName": "First Name", "LastName": "Last Name", "Email": "Business Email", "Title": "Job Title", "Company": "Company Name", "Address": "Address", "City": "City", "State":"State", "Country":"Country", "Phone": "Business Telephone", "LeadCommentsExtended": "Additional Information(optional)", "LblCustomField1": "What solution area are you wanting to discuss?", "ApplicationModern": "Application Modernization", "InfrastructureModern": "Infrastructure Modernization", "Other": "Other", "DataModern": "Data Modernization", "GlobalOption": "If you select 'Yes' below, you consent to receive commercial communications by email in relation to Hitachi Vantara's products and services.", "GlobalOptionYes": "Yes", "GlobalOptionNo": "No", "Submit": "Submit", "EmailError": "Must be valid email.", "RequiredFieldError": "This field is required." }