Pentaho Announces New Data Quality Solution

Pentaho Offers Better Data for Better Analytics by Integrating Human Inference Data Quality

April 17, 2012, — Delivering the future of business analytics, Pentaho Corporation today announced the availability of new data quality capabilities for Pentaho Business Analytics that brings consistent, accurate, and trustworthy data to business applications. Pentaho has partnered with Human Inference, a visionary data quality company, to tightly integrate their platform EasyDQ with Pentaho Business Analytics to help companies manage data quality enterprise-wide.

Tightly integrated into the data integration capability of Pentaho Business Analytics and delivered via the cloud or on-premise, Human Inference enables customers to quickly build business intelligence applications with more accurate data, driving better and faster decisions.

Data Quality is a major step in Pentaho’s roadmap to build the future of analytics. This data quality component includes:

  • Data Profiling
  • Name Validation, Standardization and Cleansing
  • Address Validation, Standardization and Cleansing
  • E Mail and Telephone Validation, Standardization and Cleansing
  • Duplicate Detection and Merge Duplicates

The solution is available immediately and can be downloaded as a plug-in for the existing Pentaho Data Integration / Kettle releases 4.2.x and later.

“Dirty data remains a major barrier in providing accurate and timely business analytics to end users. Yet until now, the cost and complexity of existing data quality solutions meant that many companies simply could not integrate or include data quality as part of their overall business analytics operations,” said Barry Godthelp, VP Sales, Human Inference. “With Pentaho Business Analytics integration with Human Inference, Pentaho takes analytics to the next level, on-ramping customers get better data and faster decisions, combined with the freedom of choice in delivery method and pricing.”

“Data quality is often requested functionality by our customers,” said Matt Casters, Founder and Chief Architect Pentaho Kettle Project, Pentaho. “With this integration, Pentaho adds another major component to Pentaho Business Analytics and in doing so knocks down the barriers of cost, integration and deployment flexibility.”

About Pentaho, a Hitachi Group company
Pentaho, a Hitachi Group company, is a leading data integration and business analytics company with an enterprise-class, open source-based platform for diverse big data deployments. Pentaho’s unified data integration and analytics platform is comprehensive, completely embeddable and delivers governed data to power any analytics in any environment. Pentaho’s mission is to help organizations across multiple industries harness the value from all their data, including big data and IoT, enabling them to find new revenue streams, operate more efficiently, deliver outstanding service and minimize risk. Pentaho has over 15,000 product deployments and 1,500 commercial customers today including ABN-AMRO Clearing, BT, EMC, NASDAQ and Sears Holdings Corporation. For more information visit

You’re in the Right Place!

Hitachi Data Systems, Pentaho and Hitachi Insight Group have merged into one company: Hitachi Vantara.

The result? More data-driven solutions and innovation from the partner you can trust.

You’re in the Right Place!

REAN Cloud is now a part of Hitachi Vantara.
The result? Robust data-driven solutions and innovation, with industry-leading expertise in cloud migration and modernization.

You’re in the Right Place!

Hitachi Consulting and Hitachi Vantara have integrated into a new company under the Hitachi Vantara brand. We help you connect what’s now to what’s next.

You’re in the Right Place!

Waterline Data is now Lumada Data Catalog, provided by Hitachi Vantara. Lumada Data Catalog, available stand-alone, is now part of the Lumada Data Services portfolio.