By allowing data preparation from any source and automating your data pipline, Pentaho Data Integration allows you to curate data better for your business user. This software delivers business analytics to end users faster with visual tools that reduce time and complexity — without writing SQL or coding in Java or Python. Organizations immediately gain real value from their data, from sources like files, relational databases, Hadoop, and more, which are in the cloud or on premises.
Pentaho Data Integration’s adaptive big data layer allows you to plug into popular big data stores with flexibility and insulation from change. Data can be accessed once, then processed, combined and consumed anywhere. The adaptive big data layer includes plug-ins for Hadoop distributions and object stores from Cloudera, Hortonworks, MapR (HPE Ezmeral Data Fabric), Amazon Web Services, Google Cloud and Microsoft Azure, object stores such as Hitachi Content Platform, as well as popular NoSQL databases like MongoDB and Cassandra.
With broad connectivity to any data type and high-performance Spark and MapReduce execution, Pentaho technology simplifies and speeds the process of integrating existing databases with new sources of data. Pentaho Data Integration’s graphical designer includes:
Pentaho Data Integration speeds performance time, reduces the complexity of integrating big data sources, and provides:
– Warren Chang, VP of Engineering, Borderfree
Pentaho Data Integration offers broad connectivity to a variety of diverse data, including all popular structured, unstructured and semi-structured data sources. Some examples include:
To increase the performance of data extraction, loading and delivery processes, Pentaho offers the following capabilities:
Pentaho technology provides data profiling capabilities, such as row counts, mathematical functions and identification of null values, as well as data quality operators, such as string manipulators, mapping functions, filtering and sorting. For name and address verification capabilities, Pentaho technology integrates with leading data quality vendors, such as Human Inference and Melissa Data. Pentaho data profiling and data quality capabilities help:
Pentaho Data Integration provides out-of-the box capabilities for managing operations for data integration projects.These capabilities include: