With Pentaho Data Integration(PDI), a Lumada DataOps Suite product, managing the enormous volumes and increased variety and velocity of data entering organizations is simplified. PDI delivers analytics-ready data to end users faster with visual tools that reduce time and complexity. Without writing SQL or coding in Java or Python, organizations immediately gain real value from their data, from sources like files, relational databases, Hadoop and more, which are in the cloud or on premises.
Turn Big Data Into Actionable Analytics
Pentaho’s adaptive big data layer allows you to plug into popular big data stores with flexibility and insulation from change. Data can be accessed once, then processed, combined and consumed anywhere. Pentaho’s adaptive big data layer includes plug-ins for Hadoop distributions and object stores from Cloudera, Hortonworks, MapR (HPE Ezmeral Data Fabric), Amazon Web Services, Google Cloud and Microsoft Azure, object stores such as Hitachi Content Platform, as well as popular NoSQL databases like MongoDB and Cassandra.
Integrate and Blend Big Data With Existing Enterprise Data
With broad connectivity to any data type and high-performance Spark and MapReduce execution, Pentaho simplifies and speeds the process of integrating existing databases with new sources of data. Pentaho Data Integration’s graphical designer includes:
Big Data Processing Performance and Productivity
Pentaho speeds performance time and reduces the complexity of integrating big data sources. Pentaho provides:
Broad Connectivity and Data Delivery
Pentaho Data Integration offers broad connectivity to a variety of diverse data, including all popular structured, unstructured and semi-structured data sources. Some examples include:
To increase the performance of data extraction, loading and delivery processes, Pentaho offers the following capabilities:
Data Profiling and Data Quality
Pentaho provides data profiling capabilities, such as row counts, mathematical functions and identification of null values, as well as data quality operators, such as string manipulators, mapping functions, filtering and sorting. For name and address verification capabilities, Pentaho integrates with leading data quality vendors, such as Human Inference and Melissa Data. Pentaho data profiling and data quality capabilities help:
Powerful Administration and Management
Pentaho Data Integration provides out-of-the box capabilities for managing operations for data integration projects. These capabilities include:
– Warren Chang, VP of Engineering, Borderfree