Hitachi Vantara Overview Pentaho 8.0 and 8.1 Roadmap - Pedro Alves - it-novum

 
CONTINUE READING
Hitachi Vantara Overview Pentaho 8.0 and 8.1 Roadmap - Pedro Alves - it-novum
Hitachi Vantara Overview
Pentaho 8.0 and 8.1 Roadmap

Pedro Alves
Hitachi Vantara Overview Pentaho 8.0 and 8.1 Roadmap - Pedro Alves - it-novum
Safe Harbor Statement

 The forward-looking statements contained in this document represent an outline of
 our current intended product direction. It is provided for information purposes only
 and is not a commitment to deliver any new or enhanced product or functionality, or
 that we will pursue the product direction described. Facts and circumstances may
 occur which may impact current plans, resulting in changes to the information in this
 presentation. This information is current only as of the date it is made and should not
 be relied upon in making purchasing decisions. The development, release (if at all),
 and timing of any features or functionality described for the Pentaho products
 remains at the sole discretion of Pentaho.
Hitachi Vantara Overview Pentaho 8.0 and 8.1 Roadmap - Pedro Alves - it-novum
Hitachi Vantara
Hitachi Vantara Overview Pentaho 8.0 and 8.1 Roadmap - Pedro Alves - it-novum
Advanced                     X as a                       Data
Analytics                    Service                    Scientists
                                           OT/IT
               Software
                                          Expertise

Hitachi Data
  Systems
            Hitachi VantaraLumada
                 Pentaho

                                           Solutions
            Infrastructure
 Partner                      Vertical
                                         and Services   Machine
Ecosystem                    Expertise                  Learning
Hitachi Vantara Overview Pentaho 8.0 and 8.1 Roadmap - Pedro Alves - it-novum
Hitachi Vantara:
A New Company With 100+ Years of Heritage

         HITACHI              RECOGNIZED
                                                                  HITACHI, LTD.
        VANTARA                CULTURE
                                                               Founded in 1910
   10,000 customers                                           US$81.8B FY16
   7,000 employees                                            950+ subsidiaries
   2,000 partners                                             304,000 employees
   100 countries                                              #1 worldwide for
                        “I’d posit that the reason these
                                                                patent applications
   Trusted by 85% of                                           in big data analysis
                        three values have lasted for
    FORTUNE® Global     more than 100 years at Hitachi –        foundation technology
    100                 and will likely last for 100 more –
                        is because of the emphasis they        #71 in the 2017
                        place on human relationships.”          FORTUNE® Global
                                        – Inc. Magazine
                                                                500
Hitachi Vantara Overview Pentaho 8.0 and 8.1 Roadmap - Pedro Alves - it-novum
Hitachi Vantara Is Trusted by 85% of the
 FORTUNE Global 100 and 10,000 Customers
     FINANCE             I . T. S E R V I C E S     H E A LT H C A R E              R E TA I L

                                                   First People’s
                                                  Hospital of Yibin

M AN U FAC T U R I N G   SERVICES/OTHER                      MEDIA                  TELECOM

                                                                        Xinhua
                                                                      News Agency
Hitachi Vantara Overview Pentaho 8.0 and 8.1 Roadmap - Pedro Alves - it-novum
Hitachi Vantara Is Trusted by 85% of the
FORTUNE Global 100 and 10,000+ Customers
  H I TA C H I       CORE INFRASTRUCTURE              H I TA C H I
A N A LY T I C S                                      CONTENT
PORTFOLIO/                                           PORTFOLIO
 P E N TA H O

                                 First People’s
                                 Hospital of Yibin

                                                              Xinhua
                                                              News Agency

                   Xinhua
                   News Agency
Hitachi Vantara Overview Pentaho 8.0 and 8.1 Roadmap - Pedro Alves - it-novum
Rely on Real, Ongoing Innovation

  4,900 R&D employees,             Over $5.4B revenue
    $2.8 billion annually           annually from IoT
     committed to R&D
                                  Computerworld:
 2,500+ patents awarded
                                 Hitachi Is #1 Most
    in technologies for
                               Powerful IoT Company
     big data analysis
                                   Thomson Reuters:
119,000+ patents worldwide             “Top 100
                                   Global Innovators”
Hitachi Vantara Overview Pentaho 8.0 and 8.1 Roadmap - Pedro Alves - it-novum
Hitachi’s Unique Value

                       CITY            CLOUD

                                               INFRASTRUCTURE

        Only Hitachi Vantara combines
         INDUSTRIAL

       deep IT, OT    OT IoT IT
                108 andISdomain
                             59 expertise
                                                       ANALYTICS

                         IN
          BUSINESS
                      YEARS      OUR   YEARS
                                                  APPLICATIONS
                                 DNA

                      CONSUMER         BIG DATA
Hitachi Vantara Overview Pentaho 8.0 and 8.1 Roadmap - Pedro Alves - it-novum
Every Company Must Now
Be a Technology Company

           We all need to find the most value in our data.
         That’s with big data,
    That means fundamental change to business and operations.

   but Many
       what have
            abouta REALLY
                   long waybig  data?
                            to go:
     Only 40% of industries are truly
 There will be nearly 21 billion connected
           digitized (McKinsey)
       “things” by 2020 (Gartner)
Swimming in Data But Searching for Insight

         The full potential of data is elusive.
     A precious few companies have discovered
        how to find the full value of their data.
           Hitachi Vantara is a partner with
          data-driven solutions and services
     for people who see data as an opportunity.
We Help the Two Basic Business Directives:
Make Money and Save Money

Top-Line Benefit                                                      INTERNET OF
                                                                         THINGS
                                                 BUSINESS
                                                                      Asset Avatars
                                                  INSIGHT             Edge to Outcomes
                                               Data Orchestration    Co-Creation
                                               Embedded
                        DIGITAL
                                                Analytics
                       WORKPLACE               Data Ingest and
INFRASTRUCTURE        Enterprise Mobility      Blending
 MODERNIZATION        Content Intelligence
                      Object
 Private, Public,
  Hybrid
 Flash
 Software-Defined

                                                  Bottom-Line Benefit
Our Core Beliefs

  Data is at the center of all    Your data will outlive the
  you do and is the key to        application that created it and
  your transformation.            the underlying infrastructure.

  We want you to own your data.   We enable you to have your
  Not us.                         data and insight when, where
                                  and how you need it.
  Metadata is the new data.

                                               © Hitachi Vantara Corporation 2018. All Rights Reserved
Our Commitment:
Deliver Outcomes That Matter
  for Business And Society
Pentaho 8
Future Vision: A Single Consistent Experience

          Data Engineering                        Data Prep                                  Analytics

      Ingestion        Processing           Blending           Data Delivery           Data              Analysis
                                                                                     Discovery        & Dashboards

  Administration    Security         Lifecycle            Data        Dynamic Data       Monitoring       Automation
                                    Management         Provenance       Pipeline
Pentaho 8.0

                              Pentaho 8.0             • Connect to Kafka streams
Challenge #1
                              Broadens connectivity
Data volumes and velocity                             • Stream processing with Spark
                              to streaming data
are growing exponentially                             • Big data security with Knox
                              sources

Challenge #2                  Pentaho 8.0             • Enhanced Adaptive Execution (AEL)
Processing and storage        Optimizes processing    • Native Avro and Parquet handling
resources are constrained     resources               • Worker nodes for “Scale-out”

                              Pentaho 8.0             • Data explorer filters
Challenge #3
                              Boosts team
Shortage of Big Data talent                           • Improved repository UX
                              productivity across
and lack of productivity                              • Extended operations mart
                              the pipeline
Platform and Scalability
     Worker Nodes
     New theme
Worker Nodes for Scaling Out

Scale work items across multiple nodes
(containers)                                                      Worker Node (a)

                                           Distribute and Scale     Worker Node (b)
 Easily add and remove resources as
                                                                      Worker Node (c…)
  required

 Monitor and balance changing workloads
                                                 NEW in Pentaho 8.0
 Deploy on premise, cloud and hybrid
                                                    Container framework
                                                    Orchestration framework
                                                    Node monitoring
                                                    Enhanced HA implementation
Worker Nodes Architecture

                                       WORKER NODES
                                 Orchestration Framework
                                         Orchestration
                             (Scheduler, monitoring, security, etc.)     Powered by …

                                                 Controller (HA)
Pentaho Clients          Master
                        (Working)           Master           Master
                                           (Standby)        (Standby)

    Pentaho Server                  Container Framework
                             WN 1            WN 2            WN …n
  Pentaho Repository
                            e.g. KJB        e.g. KTR        “Executor”
New Theme - Ruby
Data Integration
     Streaming support!
     Run configurations for Jobs
     Filters in Data Explorer
     New Open / Save experience
Streaming for Time Sensitive Insight

Enable use cases that require real-
time processing, monitoring and
aggregation
 Real-time device monitoring
 Log-file aggregation
 Notifications
 And more…                           NEW in Pentaho 8.0
                                         Kafka Producer Step
                                         Kafka Consumer Step
                                         Get records from stream Step
                                         Spark streaming via AEL
Run configurations for Jobs

Run Configuration

 Execute on server

 Leverage worker nodes
Pentaho 7.0 – Data Explorer

     Access visualizations during data prep for inspection and prototyping
Data Explorer Filters

Enhanced data inspection in PDI

 Identify data to be cleaned or
  removed

 Deliver data to the business more
  quickly

    New in Pentaho 8.0
     Numeric filters
     String filters
     Include/Exclude data points
New PDI Repository Dialogs

  Enhanced user experience for
  content repository
    ‒   Consolidated open and save dialogs
    ‒   Enhanced search
    ‒   “Recently used” files option
    ‒   “Sticky” directories

  Business Benefits
   Increased productivity
Big Data
     Improvements on AEL
     Big Data File Formats - Avro and Parquet
     Big Data Security - Support for Knox
     VFS improvements for Hadoop Clusters
Pentaho 7.1 – Adaptive Execution for Spark

       PDI

                                          No Coding
             Pentaho
              Kettle
                                          Build Once

                                          Execute on Any*
                                           Engine

        *Currently Available Engines
Enhanced Adaptive Execution

Simplified setup                                                        HADOOP CLUSTER

   Eliminated “Zookeeper” component
                                                                          Spark/Hadoop Processing Nodes
   Reduced number of setup steps              PDI     AEL-Spark
                                              Client   Daemon on
                                                       Edge Nodes                              Spark
Hardened deployment                                                                          Executors

   Fail-over at the edge
                                                                             Hadoop/Spark Compatible
                                                                                 Storage Cluster
   Kerberos impersonation for client
                                                                          HDFS    Azure    Amazon
                                                       AEL-Spark                 Storage     S3
                                                        Engine                                      Etc…
More flexible                                          (Spark Driver)

   Support multiple run configurations

   Customize cluster settings per job type
Native Avro and Parquet Handling

Visual handling of data files with
common big data file formats

 Reading and writing files with specific
  steps

 Natively executes in Spark via AEL

  NEW in Pentaho 8.0
   Avro Input and Output Steps
   Parquet Input and Output Steps
Knox Gateway for More Secure Clusters

Pentaho can interact with data services in
Knox gateway protected Hadoop clusters*

Benefits

• Highly secured Knox-protected Hadoop
  clusters can now be integrated in ETL
  pipeline

*Knox is only available for Hortonworks HDP distro.
VFS improvements for Hadoop Clusters

In order to simplify the overall lifecycle of
jobs and transformations we made the
hadoop clusters available through VFS, on
the format

         hc://hadoop_cluster/.
Others
     Ops Mart for Oracle, MySQL, SQL Server
     Platform password security improvements
     PDI mavenization
     Documentation changes on help.pentaho.com
     Feature Removals:
           Analyzer on MongoDB
           Mobile Plug-in (Deprecated in 7.1)
Product Documentation Updates

MindTouch Documentation Improvements (continued)

   Streamlining of site content pages:
    ‒   Simpler site structure and click paths to accelerate user content access
    ‒   Eliminates pages with little content value
    ‒   Fun fact: 7.1 contained 1200+ pages; 8.0 contains 900+ pages with no elimination of valuable content

   Ongoing migration of big data steps from the wiki pages to the MindTouch Doc site

Business Benefits

 Improved access to on-line documentation
Pentaho 8.0 - Detailed

Data Integration                                                         Big Data
•   Drag and drop numeric and non-numeric filters in Data Explorer       •   Simplified Spark AEL setup
•   Chart actions in Data Explorer to include or exclude data            •   Spark AEL fail-over and load balancing
•   Cleaner and easier design/UX for PDI Repository dialogs              •   Support for Spark AEL on Hortonworks
•   Consolidated PDI Repository dialogs, replacing multiple pop-ups      •   Enhanced security for Spark AEL including
•   PDI Repository Dialogs remember last opened directory                    Kerberos impersonation and secure connectivity
•   Jobs can leverage Run Configurations for seamless user               •   Customization of Spark AEL processing (i.e.
    experience                                                               memory and other settings)
•   Run Configuration option to run jobs on Pentaho Server               •   Parquet input and output steps
                                                                         •   Avro input and output steps
Streaming
                                                                         •   Big Data Security with HDP Knox Gateway
•   Kafka Producer step for outputting streaming data
                                                                         •   VFS Improvements for named Hadoop clusters
•   Kafka Consumer step for streaming data input
•   Get Records from Stream step for stream processing
•   Ability to run AEL on Spark Streaming for transformations that use   Additional Items
    Kafka streaming ingest                                               •  Big Data Sandbox VM updates
                                                                         •  Cumulative service pack framework
Enterprise Platform                                                      •  Documentation improvements on
•   Worker Nodes Scale-Out solution to drive superior agility               help.pentaho.com
•   Ops Mart for Oracle, MySQL, SQL Server
•   Ruby Theme – new platform branding for browser and client tools
•   Platform password security improvements
•   PDI Mavenization for infra alignment
Pentaho 8.1
Pentaho 8.1 – Investment themes

   Focus on Cloud
    ‒   Entry to Google Cloud Platform
                                                    “Must-have” response to
    ‒   Make AWS Easy
                                                       market demand
    ‒   Validate WN for cloud, hybrid deploy

   Harden Core Capabilities
    ‒   Continue establishing recent innovations
                                                   Targeted improvement for
    ‒   Update overlooked platform areas           unfinished/neglected areas
    ‒   20% “Continuous Improvement”

   Begin the Next Phase
    ‒   “All-in” on Foundry
                                                       Get started now.
    ‒   Web-based design and data exploration
                                                    Release vehicles TBD.
    ‒   Vantara “solution” integrations
Q&A
You can also read