Register
Mon 4 Nov, 2019 - Amsterdam
Mon 25 Nov, 2019 - London

Description

This one-day course is aimed at getting Data Scientists, BI and Data Warehousing professionals up to scratch on Big Data technologies such as Hadoop, Storm, Spark, Flink, Analytical SQL, NoSQL DBMSs and Multi-Platform Analytics. What is Big Data? How can you make use of it? How does it integrate with a traditional analytical environment? How do you re-define your architecture to create a stronger analytical foundation for your company? What skills do you need to develop for Big Data Analytics? All of these questions are addressed in this new knowledge packed course.

Why attend

You will learn:

  • How Big Data creates several new types of analytical workload
  • Big Data technology platforms beyond the data warehouse
  • Big Data analytical techniques and front-end tools
  • Understand when to use what where - business use cases for different Big Data technologies
  • How to create a stronger data and analytical architecture by integrating Big Data, data science, data warehouses and BI
  • How to integrate real-time data into your data warehouse
  • How to analyse un-modelled, multi-structured data using Hadoop, MapReduce & Spark
  • How to leverage predictive analytics in BI reports & dashboards

Who should attend

IT directors, CIO’s, CDO’s, IT Managers, BI Managers, BI and data warehousing professionals, data scientists, enterprise architects, data architects

Register
Mon 4 Nov, 2019 - Amsterdam
Mon 25 Nov, 2019 - London

Need custom training for your team?

Get a quote

Inquire about this course

CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share

Outline

 
 
 
 
 
 
 
 
 
 

An Introduction To Big Data

This module defines big data and looks at why business wants to use Big Data technology. It looks at Big Data use cases and the difference between Big Data, traditional BI and Data Warehousing

  • The demand for data?
  • Types of Big Data
  • Why analyse Big Data?
  • Industry use cases – Popular big data analytic applications
  • What is Data Science?
  • Data Warehousing and BI versus Big Data
  • Popular patterns for Big Data technologies
  • Types of Big Data analytical workloads
  • Architecture: The Extended Analytical Ecosystem

Big Data Technology

This module looks at Big Data platforms and storage options.

  • The new multi-platform analytical ecosystem
  • Analytical RDBMSs and NoSQL options
  • An introduction to the Hadoop Stack
  • Apache Spark Framework
  • The Big Data Hadoop Marketplace
  • The Cloud Analytics option – Cloud storage Versus Hadoop, Amazon (Data Lake Formation, Kenisis, Elastic MapReduce and Redshift), Google (Pub/Sub, Data Fusion, DataProc and BigQuery), Microsoft Azure (Event Hub, Stream Analytics, Data Lake, HDInsight, Data Factory and SQL Data Warehouse, ML Service, Power BI), IBM (Streaming Analytics, Analytics Engine, Db2 Warehouse on Cloud, Cloud Pak for Data), Qubole, Oracle Analytics Cloud, SAP 
  • Accessing Big Data via SQL on Hadoop or SQL on cloud storage
  • The increasing power of Analytical Relational DBMSs
  • Analyzing Big Data – What’s in The Data Scientist’s Toolkit
    • Streaming, natural language processing, classic machine learning at scale, deep learning, graph analytics

Integrating Big Data Analytics Into The Enterprise

This module looks at how new Big Data platforms can be integrated with traditional Data Warehouses and Data Marts to create a new data and analytics architecture for the data driven enterprise. It looks at stream processing, cloud storage, Hadoop, NoSQL databases and Data Warehouse and shows how to put them together in an end-to-end architecture to maximise business value from Big Data.

  • Beyond data warehouse – a new analytical architecture and ecosystem for the data driven enterprise
  • Integrated management of the analytical ecosystem
  • Integrating stream processing, cloud storage, Hadoop, Data Warehouses and MDM
  • Dimplifying access to a multiplatform analytical ecosystem using data virtualisation
  • Multi-platform optimisation – the final frontier

Ingest, Prepare, Analyze And Govern Big Data

This module will look at the challenge of integrating and governing Big Data and the unique issues it raises. How do you deal with very large data volumes and different varieties of data? How does loading data into Hadoop differ from loading data into analytical relational databases? What about NoSQL databases? How should low-latency data be handled? It also looks at tools and techniques available to data scientists, business analysts and traditional DW/BI professionals to analyze big data. Topics that will be covered include:

  • Connecting to Big Data sources
  • Data ingestion into cloud storage or Hadoop
    • Data Ingestion options
    • Challenges of capturing different types of Big Data
    • Streaming data ingest
    • Parsing Unstructured Data
    • Change data capture – what’s possiblity?
  • ELT data preparation, transformation and integration at scale using Spark
  • Managing data scientist and business analyste self-service data preparation –Alteryx, Azure Data Factory, Google Cloud Data Fusion, IBM Cloud Pak for Data, Paxata, Trifacta, Tamr, MicroStrategy, Tableau Data Prep Builder
  • Unified data delivery – A common data integration supply chain for the entire analytical ecosystem
  • Multi-platform data and analytical pipelines from data lake to enterprise data marketplace
  • Data governance in a big data environment
    • The importance of an Information catalog Data
    • Organising data in a data lake
    • Governing data privacy
  • Governing data in a Data Science environment
  • Analyzing Big Data
  • Supervised and Unsupervised Machine Learning
  • Cloud Machine Learning Analytics Marketplace
  • Natural language processing & Sentiment Analysis
  • Search, BI & Big Data
  • Graph Analytics
  • Analysing Data in Motion Using Streaming Analytics
  • Integrating it all with Self-Service BI tools

Instructor

Mike Ferguson

Mike Ferguson

Mike is Managing Director of Intelligent Business Strategies Limited.  As an analyst and consultant he specialises in business intelligence and enterprise business integration. With over 35 years of IT experience, Mike has consulted for dozens of companies. He has spoken at events all over the world and written numerous articles.  Mike is Chairman of Big Data LDN – the fastest growing Big Data conference in Europe, and chairman of the CDO Exchange.  Formerly he was a principal and co-founder of Codd and Date Europe Limited – the inventors of the Relational Model, a Chief Architect at Teradata on the Teradata DBMS and European Managing Director of Database Associates.  He teaches popular master classes in Analytics, Big Data, Data Governance & MDM, Data Warehouse Modernisation and Data Lake operations.

Dates

04 Nov04 Nov
Amsterdam
25 Nov25 Nov
London

Pricing

The fee for this one-day course is EUR 725 per person. This includes one day of instruction, lunch and morning/afternoon snacks and course materials.

We offer the following discounts.

  • 10% discount for groups of 2 or more students from the same company registering at the same time.
  • 20% discount for groups of 5 or more students from the same company registering at the same time.

Note: Groups that register at a discounted rate must retain the minimum group size or the discount will be revoked. Discounts cannot be combined.

Copyright ©2019 Quest for Knowledge