0345 4506120

Introduction to Big Data Blended Learning

Overview

This Premium course offers both classroom training and video learning to enhance your learning experience. Please see below for full details.

This hands-on Introduction to Big Data training provides a unique approach to help you act on data for real business gain. The focus is not what a tool can do, but what you can do with the output from the tool. Gain the skills you need to store, manage, process, and analyze massive amounts of unstructured data to create an appropriate data lake.

What is big data?

Big data is a term used to define data sets that have the potential to rapidly grow so large that they become unmanageable. The Big Data movement includes new tools and ways of storing information that allow efficient processing and analysis for informed business decision-making.

What is the difference between big data and machine learning?

Big data refers to the data set that has huge, and growing, volume that can quickly become unwieldy. Machine learning is a subsection of Artificial Intelligence (AI) that can help you extract value from big data to solve problems.

How does big data help businesses?

Understanding how to work with big data can help you glean useful insights from large amounts of data, which can help you and your organisation make better business decisions.

Who Should Attend

  • Anyone needing to implement, enhance your big data environment and looking to advance their analytics career by ensuring foundational knowledge
  • Typical job roles include: Project Managers and IT Managers, Database Administrators & Data Architects, Developers & SQL Developers, Data Scientists & Business Intelligence

What's Included

Included in this course - Unlimited annual access to:

  • 2 on-demand courses
  • 5 eBooks
  • 1-day instructor-led training course
  • 3-day instructor-led training course
  • One-on-one after-course instructor coaching
  • End-of-course exam included
  • After-course computing sandbox included

This product offers access to 2 on-demand courses and 5 eBooks that have been mapped directly to the objectives of the 3-day course. At any time during your annual access to this offering, you may attend one of our 1-day course events focused specifically on Big Data Insights, Technologies & Trends.

More Information

Pre-requisites

What background do I need?

Working knowledge of the Microsoft Windows platform and basic database concepts.

Typical job roles include: Project and IT Managers, Database Administrators & Data Architects, Developers & SQL Developers, Data Scientists & Business Intelligence.

Learning Objectives

You Will Learn How To:

  • Store, manage, and analyse unstructured data
  • Select the correct big data stores for disparate data sets
  • Process large data sets using Hadoop to extract value
  • Query large data sets in near real time with Pig and Hive
  • Plan and implement a big data strategy for your organisation

Introduction to Big Data

Defining Big Data

  • The four dimensions of Big Data: volume, velocity, variety, veracity
  • Introducing the Storage, MapReduce and Query Stack

Delivering business benefit from Big Data

  • Establishing the business importance of Big Data
  • Addressing the challenge of extracting useful data
  • Integrating Big Data with traditional data

Storing Big Data

Analysing your data characteristics

  • Selecting data sources for analysis
  • Eliminating redundant data
  • Establishing the role of NoSQL

Overview of Big Data stores

  • Data models: key value, graph, document, column–family
  • Hadoop Distributed File System
  • HBase
  • Hive
  • Cassandra
  • Hypertable
  • Amazon S3
  • BigTable
  • DynamoDB
  • MongoDB
  • Redis
  • Riak
  • Neo4J

Selecting Big Data stores

  • Choosing the correct data stores based on your data characteristics
  • Moving code to data
  • Implementing polyglot data store solutions
  • Aligning business goals to the appropriate data store

Processing Big Data

Integrating disparate data stores

  • Mapping data to the programming framework
  • Connecting and extracting data from storage
  • Transforming data for processing
  • Subdividing data in preparation for Hadoop MapReduce

Employing Hadoop MapReduce

  • Creating the components of Hadoop MapReduce jobs
  • Distributing data processing across server farms
  • Executing Hadoop MapReduce jobs
  • Monitoring the progress of job flows

The building blocks of Hadoop MapReduce

  • Distinguishing Hadoop daemons
  • Investigating the Hadoop Distributed File System
  • Selecting appropriate execution modes: local, pseudo–distributed and fully distributed

Handling streaming data

  • Comparing real–time processing models
  • Leveraging Storm to extract live events
  • Lightning–fast processing with Spark and Shark

Tools and Techniques to Analyse Big Data

Abstracting Hadoop MapReduce jobs with Pig

  • Communicating with Hadoop in Pig Latin
  • Executing commands using the Grunt Shell
  • Streamlining high–level processing

Performing ad hoc Big Data querying with Hive

  • Persisting data in the Hive MegaStore
  • Performing queries with HiveQL
  • Investigating Hive file formats

Creating business value from extracted data

  • Mining data with Mahout
  • Visualising processed results with reporting tools
  • Querying in real time with Impala

Developing a Big Data Strategy

Defining a Big Data strategy for your organisation

  • Establishing your Big Data needs
  • Meeting business goals with timely data
  • Evaluating commercial Big Data tools
  • Managing organisational expectations

Enabling analytic innovation

  • Focusing on business importance
  • Framing the problem
  • Selecting the correct tools
  • Achieving timely results

Implementing a Big Data Solution

  • Selecting suitable vendors and hosting options
  • Balancing costs against business value
  • Keeping ahead of the curve

On-Demand Courses

  • From 0 to 1: Hive for Processing Big Data
  • Building a Big Data Analytics Stack

eBooks

  • Hands-on Big Data Modelling
  • Big Data Architect's Handbook
  • Modern Big Data Processing with Hadoop
  • Big Data Processing with Apache Spark
  • Practical Big Data Analytics

Big Data Training FAQs

Is the on-demand content the same as the 3-day instructor class?

No. While the content selected does map to the objectives of the instructor-led course, it does not include a recorded version of the instructor-led class. The objectives have been re-imagined to be presented in digital, self-guided formats.

What on-demand content will I receive?

An outline of the content you will receive can be seen above. You will also get access to any new on-demand content that becomes available during your annual enrolment period.

Does this include any practical, hands-on learning?

Yes! Each book and video begins with a step by step guide for you to set up a coding environment on your personal computer. The course content is full of examples and practical advice, followed up by the chance to embed your learning through real world tasks. All example code is available to download, copy and use - giving you the chance to work and practise as you read and watch.

How will I access my course materials if I choose this method?

Once payment is received, you will receive an email fromus with all the links and information you need to get started.

How can I sign up for a review session?

Once you are enrolled in the program, specific details and dates will be sent to you.

One Day Instructor-Led Review

You'll be able to register for a Training Review Session at any time after you've placed your order.

Related Courses

Our Customers Include