Course Description
Adopting Hadoop into your business intelligence and analytics architecture is much more than a technology exercise. Geared for enterprise architects, and IT or business leaders responsible for providing and supporting analytics services to other business areas, this Experfy workshop we'll cover a framework to help you avoid getting lost in a soup of technology buzzwords related to Hadoop. You’ll gain an understanding of its usefulness for solving business problems as well as how it fits into an overall landscape of technologies necessary to fully enable its use.
We’ll start with an overview of the Hadoop technology ecosystem and explains the key options that should be considered when devising a value-centric strategy for adopting it.
Once the why and how of your strategy is defined you need a roadmap to chart your course. Adopting Hadoop for any significant enterprise-wide use requires coordinated effort by a diverse team of technical and business experts over many months. Without a roadmap to guide the way, your team will likely veer off course. In the “From Strategy to Roadmap” module we’ll investigate two broad usage categories: Cost savings and insight creation. Although these uses are not mutually exclusive, the adoption roadmaps for them vary enough to consider them separately. Whether your objective for adopting Hadoop is primarily for IT cost savings and efficiency or for business insight creation it is important to remain focused on achieving the expected value. Along the way there will be many business decisions regarding risk and rewards, process and procedure, investments and security. An actionable roadmap will include guideposts and milestones to chart a course and measure your progress.
The last two modules of the workshop focus on specific guidance for the business and technology aspects of your adoption roadmap.
By the end of this workshop you’ll be prepared to organize and execute an adoption strategy for Hadoop so that it becomes a fully-enabled capability in your IT architecture for business intelligence and analytics.
What am I going to get from this course?
- Describe the primary ways Hadoop enables you to create business benefits
- Identify the technical hurdles to adopting it
- Organize an approach to effectively adopt Hadoop whether for cost-savings or business-insight purposes
- Describe six categories of use cases for Hadoop, high level technical architectures for each along with the benefits of each and the technical skills necessary to achieve them.
- Describe the technical and business experts you will need to be successful
- Define processes both for creating business value with Hadoop and for completing technical R&D to deepen your team's practical knowledge of it.
- Identify milestones of progress
- Identify cross-cutting concerns and decisions that impact the approach you take and the pace at which you can adopt Hadoop.
Prerequisites and Target Audience
What will students need to know or do before starting this course?
Students will need to be aware of database capabilities, typical business intelligence and data warehousing approaches, and common approaches to data analysis and reporting.
Who should take this course? Who should not?
This course is targeted to students with a technical background, either in business intelligence and data warehousing application development, or in data and analytics from a business perspective. It assumes that you have interest in using Hadoop in multiple areas of your business for a variety of on-going applications. The course requires familiarity with technical concepts related to software application design; but it does not require any specific programming language or platform expertise.
Curriculum
Module 1: Introduction
05:18
Lecture 1
Course Overview & Objectives
03:18
This module describes the objectives of this course along with the background expected of the student.
Lecture 2
Introduction to Your Instructor
02:00
Module 2: Overview of Hadoop
18:32
Lecture 3
Overview of Hadoop
00:24
A brief synopsis of the two main topics of Module 2 - Overview of Hadoop This module provides the necessary background to understand why a strategic approach is necessary to adopt Hadoop for enterprise wide use.
Lecture 4
What is Hadoop
08:26
This lecture describes Hadoop at its highest level as being comprised of two primary value creation capabilities: storing large file and processing them in parallel. Through analogies to similar, but more traditional, technology this lecture gives the student an understanding of Hadoop to enable them to predict the benefits of its use.
Lecture 5
The Challenges of Hadoop
09:42
This lecture contrasts using Hadoop for isolated, individual projects to using it broadly for many teams and projects. These two alternatives, called Local and Enterprise Adoption respectively, provide the basis for the challenges that can be encountered. Overcoming these challenges and arriving at a successful use of Hadoop across many teams, projects and usage categories it the motivation behind the strategic framework and architectural patterns that form the heart of this course.
Quiz 1
Overview of Hadoop Quiz
Check your understanding of some of the key capabilities and challenges of Hadoop.
Module 3: The Strategic Framework
01:02:04
Lecture 6
The Strategic Framework
04:20
This lecture covers the components of the strategic framework that is used to construct a long-range roadmap for adopting Hadoop. Beginning with the target of the roadmap -- an enterprise data lake, I describe the “lanes” that contain the activities of the roadmap that drive you there along with the mile markers along the way to confirm your progress and correct course.
Lecture 7
The Business Outcome Track
16:02
Diving into the business outcome lane of the roadmap, in this lecture we discuss the difference between cost-saving and business insight creation opportunities for Hadoop. Some of these opportunities are more focused on IT efficiency; while others are targeted at achieving new business analysis capabilities. As a result, we also discuss the differences in IT processes that support achieving these business outcomes.
Lecture 8
The Technical Capability Track
26:31
Having described the business outcomes that should be sought as a result of adopting Hadoop for wide spread use, this lecture describes the technical components of the platform in greater detail in preparation for high-level architecture discussions in the subsequent modules. It also describes the IT process implications of Hadoop being part of your infrastructure.
Lecture 9
Cross-cutting concerns
15:11
Cross-cutting concerns are those topics that affect both lanes of activity in the roadmap. These topics include organizing information and processing in Hadoop, addressing security needs, maintaining metadata, governing use and capturing metrics.
Quiz 2
The Strategic Framework
Check your understanding of the concepts of the strategic framework.
Module 4: A Value-Oriented Roadmap
01:01:07
Lecture 10
Value-Oriented Roadmap
01:56
In this introductory discussion about creating a value-oriented roadmap we discuss again the six use-case categories along with the kinds of value they are intended to create. These categories provide a related set of usage goals for the Hadoop platform that combine to create more value than each could on its own.
Each lecture in this module covers one of the use case categories either related to cost-savings or business insight creation. For each use case, we review a typical architecture for achieving this goal without using Hadoop and the pain points often associated with doing so. After that a high-level architecture using Hadoop technologies is described along with the way it addresses the pain points and achieves the related business value. Then each architecture description is completed with a list of the technical capabilities necessary to achieve it. In this way, you can see how the technical capabilities are highly reusable from use case to use case.
Lecture 11
Accessible Archive
08:20
For the accessible archive use case, this lecture describes
* The typical architecture used without Hadoop
* Pain points of that approach
* A high-level Hadoop-centric architecture
* How the pain points are addressed
* Technical capabilities necessary to achieve it
Lecture 12
Processing Offload
06:10
For the processing offload use case, this lecture describes
* The typical architecture used without Hadoop
* Pain points of that approach
* A high-level Hadoop-centric architecture
* How the pain points are addressed
* Technical capabilities necessary to achieve it
Lecture 13
EDW Optimization
09:05
For the enterprise data warehouse optimization use case, this lecture describes
* The typical architecture used without Hadoop
* Pain points of that approach
* A high-level Hadoop-centric architecture
* How the pain points are addressed
* Technical capabilities necessary to achieve it
Quiz 3
Cost-Savings Opportunities
Check your understanding of the critical issues of the cost-savings opportunities.
Lecture 14
Data Exploration
07:44
For the data exploration use case, this lecture describes
* The typical architecture used without Hadoop
* Pain points of that approach
* A high-level Hadoop-centric architecture
* How the pain points are addressed
* Technical capabilities necessary to achieve it
Lecture 15
360 Degree View
07:47
For the 360 degree view use case, this lecture describes
* The typical architecture used without Hadoop
* Pain points of that approach
* A high-level Hadoop-centric architecture
* How the pain points are addressed
* Technical capabilities necessary to achieve it
Lecture 16
Advanced Analytics
07:34
For the advanced analytics use case, this lecture describes
* The typical architecture used without Hadoop
( Pain points of that approach
* A high-level Hadoop-centric architecture
* How the pain points are addressed
* Technical capabilities necessary to achieve it
Lecture 17
Critical Workflows & Capabilities
04:58
This lecture discusses a handful of technical capabilities that are critical to succeeding with the use case patterns we describe in prior lectures. These are discussed in a manner that is independent of the technical components so you can use them as a way to measure progress in gaining technical skill with the platform.
Lecture 18
Progressive Milestones
07:33
This lecture identifies a set of milestones that can be used to chart your progress with Hadoop. Grouping these milestones into categories provides you with a method by which you can grade your achievements in a manageable number of areas. These areas include ingesting data to Hadoop, processing data in Hadoop, delivering data for use, operating the platform, and supporting various types of work.
Quiz 4
Business-Insight Opportunities
Check your understanding of the issues and concepts of the business-insight opportunities.
Module 5: Organizing for Value Creation
16:18
Lecture 19
Organizing for Value Creation
00:30
In this lecture, we turn from the technical to the people related aspects of adopting Hadoop. Creating a team to achieve business value is of primary importance and is discussed in this module along three primary themes: What constitutes the “right” team, how do you get into a rhythm, and how do you track your progress..
Lecture 20
Building the Right Team
00:30
In this lecture we review the kinds of people you need on your team to be successful at creating business value, whether this is related to cost-savings or business insights. The emphasis on this team is having the people you need to find the right opportunities to pursue and demonstrate their success.
Lecture 21
Creating a Cadence
07:20
Creating a cadence involves getting into a rhythm of achieving business value through the use of Hadoop. Rhythm creates growing confidence inside and outside the team and can turn the challenge of using new technologies into a virtuous cycle and accelerated adoption.
Lecture 22
Tracking your progress
07:58
As with any road trip, you need to know how much progress you’re making. This lecture describes some specifics about tracking your progress on the value-creation half of the roadmap framework along with some warning signs that you might be off course.
Quiz 5
Organizing for value creation
Check your understanding of organizing the checking your progress on the business track.
Module 6: Organizing for the Learning Curve
23:16
Lecture 23
Building the Right Team
10:07
In this lecture, we expand upon the people related aspects of adopting Hadoop by diving into the technical R&D capabilities your team must master. It is important to create a team that continues to learn and keep pace with developments in Hadoop and related technologies. As in the last module, we discuss this along three themes: What constitutes the “right” team, how do you get into a rhythm, and how do you track your progress..
Lecture 24
Creating a cadence
05:41
Creating a cadence involves getting into a rhythm of learning and demonstrating mastery of more and more aspects of the Hadoop platform. Rhythm creates growing confidence inside and outside the team and can turn the feeling of being overwhelmed by too much to learn into a track record of success.
Lecture 25
Tracking your progress
07:28
As with any road trip, you need to know how much progress you’re making. This lecture describes some specifics about tracking your progress on the technology learning half of the roadmap framework along with some warning signs that you might be off course.
Quiz 6
Organizing for the learning curve
Check your understanding of the key concepts related organizing for the technical capabilities track.
Module 7: Decisions, Levers, and Gates
17:57
Lecture 26
Decisions, Levers, and Gates
17:57
We wrap up this course on adopting Hadoop with a lecture called “Decisions, Levers and Gates”. Decisions steer where you head. Levers affect your pace, the size of the problems you can seek to solve, and the cost of the overall platform. Gates give you places to check the work and set up governance.
Now that we’ve covered so much ground, this lecture finishes up with a bird's-eye view of the path we’ve taken and puts many of the tools and guides discussed in the course into a single place.