Oracle Bigdata Analytics with Hadoop

BIG DATA and ANALYTICS

Oracle Bigdata Analytics with Hadoop

This training course is a comprehensive study of Oracle Big Data Administration using Hadoop. The course topics include Introduction to Hadoop and its Architecture, MapReduce and HDFS and MapReduce Abstraction. Making the most of big data means quickly analyzing a high volume of data generated in many different formats. Oracle University training teaches you how to acquire and organize diverse data sources. Learn to analyze these data sources alongside existing data to find new insights and capitalize on hidden relationships. Training is designed for database administrators and developers.

Course Duration: 50 Hrs.            

What you will learn

In the Oracle Big Data Fundamentals course, learn to use Oracle's Integrated Big Data Solution to acquire process, integrate and analyze big data.

Benefits To You

Increase your Big Data technology portfolio by learning to use a wide range of big data acquisition, processing, integration, and analysis techniques. Benefit from a hands-on, case-study approach while learning about Oracle’s Integrated Big Data Solution.

Course Topics

Introduction

Lesson Objectives

Questions About You

Course Objectives Course

Road Map Practice Environment

Connecting to the Course Environment (Oracle Big Data Lite Virtual Machine) Using VNC Starting the Oracle Big Data Lite Virtual Version Machine 4.01

Introducing the Movieplex

Big Data and the Oracle Information Management System

Big Data Opportunities and Challenges Oracle Information Management Architecture Optimizing/Simplifying Architecture with Engineered Systems

Using Oracle Big Data Lite Virtual Machine

Overview of the Big Data product stack

Access methods

Review the Oracle Big Data Virtual Machine Home page

Deep dive into the Oracle case study

Identify the data structures used

Understand the importance of filtering the data

Identify the Hadoop Command Guide URL, and review the fs and version commands that are used in the practice

Introduction to the Big Data Ecosystem

Lesson Objectives Computer Clusters Distributed Computing The Hadoop Ecosystem

Hadoop Core Components

Choosing a Hadoop Distribution and Version

Types of Analysis That Use Hadoop

Cloudera’s Distribution Including Apache Hadoop (CDH) Architecture

Introduction to the Hadoop Distributed File System (HDFS)

Lesson Objectives

Hadoop Distributed Filesystem (HDFS)

Acquire Data using CLI, Fuse-DFS, and Flume

Introducing the CLI Examining Fuse DFS Using Flume

Using and Administering Oracle NoSQL Database

Define Oracle NoSQL Database

List Benefits

Load data into the DB Access NoSQL Data

Plan an Oracle NoSQL Database installation and Node configuration

Configure and Deploy a KVStore

Using the GUI Interface (monitoring the KVStore)

Use the NoSQL Database Table Model (both CLI and Java API)

Introduction to MapReduce Lesson Objectives MapReduce

Interacting with MapReduce

MapReduce Daemons (Services) update based on YARN Interacting With MapReduce

Fault Tolerance

MapReduce Examples

Using YARN to Manage Resources

Job Submission in YARN YARN Features MapReduce 2.0: Overview

YARN Services

Overview of Apache Hive and Apache Pig

Apache Hive

Apache Pig

Overview of Cloudera Impala, Solr, and Apache Spark

Examining Cloudera Impala

Integrating Hadoop and Oracle

What is Apache Solr (Cloudera Serach)?

Cloudera Search: Key Capabilities, Features, Tasks, Indexes, and Collections

Introduction to Spark

Resilient Distributed Datasets (RDD) and Directed Acyclic Graph (DAG) Execution Engine

Overview of Scala Language

Using Oracle XQuery for Hadoop Extensible Markup Language (XML) XML Elements and Attributes

XML Path (XPath) Language: Node Types and Family Relationships

FLWOR Expressions

Oracle XQuery for Hadoop (OXH) Features and Data Flow

OXH Adapters and Configuration Properties XQuery Transformation and Basic Filtering Viewing the Completed OXH Job in YARN

Options for Integrating Your Big Data

Apache Sqoop

Oracle Loader for Hadoop (OLH) Copy To BDA

Oracle SQL Connector for HDFS (OSCH)

Using Oracle Big Data SQL

Context: Exadata and Big Data Appliance

What is Big Data SQL? Configuring Oracle Big Data SQL Create Oracle Tables over HDFS data

Leverage the Hive Metastore to Access Data in Hadoop

Apply Oracle Database Security Policies Over Data in Hadoop

Combine HDFS and Oracle data for analysis (SQL Pattern Matching)

Using Oracle Advanced Analytics

Oracle Data Mining (ODM) Oracle R Enterprise (ORE)

Oracle R Advanced Analytics for Hadoop (ORAAH)


Back to Top
Content