تبلیغات

SQL در Hadoo - آنالیز Big Data با Hive

دسته بندی ها: آموزش هدوپ (Hadoop) ، پایگاه داده ، آموزش های پلورال سایت (Pluralsight) ، آموزش اس کیوال (SQL) ، آموزش Apache Hive

pluralsight-sql-on-hadoop-analyzing-big-data-with-hive

اين دوره آموزشي زبان برنامه نويسي پرس و جوي  Hive را جهت آنالیز داده های بزرگ (Big Data) در Hadoop آموزش مي دهد. و شامل محاسبات توزيع شده، Hadoop ، اصول نگاشت و آخرين ويژگي هاي منتشر شده Hive 0.11 است.

اين دوره آموزشی محصول موسسه  Pluralsight  است.

ليست سرفصل هاي اين دوره:

  • معرفی Hadoop
  • سیستم فایل HDFS
  • معماري Hive
  • پرس و جوهاي فرعي
  • ايجاد پايگاه داده و جداول با HiveQL
  • مديريت Hive و جداول خارجي
  • جداول خارجي و ايجاد جداول جايگزين
  • انواع داده
  • انواع تبديلات
  • جداول مديريت پارتيشن بندي ها
  • درج چندگانه و درج پارتیشن پویا
  • مرتب سازي و كنترل داده هاي سرريز
  • HiveQL پيشرفته
  • كش توزيع شده
  • UDF سفارشي
  • پنجره بندي و آناليز توابع
  • HCatalog
  • Sqoop
  • DistCP
  • و...

عنوان دوره: Pluralsight SQL on Hadoop - Analyzing Big Data with Hive

نويسنده: Ahmad Alkilani

مدت زمان: 4 ساعت و 16 دقيقه

سطح: متوسط

SQL on Hadoop - Analyzing Big Data with Hive
This course will teach you the Hive query language and how to apply it to solve common Big Data problems. This includes an introduction to distributed computing, Hadoop, and MapReduce fundamentals and the latest features released with Hive 0.11

author: Ahmad Alkilani Level: Intermediate

Duration: 4h 16m Released: 08 Oct 2013

Description From developer to analyst, this course tackles a few big questions about big data: Why does this technology exist and why do I need it? How can I get the best out of it utilizing something familiar like SQL and how does this all fit together in an ever-evolving eco-system? This course will introduce the concepts of distributed computing, Hadoop and MapReduce and then goes into great detail into Apache Hive which is an SQL-like query language that can be used with Hadoop and NoSQL databases like HBase and Cassandra. The course presents some challenges you might experience solving real production problems and how Hive makes that task easier to accomplish.

Table of contents Introduction to Hadoop Introduction Motivation for Hadoop Distributed Computing Challenges Hadoop File System (HDFS) MapReduce Word Count Example Demo: Basic Hadoop Commands and Environment Setup Summary Introduction to Hive Introduction Hive Motivation Hive Architecture Hive Principles - Schema on Read Hive Principles - The Hive Warehouse Hive Query Language Basics - SELECT and Sub Queries Creating Databases and Tables with HiveQL Demo: Working with Hive Tables and Loading Data into Warehouse Loading Data - Hive Managed and External Tables Demo: External Tables and Create Table Alternatives Summary Hive Query Language Introduction Data Types Type Conversions Managed Partitioned Tables External Partitioned Tables Demo: Table Partitioning Multi Inserts and Dynamic Partition Inserts Demo: Loading Data Use Case Data Retrieval - Group By and Functions Sorting and Controlling Data Flow The CLI and Variable Substitution Summary Advanced HiveQL Introduction Bucketing Bucket and Block Sampling Joins Joins in Depth and Join Optimizations Map-side Joins for Bucketed Tables Distributed Cache UDTFs, Explode and Lateral View Demo: Extending Hive - Creating Your own UDF Demo: Extending Hive - Compiling and Testing Custom UDF Extending Hive - Custom UDF Recap Demo: Hive Initialization File Accessing The Distributed Cache Hadoop Streaming and Transform() Windowing and Analytics Functions Demo: Putting it All Together Using Transform Demo: Analytics Functions Demo: Ranking Functions Summary Storage and The Eco-System Create Table Statement - File Formats and SerDes HCatalog Sqoop DistCP Hadoop Eco-System Projects References and Resources Summary

حجم فايل: 419MB

آیا این نوشته را دوست داشتید؟
Pluralsight SQL on Hadoop Analyzing Big Data with Hive

پیشنهاد فرادرس