Companies Home Search Profile

Hadoop to GCP migration

Focused View

32:49

0 View
  • 1 - Hadoop to GCP Migration Using DistCp.mp4
    08:02
  • 2 - Capture GCS Events in BIgquery.mp4
    05:46
  • 3 - File from Bucket to BigQuery Table.mp4
    01:37
  • 4 - Overall flow.mp4
    00:54
  • 5 - Google cloud Data engineer certification exam experience.mp4
    00:21
  • 6 - Thank you.mp4
    16:09
  • Description


    Hadoop to GCP Migration Using DistCp

    What You'll Learn?


    • bigquery
    • gcs
    • hadoop
    • cloud functions

    Who is this for?


  • GCP Aspirants
  • What You Need to Know?


  • Basic Hadoop knowledge
  • More details


    Description

    Objective of the course , is to migrate data from On-prem Hadoop(Consider hadoop installed in Windows is considered as on-prem) to GCS(Google Cloud storage) and Google cloud storage to BigQuery. To learn about this course , basic knowledge in Hadoop commands is mandatory. Things you will learn from this course - Hadoop installation in Windows 11 - Load data from local file system to Hadoop - Load file from hdfs to gcs by installing gcs connector - Once data is loaded in bucket, the bucket name and file name is captured in bigquery table by creating a trigger using cloud function gen2. The input to apache beam is from latest file name bigquery table and load the contents of the file from the bucket to the actual bigquery table.    We also tried to create hive external table, which is pointing out GCS bucket and file. Due to errors , we can't able to demo the approach. By creating that , the hive external table is loaded which indirectly loads data in GCS. The apache beam code which loads data from gcs to bigquery will run by direct runner , we faced some errors while running through dataflow runner. The datatype conversion is not handled in migration , considering all the columns as string.

    Who this course is for:

    • GCP Aspirants

    User Reviews
    Rating
    0
    0
    0
    0
    0
    average 0
    Total votes0
    Focused display
    Category
    Students take courses primarily to improve job-related skills.Some courses generate credit toward technical certification. Udemy has made a special effort to attract corporate trainers seeking to create coursework for employees of their company.
    • language english
    • Training sessions 6
    • duration 32:49
    • Release Date 2024/12/06