Genre: eLearning | MP4 | Video: h264, 1280x720 | Audio: aac, 44100 HzLanguage: English | VTT | Size: 4.85 GB | Duration: 13 hours
======
The course "The Complete Hands-On Introduction to Apache Airflow" can be a nice plus.What you'll learnCoding Production Grade Data pipelines by Mastering Airflow through Hands-on ExamplesHow to Follow Best Practices with Apache AirflowHow to Scale Airflow with the Local, Celery and Kubernetes WxecutorsHow to Set Up Monitoring with Elasticsearch and GrafanaHow to Secure Airflow with authentication, crypto and the RBAC UICore and Advanced Concepts with Pros and LimitationsMastering DAGs with zones, unit testing, backfill and catchupOrganising the DAG folder and keep things cleanRequirementsNotions of Docker and PythonVirtual Box installed (Only for local Kubernetes cluster part)Vagrant installedDescriptionAirflow is a platform created by community to programmatically author, schedule and monitor workflows.It is scalable, dynamic, extensible and modulable.Without any doubts, mastering Airflow is becoming a must-have and an attractive skill for anyone working with data.What you will learn in the course:Fundamentals of Airflow are explained such as what is Airflow, how the scheduler and the web server worksThe Forex Data Pipeline project is incredible way to discover many operators in Airflow and deal with Slack, Spark, Hadoop and moreMastering your DAGs is a top priority and you will be able to play with zones, unit testing your DAGs, how to structure your DAG folder and much moreScaling Airflow through different executors such as the Local Executor, the Celery Executor and the Kubernetes Executor will be explained in details. You will discover how to specialise your workers, how to add new workers, what happens when a node crashes.A Kubernetes cluster of 3 nodes will be set up with Rancher, Airflow and the Kubernetes Executor in local to run your data pipelines.Advanced concepts will be shown through practical examples such as templatating your DAGs, how to make your DAG dependent of another, what are Subdags and deadlocks, and more.You will set up a Kubernetes cluster in the cloud with AWS EKS and Rancher in order to use Airflow along with the Kubernetes ExecutorMonitoring Airflow is extremely important! That's why you will know how to do it with Elasticsearch and Grafana.Security will be also addressed in order to make your Airflow instance compliant with your company. Specifying roles and permissions for your users with RBAC, Prevent from accessing the Airflow UI with authentication and password, data encryption and more.In addition:Many practical exercises are given along the course so that you will have occasions to apply what you learn.Best practices are stated when needed to give you the best ways of using AirflowQuiz are available to assess your comprehension at the end of each section.Answering fast your questions is my top-priority and I will do my best for you.I put a lot of effort in order to give you the best content and I hope you will enjoy it as much as I enjoyed doing it.At the end of the course you will more confident than ever to use AirflowWish you a great success!Marc LambertiWho this course is for:Data EeersInspiring Data EeersDevOpsSoftware EeersData Scientists
评论(0)