In this fourth course of the Python, Bash and SQL Essentials for Data Engineering Specialization, you will build upon the data engineering concepts introduced in the first three courses to apply Python, Bash and SQL techniques in tackling real-world problems. First, we will dive deeper into leveraging Jupyter notebooks to create and deploy models for machine learning tasks. Then, we will explore how to use Python microservices to break up your data warehouse into small, portable solutions that can scale. Finally, you will build a powerful command-line tool to automate testing and quality control for publishing and sharing your tool with a data registry.
This course is part of the Python, Bash and SQL Essentials for Data Engineering Specialization
Offered By
About this Course
Beginner level experience in Python and Linux.
What you will learn
Construct Python Microservices with FastAPI
Build a Command-Line Tool in Python using Click
Compare multiple ways to set up and use a Jupyter notebook
Skills you will gain
- Web Application
- Python Programming
- Jupyter notebooks
- Cloud-Hosted Notebooks
- Command-Line Interface
Beginner level experience in Python and Linux.
Offered by
Duke University
Duke University has about 13,000 undergraduate and graduate students and a world-class faculty helping to expand the frontiers of knowledge. The university has a strong commitment to applying knowledge in service to society, both near its North Carolina campus and around the world.
Syllabus - What you will learn from this course
Jupyter Notebooks
This week, you will learn how to install and run Jupyter on your local machine. Additionally, you will explore strategies to use code and text cells in a Jupyter notebook.
Cloud-Hosted Notebooks
This week, you will learn how to create and use a Cloud-based notebook in Google Colab and AWS Sagemaker.
Python Microservices
This week, you will learn how to build a Python Microservice with FastAPI and deploy a containerized machine learning Microservice for data engineering.
Python Packaging and Command Line Tools
This week, you will learn how to organize a Python project so you can build a powerful command-line tool. You will use Click, a useful command-line tool framework to enhance your tool. Finally, you will automate testing and quality control for publishing and sharing your tool with a registry.
About the Python, Bash and SQL Essentials for Data Engineering Specialization
If you are interested in developing the skills needed to be a data engineer, the Python, Bash and SQL Essentials for Data Engineering Specialization is a great place to start. We live in a world that is driven by big data - from what we search online to the route we take to our favorite restaurant, and everything in between. Businesses and organizations use this data to make decisions that impact the ways in which we navigate our lives. How do engineers collect this data? How can this data be organized so that it can be appropriately analyzed? A data engineer is specialized in this initial step of accessing, cleaning and managing big data.
Frequently Asked Questions
When will I have access to the lectures and assignments?
What will I get if I subscribe to this Specialization?
Is financial aid available?
More questions? Visit the Learner Help Center.