IBM
IBM Data Engineering Professional Certificate
IBM

IBM Data Engineering Professional Certificate

Launch your new career in Data Engineering. Master SQL, RDBMS, ETL, Data Warehousing, NoSQL, Big Data and Spark with hands-on job-ready skills.

Rav Ahuja
Priya Kapoor
Joseph Santarcangelo

Instructors: Rav Ahuja

40,559 already enrolled

Professional Certificate - 13 course series

Earn a career credential that demonstrates your expertise

4.6

(2,978 reviews)

Beginner level

Recommended experience

5 months at 10 hours a week
Flexible schedule
Learn at your own pace
Earn degree credit

What you'll learn

  • Create, design, and manage relational databases and apply database administration (DBA) concepts to RDBMSes such as MySQL, PostgreSQL, and IBM Db2.

  • Develop and execute SQL queries using SELECT, INSERT, UPDATE, DELETE statements, database functions, stored procedures, Nested Queries, and JOINs. 

  • Demonstrate working knowledge of NoSQL & Big Data using MongoDB, Cassandra, Cloudant, Hadoop, Apache Spark, Spark SQL, Spark ML, Spark Streaming. 

  • Implement ETL & Data Pipelines with Bash, Airflow & Kafka; architect, populate, deploy Data Warehouses; create BI reports & interactive dashboards. 

Skills you'll gain

  • Category: Relational Database Management Syste (RDBMS)
  • Category: ETL & Data Pipelines
  • Category: NoSQL and Big Data
  • Category: Apache Spark
  • Category: SQL

Details to know

Shareable certificate

Add to your LinkedIn profile

English
Subtitles: English, Arabic,

Professional Certificate - 13 course series

Earn a career credential that demonstrates your expertise

4.6

(2,978 reviews)

Beginner level

Recommended experience

5 months at 10 hours a week
Flexible schedule
Learn at your own pace
Earn degree credit

See how employees at top companies are mastering in-demand skills

Placeholder

Prepare for a career in Data Engineering

  • Receive professional-level training from IBM
  • Demonstrate your proficiency in portfolio-ready projects
  • Earn an employer-recognized certificate from IBM
  • Qualify for in-demand job titles: Database Engineer, Data Engineer, Junior Data Engineer
Placeholder
$130,000+
median U.S. salary for Data Engineering
¹
74,000+
U.S. job openings in Data Engineering
¹

Get exclusive access to career resources upon completion

  • Soft skills training

    Get free access to IBM’s People and Soft Skills Specialization

  • Resume review

    Improve your resume and LinkedIn with personalized feedback

  • Interview prep

    Practice your skills with interactive tools and mock interviews

  • Career support

    Plan your career move with Coursera’s job search guide

¹Lightcast™ Job Postings Report, United States, 1/1/22-12/31/22. ²Based on program graduate survey responses, United States 2021.

Professional Certificate - 13 course series

Introduction to Data Engineering

Course 114 hours4.7 (1,741 ratings)

What you'll learn

  • List basic skills required for an entry-level data engineering role.

  • Discuss various stages and concepts in the data engineering lifecycle.

  • Describe and provide examples of data engineering technologies such as Relational Databases, NoSQL Data Stores, Big Data Engines, and others.

  • Summarize concepts in data security, governance, and compliance.

Skills you'll gain

Category: Data Science
Category: Database (DBMS)
Category: NoSQL
Category: SQL

Python for Data Science, AI & Development

Course 222 hours4.6 (31,847 ratings)

What you'll learn

  • Describe Python Basics including Data Types, Expressions, Variables, and Data Structures.

  • Apply Python programming logic using Branching, Loops, Functions, Objects & Classes.

  • Demonstrate proficiency in using Python libraries such as Pandas, Numpy, and Beautiful Soup.

  • Access web data using APIs and web scraping from Python in Jupyter Notebooks.

Skills you'll gain

Category: Data Science
Category: Python Programming
Category: Data Analysis
Category: Pandas
Category: Numpy

Python Project for Data Engineering

Course 36 hours4.6 (394 ratings)

What you'll learn

  • Demonstrate your skills in Python for data engineering tasks 

  • Implement webscraping and use APIs to collect data in Python

  • Assume the role of a Data Engineer working on a real project

  • Extract, Transform and Load (ETL) data using Jupyter notebooks

Skills you'll gain

Category: Information Engineering
Category: Python Programming
Category: Jupyter notebooks
Category: Web Scraping
Category: Extract Transform Load (ETL)

Introduction to Relational Databases (RDBMS)

Course 418 hours4.6 (360 ratings)

What you'll learn

  • Describe data, databases, relational databases, and cloud databases.

  • Describe information and data models, relational databases, and relational model concepts (including schemas and tables). 

  • Explain an Entity Relationship Diagram and design a relational database for a specific use case.

  • Implement different relational model constraints.

Skills you'll gain

Category: Database (DB) Design
Category: Database Architecture
Category: Postgresql
Category: MySQL
Category: Relational Database Management System (RDBMS)

Databases and SQL for Data Science with Python

Course 539 hours4.6 (17,629 ratings)

What you'll learn

  • Analyze data within a database using SQL and Python.

  • Create a relational database on Cloud and work with tables.

  • Write SQL statements including SELECT, INSERT, UPDATE, and DELETE.

  • Build more powerful queries with advanced SQL techniques like views, transactions, stored procedures and joins.

Skills you'll gain

Category: Cloud Databases
Category: Python Programming
Category: Jupyter notebooks
Category: Relational Database Management System (RDBMS)
Category: SQL

What you'll learn

  • Describe the Linux architecture and common Linux distributions and update and install software on a Linux system.

  • Perform common informational, file, content, navigational, compression, and networking commands in Bash shell.

  • Develop shell scripts using Linux commands, environment variables, pipes, and filters.

  • Schedule cron jobs in Linux with crontab and explain the cron syntax. 

Skills you'll gain

Category: Shell Script
Category: Bash (Unix Shell)
Category: Linux

Relational Database Administration (DBA)

Course 720 hours4.6 (92 ratings)

What you'll learn

  • Create, query, and configure databases and access and build system objects such as tables.

  • Perform basic database management including backing up and restoring databases as well as managing user roles and permissions. 

  • Monitor and optimize important aspects of database performance. 

  • Troubleshoot database issues such as connectivity, login, and configuration and automate functions such as reports, notifications, and alerts. 

Skills you'll gain

Category: Database (DBMS)
Category: Database Servers
Category: Relational Database
Category: Database Security
Category: database administration

ETL and Data Pipelines with Shell, Airflow and Kafka

Course 815 hours4.5 (198 ratings)

What you'll learn

  • Describe and contrast Extract, Transform, Load (ETL) processes and Extract, Load, Transform (ELT) processes.

  • Explain batch vs concurrent modes of execution.

  • Implement an ETL pipelinethrough shell scripting.

  • Describe data pipeline components, processes, tools, and technologies.

Skills you'll gain

Category: Extraction, Transformation And Loading (ETL)
Category: Apache Kafka
Category: Apache Airflow
Category: Data Pipelines

Getting Started with Data Warehousing and BI Analytics

Course 916 hours4.4 (97 ratings)

What you'll learn

  • Explore the architecture, features, and benefits of data warehouses, data marts, and data lakes and identify popular data warehouse system vendors.

  • Design and populate a data warehouse, and model and querydata using CUBE, ROLLUP, and materialized views.

  • Identify popular data analytics and business intelligence tools and vendors and create data visualizations using IBM Cognos Analytics.

  • Design and load data into a data warehouse, write aggregation queries, create materialized query tables, and create an analytics dashboard.

Skills you'll gain

Category: Data Warehousing
Category: Cube and Rollup
Category: Business Intelligence (BI)
Category: Star and Snowflake Schema
Category: cognos analytics

Introduction to NoSQL Databases

Course 1017 hours4.6 (169 ratings)

What you'll learn

  • Differentiate between the four main categories of NoSQL repositories.

  • Describe the characteristics, features, benefits, limitations, and applications of the more popular Big Data processing tools.

  • Perform common tasks using MongoDB tasks including create, read, update, and delete (CRUD) operations.

  • Execute keyspace, table, and CRUD operationsin Cassandra.

Skills you'll gain

Category: Mongodb
Category: NoSQL
Category: Cloud Database
Category: Cloudant
Category: Cassandra

Introduction to Big Data with Spark and Hadoop

Course 1112 hours4.4 (212 ratings)

What you'll learn

  • Explainthe impact of Big Data including use cases, tools, and processing methods.

  • ExplainApache Hadoop architecture, ecosystem, and practices, and userelatedapplications including HDFS, HBase, Spark, and MapReduce.

  • Apply Spark programming basics, including parallel programming basics forDataFrames, data sets, and Spark SQL.

  • UseSpark’s RDDsanddata sets, optimizingSparkSQLusing Catalyst and Tungsten, anduseSpark’s development and runtime environment options.

Skills you'll gain

Category: Apache Hadoop
Category: SparkSQL
Category: SparkML
Category: Big Data
Category: Apache Spark

Data Engineering and Machine Learning using Spark

Course 127 hours3.9 (97 ratings)

What you'll learn

  • Explain how streaming data and Spark Structured Streaming empower machine learning and AI tasks.

  • Define graph theory, describe Apache Spark GraphFrames, and identify data suitable for GraphFrames.

  • Describe how ETL processes work with Apache Spark and machine learning and extend that knowledge to Spark MLlib capabilities and related benefits.

  • Explain supervised learning, unsupervised learning, and clustering, and explain how to use the k-means clustering algorithm with Spark MLlib.

Data Engineering Capstone Project

Course 1314 hours4.6 (51 ratings)

What you'll learn

  • Demonstrate proficiency in skills required for an entry-level data engineering role.

  • Design and implement various concepts and components in the data engineering lifecycle such as data repositories.

  • Showcase working knowledge with relational databases, NoSQL data stores, big data engines, data warehouses, and data pipelines.

  • Apply skills in Linux shell scripting, SQL, and Python programming languages to Data Engineering problems.

Skills you'll gain

Category: Relational Databases
Category: Python Programming
Category: NoSQL
Category: Data Pipelines
Category: SQL

Instructors

Rav Ahuja
IBM
39 Courses1,673,968 learners
Priya Kapoor
IBM
1 Course82,996 learners
Joseph Santarcangelo
IBM
26 Courses983,279 learners

Offered by

IBM
Placeholder

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV

Share it on social media and in your performance review

Placeholder

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Placeholder

Open new doors with Coursera Plus

Unlimited access to 7,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Frequently asked questions