Skip to main content

About This Course

Whether you’re building big data applications, developing data pipelines, or working on machine learning projects, it’s essential to manage changes to your code. Although developers and data scientists have employed a variety of tools for this over the years, an open source version control system called git has emerged as the standard tool for thousands of organizations around the world. This course introduces students to the Git version control system through a series of lectures, demonstrations, and hands-on exercises.

View the full course outline

Payment and Registration

You can purchase this course on its own, or as part of our Full Library subscription.


Course Length

This course includes over an hour of video content. Students who have purchased this course on its own are allowed up to 5 hours of exercise time. (Subscribers to the full OnDemand library are given 100 hours of exercise time across all courses.)


Course Outline

Students who successfully complete this course will be able to:

  • Understand how git compares to other revision control systems
  • Create and work with git repositories, both locally and in GitHub
  • Explore repository contents and add new files
  • Stage and commit modifications to files stored in a repository
  • Compare file modifications and resolve conflicting changes
  • Use branches and tags
  • Collaborate using git integration in Cloudera Data Science Workbench (CDSW)

View the full course outline

Audience and Prerequisites

This course is best suited to developers and data scientists who feel comfortable performing basic operations from the Linux command line. No prior experience with git or other revision control systems is necessary.