In part one of this course, students will develop hands-on experience manipulating real-world data using a range of data science tools (including the command line, Python, Jupiter, Git, and GitHub). This part of the course will emphasize general coding skills in Python, as well as python packages that are typically used in data science.
In part two, students will learn to develop data science projects to answer meaningful questions via backwards design, and to manage projects from inception to presentation of results. By the end of the course, students will be comfortable working with unsanitized data in numerous forms, collaborating on projects, and designing real data science projects. The course assumes basic familiarity with Python, Git, and statistics, but is meant to serve as a foundation for subsequent more specialized courses that assume deep comfort with data manipulation skills.