Intro to Data Science Online Course

Intro to Data Science Online Course

By Galvanize Remote

Date and time

October 20, 2014 · 12am - October 31, 2014 · 9pm PDT

Location

Online

Refund Policy

Contact the organizer to request a refund.

Description

Data science is a fast-growing field in high demand, and the first step is knowing what you don't know. We run an immersive 12-week program teaching data science, and built this course as an introduction for people who are getting started in data science.

The classes are designed to be work-at-your-own-pace. Feel free to begin on any day. We won't revoke access to the material.

About the classes:

Each 1.5-hour recorded lecture is accompanied by hands-on exercises which allow you to practice the concepts. You can follow along and complete data analysis in Python, data munging in Unix, and data visualization using D3.

Once you've completed the class series, we're happy to connect you with more resources to help further your data science skills.

Below is the class syllabus and FAQ.

Thanks,

Zipfian Academy

Course Syllabus

1. Introduction to Data Science

Data science has been called the sexiest profession of the 21st century, but there are a lot of misconceptions about what exactly it is and why it's so important. Learn - and be able to explain why - in this introduction to the subject. In this class:

  • Learn the short (but intriguing) history of Data Science.
  • Understand what differentiates Data Science from traditional fields (Statistics, Computer Science, Machine Learning, etc.)
  • Examine a case study highlighting each of the many steps (acquisition, preparation, exploration, analysis, visualization/communication) of the data analysis pipeline.
  • Get experience with a real data set and see first hand how to transform it into actionable insights.
  • Learn the next steps to becoming a Data Scientist and what resources are available to help.

2. Introduction to Statistics: Finding the Signal in the Noise

Humorist Evan Esar once quipped that statistics is "the science of producing unreliable facts from reliable figures." There's probably truth in there somewhere, but statistics is actually powerful tool and a cornerstone of data science. In this class:
  • Learn basic logic notation, statistical techniques and concepts, such as Bayes’ Theorem.
  • Understand basic probability theory and how it applies to statistics.
  • Learn how to calculate statistical significance and confidence intervals.
  • Examine real-world use cases and walkthrough a statistical exercise step-by-step.
  • Explore how statistics and probability theory are used by everyday data scientists.

3. Algorithms for the Rest of Us: An Intro to Machine Learning
Machine learning is often thought of as a black-box of complicated algorithms. This class demystifies the magic of machine learning and will help you begin to understand its power. In this class:
  • Learn the difference between Recommenders, Classifiers, and Clustering.
  • Understand how to pick the correct algorithm for your analysis.
  • Examine a case study showing how to decompose a problem to train an algorithm.
  • Learn Validation methods to test whether your algorithm is effective.
  • Compare Supervised vs. Unsupervised learning and their tradeoffs.

4. Data Wrangling 101: Looking for Data, and What to Do Once You've Found It
Getting and cleaning up data is one of the most under-appreciated aspects of data science. The right tools and techniques can reduce the amount of time it takes to get data ready for analysis. In this class:
  • Learn where to find novel data sets for analysis.
  • Get exposure to a typical ETL process and the best practices for each step.
  • Examine a case study highlighting the process of building a data analysis pipeline.
  • Understand the different ways to store your data (MySQL, MongoDB, Hadoop, etc.) and the tradeoffs of each.
  • Learn the best tools available to transform and manipulate data into a suitable format for analysis.

5. Computing at Scale: Behind the Curtain of Big Data
Sometimes it's not the best algorithm that wins, it's who has the most data. That's part of the reason why the amount of data stored worldwide each year is projected to reach over 30 zettabytes (32,212,254,720 terabytes!) by the year 2020. Working at this level of scale presents new problems and requires new tools. In this class:
  • Learn about the history of Big Data and why and how Hadoop was built.
  • Understand the different types of data (structured vs. unstructured) and what makes Big Data so “big”.
  • Become fluent in the language of Hadoop, including MapReduce, HDFS and ecosystem components such as Hive, Pig, Hbase, Flume, and Mahout.
  • Examine real-world use cases and see how Hadoop integrates into existing data workflows.
  • Learn key design requirements of a Big Data solution and how to avoid common pitfalls.

6. Introduction to Visualization: How to Effectively Tell Your Data Story
Your best insight is only as good as your ability to explain it to others. To round out the class:
  • Learn the core concepts of how to visualize data in a meaningful way
  • Understand basic visualization techniques and where to use them
  • Examine how different visualization methods can drastically change meaning
  • Explore the latest visualization technologies like D3.js and Rickshaw

FAQ

There are start and end dates listed above, but the description says you can start and end the class at any time. What's going on?

The classes are available to start and end at any time, whenever it's conveninet for you. The start and end dates listed on the page above are for administrative purposes.

How do the online classes compare with your 12-week immersive program?

This series is designed to be a comprehensive introduction to data science topics for those students who are interested in the field. The curriculum of our 12-week immersive program takes a deep dive into data science skills including programming in Python, Hadoop, and implementing Machine Learning algorithms through an intensive, project-based course. For that, you'd have to come to San Francisco!

Is there an certificate for completing the class series?

These classes are designed to accelerate and improve your self-directed learning. We don't issue a certificate, but you should feel free to discuss the classes and the knowledge you learned from them during instances in which having a certificate would be helpful.

I've purchased access to the series, but I can't access the lectures right away. What's going on?

It can take up to 1-2 business days for your purchase to be approved by our system. If you're in a hurry, contact us at the address below and we'll make sure you get access when you need it.

Zipfian Academy is a school teaching data science through an immersive 12-week course. The program helps students develop the mindset and technical skills they need to launch a data science career. Students receive access to a world-class network of data science experts, personalized instruction featuring hands-on projects, a collaborative learning space in San Francisco, and exclusive recruiting events with top-tier technology companies.

Organized by

Sales Ended