Data Science and Machine Learning with Python

Technology - Frank Kane

Ready to take this Premium course?

Sign up today and continue this course plus 1000 more courses anytime, anywhere.

Enroll Now

This course has two Highlights.

  1. The course will help you learn the techniques used by industry experts and working professionals.
  2. The course is delivered in simple English language. Most of the complicated terminology is avoided to help you understand easily.

At the end of the course, you are expected to finish a final project.

The topics covered in this course include:

  • Deep Learning / Neural Networks (MLP's, CNN's, RNN's)
  • Regression analysis
  • K-Means Clustering
  • Principal Component Analysis
  • Train/Test and cross-validation
  • Bayesian Methods
  • Decision Trees and Random Forests
  • Multivariate Regression
  • Multi-Level Models
  • Support Vector Machines
  • Reinforcement Learning
  • Collaborative Filtering
  • K-Nearest Neighbor
  • Bias/Variance Tradeoff
  • Ensemble Learning
  • Frequency / Inverse Document Frequency
  • Experimental Design and A/B Tests

Prior experience in any coding will be required for this course. However, Python is not mandatory. This course starts with a crash course where you can learn Python easily if you have any coding experience.


Pre – Requisites

  • A desktop computer (Windows, Mac, or Linux) which supports Enthought Canopy 1.6.2 or newer. You will learn the installation process during this course.
  • Coding or scripting experience is required.
  • Minimum of high school level math skills will be required.


Learning Objectives

  • Develop using Python notebooks
  • Understand statistical measures such as standard deviation
  • Visualize data distributions, probability mass functions, and probability density functions
  • Visualize data with matplotlib
  • Use covariance and correlation metrics
  • Apply conditional probability for finding correlated features
  • Use Bayes' Theorem to identify false positives
  • Make predictions using linear regression, polynomial regression, and multivariate regression
  • Understand complex multi-level models
  • Use train/test and K-Fold cross-validation to choose the right model
  • Build a spam classifier using Naive Bayes
  • Use decision trees to predict hiring decisions
  • Cluster data using K-Means clustering and Support Vector Machines (SVM)
  • Build a movie recommender system using item-based and user-based collaborative filtering
  • Predict classifications using K-Nearest-Neighbor (KNN)
  • Apply dimensionality reduction with Principal Component Analysis (PCA) to classify flowers
  • Understand reinforcement learning - and how to build a Pac-Man bot
  • Clean your input data to remove outliers
  • Implement machine learning, clustering, and search using TF/IDF at massive scale with Apache Spark's MLLib
  • Design and evaluate A/B tests using T-Tests and P-Values


Career Opportunities

According to Glassdoor, in 2016 data science was the highest paid field to get into. Of course, this follows the basic laws of economics - supply and demand. The demand for data science is very high, while the supply is too low. 

What are some examples of data science?

  1. Google. They are the definition of data science. Everything they do is data driven by their search engine (, through their YouTube efforts, maximization of ad revenue, etc. 
  2. Amazon. Each product recommendation that you get comes from Amazon’s sophisticated data science algorithms.
  3. Facebook. Facebook is generating ad revenue like crazy since it has all that personal data for all its users. Since you interact with the platform, they know if you prefer cat videos or dog videos, so they know if you are a cat person or a dog person.

“A data scientist is someone who is better at statistics than any software engineer and better at software engineering than any statistician.”

On any given day, a data scientist may be required to:

  • Conduct undirected research and frame open-ended industry questions
  • Extract huge volumes of data from multiple internal and external sources
  • Employ sophisticated analytics programs, machine learning and statistical methods to prepare data for use in predictive and prescriptive modeling
  • Thoroughly clean and prune data to discard irrelevant information
  • Explore and examine data from a variety of angles to determine hidden weaknesses, trends and/or opportunities
  • Devise data-driven solutions to the most pressing challenges
  • Invent new algorithms to solve problems and build new tools to automate work
  • Communicate predictions and findings to management and IT departments through effective data visualizations and reports
  • Recommend cost-effective changes to existing procedures and strategies

Every company will have a different take on job tasks. Some treat their data scientists as glorified Data Analysts or combine their duties with Data Engineers; others need top-level analytics experts skilled in intense machine learning and data visualizations.




Frank Kane

Founder: Sundog Education

Frank spent 9 years at Amazon and IMDb, developing and managing the technology that automatically…

Training 5 or more people?

Get your team access to WIISE top 2,000 courses anytime, anywhere.

Try WIISE for Business

What will you cover?

  •   Introduction 00:02:41
  •   Getting What You Need 00:02:36
  •   Installing Enthought Canopy 00:06:51
  •   Python Basics, Part 1 00:15:58
  •   Python Basics, Part 2 00:09:41
  •   Running Python Scripts 00:03:55
  •   Introducing the Pandas Library 00:10:14


Frank Kane

Founder: Sundog Education

Frank spent 9 years at Amazon and IMDb, developing and managing the technology that automatically…

Training 5 or more people?

Get your team access to WIISE top 2,000 courses anytime, anywhere.

Try WIISE for Business

Project Description


Using the concepts of basics of Python, develop the following projects.

1. Dice Rolling Simulator

The Goal: Like the title suggests, this project involves writing a program that simulates rolling dice. When the program runs, it will randomly choose a number between 1 and 6. (Or whatever other integer you prefer — the number of sides on the die is up to you.) The program will print what that number is. It should then ask you if you’d like to roll again. For this project, you’ll need to set the min and max number that your dice can produce. For the average die, that means a minimum of 1 and a maximum of 6. You’ll also want a function that randomly grabs a number within that range and prints it.


Concepts to keep in mind:

    • Random
    • Integer
    • Print
    • While Loops


A good project for beginners, this project will help establish a solid foundation for basic concepts. And if you already have programming experience, chances are that the concepts used in this project aren’t completely foreign to you. Print, for example, is similar to Javascript’s console.log.

2. Mad Libs Generator

The Goal: Inspired by Summer Son’s Mad Libs project with Javascript. The program will first prompt the user for a series of inputs a la Mad Libs. For example, a singular noun, an adjective, etc. Then, once all the information has been inputted, the program will take that data and place them into a premade story template. You’ll need prompts for user input, and to then print out the full story at the end with the input included.


Concepts to keep in mind:

    • Strings
    • Variables
    • Concatenation
    • Print


A pretty fun beginning project that gets you thinking about how to manipulate user inputted data. Compared to the prior projects, this project focuses far more on strings and concatenating. Have some fun coming up with some wacky stories for this!


No Attachments found ! in this Course

Related Courses

Frank Kane
Data Science and Machine Learning with Python
  5h 56m           12,906 Views
John Bura
Android For Complete Beginners
  11h 28m           57,338 Views
Rob Percival
Web Development for Beginners I
  12h 15m           20,352 Views
Frank Kane
Big Data: Apache Spark and Python
  5h 11m           12,643 Views