Practical Data Science Cookbook

Home » Technology » Big Data & Data Science » Practical Data Science Cookbook

Course Skill Level:

Foundational

Course Duration:

3 day/s

Course Delivery Format:

Live, instructor-led.
Course Category:

Big Data & Data Science
Course Code:

PRDSCBL21E09

Who should attend & recommended skills:

Beginners with basic Python skills

Who should attend & recommended skills

Beginners who want to complete real-world data science projects in R and Python.
Skill-level: Foundation-level Practical Data Science Cookbook skills for Intermediate skilled team members. This is not a basic class.
Python: Basic (1-2 years’ experience)

About this course

As increasing amounts of data are generated each year, the need to analyze and create value out of it is more important than ever. Companies that know what to do with their data and how to do it well will have a competitive advantage over companies that don’t. Because of this, there will be an increasing demand for people that possess both the analytical and technical abilities to extract valuable insights from data and create valuable solutions that put those insights to use. Starting with the basics, this course covers how to set up your numerical programming environment, introduces you to the data science pipeline, and guides you through several data projects in a step-by-step format. By sequentially working through the steps in each lesson, you will quickly familiarize yourself with the process and learn how to apply it to a variety of situations with examples using the two most popular programming languages for data analysis – R and Python.

Skills acquired & topics covered

Tackling every step in the data science pipeline and use it to acquire, clean, analyze, and visualize your data
Getting beyond the theory and implement real-world projects in data science using R and Python
Easy-to-follow recipes will help you understand and implement the numerical computing concepts
The installation procedure and environment required for R and Python on various platforms
Preparing data for analysis by implement various data science concepts such as acquisition, cleaning and munging through R and Python
Building a predictive model and an exploratory model
Analyzing the results of your model and create reports on the acquired data
Building various tree-based methods and build random forest

Course breakdown / modules

Understanding the data science pipeline
Installing R on Windows, Mac OS X, and Linux
Installing libraries in R and RStudio
Installing Python on Linux and Mac OS X
Installing Python on Windows
Installing the Python data stack on Mac OS X and Linux
Installing extra Python packages
Installing and using virtualenv

Introduction
Acquiring automobile fuel efficiency data
Preparing R for your first project
Importing automobile fuel efficiency data into R
Exploring and describing fuel efficiency data
Analyzing automobile fuel efficiency over time
Investigating the makes and models of automobiles

Introduction
Preparing for the analysis of top incomes
Importing and exploring the world’s top incomes dataset
Analyzing and visualizing the top income data of the US
Furthering the analysis of the top income groups of the US
Reporting with Jinja2
Repeating the analysis in R

Introduction
Acquiring stock market data
Summarizing the data
Cleaning and exploring the data
Generating relative valuations
Screening stocks and analyzing historical prices

Introduction
Preparing for analysis
Importing employment data into R
Exploring the employment data
Obtaining and merging additional data
Adding geographical information
Extracting state- and county-level wage and employment information
Visualizing geographical distributions of pay
Exploring where the jobs are, by industry
Animating maps for a geospatial time series
Benchmarking performance for some common tasks

Introduction
Getting started with IPython
Exploring Jupyter Notebook
Preparing to analyze automobile fuel efficiencies
Exploring and describing fuel efficiency data with Python
Analyzing automobile fuel efficiency over time with Python
Investigating the makes and models of automobiles with Python

Introduction
Preparing to work with social networks in Python
Importing networks
Exploring subgraphs within a heroic network
Finding strong ties
Finding key players
Exploring the characteristics of entire networks
Clustering and community detection in social networks
Visualizing graphs
Social networks in R

Introduction
Modeling preference expressions
Understanding the data
Ingesting the movie review data
Finding the highest-scoring movies
Improving the movie-rating system
Measuring the distance between users in the preference space
Computing the correlation between users
Finding the best critic for a user
Predicting movie ratings for users
Collaboratively filtering item by item
Building a non-negative matrix factorization model
Loading the entire dataset into the memory
Dumping the SVD-based model to the disk
Training the SVD-based model
Testing the SVD-based model

Introduction
Creating a Twitter application
Understanding the Twitter API v1.1
Determining your Twitter followers and friends
Pulling Twitter user profiles
Making requests without running afoul of Twitter’s rate limits
Storing JSON data to disk
Setting up MongoDB for storing Twitter data
Storing user profiles in MongoDB using PyMongo
Exploring the geographic information available in profiles
Plotting geospatial data in Python

Introduction
The ts object
Visualizing time series data
Simple linear regression models
ACF and PACF
ARIMA models
Accuracy measurements
Fitting seasonal ARIMA models

Introduction
Simple data transformations
Visualizing categorical data
Discriminant analysis
Dividing the data and the ROC
Fitting the logistic regression model
Decision trees and rules
Decision tree for German data

Free Training Courses

Leadership & Professional Development Courses

Microsoft Office Courses

Technology Courses

Practical Data Science Cookbook

Who should attend & recommended skills

About this course

Skills acquired & topics covered

Course breakdown / modules

Browse our programs to take the next step toward advancing yourself, your team, and organization.

Free Training Courses

Leadership & Professional Development Courses

Microsoft Office Courses

Technology Courses

Let us help you find the training program you are looking for.

Practical Data Science Cookbook

Who should attend & recommended skills

About this course

Skills acquired & topics covered

Course breakdown / modules

Preparing Your Data Science Environment

Driving Visual Analysis with Automobile Data with R

Creating Application-Oriented Analyses Using Tax Data and Python

Modeling Stock Market Data

Visually Exploring Employment Data

Driving Visual Analyses with Automobile Data

Working with Social Graphs

Recommending Movies at Scale (Python)

Harvesting and Geolocating Twitter Data (Python)

Forecasting New Zealand Overseas Visitors

German Credit Data Analysis

Browse our programs to take the next step toward advancing yourself, your team, and organization.

View Course Detail