Let us help you find the training program you are looking for.

If you can't find what you are looking for, contact us, we'll help you find it. We have over 800 training programs to choose from.

Learning pandas

  • Course Code: Data Analysis / BI - Learning pandas
  • Course Dates: Contact us to schedule.
  • Course Category: Big Data & Data Science Duration: 4 Days Audience: This course is geared for Python experienced developers, analysts or others who wants to get to grips with pandas—a versatile and high-performance Python library for data manipulation, analysis, and discovery

Course Snapshot 

  • Duration: 4 days 
  • Skill-level: Foundation-level Pandas skills for Intermediate skilled team members. This is not a basic class. 
  • Targeted Audience: This course is geared for Python experienced developers, analysts or others who wants to get to grips with pandas—a versatile and high-performance Python library for data manipulation, analysis, and discovery  
  • Hands-on Learning: This course is approximately 50% hands-on lab to 50% lecture ratio, combining engaging lecture, demos, group activities and discussions with machine-based student labs and exercises. Student machines are required. 
  • Delivery Format: This course is available for onsite private classroom presentation, or remote instructor led delivery, or CBT/WBT (by request). 
  • Customizable: This course may be tailored to target your specific training skills objectives, tools of choice and learning goals. 

You will learn how to use pandas to perform data analysis in Python. You will start with an overview of data analysis and iteratively progress from modeling data, to accessing data from remote sources, performing numeric and statistical analysis, through indexing and performing aggregate analysis, and finally to visualizing statistical data and applying pandas to finance. With the knowledge you gain from this course, you will quickly learn pandas and how it can empower you in the exciting world of data manipulation, analysis and science. 

Working in a hands-on learning environment, led by our Pandas expert instructor, students will learn about and explore: 

  • Get comfortable using pandas and Python as an effective data exploration and analysis tool 
  • Explore pandas through a framework of data analysis, with an explanation of how pandas is well suited for the various stages in a data analysis process 
  • A comprehensive guide to pandas with many of clear and practical examples to help you get up and using pandas 

Topics Covered: This is a high-level list of topics covered in this course. Please see the detailed Agenda below 

  • Understand how data analysts and scientists think about of the processes of gathering and understanding data 
  • Learn how pandas can be used to support the end-to-end process of data analysis 
  • Use pandas Series and DataFrame objects to represent single and multivariate data 
  • Slicing and dicing data with pandas, as well as combining, grouping, and aggregating data from multiple sources 
  • How to access data from external sources such as files, databases, and web services 
  • Represent and manipulate time-series data and the many of the intricacies involved with this type of data 
  • How to visualize statistical information 
  • How to use pandas to solve several common data representation and analysis problems within finance 

Audience & Pre-Requisites 

This course is geared for attendees with Python skills who wish to get to grips with pandas—a versatile and high-performance Python library for data manipulation, analysis, and discovery 

Pre-Requisites:  Students should have  

  • developers with some knowledge of Python.  
  • experienced with spreadsheet software who know the basics of Python. 

Course Agenda / Topics 

  1. pandas and Data Analysis 
  • pandas and Data Analysis 
  • Introducing pandas 
  • Data manipulation, analysis, science, and pandas 
  • The process of data analysis 
  • Relating the book to the process 
  • Concepts of data and analysis in our tour of pandas 
  • Other Python libraries of value with pandas 
  1. Up and Running with pandas 
  • Up and Running with pandas 
  • Installation of Anaconda 
  • IPython and Jupyter Notebook 
  • Introducing the pandas Series and DataFrame 
  • Visualization 
  1. Representing Univariate Data with the Series 
  • Representing Univariate Data with the Series 
  • Configuring pandas 
  • Creating a Series 
  • The .index and .values properties 
  • The size and shape of a Series 
  • Specifying an index at creation 
  • Heads, tails, and takes 
  • Retrieving values in a Series by label or position 
  • Slicing a Series into subsets 
  • Alignment via index labels 
  • Performing Boolean selection 
  • Re-indexing a Series 
  • Modifying a Series in-place 
  1. Representing Tabular and Multivariate Data with the DataFrame 
  • Representing Tabular and Multivariate Data with the DataFrame 
  • Configuring pandas 
  • Creating DataFrame objects 
  • Accessing data within a DataFrame 
  • Selecting rows using Boolean selection 
  • Selecting across both rows and columns 
  1. Manipulating DataFrame Structure 
  • Manipulating DataFrame Structure 
  • Configuring pandas 
  • Renaming columns 
  • Adding new columns with [] and .insert() 
  • Adding columns through enlargement 
  • Adding columns using concatenation 
  • Reordering columns 
  • Replacing the contents of a column 
  • Deleting columns 
  • Appending new rows 
  • Concatenating rows 
  • Adding and replacing rows via enlargement 
  • Removing rows using .drop() 
  • Removing rows using Boolean selection 
  • Removing rows using a slice 
  1. Indexing Data 
  • Indexing Data 
  • Configuring pandas 
  • The importance of indexes 
  • The pandas index types 
  • Working with Indexes 
  • Hierarchical indexing 
  1. Categorical Data 
  • Categorical Data 
  • Configuring pandas 
  • Creating Categoricals 
  • Renaming categories 
  • Appending new categories 
  • Removing categories 
  • Removing unused categories 
  • Setting categories 
  • Descriptive information of a Categorical 
  • Munging school grades 
  1. Numerical and Statistical Methods 
  • Numerical and Statistical Methods 
  • Configuring pandas 
  • Performing numerical methods on pandas objects 
  • Performing statistical processes on pandas objects 
  1. Accessing Data 
  • Accessing Data 
  • Configuring pandas 
  • Working with CSV and text/tabular format data 
  • Reading and writing data in Excel format 
  • Reading and writing JSON files 
  • Reading HTML data from the web 
  • Reading and writing HDF5 format files 
  • Accessing CSV data on the web 
  • Reading and writing from/to SQL databases 
  • Reading data from remote data services 
  1. Tidying Up Your Data 
  • Tidying Up Your Data 
  • Configuring pandas 
  • What is tidying your data? 
  • How to work with missing data 
  • Handling duplicate data 
  • Transforming data 
  1. Combining, Relating, and Reshaping Data 
  • Combining, Relating, and Reshaping Data 
  • Configuring pandas 
  • Concatenating data in multiple objects 
  • Merging and joining data 
  • Pivoting data to and from value and indexes 
  • Stacking and unstacking 
  • Performance benefits of stacked data 
  1. Data Aggregation 
  • Data Aggregation 
  • Configuring pandas 
  • The split, apply, and combine (SAC) pattern 
  • Data for the examples 
  • Splitting data 
  • Applying aggregate functions, transforms, and filters 
  • Transforming groups of data 
  • Filtering groups from aggregation 
  1. Time-Series Modelling 
  • Time-Series Modelling 
  • Setting up the IPython notebook 
  • Representation of dates, time, and intervals 
  • Introducing time-series data 
  • Calculating new dates using offsets 
  • Representing durations of time using Period 
  • Handling holidays using calendars 
  • Normalizing timestamps using time zones 
  • Manipulating time-series data 
  • Time-series moving-window operations 
  1. Visualization 
  • Visualization 
  • Configuring pandas 
  • Plotting basics with pandas 
  • Creating time-series charts 
  • Common plots used in statistical analyses 
  • Manually rendering multiple plots in a single chart 
  1. Historical Stock Price Analysis 
  • Historical Stock Price Analysis 
  • Setting up the IPython notebook 
  • Obtaining and organizing stock data from Google 
  • Plotting time-series prices 
  • Plotting volume-series data 
  • Calculating the simple daily percentage change in closing price 
  • Calculating simple daily cumulative returns of a stock 
  • Resampling data from daily to monthly returns 
  • Analyzing distribution of returns 
  • Performing a moving-average calculation 
  • Comparison of average daily returns across stocks 
  • Correlation of stocks based on the daily percentage change of the closing price 
  • Calculating the volatility of stocks 
  • Determining risk relative to expected returns 
View All Courses

    Course Inquiry

    Fill in the details below and we will get back to you as quickly as we can.

    Interested in any of these related courses?