Let us help you find the training program you are looking for.

If you can't find what you are looking for, contact us, we'll help you find it. We have over 800 training programs to choose from.

banner-img

Course Skill Level:

Foundational to Intermediate

Course Duration:

5 day/s

  • Course Delivery Format:

    Live, instructor-led

  • Course Category:

    Change Management

Who should attend & recommended skills:

Those with basic IT and traditional database skills

Course breakdown / modules

  • The properties of data
  • The fact-based model for representing data
  • Graph schemas
  • A complete data model for SuperWebAnalytics.com

  • The properties of data
  • The fact-based model for representing data
  • Graph schemas
  • A complete data model for SuperWebAnalytics.com

  • Why a serialization framework?
  • Apache Thrift
  • Limitations of serialization frameworks

  • Storage requirements for the master dataset
  • Choosing a storage solution for the batch layer
  • How distributed filesystems work
  • Storing a master dataset with a distributed filesystem
  • Vertical partitioning
  • Low-level nature of distributed filesystems
  • Storing the SuperWebAnalytics.com master dataset on a distributed filesystem

  • Using the Hadoop Distributed File System
  • Data storage in the batch layer with Pail
  • Storing the master dataset for SuperWebAnalytics.com
  1. Ongoing training is a talent recruiting differentiator
  2. Bonus outcome: Saving IT team time, resources, and budget