Let us help you find the training program you are looking for.

If you can't find what you are looking for, contact us, we'll help you find it. We have over 800 training programs to choose from.

banner-img

Course Skill Level:

Foundational to Intermediate

Course Duration:

4 day/s

  • Course Delivery Format:

    Live, instructor-led.

  • Course Category:

    Big Data & Data Science

  • Course Code:

    SOLR00L21E09

Who should attend & recommended skills:

Developers and designers with basic IT, Java, & standard database technology skills

Who should attend & recommended skills

  • Those who want to implement scalable search using Apache Solr.
  • Skill-level: Foundation-level Solr skills for Intermediate skilled team members. This is not a basic class.
  • IT skills: Basic to Intermediate (1-5 years’ experience)
  • Java: Basic (1-2 years’ experience)
  • Standard database technology: Basic (1-2 years’ experience)
  • No prior knowledge of Solr or Lucene is required.

About this course

Whether you’re handling big (or small) data, managing documents, or building a website, it is important to be able to quickly search through your content and discover meaning in it. Apache Solr is your tool: a ready-to-deploy, Lucene-based, open source, full-text search engine. Solr can scale across many servers to enable real-time queries and data analytics across billions of documents.
Solr teaches you to implement scalable search using Apache Solr. This easy-to-read guide balances conceptual discussions with practical examples to show you how to implement all of Solr’s core capabilities. You’ll master topics like text analysis, faceted search, hit highlighting, result grouping, query suggestions, multilingual search, advanced geospatial and data operations, and relevancy tuning.

Skills acquired & topics covered

  • Redis expands on the key-value pattern by accepting a wide variety of data types, including hashes, strings, lists, and other structures.
  • It provides lightning-fast operations on in-memory datasets, and also makes it easy to persist to disk on the fly. Plus, it’s free and open source.
  • How to scale Solr for big data
  • Rich real-world examples
  • Solr as a NoSQL data store
  • Advanced multilingual, data, and relevancy tricks
  • Coverage of versions through Solr 4.7

Course breakdown / modules

  • Why do I need a search engine?
  • What is Solr?
  • Why Solr?
  • Features overview

  • Getting started
  • Searching is what it’s all about
  • Tour of the Solr administration console
  • Adapting the example to your needs

  • Searching, matching, and finding content
  • Relevancy
  • Precision and Recall
  • Searching at scale

  • Overview of solrconfig.xml
  • Query request handling
  • Managing searchers
  • Cache management
  • Remaining configuration options

  • Example microblog search application
  • Designing your schema
  • Defining fields in schema.xml
  • Field types for structured nontext fields
  • Sending documents to Solr for indexing
  • Update handler
  • Index management

  • Analyzing microblog text
  • Basic text analysis
  • Defining a custom field type for microblog text
  • Advanced text analysis

  • The anatomy of a Solr request
  • Working with query parsers
  • Queries and filters
  • The default query parser (Lucene query parser)
  • Handling user queries (eDisMax query parser)
  • Other useful query parsers
  • Returning results
  • Sorting results
  • Debugging query results

  • Navigating your content at a glance
  • Setting up test data
  • Field faceting
  • Query faceting
  • Range faceting
  • Filtering upon faceted values
  • Multiselect faceting, keys, and tags
  • Beyond the basics

  • Overview of hit highlighting
  • How highlighting works
  • Improving performance using FastVectorHighlighter
  • PostingsHighlighter

  • Spell-check
  • Autosuggesting query terms
  • Suggesting document field values
  • Suggesting queries based on user activity

  • Result grouping vs. field collapsing
  • Skipping duplicate documents
  • Returning multiple documents per group
  • Grouping by functions and queries
  • Paging and sorting grouped results
  • Grouping gotchas
  • Efficient field collapsing with the Collapsing query parser

  • Developing a Solr distribution
  • Deploying Solr
  • Hardware and server configuration
  • Data acquisition strategies
  • Sharding and replication
  • Solr core management
  • Managing clusters of servers
  • Querying and interacting with Solr
  • Monitoring Solr’s performance
  • Upgrading between Solr versions

  • Getting started with SolrCloud
  • Core concepts
  • Distributed indexing
  • Distributed search
  • Collections API
  • Basic system-administration tasks
  • Advanced topics

  • Why linguistic analysis matters
  • Stemming vs. lemmatization
  • Stemming
  • Handling edge cases
  • Available language libraries in Solr
  • Searching content in multiple languages
  • Language identification

  • Function queries
  • Geospatial search
  • Pivot faceting
  • Referencing external data
  • Cross-document and cross-index joins
  • Big data analytics with Solr

  • The impact of relevancy tuning
  • Debugging the relevancy calculation
  • Relevancy boosting
  • Pluggable Similarity class implementations
  • Personalized search and recommendations
  • Creating a personalized search experience
  • Running relevancy experiments