Using Solr with Big Data

Course Overview

This two-day training course is designed for Solr developers who want to:

  • Learn how to use key open source tools such as Hadoop, Cascading and Cassandra.
  • Process big data using workflows to generate large search indexes.
  • Use Solr 4 as a scalable NoSQL database and analytics engine.

The modules and labs teach participants about the essential technologies for generating both traditional Solr search indexes and NoSQL/analytics “databases” using Hadoop, Cascading, Cassandra and Storm. By the end of the class, students will understand both good and bad use cases for these popular “big data” technologies, and how they can be used to create larger, more sophisticated search and analytics solutions based on Solr. Also see course outline.

Class Schedule and Registration

10% Early Bird discount automatically applied when you register 30 days prior to the class start date.

Who Should Attend?

The class is for Solr developers who want to know how to leverage the flexible search functionality of Apache Solr and the Big Data processing of Apache Hadoop, to create the indexes for both general search and augmented data analytics. Lab exercises and real-world examples will be used to reinforce content.

Prerequisites

To get the most from this course you should have experience developing developing Solr applications and with Java development. We also recommend completing “Solr Unleashed” and relevant work experience before taking this class.

Format

Instructor-led lectures with hands-on lab exercises, examples & demonstrations.

Course Materials

Participants will receive an electronic copy of all slides and handouts, as well as links to other resources and downloads.

More information

If you have questions about this or any other LucidWorks University class, please contact the LucidWorks University team.

Course Outline

 

Big Data

  • Real-world example of a big data problem
  • Real-world solution using Hadoop & Solr
  • Hadoop overview
  • Hadoop distributed file system (HDFS)
  • Conceptual map-reduce
  • Hadoop map-reduce
  • Hadoop streaming
  • Map-reduce lab
  • Hadoop summary
  • Hadoop eco-system
  • Workflows with Cascading
  • NoSQL with Cassandra
  • Continuous processing with Storm
  • Workflow lab
Big Search

  • Why use Hadoop with Solr?
  • Designing workflows
  • Moving Big Data
  • Scalable Solr indexing
  • Solr indexing lab
  • Augmented search
  • Solr as a NoSQL database
  • Solr-based analytics
  • Solr analytics lab
  • Scalable indexes
  • Optimizing indexes lab

Cancellation Policy

Registration for a class can be cancelled up to 14 calendar days in advance of the class date for either a full refund, or credit towards another class. No credit or refund can be given for no-shows, or class registrations cancelled less than 14 calendar days prior to a class date. If a registered participant is unable to attend the course, a substitute is welcome to take their place.

On occasion, LucidWorks has to cancel or reschedule a delivery. If this happens, we will notify you as far in advance of the scheduled course dates as possible. In the event that a course is cancelled, the liability of LucidWorks is limited to the return of paid registration fees.

LucidWorks University

LucidWorks University

Need help choosing the right Solr training?

 

Tan Matosian
Training Manager

SiLK Webinar

Marketplace

Search MarketPlace

Get the latest apps, add-ons,
code snippets and more

Visit the MarketPlace XXX

Blog

LucidWorks Blog

Alea Abed, Aug 21, 2014
Andy Wibbels, Jul 16, 2014

SearchHub.org

SearchHub.org

All Lucene/Solr - All the time.

Check it out XXX

DeveloperXXX