Workshop
Wednesday, April 6, 2016 in San Francisco
Full-day: 8:00am - 4:00pm
Room: Salon 5 & 6
Supercharging Prediction:
Hands-On with Ensemble Models
Intended Audience:
- Practitioners: Analysts who would like to learn theoretical principles of and practical tips for how to build model ensembles.
- Technical Managers: Project leaders and managers who are responsible for developing predictive analytics solutions and want to understand the potential value and limitations of model ensembles.
Knowledge Level: Beginning to intermediate understanding of statistical methods or predictive modeling algorithms.
More statements of testimony:
"Outstanding instruction."
– Robert Lake, Cisco
Workshop Description
Once you know the basics of predictive analytics including data exploration, data preparation, modeling building, and model evaluation, what can be done to improve model accuracy? One key technique is the use of model ensembles, combines several or even thousands of models into a single, new model score. It turns out that model ensembles are usually more accurate than any single model, and they are typically more fault tolerant than single models.
Are model ensembles an algorithm or an approach? How can one understand the influence of key variables in the ensembles? Which options affect the ensembles most? This workshop dives into the key ensemble approaches including Bagging, Random Forests, and Stochastic Gradient Boosting. Attendees will learn "best practices" and attention will be paid to learning and experiencing the influence various options have on ensemble models so that attendees will gain a deeper understanding of how the algorithms work qualitatively and how one can interpret resulting models. Attendees will also learn how to automate the building of ensembles by changing key parameters.
Participant background
Participants are expected to know the principles of predictive analytics and how the most important algorithms in predictive analytics work (like decision trees, neural networks, regression, etc.).
Course Notes and Free Textbook
Course notes will be provided in electronic form and delivered on a thumb drive or available via download.
All participants will also receive an eBook code for Abbott's book Applied Predictive Analytics (Wiley 2014).
Software
The key concepts covered during this workshop can be applied to many predictive analytics projects regardless of the software used. Live demonstrations using Salford Systems SPM and KNIME will be included in the workshop. Participants will receive an evaluation copy of SPM as part of the registration.
Hardware
Laptops are not required for this course, but is recommended to view the course slides and take notes. Additionally, all participants who would like to experiment with ensembles during the demonstrations may do so with the software provided.
Why Attend?
Schedule
- Registration/Breakfast - 7:30am -8:30am
- AM Break 10:00am - 10:15am
- Lunch 12:15-1:00pm
- First PM Break: 2:15- 2:30pm
- End of the Workshop: 4:00pm
Instructor
Dean Abbott, President, Abbott Analytics
Dean Abbott is Co-Founder and Chief Data Scientist of SmarterHQ, and President of Abbott Analytics, Inc. in San Diego, California. Mr. Abbott is an internationally recognized data mining and predictive analytics expert with over two decades of experience applying advanced data mining algorithms, data preparation techniques, and data visualization methods to real-world problems, including fraud detection, risk modeling, text mining, personality assessment, response modeling, survey analysis, planned giving, and predictive toxicology.
Mr. Abbott is the author of Applied Predictive Analytics (Wiley, 2014) and co-author of IBM SPSS Modeler Cookbook (Packt Publishing, 2013). He has taught full-day data mining and predictive analytics training courses and hands-on workshops to thousands . He teaches predictive analytics and text mining courses through UC Irvine extension and UC San Diego extension programs.