TMVA New Features Sergei V. Gleyzer Omar Zapata Mesa Lorenzo Moneta Data Science at the LHC Workshop Nov. 9, 2015.

Slides:



Advertisements
Similar presentations
Dr Andy Pryke - The Data Mine Ltd An Introduction to R Free software for repeatable statistics, visualisation and modeling Dr Andy Pryke, The Data Mine.
Advertisements

This is a subject-based presentation for you as PV to adapt to your subject. The subject content at present is English for Speakers of other Languages.
Scikit-learn: Machine learning in Python
Workshop on Implementing Scriblio The Next-Generation Library Catalog 25 June 2008 The Hong Kong University of Science and Technology Library Interface.
Justin Sun Boston DataCon September 14, Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources.
March 16, Calendar Next week: Thursday meeting instead of Tuesday (May 25) Web update later today.
London, UK 21 May 2012 Janaina Mourao-Miranda, Machine Learning and Neuroimaging Lab, University College London, UK Pattern Recognition for Neuroimaging.
ExaFMM --An open source fast multipole method library aimed for Exascale systems Rio Yokota (KAUST), L. A. Barba (BU)
ROOT: A Data Mining Tool from CERN Arun Tripathi and Ravi Kumar 2008 CAS Ratemaking Seminar on Ratemaking 17 March 2008 Cambridge, Massachusetts.
Practical Machine Learning Pipelines with MLlib Joseph K. Bradley March 18, 2015 Spark Summit East 2015.
Avalanche Internet Data Management System. Presentation plan 1. The problem to be solved 2. Description of the software needed 3. The solution 4. Avalanche.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
JAS3 + AIDA LC Simulations Workshop SLAC 19 th May 2003.
Nightly Releases and Testing Alexander Undrus Atlas SW week, May
Company Overview for GDF Suez December 29, Enthought’s Business Enthought provides products and consulting services for scientific software solutions.
--Caesar Cai TEXT RECOGNITION SENIOR CAPSTONE 2012.
Virtual Cell and CellML The Virtual Cell Group Center for Cell Analysis and Modeling University of Connecticut Health Center Farmington, CT – USA.
1 of 8Prof. Dr. Dr. h.c. Otto Spaniol Computer Science, Informatik 4 Communication and Distributed Systems 25/10/2015 Distributed Applications and Network.
Machine Learning with Weka Cornelia Caragea Thanks to Eibe Frank for some of the slides.
IPlant Collaborative Hands-on Cyberinfrastructure Workshop – Part 2 R. Walls University of Arizona Biodiversity Information Standards (TDWG) Sep. 29, 2015,
Database Systems Carlos Ordonez. What is “Database systems” research? Input? large data sets, large files, relational tables How? Fast external algorithms;
Information Literacy Using The Learning Manager (TLM)‏ Online Instruction from your Desktop! Sylvie Lafortune and Alain Létourneau.
Chapter12. 1) What are students like today? 2) How are they motivated to learn? 3) How can we, as AGSC teachers, incorporate that into our teaching? 4)
TEXT ANALYTICS - LABS Maha Althobaiti Udo Kruschwitz Massimo Poesio.
Petra III Status Teresa Núñez Hasylab-DESY Tango Meeting DESY,
Weekly Presentation Fabricio Teles Dutra Goncalves 05/22/2015.
Lecture 6: Classification – Boosting and SVMs CAP 5415 Fall 2006.
ROOT-R Interface Minimizer/Optimizer Kirby Hermansen Lorenzo Moneta.
Parallel I/O Performance Study and Optimizations with HDF5, A Scientific Data Package MuQun Yang, Christian Chilan, Albert Cheng, Quincey Koziol, Mike.
Titanic: Machine Learning from Disaster
Come ON TIME to the RIGHT LOCATION. Bring PICTURE ID. Make sure you know your TUTORIAL NUMBER Calculators: any Casio fx991 calculator is ok If you have.
Feature Selection Topics
AliEn central services Costin Grigoras. Hardware overview  27 machines  Mix of SLC4, SLC5, Ubuntu 8.04, 8.10, 9.04  100 cores  20 KVA UPSs  2 * 1Gbps.
CS 732: Advance Machine Learning
LECTURE 02: EVALUATING MODELS January 27, 2016 SDS 293 Machine Learning.
M M M M 5. Not Listed
New Advanced Higher Subject Implementation Events Computing Science Unit Assessment at Advanced Higher.
Program Update May 9, 2013 Andrew J. Buckler, MS Principal Investigator, QI-Bench WITH FUNDING SUPPORT PROVIDED BY NATIONAL INSTITUTE OF STANDARDS AND.
Mantid: A new approach to data analysis at large scale facilities Jon Taylor Project WIKI (Binary downloads)

Origami: Scientific Distributed Workflow in McIDAS-V Maciek Smuga-Otto, Bruce Flynn (also Bob Knuteson, Ray Garcia) SSEC.
Introduction to FCC Software FCC Istanbul 11 March, 2016 Alice Robson (CERN/UNIGE) on behalf of / with thanks to the FCC software group.
Tutorial on Science Gateways, Roma, Catania Science Gateway Framework Motivations, architecture, features Riccardo Rotondo.
ILC DMS – 8 th November 2005 Thomas Baron – CERN – IT Managing Events with Indico.
Bootstrap Tutorial Overview Objective Learn how to use the bootstrap for configuring the system. Requirements Installed Version of.
Cloud Analytics Platforms Christian Frey. About AIDA Our mission is to advance knowledge in data analytics through research, education and outreach Our.
Exploring Strategies For Optimizing Knowledge Derivation From Imagery
IML News Sergei Gleyzer (CMS)
Bhakthi Liyanage SQL Saturday Atlanta 15 July 2017
Introduction to Machine Learning
Inter-experimental LHC Machine Learning Working Group Activities
Machine Learning Developments in ROOT Sergei Gleyzer, Lorenzo Moneta
Assurance Scoring: Using Machine Learning and Analytics to Reduce Risk in the Public Sector Matt Thomson 17/11/2016.
Business analytics Lessons from an undergraduate introductory course
Sergei V. Gleyzer University of Florida
VO-Neural / Data Mining Exploration
IML News IML WG Meeting Sergei Gleyzer (CMS)
Study course: “Computing clusters, grids and clouds” Andrey Y. Shevel
Zhe Ye Word2vec Tutorial Zhe Ye
دانشکده سرمایه گذار مبانی و مدل
Spark Software Stack Inf-2202 Concurrent and Data-Intensive Programming Fall 2016 Lars Ailo Bongo
Website Call
Data Science with Python
Database 01.
Call : Website: -
Lecture 15: Data Cleaning for ML
Scaffolding the Generation of Machine Learning Models with SciRise
Python for Data Analysis
The Student’s Guide to Apache Spark
Presentation transcript:

TMVA New Features Sergei V. Gleyzer Omar Zapata Mesa Lorenzo Moneta Data Science at the LHC Workshop Nov. 9, 2015

Outline New TMVA Features Data Loader Feature Selection RMVA PyMVA Nov. 9, 2015Sergei V. Gleyzer Data Science at LHC Workshop2

New Features Improved design that allows – Greater flexibility (modularization) – Feature Selection, Cross-Validation, new techniques for data storage and manipulation – Parallelization (OpenMP/MPI/cuda/TBB) – Integration with Python and R (PyMVA/RMVA) Nov. 9, 2015Sergei V. Gleyzer Data Science at LHC Workshop3

Data Loader TMVA::DataLoader class allows greater flexibility in dealing with data – Connection of different features to different classifier methods: Useful for optimization Nov. 9, 2015Sergei V. Gleyzer Data Science at LHC Workshop4

Feature Selection Based on the FAST stochastic wrapper algorithm – See previous talk and today’s tutorial for details Nov. 9, 2015Sergei V. Gleyzer Data Science at LHC Workshop5

RMVA RMVA is a set of plugins for TMVA that allows the use of R’s classification and regression packages Nov. 9, 2015Sergei V. Gleyzer Data Science at LHC Workshop6

RMVA Nov. 9, 2015Sergei V. Gleyzer Data Science at LHC Workshop7

PyMVA PyMVA is a set of plugins for TMVA based on python api that allows use of python based ML methods Nov. 9, 2015Sergei V. Gleyzer Data Science at LHC Workshop8

PyMVA Nov. 9, 2015Sergei V. Gleyzer Data Science at LHC Workshop9

Tutorials Try the new features tutorial today and the interfaces tutorial on Thursday More features coming soon. Nov. 9, 2015Sergei V. Gleyzer Data Science at LHC Workshop10

More Information Websites: Nov. 9, 2015Sergei V. Gleyzer Data Science at LHC Workshop11