RooFit – tools for ML fits Authors: Wouter Verkerke and David Kirkby.

Slides:



Advertisements
Similar presentations
Testing Relational Database
Advertisements

Statistical Methods for Data Analysis Modeling PDF’s with RooFit
Statistical Methods for Data Analysis Random numbers with ROOT and RooFit Luca Lista INFN Napoli.
S.Towers TerraFerMA TerraFerMA A Suite of Multivariate Analysis tools Sherry Towers SUNY-SB Version 1.0 has been released! useable by anyone with access.
R Packages Davor Cubranic SCARL, Dept. of Statistics.
Wouter Verkerke, NIKHEF RooFit A tool kit for data modeling in ROOT (W. Verkerke, D. Kirkby) RooStats A tool kit for statistical analysis (K. Cranmer,
Effect of b-tagging Scale Factors on M bb invariant mass distribution Ricardo Gonçalo.
 Statistics package  Graphics package  Programming language  Can be used to share/reproduce analyses  Many new packages being created - can be downloaded.
MC365 Ant. Today We Will Cover: Overview of Ant Installing Ant Using the Ant command line tool Various Ant commands available Using Ant in Eclipse.
K. Kinoshita University of Cincinnati Belle Collaboration Statistical distribution of  - zero free parameters impact of free parameters some speculations.
QCD Meeting July 23, 2004 Is it due to the hard collision? Is it due to fragmentation? Strong polarization seen in fixed-target experiments where jet NOT.
RooFit A tool kit for data modeling in ROOT
RooFit Introduction Basic functionality Addition and convolution
D L /dz and  L : Results and Plans J. Thompson, A. Roodman SLAC MDI Meeting - July 22, 2005.
Chapter 3 Planning Your Solution
Normal Distribution Recall how we describe a distribution of quantitative (continuous) data: –plot the data (stemplot or histogram) –look for the overall.
Anthony Atkins Digital Library and Archives VirginiaTech ETD Technology for Implementers Presented March 22, 2001 at the 4th International.
What is R By: Wase Siddiqui. Introduction R is a programming language which is used for statistical computing and graphics. “R is a language and environment.
Linux Operations and Administration
The CAVES Project Collaborative Analysis Versioning Environment System The CODESH Project COllaborative DEvelopment SHell Dimitri Bourilkov University.
Statistical Methods for Data Analysis Parameter estimates with RooFit Luca Lista INFN Napoli.
880.P20 Winter 2006 Richard Kass 1 Confidence Intervals and Upper Limits Confidence intervals (CI) are related to confidence limits (CL). To calculate.
RooFit/RooStats Tutorial CAT Meeting, June 2009
Yannick Patois – CVS and Autobuild tools at CCIN2P3 – hepix - October, n° 1 CVS setup at CC-IN2P3 and Datagrid edg- build tools CVS management,
BaBar MC production BaBar MC production software VU (Amsterdam University) A lot of computers EDG testbed (NIKHEF) Jobs Results The simple question:
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
ADD-IN EXPRESS True RAD tools for Microsoft Office developers.
RooFit A tool kit for data modeling in ROOT
1 GE Fanuc ©2008 GE Fanuc Intelligent Platforms All Rights Reserved Proficy* DataMart v1.0 Barry Lynch Product Manager GE Fanuc Software Puts the “E” in.
Update on Database Issues Peter Chochula DCS Workshop, June 21, 2004 Colmar.
Architectural Styles SE 464 / ECE 452 / CS 446 Chang Hwan Peter Kim Based on slides prepared by Michał Antkiewicz June 24, 2006.
1 Lesson 8: Basic Monte Carlo integration We begin the 2 nd phase of our course: Study of general mathematics of MC We begin the 2 nd phase of our course:
Wouter Verkerke, NIKHEF RooFit A tool kit for data modeling in ROOT Wouter Verkerke (NIKHEF) David Kirkby (UC Irvine)
1Offline Weekly Meeting May May 2006 AliRoot Build Integration and (Testing) System Peter Hristov Vagner Morais.
Signal and Background Modeling for H → 4l Peter Vankov UK Higgs Meeting, RAL
Fabiola Gianotti, 31/8/’99 PHYSICS and SOFTWARE ATLAS Software Week 31/8/’99 Fabiola Gianotti Software requirements of physics groups What should Detector.
Log-in to the Unix Server A successful log-in allows access to the server through the Secure Shell. SSH, also known as Secure Shocket Shell, is a Unix.
Statistical Methods for Data Analysis Introduction to the course Luca Lista INFN Napoli.
AliRoot survey P.Hristov 11/06/2013. Offline framework  AliRoot in development since 1998  Directly based on ROOT  Used since the detector TDR’s for.
Today’s Topics Playing Deterministic (no Dice, etc) Games –Mini-max –  -  pruning –ML and games? 1997: Computer Chess Player (IBM’s Deep Blue) Beat Human.
Introduction to RooFit W. Verkerke (NIKHEF) 1.Introduction and overview 2.Creation and basic use of models 3.Composing models 4.Working with (profile)
Wouter Verkerke, NIKHEF RooFit – status & plans Wouter Verkerke (NIKHEF)
Met and Normalization Sarah Eno. I wanted to see if we can learn anything about the MET normalization issue using a toy monte carlo. first, we need a.
Discussion Global Analysis of Polarized RHIC and DIS data: How do we get there? Global Analysis Workshop at BNL, Oct. 8, 2007.
1 The Software Development Process ► Systems analysis ► Systems design ► Implementation ► Testing ► Documentation ► Evaluation ► Maintenance.
8th December 2004Tim Adye1 Proposal for a general-purpose unfolding framework in ROOT Tim Adye Rutherford Appleton Laboratory BaBar Statistics Working.
Transition to SVN server: follow up P.Hristov 24/01/2008.
Principal Component Analysis
Wouter Verkerke, UCSB RooFitTools A general purpose tool kit for data modeling, developed in BaBar Wouter Verkerke (UC Santa Barbara) David Kirkby (Stanford.
1 G4UIRoot Isidro González ALICE ROOT /10/2002.
Conditional Observables Joe Tuggle BaBar ROOT/RooFit Workshop December 2007.
Database Issues Peter Chochula 7 th DCS Workshop, June 16, 2003.
Getting started – ROOT setup Start a ROOT 5.34/17 or higher session Load the roofit libraries If you see a message that RooFit v3.60 is loaded you are.
RooFit Tutorial – Topical Lectures June 2007
Introduction to RooFit
Why do F77 always spoil the landscape ?. Several ideas … slide 2.
Software - RooFit/RooStats W. Verkerke Wouter Verkerke, NIKHEF What is it Where is it used Experience, Lessons and Issues.
Experience and perspective in parallelization of data analysis Alfio Lazzaro Openlab/CERN SuperB Computing Ferrara March 9 th -12 th, 2010.
Analysis Tools interface - configuration Wouter Verkerke Wouter Verkerke, NIKHEF 1.
RooFit A tool kit for data modeling in ROOT
Lesson 8: Basic Monte Carlo integration
Prepared by Kimberly Sayre and Jinbo Bi
RooFit A general purpose tool kit for data modeling

How to add value to EPM Reporting
Statistical Methods for Data Analysis Parameter estimates with RooFit
Java Analysis Studio - Status
Statistical Methods for Data Analysis Modeling PDF’s with RooFit
Statistical Methods for Data Analysis Random numbers with ROOT and RooFit Luca Lista INFN Napoli.
Lesson 9: Basic Monte Carlo integration
Presentation transcript:

RooFit – tools for ML fits Authors: Wouter Verkerke and David Kirkby

-there was always a gap between common tools for handling “binned” data samples (Paw, Mn_Fit, Root..) and tools for doing more complicated unbinned ML analysis for arbitrary number of variables, -few years ago David and Wouter wrote a prototype for BaBar(RooFitTools), and many groups jumped on it, used it, and gave feedback to authors(good and bad..), -based on all the feedback, authors decided to re-write the package from scratch, and call it RooFit -since then, all the most complicated CP-fitting analyses in BaBar were done with this package, which became essentially a standard (~40 variables, per-event-errors, realistic resolution functions,~20-50 signal/background categories ) Some history

-source code, help, tutorials are available from the open- source server: -code was compiled/tested on various flavors of Linux and Unix, -it took few minor changes in the Makefiles to install it on BNL “acas” Linux farm: ~vasia/RooFit-V works “out of the box” for many simple tasks, but can be easily customized for each specific analysis, in fact most of the analyses in BaBar have their own RooFit- based package which archived in CVS ( helps each time you need to go back and re-check stuff ), Documentation

RooRealVar x("x","x",0.0,0, 10.0); RooRealVar mean ("mean","mean", 6.5,0,10); RooRealVar sigma("sigma","sigma",0.4,0,10); RooRealVar nsig("nsig","nsig",90,-10,1000); RooGaussian g("g","Generation Pdf",x,mean,sigma); RooRealVar argpar ("argpar", “Argus shape paramerter",-1.0,-10,10); RooRealVar cutoff ("cutoff", “Argus cutoff",10.0,0,20); RooArgusBG a ("a", "Argus PDF", x, cutoff, argpar); RooRealVar nbkg("nbkg","nbkg",210,-10,1000); RooAddPdf fit ("fit", "g+a", RooArgList(g, a), RooArgList(nsig, nbkg)); RooDataSet *data = fit.generate(x,300); RooPlot* xframe = x.frame(); data->plotOn(xframe); fit->plotOn(xframe); fit->plotOn(xframe,Components(RooArgSet(ge,a))); xframe->Draw(); Example 1(code)

Example 1(results)

RooMCStudy toymc ("fit", fit, “x”, “evh” ); 1000,300 toymc.generateAndFit(1000,300); Example 2(toy studies) RooPlot* xfr = nsig.frame(20,160,25); // 4 - plots toymc.plotParamOn(xfr); RooPlot* exfr = toymc.plotError(nsig,0,30,25); RooPlot* pxfr = toymc.plotPull(nsig,-5,5,25); RooPlot* xll = toymc.plotNLL(-1200,-500,50); ~30 min on 1 “acas” box

1000 “toy MC experiments”

-please try RooFit for yourself ( notice, PDF normalization is taken away from the user and handled by the package.. ), -there are few hundreds slides of tutorials, and full class references, - a lot of very useful features(e.g. mix of data and Toy MC..) -a lot of “tricks” vent into optimization (e.g. for PDF normalization analytical solutions are used for simple functions, for more complicated integrals, numerical integration is done and look-up tables are generated, “lazy” recalculations, ) Conclusion