Continuous simulation of Beyond-Standard-Model processes with multiple parameters Jiahang Zhong (University of Oxford * ) Shih-Chang Lee (Academia Sinica)

Slides:



Advertisements
Similar presentations
Neural Networks and Kernel Methods
Advertisements

Applications of one-class classification
14 Sept 2004 D.Dedovich Tau041 Measurement of Tau hadronic branching ratios in DELPHI experiment at LEP Dima Dedovich (Dubna) DELPHI Collaboration E.Phys.J.
June 6 th, 2011 N. Cartiglia 1 “Measurement of the pp inelastic cross section using pile-up events with the CMS detector” How to use pile-up.
Top Thinkshop-2 Nov , 2000 Pushpa Bhat1 Advanced Analysis Algorithms for Top Analysis Pushpa Bhat Fermilab Top Thinkshop 2 Fermilab, IL November.
Summary of Results and Projected Sensitivity The Lonesome Top Quark Aran Garcia-Bellido, University of Washington Single Top Quark Production By observing.
1 6 th September 2007 C.P. Ward Sensitivity of ZZ→llνν to Anomalous Couplings Pat Ward University of Cambridge Neutral Triple Gauge Couplings Fit Procedure.
Summary of Results and Projected Precision Rediscovering the Top Quark Marc-André Pleier, Universität Bonn Top Quark Pair Production and Decay According.
Recent Electroweak Results from the Tevatron Weak Interactions and Neutrinos Workshop Delphi, Greece, 6-11 June, 2005 Dhiman Chakraborty Northern Illinois.
Top Physics at the Tevatron Mike Arov (Louisiana Tech University) for D0 and CDF Collaborations 1.
Bayesian Neural Networks Pushpa Bhat Fermilab Harrison Prosper Florida State University.
1 24 th September 2007 C.P. Ward Sensitivity of ZZ→llνν to Anomalous Couplings Pat Ward University of Cambridge Neutral Triple Gauge Couplings Fit Procedure.
Search for resonances The fingerprints of the Top Quark Jessica Levêque, University of Arizona Top Quark Mass Measurement Top Turns Ten Symposium, Fermilab,
Generative Models Rong Jin. Statistical Inference Training ExamplesLearning a Statistical Model  Prediction p(x;  ) Female: Gaussian distribution N(
WW  e ν 14 April 2007 APS April Meeting WW/WZ production in electron-neutrino plus dijet final state at CDFAPS April Meeting April 2007 Jacksonville,
Detect Unknown Systematic Effect: Diagnose bad fit to multiple data sets Advanced Statistical Techniques in Particle Physics Grey College, Durham 18 -
Arizona State University DMML Kernel Methods – Gaussian Processes Presented by Shankar Bhargav.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
Multivariate Methods of Data Analysis in Cosmic Ray Astrophysics A. Chilingarian, A. Vardanyan Cosmic Ray Division, Yerevan Physics Institute, Armenia.
Review of Lecture Two Linear Regression Normal Equation
T-CHANNEL MODELING UNCERTAINTIES AND FURTHER QUESTIONS TO TH AND NEW FIDUCIAL MEASUREMENTS Julien Donini, Jose E. Garcia, Dominic Hirschbuehl, Luca Lista,
Heavy charged gauge boson, W’, search at Hadron Colliders YuChul Yang (Kyungpook National University) (PPP9, NCU, Taiwan, June 04, 2011) June04, 2011,
Analysis Meeting vol Shun University.
1 Probability and Statistics  What is probability?  What is statistics?
A Neural Network MonteCarlo approach to nucleon Form Factors parametrization Paris, ° CLAS12 Europen Workshop In collaboration with: A. Bacchetta.
Harrison B. Prosper Workshop on Top Physics, Grenoble Bayesian Statistics in Analysis Harrison B. Prosper Florida State University Workshop on Top Physics:
W properties AT CDF J. E. Garcia INFN Pisa. Outline Corfu Summer Institute Corfu Summer Institute September 10 th 2 1.CDF detector 2.W cross section measurements.
Irakli Chakaberia Final Examination April 28, 2014.
G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 3: Multivariate Methods (II) 清华大学高能物理研究中心 2010 年 4 月 12—16.
Comparison of Bayesian Neural Networks with TMVA classifiers Richa Sharma, Vipin Bhatnagar Panjab University, Chandigarh India-CMS March, 2009 Meeting,
Use of Multivariate Analysis (MVA) Technique in Data Analysis Rakshya Khatiwada 08/08/2007.
Conceptual Modelling and Hypothesis Formation Research Methods CPE 401 / 6002 / 6003 Professor Will Zimmerman.
Empirical Research Methods in Computer Science Lecture 7 November 30, 2005 Noah Smith.
Supervised Learning of Edges and Object Boundaries Piotr Dollár Zhuowen Tu Serge Belongie.
W+jets and Z+jets studies at CMS Christopher S. Rogan, California Institute of Technology - HCP Evian-les-Bains Analysis Strategy Analysis Overview:
Sensitivity Prospects for Light Charged Higgs at 7 TeV J.L. Lane, P.S. Miyagawa, U.K. Yang (Manchester) M. Klemetti, C.T. Potter (McGill) P. Mal (Arizona)
Possibility of tan  measurement with in CMS Majid Hashemi CERN, CMS IPM,Tehran,Iran QCD and Hadronic Interactions, March 2005, La Thuile, Italy.
N. Saoulidou & G. Tzanakos1 ANN Basics : Brief Review N. Saoulidou, Fermilab & G. Tzanakos, Univ. of Athens.
1 ttbar Cross-Section Studies D. Jana*, M. Saleem*, F. Rizatdinova**, P. Gutierrez*, P. Skubic* *University of Oklahoma, **Oklahoma State University.
October 19, 2000ACAT 2000, Fermilab, Suman B. Beri Top Quark Mass Measurements Using Neural Networks Suman B. Beri, Rajwant Kaur Panjab University, India.
Measurements of Top Quark Properties at Run II of the Tevatron Erich W.Varnes University of Arizona for the CDF and DØ Collaborations International Workshop.
Higgs Reach Through VBF with ATLAS Bruce Mellado University of Wisconsin-Madison Recontres de Moriond 2004 QCD and High Energy Hadronic Interactions.
Top mass error predictions with variable JES for projected luminosities Joshua Qualls Centre College Mentor: Michael Wang.
Machine Learning 5. Parametric Methods.
Study of pair-produced doubly charged Higgs bosons with a four muon final state at the CMS detector (CMS NOTE 2006/081, Authors : T.Rommerskirchen and.
A bin-free Extended Maximum Likelihood Fit + Feldman-Cousins error analysis Peter Litchfield  A bin free Extended Maximum Likelihood method of fitting.
1 Searching for Z’ and model discrimination in ATLAS ● Motivations ● Current limits and discovery potential ● Discriminating variables in channel Z’ 
Neutrino DIS measurements in CHORUS DIS2004 Strbske Pleso Alfredo G. Cocco INFN – Napoli.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
Jet reconstruction with Deterministic Annealing Davide Perrino Dipartimento di Fisica – INFN di Bari Terzo Convegno Nazionale sulla Fisica di Alice – 13/11/2007.
Pattern Classification All materials in these slides* were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
Stano Tokar, slide 1 Top into Dileptons Stano Tokar Comenius University, Bratislava With a kind permissison of the CDF top group Dec 2004 RTN Workshop.
Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.
SEARCH FOR DIRECT PRODUCTION OF SUPERSYMMETRIC PAIRS OF TOP QUARKS AT √ S = 8 TEV, WITH ONE LEPTON IN THE FINAL STATE. Juan Pablo Gómez Cardona PhD Candidate.
Single Top Quark Production at D0, L. Li (UC Riverside) EPS 2007, July Liang Li University of California, Riverside On Behalf of the DØ Collaboration.
I'm concerned that the OS requirement for the signal is inefficient as the charge of the TeV scale leptons can be easily mis-assigned. As a result we do.
 reconstruction and identification in CMS A.Nikitenko, Imperial College. LHC Days in Split 1.
Studies of the Higgs Boson at the Tevatron Koji Sato On Behalf of CDF and D0 Collaborations 25th Rencontres de Blois Chateau Royal de Blois, May 29, 2013.
Investigation on CDF Top Physics Group Ye Li Graduate Student UW - Madison.
Analysis Tools interface - configuration Wouter Verkerke Wouter Verkerke, NIKHEF 1.
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
The expected confident intervals for triple gauge coupling parameter
Multivariate Analysis Past, Present and Future
Multi-dimensional likelihood
° status report analysis details: overview; “where we are”; plans: before finalizing result.. I.Larin 02/13/2009.
Dilepton Mass. Progress report.
Measurement of the Single Top Production Cross Section at CDF
° status report analysis details: overview; “where we are”; plans: before finalizing result.. I.Larin 02/13/2009.
Presentation transcript:

Continuous simulation of Beyond-Standard-Model processes with multiple parameters Jiahang Zhong (University of Oxford * ) Shih-Chang Lee (Academia Sinica) ACAT 2011, 5-9 September, London * Was in Academia Sinica and Nanjing University

Motivation Many Beyond Standard Model (BSM) processes are defined by more than one free parameters Masses of hypothetical particles Coupling constants … Grid Scan Scan the parameter space with grid points Simulate a sample of events on each point ACAT 2011, 5-9 September, London 2 Var1 Var2 Jiahang ZHONG

Motivation The difficulties of the grid-scan approach: Curse of dimensionality N points ~N d Hard to go beyond 2D Costly for finer granularity ACAT 2011, 5-9 September, London 3 Var1 Var2 Jiahang ZHONG

Motivation The difficulties of the grid-scan approach: Curse of dimensionality N points ~N d Hard to go beyond 2D Costly for finer granularity Large statistics required Samples at different points are treated independently Considerable statistics needed within each sample ACAT 2011, 5-9 September, London 4 Var1 Var2 Pass Fail ~10k evts Jiahang ZHONG

Motivation The difficulties of the grid-scan approach: Curse of dimensionality N points ~N d Hard to go beyond 2D Costly for finer granularity Large statistics required Samples at different points are treated independently Considerable statistics needed within each sample Discreteness Considerable space between points Smoothing/interpolation needed Consequent systematic uncertainties ACAT 2011, 5-9 September, London 5 Var1 Var2 ~TeV ~100GeV Jiahang ZHONG

Motivation Grid-scan: Curse of dimensionality Large statistics needed Discreteness The aim of Continuous MC Competent for multivariate parameter space Less events to be simulated Continuous estimation of signal yield over the parameter space ACAT 2011, 5-9 September, London 6 Jiahang ZHONG

The usage of multivariate BSM simulation is to estimate signal yields over the parameter space. Yields: N(x)=L* σ(x) * ε(x) L: Luminosity. Irrelevant to x (the free parameters) σ: Cross section, branching ratio. Easy to calculate with event generators ε: Detector a cceptance, offline efficiency Need large amount and expensive detector simulation Therefore, our method is focused on easing the estimation of ε Motivation ACAT 2011, 5-9 September, London 7 Jiahang ZHONG

The procedure Event generation ACAT 2011, 5-9 September, London 8 Var1 Var2 Grid ScanContinuous MC O(10 d ) space points O(100k) space points O(10k) events/point O(1) events/point Jiahang ZHONG

The procedure Bayesian Neural Network (BNN) is used to fit the efficiency ε Desirable features of NN fitting Non-parametric modeling Smooth over the parameter space Unbinned fitting Suffer less from dimensionality Correlation between the variables Jiahang ZHONGACAT 2011, 5-9 September, London 9 Unbinned fitting vs. Binned Histogram

The procedure Bayesian implementations of NN further provide Automatic complexity control of NN topology during training Probabilistic output Uncertainty estimation of the output Uncertainty of the output estimated based on the p.d.f. of the NN parameters. Statistical fluctuation of the training sample Choice of NN topology Impact of fitting goodness at certain space point x Jiahang ZHONGACAT 2011, 5-9 September, London 10

Demo Production of right-handed W boson and Majorana neutrino Di-lepton final state 2 leptons (e, μ ) p T >20GeV, |eta|<2.5 cone20/p T <0.1 Two free parameters W R mass [500GeV,1500GeV] N R mass [0, M(W R )] Affect both the cross-section and efficiency 11

Demo Continuous Simulation Generated 100k events, each with random { M(W R ), M(N R ) } Put each event through the selection criteria, and assign target value 1/0 if it pass/fail Feed all events to a BNN, with { M(W R ), M(N R ) } as the input variables Use the trained BNN as a function to provide ε ±σ ε Reference grid-scan A grid with 100GeV step in M(W R ) and 50GeV step in M(N R ) (171 samples in total) Sufficient statistics in each sample to achieve precise reference values Jiahang ZHONGACAT 2011, 5-9 September, London 12

Demo The BNN fitted efficiency Reference from grid-scan Jiahang ZHONGACAT 2011, 5-9 September, London 13

Demo The difference between fitted values and reference values Jiahang ZHONGACAT 2011, 5-9 September, London 14

Demo Uncertainty estimated by the BNN. Jiahang ZHONGACAT 2011, 5-9 September, London 15

Demo The real deviations vs. estimated uncertainties (N σ ) Jiahang ZHONGACAT 2011, 5-9 September, London 16

Summary New approach to simulate multivariate BSM processes More space points, less events Use BNN fitting to obtain smooth yield estimation Performance tested by The deviation between BNN and reference values This deviation vs. BNN uncertainty Limitation: the assumption of smooth distribution Not sensitive to local abrupt changes Less performance across physics boundary. 17 ACAT 2011, 5-9 September, LondonJiahang ZHONG

完 Thank you! 18 ACAT 2011, 5-9 September, LondonJiahang ZHONG

Backup More detailed documentation of this method The Bayesian Neural Network in TMVA/ROOT Links ACAT 2011, 5-9 September, LondonJiahang ZHONG

A black-box of discriminator A white-box of non-parametric fitting tool A multivariate function y(x) Generic function approximator (analog to polynomial in 1D) Training  unbinned MLE fitting y: NN output, a probability, [0,1] t: Target value, 1=pass, 0=fail 20 Backup How does BNN fitting work ACAT 2011, 5-9 September, LondonJiahang ZHONG

Backup: Bayesian implementation of NN(I) 21 Probability fitting Unbinned fitting Full usage of every event Extrapolation/Interpolation Fit y as probability function Bernoulli likelihood Histogram BNN ACAT 2011, 5-9 September, LondonJiahang ZHONG

Backup: Bayesian implementation of NN (II) 22 Uncertainty estimation Training: Most probable value w MP P(w|D) Probability of other w Prediction Probability Uncertainty of y Avoid excessive extrapolation (non-trivial for multivariate analysis) Histogram BNN ACAT 2011, 5-9 September, LondonJiahang ZHONG

Backup: Bayesian implementation of NN (III) 23 Regulator Overtraining is possible due to excessive complexity of NN Early stop Use half input sample as monitor Manual decision of when to stop excessive fitting Regulator Prior knowledge that “simpler” model is preferred Adaptive during training Save the monitor sample!!! Early stop Regulator ACAT 2011, 5-9 September, LondonJiahang ZHONG