ACAT05 May , 2005 DESY, Zeuthen, Germany Search for the Higgs boson at LHC by using Genetic Algorithms Mostafa MJAHED Ecole Royale de l’Air, Mathematics and Systems Dept. Marrakech, Morocco
Introduction Introduction Genetic Algorithms Genetic Algorithms Search for the Higgs boson at LHC by using Genetic Algorithms Search for the Higgs boson at LHC by using Genetic Algorithms Optimization of discriminant functions Optimization of discriminant functions Optimization of Neural weights Optimization of Neural weights Hyperplans search Hyperplans search Hypersurfaces search Hypersurfaces search Conclusion Conclusion Search for the Higgs boson at LHC by using Genetic Algorithms M. Mjahed ACAT 05, DESY, Zeuthen, 26/05/2005 2
Introduction Introduction Introduction Introduction
Introduction Higgs production at LHC Several mechanisms contribute to the production of SM Higgs boson in proton collisions The dominant mechanism is the gluon fusion process, gg H
Introduction Decay Modes decay into quarks: H bb and H cc leptonic decay: H + - gluonic decay: H g g decay into virtual W boson pair: H W + W -
Introduction Main discovery modes M H < 2M Z : H bb+X H W + W - l l H ZZ 4l H
The decay channel chosen: H W + W - e + -, + e -, e + e -, + -. Signature: Two charged oppositely leptons with large transverse momentum P T Two energetic jets in the forward detectors. Large missing transverse momentum P’ T Main background: tt production: with tt WbWb l j l j QCD W + W - +jets production Electroweak WWjj H W + W - ll (1) M. Mjahed ACAT 05, DESY, Zeuthen, 26/05/2005 7
ll, ll : the pseudo-rapidity and the azimuthal angle differences between the two leptons jj, jj : the pseudo-rapidity and the azimuthal angle differences between the two jets M ll, M jj : the invariant mass of the two leptons and jets, M nm (n,m = 1,2,3) some rapidity weighted transverse momentum n, m = 1, 2,3, … i rapidity of the leptons or jets, p iT their transverse momentums. H W + W - ll (2) Main Variables M. Mjahed ACAT 05, DESY, Zeuthen, 26/05/2005 8
Genetic Algorithms Genetic Algorithms Genetic Algorithms Genetic Algorithms
Pattern Recognition Measurement Feature Decision Feature Extraction/ Feature Selection Classification M. Mjahed ACAT 05, DESY, Zeuthen, 26/05/
Pattern Recognition Methods Pattern Recognition Methods Fuzzy Logic Neural Networks Neural Networks Genetic Algorithms Genetic Algorithms Pattern Recognition Methods Statistical Methods Statistical Methods Connectionist Methods Connectionist Methods Others Methods Others Methods Wavelets PCA Decision Trees Discriminant Analysis Discriminant Analysis Clustering
Genetic Algorithms Based on Darwin’s theory of ”survival of the fittest”: Living organisms reproduce, individuals evolve/ mutate, individuals survive or die based on fitness The input is an initial set of possible solutions The output of a genetic algorithm is the set of ”fittest solutions” that will survive in a particular environment The process Produce the next generation (by a cross-over function) Evolve solutions (by a mutation function) Discard weak solutions (based on a fitness function) M. Mjahed ACAT 05, DESY, Zeuthen, 26/05/
Genetic Algorithms Preparation: Define an encoding to represent solutions (i. e., use a character sequence to represent a classe) Create possible initial solutions (and encode them as strings) Perform the 3 genetic functions: Crossover, Mutate, Fitness Test Why Genetic Algorithms (GAs) ? Many real life problems cannot be solved in polynomial amount of time using deterministic algorithm Sometimes near optimal solutions that can be generated quickly are more desirable than optimal solutions which require huge amount of time Problems can be modeled as an optimization one M. Mjahed ACAT 05, DESY, Zeuthen, 26/05/
Crossover Mutation Spin Roulette Wheel Selection Genetic Functions
GA Process Initialize Population Terminate? Yes Output solution Output solution No Evaluate Fitness Perform selection, crossover and mutation Perform selection, crossover and mutation Evaluate Fitness M. Mjahed ACAT 05, DESY, Zeuthen, 26/05/
O O ptimization of discriminant functions O Optimization of Neural weights H Hyperplans search H Hypersurfaces search GAs for Pattern Classification GAs for Pattern Classification Efficiency and Purity of Classification Validation Test Events Classification C1C1 C2C2 C 1 : N 1 N 11 N 12 C 2 : N 2 N 21 N 22 TotalM1M1 M2M2 Efficiency for C i classification Purity for C i events Misclassification rate for C i or Error
Search for the Higgs boson at LHC by using Genetic Algorithms
pp HX W + W - X l + l - X Signal pp ttX WbWbX l j l jX pp qqX W + W - X Search for the Higgs boson at LHC by using Genetic Algorithms Events generated by the LUND MC PYTHIA 6.1 at s = 14 TeV M H = GeV/c 2 Higgs events and Background events are used Background Research of discriminating variables M. Mjahed ACAT 05, DESY, Zeuthen, 26/05/
Variables ll, ll : the pseudo-rapidity and the azimuthal angle differences between the two leptons jj, jj : the pseudo-rapidity and the azimuthal angle differences between the two jets M ll : the invariant mass of the two leptons Rapidity-impulsion weighted Moments M nm : (n=1, …,6) i rapidity: M jj : the invariant mass of the two jets ll, ll, jj, jj, M ll, M jj, M 11, M 21, M 31, M 41
C C Higgs C Back The most separating discriminant Function F Higgs / Back between the classes C Higgs and C Back is : F Higgs / Back = ll +0.4 jj M ll M jj M M 21 The classification of a test event x 0 is then obtained according to the condition: if F Higgs / Back (x o ) 0 then x o C Higgs else x o C Back Optimization of Discriminant Functions (1) Classification of test events Efficiency C Back C Higgs Test events Purity Classification F(x ) = (g signal - g back ) T V -1 x = i x i Discriminant Analysis
Optimization of Discriminant Functions ( 2) 5i5i 4i4i 2i2i 1i1i 0i0i 6i6i 3i3i II Discriminant Function GA Parameters Number of generations Selection, Crossover, Mutation Fitness GA Code Matlab GA Toolbox Fitness Fn: Misclassification rate M. Mjahed ACAT 05, DESY, Zeuthen, 26/05/ Test Events Classification Eff.PurityEff C Higgs C Back
Optimization of Discriminant Functions (3) Chromosome 1 ……………… Chromosome N Generation of N solutions Fitness Fn: Initialize Population Terminate? Yes Output solution Output solution No Evaluate Fitness Perform selection, crossover and mutation Perform selection, crossover and mutation Evaluate Fitness Genetic Process 5151 4141 2121 1111 0101 6161 3131 II 5N5N 4N4N 2N2N 1N1N 0N0N 6N6N 3N3N NN Coefficients of F
Optimization of Discriminant Functions (4) Optimization Results Number of generations =10000 CPU Time: 120 s Optimal Disc. Fn Purity Eff. Classification 0.65 Eff C Back 0.35 C Higgs Test Events M. Mjahed ACAT 05, DESY, Zeuthen, 26/05/
Optimization of Neural Weights (1) Classification of test events Neural Analysis NN Architecture: (10, 10, 10, 1) if O 1 (x) 0.5 then x C Higgs else x C Back Test Evts Classification Eff.Pur.Eff C Higgs C Back ll ll jj jj M ll M 0 M 11 M 21 M 31 M 41 O1O1 h1h1 h2h2
Optimization of Neural Weights (2) Connection Weights + Thresholds GA Parameters Total number of parameters to be optimized : 230 Fitness: Misclassification rate, W ij xh1, i h1 II W ij h1h2, i h2 W ij h2o II 10 Optimization Results Number of generations =1000 CPU Time: 6 mn Test Evts Classification Eff.Pur.Eff C Higgs C Back
Hyperplan search (1) Hyperplan H j Genetic Process 0 j 1 j 2 j 3 j 4 j 5 j 6 j 7 j 8 j 9 j 10 j Generation of N hyperplans H j, j=1: N=20 11 parameters to optimize Number of generations =10000 CPU Time: 4 mn H(x) = 0 + i x i = 0, i=1, 10 Classification rule if H(x) 0 then x C Higgs else x C Back Initialize Population Terminate? Yes Output solution Output solution No Evaluate Fitness Perform selection, crossover and mutation Perform selection, crossover and mutation Evaluate Fitness
Hyperplan search (2) Hyperplan search Results Test Evts Classification Eff.Pur.Eff C Higgs C Back Classification of test events Same results than Discriminant functions optimization Pur Eff. Classification 0.65 Eff C Back 0.35 C Higgs Test Events
Hypersurface search (1) Hypersurface j Initialize Population Terminate? Yes Output solution No Evaluate Fitness Perform selection, crossover and mutation Evaluate Fitness Genetic Process Generation of N hyperplans S j, j=1: N=20 31 parameters to optimize Number of generations =10000 CPU Time: 6 mn Classification rule if S(x) 0 then x C Higgs else x C Back I j i=0:10 j j i j i=1:10 i j i=1:10
Hypersurface search Results Test Evts Classification Eff.Pur.Eff C Higgs C Back Classification of test events Same results as NN weights optimization Hypersurface search (2) M. Mjahed ACAT 05, DESY, Zeuthen, 26/05/ Pur Eff. Classification Eff C Back C Higgs Test Evts
Conclusion Methods Importance of Pattern Recognition Methods The improvement of an any identification is subjected to the multiplication of multidimensional effect offered by PR methods and the discriminating power of the proposed variables. Genetic Algorithms Method allows to minimize the classification error and to improve efficiencies and purities of classifications. The performances are in average 3 to 5 % higher than those obtained with the other methods. Discriminant Functions Optimization : comparative to hyperplan search approach Neural Weights Optimization : comparative to hypersurface search approach C C C Higgs C Back M. Mjahed ACAT 05, DESY, Zeuthen, 26/05/
VV ariables Conclusion (continued) C Characterisation of Higgs Boson events: Other variables should be examined PP hysics Processes Other processes should be considered Detector effects should be added to the simulated events M. Mjahed ACAT 05, DESY, Zeuthen, 26/05/