EMI is partially funded by the European Commission under Grant Agreement RI-261611 CREAM CE Tutorial Installation Configuration Troubleshooting Paolo Andreetto.

Slides:



Advertisements
Similar presentations
12th EELA TUTORIAL - USERS AND SYSTEM ADMINISTRATOR E-infrastructure shared between Europe and Latin America CE + WN installation and configuration.
Advertisements

Instalación y configuración de CE+WN Angelines Alberto CIEMAT Grid Tutorial, Sept
INFSO-RI Enabling Grids for E-sciencE Computing Element installation & configuration Giuseppe Platania INFN Catania EMBRACE Tutorial.
1 Kolkata, Asia Joint CHAIN/EU-IndiaGrid2/EPIKH School for Grid Site Administrators, The EPIKH Project (Exchange Programme.
SEE-GRID-SCI Hands-On Session: Computing Element (CE) and site BDII Installation and Configuration Dusan Vudragovic Institute of Physics.
1 Worker Nodes Installation&Configuration Sara Bertocco INFN Padova 11 th International GridKa School 2013 – Big Data, Clouds and Grids.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) VOMS Installation and configuration Bouchra
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) CE+WN+siteBDII Installation and configuration Bouchra
E-science grid facility for Europe and Latin America Installation and configuration of a top BDII Gianni M. Ricciardi – Consorzio COMETA.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Workload Management System + Logging&Bookkeeping Installation.
E-science grid facility for Europe and Latin America LFC Server Installation and Configuration Antonio Calanducci INFN Catania.
E-science grid facility for Europe and Latin America gLite WMS Installation and configuration Riccardo Bruno – INFN.CT 30/06/2008 – 04/07/2008.
EMI is partially funded by the European Commission under Grant Agreement RI Argus Policies Tutorial Valery Tschopp - SWITCH EGI TF Prague.
EPIKH School for Grid Site Administrators, Amman, /32 Introductions BDII Installation and Configuration Miguel Angel Díaz Corchero
INFSO-RI Enabling Grids for E-sciencE WMS + LB Installation Emidio Giorgio Giuseppe La Rocca INFN EGEE Tutorial, Rome November.2005.
9th EELA TUTORIAL - USERS AND SYSTEM ADMINISTRATORS E-infrastructure shared between Europe and Latin America CE + WN installation and configuration.
12th EELA Tutorial for Users and System Administrators E-infrastructure shared between Europe and Latin America User Interface installation.
4th EELA TUTORIAL - USERS AND SYSTEM ADMINISTRATORS E-infrastructure shared between Europe and Latin America CE + WN installation and configuration.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America BDII Server Installation and Configuration.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Worker Node installation & configuration.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
CREAM Installation&Configuration Sara Bertocco INFN Padova 11 th International GridKa School 2013 – Big Data, Clouds and Grids.
CE: compute element TP: CE & WN Compute Element Worker Node Installation configuration.
VO Box Issues Summary of concerns expressed following publication of Jeff’s slides Ian Bird GDB, Bologna, 12 Oct 2005 (not necessarily the opinion of)
EMI INFSO-RI EMIR integration in BDII Maria Alandes Pradillo (CERN) Information System Product Team.
Site-BDII Installation&Configuration Sara Bertocco INFN Padova 11 th International GridKa School 2013 – Big Data, Clouds and Grids.
EGEE-II INFSO-RI Enabling Grids for E-sciencE YAIM Overview MiMOS Grid tutorial HungChe, ASGC OPS Team.
Third EELA Tutorial for Managers and Users E-infrastructure shared between Europe and Latin America CE + WN installation and configuration.
12th EELA TUTORIAL - USERS AND SYSTEM ADMINISTRATORS FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin.
YAIM gLite configuration tool Sara Bertocco INFN Padova 11 th International GridKa School 2013 – Big Data, Clouds and Grids.
TP: Grid site installation BEINGRID site installation.
User Interface UI TP: UI User Interface installation & configuration.
Presentation of the results khiat abdelhamid
GLite WN Installation Giuseppe LA ROCCA INFN Catania ACGRID-II School 2-14 November 2009 Kuala Lumpur - Malaysia.
SEE-GRID-SCI CE and BDII Hands-on Session Miloš Ivanović Research and Development Center for Bioengineering, Kragujevac Serbia
EMI is partially funded by the European Commission under Grant Agreement RI Argus Policies Tutorial Valery Tschopp (SWITCH) – Argus Product Team.
First South Africa Grid Training WORKER NODE Albert van Eck University of the Free State 25 July, 2008.
First South Africa Grid Training Installation and configuration of BDII Gianni M. Ricciardi Consorzio COMETA First South Africa Grid Training Catania,
Site BDII and CE Installation Muhammad Farhan Sjaugi, UPM 2009 November , UM Malaysia 1.
WORKER NODE Alfonso Pardo EPIKH School, System Admin Tutorial Beijing, 2010 August 30th – 2010 September 3th.
INFSO-RI Enabling Grids for E-sciencE Worker Node installation & configuration Giuseppe Platania INFN Catania EMBRACE Tutorial Clermont-Ferrand,
EMI is partially funded by the European Commission under Grant Agreement RI DPM in EMI-II HTTP and NFS interfaces Oliver Keeble On behalf of DPM.
South African Grid Training WORKER NODE Albert van Eck UFS - ICTS 17 November 2009 Slides by GIUSEPPE PLATANIA.
User Interface (UI) Installation Bandung ITB Desember 2009.
16-26 June 2008, Catania (Italy) First South Africa Grid Training LFC Server Installation and Configuration Antonio Calanducci INFN Catania.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Worker Node & Torque Client Installation.
EMI INFSO-RI EMI 1 (Kebnekaise) Updates C. Aiftimiei (INFN) EMI Release Manager.
EGI-InSPIRE RI EGI Training for AEGIS Site Administrators EGI-InSPIRE N G I A E G I S EGI Training for AEGIS Site Administrators Institute.
YAIM Optimized Cristina Aiftimiei – Sergio Traldi
CREAM Installation&Configuration Sara Bertocco INFN Padova
Classic Storage Element
MyProxy Server Installation
Security aspects of the CREAM-CE
Andreas Unterkircher CERN Grid Deployment
Installation and configuration of a top BDII
WMS Installation and Configuration
(Exchange Programme to advance e-Infrastructure Know-How)
UI Installation and Configuration
Installation and configuration of a Computing Element
The Workload Management System
gLite User Interface Installation
EMI 1 (Kebnekaise) Updates
Berkley Database Information Index (BDII) Server Installation & Configuration Giuseppe La Rocca INFN – Catania gLite Tutorial Rome, April 2006.
EMI Componets Installation And Configuration
EMI: dal Produttore al Consumatore
Grid Management Challenge - M. Jouvin
gLite User Interface Installation and configuration
WMS+LB Server Installation and Configuration
Presentation transcript:

EMI is partially funded by the European Commission under Grant Agreement RI CREAM CE Tutorial Installation Configuration Troubleshooting Paolo Andreetto INFN Padova Prague, Sept 19 th 2012

● EMI INFSO-RI Goal of the tutorial ● Installation of a EMI-2 CREAM CE with Torque and a worker node ● 2 AMD Opteron (x86_64) machines with SL5 ● Installed components on the CE ● CREAM service (tomcat container) ● BLAH services (Bupdater, BNotifier) ● Resource BDII ● Torque services (main service, MAUI, munge) ● Other services (Mysql, LB locallogger, Globus GridFTP) ● Installed components on the WN ● Torque services (mom service, munge)

● EMI INFSO-RI Basic concepts ● Software repositories (yum, apt) ● Metapackages ● YAIM configuration

● EMI INFSO-RI Basic OS repositories ● Disable DAG repository if available on the system ● set enabled=0 in /etc/yum.repos.d/dag.repo ● Install EPEL repositories: ● rpm -Uvh noarch.rpm

● EMI INFSO-RI Certification Authorities ● Install EGI trustanchor repository ● wget trustanchors.repo -O /etc/yum.repos.d/EGI-trustanchors.repo ● Install the CA metapackage ● yum install ca-policy-egi-core ● Install CRL fetch daemon (optional) ● yum install fetch-crl ● service fetch-crl-cron start ● chkconfig fetch-crl-cron on ● Reference ●

● EMI INFSO-RI Components installation ● Install EMI-2 repositories: ● rpm --import ● rpm -Uvh ● /emi-release sl5.noarch.rpm ● Install CREAM metapackage ● yum install emi-cream-ce xml-commons-apis ● Install Torque metapackages ● On the CREAM node ● yum install emi-torque-server emi-torque-utils ● On the worker node ● yum install emi-wn emi-torque-client emi-torque-utils

● EMI INFSO-RI Batch system specific setup For Torque: munge configuration ● Create the key on the CREAM node ● /usr/sbin/create-munge-key ● Copy the key from the CE into the Wns ● Check the permissions for the key ● -r munge munge 1024 Jan 03 09:57 munge.key ● Startup the services on the CE and on the WN ● service munge start ● chkconfig munge on

● EMI INFSO-RI Security specific setup ● Host certificate and private key ● /etc/grid-security/hostcert.pem ● Check the permissions ● chmod 600 /etc/grid-security/hostcert.pem ● chmod 400 /etc/grid-security/hostkey.pem

● EMI INFSO-RI YAIM (1) ● YAIM configuration files ● site-info.def ● groups.conf ● users.conf ● wn-list.conf ● vo.d/* ● services/* ● Same files for CREAM and WN configuration ● Examples in /opt/glite/yaim/examples ( ● Deploy files in a local workarea (/root/yaim-config)

● EMI INFSO-RI YAIM (2) ● site-info.def ● is the main configuration file needed to execute the yaim command. It contains the list of variables common to multiple node types in form of key-value pairs. ● Comments and information available at: ● ● services/glite-creamce ● contains cream-ce node type specific variables

● EMI INFSO-RI YAIM (3) ● users.conf ● defines the pool account user mapping, i.e. the UNIX users to be created on the service nodes that need them (mainly CE and WNs) ● File format (fields must not have any white space): ● UID:LOGIN:GID1[,GID2,...]:GROUP1[,GROUP2,...]:VO:FLAG: ● groups.conf ● defines the VOMS group mapping, i.e. the user categories that must be accepted by the grid services provided by a site. It indicates for each category to which kind of local accounts the user should be mapped, where applicable. ● File format (fields must not have any white space): ● "VOMS_FQAN":GROUP:GID:FLAG:[VO]

● EMI INFSO-RI YAIM (4) ● wn-list.conf ● defines the list of WN hostnames (FQDN) in the site. ● Hostnames have to be Fully Qualified Domain Names (FQDN) ● vo.d ● contains a file name per VO whose name has to be the lower-cased version of the VO name. The matching file should contain the definitions for that VO. In case the VO is also defined in site-info.def, the vo.d file will overwrite the variables which are defined there.

● EMI INFSO-RI Component configuration ● On the CE node ● /opt/glite/yaim/bin/yaim -c -s yaim-config/site-info.def -n creamCE -n TORQUE_server -n TORQUE_utils ● On the WN ● /opt/glite/yaim/bin/yaim -c -s yaim-config/site-info.def -n WN -n TORQUE_utils -n TORQUE_client

● EMI INFSO-RI Troubleshooting - services Check the status of the services on the CREAM node ● (WS container) service tomcat5 status ● (BLAH parser) service glite-ce-blah-parser status ● (Resource BDII) service bdii status ● (DB) service mysqld status ● (Local logger) service glite-lb-locallogger status ● (Globus gridftp) service globus-gridftp status ● (Torque server) service pbs_server status ● (Maui scheduler) service maui status ● (Munge daemon) service munge status

● EMI INFSO-RI Troubleshooting – config. Files (1) ● CREAM service: /etc/glite-ce-cream/cream-config.xml ● BLAH: /etc/blah.config ● BLAH (old parser): /etc/blparser.conf ● Glexec+gJAF: ● /etc/glexec.conf ● /etc/lcas/lcas-glexec.db ● /etc/lcmaps/lcmaps-glexec.db

● EMI INFSO-RI Troubleshooting – config. Files (2) ● GridFTP + Glexec ● /etc/lcas/lcas.db ● /etc/lcmaps/lcmaps.db ● GridFTP + Argus: ● /etc/grid-security/gsi-pep-callout.conf

● EMI INFSO-RI Troubleshooting – Log files (1) ● Tomcat: /var/log/tomcat5/catalina.out ● Trustmanager: /var/log/tomcat5/trustmanager.log ● BLAH (new parser): ● /var/log/cream/glite-ce-bnotifier.log ● /var/log/cream/glite-ce-bupdater.log ● BLAH (old parser): ● /var/log/cream/glite- parser.log

● EMI INFSO-RI Troubleshooting – Log files (2) ● GridFTP: ● /var/log/globus-gridftp.log ● /var/log/gridftp-session.log ● Glexec (default solution): /var/log/messages ● CREAM: /var/log/cream/glite-ce-cream.log ● Log config. file: /etc/glite-ce-cream/log4j.properties ● ● Example: changing verbosity level ● replace log4j.logger.org.glite=info, fileout ● with log4j.logger.org.glite=debug, fileout

● EMI INFSO-RI Troubleshooting – auto updates STRONG RECOMMENDATION: Disable any automatic update procedure Several update requires YAIM to be run Automatic update can lead to unstable configuration of the services

● EMI INFSO-RI Troubleshooting – simple tests (1) ● Check nodes status with /usr/bin/pbsnodes ● Try to submit a simple job with qsub as a pool account user on the CE ● Delegate a proxy: ● glite-ce-delegate-proxy -e vm04.ct.infn.it myid

● EMI INFSO-RI Troubleshooting – simple tests (2) ● Submit a simple job: ● glite-ce-job-submit -D myid -r vm04.ct.infn.it:8443/cream-pbs-creamcert1 testsb-vm04.jdl ● where the jdl contains: [executable="/bin/cp";arguments="grid-mapfile out-`date +%s%N`-`whoami`.txt"; inputsandbox={"gsiftp://vm04.ct.infn.it/etc/grid-security/grid-mapfile"}; outputsandbox={ "out*.txt"}; outputsandboxbasedesturi="gsiftp://vm04.ct.infn.it/tmp"; epilogue="/bin/sleep"; epiloguearguments="30";]

● EMI INFSO-RI Troubleshooting – networking ● Check the firewall configuration ● Ports required: ● Check the connection between CE and WN ● try a simple /usr/bin/ssh from WN and CE as a pool account user

● EMI INFSO-RI References ● CREAM site ● ● Main reference guide ●