INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org Computing Element installation & configuration Giuseppe Platania INFN Catania EMBRACE Tutorial.

Slides:



Advertisements
Similar presentations
12th EELA TUTORIAL - USERS AND SYSTEM ADMINISTRATOR E-infrastructure shared between Europe and Latin America CE + WN installation and configuration.
Advertisements

Instalación y configuración de CE+WN Angelines Alberto CIEMAT Grid Tutorial, Sept
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America DPM Server Installation Luciano Diaz ICN-UNAM.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America DPM Server Installation Claudio Cherubino INFN – Catania.
1 Kolkata, Asia Joint CHAIN/EU-IndiaGrid2/EPIKH School for Grid Site Administrators, The EPIKH Project (Exchange Programme.
Ninth EELA Tutorial for Users and Managers E-infrastructure shared between Europe and Latin America User Interface installation and configuration.
Ninth EELA Tutorial for Users and Managers E-infrastructure shared between Europe and Latin America LFC Server Installation and Configuration.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) CE+WN+siteBDII Installation and configuration Bouchra
E-science grid facility for Europe and Latin America Installation and configuration of a top BDII Gianni M. Ricciardi – Consorzio COMETA.
DPM Server Installation Claudio Cherubino INFN - Catania.
IST E-infrastructure shared between Europe and Latin America VOMS and MyProxy Server installation and configuration Pedro Henrique.
4th EELA TUTORIAL - USERS AND SYSTEM ADMINISTRATORS E-infrastructure shared between Europe and Latin America BDII Server Installation Vanessa.
1 Introduction to the tutorial for site managers Antonio Fuentes Red.es/RedIRIS EGEE/EUMedGrid/EELA Tutorial for Managers Sevilla,
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Workload Management System + Logging&Bookkeeping Installation.
E-science grid facility for Europe and Latin America LFC Server Installation and Configuration Antonio Calanducci INFN Catania.
INFSO-RI Enabling Grids for E-sciencE Workload Management System Mike Mineter
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America MyProxy server installation Emidio Giorgio.
INFSO-RI Enabling Grids for E-sciencE Installation and configuration of gLite Resource Broker Emidio Giorgio INFN EGEE-EMBRACE tutorial,
INFSO-RI Enabling Grids for E-sciencE WMS + LB Installation Emidio Giorgio Giuseppe La Rocca INFN EGEE Tutorial, Rome November.2005.
9th EELA TUTORIAL - USERS AND SYSTEM ADMINISTRATORS E-infrastructure shared between Europe and Latin America CE + WN installation and configuration.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America BDII Server Installation and Configuration Antonio Juan.
INFSO-RI Enabling Grids for E-sciencE BDII installation & configuration Giuseppe Platania INFN Catania EMBRACE Tutorial Clermont-Ferrand,
EGEE-II INFSO-RI Enabling Grids for E-sciencE The GILDA training infrastructure.
BDII Server Installation and Configuration Manuel Rubio del Solar Extremadura Advanced Research Center (CETA-CIEMAT) 11th EELA Tutorial for Users Sevilla,
4th EELA TUTORIAL - USERS AND SYSTEM ADMINISTRATORS E-infrastructure shared between Europe and Latin America CE + WN installation and configuration.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks User Interface: installation, configuration.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Giuseppe La Rocca INFN – Catania
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America BDII Server Installation and Configuration.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America User Interface (gLite 1.4) Installation.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Worker Node installation & configuration.
E-infrastructure shared between Europe and Latin America Introduction to the tutorial for site managers Vanessa Hamar Universidad de Los.
INFSO-RI Enabling Grids for E-sciencE Installing a gLite VOMS Server Giuseppe La Rocca INFN EGEE Tutorial Rome November 2005.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Introduction to the tutorial for site managers.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
Enabling Grids for E-sciencE Workload Management System on gLite middleware - commands Matthieu Reichstadt CNRS/IN2P3 ACGRID School, Hanoi.
INFSO-RI Enabling Grids for E-sciencE GILDA and GENIUS Guy Warner NeSC Training Team An induction to EGEE for GOSC and the NGS NeSC,
CE: compute element TP: CE & WN Compute Element Worker Node Installation configuration.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America WMS+LB Server Installation Tony Calanducci.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid2Win : gLite for Microsoft Windows Roberto.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America DPM Server Installation Claudio Cherubino.
Ninth EELA Tutorial for Users and Managers E-infrastructure shared between Europe and Latin America BDII Server Installation Yubiryn Ramírez.
EGEE-II INFSO-RI Enabling Grids for E-sciencE YAIM Overview MiMOS Grid tutorial HungChe, ASGC OPS Team.
Third EELA Tutorial for Managers and Users E-infrastructure shared between Europe and Latin America CE + WN installation and configuration.
12th EELA TUTORIAL - USERS AND SYSTEM ADMINISTRATORS FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin.
South African Grid Training COMPUTING ELEMENT Albert van Eck UFS - ICTS 18 November 2009 Slides by: GIUSEPPE PLATANIA.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America WMS+LB Server Installation Eduardo Murrieta.
E-science grid facility for Europe and Latin America COMPUTING ELEMENT GIUSEPPE PLATANIA INFN Catania 30 June - 4 July, 2008.
User Interface UI TP: UI User Interface installation & configuration.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America BDII Server Installation Claudio Cherubino.
12th EELA Tutorial for Users and Managers E-infrastructure shared between Europe and Latin America LFC Server Installation and Configuration.
GLite WN Installation Giuseppe LA ROCCA INFN Catania ACGRID-II School 2-14 November 2009 Kuala Lumpur - Malaysia.
First South Africa Grid Training WORKER NODE Albert van Eck University of the Free State 25 July, 2008.
First South Africa Grid Training Installation and configuration of BDII Gianni M. Ricciardi Consorzio COMETA First South Africa Grid Training Catania,
INFN/IGI contributions Federated Clouds Task Force F2F meeting November 24, 2011, Amsterdam.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Giuseppe La Rocca INFN – Catania
Site BDII and CE Installation Muhammad Farhan Sjaugi, UPM 2009 November , UM Malaysia 1.
WORKER NODE Alfonso Pardo EPIKH School, System Admin Tutorial Beijing, 2010 August 30th – 2010 September 3th.
INFSO-RI Enabling Grids for E-sciencE Worker Node installation & configuration Giuseppe Platania INFN Catania EMBRACE Tutorial Clermont-Ferrand,
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America LFC Server Installation and Configuration.
South African Grid Training WORKER NODE Albert van Eck UFS - ICTS 17 November 2009 Slides by GIUSEPPE PLATANIA.
16-26 June 2008, Catania (Italy) First South Africa Grid Training LFC Server Installation and Configuration Antonio Calanducci INFN Catania.
YAIM Giuseppe Platania INFN Catania EMBRACE Tutorial
LFC Server Installation & Configuration
Installation and configuration of a top BDII
(Exchange Programme to advance e-Infrastructure Know-How)
Installation and configuration of a Computing Element
Berkley Database Information Index (BDII) Server Installation & Configuration Giuseppe La Rocca INFN – Catania gLite Tutorial Rome, April 2006.
BDII Server Installation and Configuration
WMS+LB Server Installation and Configuration
Presentation transcript:

INFSO-RI Enabling Grids for E-sciencE Computing Element installation & configuration Giuseppe Platania INFN Catania EMBRACE Tutorial Clermont-Ferrand,

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, OUTLINE OVERVIEW INSTALLATION & CONFIGURATION TESTING FIREWALL SETUP TROUBLESHOOTING

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, OVERVIEW The Computing Element is the central service of a site. Its main functionally are: – manage the jobs (job submission, job control) – update to WMS the status of the jobs – publish all site informations (site location, queues, about the CPUs status, and so on) via ldap (site BDII service) It can run several kinds of batch system: – Torque + MAUI – LSF – Condor

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, TORQUE + MAUI The Torque server is composed by a: – pbs_server – pbs_server which provides the basic batch services such as receiving/creating a batch job. The Torque client is composed by a: – pbs_mom – pbs_mom which places the job into execution. It is also responsible for returning the job’s output to the user The MAUI system is composed by a: – job_scheduler – job_scheduler which contains the site's policy to decide which job must be executed.

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, Computing Element installation & configuration using YAIM

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, how to obtain the CE host certificate Request host certificate for CE. – Install host certificate (hostcert.pem and hostkey.pem) in /etc/grid-security. – mkdir /etc/grid-security – chmod 644 hostcert.pem – chmod 400 hostkey.pem

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, INSTALLATION: JAVA SDK Because of SUN licence used for Java SDK, it is not possible to redistribute it with the middleware. You have to download Java SDK from Sun web site: Select ``Download J2SE SDK'', and download the ``RPM in self-extracting file''. Follow the instruction on the pages to extract the rpm.

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, INSTALLATION: glite/gilda yaim Download and install latest version of glite-yaim * on all your grid nodes: MS.Release3.0/ MS.Release3.0/ Download and install the latest version of gilda_ig-yaim * on all your grid nodes:

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, INSTALLATION: glite/gilda yaim Copy gilda_ig-site-info.def template file provided by gilda_ig_yaim in to the root dir and customize it cp /opt/glite/yaim/examples/gilda_ig-site-info.def \ /root/my-site-info.def Open /root/my-site-info.def file using a text editor and set the following values according to your grid environment: MY_DOMAIN= CE_HOST= NTP_HOSTS=“ ” TORQUE_SERVER=$CE_HOST

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, Set the repositories: INSTALL_SERVER_HOST=training50d.$MY_DOMAIN OS_REPOSITORY="rpm slc306-i386 os updates extras localrpms" LCG_REPOSITORY="rpm glite_sl3- i386 3_0_0 3_0_0_externals 3_0_0_updates" IG_REPOSITORY="rpm ig_sl3-i386 3_0_0 utils“ GILDA_REPOSITORY="rpm gilda_app- i386 app 3_0_0" CA_REPOSITORY="rpm glite_sl3- i386 security" Customize gilda_ig-site-info.def

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, JAVA_LOCATION="/usr/java/j2sdk1.4.2_12“ MYSQL_PASSWORD=set_this_to_a_good_password APEL_DB_PASSWORD="APELDB_PWD" SITE_NAME= SITE_LOC="Clermont, France" SITE_LAT=45.7 SITE_LONG=3.08 SITE_WEB=" SITE_TIER="EMBRACE Testbed" Customize gilda_ig-site-info.def

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, JOB_MANAGER=lcgpbs CE_BATCH_SYS=pbs BATCH_BIN_DIR=/usr/bin BATCH_VERSION=torque-1.0.1b CE_CPU_MODEL=PIII CE_CPU_VENDOR=intel CE_CPU_SPEED=1400 CE_OS="Scientific Linux CERN“ CE_OS_RELEASE=3.0.6 CE_OS_VERSION="SLC“ CE_MINPHYSMEM=1024 CE_MINVIRTMEM=2048 CE_SMPSIZE=2 CE_SI00=1000 CE_SF00=1200 CE_OUTBOUNDIP=TRUE CE_INBOUNDIP=TRUE CE_RUNTIMEENV=“list of tags to publish” Customize gilda_ig-site-info.def

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, CLASSIC_HOST="classic_SE_hostname“ DPM_HOST=“dpm_hostname” SE_LIST="$DPM_HOST $CLASSIC_HOST“ BDII_REGIONS="CE SE“ BDII_CE_URL="ldap://$CE_HOST:2135/mds-vo-name=local,o=grid“ BDII_SE_URL="ldap://$CLASSIC_HOST:2135/mds-vo- name=local,o=grid" BDII_SE1_URL="ldap://$DPM_HOST:2135/mds-vo-name=local,o=grid“ VOS=“write here the VOs you want to support” ALL_VOMS=“write here the VOs supported that have a VOMS” QUEUES="short long infinite" Customize gilda_ig-site-info.def

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, WN_LIST=/opt/glite/yaim/examples/gilda_wn-list.conf The file specified in WN_LIST has to be set with the list of all your WNs’s hostname. WARNING: It’s important to setup it before to run the configure command Customize gilda_ig-site-info.def

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, There are several kind of metapackages to install: GILDA_ig_CE – LCG ComputingElement without batch system packages. GILDA_ig_CE_LSF – LCG ComputingElement with LSF. IMPORTANT: providedfor consistency, it does not install LSFbut it apply some fixes via GILDA_ig_configure_node. GILDA_ig_CE_torque – LCG ComputingElement with Torque+MAUI. GILDA_ig_glite_CE gLite – ComputingElement without batch system packages. GILDA_ig_glite_CE_LSF – gLite ComputingElement with LSF. IMPORTANT:provided for consistency, it does not install LSF GILDA_ig_glite_CE_torque – gLite ComputingElement with Torque+MAUI. WHAT KIND OF CE?

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, CE Torque Installation This command will download and install all the needed packages: /opt/glite/bin/gilda_ig_install_node /root/my-site-info.def GILDA_ig_CE_torque Now we can configure the node: /opt/glite/bin/gilda_ig_configure_node /root/my-site-info.def GILDA_ig_CE_torque

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, Computing Element testing

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, Testing Edit a file and write: #!/bin/sh sleep 10 #(it's useful to see the job status) hostname Save it and set the permission of execution: chmod 700 test.sh

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, Testing gilda003]$ qsub -q short test.sh 3.ce-wn.localdomain gilda003]$ qstat -a ce.localdomain: Req'd Req'd Elap Job ID Username Queue Jobname SessID NDS TSK Memory Time S Time ce-wn.localdo gilda003 short test.sh :15 R --

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, Testing gilda003]$ qstat -a gilda003]$ The job execution has finished and we have to list the output file: gilda003]$ ls test.sh.e3 test.sh.o3 And show them: gilda003]$ cat test.sh.e3 (error file) gilda003]$ gilda003]$ cat test.sh.o3 (output file) wn.localdomain

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, Testing plt]$ voms-proxy-init –voms gilda plt]$ globus-job-run ce.localdomain:2119/jobmanager-lcgpbs -q short /bin/hostname wn.localdomain plt]$ edg-job-submit -r ce-wn.localdomain:2119/jobmanager-lcgpbs-short hostname.jdl Selected Virtual Organisation name (from proxy certificate extension): gilda Connecting to host ui-rb-bdii.localdomain, port 7772 Logging to host ui-rb-bdii.localdomain, port 9002 ********************************************************************************************* JOB SUBMIT OUTCOME The job has been successfully submitted to the Network Server. Use edg-job-status command to check job current status. Your job identifier (edg_jobId) is: - *********************************************************************************************

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, Testing Check if the local GRIS and the site BDII are running on CE and are publishing the right informations (CPU, site name and so on) ldapsearch -x -h -p b mds-vo-name=local,o=grid ldapsearch -x -h -p b mds-vo-name=,o=grid

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, FIREWALL SETUP

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, /etc/sysconfig/iptables (1/2) *filter :INPUT ACCEPT [0:0] :FORWARD ACCEPT [0:0] :OUTPUT ACCEPT [0:0] :RH-Firewall-1-INPUT - [0:0] -A INPUT -j RH-Firewall-1-INPUT -A FORWARD -j RH-Firewall-1-INPUT -A RH-Firewall-1-INPUT -i lo -j ACCEPT -A RH-Firewall-1-INPUT -m state --state ESTABLISHED,RELATED -j ACCEPT -A RH-Firewall-1-INPUT -m state --state NEW -m tcp -p tcp --dport 22 -j ACCEPT -A RH-Firewall-1-INPUT -m state --state NEW -m tcp -p tcp --dport j ACCEPT -A RH-Firewall-1-INPUT -m state --state NEW -m tcp -p tcp --dport j ACCEPT -A RH-Firewall-1-INPUT -m state --state NEW -m tcp -p tcp --dport j ACCEPT -A RH-Firewall-1-INPUT -m state --state NEW -m tcp -p tcp --dport j ACCEPT -A RH-Firewall-1-INPUT -m state --state NEW -m tcp -p tcp --dport maui -j ACCEPT -A RH-Firewall-1-INPUT -m state --state NEW -m tcp -p tcp --dport pbs_mom -j ACCEPT -A RH-Firewall-1-INPUT -m state --state NEW -m tcp -p tcp --dport pbs_resmon -j ACCEPT

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, /etc/sysconfig/iptables (2/2) -A RH-Firewall-1-INPUT -m state --state NEW -m tcp -p tcp --dport pbs -j ACCEPT -A RH-Firewall-1-INPUT -m state --state NEW -m tcp -p tcp --dport j ACCEPT -A RH-Firewall-1-INPUT -m state --state NEW -m tcp -p tcp --dport j ACCEPT -A RH-Firewall-1-INPUT -m state --state NEW -m udp -p udp --dport j ACCEPT -A RH-Firewall-1-INPUT -m state --state NEW -m tcp -p tcp --dport j ACCEPT -A RH-Firewall-1-INPUT -m state --state NEW -m udp -p udp --dport 1020:1023 -j ACCEPT -A RH-Firewall-1-INPUT -m state --state NEW -m tcp -p tcp --dport 20000: j ACCEPT -A RH-Firewall-1-INPUT -m state --state NEW -m tcp -p tcp --dport 32768: j ACCEPT -A RH-Firewall-1-INPUT -m state --state NEW -m udp -p udp --dport 32768: j ACCEPT -A RH-Firewall-1-INPUT -p tcp -m tcp --syn -j REJECT -A RH-Firewall-1-INPUT -j REJECT --reject-with icmp-host-prohibited COMMIT

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, iptables startup /sbin/chkconfig iptables on /etc/init.d/iptables start

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, Troubleshooting

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, Troubleshooting plt]$ globus-job-run ce-wn.localdomain:2119/jobmanager-lcgpbs -q short /bin/hostname GRAM Job submission failed because the connection to the server failed (check host and port) (error code 12) solution: check if the globus-gatekeeper daemon is up and running on CE plt]$ globus-job-run ce-wn.localdomain:2119/jobmanager-lcgpbs -q short /bin/hostname GRAM Job submission failed because authentication failed: GSS Major Status: Authentication Failed GSS Minor Status Error Chain: init.c:499: globus_gss_assist_init_sec_context_async: Error during context initialization init_sec_context.c:171: gss_init_sec_context: SSLv3 handshake problems globus_i_gsi_gss_utils.c:888: globus_i_gsi_gss_handshake: Unable to verify remote side's credentials globus_i_gsi_gss_utils.c:847: globus_i_gsi_gss_handshake: Unable to verify remote side's credentials: Couldn't verify the remote certificate OpenSSL Error: s3_pkt.c:1046: in library: SSL routines, function SSL3_READ_BYTES: sslv3 alert bad certificate (error code 7) solution: probably there is no GILDA CA rpm installed on CE

Enabling Grids for E-sciencE INFSO-RI Embrace Tutorial, Clermont-Ferrand, Troubleshooting plt]$ edg-gridftp-ls gsiftp://ce.localdomain/ error the server sent an error response: LCMAPS credential mapping NOT successful solution: check on CE the VO mapping in /opt/edg/etc/lcmaps/gridmapfile /opt/edg/etc/lcmaps/groupmapfile