Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR

14
Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006 Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR Hermann-von-Helmholtz-Platz 1 D-76344 Eggenstein-Leopoldshafen H. Marten http://www.gridka.de GridKa plans for SC4 and beyond ier 1/2 Meeting and 47 th Session of the GridKa TAB, 2.-3.3.20

description

GridKa plans for SC4 and beyond Tier 1/2 Meeting and 47 th Session of the GridKa TAB, 2.-3.3.2006. Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR Hermann-von-Helmholtz-Platz 1 D-76344 Eggenstein-Leopoldshafen H. Marten http://www.gridka.de. - PowerPoint PPT Presentation

Transcript of Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR

Page 1: Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR

Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft

Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006

Forschungszentrum Karlsruhe GmbHInstitut für Wissenschaftliches Rechnen, IWR

Hermann-von-Helmholtz-Platz 1D-76344 Eggenstein-Leopoldshafen

H. Marten

http://www.gridka.de

GridKa plans for SC4 and beyond

Tier 1/2 Meeting and 47th Session of the GridKa TAB, 2.-3.3.2006

Page 2: Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR

Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft

Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006

LCG Service DeadlinesLCG Service Deadlines

full physicsrun

first physics

cosmics

2007

2008

2006 Pilot Services – stable service from 1 June 06

LHC Service in operation – 1 Oct 06 over following six months ramp up to full operational capacity & performance

LHC service commissioned – 1 Apr 07

Service Challenge 4

Page 3: Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR

Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft

Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006

WLCG MB defines a set of High Level MilestonesWLCG MB defines a set of High Level Milestoneshttps://uimon.cern.ch/twiki/pub/LCG/Planning/WLCG_High_Level_Phase2_Plan-20060112.xls

    2006

SC4-1 28.02.06 All required software for baseline services deployed and operational at all Tier-1s and at least 20 Tier-2 sites

OPN-2 31.03.06 Tier-0/1 high-performance network operational at CERN and 6 Tier-1s, at least 3 via GEANT.

SC4-2 28.02.06 Use cases and service level support defined for SC4

CAS-1 15.03.06 Castor2 Readiness Review

SC3-4 31.03.06 All services on all Tier-1 sites monitored

SC3-5 31.03.06 Proposal on availability levels specified in Annex 3 of the WLCG MoU

SC4-3 30.04.06 Service Challenge 4 Set-up: Set-up complete and basic service demonstrated, capable of running experiment-supplied packaged test jobs, data distribution tested.

DRC-3 30.04.06 1.0 GB/s data recording demonstration at CERN

SC4-4 31.05.06 Service Challenge 4: Start of stable service phase

SC4-5 30.09.06 Service Challenge 4: Successful completion of service phase

To be shifted by 1 month (see later)

Page 4: Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR

Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft

Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006

WLCG MB asks Tier-1s to provide site milestone plansWLCG MB asks Tier-1s to provide site milestone plans

https://uimon.cern.ch/twiki/bin/view/LCG/SitesPlanshttps://uimon.cern.ch/twiki/pub/LCG/SitesPlans/FZK_Plan-20060121.xls

Specifies hardware installation and configuration plans (see also

TAB#46). In detail:

03/ 2006: tape access and I/O optimization tests

03/ 2006: installation of 2nd 10 Gbps OPN from GridKa to CERN

03/ 2006: delivery and installation of CPUs

04/ 2006: dCache server & write pool upgrade (throughput)

04/ 2006: 2nd Tape I/O upgrade

05/ 2006: disk delivery installation, configuration, tests

06/ 2006: start of stable SC4 services

Page 5: Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR

Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft

Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006

experiment kSI2000 Disk / TB Tape / TB

Alice 363 59 106

Atlas 250 56 55

CMS 220 120 180

LHCb 194 49 52

BaBar 430 104 50

CDF 120 81 100

Dzero 430 135 300

Compass 80 33 95

GridKa resources after all upgradesGridKa resources after all upgrades

Σ 2087 636 938

Page 6: Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR

Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft

Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006

experiment kSI2000 share percentage

Alice 363 36 300 17.4

Atlas 250 25 000 12.0

CMS 220 22 000 10.5

LHCb 194 19 400 9.3

BaBar 430 43 000 20.6

CDF 120 12 000 5.8

Dzero 430 43 300 20.6

Compass 80 8 000 3.8

1-Apr-2006

The default (test) queue is not handled by the fair share.

These 20-30 CPUs are kept free for test jobs.

PBSPro fair share after delivery of CPUs in productionPBSPro fair share after delivery of CPUs in production

49 % LHC51 % nLHC

Page 7: Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR

Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft

Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006

WLCG MB asks the TCG to provideWLCG MB asks the TCG to providea middleware deployment plana middleware deployment plan

https://uimon.cern.ch/twiki/pub/LCG/Planning/SC4ServicesPlanning_06-01-30.xls

TCG = Technical Coordination Group

Combines user’s requirements and middleware development plans

(great work done by Flavia Donno).

Resulted in “SC4 Middleware (deployment) Plan”

Specifies, which middleware component and version will be

deployed for SC4 and when.

Important message:

LCG 2_7_0 released for production end of January 2006

gLite 3.0 released for pre-production end of February 2006

gLite 3.0 released for production end of April 2006

Page 8: Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR

Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft

Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006

• February• March• April• May• June• July• August• September• October

gLite-3.0 gLite-3.2

p-ps

p-ps

deploy

deploy

production

production

certification

certification

SC4LHC Pilot service

gLite-3.x

Middleware release schedule 2006Middleware release schedule 2006(acc. to Maite Borosso Lopez, EGEE ROC managers meeting, 21-feb-2006)

Likely not the final schema !

Page 9: Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR

Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft

Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006

Middleware deployment plans per siteMiddleware deployment plans per site

https://uimon.cern.ch/twiki/pub/LCG/Planning/SC4SiteServicesPlan.xls

• have been prepared by all Tier-1s

• were discussed in Mumbai (SC4 workshop) together with

• mw development plans

• mw requirements by LHC VOs (applications)

• time scales for activities of LHC VOs

• A summary of the Mumbai workshop was

• prepared by J.Shiers, I.Bird, T.Cass, L.Robertson

• submitted to LCG MB for comments

• submitted to LCG GDB for comments and approval

Page 10: Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR

Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft

Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006

Services & packages

Needed by Pre-production Production

gLite Service deployed

Installation

gLite 3.0

Tests by all VOs

LCG Service deployed

Installation

gLite 3.0

SC4

VOMS server COMPASS & others -- -- -- (VO server) May Jun-Sep

VOMS clients All VOs 1.3.-20.3. 21.3.-30.4. not tested May Jun-Sep

Myproxy All VOs -- -- X May Jun-Sep

Site BDII All VOs 1.3.-20.3. 21.3.-30.4. X May Jun-Sep

Top level BDII All VOs -- -- X May Jun-Sep

FTS Alice, Atlas, LHCb 1.3.-20.3. 21.3.-30.4. X May Jun-Sep

LFC Alice, Atlas, CMS 1.3.-20.3. 21.3.-30.4. X May Jun-Sep

RB All VOs 1.3.-20.3. 21.3.-30.4. X May Jun-Sep

CE All VOs X 1.3.-20.3. 21.3.-30.4. X May Jun-Sep

SE All VOs 1.3.-20.3. 21.3.-30.4. X May Jun-Sep

SRM / dCache All VOs ? 21.3.-30.4. X May Jun-Sep

VOBox Alice, Atlas -- -- -- X May Jun-Sep

UI All VOs 1.3.-20.3. 21.3.-30.4. X May Jun-Sep

WN packages All VOs X 1.3.-20.3. 21.3.-30.4. X May Jun-Sep

Lcg-utils All VOs X 1.3.-20.3. 21.3.-30.4. X May Jun-Sep

GFAL All VOs X 1.3.-20.3. 21.3.-30.4. X May Jun-Sep

R-GMA All VOs X 1.3.-20.3. 21.3.-30.4. X May Jun-Sep

Apel All VOs X 1.3.-20.3. 21.3.-30.4. X May Jun-Sep

3D DB services All VOs (incl. SQUID) -- -- not yet 27.2.-? / tests Jun-Sep

Middleware deployment (gLite 3.0) at GridKaMiddleware deployment (gLite 3.0) at GridKa

Page 11: Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR

Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft

Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006

Month who Pre-production env. Production environment

March GridKa Deployment gLite 3.0

ALICE Testing gLite 3.0 Bulk production at T1/T2; data back to T0

ATLAS Testing gLite 3.0 3-4 weeks Mar / Apr T0 tests (not at GridKa)

CMS Testing gLite 3.0 PhEDEx integration with FTS (development; not at GridKa)

LHCb Testing gLite 3.0 Start generation of 100M B-physics + 100M bias events

April GridKa Support gLite 3.0 SC4 throughput tests

ALICE Testing gLite 3.0 First push out of sim. Data; reconstruction at T1s

ATLAS Testing gLite 3.0 See above (not at GridKa)

CMS Testing gLite 3.0 10 TB to Tape at T1s at 150 MB/s

LHCb Testing gLite 3.0 Generation of B-physics and bias events continues

May GridKa Deployment of gLite 3.0; Main hardware setup

ALICE --

ATLAS Test distributed operations (cont.) --

CMS --

LHCb --

June GridKa Deployment gLite 3.2 Support for SC4

ALICE Testing gLite 3.2

ATLAS Testing gLite 3.2 Tier-0 test (phase 1) with data distribution to Tier-1s (last 3 weeks)

CMS Testing gLite 3.2 2-week re-run of SC3 goals (beginning of month)

LHCb Testing gLite 3.2 Reconstruction/stripping: 2 TB/day out of CERN; 125 TB on MSS @ Tier-1s

LHC activities at GridKa March - JuneLHC activities at GridKa March - June

Page 12: Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR

Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft

Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006

Month who Pre-production env. Production environment

July GridLa Support gLite 3.2 T0-T1 at full nominal rates (tape); via dTeam

ALICE Testing gLite 3.2 Reconstruction at CERN and remote centres

ATLAS Testing gLite 3.2 Distributed processing tests(part I; 3 weeks

CMS Testing gLite 3.2 Bulk simulation (2 months)

LHCb Testing gLite 3.2 Reconstruction/stripping: 2 TB/day out of CERN; 125 TB on MSS @ Tier-1s

August GridKa Deployment of gLite 3.2 ??

ALICE

ATLAS Distributed analysis tests part I (2 weeks in July - August)

CMS Bulk simulation continues

LHCb Analysis on data from June / July … until spring 07 or so…

Sept. GridKa

ALICE Scheduled + unscheduled (T2s?) analysis challenges

ATLAS Tier-0 test phase 2 with data to Tier-2s (3-4 weeks in September - October)

CMS Preparations for Computing Software Analysis Challenge 2006 (CSA06)

LHCb Analysis on data continues

October GridKa Prepare for re-installation with Scientific Linux 4.x ??

ALICE

ATLAS Distributed processing tests part 2 (3 weeks)

CMS Execute CSA06

LHCb Analysis on data continues

LHC activities at GridKa July - OctoberLHC activities at GridKa July - October

Page 13: Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR

Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft

Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006

WLCG medium term evolutionWLCG medium term evolution

3Ddistributeddatabaseservices

developmenttest

SC4

SRM 2test and

deployment

Plan beingelaborated

October?

Additional planned

Functionality

to be agreed& completedin the next

few months

then - testeddeployed

Subject to progress& experience

Newfunctionality

Evaluation&

developmentcycles

Possiblecomponents

for lateryears

??

Page 14: Forschungszentrum Karlsruhe GmbH Institut für Wissenschaftliches Rechnen, IWR

Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft

Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006

Two grid infrastructures are now in operation, on which we are able to build computing services for LHC

Reliability and performance have improved significantly over the past year

The focus of Service Challenge 4 is to demonstrate a basic but reliable service that can be scaled up - by April 2007 -to the capacity and performance needed for the first beams.

Development of new functionality and services must continue, but we must be careful that this does not interfere with the main priority for this year –

reliable operation of the baseline services

Summary (taken from Jamie Shiers / SC4 Mumbai)Summary (taken from Jamie Shiers / SC4 Mumbai)