<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-4517963319103426980</id><updated>2011-08-03T10:18:59.514+01:00</updated><category term='Clustervision'/><category term='GridPP'/><category term='RHUL'/><category term='IC'/><category term='move'/><category term='cluster'/><category term='Alces'/><category term='newton'/><category term='Dell'/><title type='text'>LondonGrid</title><subtitle type='html'>LondonGrid is a regional Tier 2 of GridPP, distributed between the Universities of Queen Mary, Imperial College, Royal Holloway, Brunel and UCL.</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Mona Aggarwal</name><uri>http://www.blogger.com/profile/05961397599414133572</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>33</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-2468187611342108877</id><published>2011-03-11T12:41:00.003Z</published><updated>2011-03-11T12:44:07.181Z</updated><category scheme='http://www.blogger.com/atom/ns#' term='newton'/><category scheme='http://www.blogger.com/atom/ns#' term='RHUL'/><category scheme='http://www.blogger.com/atom/ns#' term='cluster'/><category scheme='http://www.blogger.com/atom/ns#' term='Dell'/><category scheme='http://www.blogger.com/atom/ns#' term='GridPP'/><category scheme='http://www.blogger.com/atom/ns#' term='Alces'/><title type='text'>RHUL cluster expands</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://farm6.static.flickr.com/5051/5516797353_5400b5b13c_m.jpg"&gt;&lt;img style="float: right; margin: 0pt 0pt 10px 10px; cursor: pointer; width: 180px; height: 240px;" src="http://farm6.static.flickr.com/5051/5516797353_5400b5b13c_m.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Yesterday, RHUL took delivery of new storage and compute nodes to beef up its Tier2 cluster.&lt;br /&gt;The GridPP and CIF funded kit was supplied by Dell and is being installed and configured by Alces.&lt;br /&gt;The extra 6.3 kHS06 and 420 TB will more than double the capacity of cluster.&lt;br /&gt;Once  the installation is complete and accepted, work to integrate it with  the existing cluster and bring up the gLite services will begin.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-2468187611342108877?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/2468187611342108877/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=2468187611342108877' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2468187611342108877'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2468187611342108877'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2011/03/rhul-cluster-expands.html' title='RHUL cluster expands'/><author><name>Simon George</name><uri>http://www.blogger.com/profile/10363160113556218890</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://bp1.blogger.com/_y8YjKFE0xSE/R875lXBUjqI/AAAAAAAAAAM/BrqPTR4a6Xg/S220/Simon2_small.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://farm6.static.flickr.com/5051/5516797353_5400b5b13c_t.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-1936649762288487674</id><published>2010-02-19T17:29:00.002Z</published><updated>2010-02-19T17:40:34.034Z</updated><category scheme='http://www.blogger.com/atom/ns#' term='newton'/><category scheme='http://www.blogger.com/atom/ns#' term='Clustervision'/><category scheme='http://www.blogger.com/atom/ns#' term='RHUL'/><category scheme='http://www.blogger.com/atom/ns#' term='cluster'/><category scheme='http://www.blogger.com/atom/ns#' term='move'/><category scheme='http://www.blogger.com/atom/ns#' term='IC'/><title type='text'>RHUL 'Newton' cluster comes home</title><content type='html'>After two years hosted by Imperial College, our 'Newton' Grid computing cluster has finally been relocated to Royal Holloway's new state-of-the-art computer centre. The move was carried out by &lt;a href="http://www.clustervision.com/" target="_top"&gt;Clustervision&lt;/a&gt; and everything went smoothly.  Before the cluster goes back into production, analysing LHC data, a software upgrade to SL5 is planned.&lt;br /&gt;&lt;br /&gt;A small part of Newton remains at IC: the racks were donated to become part of the particle physics cluster.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-1936649762288487674?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/1936649762288487674/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=1936649762288487674' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/1936649762288487674'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/1936649762288487674'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2010/02/rhul-newton-cluster-comes-home.html' title='RHUL &apos;Newton&apos; cluster comes home'/><author><name>Simon George</name><uri>http://www.blogger.com/profile/10363160113556218890</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://bp1.blogger.com/_y8YjKFE0xSE/R875lXBUjqI/AAAAAAAAAAM/BrqPTR4a6Xg/S220/Simon2_small.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-2138030839740954842</id><published>2009-07-31T12:14:00.028+01:00</published><updated>2009-08-03T16:08:54.719+01:00</updated><title type='text'>Comparing ATLAS analysis at RHUL using the file-staging and RFIO approaches</title><content type='html'>I have been looking at the performance of the Royal Holloway cluster during Hammercloud tests in which data was accessed directly from the DPM pool nodes using the RFIO protocol and comparing it to the recent UK-wide file-staging test (&lt;a href="http://gangarobot.cern.ch/hc/540/test/"&gt;540&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;For the RFIO  approach  two identical tests (&lt;a href="http://gangarobot.cern.ch/hc/537/test/"&gt;537&lt;/a&gt; and &lt;a href="http://gangarobot.cern.ch/hc/538/test/"&gt;538&lt;/a&gt;) were requested in order to ensure enough jobs arrived on site. The RFIO IOBUFSIZE was set to 4KB.  Job CPU efficiencies and cluster throughput (the product of number of running jobs and average job efficiency) were extracted using  Sam and Dug's script. The job throughput climbed steadily up to a peak at around 320 running jobs. At this point the throughput started to decline probably compounded by the fact that one of the disk servers lost a disk and became over-loaded.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_PqExmluvQ0s/SnbKANlMiDI/AAAAAAAAADw/QDKUDpnLkco/s1600-h/test537-538-thrpt-2.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 320px; height: 278px;" src="http://1.bp.blogspot.com/_PqExmluvQ0s/SnbKANlMiDI/AAAAAAAAADw/QDKUDpnLkco/s320/test537-538-thrpt-2.png" alt="" id="BLOGGER_PHOTO_ID_5365698111053006898" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;The CPU efficiency  declined relatively consistently as the number of running jobs increased:&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_PqExmluvQ0s/SnbGtY6QZXI/AAAAAAAAADo/26tICegHCZ4/s1600-h/test-537-538-eff-1.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 320px; height: 312px;" src="http://2.bp.blogspot.com/_PqExmluvQ0s/SnbGtY6QZXI/AAAAAAAAADo/26tICegHCZ4/s320/test-537-538-eff-1.png" alt="" id="BLOGGER_PHOTO_ID_5365694489141732722" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Each job was reading data at about 1 MB/s so that at the peak the total bandwidth was around 350 MB/s - roughly 30 MB/s per disk server. The disk servers were working hard, however, the iostat %util values were around 100% with high cpu iowait values.&lt;br /&gt;&lt;br /&gt;So how do these results compare to those obtained when staging files to the worker node prior to analysis? This graph shows the same RFIO throughput  data together with results from the recently run file-staging test:&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_PqExmluvQ0s/SnbFjKL3Z8I/AAAAAAAAADY/o4wKZpDN4qQ/s1600-h/rfio-filestage-thrpt-1.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 320px; height: 223px;" src="http://4.bp.blogspot.com/_PqExmluvQ0s/SnbFjKL3Z8I/AAAAAAAAADY/o4wKZpDN4qQ/s320/rfio-filestage-thrpt-1.png" alt="" id="BLOGGER_PHOTO_ID_5365693213878740930" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The throughput during file-staging leveled off  earlier - at around 175 running jobs. Similarly the average job efficiency drops more steeply:&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_PqExmluvQ0s/SnbFrRANEKI/AAAAAAAAADg/tSMKopx-SN4/s1600-h/rfio-filestage-eff-1.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 320px; height: 207px;" src="http://3.bp.blogspot.com/_PqExmluvQ0s/SnbFrRANEKI/AAAAAAAAADg/tSMKopx-SN4/s320/rfio-filestage-eff-1.png" alt="" id="BLOGGER_PHOTO_ID_5365693353147830434" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;The job failure rate for the RFIO tests was  4% compared to 17% for the file-staging test.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-2138030839740954842?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/2138030839740954842/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=2138030839740954842' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2138030839740954842'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2138030839740954842'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2009/07/comparing-atlas-analysis-at-rhul-using.html' title='Comparing ATLAS analysis at RHUL using the file-staging and RFIO approaches'/><author><name>Duncan Rand</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_PqExmluvQ0s/SnbKANlMiDI/AAAAAAAAADw/QDKUDpnLkco/s72-c/test537-538-thrpt-2.png' height='72' width='72'/><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-5787450502301880601</id><published>2009-05-13T19:55:00.004+01:00</published><updated>2009-05-13T20:06:24.212+01:00</updated><title type='text'>RHUL getting good rates into MCDISK from RAL</title><content type='html'>&lt;div style="text-align: justify;"&gt;RHUL has regularly got good rates and by that I mean 80-100 MB/s from Fermilab when downloading CMS data. It nice now to see similarly high rates downloading ATLAS data into the MCDISK space token from RAL.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_PqExmluvQ0s/SgsYWiWovCI/AAAAAAAAAAM/frf27FBSbC0/s1600-h/ral-rhul.png"&gt;&lt;img style="margin: 0pt 10px 10px 0pt; float: left; cursor: pointer; width: 320px; height: 124px;" src="http://4.bp.blogspot.com/_PqExmluvQ0s/SgsYWiWovCI/AAAAAAAAAAM/frf27FBSbC0/s320/ral-rhul.png" alt="" id="BLOGGER_PHOTO_ID_5335384959008422946" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-5787450502301880601?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/5787450502301880601/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=5787450502301880601' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/5787450502301880601'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/5787450502301880601'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2009/05/rhul-getting-good-rates-into-mcdisk.html' title='RHUL getting good rates into MCDISK from RAL'/><author><name>Duncan Rand</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_PqExmluvQ0s/SgsYWiWovCI/AAAAAAAAAAM/frf27FBSbC0/s72-c/ral-rhul.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-1925143640107016877</id><published>2008-04-16T13:44:00.003+01:00</published><updated>2008-04-16T13:57:31.799+01:00</updated><title type='text'>Exercised space token creation at UCL-HEP</title><content type='html'>Thought it was neat to give it a try and created as a test a small reservation for dteam, following the instructions on the LCG Twiki. All went well and all the tweaks for SL3 / gLite 3.0 worked well. Only oddity was that: &lt;pre&gt;[root@pc55 root]# dpm-reservespace --gspace 10M --lifetime Inf --group lcgdteam --token_desc dteam_10M&lt;br /&gt;send2nsd: NS009 - fatal configuration error: Host unknown: UNUSED&lt;br /&gt;invalid group: lcgdteam&lt;/pre&gt;but: &lt;pre&gt;[root@pc55 root]# dpm-reservespace --gspace 10M --lifetime Inf --gid 2688 --token_desc dteam_10M&lt;/pre&gt;worked well. Perhaps due to the fact that the group id is not the same as the VO name?? (tried also with 'dteam' in place of 'lcgdteam', but had the same error.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-1925143640107016877?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/1925143640107016877/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=1925143640107016877' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/1925143640107016877'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/1925143640107016877'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2008/04/exercised-space-token-creation-at-ucl.html' title='Exercised space token creation at UCL-HEP'/><author><name>Gianfranco Sciacca</name><uri>http://www.blogger.com/profile/14231948620457508163</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-5814181703450295735</id><published>2008-03-26T14:41:00.003Z</published><updated>2008-03-26T15:19:05.357Z</updated><title type='text'>RHUL aircon problems</title><content type='html'>Our machine room aircon system broke down last week and the temperatures have been all over the place.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_y8YjKFE0xSE/R-poxx1tacI/AAAAAAAAAAY/gIAQK2lep2I/s1600-h/last_month.png"&gt;&lt;img style="margin: 0pt 10px 10px 0pt; float: left; cursor: pointer;" src="http://1.bp.blogspot.com/_y8YjKFE0xSE/R-poxx1tacI/AAAAAAAAAAY/gIAQK2lep2I/s320/last_month.png" alt="" id="BLOGGER_PHOTO_ID_5182069525644667330" border="0" /&gt;&lt;/a&gt;After a few days of summer clothing and a few nights of temperature alarms, it was diagnosed to be a refrigerant gas leak  from  the chiller on the roof. The bad news is that this takes 2 weeks to fix. Luckily the estates engineer was very efficient and organised the delivery and connection of a backup chiller on the last day of  term, then personally looked  in over Easter to keep an eye on it.&lt;br /&gt;&lt;br /&gt;It has been stable the last few days so I've just brought the cluster back up. The site will come out of downtime this evening.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-5814181703450295735?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/5814181703450295735/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=5814181703450295735' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/5814181703450295735'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/5814181703450295735'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2008/03/rhul-aircon-problems.html' title='RHUL aircon problems'/><author><name>Simon George</name><uri>http://www.blogger.com/profile/10363160113556218890</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://bp1.blogger.com/_y8YjKFE0xSE/R875lXBUjqI/AAAAAAAAAAM/BrqPTR4a6Xg/S220/Simon2_small.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_y8YjKFE0xSE/R-poxx1tacI/AAAAAAAAAAY/gIAQK2lep2I/s72-c/last_month.png' height='72' width='72'/><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-5539324308011814857</id><published>2007-08-07T16:15:00.000+01:00</published><updated>2007-08-07T16:27:55.330+01:00</updated><title type='text'>UCL-HEP APEL accounting fixed</title><content type='html'>After upgrading to gLite r27 on the 4th of July, APEL stopped publishing to the central RGMA registry. The apel-publisher failed with a not handled&lt;br /&gt;&lt;pre&gt;RGMABufferFullException&lt;/pre&gt;To fix this, we had to update to the latest version of the APEL rpm's (2.0.5-1) on the MON and CE and re-run YAIM on both&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-5539324308011814857?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/5539324308011814857/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=5539324308011814857' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/5539324308011814857'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/5539324308011814857'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2007/08/ucl-hep-apel-accounting-fixed.html' title='UCL-HEP APEL accounting fixed'/><author><name>Gianfranco Sciacca</name><uri>http://www.blogger.com/profile/14231948620457508163</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-3726395956612776373</id><published>2007-07-20T14:00:00.000+01:00</published><updated>2007-07-20T14:01:31.224+01:00</updated><title type='text'>Imperial SE - dCache removed ~30TB of CMS data</title><content type='html'>As requested by CMS users, this week we have cleaned up around ~30TB (orphaned files) of CMS data from IC dCache. We need to understand why so many orphaned files are generated in dCache.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-3726395956612776373?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/3726395956612776373/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=3726395956612776373' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/3726395956612776373'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/3726395956612776373'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2007/07/imperial-se-dcache-removed-30tb-of-cms_20.html' title='Imperial SE - dCache removed ~30TB of CMS data'/><author><name>Mona Aggarwal</name><uri>http://www.blogger.com/profile/05961397599414133572</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-387086785548007204</id><published>2007-07-20T13:19:00.001+01:00</published><updated>2007-07-20T13:24:09.492+01:00</updated><title type='text'>Brunel SE running DPM 1.6.5</title><content type='html'>We were having problems with the storage element at Brunel so I upgraded it to DPM version 1.6.5 (via 1.6.3) this week. The upgrade didn't go totally smoothly but now things seem a lot better. Thanks to Greig for his usual excellent support.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-387086785548007204?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/387086785548007204/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=387086785548007204' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/387086785548007204'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/387086785548007204'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2007/07/brunel-se-running-dpm-165.html' title='Brunel SE running DPM 1.6.5'/><author><name>Duncan Rand</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-8020664768387399936</id><published>2007-07-20T12:46:00.000+01:00</published><updated>2007-07-20T13:18:22.413+01:00</updated><title type='text'>Brunel running SL4 cluster</title><content type='html'>The worker nodes of dgc-grid-40 are now running the glite worker node release on SL4. It is passing the ops SAM tests and the VO tests that have run recently. There was a problem with LHCb production jobs trying to use edg-brokerinfo rather than glite-brokerinfo which I reported and they have now fixed. CMS production jobs have also completed successfully. Steve Lloyd's ATLAS tests pass apart from the 'New Package' part. Steve's comment was "My tests are still running release 12.0.6 for which the requirement is SL3 so they shouldn't really go into SL4 machines...this problem will go away when I switch to release 13.0.X as that's supposed to work on SL4". ATLAS production jobs seem to run OK but there seems to be a problem copying the output files back.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-8020664768387399936?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/8020664768387399936/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=8020664768387399936' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/8020664768387399936'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/8020664768387399936'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2007/07/brunel-running-sl4-cluster.html' title='Brunel running SL4 cluster'/><author><name>Duncan Rand</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-1899162859683472354</id><published>2007-07-20T12:32:00.000+01:00</published><updated>2007-07-20T12:45:45.295+01:00</updated><title type='text'>RHUL accounting problem</title><content type='html'>There was a problem with the apel accounting at RHUL this week:&lt;br /&gt;&lt;br /&gt;ZoneInfo: /usr/java/j2sdk1.4.2_12/jre/lib/zi/ZoneInfoMappings (Too&lt;br /&gt;many open files)&lt;br /&gt;Thu Jul 19 00:35:06 GMT 2007: apel-pbs-log-parser - WARNING -&lt;br /&gt;Exception opening file /var/spool/PBS/server_priv/accounting/20070713&lt;br /&gt;java.io.FileNotFoundException:&lt;br /&gt;/var/spool/PBS/server_priv/accounting/20070713 (Too many open files)&lt;br /&gt;&lt;br /&gt;we solved it by moving some of the files out of  /var/spool/PBS/server_priv&lt;wbr&gt;/accounting.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-1899162859683472354?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/1899162859683472354/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=1899162859683472354' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/1899162859683472354'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/1899162859683472354'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2007/07/rhul-accounting-problem.html' title='RHUL accounting problem'/><author><name>Duncan Rand</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-5647096105150851679</id><published>2007-06-26T05:53:00.000+01:00</published><updated>2007-06-26T05:56:23.216+01:00</updated><title type='text'>bdii counts</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_W1ILuVlGUTk/RoCcZdJ9BII/AAAAAAAAABw/glEqjqHo620/s1600-h/bdii-counts.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;" src="http://1.bp.blogspot.com/_W1ILuVlGUTk/RoCcZdJ9BII/AAAAAAAAABw/glEqjqHo620/s400/bdii-counts.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5080232340810957954" /&gt;&lt;/a&gt;&lt;br /&gt;Promised to monitor the bdii. This is the plot of the bdii count a while ago. I'll have to redo it for a longer period. It seems clear that it is not the entire site bdii that disappear but only individual entries. Which is very probably correlated with load. We have seen it with the ce mds.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-5647096105150851679?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/5647096105150851679/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=5647096105150851679' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/5647096105150851679'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/5647096105150851679'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2007/06/bdii-counts.html' title='bdii counts'/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_W1ILuVlGUTk/RoCcZdJ9BII/AAAAAAAAABw/glEqjqHo620/s72-c/bdii-counts.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-3786284285836533011</id><published>2007-06-19T10:28:00.000+01:00</published><updated>2007-06-19T10:35:31.812+01:00</updated><title type='text'>RB very slow</title><content type='html'>Yesterday I have been wrestling with our RB. I takes several hours for a job to go from waiting to scheduled which means that the matchmaking process is overloaded. I think the reason was that the database was very big (4GB). Exacly 2^32. As suggested &lt;a href="http://www.gridpp.ac.uk/wiki/IC-HEP#RB_problems"&gt;here&lt;/a&gt; I cleaned the database and it seems better now. The problem is that I never got to the root of what was going wrongly...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-3786284285836533011?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/3786284285836533011/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=3786284285836533011' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/3786284285836533011'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/3786284285836533011'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2007/06/rb-very-slow.html' title='RB very slow'/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-2280893804332421492</id><published>2007-06-19T10:22:00.000+01:00</published><updated>2007-06-19T10:27:56.316+01:00</updated><title type='text'>dCache failures (dcache-server-1.7.0-36)</title><content type='html'>Again this morning we have pools going down with a memory allocation problem:&lt;br /&gt;--&lt;br /&gt;06/19 00:45:58 Cell(sedsk01_5@sedsk01Domain) : Thread : ping got : java.lang.OutOfMemoryError: Java heap space&lt;br /&gt;--&lt;br /&gt;I think the only way we will solve this will be to get hold on a dCache developer that can have a look. Clearly we did not have this problem when we where running the previous version (release 35).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-2280893804332421492?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/2280893804332421492/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=2280893804332421492' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2280893804332421492'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2280893804332421492'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2007/06/dcache-failures-dcache-server-170-36.html' title='dCache failures (dcache-server-1.7.0-36)'/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-4983196248231363165</id><published>2007-06-18T13:25:00.000+01:00</published><updated>2007-06-18T13:28:01.631+01:00</updated><title type='text'>dCache pools went down</title><content type='html'>From friday afternoon several dCache pools went down. It ran out of memory, and here is the content of the sedsk01Domain.log file.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:78%;"&gt;&lt;span style="font-family: courier new;"&gt;06/15 16:32:13 Cell(sedsk01_1@sedsk01Domain) :  at java.lang.Thread.run(Thread.java:595)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;06/15 16:32:13 Cell(sedsk01_1@sedsk01Domain) : Storing incomplete file : 0003000000000000006E0B80 with 2756018417&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;06/15 16:32:13 Cell(sedsk01_1@sedsk01Domain) : Stacked Exception (Original) for : 0003000000000000006E0B80 &lt;-P---------(0)[0]&gt; 2756018417 si={cms:cms} : CacheException(rc=10006;msg=Pnfs request timed out)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;06/15 16:32:13 Cell(sedsk01_1@sedsk01Domain) : Stacked Throwable (Resulting) for : 0003000000000000006E0B80 &lt;-P---------(0)[0]&gt; 2756018417 si={cms:cms} : CacheException(rc=33;msg=Illegal State Transition -P-------- -&gt; -P--------)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;06/15 16:32:13 Cell(sedsk01_1@sedsk01Domain) : CacheException(rc=33;msg=Illegal State Transition -P-------- -&gt; -P--------)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;06/15 16:32:13 Cell(sedsk01_1@sedsk01Domain) :  at diskCacheV111.repository.CacheRepository2$CacheEntry.setPrimaryState(CacheRepository2.java:107)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;06/15 16:32:13 Cell(sedsk01_1@sedsk01Domain) :  at diskCacheV111.repository.CacheRepository2$CacheEntry.setPrecious(CacheRepository2.java:219)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;06/15 16:32:13 Cell(sedsk01_1@sedsk01Domain) :  at diskCacheV111.repository.CacheRepository2$CacheEntry.setPrecious(CacheRepository2.java:215)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;06/15 16:32:13 Cell(sedsk01_1@sedsk01Domain) :  at diskCacheV111.pools.MultiProtocolPool2$RepositoryIoHandler.run(MultiProtocolPool2.java:1538)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;06/15 16:32:13 Cell(sedsk01_1@sedsk01Domain) :  at diskCacheV111.util.SimpleJobScheduler$SJob.run(SimpleJobScheduler.java:64)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;06/15 16:32:13 Cell(sedsk01_1@sedsk01Domain) :  at java.lang.Thread.run(Thread.java:595)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;06/15 16:35:02 Cell(c-100@sedsk01Domain) : runIO : java.lang.OutOfMemoryError: Java heap space&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;06/15 16:35:02 Cell(c-100@sedsk01Domain) : java.lang.OutOfMemoryError: Java heap space&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;06/15 16:35:02 Cell(c-100@sedsk01Domain) : java.lang.OutOfMemoryError: Java heap space&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;06/15 16:38:25 Cell(c-100@sedsk01Domain) : runIO : java.lang.OutOfMemoryError: Java heap space&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;dCache is started with those parameters:&lt;br /&gt; -server -Xmx512m -XX:MaxDirectMemorySize=512m&lt;br /&gt;&lt;br /&gt;We don't know what happened.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-4983196248231363165?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/4983196248231363165/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=4983196248231363165' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/4983196248231363165'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/4983196248231363165'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2007/06/dcache-pools-went-down.html' title='dCache pools went down'/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-7905908706407043898</id><published>2007-06-15T03:47:00.000+01:00</published><updated>2007-06-15T03:52:31.561+01:00</updated><title type='text'>Dataset access problem at IC-HEP</title><content type='html'>&lt;!---mandFontOffStart---&gt;&lt;!---mandFontOffEnd---&gt;Some users are experimenting datasets access problems at IC-HEP. The ticket in question is GGUS 22106. The problem is that our cms users don't have the problem for the same dataset.&lt;br /&gt;This raises the question on how to debug those problems when  you don't have users on hand. In this case the only solutions will be to do it interactively with the user.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-7905908706407043898?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/7905908706407043898/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=7905908706407043898' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/7905908706407043898'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/7905908706407043898'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2007/06/dataset-access-problem-at-ic-hep.html' title='Dataset access problem at IC-HEP'/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-6332852403826164252</id><published>2007-06-15T03:28:00.000+01:00</published><updated>2007-06-15T03:47:15.891+01:00</updated><title type='text'>SAM Failures in London</title><content type='html'>Summary of SAM failures and solutions&lt;br /&gt;&lt;ul&gt;&lt;li&gt;mars-ce2: CA certificates updated but permissions where wrong for the lt2-lcg group and hence the certs where not readable. Fixed now&lt;/li&gt;&lt;li&gt;hep-ce:&lt;br /&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;Update of the images. Missing ssl and uuid libraries caused the lcg-cp tools to fail. Matt solved this&lt;/li&gt;&lt;li&gt;updated the CA but unfortunatly the crl cronjob did not run  since it is being run by mona. Now fixed&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;gw-2 (UCL-CENTRAL): Investigated intermittent failures and discovered that the sam jobs are sometimes killed by sge which has a vmem limit of 2GB. The problem is that python when creating a new thread tries to use the max stack size of the parent process. Since sge set this with a very high value any new thread will thread will try to create a big stack and the vmem limit will be reached. The solution is to change the max stack size in the sge configuration. We tried a ulimit -s 10 in the jobmanager but since then gw-2 is failing the ops test consistently. William has been contacted the revert back this change and make the modification in the sge queue configuration.&lt;br /&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;Note: this problem was seen on the ic-hep cluster (ce00) and fixed using the stack size limit.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;ce1.pp (RHUL): gatekeeper problem, it seems I cannot access with the ssh keys I am using at home. Have to check from IC.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;It's a black week for the availability in London...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-6332852403826164252?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/6332852403826164252/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=6332852403826164252' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/6332852403826164252'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/6332852403826164252'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2007/06/sam-failures-in-london.html' title='SAM Failures in London'/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-2231965384220007836</id><published>2007-05-02T12:03:00.000+01:00</published><updated>2007-05-02T12:05:22.427+01:00</updated><title type='text'>London Tier2 Workshop</title><content type='html'>The London Tier2 Workshop took place on the 16 of April. &lt;br /&gt;It was a good opportunity to see what are the non hep application running on the grid. &lt;br /&gt;The slides of the workshop can be found &lt;a href="http://www.gridpp.ac.uk/workshops/LT2April07.html "&gt;here&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-2231965384220007836?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/2231965384220007836/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=2231965384220007836' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2231965384220007836'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2231965384220007836'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2007/05/london-tier2-workshop.html' title='London Tier2 Workshop'/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-2088586924389307073</id><published>2007-05-02T10:37:00.000+01:00</published><updated>2007-05-02T10:41:43.138+01:00</updated><title type='text'>New Grid Security Policy Document</title><content type='html'>The new Grid Security Policy Document can be found at &lt;a href="https://edms.cern.ch/document/428008/4"&gt;here&lt;/a&gt; . It is still a draft, and comments are welcome. See version 5.6&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-2088586924389307073?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/2088586924389307073/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=2088586924389307073' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2088586924389307073'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2088586924389307073'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2007/05/new-grid-security-policy-document.html' title='New Grid Security Policy Document'/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-5389611624694582469</id><published>2007-02-20T10:55:00.000Z</published><updated>2007-02-20T11:02:10.677Z</updated><title type='text'>RB Wrestling the comeback</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_W1ILuVlGUTk/RdrVKiMkdyI/AAAAAAAAABM/2OjJgCOtlyw/s1600-h/backlog.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://3.bp.blogspot.com/_W1ILuVlGUTk/RdrVKiMkdyI/AAAAAAAAABM/2OjJgCOtlyw/s400/backlog.png" alt="" id="BLOGGER_PHOTO_ID_5033569910494885666" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;This morning looking at the monitoring our RB does not look happy. You can judge yourself on the plot below. It clearly seems that when the submission rate is too high the workload manager can just not eat the jobs fast enough to reduce the queue length. I have asked help from Maarten, we'll see what he come up with. I think I will have a look in the rb code to find out what is going on...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-5389611624694582469?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/5389611624694582469/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=5389611624694582469' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/5389611624694582469'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/5389611624694582469'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2007/02/rb-wrestling-comeback.html' title='RB Wrestling the comeback'/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_W1ILuVlGUTk/RdrVKiMkdyI/AAAAAAAAABM/2OjJgCOtlyw/s72-c/backlog.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-2936784485834883794</id><published>2007-02-19T17:08:00.001Z</published><updated>2007-02-19T17:12:34.134Z</updated><title type='text'>Certificates and Mars</title><content type='html'>I was worried by the low number of jobs at LeSC. There was not much jobs there.&lt;br /&gt;It is very difficult to get an hold on the output files of failing jobs. Thanks to sge we can find out where it is located&lt;br /&gt;&lt;ul&gt;&lt;li&gt;qstat -j jobid will print out the std.err and std.out of the jobid given&lt;/li&gt;&lt;/ul&gt;The problem was:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;globus_i_gsi_gss_utils.c:2155: globus_i_gsi_gssapi_init_ssl_context: Error with openssl: Couldn't open bio for reading on file: /homes/lt2-lcg/grid-security/certificates/47d3d1a0.0&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;and that is because when untarring the files in the certificate directory one of the certificate&lt;br /&gt;was not readable by the lt2-[users]. This is now fixed and I will chase up lhcb to understand&lt;br /&gt;if they can run there without problem.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-2936784485834883794?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/2936784485834883794/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=2936784485834883794' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2936784485834883794'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2936784485834883794'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2007/02/certificates-and-mars.html' title='Certificates and Mars'/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-968651105267623700</id><published>2007-02-19T16:56:00.000Z</published><updated>2007-02-19T17:06:18.428Z</updated><title type='text'>Wrestling with our RB</title><content type='html'>We are still observing very long time (several minutes) to have a job going from the waiting state to the scheduled state. This means that the network server of the rb is accepting the job but the workload manager is running out of steam to process it and do the match making.&lt;br /&gt;&lt;ul&gt;&lt;li&gt;I monitored the rb by looking at the number of entries in the input queue (/var/log/edgwl/workload_manager/input.fl). Checked the number of entries that matches the regular expression ("g$").&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Plotting the number of entries waiting to be accepted by the workload manager as a function of time. The result is here.&lt;/li&gt;&lt;/ul&gt;&lt;div style="text-align: center;"&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_W1ILuVlGUTk/RdnXryMkduI/AAAAAAAAAAk/-6L96vWEFJQ/s1600-h/rbmon.jpg"&gt;&lt;img style="cursor: pointer;" src="http://3.bp.blogspot.com/_W1ILuVlGUTk/RdnXryMkduI/AAAAAAAAAAk/-6L96vWEFJQ/s320/rbmon.jpg" alt="" id="BLOGGER_PHOTO_ID_5033291205772080866" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;The left scale (blue dots) is the number of jobs waiting to be matched. The right scale (red dots) is the number of jobs submitted per unit of 10 minutes&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;You can see a clear drop at the end of the x range. I think this is because I have reduced the number of threads for the network server and increased that number for the workload manager. The file to look at is /opt/edg/etc/edg_wl.conf .&lt;/li&gt;&lt;ul&gt;&lt;li&gt;For the NetWorkServer:&lt;br /&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;MasterThreads = 4;&lt;/li&gt;&lt;li&gt;DispatcherThreads = 6;&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;ul&gt;&lt;ul&gt;&lt;li&gt;For the WorkLoadManager&lt;/li&gt;&lt;ul&gt;&lt;li&gt;NumberOfWorkerThreads = 10;&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;/ul&gt;I will continue to monitor it during the night because the drop is not fully understood. Maybe it is the cms production that has slowed down and is giving some air to the rb.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-968651105267623700?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/968651105267623700/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=968651105267623700' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/968651105267623700'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/968651105267623700'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2007/02/wrestling-with-our-rb.html' title='Wrestling with our RB'/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_W1ILuVlGUTk/RdnXryMkduI/AAAAAAAAAAk/-6L96vWEFJQ/s72-c/rbmon.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-2975298319713625263</id><published>2007-02-12T13:14:00.000Z</published><updated>2006-12-04T10:26:01.240Z</updated><title type='text'>Resurrection of the Blog</title><content type='html'>&lt;span style="font-weight: bold;"&gt;ICT the Grid&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Today we are back in  business with &lt;a href="http://www3.imperial.ac.uk/ict/services/teachingandresearchservices/highperformancecomputing"&gt;&lt;span style="text-decoration: underline;"&gt;ICT&lt;/span&gt;&lt;/a&gt; to get their cluster on the Grid.&lt;br /&gt;&lt;ul&gt;&lt;li&gt;They will provide one machine and install RHEL3 i386 so that we don't have the RHEL4 problem.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;We have to find out how to modify the information system since they are running pbspro which does not have exactly the same commands as pbs.&lt;/li&gt;&lt;li&gt;They will create the pool accounts and we have yet to make sure that we can run prolog scripts to get the lcg environment correct&lt;/li&gt;&lt;/ul&gt;&lt;span style="font-weight: bold;"&gt;QMUL&lt;br /&gt;&lt;/span&gt;&lt;ul&gt;&lt;li&gt;Atlas cannot install the tags. They tried to install the new software but it is not published correctly.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Maybe this is because we are publishing another subcluster to publish the 64 bit queues. I'll make a wiki entry with explanations how this was done. The dynamic information does not seem to be correct though.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;span style="font-weight: bold;"&gt;Imperial Hep&lt;/span&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Mona has enabled camont and total on our rb. We now need a site to test it and we also need to enable it on the lesc and ic-hep ce.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-2975298319713625263?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/2975298319713625263/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=2975298319713625263' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2975298319713625263'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2975298319713625263'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2007/02/resurrection-of-blog.html' title='Resurrection of the Blog'/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-4798631888558698385</id><published>2006-12-04T10:24:00.000Z</published><updated>2006-12-04T10:22:35.337Z</updated><title type='text'>dzero </title><content type='html'>- Dzero was disabled at QMUL because the NAT box changed. This is now &lt;br&gt;solved and we see dzero jobs at QMUL again. Next will be &lt;br&gt;ce00.hep.ph.ic.ac.uk&lt;br&gt;- CMS installation should now be possible at QMUL. We are trying to get &lt;br&gt;all the London sites cms installable by the sgm.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-4798631888558698385?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/4798631888558698385/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=4798631888558698385' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/4798631888558698385'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/4798631888558698385'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2006/12/dzero.html' title='dzero '/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-7618901711329173902</id><published>2006-12-01T11:32:00.001Z</published><updated>2006-12-01T11:32:07.605Z</updated><title type='text'>mars will be offline</title><content type='html'>UKI-LT2-IC-LESC will be down from the 1/12 - 4/12 for maintenance.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-7618901711329173902?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/7618901711329173902/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=7618901711329173902' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/7618901711329173902'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/7618901711329173902'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2006/12/mars-will-be-offline.html' title='mars will be offline'/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-1398285821358016189</id><published>2006-11-09T09:40:00.000Z</published><updated>2006-11-09T09:49:58.773Z</updated><title type='text'>SAM, NGS</title><content type='html'>&lt;ul&gt;&lt;li&gt;Yesterday we have found why we had Aborted SAM tests. The reason was simple, the SAM jobs seems to use a lot of virtual memory and they where killed by our lovely SGE. We where unlucky since all the tests could be published and where showing ok. Only the RB state was showing an Abort with a Maradona error.  The key to find why the jobs where failing was to use the sam tests ourself with the information found &lt;a href="http://goc.grid.sinica.edu.tw/gocwiki/SAM_Submission_Framework"&gt;here&lt;/a&gt;.&lt;/li&gt;&lt;li&gt;Concerning NGS the setup is done at LeSC. But we have a problem with the home directories that are not mounting properly on the worker nodes. Keith is looking at it.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Another usefull info is that we do not have to setup a specific UI for the NGS. They can submit from their UI and they use the stage in switch of globus-job-submit (globus-job-submit  mars-ce2.mars.lesc.doc.ic.ac.uk:2119/jobmanager-sge -q 10min  -s `pwd`/test.sh)&lt;/li&gt;&lt;li&gt;APEL accounting, Duncan and Giuseppe are chasing problems with the data not being published.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;LHCB installation at Imperial failed, Joel is trying another time&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Today we reached more than 1.8k jobs in London, Thanks everyone and welcome to the new clusters. Brunel is running full steam and Imperial is catching up.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-1398285821358016189?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/1398285821358016189/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=1398285821358016189' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/1398285821358016189'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/1398285821358016189'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2006/11/sam-ngs.html' title='SAM, NGS'/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-3769675248835699266</id><published>2006-09-25T19:29:00.000+01:00</published><updated>2006-09-25T19:48:23.099+01:00</updated><title type='text'></title><content type='html'>&lt;span style="font-weight: bold;"&gt;UCL-CENTRAL:&lt;/span&gt; Today I chased up the  replication sft problem. I don't have any answer from my ggus ticket (#12970). William has mapped me to opssgm and I can sucessfully store files on their srm. When looking more in detail it seems that they are also openssl errors spitted by the server. They only appear when it is the "regular sft" that are running. If we use the polish portal we don't have the error. My bet it that the portal uses grid-proxy-init while the "regular sft" is using voms-proxy-init. It is probably because the vomses and vomsdir content is not completely correct. William is having a look at that.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;ATLAS: &lt;/span&gt;I have contacted Alessandro to understand why some of the London sites are not in shown on their production portal. He explained me that they have to populate a list of sites to which they submit production jobs. I will make sure with Kondo that the atlas sw is there on all london sites.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;CMS: &lt;/span&gt;The cms production is finished (50 millions events). CSA06 should start in the coming weeks to analyse the produced data that is fed back to CERN.&lt;span style="font-weight: bold;"&gt; &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Woodcrest:&lt;/span&gt; Today we (me, Mona, Bill) physically installed all the machines we currently have (22 wn and 8 disk servers).&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;PPS:&lt;/span&gt; Barry is summarizing all the problems he went trough with the glite WMS. Hope to have this for the next operation meeting.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;dCache: &lt;/span&gt;Having problems with the number of connection in close_wait state. This is a known issue to the dCache team. We don't know when it will be fixed. This causes replication failures ...&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Transfer test:&lt;/span&gt; RALPPD--&gt;QMUL transfer test worked. Average bandwidth 106Mbits/s and 1146/1500 files transferred. Clearly we should be able to do better since they have a 1Gb link which is not much loaded.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;GridMon: &lt;/span&gt;Got an agreement with Mark to have the recepie to rebuild the GridMon box for Imperial and UCL. Kostas agreed to build it.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Brunel new cluster:&lt;/span&gt;Discusssions with Duncan on how to proceed with the new cluster.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-3769675248835699266?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/3769675248835699266/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=3769675248835699266' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/3769675248835699266'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/3769675248835699266'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2006/09/ucl-central-today-i-chased-up.html' title=''/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-5398575201645069798</id><published>2006-09-22T15:48:00.000+01:00</published><updated>2006-09-22T16:31:21.549+01:00</updated><title type='text'></title><content type='html'>Today is a horrible rainy day... I have been again busy with QMUL. For some weird reason lhcb has managed to schedule 2500 jobs there and none are running. I can even not see the jobs in the queue. I restarted the gatekeeper and I can see jobs being submitted by Ricardo. I still don't understand what was happening there. I have also tried to ressurect the ganglia of QMUL which is down. I could not power cycle it.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Transfer Tests:&lt;/span&gt; I have initiated a transfer of  100Gb from RALT2 to QMUL it seems to be fine for now.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Woodcrest:&lt;/span&gt; 20 new machines arrived today.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-5398575201645069798?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/5398575201645069798/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=5398575201645069798' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/5398575201645069798'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/5398575201645069798'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2006/09/today-is-horrible-rainy-day.html' title=''/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-8371336016382431964</id><published>2006-09-21T16:02:00.000+01:00</published><updated>2006-09-21T16:17:27.276+01:00</updated><title type='text'></title><content type='html'>&lt;span style="font-weight: bold;"&gt;Woodcrest: &lt;/span&gt;Asked ICT to reserve a set of UID/GID to avoid clashes between hep and ict.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Biomed Challenge:&lt;/span&gt; Asked Yannick to see if we can find another solution then inbound connectivity to the nodes for the flex licence.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;QMUL:&lt;/span&gt; try to understand what causes such a low number of running jobs while there is a lot of jobs queued. see attached plots.&lt;br /&gt;&lt;ul&gt;   &lt;li&gt;DNS problem: very high load on the dns server. Process zombie when trying to kill it. All Grid services stuck. Could restart the dns and the situation seems to be stabilized.&lt;/li&gt;   &lt;li&gt;Maui conf: Reservation did not work if ops jobs where submitted on the long queue. This is because the reservation period was not set to infinity.&lt;br /&gt;&lt;/li&gt; &lt;/ul&gt; &lt;span style="font-weight: bold;"&gt;Dzero: &lt;/span&gt;&lt;ul&gt;   &lt;li&gt;dzero station sandbox full. All dzero sites affected. Frederic was waiting that QMUL is back to send jobs there for testing.&lt;/li&gt;    &lt;li&gt;I have investigated lesc to try to understand what causes such a low number of running jobs while there is a lot of jobs queued. see attached plots. On the left number of scheduled jobs and on the right running. &lt;span style="font-weight: bold;"&gt;    &lt;/span&gt;&lt;/li&gt;   &lt;/ul&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://photos1.blogger.com/blogger2/4516/551509194226492/1600/lesc1.0.png"&gt;&lt;img style="cursor: pointer;" src="http://photos1.blogger.com/blogger2/4516/551509194226492/320/lesc1.0.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://photos1.blogger.com/blogger2/4516/551509194226492/1600/lesc2.png"&gt;&lt;img style="cursor: pointer;" src="http://photos1.blogger.com/blogger2/4516/551509194226492/320/lesc2.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;Other Stuff:&lt;br /&gt;&lt;/span&gt; &lt;ul&gt;   &lt;li&gt;&lt;span style="font-weight: bold;"&gt;SAM Monitoring:&lt;/span&gt; Asked all sites to check their status in SAM&lt;/li&gt;   &lt;li&gt;Asked to be mapped to ops to test UCL srm. No answer yet to the ticket &lt;strong&gt;#12970&lt;/strong&gt;&lt;/li&gt;&lt;li&gt;Installed new apel accounting rpms at UCL-HEP. Will see tomorrow if it has solved the problem.&lt;br /&gt;  &lt;/li&gt;  &lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-8371336016382431964?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/8371336016382431964/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=8371336016382431964' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/8371336016382431964'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/8371336016382431964'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2006/09/woodcrest-asked-ict-to-reserve-set-of.html' title=''/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-2373367993318780201</id><published>2006-09-20T09:38:00.000+01:00</published><updated>2006-09-20T09:39:17.782+01:00</updated><title type='text'></title><content type='html'>&lt;span style="font-weight: bold;"&gt;QMUL: &lt;/span&gt;&lt;br /&gt;&lt;ul&gt;   &lt;li&gt;Certificates have not been upgraded, the tests are now critical and Giuseppe is on Holiday up to the 27/09. I will upgrade them since Giuseppe is on holiday. Have made a new rpm version 8 for the certs. Have a look at our &lt;a href="http://www.gridpp.ac.uk/wiki/QMUL#CA_1.9"&gt;wiki&lt;/a&gt;&lt;/li&gt;   &lt;li&gt;lhcb yaim settings have been changed to match their use of groups rather than roles. It seems ok and I am populating the LT2 yaim file with the corrected entries&lt;/li&gt; &lt;/ul&gt;&lt;span style="font-weight: bold;"&gt;Biomed challenge: &lt;/span&gt;&lt;span&gt;Have asked all London to update the biomed voms to be ready for the biomed challenge of 1/10. Will have to fill in the ressource pledge in the cic portal &lt;a href="https://cic.in2p3.fr/index.php?id=rc&amp;subid=rc_activity&amp;amp;dc=7#DC"&gt;here&lt;/a&gt;. Duncan has checked with Yannick that the &lt;a href="http://www.gridpp.ac.uk/wiki/Lt2-yaim-vo"&gt;voms settings&lt;/a&gt; is ok.&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;NetMon: &lt;/span&gt;Trying to find a solution for the net monitoring box proposed by Robin. Admins wants to rebuild the machine. I am afraid this will take time to be resolved.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Woodcrest purchase: &lt;/span&gt;Today we have received 8 new boxes. The disks one. We have also debugged a problem with the pxe boot. It seems that some of our switches are not behaving correctly since the dhcp requests do not come back. Kostas is investigating further. For the CE we will build an old machine. We are ordering additional 1Gb memory for the CE that will go to ICT and HEP.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Dzero:&lt;/span&gt; Frederic has tested dzero at QMUL, it seems that the setup we have made for the TMP directory is ok. He is having concerns with the number of slots he is having at LESC.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-2373367993318780201?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/2373367993318780201/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=2373367993318780201' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2373367993318780201'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2373367993318780201'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2006/09/qmul-certificates-have-not-been.html' title=''/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-1139005257463264874</id><published>2006-09-19T09:09:00.000+01:00</published><updated>2006-09-19T09:10:33.987+01:00</updated><title type='text'></title><content type='html'>&lt;span style="font-weight: bold;"&gt;QMUL:&lt;/span&gt; poolfs is down again. It says no space left on device. Restarting poolfs does not help. I have decided to reboot the storage element. The reason there was a poolfs problem is that the / was full. I have setup the logrotate for the logs to be compressed which is not the case by default in the logrotate file created by yaim. The se01 is back online.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.allhands.org.uk/"&gt;All Hands 2006&lt;/a&gt;: Prepared an small talk "Towards Sustainability" to initiate discussions.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;UCL-CENTRAL:&lt;/span&gt; still having problems with replication for the ops vo. Wiliam observed that if he runs the test using the &lt;a href="https://monitoring.egee.man.poznan.pl/admin2/"&gt;submission tool&lt;/a&gt; it does not give a replication error. Submitted a ticket (#12970) to understand the differences. &lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;br /&gt;IC-HEP: &lt;/span&gt;Viglen as delivered the racks and two machines. We are preparing to install them.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;RHUL:&lt;/span&gt; Going down for the week-end.&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-1139005257463264874?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/1139005257463264874/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=1139005257463264874' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/1139005257463264874'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/1139005257463264874'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2006/09/qmul-poolfs-is-down-again.html' title=''/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-2075912039250915649</id><published>2006-09-18T09:40:00.000+01:00</published><updated>2006-09-18T09:45:39.327+01:00</updated><title type='text'></title><content type='html'>&lt;span style="font-weight: bold;"&gt;SC4 CMS: &lt;/span&gt;Last friday Brunel was validated for CMS production. It is now running full with cms jobs. We had to play with the &lt;a href="http://www.gridpp.ac.uk/wiki/London_SC4_Activity#RHUL"&gt;dpm permissions&lt;/a&gt; to make this work and now doing the same at RHUL. I am trying to keep the &lt;a href="http://www.gridpp.ac.uk/wiki/London_SC4_Activity"&gt;London SC4&lt;/a&gt; activity page updated.  See below a plot taken on the 18/09/2006 showing the Brunel activity.&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://photos1.blogger.com/blogger2/4516/551509194226492/1600/brunel.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://photos1.blogger.com/blogger2/4516/551509194226492/320/brunel.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;span style="font-weight: bold;"&gt;QMUL:&lt;/span&gt;&lt;br /&gt;&lt;ul&gt;   &lt;li&gt;poolfs died, the index is not reachable. It seems that autofs is not behaving correctly. I did not dare to restart the machine remotely and I made a hard mount to the index under /tmnt/poolfs. dpm is back.&lt;/li&gt;   &lt;li&gt;Prepared qmul for cms sc4 . Waiting for the cmssw to be installed and will have to make the dpm hack.&lt;/li&gt; &lt;/ul&gt;&lt;span style="font-weight: bold;"&gt;LeSC:&lt;/span&gt; &lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://photos1.blogger.com/blogger2/4516/551509194226492/1600/dzero-running.png"&gt;&lt;img style="margin: 0pt 0pt 10px 10px; float: right; cursor: pointer;" src="http://photos1.blogger.com/blogger2/4516/551509194226492/320/dzero-running.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://photos1.blogger.com/blogger2/4516/551509194226492/1600/dzero-sched.png"&gt;&lt;img style="margin: 0pt 0pt 10px 10px; float: right; cursor: pointer;" src="http://photos1.blogger.com/blogger2/4516/551509194226492/320/dzero-sched.png" alt="" border="0" /&gt;&lt;/a&gt;Frederic is having problems with dzero jobs. A lot of jobs are scheduled but they seem to dissapear. See the two plots below. I suspect that some of the jobs have a very short time because of a stager error and they are not seen as running since they last for less than five minutes. We will check in the root file to see what wall clock time distribution the job have.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;IC-HEP:&lt;/span&gt; Our rb is creating a lot of proxy renewal and filling up the disk., Mona is investigating. &lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;/span&gt;We have received two woodcrest from Viglen and I need to check with the order.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;UCL-CENTRAL:&lt;/span&gt; Alice left thanks for her work !&lt;br /&gt;We have problems with the ops vo, the replication does not work. William saw that it is due to the default se that is not set correctly. Probably a bug in yaim. It should be fixed now.&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-2075912039250915649?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/2075912039250915649/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=2075912039250915649' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2075912039250915649'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/2075912039250915649'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2006/09/sc4-cms-last-friday-brunel-was.html' title=''/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4517963319103426980.post-5516562417158074214</id><published>2006-09-11T09:49:00.000+01:00</published><updated>2006-09-12T09:59:17.586+01:00</updated><title type='text'>Activity restarted after Holiday</title><content type='html'>I have tried to recover from my emails.&lt;br /&gt;&lt;ul&gt;   &lt;li&gt;I have started to gather the slides/data for the rtm talk of friday &lt;a href="http://indico.cern.ch/conferenceDisplay.py?confId=3849"&gt;Service Challenge Technical Meeting .&lt;/a&gt;&lt;/li&gt;   &lt;li&gt;dCache head node was down due to a full /var. Mona cleaned the directory and started to look at the nagios &lt;a href="http://www.ph.ed.ac.uk/%7Egcowan1/dcache-nagios.tar"&gt;dCache scripts&lt;/a&gt;&lt;br /&gt;&lt;/li&gt;   &lt;li&gt;Duncan is off ill and I will have a look at the RHUL sft failure&lt;/li&gt;   &lt;li&gt;Started discussing with Matt Harvey (ICT) about the pbspro they are using. We will have to adapt the job manager to that version&lt;/li&gt;   &lt;li&gt;Gave contact details to Mark Leese for the Net box to have one at Imperial and asked William if he is happy with having such a box at UCL-CENTRAL&lt;/li&gt;   &lt;li&gt;Giuseppe wanted help to create 100*1Gb files in his dpm. suggested him to use rfcp from one of his cn nodes.&lt;br /&gt;&lt;/li&gt;   &lt;li&gt;Jamie told to hold since Alice was performing UCL-CENTRAL tests.&lt;br /&gt;&lt;/li&gt;   &lt;li&gt;Followed the castor/d-cache phone conf that was held at RAL.&lt;br /&gt;&lt;/li&gt; &lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4517963319103426980-5516562417158074214?l=londongrid.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://londongrid.blogspot.com/feeds/5516562417158074214/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4517963319103426980&amp;postID=5516562417158074214' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/5516562417158074214'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4517963319103426980/posts/default/5516562417158074214'/><link rel='alternate' type='text/html' href='http://londongrid.blogspot.com/2006/09/activity-restarted-after-holiday.html' title='Activity restarted after Holiday'/><author><name>Olivier van der Aa</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
