ADCoS
  ADCoS Shift Logbook, Page 1 of 149  Not logged in ELOG logo
ID Date Author Category Status Affect Users Affect Cloud Action Subject Text Attachments
  2985   Mon Feb 16 08:04:08 2009 lanconProdsysOpenNoNoBug reportstdout file too bigHello,

We are seeing jobs on FR-cloud that are failing
  
  2984   Fri Feb 13 13:49:53 2009 Yuri SmirnovOtherOpen   switch over to CERN ElogDearr Shifters!

Please don't use this UTA ELOG interface
  
  2983   Wed Feb 11 17:41:37 2009 Fedor ProkoshinProdsysOpenNoNoBug reportFZK cloud: EXEPANDA_DQ2PUT_FILECOPYERRORLow efficiency on some sites

FZK-LCG2 67.9%
  
  2982   Wed Feb 11 14:16:29 2009 Barry SpurlockProdsysOpenNoYesInterventionMWT2 sites are about to go offlineThe sites MWT2_IU, MWT2_UC and IU_OSG will
be set offline in Panda so that they can
drain.  Tomorrow these sites will be completely
  
  2981   Wed Feb 11 11:52:34 2009 Barry SpurlockProdsysPending  InterventionSLACXRD and SWT2_CPB are now offlineSLACXRD and SWT2_CPB have both been set offline
until a problem with mismatched checksum
values is fixed.
  
  2980   Wed Feb 11 11:31:27 2009 Fedor ProkoshinProdsysOpenNoYesGGUSAll jobs failing on UNI-FREIBURG due to copy errorMultiple errors EXEPANDA_DQ2GET_INFILE reported
on UNI-FREIBURG-ce-atlas-pbs at least for
last 4 hours.
  
  2979   Mon Feb 9 16:51:40 2009 Fedor ProkoshinDDMOpenYesYesGGUSFTS erros for PIC_MCDISK ([FILE_EXISTS]~10000 errors reported for this site
GGUS ticket #[URL=https://gus.fzk.de/ws/ticket_info.php?ticket=46075]46075[/URL]
created.
  
  2978   Mon Feb 9 15:18:17 2009 Fedor ProkoshinCentral ServicesOpenNoNoBug reportNo DDM activity reported on CERN, TWDashboard report no activity of this clouds.
Maybe, the same as was reported earlier:
  
  2977   Mon Feb 9 13:56:21 2009 Fedor ProkoshinProdsysOpenNoNoBug reportTask 30244 failed on JINR-LCG2, RU-Protvino-IHEP, NIKHEF-ELPROD [quote="Fedor Prokoshin"]looks like task-related
problem
Savanna Bug report #[URL=https://savannah.cern.ch/bugs/index.php?46769]46769[/URL]
  
  2976   Mon Feb 9 13:38:16 2009 Fedor ProkoshinProdsysOpenNoYesGGUSAll jobs failing on UKI-NORTHGRID-LIV-HEP~900 failing jobs mostly with transfer errors.

[URL=http://dashb-atlas-prodsys.cern.ch/dashboard/request.py/overview?site=UKI-NORTHGRID-LIV-HEP&grouping=cluster&cloud=RAL&start-date=2009-02-09%2006:00:00&end-date=2009-02-09%2018:59:59&grouping=site]http://dashb-atlas-prodsys.cern.ch/dashboard/request.py/overview?site=UKI-NORTHGRID-LIV-HEP&grouping=cluster&cloud=RAL&start-date=2009-02-09%2006:00:00&end-date=2009-02-09%2018:59:59&grouping=site[/URL]
  
  2975   Mon Feb 9 11:57:36 2009 Fedor ProkoshinProdsysOpenNoNoBug reportTask 30244 failed on JINR-LCG2, RU-Protvino-IHEP, NIKHEF-ELPROD looks like task-related problem
Savanna Bug report #[URL=https://savannah.cern.ch/bugs/index.php?46769]46769[/URL]
submitted
  
  2974   Sat Feb 7 14:34:55 2009 Marco Aurelio DiazProdsys    New errors at SARA, NIKHEF-ELPRODThere are 167 recent errors associated to
task 30244 at NIKHEF-ELPROD, 105 of them
EXEPANDA_DQ2PUT_FILECOPYERROR.
  
  2973   Fri Feb 6 16:48:57 2009 Marco Aurelio DiazProdsys    small but growing number of errors at DESY-ZN42 errors:  EXEPANDA_GET_REPLICANOTFOUND at
DESY-ZN. 
taskfk: 39330
  
  2972   Fri Feb 6 01:49:28 2009 Boris Panes SaavedraProdsysOpenYesYesBug reportvalid task with 10% of failed jobs[quote="Boris Panes Saavedra"]savannah bug
#46667 open in ADCOS support[/quote]
  
  2971   Thu Feb 5 12:34:57 2009 Boris Panes SaavedraProdsysOpenYesYesBug reportvalid task with 10% of failed jobssavannah bug #46667 open in ADCOS support   
  2970   Wed Feb 4 17:32:32 2009 Fedor ProkoshinDDMOpenYesYesGGUSA lot of transfer errors for CERN cloud (~2600)A lot of transfer errors for CERN cloud (~2600)

http://dashb-atlas-data.cern.ch/dashboard/request.py/site
  
  2969   Wed Feb 4 12:12:18 2009 Fedor ProkoshinProdsysOpenYesNoGGUSProblem with task 39293 CE is down on UKI-SCOTGRID-ECDFGGUS ticket [URL=https://gus.fzk.de/ws/ticket_info.php?ticket=45848]#45848[/URL]
submitted.
    
  
  2968   Wed Feb 4 11:38:53 2009 Fedor ProkoshinProdsysOpenYesYesGGUSPSNC failing lcg-cp Opened GGUS ticket [URL=https://gus.fzk.de/ws/ticket_info.php?ticket=45833]#45833[/URL]   
  2967   Wed Feb 4 00:51:34 2009 Hiroshi SakamotoProdsysClosedYesYesGGUSFile transfer errors at FZK-LCG2, GGUS-Ticket 45748 openedMany jobs in the DE cloud started failing
due to
"taskBuffer: transfer timeout".
  
  2966   Wed Feb 4 00:03:11 2009 Hiroshi SakamotoProdsysClosedNoYesSCALEDlarge backlog (and not moving) of production file transfers in CA CloudMany jobs of Task ID=39672 in the CA cloud
(ALBERTA,TORONTO,SFU,VICTORIA) started failing
due to
  
ELOG V2.7.5-2130