| ID |
Date |
Author |
Category |
Status |
Affect Users |
Affect Cloud |
Action |
Subject |
Text |
 |
|
2985
|
Mon Feb 16 08:04:08 2009 |
lancon | Prodsys | Open | No | No | Bug report | stdout file too big | Hello,
We are seeing jobs on FR-cloud that are failing |
|
|
2984
|
Fri Feb 13 13:49:53 2009 |
Yuri Smirnov | Other | Open | | | | switch over to CERN Elog | Dearr Shifters!
Please don't use this UTA ELOG interface |
|
|
2983
|
Wed Feb 11 17:41:37 2009 |
Fedor Prokoshin | Prodsys | Open | No | No | Bug report | FZK cloud: EXEPANDA_DQ2PUT_FILECOPYERROR | Low efficiency on some sites
FZK-LCG2 67.9%
|
|
|
2982
|
Wed Feb 11 14:16:29 2009 |
Barry Spurlock | Prodsys | Open | No | Yes | Intervention | MWT2 sites are about to go offline | The sites MWT2_IU, MWT2_UC and IU_OSG will
be set offline in Panda so that they can
drain. Tomorrow these sites will be completely |
|
|
2981
|
Wed Feb 11 11:52:34 2009 |
Barry Spurlock | Prodsys | Pending | | | Intervention | SLACXRD and SWT2_CPB are now offline | SLACXRD and SWT2_CPB have both been set offline
until a problem with mismatched checksum
values is fixed. |
|
|
2980
|
Wed Feb 11 11:31:27 2009 |
Fedor Prokoshin | Prodsys | Open | No | Yes | GGUS | All jobs failing on UNI-FREIBURG due to copy error | Multiple errors EXEPANDA_DQ2GET_INFILE reported
on UNI-FREIBURG-ce-atlas-pbs at least for
last 4 hours.
|
|
|
2979
|
Mon Feb 9 16:51:40 2009 |
Fedor Prokoshin | DDM | Open | Yes | Yes | GGUS | FTS erros for PIC_MCDISK ([FILE_EXISTS] | ~10000 errors reported for this site
GGUS ticket #[URL=https://gus.fzk.de/ws/ticket_info.php?ticket=46075]46075[/URL]
created. |
|
|
2978
|
Mon Feb 9 15:18:17 2009 |
Fedor Prokoshin | Central Services | Open | No | No | Bug report | No DDM activity reported on CERN, TW | Dashboard report no activity of this clouds.
Maybe, the same as was reported earlier:
|
|
|
2977
|
Mon Feb 9 13:56:21 2009 |
Fedor Prokoshin | Prodsys | Open | No | No | Bug report | Task 30244 failed on JINR-LCG2, RU-Protvino-IHEP, NIKHEF-ELPROD | [quote="Fedor Prokoshin"]looks like task-related
problem
Savanna Bug report #[URL=https://savannah.cern.ch/bugs/index.php?46769]46769[/URL] |
|
|
2976
|
Mon Feb 9 13:38:16 2009 |
Fedor Prokoshin | Prodsys | Open | No | Yes | GGUS | All jobs failing on UKI-NORTHGRID-LIV-HEP | ~900 failing jobs mostly with transfer errors.
[URL=http://dashb-atlas-prodsys.cern.ch/dashboard/request.py/overview?site=UKI-NORTHGRID-LIV-HEP&grouping=cluster&cloud=RAL&start-date=2009-02-09%2006:00:00&end-date=2009-02-09%2018:59:59&grouping=site]http://dashb-atlas-prodsys.cern.ch/dashboard/request.py/overview?site=UKI-NORTHGRID-LIV-HEP&grouping=cluster&cloud=RAL&start-date=2009-02-09%2006:00:00&end-date=2009-02-09%2018:59:59&grouping=site[/URL]
|
|
|
2975
|
Mon Feb 9 11:57:36 2009 |
Fedor Prokoshin | Prodsys | Open | No | No | Bug report | Task 30244 failed on JINR-LCG2, RU-Protvino-IHEP, NIKHEF-ELPROD | looks like task-related problem
Savanna Bug report #[URL=https://savannah.cern.ch/bugs/index.php?46769]46769[/URL]
submitted |
|
|
2974
|
Sat Feb 7 14:34:55 2009 |
Marco Aurelio Diaz | Prodsys | | | | | New errors at SARA, NIKHEF-ELPROD | There are 167 recent errors associated to
task 30244 at NIKHEF-ELPROD, 105 of them
EXEPANDA_DQ2PUT_FILECOPYERROR.
|
|
|
2973
|
Fri Feb 6 16:48:57 2009 |
Marco Aurelio Diaz | Prodsys | | | | | small but growing number of errors at DESY-ZN | 42 errors: EXEPANDA_GET_REPLICANOTFOUND at
DESY-ZN.
taskfk: 39330
|
|
|
2972
|
Fri Feb 6 01:49:28 2009 |
Boris Panes Saavedra | Prodsys | Open | Yes | Yes | Bug report | valid task with 10% of failed jobs | [quote="Boris Panes Saavedra"]savannah bug
#46667 open in ADCOS support[/quote]
|
|
|
2971
|
Thu Feb 5 12:34:57 2009 |
Boris Panes Saavedra | Prodsys | Open | Yes | Yes | Bug report | valid task with 10% of failed jobs | savannah bug #46667 open in ADCOS support |
|
|
2970
|
Wed Feb 4 17:32:32 2009 |
Fedor Prokoshin | DDM | Open | Yes | Yes | GGUS | A lot of transfer errors for CERN cloud (~2600) | A lot of transfer errors for CERN cloud (~2600)
http://dashb-atlas-data.cern.ch/dashboard/request.py/site
|
|
|
2969
|
Wed Feb 4 12:12:18 2009 |
Fedor Prokoshin | Prodsys | Open | Yes | No | GGUS | Problem with task 39293 CE is down on UKI-SCOTGRID-ECDF | GGUS ticket [URL=https://gus.fzk.de/ws/ticket_info.php?ticket=45848]#45848[/URL]
submitted.
|
|
|
2968
|
Wed Feb 4 11:38:53 2009 |
Fedor Prokoshin | Prodsys | Open | Yes | Yes | GGUS | PSNC failing lcg-cp | Opened GGUS ticket [URL=https://gus.fzk.de/ws/ticket_info.php?ticket=45833]#45833[/URL] |
|
|
2967
|
Wed Feb 4 00:51:34 2009 |
Hiroshi Sakamoto | Prodsys | Closed | Yes | Yes | GGUS | File transfer errors at FZK-LCG2, GGUS-Ticket 45748 opened | Many jobs in the DE cloud started failing
due to
"taskBuffer: transfer timeout".
|
|
|
2966
|
Wed Feb 4 00:03:11 2009 |
Hiroshi Sakamoto | Prodsys | Closed | No | Yes | SCALED | large backlog (and not moving) of production file transfers in CA Cloud | Many jobs of Task ID=39672 in the CA cloud
(ALBERTA,TORONTO,SFU,VICTORIA) started failing
due to
|
|