Apology for recent bad batches of workunits

Author	Message
GaryG Send message Joined: 29 Aug 12 Posts: 31 Credit: 40,781,945 RAC: 0	Message 55981 - Posted: 27 Oct 2012, 17:53:44 UTC Travis, I believe something needs to be done soon about the current number of errored tasks. I am pretty sure my machine has crunched several hundreds of these by now as the number in my task list has gone from as low as 37 to over 100 several times over the last week or so. I now have over 120 errored tasks showing and have gone from being allowed 75 tasks, to 58 and now 32 at a time. At this rate in a couple of days I will not be able to get any tasks to run. Can the the tasks that have failed on 4 systems be removed quicker to allow us to receive more tasks to run? Is there a plan to deal with this or these jobs going to continue as is until they are completed? Thanks, Gary ID: 55981 · Rating: 0 · rate: / Reply Quote

Adrian Taylor Send message Joined: 6 Apr 08 Posts: 13 Credit: 139,088,163 RAC: 0	Message 56001 - Posted: 29 Oct 2012, 10:35:06 UTC still running at over 5% errors my machine stats show 400 at any one time now we are in week three of this situation perhaps its time to pull all the rogue WU and start again ? http://teamocuk.co.uk/cprojectcred.php?p=MWAH the graph of output on the link above shows what an effect is being caused is there anything any of us can do to help solve the situation ? ID: 56001 · Rating: 0 · rate: / Reply Quote

Adrian Taylor Send message Joined: 6 Apr 08 Posts: 13 Credit: 139,088,163 RAC: 0	Message 56002 - Posted: 29 Oct 2012, 10:42:36 UTC Last modified: 29 Oct 2012, 10:55:49 UTC i suggest at the least reduce the quorum required before declaring a bad WU even if its a temporary measure ID: 56002 · Rating: 0 · rate: / Reply Quote