Message boards :
Number crunching :
Validate errors
Message board moderation
Previous · 1 · 2
Author | Message |
---|---|
Send message Joined: 24 Feb 09 Posts: 620 Credit: 100,587,625 RAC: 0 |
Yep, all free_ wu's have a validate error. Off to DNETC until this gets resolved... The only ones I saw were the 10_3s_free WUs, thats not to say there were no other _free varients. However those alone were bad enough, 60-80% errored out on initial validation. I spent 14 hours babysitting them, zapping the errent ones hoping someone would at least stop them at the server end pending review. Gave in last night, and switched projects until resolved. Baby sitting at weekends - for me - is practical enough as I'm always sitting at a PC, but come weekdays, others things to do. Yesterday was also my first time back on this Project after being away for a while - looks like my personal demon struck again rofl. If they error out after a few seconds I would'nt fuss about it, but they go through crunching and fail initial validation even before comparison with another crunchers efforts. Thats just a total waste of time and effort. Its life, these things happen, but there's only so many crosses I'll burn myself on before reaching for the fire hose :) Regards Zy |
Send message Joined: 25 Jan 11 Posts: 271 Credit: 346,072,284 RAC: 0 |
If they error out after a few seconds I would'nt fuss about it, but they go through crunching and fail initial validation even before comparison with another crunchers efforts. Thats just a total waste of time and effort. Its life, these things happen, but there's only so many crosses I'll burn myself on before reaching for the fire hose :) i hear ya...i just want to be sure which of my WU's are going to end up with validate errors before i start aborting them. while my error rate is far less than yours (mine is more like 20-25%), that's still 20-25% of my GPU cycles completely wasted. so i may stitch back to SETI@Home MB and AP tasks for my HD 5870 GPU until MW@H tasks are back to normal... *EDIT* - i'm also relieved to know that so many users are getting these validate errors. you see, we had one heck of a rainstorm last night here in Sarasota, FL, with tons of lightning. i would have shut down my rigs had i known the storm was coming, but it hit in the middle of the night. when i awoke this morning, my home office rig was frozen. when i restarted the machine and noticed all the MW@H validate errors, i thought for sure that the storm had fried parts of my machine. fortunately i had enough common sense to research validate errors on the message boards this morning, and stumbled across this thread. i think i can rest assured that nothing is wrong with my GPU or other components, and that these validate errors are simply due to a server-side issue that hopefully gets fixed real soon. |
Send message Joined: 6 May 09 Posts: 217 Credit: 6,856,375 RAC: 0 |
I've stopped the "de_separation_10_3s_free_1" until we can figure out what's wrong with them. Let me know if the '13_3s_free' WUs are also causing problems. (It may take a bit for the existing 10_3s_free WUs to filter out of the system) -Matthew |
Send message Joined: 25 Jan 11 Posts: 271 Credit: 346,072,284 RAC: 0 |
I've stopped the "de_separation_10_3s_free_1" until we can figure out what's wrong with them. Let me know if the '13_3s_free' WUs are also causing problems. will do. of the 31 remaining MW@H tasks in my que, 23 are "de_separation_10_3s_free_1" tasks, while the other 8 are "de_separation_13_3s_free_1" tasks. again, MW@H is currently suspended on my host, so i won't be able to confirm/verify anything until this evening. i'll post up as soon as i know more... |
Send message Joined: 24 Feb 09 Posts: 620 Credit: 100,587,625 RAC: 0 |
You're a star. Thanks Matthew. I'll run out the WUs on the other project and flip back tonight and yell if any pesky non-10-3s _free validate out. Regards Zy |
Send message Joined: 8 Feb 08 Posts: 261 Credit: 104,050,322 RAC: 0 |
Just did run a couple of 13_3s_free_1 to make sure what I remembered from yesterday, all of them validated. |
Send message Joined: 24 Feb 09 Posts: 620 Credit: 100,587,625 RAC: 0 |
Same here, just restarted and the 13_3s_free's appear fine. 10_3s_free's now very rare, so they appear to have all but worked their way through the system now. Regards Zy |
Send message Joined: 1 Feb 11 Posts: 17 Credit: 16,245,184 RAC: 0 |
So, I had a few 10_3 errors, now I've a quadrillion "Completed, validation inconclusive" as punishment. So the system, it seems, sends these to other folks flagged and scores their results as "Completed, validation inconclusive" too and then sends the WU off to another and so and so on ... this one is on its 5th machine - http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=260891871 ... will it ever end? - Ed.T Please: WCG - Help Cure Muscular Dystrophy |
Send message Joined: 25 Jan 11 Posts: 271 Credit: 346,072,284 RAC: 0 |
ok, so of the 5 "de_separation_13_3s_free_1" tasks i had left, 3 were valid and 2 are still pending. of the 6 "de_separation_10_3s_free_1" tasks i had left, 5 were valid and 1 is still pending. |
Send message Joined: 4 Oct 08 Posts: 1734 Credit: 64,228,409 RAC: 0 |
Just swapped here with my ATI GPUs, as DNETC is having WU validation issues on all crunching. I've lost over 500K in credit because of it. My Milkyway WUs are reporting and waiting in pending. But, one WU on each of my quads has validated, so I hope the rest follow suit. Go away, I was asleep |
Send message Joined: 24 Feb 09 Posts: 620 Credit: 100,587,625 RAC: 0 |
The suspect WUs were only one type " _10_3s_free_1 " - they have been taken out of the system now. All other WUs were validating as normal prior to the saga, and all appear ok now. Should be fine so you havent jumped out the frying pan into the fire :) Regards Zy |
Send message Joined: 13 Mar 08 Posts: 804 Credit: 26,380,161 RAC: 0 |
Sunny-The de_separation_10_3s_free_1 are failing to validate because the run is over. As Matt stated, it'll take a little bit of time for all of the run to work their way through. |
Send message Joined: 25 Jan 11 Posts: 271 Credit: 346,072,284 RAC: 0 |
yes i know. that's why it is of interest that 4 of my 5 remaining de_separation_10_3s_free_1's still validated and gave me credit for the work after completing,uploading, and reporting to the server. and this is after the e_separation_10_3s_free_1 run was disabled. nevertheless, its of little consequence. ever since Matthew ended the run of bad tasks, everything else i've crunched since then has validated and earned credit. |
Send message Joined: 12 Sep 07 Posts: 17 Credit: 6,578,049 RAC: 4,231 |
>>>>>>Same here, just restarted and the 13_3s_free's appear fine. 10_3s_free's now very rare, so they appear to have all but worked their way through the system now. Regards Zy<<<<<<< I dissagree... 03/27/2011 10:53:26 PM|Milkyway@home|Started download of de_separation_13_3s_free_1_1278565_1301291624_search_parameters 03/27/2011 10:53:27 PM|Milkyway@home|Finished download of de_separation_13_3s_free_1_1278565_1301291624_search_parameters 03/27/2011 10:53:28 PM|Milkyway@home|Starting de_separation_13_3s_free_1_1278565_1301291624_0 03/27/2011 10:53:28 PM|Milkyway@home|[error] Process creation failed: 03/27/2011 10:53:29 PM|Milkyway@home|[error] Process creation failed: 03/27/2011 10:53:30 PM|Milkyway@home|[error] Process creation failed: 03/27/2011 10:53:30 PM|Milkyway@home|[error] Process creation failed: 03/27/2011 10:53:30 PM|Milkyway@home|[error] Process creation failed: 03/27/2011 10:53:30 PM|Milkyway@home|Computation for task de_separation_13_3s_free_1_1278565_1301291624_0 finished 03/27/2011 10:54:31 PM|Milkyway@home|Sending scheduler request: To fetch work. Requesting 21601 seconds of work, reporting 1 completed tasks 03/27/2011 10:54:36 PM|Milkyway@home|Scheduler request succeeded: got 0 new tasks 03/27/2011 10:54:36 PM|Milkyway@home|Message from server: No work sent 03/27/2011 10:54:36 PM|Milkyway@home|Message from server: (reached daily quota of 97 tasks) on my host http://milkyway.cs.rpi.edu/milkyway/show_host_detail.php?hostid=38865 Hey, I'm trying to do work here ! |
Send message Joined: 4 Feb 11 Posts: 86 Credit: 60,913,150 RAC: 0 |
Here is one that had a validate error, and it was not a member of the de_separation_10_3s_free_1 run: de_separation_13_3s_free_1_65998_1301362073 also known as work unit 261694197. |
Send message Joined: 12 Aug 09 Posts: 262 Credit: 92,631,041 RAC: 0 |
Well I got new 10_3S units to crunch and when ready some where already validated. Greetings from, TJ |
Send message Joined: 4 Feb 11 Posts: 86 Credit: 60,913,150 RAC: 0 |
Seeing that you have plenty of compute errors, I think that it might be your computer at fault. I think you are looking in the wrong area. |
©2024 Astroinformatics Group