Message boards :
News :
testing work generation with 'ps_separation_14_2s_null_3'
Message board moderation
Author | Message |
---|---|
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
I'm testing work generation right now, there should be workunits available to download now. Let me know how these workunits are crunching! --Travis |
Send message Joined: 29 Aug 10 Posts: 25 Credit: 2,172,252,217 RAC: 0 |
I am getting computation errors on all WUs |
Send message Joined: 8 Apr 09 Posts: 70 Credit: 11,027,167,827 RAC: 0 |
Same here. They run like normal, but when they reach 100% they end with a Computation Error. |
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
I am getting computation errors on all WUs Think I fixed the problem, I generated a new batch of workunits, let me know how these are crunching. |
Send message Joined: 9 May 12 Posts: 12 Credit: 10,339,447 RAC: 0 |
Still getting computation errors (Not had any errors before today.) Incorrect function. (0x1) - exit code 1 (0x1) Failed to read number of star points from file (2): No such file or directory |
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
Still getting computation errors Looks like Matt N. gave me a bad star file. Started up 'ps_separation_14_2s_null_3_v2', hopefully that will fix it. |
Send message Joined: 28 Sep 11 Posts: 60 Credit: 22,764,173 RAC: 0 |
...nope, nothing comin' out of the hose...you sure the water's turned on?... |
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
...nope, nothing comin' out of the hose...you sure the water's turned on?... Just made 500 more workunits from the new search. |
Send message Joined: 9 May 12 Posts: 12 Credit: 10,339,447 RAC: 0 |
_V2 still the same error right at the end of procesing. http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=225454907 |
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
_V2 still the same error right at the end of procesing. I'm looking into this, seems like something weird is going on with the star files Matt N. gave me. |
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
Looks like _v2 might have been using the old wrong star file. I'm hoping v3 fixes that. |
Send message Joined: 5 Nov 10 Posts: 69 Credit: 15,064,831 RAC: 0 |
Same for me, Computer 427419. Will credits be given for completed work that fail this way? Thanks. |
Send message Joined: 25 Jan 11 Posts: 271 Credit: 346,072,284 RAC: 0 |
thank god others are having the same problem LOL. i've been pulling my hair out for the last hour trying to figure out why tasks are essentially running to completion and then erroring out at the last second...i feel much better now that i know its a server-side issue. |
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
thank god others are having the same problem LOL. i've been pulling my hair out for the last hour trying to figure out why tasks are essentially running to completion and then erroring out at the last second...i feel much better now that i know its a server-side issue. From what I can tell, it looks like the newly generating 'ps_separation_14_2s_null_3_v3' workunits are crunching and validating, so I think we're in the clear from here on out. |
Send message Joined: 25 Jan 11 Posts: 271 Credit: 346,072,284 RAC: 0 |
From what I can tell, it looks like the newly generating 'ps_separation_14_2s_null_3_v3' workunits are crunching and validating, so I think we're in the clear from here on out. thanks for the update Travis. i wouldn't know yet, as i immediately suspended all MW@H work as soon as i saw WU's erroring out. now that i've just discovered the nature of the problem, i can resume crunching the remaining MW@H tasks in my queue (even though i know they'll error out). once those tasks have cleared my host, i can test the ps_separation_14_2s_null_3_v3 WU's and confirm whether or not the errors are gone...that is, if someone doesn't beat me to it. |
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
From what I can tell, it looks like the newly generating 'ps_separation_14_2s_null_3_v3' workunits are crunching and validating, so I think we're in the clear from here on out. We'll i've gotten back a bunch of successful ps_separation_14_2s_null_3_v3 results, so it's looking like here on out things will be good unless I screw something else up. I've actually been surprised at how smooth things have been going so far (considering it was a total reimplementation). I did a lot of offline testing but there's always kinks to work out when something like that goes live. Of course, I'm probably shooting myself in the foot by saying that, so expect incoming catastrophic errors. :P |
Send message Joined: 25 Jan 11 Posts: 271 Credit: 346,072,284 RAC: 0 |
ok, the ps_separation_14_2s_null_3_v3 are crunching to completion without errors...so it seems all is well for the time being. |
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
I've also started a DE search: 'de_separation_14_2s_05_3'. It's using a different star file (but correctly formatted as far as I can tell), so let me know if those are crunching correctly as well. |
Send message Joined: 9 May 12 Posts: 12 Credit: 10,339,447 RAC: 0 |
So far so good ;-) mostly... null_3_v4 --- Completed, validation inconclusive (all good so far) 05_3 --- Completed, validation inconclusive (all good so far) sample_1 --- Completed and validated (all good so far) null_3_v2 --- Computation error |
Send message Joined: 5 Nov 10 Posts: 69 Credit: 15,064,831 RAC: 0 |
GPU-only tasks tested ... PC #A Just completed a ps_separation_09 task, OK All else fails with computation eror at 100% completion:- ps_separation_14_2s_null_3_v2, v3, v4, ps_separation_14_2s_05_03 I have restarted, cold booted and detached/reattached. Same problem as above. Result:- Will suspend project and abort existing ps_separation_14 tasks until a fix is in place. PC #B (only now switched it on after a couple of days, so no recent tasks have been loaded yet) Completing ps_separation_09 tasks (7 of them so far), OK Result:- I've switched to "No new tasks" for now. For those with headless/unattended servers, you're going to either be busy for a while or else waste a lot of electricity doing nothing until a fix is found. |
©2024 Astroinformatics Group