Welcome to MilkyWay@home

Posts by Robert Meckley

1) Message boards : News : New Separation Modfit Version 1.36 (Message 62505)
Posted 7 Oct 2014 by Robert Meckley
Post:
Jake,

I have been running (modified-fit) off and on for the past 30 days or so stopping because of high error rates, and restarting each time new versions appeared. Unfortunately, v1.36 doesn't seem any better than the previous versions (v1.32 & v1.34). Currently I'm running ~6% VALIDATE ERROR rate over the past 20 hours. I checked the top 100 Hosts to see if anyone else was having a problem. Most of the top 100 hosts use the HD 7970 card with the driver identified by BOINC as 1.4.1848 as do I. I found that the only ones that were not realizing a significant VALIDATE ERROR % rate were those not running MOD-FIT. Without exception it seems that everyone is having a problem with VALIDATE ERRORs. I also made another observation that may or may not be related to another recently identified problem. The top Hosts running MOD-FIT also have a large number of COMPLETED, CAN'T VALIDATE tasks. A lot of these will ultimately validate, but some of these probably won't. Here we find that a number of these tasks have been downloaded to other Hosts only to be ABORTED BY USER. Further research of these hosts shows that in some cases, literally thousands of tasks have been manually aborted by the user. And its not just one or two Hosts that show such behavior - with very little effort I've identified at least a dozen. Since I can't for the life of me posit a sane motive for such behavior, I'm guessing it may be frustration over the errors volunteers are currently experiencing. (O.K., maybe not.) At any rate, could you please fix this. Last month at this time I had an RAC rating of ~325000. Now its barely 200,000 and falling. At 325000 I felt I was contributing to the effort. Now I'm starting to feel like an outsider.
2) Message boards : Number crunching : M@H v1.02(opencl_amd_ati) errors: "error while computing" (Message 62477)
Posted 5 Oct 2014 by Robert Meckley
Post:
Did you switch to a newer graphics driver with the new build?


Yes, of course. The Catalyst proprietary driver in the Ubuntu 14.04 repository was the same one that I had been using with Windows. The research I did after posting this problem suggested that the problem might actually be with Ubuntu 14.04. I found that two other users using the Linux kernel image of Ubuntu 14.04 and catalyst driver identified by BOINC as 1.4.1848 had the same problem. I will never be certain that there is some incompatibility between Ubuntu 14.04 and the M@H application, but in any event, I went back to running windows 7 and am now running the M@H application without any problem. I thank you very kindly for taking an interest in my post.
3) Message boards : Number crunching : M@H v1.02(opencl_amd_ati) errors: "error while computing" (Message 62403)
Posted 25 Sep 2014 by Robert Meckley
Post:
This past week I have noticed that many of my M@H v1.02(opencl_amd_ati) tasks have run and ended with the status "error while computing". Specifically the WUs are labeled "ps_84_DR8_rev_8_4_00001_141150441_ ". I suppose all of us have been getting a lot of errors lately if we have tried to run ModifiedFit v.1.32 due to the bugs in the recent revisions. But there have been no recent revisions to the M@H (opencl_amd_ati)application, at least none of which I'm aware. To clarify my concern, most of these WUs are completing without incident. Presently I am showing 2950 valid and 56 errors in recent completions. Perhaps this is normal, but I have never, in the past, noticed an error count of more than 1 or 2 in any application that I've run with M@H, that is until the recent revisions to ModifiedFit. (Currently I've opted out of running ModifiedFit.) Moreover, when I click on the details of these WUs, I see that other computers seem to run these WUs without a problem. I've searched this form for threads discussing M@H(opencl) errors, but these discussions tend to driver problems or legacy GPU problems. I'm using an HD7970 with driver 1.4.1848 OpenCL:1.2 as many others are, so I don't think I have a driver or hardware issue. I have recently changed from using an intel quad to an AMD quad CPU, and also changed OS from Windows7 to Ubuntu 14.04, but I've made such changes in the past and have not noticed an advanced error count. Anyway, I can not see how these changes could impact the functioning of my GPU. If anyone can make sense of this, I would be very grateful. In fact, If you can assure me that this is not a problem, that's O.K. too - I just want to understand what's going on.

(Do we ever really understand what's going on?!)
4) Message boards : News : New Version of Separation Modified Fit (1.32) (Message 62320)
Posted 12 Sep 2014 by Robert Meckley
Post:
For what its worth, I'm still experiencing errors on the WUs de_modfit_16TestStars_1s_132_etc. When I first reported this on Sept. 9th, I thought you were fixing this.
5) Message boards : News : New Version of Separation Modified Fit (1.32) (Message 62299)
Posted 9 Sep 2014 by Robert Meckley
Post:
114 errors so far and counting. All the errors I am seeing concern "ps_modfit_16TestStars_1s_132_wrap_1" WUs. On the other hand, "de_modfit_15_3s_132_wrap_1" WUs are executing without any problem. I don't suppose there's anything I could do at my end to make the scheduler assign only the "de_modfit" and not the "ps_modfit" WUs to my machine? This problem is not good for the RACs. OK, OK, just asking.




©2019 Astroinformatics Group