Message boards :
News :
New Modfit Runs
Message board moderation
Author | Message |
---|---|
Send message Joined: 25 Feb 13 Posts: 580 Credit: 94,200,158 RAC: 0 |
Hey Everyone, Just a heads up, I will be starting new runs over the next couple days that will have a different step size in a computationally intensive section of our code. Through some work we did, we found that this step size could be optimized and reduced to improve the speed at which our code runs without sacrificing accuracy. We expect to see these runs complete in about 1/5th the time depending on the platform. Credits will hopefully still be calculated correctly, but if you think they are not for these runs please post here. If these runs cause you any trouble with these runs or you notice you stop receiving work units please post that here too. These runs will all include the tag _fast_ in the work unit name so it should be easy to track. For now, these optimizations will only be used on the Modfit code, but in the near future we plan on also using them on original separation code, too. If you have any questions about this optimization or problems, please let me know, Jake |
Send message Joined: 8 Apr 13 Posts: 89 Credit: 517,085,245 RAC: 0 |
Any chance to look at the high error rate of current Modfit units ? I have no issues with the default MW application, only a few errors during several days. Today I gave Modfit another chance, but after ~1 hour I have disabled them again - 10 invalids during such a short period and that's probably not the end. |
Send message Joined: 30 Mar 09 Posts: 63 Credit: 621,582,726 RAC: 0 |
My first "fast" http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=903753428 12 secs and 26,74 credits Normal modified 52 secs and 213,76 credits Sooooooooo.............. |
Send message Joined: 25 Feb 13 Posts: 580 Credit: 94,200,158 RAC: 0 |
Looks like the credits aren't scaled exactly right. Pretty sure we neglect some constant overheads when we are scaling things so I will talk to Travis about fixing this. Thanks, Jake |
Send message Joined: 25 Feb 13 Posts: 580 Credit: 94,200,158 RAC: 0 |
After some checking, I think the credits might actually be scaled properly based on the number of flops required to complete the work unit. The problem is that now the runs might not fully utilize your GPU. It might be a good idea to try configuring your GPU to run 2 of these at the same time and see if that makes a difference. Jake |
Send message Joined: 30 Mar 09 Posts: 63 Credit: 621,582,726 RAC: 0 |
It might be a good idea to try configuring your GPU to run 2 of these at the same time and see if that makes a difference. That might be a problem because the normal modfit takes 98% of the GPU (with a CPU core to feed it) If I can make a choice to do only 'fast' modfits it might work with 3 at the same time because it only take about 33% of the GPU (sometime less) |
Send message Joined: 25 Feb 13 Posts: 580 Credit: 94,200,158 RAC: 0 |
Modfit will be switching over to using only fast runs as these slow ones finish up. You can expect them to be all the faster runs by the end of the week. Thank you for the feedback, Jake |
Send message Joined: 28 Nov 14 Posts: 51 Credit: 86,696,721 RAC: 0 |
We will be switching over to using only fast runs as these slow ones finish up. You can expect them to be all the faster runs by the end of the week. On a slightly different topic, since yesterday nearly every single unmodded ask I've processed has ended up as a computational error, some 30 odd before I realised what was going on, nothing has changed on my rig... Has there been an alteration at MW@H end? I've stopped processing the affected WU and am now only doing modified, seemingly without problems other than delayed validation. Regards, Cliff |
Send message Joined: 4 Oct 11 Posts: 38 Credit: 309,729,457 RAC: 0 |
Cliff All your failed tasks are also failed by the other two users doing them. In other Words Bad Tasks. "May Have Bug" |
Send message Joined: 28 Nov 14 Posts: 51 Credit: 86,696,721 RAC: 0 |
Hi Tom, Yup, trouble is, 'every' task of that type fails the same way.. I cannot see any point in processing them, if they are all going to fail. All they do is utilise my GPU and block other WU that can be processed. I'll continue to process the modfitts until the others are sorted out. Thanks for your assistance. Regards, Cliff |
Send message Joined: 25 Feb 13 Posts: 580 Credit: 94,200,158 RAC: 0 |
Hey Guys, Jeff put up some new runs with a bad parameter file. He said they are are fixed now so it should be good to run them now. There may still be a couple in the crunch queue but that should clear itself soon. Sorry for the trouble. Jake [Edit] The "Slow" runs are still running because it seems I am still getting good results from them. I would hate to kill them before they were done. Sorry for the delay, but these runs are still being useful and will be run a little while longer. Enjoy your holidays [\Edit] |
Send message Joined: 6 Apr 13 Posts: 8 Credit: 215,367,305 RAC: 0 |
I frequently run out of work for short periods of time when there's a high percentage of "fast" WUs. My GPU goes through 2 of them in about 20 seconds. Would it be possible to increase the amount of work done in the fast WUs, so that the required amount of communication with the server drops? |
Send message Joined: 31 Aug 11 Posts: 20 Credit: 529,335,116 RAC: 0 |
My 7970 graphics cards can each tear through a "fast" workunit in 8-12 seconds, but then there is a 2-6 second pause while BOINC does whatever it does between workunits before the next one starts. This is highly inefficient from a processing point of view. Even my poor 6 year old Radeon HD4650 can crunch these units quickly (70 seconds). WE NEED BIGGER WORKUNITS FOR GPU's. WE NEED BIGGER WORKUNITS FOR GPU's. WE NEED BIGGER WORKUNITS FOR GPU's. That being said, de_nbody_08_05 has tied up 7 CPU cores for 37 hours. 8( |
Send message Joined: 31 Aug 11 Posts: 20 Credit: 529,335,116 RAC: 0 |
I frequently run out of work for short periods of time when there's a high percentage of "fast" WUs. You may need to update/increase the "Minimum Work Buffer" under the Tools/Computing Preferences/Network settings tab. Add .5 to 1 day to whatever it currently is set at. |
Send message Joined: 17 Apr 10 Posts: 14 Credit: 5,122,519 RAC: 0 |
Problem with adding buffer, it adds buffer for CPU and GPU and you cannot separate. Since I want my CPU not to buffer the work it is doing, the short units are causing a lot of non-work time because of the time it takes to process the finishing and downloading the next unit. |
Send message Joined: 31 Aug 11 Posts: 20 Credit: 529,335,116 RAC: 0 |
....the runs might not fully utilize your GPU. ...try configuring your GPU to run 2 of these at the same time... Uhm, is there a tutorial for that? |
Send message Joined: 18 Jul 09 Posts: 300 Credit: 303,693,713 RAC: 947 |
....the runs might not fully utilize your GPU. ...try configuring your GPU to run 2 of these at the same time... Copy this in Notepad (not Word): <app_config> <app> <name>milkyway</name> <gpu_versions> <gpu_usage>0.5</gpu_usage> <cpu_usage>0.05</cpu_usage> </gpu_versions> </app> <app> <name>milkyway_separation__modified_fit</name> <gpu_versions> <gpu_usage>0.5</gpu_usage> <cpu_usage>0.05</cpu_usage> </gpu_versions> </app> </app_config> Save the file as "app-config.xml" and place the file here: (Unhide your folders if you haven't already) Program data\Boinc\Projects\MilkyWay Restart Boinc and you'll be running two at a time. |
Send message Joined: 23 Nov 09 Posts: 29 Credit: 17,119,258 RAC: 0 |
....the runs might not fully utilize your GPU. ...try configuring your GPU to run 2 of these at the same time... Is this app not available on Linux? I simply get an error that it cannot find an app by that name, so it only discovers the normal milkyway app. |
Send message Joined: 14 Feb 09 Posts: 999 Credit: 74,932,619 RAC: 0 |
....the runs might not fully utilize your GPU. ...try configuring your GPU to run 2 of these at the same time... Make certain that you have mod fit checked in your preferences CPU: http://milkyway.cs.rpi.edu/milkyway/download/milkyway_separation__modified_fit_1.36_x86_64-pc-linux-gnu AMD: http://milkyway.cs.rpi.edu/milkyway/download/milkyway_separation__modified_fit_1.36_x86_64-pc-linux-gnu__opencl_ati_101 |
Send message Joined: 31 Aug 11 Posts: 20 Credit: 529,335,116 RAC: 0 |
....the runs might not fully utilize your GPU. ...try configuring your GPU to run 2 of these at the same time... I'm afraid that did nothing on neither my dual GPU machine nor my single GPU box. :( Not even after triple checking I'd followed the instructions. So, having successfully (but unintentionally) 'hijacked' a thread, I'd like to try and return the focus to these fast work units. These fast units, due to the standard GUI interface issues of the BOINC client, don't keep the GPU's busy enough. I've dropped from almost a million credits per 24 hours to 650k a day. I don't give a crap about the credits, the point is that these efficient fast WU's have resulted in almost a third less processing power, apparently. Again, even my lowly 6 year old ATI HD4850 can crunch these WU's very quickly. And if it's possible from a computer science perspective, I think the project would better benefit from a denser, more prolonged set of WU's, that help to minimize/offset the 2-6 seconds of 'downtime' that occurs between WU's. |
©2025 Astroinformatics Group