Message boards :
News :
ATI application updated to 0.60
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · Next
Author | Message |
---|---|
Send message Joined: 1 Sep 08 Posts: 204 Credit: 219,354,537 RAC: 0 |
Update to my strange ~30s screen update lag: it disappeared after a reboot.. *doh* MrS Scanning for our furry friends since Jan 2002 |
Send message Joined: 22 Jan 11 Posts: 375 Credit: 64,681,937 RAC: 2,011 |
So how do I set the polling mode &/or target frequency then?? (couldn't find an FAQ about it). My rig specs:- Q6600 @3.25 GHz (running F@H SMP mode) HD4830 (GPU @600MHz) Win XP SP3 32bit Was Cat 10.12 now on 11.3 Since recent MW updates I'm getting severe GUI lag making the machine un-useable :( (whereas before it was moderately laggy but useable if I didn't have to scroll much). Driver change made no difference. Hopefully the options mentioned above can sort this out :). |
Send message Joined: 8 Feb 08 Posts: 261 Credit: 104,050,322 RAC: 0 |
target frequency - into how many packets per second the data is split polling mode - how many milliseconds the cpu 'sleeps' before it is checking and waiting for data back from the gpu defaults are target frequency 30 and polling mode 1 There are still some problems with the settings and Matt said he will look into it coming week. If you want to reduce the gui lag you should increase the frequency, to reduce a lag of your system you need to increase the polling. In theory you can calculate 1000 / frequency = polling Take it as a rough calculation and add a little to frequency or polling, the higher you go in frequency the more you have to add. |
Send message Joined: 30 Dec 07 Posts: 311 Credit: 149,490,184 RAC: 0 |
Still getting "Completed, can't validate" invalids. This is due to wingmen exceeding the maximum number of errors of 3. Some of these errors are due to ATI GPUs using older Catalyst drivers that are not compatible with the new application. Others are from CPUs which either do not work with the new application or are using old optimised applications that will no longer work without a parameter file. These computers are trashing a lot of tasks and some of the owners are just letting them run. Here are 4 such computers from the last "Completed, can't validate" I had before it was rapidly cleared from the database: hostid=264221 hostid=200293 hostid=102667 hostid=211500 Hostid 102667 in particular currently has over 7,000 tasks listed. It is still using v0.18 speedimic_sse3_64 As I requested a week ago, could the maximum number of errors be increased from 3 please until the number of these computers producing errors decreases or they are blocked from receiving work if they return many invalid results. |
Send message Joined: 24 Feb 09 Posts: 620 Credit: 100,587,625 RAC: 0 |
The problem can be vertually killed off at source by implementing the Server-Choke to stop runaways Regards Zy |
Send message Joined: 1 Sep 08 Posts: 204 Credit: 219,354,537 RAC: 0 |
@Assimilator1: it should get better if you: - report all finished MW tasks (assuming you're running 0.62 already) - stop your BOINC - place a file called "app_info.xml" in your MW folder with the following content: <app_info> <app> <name>milkyway</name> </app> <file_info> <name>milkyway_0.62_windows_intelx86__ati14.exe</name> <executable/> </file_info> <app_version> <app_name>milkyway</app_name> <version_num>62</version_num> <plan_class>ati14</plan_class> <flops>1.0e11</flops> <avg_ncpus>0.05</avg_ncpus> <max_ncpus>1</max_ncpus> <coproc> <type>ATI</type> <count>0.5</count> </coproc> <cmdline>--gpu-target-frequency 60</cmdline> <file_ref> <file_name>milkyway_0.62_windows_intelx86__ati14.exe</file_name> <main_program/> </file_ref> </app_version> </app_info> This will increase the target screen refresh time from 30 Hz to 60 Hz, which in my case is enough to make it almost smooth again. You can also adjust "<count>0.5</count>" to "<count>1</count>" if you want to run 1 WU at a time rather than 2 WUs at a time (as I'm currently doing to avoid the idle GPU time between WUs). I don't expect you'd need to adjust polling with this. MrS Scanning for our furry friends since Jan 2002 |
Send message Joined: 15 Jul 08 Posts: 383 Credit: 729,293,740 RAC: 0 |
Still getting "Completed, can't validate" invalids. This is due to wingmen exceeding the maximum number of errors of 3. I started a thread a couple days ago about this problem: http://milkyway.cs.rpi.edu/milkyway/forum_thread.php?id=2361 People have posted that their RAC has gone down about 20-25%. Mines down about the same. The client speed loss accounts for only about 5% of this. The rest I believe is (as you point out) due to good results being marked invalid because of the multitude of clients spewing out bad results at about 1 per second, causing valid results to be discarded as the WUs are marked as bad due to too many invalids. Sure would be nice (and should be easy) to resolve this. |
Send message Joined: 19 Feb 09 Posts: 32 Credit: 32,843,308 RAC: 0 |
The latest version (0.62) is running very smooth for me (Win7_64 Catalyst 11.3) - thanks for getting it to work this fast :) Just noticed another one: 7320292 <core_client_version>6.12.22</core_client_version> <![CDATA[ <stderr_txt> </stderr_txt> ]]>[/url] Starfire |
Send message Joined: 1 Sep 08 Posts: 204 Credit: 219,354,537 RAC: 0 |
Update to my strange ~30s screen update lag: it disappeared after a reboot.. *doh* And re-appeared after approximately 1 day of run time. Switching to Win 7 classic lets me move the mouse during the break but everything else is still on hold. Switching to 1 WU at a time solves the problem. Waiting for that CPU time optimization ;) MrS Scanning for our furry friends since Jan 2002 |
Send message Joined: 10 Aug 10 Posts: 4 Credit: 57,345,700 RAC: 0 |
I just recieved a bunch of 0.62 WU's for Linux, graphics are very smooth now and 1WU finishes in 2 minutes flat. Thanks! OpenSUSE 11.4 AMD64 with 5850 |
Send message Joined: 8 Feb 08 Posts: 261 Credit: 104,050,322 RAC: 0 |
It looks like I might need to do some work on the time estimates for preventing lag on different GPUs Tried myself on an analysis, based on different WUs and settings. How it should work: frequency 30 -> 1000ms / 30 blocks => max 33.3333ms per block. gpu flops per second / 30 blocks => max gpu flops per block. Round those max values down a little and you can calculate the size of the parts to send to the gpu and how long the cpu should sleep (a positive polling value can be used to add to this sleep time). So what is not working here? After testing with different params on different WUs several times ... There seem to be 2 pattern: 1) Based on estimated iteration time the number of chunks is calculated 1 too low. Think I found it in the code of deviceChunkEstimate. Looks like a simple mistake, type conversion to integer cutting the fragment part, adding 1.0 should fix this (and saves the test for zero) since it is very unlikely that the division (estimated time / time allowed per chunk) will have no fragment. 2) The estimated iteration time is always too low. The difference to the average iteration time is increasing with higher number of chunks. Expecting that high numbers of small chunks will kill the average by many data transfers. WUs with 152.839606 ms estimated iteration time 1 chunk(s): 6.0 - 6.4 ms too low 2 chunk(s): 7.3 - 7.5 ms too low 4 chunk(s): 8.6 - 9.1 ms too low 8 chunk(s): 11.9 - 12.8 ms too low This looks like 1 fix time offset + 1 per chunk (along the lines of 5.5ms + 0.9ms * chunks, might be more complicated for an exact calculation for different gpus and clocks). The shorter ones with 109.923835 ms estimated iteration time are off even more: 1 chunk: 9.62 - 10.1 ms (got too few WUs for enough tests on higher number of chunks) (The difference to the average iteration time seems to increase with lower gpu clock too but haven't tested this one enough to proof it.) Polling mode The way the polling mode works right now makes it impossible to find a good value. Setting it to work best for WUs with high estimated time leads to low gpu utilization for WUs with low estimated time. Setting it for the faster WUs will lead to high polling and more overhead for the slower WUs. Keep polling mode < 0 like it is. Polling mode >= 0 should add it's value to a calculated sleep time slightly below the time per chunk. This way the fine tuning for a machine is far easier and independent of the WU size. |
Send message Joined: 19 Sep 09 Posts: 16 Credit: 218,390,676 RAC: 0 |
@Assimilator1: Cheers, ETA. I tried this app_info, but in my case it did not really help. I found some odd values for tolerable lag: 4770 2 WUs at a time 5970 3-4 WUs (does not make a huge difference) 5870 4 WUs (2 and 3 WUs can show massive lag). |
Send message Joined: 22 Jan 11 Posts: 375 Credit: 64,681,937 RAC: 2,011 |
target frequency - into how many packets per second the data is split Err thanks Len but I still don't know where to change those options as I said earlier ;). ExtraTerrestrial Apes Thanks, that should sort things out :) |
Send message Joined: 22 Jan 11 Posts: 375 Credit: 64,681,937 RAC: 2,011 |
Wot no editing?? ETA >>>You can also adjust "<count>0.5</count>" to "<count>1</count>" if you want to run 1 WU at a time rather than 2 WUs at a time (as I'm currently doing to avoid the idle GPU time between WUs). I don't expect you'd need to adjust polling with this.<<< My count setting was already on 1, why would you want to run 2 at a time? It switches from 1 WU to the next nearly instantly anytime I've watched it ...... |
Send message Joined: 25 Jan 11 Posts: 271 Credit: 346,072,284 RAC: 0 |
actually, for many of us, there appears to be some dead time between tasks when running one at a time (its not much, only on the order of seconds). but some folks feel that the extra few percentage points increase in production/crunching efficiency is worth running 2 tasks at a time, as it does seem to minimize that small (but still significant to some) pause between tasks. i know its a bit pf a pain at this point b/c this thread is so long, but the benefits of running 2 tasks at a time are discussed somewhere above. i know the app_info file i posted for you on the AT forums contains <count>1</count>, but i actually run 2 tasks at a time as well. |
Send message Joined: 19 Sep 09 Posts: 16 Credit: 218,390,676 RAC: 0 |
Not sure what was happening now, but my output has gone down the drain, most likely because of inconclusive validations and also a good number of invalid validations. Is this linked to optimized apps settings or are there stock users concerned as well? |
Send message Joined: 15 Jul 08 Posts: 383 Credit: 729,293,740 RAC: 0 |
Not sure what was happening now, but my output has gone down the drain, most likely because of inconclusive validations and also a good number of invalid validations. Is this linked to optimized apps settings or are there stock users concerned as well? Yes, many. See this thread: http://milkyway.cs.rpi.edu/milkyway/forum_thread.php?id=2361 |
Send message Joined: 22 Jan 11 Posts: 375 Credit: 64,681,937 RAC: 2,011 |
I've had enough of the massive lag that MW causes me, & since I also found out that about the same time (ish,11/4) as the MW update my F@H output has nose dived I've stopped running MW, at least until I can find settings that counter those problems. There's not really a clear explanation of what either the GPU target freq does nor the GPU polling mode. Like wth does 'busy waiting' refer to??? Nevertheless I did try increasing the target freq to 40, which stopped it crunching!, 60,90 & 120 all of which had no significant impact, still huge GUI lag, although 120 helped very slightly. I'll try increasing the polling mode, but again their is no guidance on how much I should increase it, in whole numbers or tenths?? Sunny Got ya :), probably my HD 4830 isn't quick enough to worry about that ;), plus I get enough lag running just 1 WU. |
Send message Joined: 9 Sep 08 Posts: 96 Credit: 336,443,946 RAC: 0 |
Not sure what was happening now, but my output has gone down the drain, most likely because of inconclusive validations and also a good number of invalid validations. Is this linked to optimized apps settings or are there stock users concerned as well? My theoretical daily RAC for a 5870 on W7_64 here is now ~202,243cr, which is near as makes no difference to it's RAC at PG ~202,663(coincidence?), and you can actually buffer an entire day of work there, so I am easing back to PG for now... |
Send message Joined: 8 May 10 Posts: 576 Credit: 15,979,383 RAC: 0 |
I've had enough of the massive lag that MW causes me, & since I also found out that about the same time (ish,11/4) as the MW update my F@H output has nose dived I've stopped running MW, at least until I can find settings that counter those problems. There's not really a clear explanation of what either the GPU target freq does nor the GPU polling mode. Like wth does 'busy waiting' refer to???The target frequency influences how large the work sent to the GPU at once is. Higher reduces it, which decreases lag. The frequency number controls how long it attempts to make the GPU tasks. The default of 30 tries to reach (1000 (milliseconds / second) / 30) = 33.3ms for the maximum time the GPU will take which should allow a chance for the screen to redraw at ~30fps. The flag is meant as a fallback if the time estimates aren't good for your GPU. It looks like the time estimates you're getting are a bit low for what you actually get. I'm going to do work on the time estimates (particular for non-5xxx GPUs) before the next release. Ideally you shouldn't need to set anything. Doubling it should theoretically mean half the lag. In another thread, someone set it to 300 to reduce their GPU usage to ~10%. Busy waiting is constantly checking if the GPU is done, which causes 100% CPU usage but is least likely to waste extra time waiting after a packet is done one the GPU. Nevertheless I did try increasing the target freq to 40, which stopped it crunching!, 60,90 & 120 all of which had no significant impact, still huge GUI lag, although 120 helped very slightly. |
©2024 Astroinformatics Group