Message boards :
Number crunching :
AMD Radeon R9 Fury X - app_info.xml and apps - optimizations
Message board moderation
Author | Message |
---|---|
![]() Send message Joined: 30 Apr 09 Posts: 99 Credit: 29,791,015 RAC: 422 ![]() ![]() |
Because of my Messsage 64935... I have inter alia four AMD Radeon R9 Fury X VGA cards and it's still not possible to get work, if the project is 'stock'. I need to use (the following entries and apps are correct?) an app_info.xml file. [Windows 64Bit, (N-Body Sim. non-MultiThread (Single-Thread) CPU app)] <app_info> <app> <name>milkyway_nbody</name> <user_friendly_name>Milkyway N-Body Sim.</user_friendly_name> </app> <file_info> <name>milkyway_nbody_1.62_windows_x86_64.exe</name> <executable/> </file_info> <app_version> <app_name>milkyway_nbody</app_name> <version_num>162</version_num> <platform>windows_x86_64</platform> <file_ref> <file_name>milkyway_nbody_1.62_windows_x86_64.exe</file_name> <main_program/> </file_ref> </app_version> <app> <name>milkyway</name> <user_friendly_name>Milkyway</user_friendly_name> </app> <file_info> <name>milkyway_1.36_windows_x86_64.exe</name> <executable/> </file_info> <app_version> <app_name>milkyway</app_name> <version_num>136</version_num> <platform>windows_x86_64</platform> <file_ref> <file_name>milkyway_1.36_windows_x86_64.exe</file_name> <main_program/> </file_ref> </app_version> <app> <name>milkyway</name> </app> <file_info> <name>milkyway_1.36_windows_x86_64__opencl_ati_101.exe</name> <executable/> </file_info> <app_version> <app_name>milkyway</app_name> <version_num>136</version_num> <platform>windows_x86_64</platform> <avg_ncpus>1</avg_ncpus> <max_ncpus>1</max_ncpus> <plan_class>opencl_ati_101</plan_class> <cmdline></cmdline> <coproc> <type>ATI</type> <count>1</count> </coproc> <file_ref> <file_name>milkyway_1.36_windows_x86_64__opencl_ati_101.exe</file_name> <main_program/> </file_ref> </app_version> <app> <name>milkyway_separation__modified_fit</name> <user_friendly_name>Milkyway Sep. (Mod. Fit)</user_friendly_name> </app> <file_info> <name>milkyway_separation__modified_fit_1.36_windows_x86_64.exe</name> <executable/> </file_info> <app_version> <app_name>milkyway_separation__modified_fit</app_name> <version_num>136</version_num> <platform>windows_x86_64</platform> <file_ref> <file_name>milkyway_separation__modified_fit_1.36_windows_x86_64.exe</file_name> <main_program/> </file_ref> </app_version> <app> <name>milkyway_separation__modified_fit</name> </app> <file_info> <name>milkyway_separation__modified_fit_1.36_windows_x86_64__opencl_ati_101.exe</name> <executable/> </file_info> <app_version> <app_name>milkyway_separation__modified_fit</app_name> <version_num>136</version_num> <platform>windows_x86_64</platform> <avg_ncpus>1</avg_ncpus> <max_ncpus>1</max_ncpus> <plan_class>opencl_ati_101</plan_class> <cmdline></cmdline> <coproc> <type>ATI</type> <count>1</count> </coproc> <file_ref> <file_name>milkyway_separation__modified_fit_1.36_windows_x86_64__opencl_ati_101.exe</file_name> <main_program/> </file_ref> </app_version> </app_info> http://milkyway.cs.rpi.edu/milkyway/download/milkyway_nbody_1.62_windows_x86_64.exe http://milkyway.cs.rpi.edu/milkyway/download/milkyway_1.36_windows_x86_64.exe http://milkyway.cs.rpi.edu/milkyway/download/milkyway_1.36_windows_x86_64__opencl_ati_101.exe http://milkyway.cs.rpi.edu/milkyway/download/milkyway_separation__modified_fit_1.36_windows_x86_64.exe http://milkyway.cs.rpi.edu/milkyway/download/milkyway_separation__modified_fit_1.36_windows_x86_64__opencl_ati_101.exe BTW. Is the milkyway_separation__modified_fit part superfluous now, so I could delete this part in red (and this two apps)? - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - I made a forum search because of the project prefs point: Frequency (in Hz) that should try to complete individual work chunks. Higher numbers may run slower but will provide a more responsive system. Lower may be faster but more laggy. default 60 (corresponds to 60 fps) It looks like just the outdated Milkyway 1.20 ATI app used this settings. The currently Milkyway 1.36 ATI app don't use it. I set '1' (one) but the app use/show: Using a target frequency of 60.0 Is this OK, or a bug? - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - I'm a bit disappointed that a project task last ~ 16 seconds with the Milkyway 1.36 ATI app on one FuryX VGA card (1 WU/GPU). I looked to other PCs, e.g. hostid=590597 with 'R9 200 Series - Hawaii' VGA cards. This VGA card have just 44 ComputeUnits (CUs) A task last ~ 15 seconds. The/my FuryX have 64 CUs. But a task last ~ 16 seconds. Is there something wrong - possibilities to optimize/fine tune? Thanks. ![]() |
![]() ![]() Send message Joined: 8 May 09 Posts: 3221 Credit: 518,624,873 RAC: 5 ![]() ![]() ![]() |
Take a look at memory speed and whether the R9 280 has 128bit, 256 bit or 384 bit thruput speeds, the faster the memory and the faster the bits get transferred at one time, the faster the card crunches. It's not JUST the CU's anymore. |
Kylinblue Send message Joined: 23 Aug 11 Posts: 7 Credit: 498,188 RAC: 0 ![]() ![]() |
Don't you know that Fury X beats R9 290x memory bandwidth by 1.6 (512gb/s vs 320) |
![]() Send message Joined: 30 Apr 09 Posts: 99 Credit: 29,791,015 RAC: 422 ![]() ![]() |
I read a lot in the forum and found that the GPU apps can be fine tuned. I found:
![]() |
JHMarshall Send message Joined: 24 Jul 12 Posts: 40 Credit: 7,123,301,054 RAC: 50 ![]() ![]() |
It's not just memory speed here. Since MW use double precision calculations (DP) the card's DP compute capability is the real driver. R9 280 series DP = 1/4 SP R9 290 series DP = 1/8 SP R9 Fury X series DP = 1/16 SP The R9 280 series may not be the fastest single precision performer but the 1/4 DP to SP ratio makes it the leader in double precision for the $. Joe |
![]() Send message Joined: 4 Mar 10 Posts: 65 Credit: 639,958,626 RAC: 120 ![]() ![]() ![]() |
https://www.primegrid.com/forum_thread.php?id=6113 if is all right in this thread GPU______________________FP32 GFLOPS__FP64 GFLOPS__Ratio Radeon R9 295X2__________11264________1408_________FP64 = 1/8 FP32 Radeon HD 7990___________7782_________1946_________FP64 = 1/4 FP32 GeForce GTX Titan Black____5645_________1881_________FP64 = 1/3 FP32 GeForce GTX 690___________5622_________234__________FP64 = 1/24 FP32 Radeon R9 290X___________5632_________704__________FP64 = 1/8 FP32 GeForce GTX 780 Ti_________5345_________223__________FP64 = 1/24 FP32 Radeon HD 6990___________5099_________1276_________FP64 = 1/4 FP32 GeForce GTX 980___________4981_________156__________FP64 = 1/32 FP32 Radeon R9 290_____________4849_________606__________FP64 = 1/8 FP32 GeForce GTX Titan__________4709_________1523_________FP64 = 1/3 FP32 Radeon HD 7970 GHz_______4301_________1075_________FP64 = 1/4 FP32.................... "Niceee ..7970 rocks :-))" actually i have 15-16 second per one task. |
![]() Send message Joined: 4 Mar 10 Posts: 65 Credit: 639,958,626 RAC: 120 ![]() ![]() ![]() |
run 13 sec now. no errors . im wonder if i can run more tasks per one 7970 gfcard withou afect results.. where is thread with app info? |
![]() ![]() Send message Joined: 5 Jun 08 Posts: 21 Credit: 245,803,013 RAC: 0 ![]() ![]() |
Hi Jojez J. Have a look there https://milkyway.cs.rpi.edu/milkyway/forum_thread.php?id=3987&postid=64975 Cheers |
![]() Send message Joined: 30 Apr 09 Posts: 99 Credit: 29,791,015 RAC: 422 ![]() ![]() |
I don't know if this ratios are correct... If I look online the 1.39 app say in the <stderr_txt>: (...) Estimated AMD GPU GFLOP/s: 672 SP GFLOP/s, 134 DP FLOP/s (...) This would be a 5:1 ratio for the R9 Fury X. ![]() |
rtennill Send message Joined: 22 Mar 09 Posts: 6 Credit: 778,016,514 RAC: 0 ![]() ![]() |
Unfortunately this is expected with modern consumer cards. This project is all about double precision compute capability. The trend for both Nvidia and AMD over the last several years has been to significantly reduce the double precision compute capability on consumer/gaming cards in favor of single precision and other features. Wikipedia has good tables for comparing the stats across generation and model. My 6970 has more double precision compute than the fury x. My new card, the 7970, will have almost twice the double precision capability. https://en.wikipedia.org/wiki/AMD_Radeon_Rx_300_series https://en.wikipedia.org/wiki/Radeon_HD_7000_Series |
©2023 Astroinformatics Group