Posts by VietOZ

1) Message boards : Number crunching : Finally getting new tasks only seconds after running out. May not be worth the hassle. (Message 69287) Posted 22 Nov 2019 by VietOZ Post: Great work @JStateson! From what understand, the Seti guys only made the custom Boinc for Linux. If we're running windows then we'd have to recompile our own, like you did. I'm running Milky on W10, so my cap is 900. But again, like you said, we can set multi instances to grab works if we anticipate a long down time.
2) Message boards : Number crunching : Finally getting new tasks only seconds after running out. May not be worth the hassle. (Message 69283) Posted 22 Nov 2019 by VietOZ Post: I simply just alter the coproc file so I can get the max 900 then run an update command for every 92 seconds. For linux: watch -n 92 boinccmd --project http://milkyway.cs.rpi.edu/milkyway/ update for windows: :top "C:\Program Files\BOINC\boinccmd" --passwd PASSWORD --project http://milkyway.cs.rpi.edu/milkyway/ update TIMEOUT /T 92 goto top
3) Message boards : News : New Separation Runs [UPDATE] (Message 68912) Posted 25 Jul 2019 by VietOZ Post: Hi Tom, I enabled 2 of my machine just now. Still got some errors, i guess the old runs. The bgset_2 looks like this <core_client_version>7.8.3</core_client_version> <![CDATA[ <stderr_txt> <search_application> milkyway_separation 1.46 Windows x86 double OpenCL </search_application> BOINC GPU type suggests using OpenCL vendor 'Advanced Micro Devices, Inc.' Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' Switching to Parameter File 'astronomy_parameters.txt' <number_WUs> 4 </number_WUs> <number_params_per_WU> 26 </number_params_per_WU> Using SSE4.1 path Found 2 platforms Platform 0 information: Name: AMD Accelerated Parallel Processing Version: OpenCL 2.0 AMD-APP (2442.12) Vendor: Advanced Micro Devices, Inc. Extensions: cl_khr_icd cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_amd_event_callback cl_amd_offline_devices Profile: FULL_PROFILE Platform 1 information: Name: Intel(R) OpenCL Version: OpenCL 2.1 Vendor: Intel(R) Corporation Extensions: cl_intel_dx9_media_sharing cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_d3d11_sharing cl_khr_depth_images cl_khr_dx9_media_sharing cl_khr_fp64 cl_khr_gl_sharing cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_spir Profile: FULL_PROFILE Using device 1 on platform 0 Found 3 CL devices Device 'Tahiti' (Advanced Micro Devices, Inc.:0x1002) (CL_DEVICE_TYPE_GPU) Board: AMD Radeon R9 200 Series Driver version: 2442.12 Version: OpenCL 1.2 AMD-APP (2442.12) Compute capability: 0.0 Max compute units: 32 Clock frequency: 1100 Mhz Global mem size: 3221225472 Local mem size: 32768 Max const buf size: 65536 Double extension: cl_khr_fp64 Estimated AMD GPU GFLOP/s: 4506 SP GFLOP/s, 1126 DP FLOP/s Using a target frequency of 60.0 Using a block size of 8192 with 68 blocks/chunk Using clWaitForEvents() for polling (mode -1) Range: { nu_steps = 320, mu_steps = 800, r_steps = 700 } Iteration area: 560000 Chunk estimate: 1 Num chunks: 2 Chunk size: 557056 Added area: 554112 Effective area: 1114112 Initial wait: 16 ms Integration time: 13.201212 s. Average time per iteration = 41.253788 ms Integral 0 time = 13.681410 s Running likelihood with 38095 stars Likelihood time = 0.920602 s <background_integral> 0.000155305800785 </background_integral> <stream_integral> 4.181364298317329 1.584709335463284 130.381645952612530 48.033929790624995 </stream_integral> <background_likelihood> -10.909361772835503 </background_likelihood> <stream_only_likelihood> -8.764722699084267 -153.382575093746400 -3.053178863333347 -7.281001906390212 </stream_only_likelihood> <search_likelihood> -3.045379847005349 </search_likelihood> Using SSE4.1 path Found 2 platforms Platform 0 information: Name: AMD Accelerated Parallel Processing Version: OpenCL 2.0 AMD-APP (2442.12) Vendor: Advanced Micro Devices, Inc. Extensions: cl_khr_icd cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_amd_event_callback cl_amd_offline_devices Profile: FULL_PROFILE Platform 1 information: Name: Intel(R) OpenCL Version: OpenCL 2.1 Vendor: Intel(R) Corporation Extensions: cl_intel_dx9_media_sharing cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_d3d11_sharing cl_khr_depth_images cl_khr_dx9_media_sharing cl_khr_fp64 cl_khr_gl_sharing cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_spir Profile: FULL_PROFILE Using device 1 on platform 0 Found 3 CL devices Device 'Tahiti' (Advanced Micro Devices, Inc.:0x1002) (CL_DEVICE_TYPE_GPU) Board: AMD Radeon R9 200 Series Driver version: 2442.12 Version: OpenCL 1.2 AMD-APP (2442.12) Compute capability: 0.0 Max compute units: 32 Clock frequency: 1100 Mhz Global mem size: 3221225472 Local mem size: 32768 Max const buf size: 65536 Double extension: cl_khr_fp64 Estimated AMD GPU GFLOP/s: 4506 SP GFLOP/s, 1126 DP FLOP/s Using a target frequency of 60.0 Using a block size of 8192 with 68 blocks/chunk Using clWaitForEvents() for polling (mode -1) Range: { nu_steps = 320, mu_steps = 800, r_steps = 700 } Iteration area: 560000 Chunk estimate: 1 Num chunks: 2 Chunk size: 557056 Added area: 554112 Effective area: 1114112 Initial wait: 16 ms Integration time: 13.226758 s. Average time per iteration = 41.333619 ms Integral 0 time = 13.749233 s Running likelihood with 38095 stars Likelihood time = 0.873341 s <background_integral1> 0.000136224675998 </background_integral1> <stream_integral1> 152.671899805996900 145.555865287139110 183.784669581012310 4.906343751930823 </stream_integral1> <background_likelihood1> -9.784610545126764 </background_likelihood1> <stream_only_likelihood1> -17.027740007600620 -2.898074963499768 -13.385826642313273 -136.031850343614370 </stream_only_likelihood1> <search_likelihood1> -2.898073595535414 </search_likelihood1> Using SSE4.1 path Found 2 platforms Platform 0 information: Name: AMD Accelerated Parallel Processing Version: OpenCL 2.0 AMD-APP (2442.12) Vendor: Advanced Micro Devices, Inc. Extensions: cl_khr_icd cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_amd_event_callback cl_amd_offline_devices Profile: FULL_PROFILE Platform 1 information: Name: Intel(R) OpenCL Version: OpenCL 2.1 Vendor: Intel(R) Corporation Extensions: cl_intel_dx9_media_sharing cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_d3d11_sharing cl_khr_depth_images cl_khr_dx9_media_sharing cl_khr_fp64 cl_khr_gl_sharing cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_spir Profile: FULL_PROFILE Using device 1 on platform 0 Found 3 CL devices Device 'Tahiti' (Advanced Micro Devices, Inc.:0x1002) (CL_DEVICE_TYPE_GPU) Board: AMD Radeon R9 200 Series Driver version: 2442.12 Version: OpenCL 1.2 AMD-APP (2442.12) Compute capability: 0.0 Max compute units: 32 Clock frequency: 1100 Mhz Global mem size: 3221225472 Local mem size: 32768 Max const buf size: 65536 Double extension: cl_khr_fp64 Estimated AMD GPU GFLOP/s: 4506 SP GFLOP/s, 1126 DP FLOP/s Using a target frequency of 60.0 Using a block size of 8192 with 68 blocks/chunk Using clWaitForEvents() for polling (mode -1) Range: { nu_steps = 320, mu_steps = 800, r_steps = 700 } Iteration area: 560000 Chunk estimate: 1 Num chunks: 2 Chunk size: 557056 Added area: 554112 Effective area: 1114112 Initial wait: 16 ms Integration time: 13.202470 s. Average time per iteration = 41.257717 ms Integral 0 time = 13.680797 s Running likelihood with 38095 stars Likelihood time = 0.857028 s <background_integral2> 0.000165617672821 </background_integral2> <stream_integral2> 160.847266587313300 3.968876879409631 176.308152947632440 129.096313328206120 </stream_integral2> <background_likelihood2> -8.366224710913647 </background_likelihood2> <stream_only_likelihood2> -12.668971218100111 -4.873254839594230 -6.613812086197211 -9.336805624681906 </stream_only_likelihood2> <search_likelihood2> -4.346214889695199 </search_likelihood2> Using SSE4.1 path Found 2 platforms Platform 0 information: Name: AMD Accelerated Parallel Processing Version: OpenCL 2.0 AMD-APP (2442.12) Vendor: Advanced Micro Devices, Inc. Extensions: cl_khr_icd cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_amd_event_callback cl_amd_offline_devices Profile: FULL_PROFILE Platform 1 information: Name: Intel(R) OpenCL Version: OpenCL 2.1 Vendor: Intel(R) Corporation Extensions: cl_intel_dx9_media_sharing cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_d3d11_sharing cl_khr_depth_images cl_khr_dx9_media_sharing cl_khr_fp64 cl_khr_gl_sharing cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_spir Profile: FULL_PROFILE Using device 1 on platform 0 Found 3 CL devices Device 'Tahiti' (Advanced Micro Devices, Inc.:0x1002) (CL_DEVICE_TYPE_GPU) Board: AMD Radeon R9 200 Series Driver version: 2442.12 Version: OpenCL 1.2 AMD-APP (2442.12) Compute capability: 0.0 Max compute units: 32 Clock frequency: 1100 Mhz Global mem size: 3221225472 Local mem size: 32768 Max const buf size: 65536 Double extension: cl_khr_fp64 Estimated AMD GPU GFLOP/s: 4506 SP GFLOP/s, 1126 DP FLOP/s Using a target frequency of 60.0 Using a block size of 8192 with 68 blocks/chunk Using clWaitForEvents() for polling (mode -1) Range: { nu_steps = 320, mu_steps = 800, r_steps = 700 } Iteration area: 560000 Chunk estimate: 1 Num chunks: 2 Chunk size: 557056 Added area: 554112 Effective area: 1114112 Initial wait: 16 ms Integration time: 11.259764 s. Average time per iteration = 35.186763 ms Integral 0 time = 11.575826 s Running likelihood with 38095 stars Likelihood time = 0.967706 s <background_integral3> 0.000086545095616 </background_integral3> <stream_integral3> 0.135624606604846 66.422776529197293 173.534953341131880 113.367127592535130 </stream_integral3> <background_likelihood3> -8.873801050948440 </background_likelihood3> <stream_only_likelihood3> -214.679271661823320 -6.563516836358728 -2.915561601617464 -16.132769365806681 </stream_only_likelihood3> <search_likelihood3> -2.915368518622770 </search_likelihood3> 00:00:22 (8948): called boinc_finish(0) </stderr_txt> ]]> I'll monitor to see if any error with the bgset_2 ... but looks good so far. Thanks for the quick fix! Edit: confirmed, the errors were the other runs de_modfit_83_bundle4_4s_south4s_bgset_
4) Message boards : News : New Separation Runs [UPDATE] (Message 68905) Posted 24 Jul 2019 by VietOZ Post: Hi Tom, I just got a bunch of errors with this new runs. This happened on all 3 of my machines which were fine before this <core_client_version>7.8.3</core_client_version> <![CDATA[ <message> Incorrect function. (0x1) - exit code 1 (0x1)</message> <stderr_txt> <search_application> milkyway_separation 1.46 Windows x86 double OpenCL </search_application> BOINC GPU type suggests using OpenCL vendor 'Advanced Micro Devices, Inc.' Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' Switching to Parameter File 'astronomy_parameters.txt' <number_WUs> 4 </number_WUs> <number_params_per_WU> 25 </number_params_per_WU> Number of streams does not match 16:49:44 (8604): called boinc_finish(1) </stderr_txt> ]] Edit: I'll stop getting work for now because my errors went up faster than my valids :(
5) Message boards : Number crunching : Monitoring of Invalid results on separation run de_modfit_84_bundle4_4s_south4s_1 (Message 68853) Posted 12 Jun 2019 by VietOZ Post: I got 4 of these out of 15k of WU valid max # of error/total/success tasks 2, 9, 6 errors Too many errors (may have bug)
6) Message boards : News : 30 Workunit Limit Per Request - Fix Implemented (Message 68468) Posted 2 Apr 2019 by VietOZ Post: 2 GPUs should give you 400 tasks. Run 1 task per GPU and setup 6 instances. That still give you 12 tasks at a time. Add another GPU to your coproc.xml and lock it. Set cache to 10/10 and you'll get 600 tasks per instance. Make a tickler for every 5 minutes, that way when 600 tasks run out, the machine still get work after a few minutes idle. I know it's not a long term solution, but help Jake out. Give him time to pin point the problem. There are many ways to get around this. My VII maybe losing about 80k points per day with this issue ... not really a big deal.
7) Message boards : News : Planned Server Outage Tuesday March 19th (Message 68260) Posted 15 Mar 2019 by VietOZ Post: Thanks Jake for the response on the other thread! Good luck with the transition to the new server. I'm sure I'll be back full force after you're done :) Cheers!
8) Message boards : News : Planned Server Outage Tuesday March 19th (Message 68254) Posted 15 Mar 2019 by VietOZ Post: hi Jake, Great news with the new server. Please take your time to solve the problem once and for all. I think most of the users will understand and wait patiently. Also, I'm not sure if you're aware of this issue http://milkyway.cs.rpi.edu/milkyway/forum_thread.php?id=4399 If you can take a look at it and let us know some info on what to do then it'll be awesome. Majority of my teammates had pulled away from this project because of that issue. I, myself, had reduced over 50% of my resource also because of this issue. Hope the transition to the new server goes well and everything come out great for you and the project. Thank you kindly! Z
9) Message boards : Number crunching : inconclusive work units (Message 68210) Posted 6 Mar 2019 by VietOZ Post: inconclusive are just like pendings. Invalids is what you worry about.
10) Message boards : Number crunching : Invalids Exit status 0 (0x0) after server came back (Message 68207) Posted 6 Mar 2019 by VietOZ Post: Admin, can you please investigate what's going on? Did something change? I just spent almost 1/2 day trying to find the "right" voltage for my cards, which is not the case.
11) Message boards : Number crunching : Invalids Exit status 0 (0x0) after server came back (Message 68204) Posted 6 Mar 2019 by VietOZ Post: Does anyone else notice the invalids went way up after the server came back from the outage? Exit status 0 (0x0) This happened all across my machines. Not just one, so I don't think it's stability issue unless all of my cards decided to fail at once.
12) Message boards : Number crunching : AMD FirePro S9150 (Message 68186) Posted 24 Feb 2019 by VietOZ Post: @BeemerBiker: There is thread somewhere about Nvidia Titan V running MH. In that thread, the author mentions about each WU uses about 1.5GB of VRAM. So you can run about 6 WUs on a S9100 (12GB) or 8-10 WUs on S9150 (16GB). As for compiling, here is roughly how I do it in Linux: https://gist.github.com/neofob/8a73e2f44787541c11c0445763953950 That's an NV card though, which ends up using much more VRAM than the same task on an AMD card. 4x tasks in Win10 on my 280x is only using ~250 VRAM according to GPU-Z. Chalk it up to NV and OpenCL. The TV can't be fully utilized because if it but the Vega 7 or S9100 should be able to, esp with 16 and 12 GB of VRAM. confirmed, my Radeon 7 used about 1.1GB of VRAM running 7 tasks. Looks like the Radeon is having the same issue. <number_WUs> 5 </number_WUs> <number_params_per_WU> 20 </number_params_per_WU> Using SSE4.1 path Found 1 platform Platform 0 information: Name: AMD Accelerated Parallel Processing Version: OpenCL 2.1 AMD-APP (2766.5) Vendor: Advanced Micro Devices, Inc. Extensions: cl_khr_icd cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_amd_event_callback cl_amd_offline_devices Profile: FULL_PROFILE Using device 0 on platform 0 Found 1 CL device Device 'gfx906' (Advanced Micro Devices, Inc.:0x1002) (CL_DEVICE_TYPE_GPU) Board: AMD Radeon VII Driver version: 2766.5 (PAL,HSAIL) Version: OpenCL 1.2 AMD-APP (2766.5) Compute capability: 0.0 Max compute units: 60 Clock frequency: 1802 Mhz Global mem size: 3221225472 Local mem size: 32768 Max const buf size: 3221225472 Double extension: cl_khr_fp64 Build log: -------------------------------------------------------------------------------- Estimated AMD GPU GFLOP/s: 1081 SP GFLOP/s, 216 DP FLOP/s Using a target frequency of 60.0 Using a block size of 15360 with 7 blocks/chunk
13) Message boards : Number crunching : AMD recently announced Radeon VII with 6.9TFLOPS of FP64 (1:2) for only $699???!!! (Message 68171) Posted 18 Feb 2019 by VietOZ Post: How come the count shows (8) Radeon VII's VietOZ? Man.... I need your job! The gear you have is worth squillions! It's what I'd do if I won lotto lol. (just saying what we're all thinking ;) only one card, not 8 :) Just a modification to get the max cache.
14) Message boards : Number crunching : AMD recently announced Radeon VII with 6.9TFLOPS of FP64 (1:2) for only $699???!!! (Message 68164) Posted 16 Feb 2019 by VietOZ Post: Look like this card needs some serious cooling if you want to run 24/7. Maybe a Kraken G12 adapter and a Thermaltake Water 3.0 would be sufficient. Runs fine on stock cooling here at an ambient of 73F. Actually it's much cooler than I thought. Currently running Moo AND Milky at the same time.Fan noise can be an issue for some folks at this configuration, but not me. Use Wattman Auto undervolt if you want silent and save A LOT of power consumption. Here's my host running 3x on 1 instance and the other instance is running Moo. http://milkyway.cs.rpi.edu/milkyway/show_host_detail.php?hostid=797896