Name | de_modfit_85_bundle4_4s_south4s_bgset_4_1603804501_67020712_1 |
Workunit | 2137243620 |
Created | 19 Jan 2021, 4:41:15 UTC |
Sent | 19 Jan 2021, 4:49:25 UTC |
Report deadline | 31 Jan 2021, 4:49:25 UTC |
Received | 24 Jan 2021, 3:56:30 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 840645 |
Run time | 2 min 19 sec |
CPU time | 1 min 21 sec |
Validate state | Valid |
Credit | 0.00 |
Device peak FLOPS | 2,235.38 GFLOPS |
Application version | Milkyway@home Separation v1.46 (opencl_nvidia_101) windows_x86_64 |
Peak working set size | 370.36 MB |
Peak swap size | 390.93 MB |
Peak disk usage | 0.01 MB |
<core_client_version>7.16.5</core_client_version> <![CDATA[ <stderr_txt> <search_application> milkyway_separation 1.46 Windows x86 double OpenCL </search_application> BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation' Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' Switching to Parameter File 'astronomy_parameters.txt' <number_WUs> 4 </number_WUs> <number_params_per_WU> 26 </number_params_per_WU> Using SSE4.1 path Found 2 platforms Platform 0 information: Name: NVIDIA CUDA Version: OpenCL 1.2 CUDA 11.0.208 Vendor: NVIDIA Corporation Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics Profile: FULL_PROFILE Platform 1 information: Name: Intel(R) OpenCL Version: OpenCL 2.1 Vendor: Intel(R) Corporation Extensions: cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_depth_images cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_spir Profile: FULL_PROFILE Using device 0 on platform 0 Found 1 CL device Device 'GeForce RTX 2080 SUPER' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU) Board: Driver version: 451.67 Version: OpenCL 1.2 CUDA Compute capability: 7.5 Max compute units: 48 Clock frequency: 1815 Mhz Global mem size: 8589934592 Local mem size: 49152 Max const buf size: 65536 Double extension: cl_khr_fp64 Build log: -------------------------------------------------------------------------------- ptxas info : 0 bytes gmem ptxas info : Compiling entry function 'probabilities' for 'sm_75' ptxas info : Function properties for probabilities ptxas . 0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads ptxas info : Used 128 registers, 420 bytes cmem[0], 144 bytes cmem[2] -------------------------------------------------------------------------------- Build log: -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Estimated Nvidia GPU GFLOP/s: 5576 SP GFLOP/s, 697 DP FLOP/s Using a target frequency of 60.0 Using a block size of 12288 with 22 blocks/chunk Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0) Range: { nu_steps = 320, mu_steps = 800, r_steps = 700 } Iteration area: 560000 Chunk estimate: 2 Num chunks: 3 Chunk size: 270336 Added area: 251008 Effective area: 811008 Initial wait: 13 ms Integration time: 29.641853 s. Average time per iteration = 92.630791 ms Integral 0 time = 29.976010 s Estimated Nvidia GPU GFLOP/s: 5576 SP GFLOP/s, 697 DP FLOP/s Using a target frequency of 60.0 Using a block size of 12288 with 3 blocks/chunk Using clWaitForEvents() for polling with initial wait of 0 ms (mode 0) Range: { nu_steps = 320, mu_steps = 58, r_steps = 700 } Iteration area: 40600 Chunk estimate: 1 Num chunks: 2 Chunk size: 36864 Added area: 33128 Effective area: 73728 Initial wait: 0 ms Integration time: 2.329392 s. Average time per iteration = 7.279350 ms Integral 1 time = 2.391772 s Running likelihood with 31964 stars Likelihood time = 0.694145 s <background_integral> 0.000046166119547 </background_integral> <stream_integral> 119.941510260386560 26.508003764440865 0.000000000000000 22.203020386944011 </stream_integral> <background_likelihood> -3.223601449926322 </background_likelihood> <stream_only_likelihood> -3.799915405999677 -3.403230879610288 -220.541310172789220 -11.969708750932613 </stream_only_likelihood> <search_likelihood> -2.682188042025581 </search_likelihood> Using SSE4.1 path Found 2 platforms Platform 0 information: Name: NVIDIA CUDA Version: OpenCL 1.2 CUDA 11.0.208 Vendor: NVIDIA Corporation Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics Profile: FULL_PROFILE Platform 1 information: Name: Intel(R) OpenCL Version: OpenCL 2.1 Vendor: Intel(R) Corporation Extensions: cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_depth_images cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_spir Profile: FULL_PROFILE Using device 0 on platform 0 Found 1 CL device Device 'GeForce RTX 2080 SUPER' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU) Board: Driver version: 451.67 Version: OpenCL 1.2 CUDA Compute capability: 7.5 Max compute units: 48 Clock frequency: 1815 Mhz Global mem size: 8589934592 Local mem size: 49152 Max const buf size: 65536 Double extension: cl_khr_fp64 Build log: -------------------------------------------------------------------------------- ptxas info : 0 bytes gmem ptxas info : Compiling entry function 'probabilities' for 'sm_75' ptxas info : Function properties for probabilities ptxas . 0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads ptxas info : Used 128 registers, 420 bytes cmem[0], 144 bytes cmem[2] -------------------------------------------------------------------------------- Build log: -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Estimated Nvidia GPU GFLOP/s: 5576 SP GFLOP/s, 697 DP FLOP/s Using a target frequency of 60.0 Using a block size of 12288 with 22 blocks/chunk Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0) Range: { nu_steps = 320, mu_steps = 800, r_steps = 700 } Iteration area: 560000 Chunk estimate: 2 Num chunks: 3 Chunk size: 270336 Added area: 251008 Effective area: 811008 Initial wait: 13 ms Integration time: 29.663477 s. Average time per iteration = 92.698364 ms Integral 0 time = 30.007549 s Estimated Nvidia GPU GFLOP/s: 5576 SP GFLOP/s, 697 DP FLOP/s Using a target frequency of 60.0 Using a block size of 12288 with 3 blocks/chunk Using clWaitForEvents() for polling with initial wait of 0 ms (mode 0) Range: { nu_steps = 320, mu_steps = 58, r_steps = 700 } Iteration area: 40600 Chunk estimate: 1 Num chunks: 2 Chunk size: 36864 Added area: 33128 Effective area: 73728 Initial wait: 0 ms Integration time: 2.285473 s. Average time per iteration = 7.142104 ms Integral 1 time = 2.340901 s Running likelihood with 31964 stars Likelihood time = 0.689265 s <background_integral1> 0.000045883225116 </background_integral1> <stream_integral1> 125.176160475480400 17.573766470227106 0.000000000000000 1.165322708456263 </stream_integral1> <background_likelihood1> -3.182589954642396 </background_likelihood1> <stream_only_likelihood1> -3.537256319940795 -3.811927272493068 -220.836949551239800 -134.732465155337310 </stream_only_likelihood1> <search_likelihood1> -2.683835922892564 </search_likelihood1> Using SSE4.1 path Found 2 platforms Platform 0 information: Name: NVIDIA CUDA Version: OpenCL 1.2 CUDA 11.0.208 Vendor: NVIDIA Corporation Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics Profile: FULL_PROFILE Platform 1 information: Name: Intel(R) OpenCL Version: OpenCL 2.1 Vendor: Intel(R) Corporation Extensions: cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_depth_images cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_spir Profile: FULL_PROFILE Using device 0 on platform 0 Found 1 CL device Device 'GeForce RTX 2080 SUPER' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU) Board: Driver version: 451.67 Version: OpenCL 1.2 CUDA Compute capability: 7.5 Max compute units: 48 Clock frequency: 1815 Mhz Global mem size: 8589934592 Local mem size: 49152 Max const buf size: 65536 Double extension: cl_khr_fp64 Build log: -------------------------------------------------------------------------------- ptxas info : 0 bytes gmem ptxas info : Compiling entry function 'probabilities' for 'sm_75' ptxas info : Function properties for probabilities ptxas . 0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads ptxas info : Used 128 registers, 420 bytes cmem[0], 144 bytes cmem[2] -------------------------------------------------------------------------------- Build log: -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Estimated Nvidia GPU GFLOP/s: 5576 SP GFLOP/s, 697 DP FLOP/s Using a target frequency of 60.0 Using a block size of 12288 with 22 blocks/chunk Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0) Range: { nu_steps = 320, mu_steps = 800, r_steps = 700 } Iteration area: 560000 Chunk estimate: 2 Num chunks: 3 Chunk size: 270336 Added area: 251008 Effective area: 811008 Initial wait: 13 ms Integration time: 29.648670 s. Average time per iteration = 92.652095 ms Integral 0 time = 29.974204 s Estimated Nvidia GPU GFLOP/s: 5576 SP GFLOP/s, 697 DP FLOP/s Using a target frequency of 60.0 Using a block size of 12288 with 3 blocks/chunk Using clWaitForEvents() for polling with initial wait of 0 ms (mode 0) Range: { nu_steps = 320, mu_steps = 58, r_steps = 700 } Iteration area: 40600 Chunk estimate: 1 Num chunks: 2 Chunk size: 36864 Added area: 33128 Effective area: 73728 Initial wait: 0 ms Integration time: 2.390833 s. Average time per iteration = 7.471352 ms Integral 1 time = 2.468797 s Running likelihood with 31964 stars Likelihood time = 0.701607 s <background_integral2> 0.000046484958127 </background_integral2> <stream_integral2> 120.072191377420750 26.755103789230589 0.000000000000000 2.669301390224146 </stream_integral2> <background_likelihood2> -3.232070244298225 </background_likelihood2> <stream_only_likelihood2> -3.716455135218023 -3.444551340904686 -220.714317003324540 -139.029032152088970 </stream_only_likelihood2> <search_likelihood2> -2.682019840715076 </search_likelihood2> Using SSE4.1 path Found 2 platforms Platform 0 information: Name: NVIDIA CUDA Version: OpenCL 1.2 CUDA 11.0.208 Vendor: NVIDIA Corporation Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics Profile: FULL_PROFILE Platform 1 information: Name: Intel(R) OpenCL Version: OpenCL 2.1 Vendor: Intel(R) Corporation Extensions: cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_depth_images cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_spir Profile: FULL_PROFILE Using device 0 on platform 0 Found 1 CL device Device 'GeForce RTX 2080 SUPER' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU) Board: Driver version: 451.67 Version: OpenCL 1.2 CUDA Compute capability: 7.5 Max compute units: 48 Clock frequency: 1815 Mhz Global mem size: 8589934592 Local mem size: 49152 Max const buf size: 65536 Double extension: cl_khr_fp64 Build log: -------------------------------------------------------------------------------- ptxas info : 0 bytes gmem ptxas info : Compiling entry function 'probabilities' for 'sm_75' ptxas info : Function properties for probabilities ptxas . 0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads ptxas info : Used 128 registers, 420 bytes cmem[0], 144 bytes cmem[2] -------------------------------------------------------------------------------- Build log: -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Estimated Nvidia GPU GFLOP/s: 5576 SP GFLOP/s, 697 DP FLOP/s Using a target frequency of 60.0 Using a block size of 12288 with 22 blocks/chunk Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0) Range: { nu_steps = 320, mu_steps = 800, r_steps = 700 } Iteration area: 560000 Chunk estimate: 2 Num chunks: 3 Chunk size: 270336 Added area: 251008 Effective area: 811008 Initial wait: 13 ms Integration time: 29.595810 s. Average time per iteration = 92.486908 ms Integral 0 time = 29.947780 s Estimated Nvidia GPU GFLOP/s: 5576 SP GFLOP/s, 697 DP FLOP/s Using a target frequency of 60.0 Using a block size of 12288 with 3 blocks/chunk Using clWaitForEvents() for polling with initial wait of 0 ms (mode 0) Range: { nu_steps = 320, mu_steps = 58, r_steps = 700 } Iteration area: 40600 Chunk estimate: 1 Num chunks: 2 Chunk size: 36864 Added area: 33128 Effective area: 73728 Initial wait: 0 ms Integration time: 2.354891 s. Average time per iteration = 7.359033 ms Integral 1 time = 2.412989 s Running likelihood with 31964 stars Likelihood time = 0.680166 s <background_integral3> 0.000047061781034 </background_integral3> <stream_integral3> 124.118575833239050 21.202549625089571 0.000000000000000 74.953300707386902 </stream_integral3> <background_likelihood3> -3.246263512751939 </background_likelihood3> <stream_only_likelihood3> -3.609175211526673 -3.603090439002938 -220.766729214313470 -7.798790078112769 </stream_only_likelihood3> <search_likelihood3> -2.682993731418269 </search_likelihood3> 23:56:14 (22748): called boinc_finish(0) </stderr_txt> ]]>
©2024 Astroinformatics Group