| Name | de_modfit_85_bundle4_4s_south4s_bgset_4_1603804501_68348927_1 |
| Workunit | 2138722919 |
| Created | 1 Feb 2021, 19:26:25 UTC |
| Sent | 1 Feb 2021, 19:26:26 UTC |
| Report deadline | 13 Feb 2021, 19:26:26 UTC |
| Received | 1 Feb 2021, 20:55:32 UTC |
| Server state | Over |
| Outcome | Success |
| Client state | Done |
| Exit status | 0 (0x00000000) |
| Computer ID | 879005 |
| Run time | 10 min 5 sec |
| CPU time | 4 min 9 sec |
| Validate state | Valid |
| Credit | 0.00 |
| Device peak FLOPS | 562.64 GFLOPS |
| Application version | Milkyway@home Separation v1.46 (opencl_nvidia_101) windows_x86_64 |
| Peak working set size | 287.80 MB |
| Peak swap size | 285.30 MB |
| Peak disk usage | 0.02 MB |
<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
<search_application> milkyway_separation 1.46 Windows x86 double OpenCL </search_application>
Reading preferences ended prematurely
BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation'
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4'
Switching to Parameter File 'astronomy_parameters.txt'
<number_WUs> 4 </number_WUs>
<number_params_per_WU> 26 </number_params_per_WU>
Using SSE3 path
Found 1 platform
Platform 0 information:
Name: NVIDIA CUDA
Version: OpenCL 1.2 CUDA 11.0.140
Vendor: NVIDIA Corporation
Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
Profile: FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'GeForce GTX 960' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board:
Driver version: 445.87
Version: OpenCL 1.2 CUDA
Compute capability: 5.2
Max compute units: 8
Clock frequency: 1367 Mhz
Global mem size: 2147483648
Local mem size: 49152
Max const buf size: 65536
Double extension: cl_khr_fp64
Build log:
--------------------------------------------------------------------------------
ptxas info : 0 bytes gmem
ptxas info : Compiling entry function 'probabilities' for 'sm_52'
ptxas info : Function properties for probabilities
ptxas . 0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------
--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 700 SP GFLOP/s, 87 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2048 with 17 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range: { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 16
Num chunks: 17
Chunk size: 34816
Added area: 31872
Effective area: 591872
Initial wait: 12 ms
Integration time: 137.321694 s. Average time per iteration = 429.130293 ms
Integral 0 time = 138.154410 s
Estimated Nvidia GPU GFLOP/s: 700 SP GFLOP/s, 87 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2048 with 19 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 15 ms (mode 0)
Range: { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 1
Num chunks: 2
Chunk size: 38912
Added area: 37224
Effective area: 77824
Initial wait: 15 ms
Integration time: 14.786691 s. Average time per iteration = 46.208410 ms
Integral 1 time = 14.862496 s
Running likelihood with 31964 stars
Likelihood time = 1.078111 s
<background_integral> 0.000046179146618 </background_integral>
<stream_integral> 115.520989842072140 30.097146010719939 0.000000000000000 0.262900807425154 </stream_integral>
<background_likelihood> -3.221084585729053 </background_likelihood>
<stream_only_likelihood> -3.753871789979520 -3.331220679301900 -220.840870377434130 -177.459895108388050 </stream_only_likelihood>
<search_likelihood> -2.681854007586447 </search_likelihood>
Using SSE3 path
Found 1 platform
Platform 0 information:
Name: NVIDIA CUDA
Version: OpenCL 1.2 CUDA 11.0.140
Vendor: NVIDIA Corporation
Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
Profile: FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'GeForce GTX 960' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board:
Driver version: 445.87
Version: OpenCL 1.2 CUDA
Compute capability: 5.2
Max compute units: 8
Clock frequency: 1367 Mhz
Global mem size: 2147483648
Local mem size: 49152
Max const buf size: 65536
Double extension: cl_khr_fp64
Build log:
--------------------------------------------------------------------------------
ptxas info : 0 bytes gmem
ptxas info : Compiling entry function 'probabilities' for 'sm_52'
ptxas info : Function properties for probabilities
ptxas . 0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------
--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 700 SP GFLOP/s, 87 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2048 with 17 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range: { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 16
Num chunks: 17
Chunk size: 34816
Added area: 31872
Effective area: 591872
Initial wait: 12 ms
Integration time: 130.644708 s. Average time per iteration = 408.264711 ms
Integral 0 time = 131.303046 s
Estimated Nvidia GPU GFLOP/s: 700 SP GFLOP/s, 87 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2048 with 19 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 15 ms (mode 0)
Range: { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 1
Num chunks: 2
Chunk size: 38912
Added area: 37224
Effective area: 77824
Initial wait: 15 ms
Integration time: 14.797924 s. Average time per iteration = 46.243512 ms
Integral 1 time = 14.869349 s
Running likelihood with 31964 stars
Likelihood time = 1.020817 s
<background_integral1> 0.000046087391033 </background_integral1>
<stream_integral1> 125.450599985914370 28.233523234221813 0.000000000000000 10.884307313404308 </stream_integral1>
<background_likelihood1> -3.235194472777808 </background_likelihood1>
<stream_only_likelihood1> -3.683456809386089 -3.376085363329360 -220.639489201428890 -5.151143981174979 </stream_only_likelihood1>
<search_likelihood1> -2.681935643716488 </search_likelihood1>
Using SSE3 path
Found 1 platform
Platform 0 information:
Name: NVIDIA CUDA
Version: OpenCL 1.2 CUDA 11.0.140
Vendor: NVIDIA Corporation
Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
Profile: FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'GeForce GTX 960' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board:
Driver version: 445.87
Version: OpenCL 1.2 CUDA
Compute capability: 5.2
Max compute units: 8
Clock frequency: 1367 Mhz
Global mem size: 2147483648
Local mem size: 49152
Max const buf size: 65536
Double extension: cl_khr_fp64
Build log:
--------------------------------------------------------------------------------
ptxas info : 0 bytes gmem
ptxas info : Compiling entry function 'probabilities' for 'sm_52'
ptxas info : Function properties for probabilities
ptxas . 0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------
--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 700 SP GFLOP/s, 87 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2048 with 17 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range: { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 16
Num chunks: 17
Chunk size: 34816
Added area: 31872
Effective area: 591872
Initial wait: 12 ms
Integration time: 131.756841 s. Average time per iteration = 411.740129 ms
Integral 0 time = 132.438708 s
Estimated Nvidia GPU GFLOP/s: 700 SP GFLOP/s, 87 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2048 with 19 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 15 ms (mode 0)
Range: { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 1
Num chunks: 2
Chunk size: 38912
Added area: 37224
Effective area: 77824
Initial wait: 15 ms
Integration time: 14.785059 s. Average time per iteration = 46.203310 ms
Integral 1 time = 14.920526 s
Running likelihood with 31964 stars
Likelihood time = 1.033774 s
<background_integral2> 0.000046235789492 </background_integral2>
<stream_integral2> 70.745688034937999 26.171615189440214 0.000000000000000 51.385403413781994 </stream_integral2>
<background_likelihood2> -3.214649249570045 </background_likelihood2>
<stream_only_likelihood2> -3.793445686196958 -3.402814575221790 -220.742163610412750 -10.639362115856823 </stream_only_likelihood2>
<search_likelihood2> -2.683038230683721 </search_likelihood2>
Using SSE3 path
Found 1 platform
Platform 0 information:
Name: NVIDIA CUDA
Version: OpenCL 1.2 CUDA 11.0.140
Vendor: NVIDIA Corporation
Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
Profile: FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'GeForce GTX 960' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board:
Driver version: 445.87
Version: OpenCL 1.2 CUDA
Compute capability: 5.2
Max compute units: 8
Clock frequency: 1367 Mhz
Global mem size: 2147483648
Local mem size: 49152
Max const buf size: 65536
Double extension: cl_khr_fp64
Build log:
--------------------------------------------------------------------------------
ptxas info : 0 bytes gmem
ptxas info : Compiling entry function 'probabilities' for 'sm_52'
ptxas info : Function properties for probabilities
ptxas . 0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------
--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 700 SP GFLOP/s, 87 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2048 with 17 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range: { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 16
Num chunks: 17
Chunk size: 34816
Added area: 31872
Effective area: 591872
Initial wait: 12 ms
Integration time: 130.166533 s. Average time per iteration = 406.770414 ms
Integral 0 time = 130.883377 s
Estimated Nvidia GPU GFLOP/s: 700 SP GFLOP/s, 87 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2048 with 19 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 15 ms (mode 0)
Range: { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 1
Num chunks: 2
Chunk size: 38912
Added area: 37224
Effective area: 77824
Initial wait: 15 ms
Integration time: 14.890878 s. Average time per iteration = 46.533995 ms
Integral 1 time = 14.967701 s
Running likelihood with 31964 stars
Likelihood time = 1.035843 s
<background_integral3> 0.000046731173148 </background_integral3>
<stream_integral3> 124.292559289868380 29.651724288698720 0.000000000000000 2.341322230023364 </stream_integral3>
<background_likelihood3> -3.236114864428058 </background_likelihood3>
<stream_only_likelihood3> -3.738100787725315 -3.331034420717507 -220.818070519857370 -7.857234696287774 </stream_only_likelihood3>
<search_likelihood3> -2.681846800660083 </search_likelihood3>
13:55:27 (4256): called boinc_finish(0)
</stderr_txt>
]]>
©2025 Astroinformatics Group