Task 810342

Task 810342

Name	de_modfit_80_bundle4_4s_south4s_bgset_4_1603804501_75079407_0
Workunit	2146169175
Created	29 Jan 2021, 4:28:51 UTC
Sent	29 Jan 2021, 4:35:19 UTC
Report deadline	10 Feb 2021, 4:35:19 UTC
Received	29 Jan 2021, 7:41:09 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	838327
Run time	3 min 22 sec
CPU time	55 sec
Validate state	Valid
Credit	227.51
Device peak FLOPS	1,386.07 GFLOPS
Application version	Milkyway@home Separation v1.46 (opencl_nvidia_101) windows_x86_64
Peak working set size	294.57 MB
Peak swap size	293.57 MB
Peak disk usage	0.01 MB
Stderr output

<core_client_version>7.16.11</core_client_version>
<![CDATA[
<stderr_txt>
<search_application> milkyway_separation 1.46 Windows x86 double OpenCL </search_application>
Reading preferences ended prematurely
BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation'
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' 
Switching to Parameter File 'astronomy_parameters.txt'
<number_WUs> 4 </number_WUs>
<number_params_per_WU> 26 </number_params_per_WU>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.1.96
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 3 CL devices
Device 'GeForce GTX 980 Ti' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      456.71
Version:             OpenCL 1.2 CUDA
Compute capability:  5.2
Max compute units:   22
Clock frequency:     1076 Mhz
Global mem size:     6442450944
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_52'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 1515 SP GFLOP/s, 189 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 5632 with 14 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 7
Num chunks:     8
Chunk size:     78848
Added area:     70784
Effective area: 630784
Initial wait:   13 ms
Integration time: 47.456218 s. Average time per iteration = 148.300680 ms
Integral 0 time = 47.966849 s
Running likelihood with 17260 stars
Likelihood time = 1.097105 s
<background_integral> 0.000034923559060 </background_integral>
<stream_integral>  6.429115866221100  17.693085961315223  29.304780664555860  0.021073629167205 </stream_integral>
<background_likelihood> -2.985614770718934 </background_likelihood>
<stream_only_likelihood>  -9.032725847371754  -39.546826371704235  -3.283906868771460  -230.416443425726240 </stream_only_likelihood>
<search_likelihood> -2.556268322820613 </search_likelihood>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.1.96
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 3 CL devices
Device 'GeForce GTX 980 Ti' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      456.71
Version:             OpenCL 1.2 CUDA
Compute capability:  5.2
Max compute units:   22
Clock frequency:     1076 Mhz
Global mem size:     6442450944
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_52'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 1515 SP GFLOP/s, 189 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 5632 with 14 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 7
Num chunks:     8
Chunk size:     78848
Added area:     70784
Effective area: 630784
Initial wait:   13 ms
Integration time: 47.935435 s. Average time per iteration = 149.798235 ms
Integral 0 time = 48.524206 s
Running likelihood with 17260 stars
Likelihood time = 1.097118 s
<background_integral1> 0.000034944344138 </background_integral1>
<stream_integral1>  6.096382988735901  15.612274057232513  29.261840207994886  0.021592172475375 </stream_integral1>
<background_likelihood1> -2.986493748047153 </background_likelihood1>
<stream_only_likelihood1>  -8.640121459397037  -44.132910774769876  -3.286437870299211  -230.360650462342330 </stream_only_likelihood1>
<search_likelihood1> -2.556242254584864 </search_likelihood1>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.1.96
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 3 CL devices
Device 'GeForce GTX 980 Ti' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      456.71
Version:             OpenCL 1.2 CUDA
Compute capability:  5.2
Max compute units:   22
Clock frequency:     1076 Mhz
Global mem size:     6442450944
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_52'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 1515 SP GFLOP/s, 189 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 5632 with 14 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 7
Num chunks:     8
Chunk size:     78848
Added area:     70784
Effective area: 630784
Initial wait:   13 ms
Integration time: 47.597607 s. Average time per iteration = 148.742523 ms
Integral 0 time = 48.151554 s
Running likelihood with 17260 stars
Likelihood time = 1.125183 s
<background_integral2> 0.000034904155652 </background_integral2>
<stream_integral2>  6.354832796415771  14.857708454990767  30.359291416879579  0.021254402638430 </stream_integral2>
<background_likelihood2> -2.986644899003008 </background_likelihood2>
<stream_only_likelihood2>  -8.407433383555677  -46.760111187557946  -3.269312140612864  -230.454648187006740 </stream_only_likelihood2>
<search_likelihood2> -2.556248692808724 </search_likelihood2>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.1.96
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 3 CL devices
Device 'GeForce GTX 980 Ti' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      456.71
Version:             OpenCL 1.2 CUDA
Compute capability:  5.2
Max compute units:   22
Clock frequency:     1076 Mhz
Global mem size:     6442450944
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 1515 SP GFLOP/s, 189 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 5632 with 14 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 7
Num chunks:     8
Chunk size:     78848
Added area:     70784
Effective area: 630784
Initial wait:   13 ms
Integration time: 47.168406 s. Average time per iteration = 147.401267 ms
Integral 0 time = 47.710097 s
Running likelihood with 17260 stars
Likelihood time = 1.089638 s
<background_integral3> 0.000034923591456 </background_integral3>
<stream_integral3>  6.577535367681469  15.817787752491927  30.212711814619404  0.022811458087309 </stream_integral3>
<background_likelihood3> -2.989229267709521 </background_likelihood3>
<stream_only_likelihood3>  -7.865337013780761  -41.650624236415929  -3.264677735088633  -230.006703876752650 </stream_only_likelihood3>
<search_likelihood3> -2.556246863031484 </search_likelihood3>
00:11:12 (12968): called boinc_finish(0)

</stderr_txt>
]]>