Task 1334265

Task 1334265

Name	de_modfit_81_bundle4_4s_south4s_bgset_4_1603804501_76119272_0
Workunit	2147326890
Created	30 Jan 2021, 9:41:56 UTC
Sent	30 Jan 2021, 9:50:16 UTC
Report deadline	11 Feb 2021, 9:50:16 UTC
Received	30 Jan 2021, 11:55:23 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	838327
Run time	3 min 42 sec
CPU time	1 min 15 sec
Validate state	Valid
Credit	0.00
Device peak FLOPS	1,386.07 GFLOPS
Application version	Milkyway@home Separation v1.46 (opencl_nvidia_101) windows_x86_64
Peak working set size	291.89 MB
Peak swap size	296.34 MB
Peak disk usage	0.02 MB
Stderr output

<core_client_version>7.16.11</core_client_version>
<![CDATA[
<stderr_txt>
<search_application> milkyway_separation 1.46 Windows x86 double OpenCL </search_application>
Reading preferences ended prematurely
BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation'
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' 
Switching to Parameter File 'astronomy_parameters.txt'
<number_WUs> 4 </number_WUs>
<number_params_per_WU> 26 </number_params_per_WU>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.1.96
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 3 CL devices
Device 'GeForce GTX 980 Ti' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      456.71
Version:             OpenCL 1.2 CUDA
Compute capability:  5.2
Max compute units:   22
Clock frequency:     1076 Mhz
Global mem size:     6442450944
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_52'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 1515 SP GFLOP/s, 189 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 5632 with 14 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 7
Num chunks:     8
Chunk size:     78848
Added area:     70784
Effective area: 630784
Initial wait:   13 ms
Integration time: 50.995103 s. Average time per iteration = 159.359698 ms
Integral 0 time = 51.497589 s
Running likelihood with 34614 stars
Likelihood time = 2.256977 s
<background_integral> 0.000059439702920 </background_integral>
<stream_integral>  161.482527700369760  17.387015853519863  0.449276626575925  19.417466556117674 </stream_integral>
<background_likelihood> -3.370026546621824 </background_likelihood>
<stream_only_likelihood>  -3.415447869510596  -4.477945421783077  -63.482997711441989  -4.679574339819787 </stream_only_likelihood>
<search_likelihood> -2.788954905464006 </search_likelihood>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.1.96
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 3 CL devices
Device 'GeForce GTX 980 Ti' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      456.71
Version:             OpenCL 1.2 CUDA
Compute capability:  5.2
Max compute units:   22
Clock frequency:     1076 Mhz
Global mem size:     6442450944
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 1515 SP GFLOP/s, 189 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 5632 with 14 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 7
Num chunks:     8
Chunk size:     78848
Added area:     70784
Effective area: 630784
Initial wait:   13 ms
Integration time: 51.571328 s. Average time per iteration = 161.160401 ms
Integral 0 time = 52.159547 s
Running likelihood with 34614 stars
Likelihood time = 2.216658 s
<background_integral1> 0.000059459666457 </background_integral1>
<stream_integral1>  161.286249400165100  17.319509282247186  0.457890105553453  19.400957351014732 </stream_integral1>
<background_likelihood1> -3.371533612167747 </background_likelihood1>
<stream_only_likelihood1>  -3.414419559647291  -4.478425175033314  -62.565627871056520  -4.669334146518541 </stream_only_likelihood1>
<search_likelihood1> -2.788954726493921 </search_likelihood1>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.1.96
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 3 CL devices
Device 'GeForce GTX 980 Ti' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      456.71
Version:             OpenCL 1.2 CUDA
Compute capability:  5.2
Max compute units:   22
Clock frequency:     1076 Mhz
Global mem size:     6442450944
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_52'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 1515 SP GFLOP/s, 189 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 5632 with 14 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 7
Num chunks:     8
Chunk size:     78848
Added area:     70784
Effective area: 630784
Initial wait:   13 ms
Integration time: 51.472881 s. Average time per iteration = 160.852753 ms
Integral 0 time = 52.030980 s
Running likelihood with 34614 stars
Likelihood time = 2.252588 s
<background_integral2> 0.000059474252209 </background_integral2>
<stream_integral2>  160.085352088143940  17.343053672467985  0.457505551061532  19.442920985071524 </stream_integral2>
<background_likelihood2> -3.370991394035229 </background_likelihood2>
<stream_only_likelihood2>  -3.413410940828392  -4.482666426154625  -61.838838444171664  -4.669564143027585 </stream_only_likelihood2>
<search_likelihood2> -2.788955264358704 </search_likelihood2>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.1.96
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 3 CL devices
Device 'GeForce GTX 980 Ti' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      456.71
Version:             OpenCL 1.2 CUDA
Compute capability:  5.2
Max compute units:   22
Clock frequency:     1076 Mhz
Global mem size:     6442450944
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_52'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 1515 SP GFLOP/s, 189 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 5632 with 14 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 7
Num chunks:     8
Chunk size:     78848
Added area:     70784
Effective area: 630784
Initial wait:   13 ms
Integration time: 51.482796 s. Average time per iteration = 160.883739 ms
Integral 0 time = 52.037218 s
Running likelihood with 34614 stars
Likelihood time = 2.360414 s
<background_integral3> 0.000059465476181 </background_integral3>
<stream_integral3>  161.209776349116080  17.613237582963983  0.476380335395537  19.126617359888840 </stream_integral3>
<background_likelihood3> -3.371855783980996 </background_likelihood3>
<stream_only_likelihood3>  -3.412361282079298  -4.455044234804165  -58.696383821039511  -4.699513633905811 </stream_only_likelihood3>
<search_likelihood3> -2.788957783022686 </search_likelihood3>
04:22:02 (900): called boinc_finish(0)

</stderr_txt>
]]>