Task 885304

Task 885304

Name	de_modfit_84_bundle4_4s_south4s_bgset_4_1603804501_65685232_2
Workunit	2135751749
Created	29 Jan 2021, 11:50:21 UTC
Sent	29 Jan 2021, 11:58:08 UTC
Report deadline	10 Feb 2021, 11:58:08 UTC
Received	2 Feb 2021, 5:45:55 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	541559
Run time	19 min 48 sec
CPU time	25 sec
Validate state	Valid
Credit	0.00
Device peak FLOPS	250.72 GFLOPS
Application version	Milkyway@home Separation v1.46 (opencl_nvidia_101) windows_x86_64
Peak working set size	100.72 MB
Peak swap size	95.37 MB
Peak disk usage	0.02 MB
Stderr output

<core_client_version>7.16.11</core_client_version>
<![CDATA[
<stderr_txt>
<search_application> milkyway_separation 1.46 Windows x86 double OpenCL </search_application>
BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation'
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' 
Switching to Parameter File 'astronomy_parameters.txt'
<number_WUs> 4 </number_WUs>
<number_params_per_WU> 26 </number_params_per_WU>
Using AVX path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.1 CUDA 6.0.1
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll 
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'Quadro K4000' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      331.82
Version:             OpenCL 1.1 CUDA
Compute capability:  3.0
Max compute units:   4
Clock frequency:     810 Mhz
Global mem size:     3221225472
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas : info : 0 bytes gmem
ptxas : info : Compiling entry function 'probabilities' for 'sm_30'
ptxas : info : Function properties for probabilities
    240 bytes stack frame, 240 bytes spill stores, 240 bytes spill loads
ptxas : info : Used 63 registers, 388 bytes cmem[0], 192 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 207 SP GFLOP/s, 26 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 4096 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 56
Num chunks:     69
Chunk size:     8192
Added area:     5248
Effective area: 565248
Initial wait:   12 ms
Integration time: 347.417288 s. Average time per iteration = 1085.679024 ms
Integral 0 time = 348.669137 s
Estimated Nvidia GPU GFLOP/s: 207 SP GFLOP/s, 26 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 4096 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 4
Num chunks:     5
Chunk size:     8192
Added area:     360
Effective area: 40960
Initial wait:   12 ms
Integration time: 20.001418 s. Average time per iteration = 62.504431 ms
Integral 1 time = 20.130519 s
Running likelihood with 31815 stars
Likelihood time = 1.650075 s
<background_integral> 0.000047009113699 </background_integral>
<stream_integral>  10.407210697692614  36.322341519396701  123.360339130856190  100.596788668025250 </stream_integral>
<background_likelihood> -3.278935986051737 </background_likelihood>
<stream_only_likelihood>  -39.981021642174575  -3.311656584099222  -3.824428624979828  -9.178624762272193 </stream_only_likelihood>
<search_likelihood> -2.704104964043562 </search_likelihood>
Using AVX path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.1 CUDA 6.0.1
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll 
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'Quadro K4000' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      331.82
Version:             OpenCL 1.1 CUDA
Compute capability:  3.0
Max compute units:   4
Clock frequency:     810 Mhz
Global mem size:     3221225472
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas : info : 0 bytes gmem
ptxas : info : Compiling entry function 'probabilities' for 'sm_30'
ptxas : info : Function properties for probabilities
    240 bytes stack frame, 240 bytes spill stores, 240 bytes spill loads
ptxas : info : Used 63 registers, 388 bytes cmem[0], 192 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 207 SP GFLOP/s, 26 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 4096 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 56
Num chunks:     69
Chunk size:     8192
Added area:     5248
Effective area: 565248
Initial wait:   12 ms
Integration time: 267.422372 s. Average time per iteration = 835.694911 ms
Integral 0 time = 268.502166 s
Estimated Nvidia GPU GFLOP/s: 207 SP GFLOP/s, 26 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 4096 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 4
Num chunks:     5
Chunk size:     8192
Added area:     360
Effective area: 40960
Initial wait:   12 ms
Integration time: 19.404984 s. Average time per iteration = 60.640576 ms
Integral 1 time = 19.582163 s
Running likelihood with 31815 stars
Likelihood time = 1.616289 s
<background_integral1> 0.000046868474183 </background_integral1>
<stream_integral1>  15.224963448992661  37.685595917973828  96.838430044956041  60.208194386095442 </stream_integral1>
<background_likelihood1> -3.269830658393962 </background_likelihood1>
<stream_only_likelihood1>  -5.787726487180480  -3.320139773830849  -3.983646879880472  -8.369217803808269 </stream_only_likelihood1>
<search_likelihood1> -2.703822311888850 </search_likelihood1>
Using AVX path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.1 CUDA 6.0.1
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll 
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'Quadro K4000' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      331.82
Version:             OpenCL 1.1 CUDA
Compute capability:  3.0
Max compute units:   4
Clock frequency:     810 Mhz
Global mem size:     3221225472
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas : info : 0 bytes gmem
ptxas : info : Compiling entry function 'probabilities' for 'sm_30'
ptxas : info : Function properties for probabilities
    240 bytes stack frame, 240 bytes spill stores, 240 bytes spill loads
ptxas : info : Used 63 registers, 388 bytes cmem[0], 192 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 207 SP GFLOP/s, 26 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 4096 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 56
Num chunks:     69
Chunk size:     8192
Added area:     5248
Effective area: 565248
Initial wait:   12 ms
Integration time: 269.056421 s. Average time per iteration = 840.801315 ms
Integral 0 time = 270.158848 s
Estimated Nvidia GPU GFLOP/s: 207 SP GFLOP/s, 26 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 4096 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 4
Num chunks:     5
Chunk size:     8192
Added area:     360
Effective area: 40960
Initial wait:   12 ms
Integration time: 20.865557 s. Average time per iteration = 65.204866 ms
Integral 1 time = 21.065534 s
Running likelihood with 31815 stars
Likelihood time = 1.756172 s
<background_integral2> 0.000047469450615 </background_integral2>
<stream_integral2>  6.050803020085847  39.320916649715556  104.282432339666670  109.591780343040430 </stream_integral2>
<background_likelihood2> -3.296165004047287 </background_likelihood2>
<stream_only_likelihood2>  -73.744207818550549  -3.241315190854866  -3.877767018456112  -8.034929540287365 </stream_only_likelihood2>
<search_likelihood2> -2.703463811419482 </search_likelihood2>
Using AVX path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.1 CUDA 6.0.1
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll 
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'Quadro K4000' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      331.82
Version:             OpenCL 1.1 CUDA
Compute capability:  3.0
Max compute units:   4
Clock frequency:     810 Mhz
Global mem size:     3221225472
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas : info : 0 bytes gmem
ptxas : info : Compiling entry function 'probabilities' for 'sm_30'
ptxas : info : Function properties for probabilities
    240 bytes stack frame, 240 bytes spill stores, 240 bytes spill loads
ptxas : info : Used 63 registers, 388 bytes cmem[0], 192 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 207 SP GFLOP/s, 26 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 4096 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 56
Num chunks:     69
Chunk size:     8192
Added area:     5248
Effective area: 565248
Initial wait:   12 ms
Integration time: 273.260846 s. Average time per iteration = 853.940142 ms
Integral 0 time = 274.372603 s
Estimated Nvidia GPU GFLOP/s: 207 SP GFLOP/s, 26 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 4096 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 4
Num chunks:     5
Chunk size:     8192
Added area:     360
Effective area: 40960
Initial wait:   12 ms
Integration time: 19.723614 s. Average time per iteration = 61.636294 ms
Integral 1 time = 19.887723 s
Running likelihood with 31815 stars
Likelihood time = 1.686615 s
<background_integral3> 0.000047090369883 </background_integral3>
<stream_integral3>  4.864872271351997  36.173091426933823  86.727704683894132  75.779440892955563 </stream_integral3>
<background_likelihood3> -3.272128945800101 </background_likelihood3>
<stream_only_likelihood3>  -92.024183543887219  -3.300489803417233  -3.893016700892159  -6.248969257506015 </stream_only_likelihood3>
<search_likelihood3> -2.703509731813352 </search_likelihood3>
18:00:27 (46256): called boinc_finish(0)

</stderr_txt>
]]>