Welcome to MilkyWay@home

Task 253410

Name de_modfit_85_bundle4_4s_south4s_bgset_4_1603804501_69841569_1
Workunit 2140375839
Created 23 Jan 2021, 2:22:27 UTC
Sent 23 Jan 2021, 2:30:16 UTC
Report deadline 4 Feb 2021, 2:30:16 UTC
Received 23 Jan 2021, 3:34:25 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 847721
Run time 4 min 3 sec
CPU time 51 sec
Validate state Valid
Credit 0.00
Device peak FLOPS 897.71 GFLOPS
Application version Milkyway@home Separation v1.46 (opencl_nvidia_101)
windows_x86_64
Peak working set size 306.66 MB
Peak swap size 305.95 MB
Peak disk usage 0.02 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
<search_application> milkyway_separation 1.46 Windows x86 double OpenCL </search_application>
BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation'
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' 
Switching to Parameter File 'astronomy_parameters.txt'
<number_WUs> 4 </number_WUs>
<number_params_per_WU> 26 </number_params_per_WU>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 10.1.120
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'GeForce GTX 970' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      432.00
Version:             OpenCL 1.2 CUDA
Compute capability:  5.2
Max compute units:   13
Clock frequency:     1342 Mhz
Global mem size:     4294967296
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_52'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 1117 SP GFLOP/s, 140 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 6656 with 8 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 10
Num chunks:     11
Chunk size:     53248
Added area:     25728
Effective area: 585728
Initial wait:   13 ms
Integration time: 55.032738 s. Average time per iteration = 171.977306 ms
Integral 0 time = 55.234422 s
Estimated Nvidia GPU GFLOP/s: 1117 SP GFLOP/s, 140 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 6656 with 6 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 0 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 1
Num chunks:     2
Chunk size:     39936
Added area:     39272
Effective area: 79872
Initial wait:   0 ms
Integration time: 4.060225 s. Average time per iteration = 12.688203 ms
Integral 1 time = 4.093852 s
Running likelihood with 31964 stars
Likelihood time = 0.656749 s
<background_integral> 0.000046372487462 </background_integral>
<stream_integral>  119.936305580913920  25.071427386319002  0.000000000000000  67.923445166016052 </stream_integral>
<background_likelihood> -3.234521982151058 </background_likelihood>
<stream_only_likelihood>  -3.654840130240934  -3.444732196151885  -221.618688030173500  -10.479088762875248 </stream_only_likelihood>
<search_likelihood> -2.686300001507901 </search_likelihood>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 10.1.120
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'GeForce GTX 970' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      432.00
Version:             OpenCL 1.2 CUDA
Compute capability:  5.2
Max compute units:   13
Clock frequency:     1342 Mhz
Global mem size:     4294967296
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_52'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 1117 SP GFLOP/s, 140 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 6656 with 8 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 10
Num chunks:     11
Chunk size:     53248
Added area:     25728
Effective area: 585728
Initial wait:   13 ms
Integration time: 55.051365 s. Average time per iteration = 172.035515 ms
Integral 0 time = 55.292657 s
Estimated Nvidia GPU GFLOP/s: 1117 SP GFLOP/s, 140 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 6656 with 6 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 0 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 1
Num chunks:     2
Chunk size:     39936
Added area:     39272
Effective area: 79872
Initial wait:   0 ms
Integration time: 4.063409 s. Average time per iteration = 12.698152 ms
Integral 1 time = 4.097251 s
Running likelihood with 31964 stars
Likelihood time = 0.646885 s
<background_integral1> 0.000046166431354 </background_integral1>
<stream_integral1>  139.998348625667400  26.608611973813034  0.000000000000000  113.421022316445020 </stream_integral1>
<background_likelihood1> -3.224124472273243 </background_likelihood1>
<stream_only_likelihood1>  -3.692044042964223  -3.395649389753521  -221.282834511797720  -10.431110881563804 </stream_only_likelihood1>
<search_likelihood1> -2.685006744209154 </search_likelihood1>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 10.1.120
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'GeForce GTX 970' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      432.00
Version:             OpenCL 1.2 CUDA
Compute capability:  5.2
Max compute units:   13
Clock frequency:     1342 Mhz
Global mem size:     4294967296
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_52'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 1117 SP GFLOP/s, 140 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 6656 with 8 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 10
Num chunks:     11
Chunk size:     53248
Added area:     25728
Effective area: 585728
Initial wait:   13 ms
Integration time: 55.046162 s. Average time per iteration = 172.019257 ms
Integral 0 time = 55.294227 s
Estimated Nvidia GPU GFLOP/s: 1117 SP GFLOP/s, 140 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 6656 with 6 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 0 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 1
Num chunks:     2
Chunk size:     39936
Added area:     39272
Effective area: 79872
Initial wait:   0 ms
Integration time: 4.061069 s. Average time per iteration = 12.690840 ms
Integral 1 time = 4.098128 s
Running likelihood with 31964 stars
Likelihood time = 0.672537 s
<background_integral2> 0.000046325341179 </background_integral2>
<stream_integral2>  131.826613283833550  27.014473191109488  0.000000000000000  70.921530210888633 </stream_integral2>
<background_likelihood2> -3.217574228077268 </background_likelihood2>
<stream_only_likelihood2>  -3.688440010309463  -3.394345710088481  -221.739303459880570  -11.531978876196039 </stream_only_likelihood2>
<search_likelihood2> -2.687529564722013 </search_likelihood2>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 10.1.120
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'GeForce GTX 970' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      432.00
Version:             OpenCL 1.2 CUDA
Compute capability:  5.2
Max compute units:   13
Clock frequency:     1342 Mhz
Global mem size:     4294967296
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_52'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 1117 SP GFLOP/s, 140 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 6656 with 8 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 10
Num chunks:     11
Chunk size:     53248
Added area:     25728
Effective area: 585728
Initial wait:   13 ms
Integration time: 55.049030 s. Average time per iteration = 172.028219 ms
Integral 0 time = 55.298201 s
Estimated Nvidia GPU GFLOP/s: 1117 SP GFLOP/s, 140 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 6656 with 6 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 0 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 1
Num chunks:     2
Chunk size:     39936
Added area:     39272
Effective area: 79872
Initial wait:   0 ms
Integration time: 4.061611 s. Average time per iteration = 12.692534 ms
Integral 1 time = 4.096924 s
Running likelihood with 31964 stars
Likelihood time = 0.666982 s
<background_integral3> 0.000046152449635 </background_integral3>
<stream_integral3>  127.019875116378400  27.085433117927110  0.000000000000000  112.208369209001700 </stream_integral3>
<background_likelihood3> -3.240132169507541 </background_likelihood3>
<stream_only_likelihood3>  -3.565720082061197  -3.400597041824356  -221.329465862942730  -4.937961538374448 </stream_only_likelihood3>
<search_likelihood3> -2.685014307274806 </search_likelihood3>
22:27:14 (48548): called boinc_finish(0)

</stderr_txt>
]]>


©2024 Astroinformatics Group