Welcome to MilkyWay@home

Task 306808

Name de_modfit_84_bundle4_4s_south4s_bgset_4_1603804501_70950256_0
Workunit 2141603844
Created 24 Jan 2021, 1:00:46 UTC
Sent 24 Jan 2021, 1:08:26 UTC
Report deadline 5 Feb 2021, 1:08:26 UTC
Received 24 Jan 2021, 5:50:59 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 764330
Run time 25 min 7 sec
CPU time 10 min 9 sec
Validate state Valid
Credit 0.00
Device peak FLOPS 193.37 GFLOPS
Application version Milkyway@home Separation v1.46 (opencl_nvidia_101)
windows_x86_64
Peak working set size 293.21 MB
Peak swap size 314.55 MB
Peak disk usage 0.03 MB

Stderr output

<core_client_version>7.16.11</core_client_version>
<![CDATA[
<stderr_txt>
<search_application> milkyway_separation 1.46 Windows x86 double OpenCL </search_application>
Reading preferences ended prematurely
BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation'
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' 
Switching to Parameter File 'astronomy_parameters.txt'
<number_WUs> 4 </number_WUs>
<number_params_per_WU> 26 </number_params_per_WU>
Using SSE4.1 path
Found 2 platforms
Platform 0 information:
  Name:       Intel(R) OpenCL HD Graphics
  Version:    OpenCL 2.1 
  Vendor:     Intel(R) Corporation
  Extensions: 
  Profile:    FULL_PROFILE
Platform 1 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.2.109
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 1
Found 1 CL device
Device 'GeForce 940MX' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      461.09
Version:             OpenCL 1.2 CUDA
Compute capability:  5.0
Max compute units:   3
Clock frequency:     1241 Mhz
Global mem size:     2147483648
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_50'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 238 SP GFLOP/s, 30 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 768 with 15 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 48
Num chunks:     49
Chunk size:     11520
Added area:     4480
Effective area: 564480
Initial wait:   12 ms
Integration time: 327.950862 s. Average time per iteration = 1024.846445 ms
Integral 0 time = 328.446589 s
Estimated Nvidia GPU GFLOP/s: 238 SP GFLOP/s, 30 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 768 with 17 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 14 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 3
Num chunks:     4
Chunk size:     13056
Added area:     11624
Effective area: 52224
Initial wait:   14 ms
Integration time: 28.111035 s. Average time per iteration = 87.846985 ms
Integral 1 time = 28.190224 s
Running likelihood with 31815 stars
Likelihood time = 1.659059 s
<background_integral> 0.000047119884908 </background_integral>
<stream_integral>  2.932645594033070  28.452285954117571  87.668026773783680  0.000000000000000 </stream_integral>
<background_likelihood> -3.267627920895356 </background_likelihood>
<stream_only_likelihood>  -45.081192520663365  -3.473438083249162  -3.858538448702024  -232.519715405666180 </stream_only_likelihood>
<search_likelihood> -2.705262815747091 </search_likelihood>
Using SSE4.1 path
Found 2 platforms
Platform 0 information:
  Name:       Intel(R) OpenCL HD Graphics
  Version:    OpenCL 2.1 
  Vendor:     Intel(R) Corporation
  Extensions: 
  Profile:    FULL_PROFILE
Platform 1 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.2.109
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 1
Found 1 CL device
Device 'GeForce 940MX' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      461.09
Version:             OpenCL 1.2 CUDA
Compute capability:  5.0
Max compute units:   3
Clock frequency:     1241 Mhz
Global mem size:     2147483648
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_50'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 238 SP GFLOP/s, 30 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 768 with 15 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 48
Num chunks:     49
Chunk size:     11520
Added area:     4480
Effective area: 564480
Initial wait:   12 ms
Integration time: 360.738475 s. Average time per iteration = 1127.307736 ms
Integral 0 time = 361.276507 s
Estimated Nvidia GPU GFLOP/s: 238 SP GFLOP/s, 30 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 768 with 17 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 14 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 3
Num chunks:     4
Chunk size:     13056
Added area:     11624
Effective area: 52224
Initial wait:   14 ms
Integration time: 29.163962 s. Average time per iteration = 91.137382 ms
Integral 1 time = 29.239372 s
Running likelihood with 31815 stars
Likelihood time = 1.583406 s
<background_integral1> 0.000046955493019 </background_integral1>
<stream_integral1>  8.560525445113807  26.692604060935398  88.398260972290345  0.000000000022404 </stream_integral1>
<background_likelihood1> -3.288463664254308 </background_likelihood1>
<stream_only_likelihood1>  -15.696282318978708  -3.558010530793432  -3.830449900939965  -229.100830669582140 </stream_only_likelihood1>
<search_likelihood1> -2.706723502858184 </search_likelihood1>
Using SSE4.1 path
Found 2 platforms
Platform 0 information:
  Name:       Intel(R) OpenCL HD Graphics
  Version:    OpenCL 2.1 
  Vendor:     Intel(R) Corporation
  Extensions: 
  Profile:    FULL_PROFILE
Platform 1 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.2.109
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 1
Found 1 CL device
Device 'GeForce 940MX' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      461.09
Version:             OpenCL 1.2 CUDA
Compute capability:  5.0
Max compute units:   3
Clock frequency:     1241 Mhz
Global mem size:     2147483648
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_50'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 238 SP GFLOP/s, 30 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 768 with 15 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 48
Num chunks:     49
Chunk size:     11520
Added area:     4480
Effective area: 564480
Initial wait:   12 ms
Integration time: 328.840091 s. Average time per iteration = 1027.625286 ms
Integral 0 time = 329.348208 s
Estimated Nvidia GPU GFLOP/s: 238 SP GFLOP/s, 30 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 768 with 17 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 14 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 3
Num chunks:     4
Chunk size:     13056
Added area:     11624
Effective area: 52224
Initial wait:   14 ms
Integration time: 28.386709 s. Average time per iteration = 88.708466 ms
Integral 1 time = 28.467016 s
Running likelihood with 31815 stars
Likelihood time = 1.675726 s
<background_integral2> 0.000047612930800 </background_integral2>
<stream_integral2>  1.901689157398686  24.994705467025629  4.626260305522910  0.000000000000000 </stream_integral2>
<background_likelihood2> -3.319076889042418 </background_likelihood2>
<stream_only_likelihood2>  -96.024743396281650  -3.401518047103496  -42.112911508409510  -228.893274476332460 </stream_only_likelihood2>
<search_likelihood2> -2.718175712753964 </search_likelihood2>
Using SSE4.1 path
Found 2 platforms
Platform 0 information:
  Name:       Intel(R) OpenCL HD Graphics
  Version:    OpenCL 2.1 
  Vendor:     Intel(R) Corporation
  Extensions: 
  Profile:    FULL_PROFILE
Platform 1 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.2.109
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 1
Found 1 CL device
Device 'GeForce 940MX' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      461.09
Version:             OpenCL 1.2 CUDA
Compute capability:  5.0
Max compute units:   3
Clock frequency:     1241 Mhz
Global mem size:     2147483648
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_50'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 238 SP GFLOP/s, 30 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 768 with 15 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 48
Num chunks:     49
Chunk size:     11520
Added area:     4480
Effective area: 564480
Initial wait:   12 ms
Integration time: 357.992864 s. Average time per iteration = 1118.727700 ms
Integral 0 time = 358.509303 s
Estimated Nvidia GPU GFLOP/s: 238 SP GFLOP/s, 30 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 768 with 17 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 14 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 3
Num chunks:     4
Chunk size:     13056
Added area:     11624
Effective area: 52224
Initial wait:   14 ms
Integration time: 29.990989 s. Average time per iteration = 93.721841 ms
Integral 1 time = 30.065515 s
Running likelihood with 31815 stars
Likelihood time = 1.587015 s
<background_integral3> 0.000047159048314 </background_integral3>
<stream_integral3>  3.244276548965142  30.734300266357536  81.894842417818907  0.000000000000003 </stream_integral3>
<background_likelihood3> -3.301406500534017 </background_likelihood3>
<stream_only_likelihood3>  -57.438713311582383  -3.473530618605273  -3.724813338347662  -232.040202113513690 </stream_only_likelihood3>
<search_likelihood3> -2.705528128843730 </search_likelihood3>
06:40:21 (48736): called boinc_finish(0)

</stderr_txt>
]]>


©2024 Astroinformatics Group