Welcome to MilkyWay@home

Task 267164

Name de_modfit_84_bundle4_4s_south4s_bgset_4_1603804501_70414293_0
Workunit 2141010487
Created 23 Jan 2021, 8:31:48 UTC
Sent 23 Jan 2021, 8:41:36 UTC
Report deadline 4 Feb 2021, 8:41:36 UTC
Received 23 Jan 2021, 12:01:26 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 764330
Run time 25 min 52 sec
CPU time 10 min 48 sec
Validate state Valid
Credit 244.01
Device peak FLOPS 193.37 GFLOPS
Application version Milkyway@home Separation v1.46 (opencl_nvidia_101)
windows_x86_64
Peak working set size 291.53 MB
Peak swap size 314.44 MB
Peak disk usage 0.02 MB

Stderr output

<core_client_version>7.16.11</core_client_version>
<![CDATA[
<stderr_txt>
<search_application> milkyway_separation 1.46 Windows x86 double OpenCL </search_application>
Reading preferences ended prematurely
BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation'
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' 
Switching to Parameter File 'astronomy_parameters.txt'
<number_WUs> 4 </number_WUs>
<number_params_per_WU> 26 </number_params_per_WU>
Using SSE4.1 path
Found 2 platforms
Platform 0 information:
  Name:       Intel(R) OpenCL HD Graphics
  Version:    OpenCL 2.1 
  Vendor:     Intel(R) Corporation
  Extensions: 
  Profile:    FULL_PROFILE
Platform 1 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.2.109
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 1
Found 1 CL device
Device 'GeForce 940MX' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      461.09
Version:             OpenCL 1.2 CUDA
Compute capability:  5.0
Max compute units:   3
Clock frequency:     1241 Mhz
Global mem size:     2147483648
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_50'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 238 SP GFLOP/s, 30 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 768 with 15 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 48
Num chunks:     49
Chunk size:     11520
Added area:     4480
Effective area: 564480
Initial wait:   12 ms
Integration time: 355.300335 s. Average time per iteration = 1110.313548 ms
Integral 0 time = 355.825579 s
Estimated Nvidia GPU GFLOP/s: 238 SP GFLOP/s, 30 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 768 with 17 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 14 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 3
Num chunks:     4
Chunk size:     13056
Added area:     11624
Effective area: 52224
Initial wait:   14 ms
Integration time: 29.376231 s. Average time per iteration = 91.800723 ms
Integral 1 time = 29.458461 s
Running likelihood with 31815 stars
Likelihood time = 1.551113 s
<background_integral> 0.000047406954416 </background_integral>
<stream_integral>  3.443211450105733  29.165759813615278  126.762569799508950  0.000000000000000 </stream_integral>
<background_likelihood> -3.307205729384855 </background_likelihood>
<stream_only_likelihood>  -54.021245682147267  -3.409222277409119  -3.751600811860559  -233.277019742581470 </stream_only_likelihood>
<search_likelihood> -2.703882037087419 </search_likelihood>
Using SSE4.1 path
Found 2 platforms
Platform 0 information:
  Name:       Intel(R) OpenCL HD Graphics
  Version:    OpenCL 2.1 
  Vendor:     Intel(R) Corporation
  Extensions: 
  Profile:    FULL_PROFILE
Platform 1 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.2.109
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 1
Found 1 CL device
Device 'GeForce 940MX' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      461.09
Version:             OpenCL 1.2 CUDA
Compute capability:  5.0
Max compute units:   3
Clock frequency:     1241 Mhz
Global mem size:     2147483648
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_50'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 238 SP GFLOP/s, 30 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 768 with 15 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 48
Num chunks:     49
Chunk size:     11520
Added area:     4480
Effective area: 564480
Initial wait:   12 ms
Integration time: 354.061632 s. Average time per iteration = 1106.442601 ms
Integral 0 time = 354.554389 s
Estimated Nvidia GPU GFLOP/s: 238 SP GFLOP/s, 30 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 768 with 17 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 14 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 3
Num chunks:     4
Chunk size:     13056
Added area:     11624
Effective area: 52224
Initial wait:   14 ms
Integration time: 29.520569 s. Average time per iteration = 92.251779 ms
Integral 1 time = 29.601735 s
Running likelihood with 31815 stars
Likelihood time = 1.712474 s
<background_integral1> 0.000047189299843 </background_integral1>
<stream_integral1>  4.201961773942758  27.884013580813068  60.148330753124149  0.452727382327193 </stream_integral1>
<background_likelihood1> -3.296445394728480 </background_likelihood1>
<stream_only_likelihood1>  -29.850462355434203  -3.397385861275601  -4.081704914705806  -167.968775019736480 </stream_only_likelihood1>
<search_likelihood1> -2.704153434665565 </search_likelihood1>
Using SSE4.1 path
Found 2 platforms
Platform 0 information:
  Name:       Intel(R) OpenCL HD Graphics
  Version:    OpenCL 2.1 
  Vendor:     Intel(R) Corporation
  Extensions: 
  Profile:    FULL_PROFILE
Platform 1 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.2.109
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 1
Found 1 CL device
Device 'GeForce 940MX' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      461.09
Version:             OpenCL 1.2 CUDA
Compute capability:  5.0
Max compute units:   3
Clock frequency:     1241 Mhz
Global mem size:     2147483648
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_50'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 238 SP GFLOP/s, 30 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 768 with 15 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 48
Num chunks:     49
Chunk size:     11520
Added area:     4480
Effective area: 564480
Initial wait:   12 ms
Integration time: 359.313409 s. Average time per iteration = 1122.854402 ms
Integral 0 time = 359.844989 s
Estimated Nvidia GPU GFLOP/s: 238 SP GFLOP/s, 30 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 768 with 17 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 14 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 3
Num chunks:     4
Chunk size:     13056
Added area:     11624
Effective area: 52224
Initial wait:   14 ms
Integration time: 29.744909 s. Average time per iteration = 92.952841 ms
Integral 1 time = 29.824299 s
Running likelihood with 31815 stars
Likelihood time = 1.701734 s
<background_integral2> 0.000047198421995 </background_integral2>
<stream_integral2>  5.151949365766488  28.598431799609930  69.310880935685447  0.303925411082876 </stream_integral2>
<background_likelihood2> -3.262838126118576 </background_likelihood2>
<stream_only_likelihood2>  -29.237449268511945  -3.440124676914214  -3.863174740012123  -88.853968066585111 </stream_only_likelihood2>
<search_likelihood2> -2.704158886069345 </search_likelihood2>
Using SSE4.1 path
Found 2 platforms
Platform 0 information:
  Name:       Intel(R) OpenCL HD Graphics
  Version:    OpenCL 2.1 
  Vendor:     Intel(R) Corporation
  Extensions: 
  Profile:    FULL_PROFILE
Platform 1 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.2.109
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 1
Found 1 CL device
Device 'GeForce 940MX' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      461.09
Version:             OpenCL 1.2 CUDA
Compute capability:  5.0
Max compute units:   3
Clock frequency:     1241 Mhz
Global mem size:     2147483648
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_50'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 388 bytes cmem[0], 184 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 238 SP GFLOP/s, 30 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 768 with 15 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 48
Num chunks:     49
Chunk size:     11520
Added area:     4480
Effective area: 564480
Initial wait:   12 ms
Integration time: 350.907528 s. Average time per iteration = 1096.586025 ms
Integral 0 time = 351.442348 s
Estimated Nvidia GPU GFLOP/s: 238 SP GFLOP/s, 30 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 768 with 17 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 14 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 3
Num chunks:     4
Chunk size:     13056
Added area:     11624
Effective area: 52224
Initial wait:   14 ms
Integration time: 28.995823 s. Average time per iteration = 90.611946 ms
Integral 1 time = 29.078687 s
Running likelihood with 31815 stars
Likelihood time = 1.789727 s
<background_integral3> 0.000047100340806 </background_integral3>
<stream_integral3>  4.161802276776464  31.398603452930150  36.580927251885974  0.343772882004513 </stream_integral3>
<background_likelihood3> -3.288169141377718 </background_likelihood3>
<stream_only_likelihood3>  -53.304832570662036  -3.368763402135397  -4.639514765082314  -188.725824596361320 </stream_only_likelihood3>
<search_likelihood3> -2.706921730408591 </search_likelihood3>
12:57:30 (79308): called boinc_finish(0)

</stderr_txt>
]]>


©2024 Astroinformatics Group