Welcome to MilkyWay@home

Task 1783846

Name de_modfit_84_bundle4_4s_south4s_bgset_4_1603804501_67455663_2
Workunit 2137728997
Created 31 Jan 2021, 17:51:41 UTC
Sent 31 Jan 2021, 17:51:42 UTC
Report deadline 12 Feb 2021, 17:51:42 UTC
Received 1 Feb 2021, 2:46:45 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 616064
Run time 40 min 3 sec
CPU time 12 sec
Validate state Valid
Credit 244.01
Device peak FLOPS 141.25 GFLOPS
Application version Milkyway@home Separation v1.46 (opencl_nvidia_101)
windows_x86_64
Peak working set size 163.13 MB
Peak swap size 164.15 MB
Peak disk usage 0.02 MB

Stderr output

<core_client_version>7.16.11</core_client_version>
<![CDATA[
<stderr_txt>
<search_application> milkyway_separation 1.46 Windows x86 double OpenCL </search_application>
Reading preferences ended prematurely
BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation'
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' 
Switching to Parameter File 'astronomy_parameters.txt'
<number_WUs> 4 </number_WUs>
<number_params_per_WU> 26 </number_params_per_WU>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 9.1.84
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 2 CL devices
Device 'GeForce GT 640' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      391.35
Version:             OpenCL 1.2 CUDA
Compute capability:  3.0
Max compute units:   2
Clock frequency:     901 Mhz
Global mem size:     2147483648
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_30'
ptxas info    : Function properties for probabilities
ptxas         .     88 bytes stack frame, 88 bytes spill stores, 88 bytes spill loads
ptxas info    : Used 63 registers, 388 bytes cmem[0], 192 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 115 SP GFLOP/s, 14 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2048 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 100
Num chunks:     137
Chunk size:     4096
Added area:     1152
Effective area: 561152
Initial wait:   12 ms
Integration time: 555.240905 s. Average time per iteration = 1735.127828 ms
Integral 0 time = 555.769703 s
Estimated Nvidia GPU GFLOP/s: 115 SP GFLOP/s, 14 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2048 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 7
Num chunks:     10
Chunk size:     4096
Added area:     360
Effective area: 40960
Initial wait:   13 ms
Integration time: 43.918577 s. Average time per iteration = 137.245555 ms
Integral 1 time = 43.979775 s
Running likelihood with 31815 stars
Likelihood time = 1.141410 s
<background_integral> 0.000047124292990 </background_integral>
<stream_integral>  3.709985215194907  28.902719104558276  84.551037164494602  90.301580964891059 </stream_integral>
<background_likelihood> -3.283623667261649 </background_likelihood>
<stream_only_likelihood>  -65.076594315500358  -3.433619685850789  -3.824771098513979  -4.301801020263833 </stream_only_likelihood>
<search_likelihood> -2.703308869973528 </search_likelihood>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 9.1.84
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 2 CL devices
Device 'GeForce GT 640' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      391.35
Version:             OpenCL 1.2 CUDA
Compute capability:  3.0
Max compute units:   2
Clock frequency:     901 Mhz
Global mem size:     2147483648
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_30'
ptxas info    : Function properties for probabilities
ptxas         .     88 bytes stack frame, 88 bytes spill stores, 88 bytes spill loads
ptxas info    : Used 63 registers, 388 bytes cmem[0], 192 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 115 SP GFLOP/s, 14 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2048 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 100
Num chunks:     137
Chunk size:     4096
Added area:     1152
Effective area: 561152
Initial wait:   12 ms
Integration time: 554.122828 s. Average time per iteration = 1731.633836 ms
Integral 0 time = 554.661353 s
Estimated Nvidia GPU GFLOP/s: 115 SP GFLOP/s, 14 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2048 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 7
Num chunks:     10
Chunk size:     4096
Added area:     360
Effective area: 40960
Initial wait:   13 ms
Integration time: 43.558358 s. Average time per iteration = 136.119869 ms
Integral 1 time = 43.615985 s
Running likelihood with 31815 stars
Likelihood time = 1.065237 s
<background_integral1> 0.000046702418536 </background_integral1>
<stream_integral1>  3.301018892850659  30.125144882469264  80.708625915455883  3.548785974909140 </stream_integral1>
<background_likelihood1> -3.252605798371450 </background_likelihood1>
<stream_only_likelihood1>  -60.525375316274285  -3.429513892692786  -3.879415274191225  -114.551300993263200 </stream_only_likelihood1>
<search_likelihood1> -2.703281151286559 </search_likelihood1>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 9.1.84
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 2 CL devices
Device 'GeForce GT 640' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      391.35
Version:             OpenCL 1.2 CUDA
Compute capability:  3.0
Max compute units:   2
Clock frequency:     901 Mhz
Global mem size:     2147483648
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_30'
ptxas info    : Function properties for probabilities
ptxas         .     88 bytes stack frame, 88 bytes spill stores, 88 bytes spill loads
ptxas info    : Used 63 registers, 388 bytes cmem[0], 192 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 115 SP GFLOP/s, 14 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2048 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 100
Num chunks:     137
Chunk size:     4096
Added area:     1152
Effective area: 561152
Initial wait:   12 ms
Integration time: 553.966867 s. Average time per iteration = 1731.146458 ms
Integral 0 time = 554.515435 s
Estimated Nvidia GPU GFLOP/s: 115 SP GFLOP/s, 14 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2048 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 7
Num chunks:     10
Chunk size:     4096
Added area:     360
Effective area: 40960
Initial wait:   13 ms
Integration time: 44.320438 s. Average time per iteration = 138.501370 ms
Integral 1 time = 44.377406 s
Running likelihood with 31815 stars
Likelihood time = 1.227481 s
<background_integral2> 0.000047230653080 </background_integral2>
<stream_integral2>  4.005039032707455  36.510568659895718  81.417448885983120  0.978363754731990 </stream_integral2>
<background_likelihood2> -3.286896502889191 </background_likelihood2>
<stream_only_likelihood2>  -70.989098512907347  -3.322102739191019  -3.997359188671971  -204.169661648230800 </stream_only_likelihood2>
<search_likelihood2> -2.703423403013118 </search_likelihood2>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 9.1.84
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 2 CL devices
Device 'GeForce GT 640' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      391.35
Version:             OpenCL 1.2 CUDA
Compute capability:  3.0
Max compute units:   2
Clock frequency:     901 Mhz
Global mem size:     2147483648
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_30'
ptxas info    : Function properties for probabilities
ptxas         .     88 bytes stack frame, 88 bytes spill stores, 88 bytes spill loads
ptxas info    : Used 63 registers, 388 bytes cmem[0], 192 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 115 SP GFLOP/s, 14 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2048 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 100
Num chunks:     137
Chunk size:     4096
Added area:     1152
Effective area: 561152
Initial wait:   12 ms
Integration time: 552.099790 s. Average time per iteration = 1725.311843 ms
Integral 0 time = 552.648539 s
Estimated Nvidia GPU GFLOP/s: 115 SP GFLOP/s, 14 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2048 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 13 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 7
Num chunks:     10
Chunk size:     4096
Added area:     360
Effective area: 40960
Initial wait:   13 ms
Integration time: 43.677669 s. Average time per iteration = 136.492716 ms
Integral 1 time = 43.732540 s
Running likelihood with 31815 stars
Likelihood time = 1.058833 s
<background_integral3> 0.000047063663680 </background_integral3>
<stream_integral3>  6.933174219206183  30.256307840543705  80.707559918228810  120.547241225501490 </stream_integral3>
<background_likelihood3> -3.277887554795213 </background_likelihood3>
<stream_only_likelihood3>  -35.166476195396726  -3.420094890151222  -3.817150700803425  -11.362068268474536 </stream_only_likelihood3>
<search_likelihood3> -2.703336705746023 </search_likelihood3>
03:46:36 (5696): called boinc_finish(0)

</stderr_txt>
]]>


©2024 Astroinformatics Group