Welcome to MilkyWay@home

Posts by Cat22

1) Message boards : Number crunching : trouble re-upping to milkyqay@home (Message 76592)
Posted 7 Nov 2023 by Profile Cat22
Post:
I removed my milkyway project and tried to login as existing user, but i could not remember my password.
So i entered my email address in the dialog for "Forgot your password", but it will not take my email address,
it always says
EMAIL failed Can't send email to  ...<my email> 

yeh, i double checked the spelling as it's a globally common error
I wish the error message said why it can't send me email
Also, i am using the
https://milkyway-new.cs.rpi.edu/milkyway/

project address
I also, just for grins, tried logging in as anew user but then it says my email is already in use.
TIA
2) Message boards : News : Migrating MilkyWay@home to a New Server (Message 76557)
Posted 4 Nov 2023 by Profile Cat22
Post:
One thing i forgot in my last post. Here is the relevant part of all_projects_list.xml
  <project>
        <name>Milkyway@home</name>
        <id>14</id>
        <url>https://milkyway-new.cs.rpi.edu/milkyway/</url>
        <web_url>https://milkyway-new.cs.rpi.edu/milkyway/</web_url>                                                                                                                                         
        <general_area>Physical Science</general_area>
        <specific_area>Astronomy</specific_area>
        <description><![CDATA[The goal of Milkyway@Home is to create a highly accurate three dimensional model of the Milky Way galaxy using data gathered by the Sloan Digital Sky Survey.]]></description>
        <home>Rensselaer Polytechnic Institute</home>
    <platforms>
        <name>windows_x86_64[mt]</name>
        <name>x86_64-pc-linux-gnu[mt]</name>
    </platforms>
      <image>https://boinc.berkeley.edu/images/mw.png</image>
      <summary>Study the structure of the Milky Way galaxy</summary>
        <keywords>1 23 24 58</keywords>
    </project>
3) Message boards : News : Migrating MilkyWay@home to a New Server (Message 76556)
Posted 4 Nov 2023 by Profile Cat22
Post:
opeSuse Linux Tumbleweed
bpinc 7.24.1 x64
I deleted the milkyway project and changed the url to add -new, i also did an update-ca-certificates
but I still can not add the project. in each case i shutdown boinc and restarted it - nothing fixed the issue for me
Someone mentioned adding 2 certs but, where do they go and what name do i give them?
Is it going to fix this?

[---] Fetching configuration file from https://milkyway-new.cs.rpi.edu/milkyway/get_project_config.php
[---] Project communication failed: attempting access to reference site
[---] Internet access OK - project servers may be temporarily down.
4) Message boards : Number crunching : No more GPU tasks? (Message 76192)
Posted 6 Jul 2023 by Profile Cat22
Post:
Did MWH stop handing out tasks for GPU's? I haven't see one in weeks on 5 machines? (since around 6/22 or so)
5) Message boards : Application Code Discussion : Requesting but not getting new tasks for NVIDIA GPU (Message 75062)
Posted 18 Feb 2023 by Profile Cat22
Post:
Yep, libOpenCL.so (opencl client driver bindings) was missing, installing it fixed things on both systems - thanks
Apparently some update removed the lib, why? i don't know, but now its fixed
6) Message boards : Application Code Discussion : Requesting but not getting new tasks for NVIDIA GPU (Message 75050)
Posted 12 Feb 2023 by Profile Cat22
Post:
Linux x86_64
For a couple of weeks now I have two pc's that cant get any GPU tasks, 3 other pc's get them just fine.
Here is the startup code, you can see it's requesting NVIDIA tasks but it never gets any.
Before a couple of weeks ago both pc's were executing NVIDIA tasks just fine.

11-Feb-2023 20:39:36 [---] Starting BOINC client version 7.22.0 for x86_64-suse-linux-gnu
11-Feb-2023 20:39:36 [---] log flags: file_xfer, sched_ops, task
11-Feb-2023 20:39:36 [---] Libraries: libcurl/7.87.0 OpenSSL/3.0.7 zlib/1.2.13 brotli/1.0.9 zstd/1.5.2 libidn2/2.3.4 libpsl/0.21.2 (+libidn2/2.3.4) libssh/0.10.4/openssl/zlib nghttp2/1.51.0
11-Feb-2023 20:39:36 [---] Data directory: /home/erbenton/boinc
11-Feb-2023 20:39:36 [---] CUDA: NVIDIA GPU 0: NVIDIA GeForce RTX 2060 (driver version 525.85, CUDA version 12.0, compute capability 7.5, 5919MB, 5919MB available, 6739 GFLOPS peak)
11-Feb-2023 20:39:36 [---] CUDA: NVIDIA GPU 1: NVIDIA GeForce GTX 1660 Ti (driver version 525.85, CUDA version 12.0, compute capability 7.5, 5937MB, 5937MB available, 5668 GFLOPS peak)
11-Feb-2023 20:39:36 [---] libc:  version 2.37
11-Feb-2023 20:39:36 [---] Host name: erb1
11-Feb-2023 20:39:36 [---] Processor: 12 GenuineIntel Intel(R) Core(TM) i7-3960X CPU @ 3.30GHz [Family 6 Model 45 Stepping 7]
11-Feb-2023 20:39:36 [---] Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic popcnt tsc_deadline_timer aes xsave avx lahf_lm epb ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid xsaveopt dtherm ida arat pln pts md_clear flush_l1d
11-Feb-2023 20:39:36 [---] OS: Linux openSUSE: openSUSE Tumbleweed [6.1.10-1-default|libc 2.37]
11-Feb-2023 20:39:36 [---] Memory: 31.27 GB physical, 16.00 GB virtual
11-Feb-2023 20:39:36 [---] Disk: 309.64 GB total, 188.64 GB free
11-Feb-2023 20:39:36 [---] Local time is UTC -8 hours
11-Feb-2023 20:39:36 [---] Config: use all coprocessors
11-Feb-2023 20:39:36 [Milkyway@Home] General prefs: from Milkyway@Home (last modified 20-Jun-2020 17:48:32)
11-Feb-2023 20:39:36 [Milkyway@Home] Computer location: home
11-Feb-2023 20:39:36 [Milkyway@Home] General prefs: no separate prefs for home; using your defaults
11-Feb-2023 20:39:36 [---] Reading preferences override file
11-Feb-2023 20:39:36 [---] Preferences:
11-Feb-2023 20:39:36 [---] -  When computer is in use
11-Feb-2023 20:39:36 [---] -     'In use' means mouse/keyboard input in last 3.0 minutes
11-Feb-2023 20:39:36 [---] -     max CPUs used: 12
11-Feb-2023 20:39:36 [---] -     Use at most 100% of the CPU time
11-Feb-2023 20:39:36 [---] -     suspend if non-BOINC CPU load exceeds 90%
11-Feb-2023 20:39:36 [---] -     max memory usage: 21.89 GB
11-Feb-2023 20:39:36 [---] -  When computer is not in use
11-Feb-2023 20:39:36 [---] -     max CPUs used: 12
11-Feb-2023 20:39:36 [---] -     Use at most 100% of the CPU time
11-Feb-2023 20:39:36 [---] -     suspend if non-BOINC CPU load exceeds 90%
11-Feb-2023 20:39:36 [---] -     max memory usage: 23.45 GB
11-Feb-2023 20:39:36 [---] -  Leave apps in memory if not running
11-Feb-2023 20:39:36 [---] -  Store at least 2.00 days of work
11-Feb-2023 20:39:36 [---] -  Store up to an additional 3.00 days of work
11-Feb-2023 20:39:36 [---] -  max disk usage: 6.00 GB
11-Feb-2023 20:39:36 [---] -  (to change preferences, visit a project web site or select Preferences in the Manager)
11-Feb-2023 20:39:36 [---] Setting up project and slot directories
11-Feb-2023 20:39:36 [---] Checking active tasks
11-Feb-2023 20:39:36 [Einstein@Home] URL http://einstein.phys.uwm.edu/; Computer ID 12885175; resource share 100
11-Feb-2023 20:39:36 [Milkyway@Home] URL http://milkyway.cs.rpi.edu/milkyway/; Computer ID 966838; resource share 100
11-Feb-2023 20:39:36 [---] Setting up GUI RPC socket
11-Feb-2023 20:39:36 [---] Checking presence of 39 project files
11-Feb-2023 20:39:36 Initialization completed
11-Feb-2023 20:39:36 [Milkyway@Home] Sending scheduler request: To fetch work.
11-Feb-2023 20:39:36 [Milkyway@Home] Requesting new tasks for NVIDIA GPU
11-Feb-2023 20:39:38 [Milkyway@Home] Scheduler request completed: got 0 new tasks
11-Feb-2023 20:39:38 [Milkyway@Home] Project requested delay of 91 seconds
[/code]
7) Questions and Answers : Unix/Linux : boic sees 2 GPU's but only uses 1 (Message 74381)
Posted 8 Oct 2022 by Profile Cat22
Post:
Well, I installed boinc version 7.18.1 and now it only sees 1 GPU.
Is there some conflict between a RTX 3070 and an RTX 2060 that is causing this??
Do i need to build my own copy from source?
I did try building from source but cant get it to stop complaining there are no widgets (I'm on Linux)
and I sure wish i knew what widgets it lacked.
Seems like configure should tell us what is missing specifically not just "oh you dont have any widgets"
What widgets? what version?
so I'm pretty well busted on building from source
I have 2 other systems that are running dual GPU's and they work fine
1 has an RTX 2060 and a GTX 1660 ti and he other system is running 2 RTX 1660 ti's
Neither one has a problem with that config.
I sure would appreciate some guidance.
TIA
8) Questions and Answers : Unix/Linux : boic sees 2 GPU's but only uses 1 (Message 74238)
Posted 22 Sep 2022 by Profile Cat22
Post:
I am no xml expert, but I think that gets you 2 instances running on one GPU, but does not address the second GPU at all.

So, how should it be done?
9) Questions and Answers : Unix/Linux : boic sees 2 GPU's but only uses 1 (Message 74231)
Posted 22 Sep 2022 by Profile Cat22
Post:
just want to confirm - you are on Linux correct? not windows or Mac.
10) Questions and Answers : Unix/Linux : boic sees 2 GPU's but only uses 1 (Message 74230)
Posted 22 Sep 2022 by Profile Cat22
Post:
This look about right?
<app_config>
<app_version>
<app_name>milkyway</app_name>
<plan_class>opencl_nvidia_101</plan_class>
<avg_ncpus>0.866</avg_ncpus>
<ngpus>0.5</ngpus>
</app_version>
<!--Your comment-->
</app_config>
11) Questions and Answers : Unix/Linux : boic sees 2 GPU's but only uses 1 (Message 74217)
Posted 21 Sep 2022 by Profile Cat22
Post:
I just added 2nd GPU and at boinc startup it sees both GPU's but it only uses one
# lspci|grep VGA
01:00.0 VGA compatible controller: NVIDIA Corporation Device 2488 (rev a1)
02:00.0 VGA compatible controller: NVIDIA Corporation TU104 [GeForce RTX 2060] (rev a1

21-Sep-2022 09:49:35 [---] CUDA: NVIDIA GPU 0: NVIDIA GeForce RTX 3070 (driver version 515.65, CUDA version 11.7, compute capability 8.6, 4096MB, 3958MB available, 20314 GFLOPS peak)
21-Sep-2022 09:49:35 [---] CUDA: NVIDIA GPU 1: NVIDIA GeForce RTX 2060 (driver version 515.65, CUDA version 11.7, compute capability 7.5, 4096MB, 3970MB available, 12902 GFLOPS peak)
21-Sep-2022 09:49:35 [---] OpenCL: NVIDIA GPU 0: NVIDIA GeForce RTX 3070 (driver version 515.65.01, device version OpenCL 3.0 CUDA, 7979MB, 3958MB available, 20314 GFLOPS peak)
21-Sep-2022 09:49:35 [---] OpenCL: NVIDIA GPU 1: NVIDIA GeForce RTX 2060 (driver version 515.65.01, device version OpenCL 3.0 CUDA, 5935MB, 3970MB available, 12902 GFLOPS peak)
21-Sep-2022 09:49:35 [---] Processor: 16 GenuineIntel Intel(R) Core(TM) i9-9900K CPU @ 3.60GHz [Family 6 Model 158 Stepping 13]
21-Sep-2022 09:49:35 [---] Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb invpcid_single ssbd ibrs ibpb stibp ibrs_enhanced tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx rdseed adx smap clflushopt intel_pt xsaveopt xsavec xgetbv1 xsaves dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp md_clear flush_l1d arch_capabilities
21-Sep-2022 09:49:35 [---] OS: Linux openSUSE: openSUSE Leap 15.3 [5.3.18-59.27-preempt]
21-Sep-2022 09:49:35 [---] Memory: 15.54 GB physical, 4.00 GB virtual
21-Sep-2022 09:49:35 [---] Config: use all coprocessors

I tried doing a project reset but t didn't help.
Then i went to milkyway "Your cmputres" and it shows this:
7.8.3 	GenuineIntel
Intel(R) Core(TM) i9-9900K CPU @ 3.60GHz [Family 6 Model 158 Stepping 13]  (16 processors) 	
[2] NVIDIA NVIDIA GeForce RTX 3070 (4095MB) driver: 515.65 OpenCL: 3.0 	
Linux openSUSE openSUSE Leap 15.3 [5.3.18-59.27-preempt] 	21 Sep 2022, 18:29:20 UTC

I have a cc_config:
<cc_config>
    <options>
       <use_all_gpus>1</use_all_gpus>
    </options>
<!--    <log_flags>
       <coproc_debug>1</coproc_debug>
       <task_debug>1</task_debug>
    </log_flags>
-->
</cc_config>

Here is my coproc_info.xml, I wanted to add it as an attachment but i dont see how to do it.
 # cat coproc_info.xml
    <coprocs>
    <have_cuda>1</have_cuda>
    <cuda_version>11070</cuda_version>
<coproc_cuda>
   <count>1</count>
   <name>NVIDIA GeForce RTX 3070</name>
   <available_ram>4150263808.000000</available_ram>
   <have_cuda>1</have_cuda>
   <have_opencl>0</have_opencl>
   <peak_flops>20313600000000.000000</peak_flops>
   <cudaVersion>11070</cudaVersion>
   <drvVersion>51565</drvVersion>
   <totalGlobalMem>4294967295.000000</totalGlobalMem>
   <sharedMemPerBlock>49152.000000</sharedMemPerBlock>
   <regsPerBlock>65536</regsPerBlock>
   <warpSize>32</warpSize>
   <memPitch>2147483647.000000</memPitch>
   <maxThreadsPerBlock>1024</maxThreadsPerBlock>
   <maxThreadsDim>1024 1024 64</maxThreadsDim>
   <maxGridSize>2147483647 65535 65535</maxGridSize>
   <clockRate>1725000</clockRate>
   <totalConstMem>65536.000000</totalConstMem>
   <major>8</major>
   <minor>6</minor>
   <textureAlignment>512.000000</textureAlignment>
   <deviceOverlap>1</deviceOverlap>
   <multiProcessorCount>46</multiProcessorCount>
<pci_info>
   <bus_id>1</bus_id>
   <device_id>0</device_id>
   <domain_id>0</domain_id>
</pci_info>
</coproc_cuda>
<coproc_cuda>
   <count>1</count>
   <name>NVIDIA GeForce RTX 2060</name>
   <available_ram>4162846720.000000</available_ram>
   <have_cuda>1</have_cuda>
   <have_opencl>0</have_opencl>
   <peak_flops>12902400000000.000000</peak_flops>
   <cudaVersion>11070</cudaVersion>
   <drvVersion>51565</drvVersion>
   <totalGlobalMem>4294967295.000000</totalGlobalMem>
   <sharedMemPerBlock>49152.000000</sharedMemPerBlock>
   <regsPerBlock>65536</regsPerBlock>
   <warpSize>32</warpSize>
   <memPitch>2147483647.000000</memPitch>
   <maxThreadsPerBlock>1024</maxThreadsPerBlock>
   <maxThreadsDim>1024 1024 64</maxThreadsDim>
   <maxGridSize>2147483647 65535 65535</maxGridSize>
   <clockRate>1680000</clockRate>
   <totalConstMem>65536.000000</totalConstMem>
   <major>7</major>
   <minor>5</minor>
   <textureAlignment>512.000000</textureAlignment>
   <deviceOverlap>1</deviceOverlap>
   <multiProcessorCount>30</multiProcessorCount>
<pci_info>
   <bus_id>2</bus_id>
   <device_id>0</device_id>
   <domain_id>0</domain_id>
</pci_info>
</coproc_cuda>
   <nvidia_opencl>
      <name>NVIDIA GeForce RTX 3070</name>
      <vendor>NVIDIA Corporation</vendor>
      <vendor_id>4318</vendor_id>
      <available>1</available>
      <half_fp_config>0</half_fp_config>
      <single_fp_config>191</single_fp_config>
      <double_fp_config>63</double_fp_config>
      <endian_little>1</endian_little>
      <execution_capabilities>1</execution_capabilities>
      <extensions>cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid cl_khr_pci_bus_info cl_khr_external_semaphore cl_khr_external_memory cl_khr_external_semaphore_opaque_fd cl_khr_external_memory_opaque_fd</extensions>
      <global_mem_size>8366784512</global_mem_size>
      <local_mem_size>49152</local_mem_size>
      <max_clock_frequency>1725</max_clock_frequency>
      <max_compute_units>46</max_compute_units>
      <nv_compute_capability_major>8</nv_compute_capability_major>
      <nv_compute_capability_minor>6</nv_compute_capability_minor>
      <amd_simd_per_compute_unit>0</amd_simd_per_compute_unit>
      <amd_simd_width>0</amd_simd_width>
      <amd_simd_instruction_width>0</amd_simd_instruction_width>
      <opencl_platform_version>OpenCL 3.0 CUDA 11.7.101</opencl_platform_version>
      <opencl_device_version>OpenCL 3.0 CUDA</opencl_device_version>
      <opencl_driver_version>515.65.01</opencl_driver_version>
      <device_num>0</device_num>
      <peak_flops>20313600000000.000000</peak_flops>
      <opencl_available_ram>4150263808.000000</opencl_available_ram>
      <opencl_device_index>0</opencl_device_index>
      <warn_bad_cuda>0</warn_bad_cuda>
   </nvidia_opencl>
   <nvidia_opencl>
      <name>NVIDIA GeForce RTX 2060</name>
      <vendor>NVIDIA Corporation</vendor>
      <vendor_id>4318</vendor_id>
      <available>1</available>
      <half_fp_config>0</half_fp_config>
      <single_fp_config>191</single_fp_config>
      <double_fp_config>63</double_fp_config>
      <endian_little>1</endian_little>
      <execution_capabilities>1</execution_capabilities>
      <extensions>cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid cl_khr_pci_bus_info cl_khr_external_semaphore cl_khr_external_memory cl_khr_external_semaphore_opaque_fd cl_khr_external_memory_opaque_fd</extensions>
      <global_mem_size>6222970880</global_mem_size>
      <local_mem_size>49152</local_mem_size>
      <max_clock_frequency>1680</max_clock_frequency>
      <max_compute_units>30</max_compute_units>
      <nv_compute_capability_major>7</nv_compute_capability_major>
      <nv_compute_capability_minor>5</nv_compute_capability_minor>
      <amd_simd_per_compute_unit>0</amd_simd_per_compute_unit>
      <amd_simd_width>0</amd_simd_width>
      <amd_simd_instruction_width>0</amd_simd_instruction_width>
      <opencl_platform_version>OpenCL 3.0 CUDA 11.7.101</opencl_platform_version>
      <opencl_device_version>OpenCL 3.0 CUDA</opencl_device_version>
      <opencl_driver_version>515.65.01</opencl_driver_version>
      <device_num>1</device_num>
      <peak_flops>12902400000000.000000</peak_flops>
      <opencl_available_ram>4162846720.000000</opencl_available_ram>
      <opencl_device_index>1</opencl_device_index>
      <warn_bad_cuda>0</warn_bad_cuda>
   </nvidia_opencl>
<warning>NVIDIA library reports 2 GPUs</warning>
<warning>ATI: libaticalrt.so: cannot open shared object file: No such file or directory</warning>
    </coprocs>

I have plenty of GPU tasks so that shouldn't be the issue
I tried swapping slots on the motherboard and it just does the same thing but whatever is in the first slot is what gets used, the GPU in the next slot is ignored even tho boinc sees it n startup.
What am I missing?
12) Questions and Answers : Unix/Linux : MW stopped using my nvidia GPU (Message 69889)
Posted 4 Jun 2020 by Profile Cat22
Post:
Well, I gave up and did a 'project reset' and lo and behold the nvidia apps are running now :-) yaaaa
13) Questions and Answers : Unix/Linux : MW stopped using my nvidia GPU (Message 69888)
Posted 4 Jun 2020 by Profile Cat22
Post:
Hi,
thanks for the info. I checked and the last nvidia task was sent in on June 1, all the others ware waiting.
I have plenty of tasks, some nvidia some cpu. But why is boinc ignoring the nvidia tasks?
All my nvidia tasks are in state "waiting to run" I have 1 nbody simulation 1.76 task running (12 cpu's)
and thats it, in fact as i write this it finished and started another similar nbody task.
14) Questions and Answers : Unix/Linux : Disk Usage (Message 69885)
Posted 3 Jun 2020 by Profile Cat22
Post:
I assume you have your data storage disk (the one you wish to use) mounted somewhere on your primary disk, right?
Since you're in linux cant you just link to the desired location?
e.g
if boinc is in /home/xray_6/boinc
then
 ln -s <where you want your data dir> /home/xray_6/boinc/projects
15) Questions and Answers : Unix/Linux : MW stopped using my nvidia GPU (Message 69884)
Posted 3 Jun 2020 by Profile Cat22
Post:
openSuse Linux (Tumbleweed) x86_64, nvidia driver 440.59

This was going along fine but at least a day ago all the nvidia tasks went to "waiting to run" status and have not run since. the activitiy menu options are all set to "Always" and nothing is suspended, there is no appinfo.xml no app_confing.xml cc_config.xml etc - just raw MW and BOINC. computing preferences allow for 1 cpu per GPU (I have 2 nividia cards) so that should be ok. The nvidia driver hasnt changed. I just double checked I am still part of the video group so thats not it.
How can i determine what is preventing the GPU from running?
Also, as an aside, how does one view the specific computer on the website? I cant seem to sort by computer id so searching for a particular computer in my set of computers is like searching for a needle in a haystack

Example:
Application Milkyway@home Separation 1.46 (opencl_nvidia_101)
Name de_modfit_86_bundle4_4s_south4s_bgset_2_1588605902_20177642
State Waiting to run
Received Sun 31 May 2020 09:17:52 PM PDT
Report deadline Fri 12 Jun 2020 09:17:51 PM PDT
Resources 0.965 CPUs + 1 NVIDIA GPU
Estimated computation size 42,135 GFLOPs
CPU time ---
CPU time since checkpoint ---
Elapsed time ---
Estimated time remaining 00:03:05
Fraction done 0.000%
Virtual memory size 9.97 GB
Working set size 499.16 MB
Directory slots/2
Executable milkyway_1.46_x86_64-pc-linux-gnu__opencl_nvidia_101

Here is a boinc re-start i just did to see if that would help (it didnt):
[---] Starting BOINC client version 7.17.0 for x86_64-pc-linux-gnu
[---] This a development version of BOINC and may not function properly
[---] log flags: file_xfer, sched_ops, task
[---] Libraries: libcurl/7.70.0 OpenSSL/1.1.1g-fips zlib/1.2.11 libidn2/2.3.0 libpsl/0.21.0 (+libidn2/2.3.0) libssh/0.9.3/openssl/zlib nghttp2/1.40.0
[---] Data directory: /home/erbenton/boinc
[---] CUDA: NVIDIA GPU 0: GeForce RTX 2060 (driver version 440.59, CUDA version 10.2, compute capability 7.5, 4096MB, 3970MB available, 6739 GFLOPS peak)
[---] CUDA: NVIDIA GPU 1: GeForce GTX 1660 Ti (driver version 440.59, CUDA version 10.2, compute capability 7.5, 4096MB, 3972MB available, 5668 GFLOPS peak)
[---] OpenCL: NVIDIA GPU 0: GeForce RTX 2060 (driver version 440.59, device version OpenCL 1.2 CUDA, 5932MB, 3970MB available, 6739 GFLOPS peak)
[---] OpenCL: NVIDIA GPU 1: GeForce GTX 1660 Ti (driver version 440.59, device version OpenCL 1.2 CUDA, 5945MB, 3972MB available, 5668 GFLOPS peak)
[---] OpenCL CPU: pthread-Intel(R) Core(TM) i7-3960X CPU @ 3.30GHz (OpenCL driver vendor: The pocl project, driver version 1.4, device version OpenCL 1.2 pocl HSTR: pthread-x86_64-unknown-linux-gnu-sandybridge)
[SETI@home] Found app_info.xml; using anonymous platform
[---] libc: GNU libc version 2.31
[---] Host name: erb1
[---] Processor: 12 GenuineIntel Intel(R) Core(TM) i7-3960X CPU @ 3.30GHz [Family 6 Model 45 Stepping 7]
[---] Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic popcnt tsc_deadline_timer aes xsave avx lahf_lm epb ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid xsaveopt dtherm ida arat pln pts md_clear flush_l1d
[---] OS: Linux openSUSE: openSUSE Tumbleweed [5.5.4-cstm|libc 2.31 (GNU libc)]
[---] Memory: 31.29 GB physical, 2.00 GB virtual
[---] Disk: 80.10 GB total, 6.77 GB free
[---] Local time is UTC -7 hours
[---] VirtualBox version: 6.1.2r135662
[---] Config: use all coprocessors
[Milkyway@Home] General prefs: from Milkyway@Home (last modified 02-Jun-2020 19:14:24)
[Milkyway@Home] Computer location: home
[Milkyway@Home] General prefs: no separate prefs for home; using your defaults
[---] Reading preferences override file
[---] Preferences:
[---]    max memory usage when active: 32041.71 MB
[---]    max memory usage when idle: 32041.71 MB
[---]    max disk usage: 4.40 GB
[---]    max CPUs used: 10
[---]    (to change preferences, visit a project web site or select Preferences in the Manager)
[---] Setting up project and slot directories
[---] Checking active tasks
[Milkyway@Home] URL http://milkyway.cs.rpi.edu/milkyway/; Computer ID 852607; resource share 100
[SETI@home] URL http://setiathome.berkeley.edu/; Computer ID 8730567; resource share 100
[---] Setting up GUI RPC socket
[---] Checking presence of 31 project files
Initialization completed
[SETI@home] Sending scheduler request: To fetch work.
[SETI@home] Requesting new tasks for NVIDIA GPU
[SETI@home] Scheduler request completed: got 0 new tasks
[SETI@home] Project has no tasks available
[SETI@home] Project requested delay of 87264 seconds
[Milkyway@Home] Sending scheduler request: To fetch work.
[Milkyway@Home] Requesting new tasks for NVIDIA GPU
[Milkyway@Home] Scheduler request completed: got 0 new tasks
[Milkyway@Home] Not sending work - last request too recent: 77 sec
[Milkyway@Home] Project requested delay of 91 seconds
16) Message boards : Number crunching : Need help with linux and app_info (Message 69879)
Posted 31 May 2020 by Profile Cat22
Post:
hi,
The whole system is dedicated to boinc 24/7/365 so for the i9-99000k the answer is 16 cpu threads
I only want to commit whatever number of cpu threads are required buy each GPU application.
My goal is to run 2 concurrent GPU tasks - 1 task per card (which it seems to be doing now that i set the global cpu % down to 90%)
and have the remainder cpu resources crunching CPU tasks.
So if each nvidia app actually requires a full cpu thread to keep it fed, then the remaining 14 threads should be crunching CPU tasks
What bothers me is that by using the Global "Use at most xx CPU percentage" option I am affecting other projects where if i had a decent app_info.xml
or app_config.xml (whatever i need) it would only apply to Mikyway and leave the other (presently idle) projects alone. I would only be running a single project
not more than that concurrently. e.g I switched to MW only because SETI isnt handing out work while they manage an overwhelming amount of returned results.
TIA
17) Message boards : Number crunching : Need help with linux and app_info (Message 69874)
Posted 30 May 2020 by Profile Cat22
Post:
Hi again,
I just set both systems to a percentage of CPU so that i would have 2 cores free and after awhile it did start using both GPU's finally.
When I was running SETI we did this via an app info (see below). The SETI app_info.xml allocated however much of a cpu you wanted (.45 in this case) to tend to a single GPU's needs. So far I have not found a linux based working app_info.xml for Milkyway that handles all the apps:
    milkyway_nbody_1.76_x86_64-pc-linux-gnu__mt
    milkyway_1.46_x86_64-pc-linux-gnu__opencl_nvidia_101
    milkyway_1.46_x86_64-pc-linux-gnu


<app_info>
  <app>
     <name>setiathome_v8</name>
  </app>
    <file_info>
      <name>setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101</name>
      <executable/>
    </file_info>
    <app_version>
      <app_name>setiathome_v8</app_name>
      <platform>x86_64-pc-linux-gnu</platform>
      <version_num>801</version_num>
      <plan_class>cuda10.1</plan_class>
      <cmdline>-nobs</cmdline>
      <coproc>
        <type>NVIDIA</type>
        <count>1</count>
      </coproc>
      <avg_ncpus>.45</avg_ncpus>
      <ngpus>1</ngpus>
      <file_ref>
         <file_name>setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101</file_name>
          <main_program/>
      </file_ref>
    </app_version>
  <app>
      <name>setiathome_v8</name>
    </app>
    <file_info>
      <name>MBv8_8.05r3345_avx_linux64</name>
      <executable/>
    </file_info>
    <app_version>
      <app_name>setiathome_v8</app_name>
      <version_num>805</version_num>
      <platform>x86_64-pc-linux-gnu</platform>
      <plan_class>avx</plan_class>
      <cmdline></cmdline>
      <file_ref>
        <file_name>MBv8_8.05r3345_avx_linux64</file_name>
        <main_program/>
      </file_ref>
    </app_version>
</app_info>
18) Message boards : Number crunching : Need help with linux and app_info (Message 69873)
Posted 30 May 2020 by Profile Cat22
Post:
Hi Keith,
I want to run 1 task per GPU. right now I am running 1 task on 1 GPU and nothing on the 2nd GPU
19) Message boards : Number crunching : Need help with linux and app_info (Message 69870)
Posted 29 May 2020 by Profile Cat22
Post:
How do i set the number of cpu's that boinc will use as you mentioned? I want to try cutting it back and see if that gets both cards in use. I think we can tell right away if this is the issue if i set boinc to use say only 8 threads out of thee 16 (temporarily as an experiment), that leaves 4 threads per GPU so if this is the issue I should see both GPU's in use right away.
Sound right?
20) Message boards : Number crunching : Need help with linux and app_info (Message 69864)
Posted 28 May 2020 by Profile Cat22
Post:
On startup boinc reports:
"[---] Config: use all coprocessors"
so we are set there.


Next 20

©2024 Astroinformatics Group