Scheduled Maintenance Concluded

Author	Message
Peciak Send message Joined: 27 Jun 09 Posts: 12 Credit: 148,038,330 RAC: 0	Message 65676 - Posted: 11 Nov 2016, 22:00:53 UTC <core_client_version>7.6.22</core_client_version> <![CDATA[ <message> aborted by user </message> <stderr_txt> <search_application> milkyway_separation 1.42 Windows x86_64 double </search_application> Reading preferences ended prematurely BOINC GPU type suggests using OpenCL vendor 'Advanced Micro Devices, Inc.' Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' Switching to Parameter File 'astronomy_parameters.txt' <number_WUs> 5 </number_WUs> <number_params_per_WU> 20 </number_params_per_WU> Using AVX path Integral 0 time = 641.348502 s Running likelihood with 84046 stars Likelihood time = 1.657994 s <background_integral> 0.000069192102838 </background_integral> <stream_integral> 87.919837480121885 1231.763016660424500 218.680946939705140 </stream_integral> <background_likelihood> -3.558790208380722 </background_likelihood> <stream_only_likelihood> -19.405544659459434 -5.167057809848690 -3.549714346561023 </stream_only_likelihood> <search_likelihood> -3.135608747542042 </search_likelihood> Using AVX path ID: 65676 · Rating: 0 · rate: / Reply Quote

Jake Weiss Volunteer moderator Project developer Project tester Project scientist Send message Joined: 25 Feb 13 Posts: 580 Credit: 94,200,158 RAC: 0	Message 65677 - Posted: 11 Nov 2016, 22:03:43 UTC Peciak, That's weird. The version info doesn't say its an OpenCL app. Let me see if I forgot to set the flag when I recompiled it. Sorry about this, I haven't slept much over the last couple days trying to get this update done. Jake ID: 65677 · Rating: 0 · rate: / Reply Quote

Rich Send message Joined: 14 Nov 14 Posts: 9 Credit: 214,644,261 RAC: 0	Message 65678 - Posted: 11 Nov 2016, 22:03:46 UTC Looks like the new version isn't using the GPU. My 750Ti show no GPU usage and the CPU task is being used fully. The first two WU's validated but used the CPU. Here is from the standard error file if this will help: <core_client_version>7.6.33</core_client_version> <![CDATA[ <stderr_txt> <search_application> milkyway_separation 1.42 Windows x86_64 double </search_application> Reading preferences ended prematurely BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation' Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' Switching to Parameter File 'astronomy_parameters.txt' <number_WUs> 1 </number_WUs> <number_params_per_WU> 20 </number_params_per_WU> Using AVX path Integral 0 time = 836.284111 s Running likelihood with 84046 stars Likelihood time = 1.842669 s <background_integral> 0.000183211346476 </background_integral> <stream_integral> 35.170988580602945 220.771463270831330 56.153686511532385 </stream_integral> <background_likelihood> -3.782092719257110 </background_likelihood> <stream_only_likelihood> -50.039987133739942 -4.114360911786672 -3.380701603567321 </stream_only_likelihood> <search_likelihood> -2.964758398505063 </search_likelihood> 16:52:25 (20784): called boinc_finish(0) </stderr_txt> ID: 65678 · Rating: 0 · rate: / Reply Quote

Jake Weiss Volunteer moderator Project developer Project tester Project scientist Send message Joined: 25 Feb 13 Posts: 580 Credit: 94,200,158 RAC: 0	Message 65679 - Posted: 11 Nov 2016, 22:09:53 UTC Rich, I see it, it was my fault during building of the program. I have a fix made, just double and triple checking to make sure I don't make another silly mistake this time. Will be released soon. Jake ID: 65679 · Rating: 0 · rate: / Reply Quote

Rich Send message Joined: 14 Nov 14 Posts: 9 Credit: 214,644,261 RAC: 0	Message 65680 - Posted: 11 Nov 2016, 22:13:37 UTC Thanks, lol, I'll let the 12 WU's finish up with the CPU and then let more units start loading. I've done many mistakes in my time, fully understand. Keep up the good work. ID: 65680 · Rating: 0 · rate: / Reply Quote

Rymorea Send message Joined: 6 Oct 14 Posts: 46 Credit: 20,017,425 RAC: 0	Message 65681 - Posted: 11 Nov 2016, 22:27:05 UTC Same for me too. Both AMD R9 270x and Nvidia 750TI GPU use %0 CPU full core I have to abort those task cause Screen lag too much and other CPU projects hang. ID: 65681 · Rating: 0 · rate: / Reply Quote

LesCap Send message Joined: 24 Oct 16 Posts: 12 Credit: 56,127,036 RAC: 0	Message 65682 - Posted: 11 Nov 2016, 22:36:11 UTC - in response to Message 65679. Hey Jake The GPU units are crunching again after downloading ver. 1.42 :) 11/11/2016 3:28:56 PM \| Milkyway@Home \| Started download of milkyway_1.42_windows_x86_64__opencl_ati_101.exe 11/11/2016 3:29:33 PM \| Milkyway@Home \| Finished download of milkyway_1.42_windows_x86_64__opencl_ati_101.exe 11/11/2016 3:29:33 PM \| Milkyway@Home \| Starting task de_modfit_fast_19_3s_136_bundle5_ModfitConstraints3_4_1478900148_13142_0 Thanks again Les Intel(R) Core(TM) i7-2700K CPU @ 3.50GHz [Family 6 Model 42 Stepping 7] (8 processors) AMD Radeon HD 6900 series (Cayman) (2048MB) driver: 1.4.1848 OpenCL: 1.2 Microsoft Windows 7 Pro x64 Service Pack 1, (06.01.7601.00) ID: 65682 · Rating: 0 · rate: / Reply Quote

Jake Weiss Volunteer moderator Project developer Project tester Project scientist Send message Joined: 25 Feb 13 Posts: 580 Credit: 94,200,158 RAC: 0	Message 65683 - Posted: 11 Nov 2016, 22:37:00 UTC Hey Everyone, I'm still working on a fix. Having issues with other the Windows OpenCL version but not Linux OpenCL. Need to see if my drivers are back on the machine I'm testing with or if there is actually an issue with the client. Might take a couple hours for a fix. Sorry, Jake ID: 65683 · Rating: 0 · rate: / Reply Quote

Werkstatt Send message Joined: 19 Feb 08 Posts: 350 Credit: 141,284,369 RAC: 0	Message 65684 - Posted: 11 Nov 2016, 22:39:19 UTC Well, yes, they start crunching, but at ~70% they restart with 0%. In 45 minutes not a single gpu wu finished. ID: 65684 · Rating: 0 · rate: / Reply Quote

mmonnin Send message Joined: 2 Oct 16 Posts: 167 Credit: 1,014,814,760 RAC: 7	Message 65685 - Posted: 11 Nov 2016, 22:45:53 UTC My 280x in Linux is completing WUs in about 150-155 seconds at 4x. Same time per task as before. Still lots of validation inconclusive but I had a lot of those before. It still has app v1.4. ID: 65685 · Rating: 0 · rate: / Reply Quote

Bri Send message Joined: 14 May 14 Posts: 2 Credit: 3,945,126 RAC: 0	Message 65686 - Posted: 11 Nov 2016, 22:47:07 UTC - in response to Message 65684. Yes, On both my machines CPU wu's are also resetting at about 60% ID: 65686 · Rating: 0 · rate: / Reply Quote

LesCap Send message Joined: 24 Oct 16 Posts: 12 Credit: 56,127,036 RAC: 0	Message 65687 - Posted: 11 Nov 2016, 22:49:35 UTC - in response to Message 65684. Last modified: 11 Nov 2016, 22:52:33 UTC OOOPs.....similar here Werkstatt. 1st GPU bundle5 unit jumped back to 0% when it reached ~40% complete. CPU units are still crunching through to completion. Intel(R) Core(TM) i7-2700K CPU @ 3.50GHz [Family 6 Model 42 Stepping 7] (8 processors) AMD Radeon HD 6900 series (Cayman) (2048MB) driver: 1.4.1848 OpenCL: 1.2 Microsoft Windows 7 Pro x64 Service Pack 1, (06.01.7601.00) ID: 65687 · Rating: 0 · rate: / Reply Quote

Arivald Ha'gel Send message Joined: 30 Apr 14 Posts: 67 Credit: 160,674,488 RAC: 0	Message 65688 - Posted: 11 Nov 2016, 22:55:25 UTC Last modified: 11 Nov 2016, 23:05:17 UTC Looks like Milkyway@Home 1.42 (opencl_ati_101) app is CPU only app. No GPU usage (at all). 4 WUs have had gone back to 0% around 77-78%. (Copied from slots of one of WU after it went back to 0%) <search_application> milkyway_separation 1.42 Windows x86_64 double </search_application> Reading preferences ended prematurely BOINC GPU type suggests using OpenCL vendor 'Advanced Micro Devices, Inc.' Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' Switching to Parameter File 'astronomy_parameters.txt' <number_WUs> 5 </number_WUs> <number_params_per_WU> 20 </number_params_per_WU> Using AVX path Integral 0 time = 1033.276421 s Running likelihood with 84046 stars Likelihood time = 3.220463 s <background_integral> 0.000132467393354 </background_integral> <stream_integral> 114.693583961640020 1101.118918906797000 12.229260004040517 </stream_integral> <background_likelihood> -3.804522759975287 </background_likelihood> <stream_only_likelihood> -29.103213665631166 -4.472110400350248 -181.009708341877600 </stream_only_likelihood> <search_likelihood> -3.416589456232599 </search_likelihood> Using AVX path ID: 65688 · Rating: 0 · rate: / Reply Quote

Keith Myers Send message Joined: 24 Jan 11 Posts: 739 Credit: 571,407,420 RAC: 61,465	Message 65689 - Posted: 11 Nov 2016, 23:07:54 UTC My client doesn't like something about the downloaded 1.42 file. Had the wrong size error when it automatically downloaded it. Had the same error when I manually downloaded it from the directory. 11/11/2016 3:03:52 PM \| Milkyway@Home \| Resetting file projects/milkyway.cs.rpi.edu_milkyway/milkyway_1.42_windows_x86_64__opencl_nvidia_101.exe: wrong size 11/11/2016 3:03:52 PM \| Milkyway@Home \| Fetching scheduler list 11/11/2016 3:03:54 PM \| Milkyway@Home \| Started download of milkyway_1.42_windows_x86_64__opencl_nvidia_101.exe 11/11/2016 3:03:55 PM \| Milkyway@Home \| Master file download succeeded 11/11/2016 3:03:59 PM \| Milkyway@Home \| Finished download of milkyway_1.42_windows_x86_64__opencl_nvidia_101.exe 11/11/2016 3:03:59 PM \| Milkyway@Home \| [error] File milkyway_1.42_windows_x86_64__opencl_nvidia_101.exe has wrong size: expected 1292288, got 1316864 11/11/2016 3:03:59 PM \| Milkyway@Home \| [error] Checksum or signature error for milkyway_1.42_windows_x86_64__opencl_nvidia_101.exe 11/11/2016 3:04:00 PM \| Milkyway@Home \| Sending scheduler request: To report completed tasks. 11/11/2016 3:04:00 PM \| Milkyway@Home \| Reporting 102 completed tasks Anybody else having issues trying to run the new work units with the supplied 1.42 Nvidia app? ID: 65689 · Rating: 0 · rate: / Reply Quote

Michael H.W. Weber Send message Joined: 22 Jan 08 Posts: 29 Credit: 242,764,221 RAC: 9	Message 65690 - Posted: 11 Nov 2016, 23:56:58 UTC Last modified: 12 Nov 2016, 0:03:02 UTC All tasks with version <= 1.41 produce errors, only. Engagig on v1.42 tasks results in immediate GPU core clock reduction to 300 Mhz and GPU RAM clock to 150 MHz on both 280X and 290X AMD graphics boards. When reaching 100% of estimated run time, the bundle tasks on my 290X reset to zero progress and appear to restart although the total runtime duration is not reset (I guess from the file name that it might be a bundle of 5, so this will hopefully repeat 5-fold and then upload). By contrast, the constraints tasks do complete and upload in the expected manner on my 280X. Runtime with reduced core and RAM clock is 741.63 secs as opposed to 9 secs with standard clocks. The result file first ends up in the "inconclusive" bunch - as is usual. So, there is still something wrong with both of these tasks types with respect to clock resetting. I checked with Einstein: Once a new Einstein WU starts after having finished a MW one, the core and RAM clocks go up to regular speed. So, the down clocking is conducted by MW client. Please inform us once you have solved the clocking issue. Michael. President of Rechenkraft.net e.V. - This planet's first and largest distributed computing organization. ID: 65690 · Rating: 0 · rate: / Reply Quote

greg_be Send message Joined: 18 Aug 09 Posts: 133 Credit: 23,472,957 RAC: 10,313	Message 65691 - Posted: 12 Nov 2016, 0:03:21 UTC Just cleared out 66 V1.40 tasks all with failures. Also just download a series of V1.42 tasks now, see how they hold up. ID: 65691 · Rating: 0 · rate: / Reply Quote

Jesse Viviano Send message Joined: 4 Feb 11 Posts: 86 Credit: 60,913,150 RAC: 0	Message 65692 - Posted: 12 Nov 2016, 0:18:05 UTC - in response to Message 65650. Work units that are generated with the old version of MilkyWay@home that need additional processing are causing errors. See https://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=1386764939 for an example. It was generated for the old version of MilkyWay@home, and it needs additional runs for validation. Either the new version needs to be able to deal with results from old work units, or the new version should be listed as a new application so that old work units continue to be processed with the old version. ID: 65692 · Rating: 0 · rate: / Reply Quote

LesCap Send message Joined: 24 Oct 16 Posts: 12 Credit: 56,127,036 RAC: 0	Message 65693 - Posted: 12 Nov 2016, 0:19:11 UTC Last modified: 12 Nov 2016, 0:42:28 UTC 1 GPU bundle5 WU made the trip. The 1st GPU bundle5 unit was "ready to report" and reported after 1:17:09 and several progress resets to 0% at ~40%. The 2nd GPU bundle5 progress reset to 0% @ 38% with 15 min. elapsed. Will see if it finishes. ~15 minutes X 5 = ~1 hour and 15 minutes.....is the progress simply resetting to 0% after each unit in the bundle of 5 units is completed? Intel(R) Core(TM) i7-2700K CPU @ 3.50GHz [Family 6 Model 42 Stepping 7] (8 processors) AMD Radeon HD 6900 series (Cayman) (2048MB) driver: 1.4.1848 OpenCL: 1.2 Microsoft Windows 7 Pro x64 Service Pack 1, (06.01.7601.00) ID: 65693 · Rating: 0 · rate: / Reply Quote

rcthardcore Send message Joined: 30 Dec 08 Posts: 30 Credit: 6,999,702 RAC: 0	Message 65694 - Posted: 12 Nov 2016, 0:27:09 UTC It definitely is a downclocking issue. Work units reset at appx 50%. Aborted 76 work units and sent back. Will wait until issue is resolved before getting anymore. ID: 65694 · Rating: 0 · rate: / Reply Quote

LesCap Send message Joined: 24 Oct 16 Posts: 12 Credit: 56,127,036 RAC: 0	Message 65695 - Posted: 12 Nov 2016, 1:25:21 UTC Last modified: 12 Nov 2016, 1:26:27 UTC My 2nd GPU bundle5 unit "ready to report" and reported....no error messages for both. Like the 1st GPU bundle5 unit, the total elapsed time was 1 hour 17 minutes. The unit progress returned to 0% @ 40% after ~15 minutes 4 times (the % before returning to 0% increased each time up to 58%). After 4 resets (elapsed time 1hr 2 minutes), the progress % continued up to 100%...."ready to report" & reported. Looks like a reset of the progress indicator takes place after completion of each of the 1st 4 units in a bundle....then the indicator shows a % for the entire bundle5 unit. Prior to the update and bundle5 units, each GPU unit would complete in 19 to 33 seconds....on this 'puter.... S/B ~ 2 min 30 sec for 5 old units. Are the individual units in the new bundle of 5 units bigger than the old units? Intel(R) Core(TM) i7-2700K CPU @ 3.50GHz [Family 6 Model 42 Stepping 7] (8 processors) AMD Radeon HD 6900 series (Cayman) (2048MB) driver: 1.4.1848 OpenCL: 1.2 Microsoft Windows 7 Pro x64 Service Pack 1, (06.01.7601.00) ID: 65695 · Rating: 0 · rate: / Reply Quote