Welcome to MilkyWay@home

Scheduled Maintenance Concluded

Message boards : News : Scheduled Maintenance Concluded
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 . . . 13 · Next

AuthorMessage
Profile Peciak

Send message
Joined: 27 Jun 09
Posts: 12
Credit: 148,038,330
RAC: 0
Message 65676 - Posted: 11 Nov 2016, 22:00:53 UTC

<core_client_version>7.6.22</core_client_version>
<![CDATA[
<message>
aborted by user
</message>
<stderr_txt>
<search_application> milkyway_separation 1.42 Windows x86_64 double </search_application>
Reading preferences ended prematurely
BOINC GPU type suggests using OpenCL vendor 'Advanced Micro Devices, Inc.'
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4'
Switching to Parameter File 'astronomy_parameters.txt'
<number_WUs> 5 </number_WUs>
<number_params_per_WU> 20 </number_params_per_WU>
Using AVX path
Integral 0 time = 641.348502 s
Running likelihood with 84046 stars
Likelihood time = 1.657994 s
<background_integral> 0.000069192102838 </background_integral>
<stream_integral> 87.919837480121885 1231.763016660424500 218.680946939705140 </stream_integral>
<background_likelihood> -3.558790208380722 </background_likelihood>
<stream_only_likelihood> -19.405544659459434 -5.167057809848690 -3.549714346561023 </stream_only_likelihood>
<search_likelihood> -3.135608747542042 </search_likelihood>
Using AVX path
ID: 65676 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jake Weiss
Volunteer moderator
Project developer
Project tester
Project scientist

Send message
Joined: 25 Feb 13
Posts: 580
Credit: 94,200,158
RAC: 0
Message 65677 - Posted: 11 Nov 2016, 22:03:43 UTC

Peciak,

That's weird. The version info doesn't say its an OpenCL app. Let me see if I forgot to set the flag when I recompiled it.

Sorry about this, I haven't slept much over the last couple days trying to get this update done.

Jake
ID: 65677 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rich

Send message
Joined: 14 Nov 14
Posts: 9
Credit: 214,644,261
RAC: 0
Message 65678 - Posted: 11 Nov 2016, 22:03:46 UTC

Looks like the new version isn't using the GPU. My 750Ti show no GPU usage and the CPU task is being used fully. The first two WU's validated but used the CPU.

Here is from the standard error file if this will help:

<core_client_version>7.6.33</core_client_version>
<![CDATA[
<stderr_txt>
<search_application> milkyway_separation 1.42 Windows x86_64 double </search_application>
Reading preferences ended prematurely
BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation'
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4'
Switching to Parameter File 'astronomy_parameters.txt'
<number_WUs> 1 </number_WUs>
<number_params_per_WU> 20 </number_params_per_WU>
Using AVX path
Integral 0 time = 836.284111 s
Running likelihood with 84046 stars
Likelihood time = 1.842669 s
<background_integral> 0.000183211346476 </background_integral>
<stream_integral> 35.170988580602945 220.771463270831330 56.153686511532385 </stream_integral>
<background_likelihood> -3.782092719257110 </background_likelihood>
<stream_only_likelihood> -50.039987133739942 -4.114360911786672 -3.380701603567321 </stream_only_likelihood>
<search_likelihood> -2.964758398505063 </search_likelihood>
16:52:25 (20784): called boinc_finish(0)

</stderr_txt>
ID: 65678 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jake Weiss
Volunteer moderator
Project developer
Project tester
Project scientist

Send message
Joined: 25 Feb 13
Posts: 580
Credit: 94,200,158
RAC: 0
Message 65679 - Posted: 11 Nov 2016, 22:09:53 UTC

Rich,

I see it, it was my fault during building of the program. I have a fix made, just double and triple checking to make sure I don't make another silly mistake this time. Will be released soon.

Jake
ID: 65679 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rich

Send message
Joined: 14 Nov 14
Posts: 9
Credit: 214,644,261
RAC: 0
Message 65680 - Posted: 11 Nov 2016, 22:13:37 UTC

Thanks,

lol, I'll let the 12 WU's finish up with the CPU and then let more units start loading. I've done many mistakes in my time, fully understand. Keep up the good work.
ID: 65680 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rymorea

Send message
Joined: 6 Oct 14
Posts: 46
Credit: 20,017,425
RAC: 0
Message 65681 - Posted: 11 Nov 2016, 22:27:05 UTC

Same for me too. Both AMD R9 270x and Nvidia 750TI GPU use %0 CPU full core
I have to abort those task cause Screen lag too much and other CPU projects hang.
ID: 65681 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile LesCap
Avatar

Send message
Joined: 24 Oct 16
Posts: 12
Credit: 56,127,036
RAC: 0
Message 65682 - Posted: 11 Nov 2016, 22:36:11 UTC - in response to Message 65679.  

Hey Jake

The GPU units are crunching again after downloading ver. 1.42 :)


11/11/2016 3:28:56 PM | Milkyway@Home | Started download of milkyway_1.42_windows_x86_64__opencl_ati_101.exe
11/11/2016 3:29:33 PM | Milkyway@Home | Finished download of milkyway_1.42_windows_x86_64__opencl_ati_101.exe
11/11/2016 3:29:33 PM | Milkyway@Home | Starting task de_modfit_fast_19_3s_136_bundle5_ModfitConstraints3_4_1478900148_13142_0

Thanks again
Les
Intel(R) Core(TM) i7-2700K CPU @ 3.50GHz [Family 6 Model 42 Stepping 7]
(8 processors)
AMD Radeon HD 6900 series (Cayman) (2048MB) driver: 1.4.1848 OpenCL: 1.2
Microsoft Windows 7 Pro x64 Service Pack 1, (06.01.7601.00)
ID: 65682 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jake Weiss
Volunteer moderator
Project developer
Project tester
Project scientist

Send message
Joined: 25 Feb 13
Posts: 580
Credit: 94,200,158
RAC: 0
Message 65683 - Posted: 11 Nov 2016, 22:37:00 UTC

Hey Everyone,

I'm still working on a fix. Having issues with other the Windows OpenCL version but not Linux OpenCL. Need to see if my drivers are back on the machine I'm testing with or if there is actually an issue with the client.

Might take a couple hours for a fix.


Sorry,

Jake
ID: 65683 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Werkstatt

Send message
Joined: 19 Feb 08
Posts: 350
Credit: 141,284,369
RAC: 0
Message 65684 - Posted: 11 Nov 2016, 22:39:19 UTC

Well, yes, they start crunching, but at ~70% they restart with 0%. In 45 minutes not a single gpu wu finished.
ID: 65684 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mmonnin

Send message
Joined: 2 Oct 16
Posts: 167
Credit: 1,008,062,758
RAC: 2,245
Message 65685 - Posted: 11 Nov 2016, 22:45:53 UTC

My 280x in Linux is completing WUs in about 150-155 seconds at 4x. Same time per task as before. Still lots of validation inconclusive but I had a lot of those before. It still has app v1.4.
ID: 65685 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Bri
Avatar

Send message
Joined: 14 May 14
Posts: 2
Credit: 3,945,126
RAC: 0
Message 65686 - Posted: 11 Nov 2016, 22:47:07 UTC - in response to Message 65684.  

Yes, On both my machines CPU wu's are also resetting at about 60%
ID: 65686 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile LesCap
Avatar

Send message
Joined: 24 Oct 16
Posts: 12
Credit: 56,127,036
RAC: 0
Message 65687 - Posted: 11 Nov 2016, 22:49:35 UTC - in response to Message 65684.  
Last modified: 11 Nov 2016, 22:52:33 UTC

OOOPs.....similar here Werkstatt.

1st GPU bundle5 unit jumped back to 0% when it reached ~40% complete.

CPU units are still crunching through to completion.
Intel(R) Core(TM) i7-2700K CPU @ 3.50GHz [Family 6 Model 42 Stepping 7]
(8 processors)
AMD Radeon HD 6900 series (Cayman) (2048MB) driver: 1.4.1848 OpenCL: 1.2
Microsoft Windows 7 Pro x64 Service Pack 1, (06.01.7601.00)
ID: 65687 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Arivald Ha'gel

Send message
Joined: 30 Apr 14
Posts: 67
Credit: 160,674,488
RAC: 0
Message 65688 - Posted: 11 Nov 2016, 22:55:25 UTC
Last modified: 11 Nov 2016, 23:05:17 UTC

Looks like Milkyway@Home 1.42 (opencl_ati_101) app is CPU only app.

No GPU usage (at all).

4 WUs have had gone back to 0% around 77-78%.

(Copied from slots of one of WU after it went back to 0%)

<search_application> milkyway_separation 1.42 Windows x86_64 double </search_application>
Reading preferences ended prematurely
BOINC GPU type suggests using OpenCL vendor 'Advanced Micro Devices, Inc.'
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4'
Switching to Parameter File 'astronomy_parameters.txt'
<number_WUs> 5 </number_WUs>
<number_params_per_WU> 20 </number_params_per_WU>
Using AVX path
Integral 0 time = 1033.276421 s
Running likelihood with 84046 stars
Likelihood time = 3.220463 s
<background_integral> 0.000132467393354 </background_integral>
<stream_integral> 114.693583961640020 1101.118918906797000 12.229260004040517 </stream_integral>
<background_likelihood> -3.804522759975287 </background_likelihood>
<stream_only_likelihood> -29.103213665631166 -4.472110400350248 -181.009708341877600 </stream_only_likelihood>
<search_likelihood> -3.416589456232599 </search_likelihood>
Using AVX path
ID: 65688 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Keith Myers
Avatar

Send message
Joined: 24 Jan 11
Posts: 715
Credit: 555,467,982
RAC: 38,618
Message 65689 - Posted: 11 Nov 2016, 23:07:54 UTC

My client doesn't like something about the downloaded 1.42 file. Had the wrong size error when it automatically downloaded it. Had the same error when I manually downloaded it from the directory.

11/11/2016 3:03:52 PM | Milkyway@Home | Resetting file projects/milkyway.cs.rpi.edu_milkyway/milkyway_1.42_windows_x86_64__opencl_nvidia_101.exe: wrong size
11/11/2016 3:03:52 PM | Milkyway@Home | Fetching scheduler list
11/11/2016 3:03:54 PM | Milkyway@Home | Started download of milkyway_1.42_windows_x86_64__opencl_nvidia_101.exe
11/11/2016 3:03:55 PM | Milkyway@Home | Master file download succeeded
11/11/2016 3:03:59 PM | Milkyway@Home | Finished download of milkyway_1.42_windows_x86_64__opencl_nvidia_101.exe
11/11/2016 3:03:59 PM | Milkyway@Home | [error] File milkyway_1.42_windows_x86_64__opencl_nvidia_101.exe has wrong size: expected 1292288, got 1316864
11/11/2016 3:03:59 PM | Milkyway@Home | [error] Checksum or signature error for milkyway_1.42_windows_x86_64__opencl_nvidia_101.exe
11/11/2016 3:04:00 PM | Milkyway@Home | Sending scheduler request: To report completed tasks.
11/11/2016 3:04:00 PM | Milkyway@Home | Reporting 102 completed tasks



Anybody else having issues trying to run the new work units with the supplied 1.42 Nvidia app?
ID: 65689 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Michael H.W. Weber

Send message
Joined: 22 Jan 08
Posts: 29
Credit: 242,730,423
RAC: 0
Message 65690 - Posted: 11 Nov 2016, 23:56:58 UTC
Last modified: 12 Nov 2016, 0:03:02 UTC

All tasks with version <= 1.41 produce errors, only.

Engagig on v1.42 tasks results in immediate GPU core clock reduction to 300 Mhz and GPU RAM clock to 150 MHz on both 280X and 290X AMD graphics boards.

When reaching 100% of estimated run time, the bundle tasks on my 290X reset to zero progress and appear to restart although the total runtime duration is not reset (I guess from the file name that it might be a bundle of 5, so this will hopefully repeat 5-fold and then upload).
By contrast, the constraints tasks do complete and upload in the expected manner on my 280X. Runtime with reduced core and RAM clock is 741.63 secs as opposed to 9 secs with standard clocks. The result file first ends up in the "inconclusive" bunch - as is usual.

So, there is still something wrong with both of these tasks types with respect to clock resetting.

I checked with Einstein: Once a new Einstein WU starts after having finished a MW one, the core and RAM clocks go up to regular speed. So, the down clocking is conducted by MW client.

Please inform us once you have solved the clocking issue.

Michael.
President of Rechenkraft.net e.V. - This planet's first and largest distributed computing organization.

ID: 65690 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
greg_be

Send message
Joined: 18 Aug 09
Posts: 123
Credit: 21,130,007
RAC: 1,811
Message 65691 - Posted: 12 Nov 2016, 0:03:21 UTC

Just cleared out 66 V1.40 tasks all with failures.
Also just download a series of V1.42 tasks now, see how they hold up.
ID: 65691 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jesse Viviano

Send message
Joined: 4 Feb 11
Posts: 86
Credit: 60,913,150
RAC: 0
Message 65692 - Posted: 12 Nov 2016, 0:18:05 UTC - in response to Message 65650.  

Work units that are generated with the old version of MilkyWay@home that need additional processing are causing errors. See https://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=1386764939 for an example. It was generated for the old version of MilkyWay@home, and it needs additional runs for validation. Either the new version needs to be able to deal with results from old work units, or the new version should be listed as a new application so that old work units continue to be processed with the old version.
ID: 65692 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile LesCap
Avatar

Send message
Joined: 24 Oct 16
Posts: 12
Credit: 56,127,036
RAC: 0
Message 65693 - Posted: 12 Nov 2016, 0:19:11 UTC
Last modified: 12 Nov 2016, 0:42:28 UTC

1 GPU bundle5 WU made the trip.

The 1st GPU bundle5 unit was "ready to report" and reported after 1:17:09 and several progress resets to 0% at ~40%.

The 2nd GPU bundle5 progress reset to 0% @ 38% with 15 min. elapsed. Will see if it finishes.

~15 minutes X 5 = ~1 hour and 15 minutes.....is the progress simply resetting to 0% after each unit in the bundle of 5 units is completed?
Intel(R) Core(TM) i7-2700K CPU @ 3.50GHz [Family 6 Model 42 Stepping 7]
(8 processors)
AMD Radeon HD 6900 series (Cayman) (2048MB) driver: 1.4.1848 OpenCL: 1.2
Microsoft Windows 7 Pro x64 Service Pack 1, (06.01.7601.00)
ID: 65693 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
rcthardcore

Send message
Joined: 30 Dec 08
Posts: 30
Credit: 6,999,702
RAC: 0
Message 65694 - Posted: 12 Nov 2016, 0:27:09 UTC

It definitely is a downclocking issue. Work units reset at appx 50%. Aborted 76 work units and sent back. Will wait until issue is resolved before getting anymore.
ID: 65694 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile LesCap
Avatar

Send message
Joined: 24 Oct 16
Posts: 12
Credit: 56,127,036
RAC: 0
Message 65695 - Posted: 12 Nov 2016, 1:25:21 UTC
Last modified: 12 Nov 2016, 1:26:27 UTC

My 2nd GPU bundle5 unit "ready to report" and reported....no error messages for both.

Like the 1st GPU bundle5 unit, the total elapsed time was 1 hour 17 minutes.

The unit progress returned to 0% @ 40% after ~15 minutes 4 times (the % before returning to 0% increased each time up to 58%). After 4 resets (elapsed time 1hr 2 minutes), the progress % continued up to 100%...."ready to report" & reported.

Looks like a reset of the progress indicator takes place after completion of each of the 1st 4 units in a bundle....then the indicator shows a % for the entire bundle5 unit.

Prior to the update and bundle5 units, each GPU unit would complete in 19 to 33 seconds....on this 'puter.... S/B ~ 2 min 30 sec for 5 old units.

Are the individual units in the new bundle of 5 units bigger than the old units?
Intel(R) Core(TM) i7-2700K CPU @ 3.50GHz [Family 6 Model 42 Stepping 7]
(8 processors)
AMD Radeon HD 6900 series (Cayman) (2048MB) driver: 1.4.1848 OpenCL: 1.2
Microsoft Windows 7 Pro x64 Service Pack 1, (06.01.7601.00)
ID: 65695 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 . . . 13 · Next

Message boards : News : Scheduled Maintenance Concluded

©2024 Astroinformatics Group