Welcome to MilkyWay@home

Errors, invalid, and validation inconclusive. Anything to worry about?


Advanced search

Message boards : Number crunching : Errors, invalid, and validation inconclusive. Anything to worry about?
Message board moderation

To post messages, you must log in.

AuthorMessage
wolfman1360

Send message
Joined: 17 Feb 17
Posts: 18
Credit: 968,286
RAC: 1
500 thousand credit badge2 year member badge
Message 68663 - Posted: 3 May 2019, 15:11:06 UTC

This computer seems to be getting a lot of validation inconclusive errors...and 3 or 4 random invalid / errored tasks.
https://milkyway.cs.rpi.edu/milkyway/results.php?hostid=803731

No overclock on the GPU or CPU so is this anything to worry about?
ID: 68663 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileJStateson
Avatar

Send message
Joined: 18 Nov 08
Posts: 135
Credit: 895,372,225
RAC: 78,502
500 million credit badge10 year member badge
Message 68664 - Posted: 3 May 2019, 15:36:50 UTC - in response to Message 68663.  

Well, for whatever it is worth, the ATI "invalid" error also occurred on another system.

However, the rest of the invalids (cpu ones) as well as all of your "errors" are just yours alone.

What are temps (cpu & gpu)?
ID: 68664 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
marmot
Avatar

Send message
Joined: 12 Dec 15
Posts: 43
Credit: 7,026,236
RAC: 38,360
5 million credit badge3 year member badge
Message 68667 - Posted: 3 May 2019, 17:50:36 UTC - in response to Message 68663.  

This computer seems to be getting a lot of validation inconclusive errors...


Those are probably not errors. Those are the work units that most other projects call 'validation pending' or 'pending' (not sure why it's different here).

Those WU's are waiting for the wing-computer to report a second result and they may wait for up to a full deadline period if the computer got shut off, project detached without aborting the WU's, etc...


You have 6 actual errors, and like Beemer pointed out, could be your computer temps.

Two of the GPU were error: 3x "C:\Users\tcwoo\AppData\Local\Temp\\OCL8928T1.cl:186:67: warning: unknown attribute 'max_constant_size' ignored".
Not sure what that one is.

The 3 invalid on the CPU were only your computer's problem so check for cooling issues, running out of RAM (bad app, leaking memory pointers), hopefully you're not looking at an actual hardware problem.
ID: 68667 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
wolfman1360

Send message
Joined: 17 Feb 17
Posts: 18
Credit: 968,286
RAC: 1
500 thousand credit badge2 year member badge
Message 68669 - Posted: 3 May 2019, 20:27:06 UTC - in response to Message 68667.  

Those are probably not errors. Those are the work units that most other projects call 'validation pending' or 'pending' (not sure why it's different here).

Those WU's are waiting for the wing-computer to report a second result and they may wait for up to a full deadline period if the computer got shut off, project detached without aborting the WU's, etc...


You have 6 actual errors, and like Beemer pointed out, could be your computer temps.

Two of the GPU were error: 3x "C:\Users\tcwoo\AppData\Local\Temp\\OCL8928T1.cl:186:67: warning: unknown attribute 'max_constant_size' ignored".
Not sure what that one is.

The 3 invalid on the CPU were only your computer's problem so check for cooling issues, running out of RAM (bad app, leaking memory pointers), hopefully you're not looking at an actual hardware problem.

The processor is stable at 3.7 GHZ. The GPU clocks remain steady at 1244 MHZ. Similarly the memory clock remains steady. I have no overclocks or undervolts on anything. The CPU temp, as I mentioned in another thread, is staying lower than usual while 100% cpu is utilized. I don't know why. It doesn't appear to be hitting ram limits. If I run prime 95 the cpu temp climbs up past 60 as is the norm for the machine. There are a few tasks in progress that are going to take well over a day and appear to be only 15000 gflops and a bit. Meanwhile a core i5-3317U can complete a task of 60000 plus in less than half this time. So something is definitely going on, I just don't know what. Maybe I am hitting ram limits? But each task only takes around 14 mb and this machine has 16 gb total.

Maybe I just need to format? Not sure what else to look for.
ID: 68669 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
marmot
Avatar

Send message
Joined: 12 Dec 15
Posts: 43
Credit: 7,026,236
RAC: 38,360
5 million credit badge3 year member badge
Message 68678 - Posted: 4 May 2019, 3:41:25 UTC - in response to Message 68669.  

There are a few tasks in progress that are going to take well over a day and appear to be only 15000 gflops and a bit. Meanwhile a core i5-3317U can complete a task of 60000 plus in less than half this time. So something is definitely going on,


I discovered after making a purchase for a used GPU, that Milkyway requires double precision floating point calculations and the 1060 3gb was 1 FP64 to every 32 FP32 calculation units. (Ended up buying a used 280x for Milkyway)
The benchmarks shown in details here, I think, reflect FP32 measurements, not FP64.
It's possible the performance difference on Milkyway WU between the i5-3317U and your Ryzen could be related to double precision abilities.

Do you have a BM tool that measures FP32 and FP64 you can test on the two CPU's to compare?
Or at least to see if your Ryzen is performing at expectations to reference BMs.
Speaking of reference BM's; here's a site taking a Ryzen 1700x through it's arithmetic paces using the Sandra Lite application.
The Ryzen 1700x seems to have good FP64 , and if the 1800x (not finding their 1800x review) isn't much different in architecture, it should be outperforming the i5-3317U.



Maybe I just need to format? Not sure what else to look for.

Before you do that, to eliminate the OS and app install, you can run a testing OS from a USB thumb drive which is a barebones, OS built only to do BOINC and see how well the hardware does on that reference OS. I made one with Tiny7 but there is PenDriveLinux and others. You can use YUMI to build a pendrive Linux from many different distros.

Anyway, you had 6 errors out of over 600 WU's, it could just have been a power spike. But the performance difference between those two machines would bother me, especially since the 1800x is a workhorse.
ID: 68678 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
wolfman1360

Send message
Joined: 17 Feb 17
Posts: 18
Credit: 968,286
RAC: 1
500 thousand credit badge2 year member badge
Message 68680 - Posted: 4 May 2019, 4:36:39 UTC - in response to Message 68678.  

There are a few tasks in progress that are going to take well over a day and appear to be only 15000 gflops and a bit. Meanwhile a core i5-3317U can complete a task of 60000 plus in less than half this time. So something is definitely going on,


I discovered after making a purchase for a used GPU, that Milkyway requires double precision floating point calculations and the 1060 3gb was 1 FP64 to every 32 FP32 calculation units. (Ended up buying a used 280x for Milkyway)
The benchmarks shown in details here, I think, reflect FP32 measurements, not FP64.
It's possible the performance difference on Milkyway WU between the i5-3317U and your Ryzen could be related to double precision abilities.

Do you have a BM tool that measures FP32 and FP64 you can test on the two CPU's to compare?
Or at least to see if your Ryzen is performing at expectations to reference BMs.
Speaking of reference BM's; here's a site taking a Ryzen 1700x through it's arithmetic paces using the Sandra Lite application.
The Ryzen 1700x seems to have good FP64 , and if the 1800x (not finding their 1800x review) isn't much different in architecture, it should be outperforming the i5-3317U.



Maybe I just need to format? Not sure what else to look for.

Before you do that, to eliminate the OS and app install, you can run a testing OS from a USB thumb drive which is a barebones, OS built only to do BOINC and see how well the hardware does on that reference OS. I made one with Tiny7 but there is PenDriveLinux and others. You can use YUMI to build a pendrive Linux from many different distros.

Anyway, you had 6 errors out of over 600 WU's, it could just have been a power spike. But the performance difference between those two machines would bother me, especially since the 1800x is a workhorse.


I'm not sure what I'd use for that benchmark at this point.

I don't think this supports running 2 WUs at once on the graphics card - I'm seeing 100% utilization at around 78 Celsius. It's very tempting to go for the rx580 or even 590, or maybe wait for AMD to drop the ball on the new 7 NM, but I'd mainly be getting that for crunching rather than my own personal needs.
It's bothering me too. Lots.

I'm going to get rid of the page file to see if I'm actually running into memory issues before heading to bed tonight. If I wake up to a bunch of memory 000 errors I'll know I need to either a) upgrade to 32 GB of ram or b) wipe this thing clean and start with a clean slate.
Right now there are a few more WUs coming in that look like they're going to take around 12 hours on this Ryzen, so maybe that was just a fluke.

I don't mind wiping and reinstalling Windows. It's been about two years so I'm sure there's bound to be a memory leak somewhere or at least something effecting performance that I've done over the years. Reinstall Windows, reinstall boinc (after latest GPU drivers) and see what happens.
ID: 68680 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
marmot
Avatar

Send message
Joined: 12 Dec 15
Posts: 43
Credit: 7,026,236
RAC: 38,360
5 million credit badge3 year member badge
Message 68697 - Posted: 6 May 2019, 4:18:15 UTC - in response to Message 68680.  


I don't think this supports running 2 WUs at once on the graphics card - I'm seeing 100% utilization at around 78 Celsius. It's very tempting to go for the rx580 or even 590, or maybe wait for AMD to drop the ball on the new 7 NM, but I'd mainly be getting that for crunching rather tha


You should be able to run 8 on that card. My RX 550 4GB did and stayed around 61C but it's also only a 32W card.
ID: 68697 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Errors, invalid, and validation inconclusive. Anything to worry about?

©2019 Astroinformatics Group