Welcome to MilkyWay@home

Strange things happen (credit)

Message boards : Number crunching : Strange things happen (credit)
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile valterc

Send message
Joined: 28 Aug 09
Posts: 23
Credit: 1,264,553,550
RAC: 79,700
Message 32970 - Posted: 3 Nov 2009, 10:29:24 UTC
Last modified: 3 Nov 2009, 10:38:41 UTC

All the work done yesterday, I mean all the credits gained yesterday, have beeen lost....
ID: 32970 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile prairie69

Send message
Joined: 2 Nov 08
Posts: 11
Credit: 169,476
RAC: 0
Message 32972 - Posted: 3 Nov 2009, 11:27:42 UTC

Sisyphus lives!
ID: 32972 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Seejay
Avatar

Send message
Joined: 22 Dec 07
Posts: 51
Credit: 2,405,016
RAC: 0
Message 32973 - Posted: 3 Nov 2009, 11:50:13 UTC
Last modified: 3 Nov 2009, 12:05:51 UTC

Just looked - got a sack of wus, which were validated & credited yesterday, now showing:

156701912 	152986469  	1 Nov 2009 5:07:58 UTC  	1 Nov 2009 8:11:20 UTC  	Completed, marked as invalid 	1,394.95  	6.13  	0.00


These are obvious errors caused by the apparent DB crash ( NO ONE ACTUALLY KNOWS IF THERE WAS A DB CRASH, OR IF SOMEONE WAS MAYBE PLAYING FOOTBALL WITH THE SERVER, SINCE THERE HAS BEEN NO NEWS FROM THE ADMINISTRATION !! )

Yo, TRAVIS!! You there? Why no communication about these problems from your crowd?

P.S. Most of my WUs for 1/11 are tagged invalid, and ALL my WUs for 2/11 are missing!!
Seejay **Proud Member and Founder of BOINC Team Allprojectstats.com**
ID: 32973 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
XB-STX

Send message
Joined: 9 Aug 08
Posts: 18
Credit: 56,863,533
RAC: 0
Message 32974 - Posted: 3 Nov 2009, 11:59:29 UTC - in response to Message 32973.  

Agreed! I have evidence that my account lost no less than 41,000 credits in the overnight crash. I believe this deserves at least some acknowledgement of a problem by Travis or the other admins.
ID: 32974 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Bruce
Avatar

Send message
Joined: 28 Apr 08
Posts: 1415
Credit: 2,716,428
RAC: 0
Message 32975 - Posted: 3 Nov 2009, 12:10:43 UTC - in response to Message 32974.  

Agreed! I have evidence that my account lost no less than 41,000 credits in the overnight crash. I believe this deserves at least some acknowledgement of a problem by Travis or the other admins.


If this is the way they are going to run things here I will have to vote with my feet!! If the Admins here dont want to communicate with us lowly crunchers anymore they should BUY THEIR OWN COMPUTERS N" CRUNCH THE WU's THEMSELVES!!!
I am no longer a Happy Cruncher!!!
ID: 32975 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile banditwolf
Avatar

Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 524,164
RAC: 0
Message 32976 - Posted: 3 Nov 2009, 12:32:43 UTC

I notice a post I had made before Mw went out is gone. So it seems the data was restored.
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.
ID: 32976 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
E B

Send message
Joined: 15 Aug 09
Posts: 7
Credit: 218,896
RAC: 0
Message 32978 - Posted: 3 Nov 2009, 12:54:55 UTC - in response to Message 32970.  

uploaded 24 wu today.and downloaded 24 wu,now it say 48 inprogress.guess it lost 24.
ID: 32978 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
SkyeHunter

Send message
Joined: 6 Mar 09
Posts: 41
Credit: 38,856,291
RAC: 0
Message 32979 - Posted: 3 Nov 2009, 12:55:13 UTC

Funny situation on my account page :

Total credit 5,674,688
Recent average credit 65,650.10

....

Projects in which you are participating

Project Total credit Average credit Since
MilkyWay@home 5,689,208 71,647 6 Mar 2009
...

ID: 32979 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile valterc

Send message
Joined: 28 Aug 09
Posts: 23
Credit: 1,264,553,550
RAC: 79,700
Message 32980 - Posted: 3 Nov 2009, 13:58:06 UTC - in response to Message 32979.  

well, look at this:

ID: 32980 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile uBronan
Avatar

Send message
Joined: 9 Feb 09
Posts: 166
Credit: 27,520,813
RAC: 0
Message 32981 - Posted: 3 Nov 2009, 14:16:07 UTC
Last modified: 3 Nov 2009, 14:25:15 UTC

uhuh here same lost about 120,000 minimal.
looks like admin had to make a restore and the done units after the database problems deleted.
resulting in a -3200 on boincstats from my done work because i was over 15kk and now back to 14.9 kk
Its new, its relative fast... my new bicycle
ID: 32981 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Farscape
Avatar

Send message
Joined: 6 Apr 09
Posts: 26
Credit: 1,021,301,443
RAC: 0
Message 32982 - Posted: 3 Nov 2009, 14:34:59 UTC

Yep - first thing that I looked at this morning was BOINCstats and saw that I was -10,683.56 in MW. I hate it when that happens.........

Anyhow, I would guess that the no connection to mysql message that we all received yesterday was an indication that a total database failure has occurred - it is going to take some time (for the staff) to restore the database and even longer to figure out what each of us are due credit wise.

I for one will wait - but I DO agree that some info from a staff member would be MOST APPROPRIATE!!!

Travis - I know that you have a thankless job (I do database and network jobs in my spare time) and I appreciate all your efforts, but please give us some info dude!
ID: 32982 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Brian Silvers

Send message
Joined: 21 Aug 08
Posts: 625
Credit: 558,425
RAC: 0
Message 32984 - Posted: 3 Nov 2009, 15:14:09 UTC - in response to Message 32982.  


I for one will wait - but I DO agree that some info from a staff member would be MOST APPROPRIATE!!!

Travis - I know that you have a thankless job (I do database and network jobs in my spare time) and I appreciate all your efforts, but please give us some info dude!


Be interesting if the disk(s) failed due to high sustained i/o pressure...

Also of interest is that once this project went down, Collatz' web pages also got to the point where things normally are here...

Things simply need to be made more complex for GPUs, both here and at Collatz. Not talking about any credit reductions... Credit for unit time can remain the same for the time being. It's just obvious that the i/o load on the servers need to be reduced...
ID: 32984 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
BarryAZ

Send message
Joined: 1 Sep 08
Posts: 520
Credit: 302,524,931
RAC: 41
Message 32987 - Posted: 3 Nov 2009, 15:55:07 UTC - in response to Message 32984.  

Communication here has become Predictor like -- not a good thing.

Regarding more complex work units (particularly here for GPU work -- not quite so much for Collatz), that certainly would be one approach. But the root cause (in my view) is the lack of alternative projects for GPU processing. Frankly, until decent alternatives exist (and ideally those alternatives would exist not only for ATI *or* CUDA but rather for ATI *and* CUDA *** and (for me this is key) for lower end GPU processors. At the moment, the self funded essentially one person effort by Slicker over at Collatz is the ONLY game in town for single precision ATI GPU and lower end CUDA cards.

I have a great deal of sympathy and respect for Slicker over at Collatz with what he's trying to do AND with his efforts at *communicating* what is going on. Many, many months back, Travis made a real effort at communicating as bumps occurred over here, but sadly, that era for this project appears to have passed. So my empathy and respect for this project has dwindled as my frustration with the performance of the project along with the information vacuum have simply moved my feelings toward ire.

I only run CPU for Milkyway, so in that regard, I DO have alternative projects, but the thing is, when MilkyWay does a Predictor, I suspect folks with high power GPU's run over to Collatz and swamp Slickers environment. Unlike MilkyWay, his project is an out of pocket effort without the resources that MilkyWay has to work with. I realize MilkyWay is not exactly SETI like in its resource pool, but compared to Collatz's resources, MilkyWay is wealthy beyond measure.

That we are (and have been) in an information vacuum is simply WRONG.


I for one will wait - but I DO agree that some info from a staff member would be MOST APPROPRIATE!!!

Travis - I know that you have a thankless job (I do database and network jobs in my spare time) and I appreciate all your efforts, but please give us some info dude!


Be interesting if the disk(s) failed due to high sustained i/o pressure...

Also of interest is that once this project went down, Collatz' web pages also got to the point where things normally are here...

Things simply need to be made more complex for GPUs, both here and at Collatz. Not talking about any credit reductions... Credit for unit time can remain the same for the time being. It's just obvious that the i/o load on the servers need to be reduced...


ID: 32987 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile banditwolf
Avatar

Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 524,164
RAC: 0
Message 32988 - Posted: 3 Nov 2009, 16:25:06 UTC

With no response yet I think it will be a long shot if everyone(anyone) gets credit back. At least it didn't loose more than a days work/credits. Some explaination/excuse should be given.

If one is given; I predit it will be some students fault as a result of bad search parameters.
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.
ID: 32988 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile James Sotherden
Avatar

Send message
Joined: 3 Jan 09
Posts: 139
Credit: 50,066,562
RAC: 0
Message 32989 - Posted: 3 Nov 2009, 16:45:39 UTC
Last modified: 3 Nov 2009, 17:29:52 UTC

I lost 5 work units this morning when milkyway said they were aborted. They timed out, I take it that what ever crashed they could not get home. And here Seti has had no work so I was hoping to up my rac a little here. O well. stuff happens.
ID: 32989 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Matthew
Volunteer moderator
Project developer
Project scientist

Send message
Joined: 6 May 09
Posts: 217
Credit: 6,856,375
RAC: 0
Message 32992 - Posted: 3 Nov 2009, 20:38:01 UTC

Hello! Sorry for the difficulties! We had a massive Database crash last night, and we are almost done picking up the pieces. The database was only restored this morning at 5am.

It appears as though the database corrupted itself, and then failed in a spectacular fashion. We'll see what we can do about the missing credit.

Cheers,
Matthew Newby
ID: 32992 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Jet

Send message
Joined: 26 Jun 09
Posts: 11
Credit: 3,741,663
RAC: 0
Message 32993 - Posted: 3 Nov 2009, 21:10:13 UTC - in response to Message 32992.  

My loss: -8.389 credits...unbelievable.
Probably, MW project was somehow invented a time machine & it's ticking, but in opposite side ???
ID: 32993 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Brian Silvers

Send message
Joined: 21 Aug 08
Posts: 625
Credit: 558,425
RAC: 0
Message 32997 - Posted: 3 Nov 2009, 22:30:03 UTC - in response to Message 32987.  

Communication here has become Predictor like -- not a good thing.


I notice you do not have Cosmology as one of your projects. "Communication" that happens there pretty much only occurs between computer systems of volunteer and project. All of the admins have bungled things and then left. The project scientist disappeared for months, then showed back up saying that he had a bad injury and his wife had a kid, then has disappeared for another month now... I can offer sympathy up to a point, but the project appears to be slipping back into the same behavior. They currently have a job posting for a project admin, but are only wanting to pay $15/hr and asking for a lot of skills. If the project suffers another crash, it's doubtful things would be working again for 3-6 months, considering it took them nearly a month to get out of the one in February...


Regarding more complex work units (particularly here for GPU work -- not quite so much for Collatz), that certainly would be one approach. But the root cause (in my view) is the lack of alternative projects for GPU processing.


The "root cause" might be that, but a "contributing factor" is that the credit per unit time is higher here than at Collatz, so that shifts the attraction towards this project.

Many, many months back, Travis made a real effort at communicating as bumps occurred over here, but sadly, that era for this project appears to have passed. So my empathy and respect for this project has dwindled as my frustration with the performance of the project along with the information vacuum have simply moved my feelings toward ire.


If he indeed had the flu, then he may have needed to spend extra time on his studies. Additionally he was selected to go to Barcelona for the BOINC Workshop, so he can't exactly do as much remotely as he could on-site. Additionally, the user base here has been extremely demanding. I'm not sure it is appropriate to heap everything upon him.

Like Paul Buck said in a post that has disappeared with the crash, some people seem to think that the project owes them work. The performance problems are exacerbated by people trying to get that work that they think they're owed, especially by utilizing scripts to hit the server more often than their systems normally would otherwise. An unwillingness to accept that the tasks being processed are far too easy for their hardware and a strong resistence to alternatives (longer tasks / separate project) mean that until the project spends money on hardware upgrades, the performance problems here will continue to happen. At some point, it could become "more hassle than it is worth" from the project's perspective, much like how LHC got tired of being barked at about the replication, having to spend a lot of time and energy on babysitting the forum, thus have appear to have decided to process anything they need on an internal cluster.

ID: 32997 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Seejay
Avatar

Send message
Joined: 22 Dec 07
Posts: 51
Credit: 2,405,016
RAC: 0
Message 33003 - Posted: 3 Nov 2009, 23:04:00 UTC - in response to Message 32992.  

Hello! Sorry for the difficulties! We had a massive Database crash last night, and we are almost done picking up the pieces. The database was only restored this morning at 5am.

It appears as though the database corrupted itself, and then failed in a spectacular fashion. We'll see what we can do about the missing credit.

Cheers,
Matthew Newby



Thank God!! You're all alive and well - we were getting worried here for a minute!!

;^).
Seejay **Proud Member and Founder of BOINC Team Allprojectstats.com**
ID: 33003 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Dan T. Morris
Avatar

Send message
Joined: 17 Mar 08
Posts: 165
Credit: 410,228,216
RAC: 0
Message 33005 - Posted: 4 Nov 2009, 0:43:10 UTC
Last modified: 4 Nov 2009, 0:44:32 UTC

As of 18:31 hours cst Ia. shows my account -212,571 free dc and boinc stats.

ouch!!!... To be missing some is bad but to go into the hole -212,571 well that's nuts. But hey I am sure that they will work it out.

DD,
ID: 33005 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · Next

Message boards : Number crunching : Strange things happen (credit)

©2024 Astroinformatics Group