New application! |
Message boards : Beta Testing : New application!
| Author | Message |
|---|---|
|
Hi all, | |
| ID: 10041 | Rating: 1 | rate:
| |
|
I'll start off with the first bug report myself: | |
| ID: 10042 | Rating: 0 | rate:
| |
|
Great news! :-) | |
| ID: 10043 | Rating: 0 | rate:
| |
|
Ready to test... first WUs are already running. :-) | |
| ID: 10044 | Rating: 0 | rate:
| |
|
When shutting down BOINC & restarting it again on my Ubuntu Linux Box the Progession & To Completion showed nothing but the CPU Time remained the same as it was before shutting down. | |
| ID: 10046 | Rating: 0 | rate:
| |
When shutting down BOINC & restarting it again on my Ubuntu Linux Box the Progession & To Completion showed nothing but the CPU Time remained the same as it was before shutting down. Almost the same on my Vista x64 host. Only difference: After restarting BOINC the CPU time picked up where it left off but the progress bar jumped to 100%. ;-) The WUs are crunching along happily so far though. ____________ Lovely greetings from Cori ![]() ![]() | |
| ID: 10047 | Rating: 0 | rate:
| |
|
I suspended a task after 34 minutes runtime and 13,549% progress. | |
| ID: 10048 | Rating: 0 | rate:
| |
|
After attaching to the beta project, had numerous wu's d/l'ed. At this moment, have 4 showing 100% completion, but they are still crunching. Original completion time showed 00:48:22 & currently at 4hrs:30+. Is this to be expected? | |
| ID: 10049 | Rating: 0 | rate:
| |
|
I am also seeing apps that show as 100%, but still running. | |
| ID: 10050 | Rating: 0 | rate:
| |
I am also seeing apps that show as 100%, but still running. Same here on a Windows XP Pro 64 Bit Box, have 2 @ 100% but still running. I did have 2 64 Bit Linux finish though in around 4 hours running time. | |
| ID: 10051 | Rating: 0 | rate:
| |
|
On my XP x64 lappy I have one WU running @56.5% after 4 hours... | |
| ID: 10052 | Rating: 0 | rate:
| |
On my XP x64 lappy I have one WU running @56.5% after 4 hours... I'd fire that Lappy ... ;P ... the 2 that finished in 4 hr's were on a Lappy of mine ... :) | |
| ID: 10053 | Rating: 0 | rate:
| |
|
It sounds likely those are symptoms of the resume bug. I found the problem, and it was something that occurred only after it printed the (correct) numbers to stderr.txt, which was why I missed it before. New builds with a fix should be up soon. | |
| ID: 10054 | Rating: 0 | rate:
| |
|
Should we abort the tasks in progress, then? | |
| ID: 10055 | Rating: 0 | rate:
| |
|
New linux i686 and x86_64 clients (version 1.01) are up that fix the resume problem. If you have a running task that has been suspended and resumed with 1.00, that has unfortunately been corrupted, so it is best to abort it. | |
| ID: 10056 | Rating: 0 | rate:
| |
|
getting through the task but there is no end date/time. | |
| ID: 10057 | Rating: 0 | rate:
| |
|
The task mentioned in my former post ended successfull as far I can see after 6.5 hours runtime. This is including the 74 minutes double running. | |
| ID: 10058 | Rating: 0 | rate:
| |
The task mentioned in my former post ended successfull as far I can see after 6.5 hours runtime. This is including the 74 minutes double running. Yes, you're right; everything seems ok on that run. In the stderr.txt you can see that after the second resume the code detected the state file was inconsistent (because of the 1.00 bug), and therefore it restarted from scratch. -Willem Jan | |
| ID: 10059 | Rating: 0 | rate:
| |
|
Hummmmmmm, I think I'll be moving on now if .001 Credits is all your giving for 5 Hours of Processing. Every WU I have Validated has got this amount so far ... | |
| ID: 10060 | Rating: 0 | rate:
| |
On my XP x64 lappy I have one WU running @56.5% after 4 hours... No, please... not my Dell! ![]() ... the 2 that finished in 4 hr's were on a Lappy of mine ... :) My WU on the lappy is now @89.425% after 6:10 hrs! PS. Damn, I have forgotten if that WU is a restarted one or if it's really that long. I think this one has been running through from the beginning... I want at least one 'good' WU. ![]() ____________ Lovely greetings from Cori ![]() ![]() | |
| ID: 10061 | Rating: 0 | rate:
| |
I want at least one 'good' WU You'll need 100 good ones to get on the Board though with the .001 Credits being Granted ... hahaha ;) | |
| ID: 10062 | Rating: 0 | rate:
| |
|
Oops, 0.001 credits? I'm looking into that now. | |
| ID: 10063 | Rating: 0 | rate:
| |
|
The 4 on my workstation are close to 9hrs & still crunching. | |
| ID: 10064 | Rating: 0 | rate:
| |
... P.S. Version 1.01 for Win32 is now online, fixing the resume problem. Can I finsih the two old (1.0) WUs I have still chrunching, they seem to have run fine so far? ;-) PS. But lappy has to test the new ones as well... or maybe I'll better wait until the Win 64-bit app is ready. ;-))) ____________ Lovely greetings from Cori ![]() ![]() | |
| ID: 10065 | Rating: 0 | rate:
| |
... P.S. Version 1.01 for Win32 is now online, fixing the resume problem. Yes, if they never suspended/resumed, everything is fine. The only change between 1.00 and 1.01 was the resume fix. | |
| ID: 10066 | Rating: 0 | rate:
| |
... P.S. Version 1.01 for Win32 is now online, fixing the resume problem. Cool. My lappy will finish the WU in about 15 minutes. After 7 hrs, wow... :-) The quad has started later and should finish in 35 minutes with a run time of a bit over 4 hrs. EDIT: Lappy's result :-) ____________ Lovely greetings from Cori ![]() ![]() | |
| ID: 10067 | Rating: 0 | rate:
| |
|
1.01 binaries for new platforms are up: Win64, OS X i686, OS X PPC. | |
| ID: 10069 | Rating: 0 | rate:
| |
|
What about the Credit Issue of only Granting .001 Credits ??? | |
| ID: 10070 | Rating: 0 | rate:
| |
What about the Credit Issue of only Granting .001 Credits ??? I've made a change to the server just now which may fix it. We'll see if it worked when the next workunit is done. -Willem Jan | |
| ID: 10072 | Rating: 0 | rate:
| |
|
Thanks for the help so far! On the client side, the results that have been returned all look good, except for the resumed tasks broken by v1.00. | |
| ID: 10073 | Rating: 0 | rate:
| |
|
I am testing now with: | |
| ID: 10074 | Rating: 0 | rate:
| |
|
| |
| ID: 10075 | Rating: 0 | rate:
| |
With which client is this? If it is with the 1.00 binary, I'm afraid it's best to abort the 100%-and-running ones. It seems the 0.001 credit issue is fixed now, by the way. I also changed the project config settings so now it should no longer be possible to be your own wingman. I had forgotten to change this from when I was the only user on the project for early testing. -Willem Jan | |
| ID: 10076 | Rating: 0 | rate:
| |
It seems the 0.001 credit issue is fixed now Yes it has, Thanks ... :) | |
| ID: 10077 | Rating: 0 | rate:
| |
Just got back from london & found that several have reported in, so will leave them running, unless you think it best to abort them? I was thinking of letting them run until their deadline of 07:36 8/6/09 & aborting those left as I don't think that all will complete, even though they are running on 3 quads only. ____________ ![]() ![]() | |
| ID: 10079 | Rating: 0 | rate:
| |
It seems the 0.001 credit issue is fixed now Yup, I can confirm my very 1st granted result, too! :-) (The rest is all pending, so come on wing-men...! *LOL*) PS. I noticed the quorum of two uses the lower value for both WUs to grant right now, but I guess the credits adjustment will come later so this isn't a complaint. *grin* ____________ Lovely greetings from Cori ![]() ![]() | |
| ID: 10080 | Rating: 0 | rate:
| |
As you can see from the screenshot, it looks like I'm my own wingman on numerous units. Also, will these complete or timeout? I had the same problem, but only after a reboot of my PC during the time of their calculation. Then, I aborted these WU "finished" but always "in progress". ____________ | |
| ID: 10081 | Rating: 0 | rate:
| |
|
The WU's being sent are messing up the ability to get work from other Projects because they are way over estimated on how long their going to take to run. Some say 60+ Hours & then BOINC thinks you have all this work on your Computer & won't send you any work from the other Projects. | |
| ID: 10082 | Rating: 0 | rate:
| |
|
PoorBoy: I understand completely. Thanks for the computed workunits and the reports. They'll be very useful for solving this problem. Do you have the names (abc_sieve_wu_XXXXXXXX) of specific workunits that are wrong? That would help with determining the cause. | |
| ID: 10083 | Rating: 0 | rate:
| |
... Does the progress counter itself work properly? I.e., does the completed percentage seem to increase smoothly? From what I have seen it went pretty smooth (under all my Windows x64 OSes: XP, Vista and Win7). :-) My plan for the credits is to link that to the number of triples scanned in a work unit, but I need to work out a fair formula for that. Now that sounds cool! :-) But it will only be "fair" if a WU containing less triples is running much shorter than one with many triples to be scanned... I hope I did get this right? ;-) ____________ Lovely greetings from Cori ![]() ![]() | |
| ID: 10085 | Rating: 0 | rate:
| |
... Does the progress counter itself work properly? I.e., does the completed percentage seem to increase smoothly? That's good news. Hopefully there's just a miscomputation of the initial runtime guess on the server, then.
Yes, that's correct. And theoretically this is the case. I'll analyze the timing results from computed workunits tomorrow to see if reality matches theory :-) I'm very happy to see that so far there hasn't been a single crash or inconsistent output (other than those caused by the resume problem). -Willem Jan P.S. There was a serious performance problem in the Mac OS X clients. Version 1.02 for OS X fixes this. (i386/ppc are up now; x86_64 will hopefully follow soon.) | |
| ID: 10086 | Rating: 0 | rate:
| |
My plan for the credits is to link that to the number of triples scanned in a work unit, but I need to work out a fair formula for that. I am happily waiting for an updated x64 app then. It will have more optimization included I guess so I am looking forward to a little speed test... ;-) And hopefully the "credit theory" will work. *smiles* ____________ Lovely greetings from Cori ![]() ![]() | |
| ID: 10087 | Rating: 0 | rate:
| |
PoorBoy: I understand completely. Thanks for the computed workunits and the reports. They'll be very useful for solving this problem. Do you have the names (abc_sieve_wu_XXXXXXXX) of specific workunits that are wrong? That would help with determining the cause. Sorry but I just Aborted them before reading the Thread again so I have no Idea which ones they were. I have to leave for awhile but this afternoon I'll download some more & see which ones they were/are. They seem to Progress okay but then I haven't been watching that closely either, some would be at 9 hr's though & only show 25& done, I didn't think these things ran that long ??? Might have something to do with the AQUA CUDA WU's I'm running to but it shouldn't has I've gave them their own Core to run on so they shouldn't interfere with the ABC WU's, or at least I wouldn't think they would. They come all different though, 1 will say 14 hr's another 25 hr's & some as high as 60+ hr's. All my Systems are 64-Bit Windows XP Pro if that makes any difference ... | |
| ID: 10088 | Rating: 0 | rate:
| |
I am testing now with: Seems to have worked just fine. Waiting for my wingman to return his. http://abctest.math.leidenuniv.nl/abcbeta/result.php?resultid=5255 32 bit leopard Works, but credits are mush less than claimed: http://abctest.math.leidenuniv.nl/abcbeta/result.php?resultid=5298 PPC/tiger It appears to be working. But it will not complete in time. After 26 hours, it is only 6% complete, and sill not complete by the deadline tomorrow. So I have aborted it. With longer due dates, I think it would work. http://abctest.math.leidenuniv.nl/abcbeta/result.php?resultid=5384 ____________ Dublin, CA Team SETI.USA | |
| ID: 10089 | Rating: 0 | rate:
| |
I am testing now with: Thanks for the report. There was a rather serious performance problem with the OS X clients that has been solved now. The new clients (1.02) are likely 5 times as fast as the old ones. | |
| ID: 10090 | Rating: 0 | rate:
| |
|
Looked like all was ok with the client on Ubuntu x86_64, BOINC v 6.6.20 Mon 08 Jun 2009 22:09:17 EST abcbeta Message from server: Server error: can't attach shared memory So I couldn't return anything. ____________ ![]() | |
| ID: 10092 | Rating: 0 | rate:
| |
Thanks for the report. There was a rather serious performance problem with the OS X clients that has been solved now. The new clients (1.02) are likely 5 times as fast as the old ones. Installed on my G5, one work unit downloaded and processing. Suspended everything else to get ABCbeta to run, now it's downloaded a ton of WUs. I think I'm over committed. ____________ | |
| ID: 10093 | Rating: 0 | rate:
| |
|
New versions 1.03 are up for linux i686, linux x86_64, win32, windows x86_64. | |
| ID: 10095 | Rating: 0 | rate:
| |
|
Cool! I've downloaded fresh WUs on my Vista and XP hosts (both x64). :-) | |
| ID: 10096 | Rating: 0 | rate:
| |
Yes, you're right. The WUs are the same, and the runtimes will be shorter on Win64. In a sample run I did with a (fake) tiny workunit the runtime was reduced from 225s to 200s. | |
| ID: 10097 | Rating: 0 | rate:
| |
Only little problem: BOINC was downloading way too much work because the estimated time to completion was a bit too 'ambitious' (~30 minutes at the beginning)... I suspect that I 'm going to have to do the same thing, in three hours of computation the first WU is 14% complete and I have about 40 more WUs due in 2½ days. {edit} 10.4.11, 2.0 GHz DP G5, 8 GB RAM. {/edit} ____________ | |
| ID: 10098 | Rating: 0 | rate:
| |
|
Wow, the new 64-bit app for Windoze is really fast!! | |
| ID: 10099 | Rating: 0 | rate:
| |
|
That's nice to hear. Been getting no work from ABC for over 36 hrs & run out of wu's, so switched rigs over to beta. XP X64 rig got plenty, but the rest keeps getting "shared memory error" | |
| ID: 10103 | Rating: 0 | rate:
| |
|
Ooops, the "shared memory" error keeps me from reporting finished WUs! *sniff* ;-))) | |
| ID: 10105 | Rating: 0 | rate:
| |
|
Anybody running these things, I have close to 24,000 Pending Credits piled up so far ... Getting the Shared memory message too when trying to connect ... | |
| ID: 10106 | Rating: 0 | rate:
| |
|
Same here: "shared memory"... Waiting for reporting the WUs! | |
| ID: 10107 | Rating: 0 | rate:
| |
|
Woo-hoo, servers are back! Just reported all my WUs! :-)) | |
| ID: 10108 | Rating: 0 | rate:
| |
PPC/tiger I ran a new one, with the *much* faster app 1.02. It validated. http://abctest.math.leidenuniv.nl/abcbeta/result.php?resultid=6715 Credits are still much less than claimed. ____________ Dublin, CA Team SETI.USA | |
| ID: 10109 | Rating: 0 | rate:
| |
|
abc_sieve_1.02_powerpc-apple-darwin is 80% through the first WU after 30 hours of computation, I've aborted all but the two WUs in progress. | |
| ID: 10110 | Rating: 0 | rate:
| |
|
Server down again. | |
| ID: 10111 | Rating: 0 | rate:
| |
I ran a new one, with the *much* faster app 1.02. It validated. It looks to me like you finished the wu in 19 hours, is that correct? If so, I wonder why mine are running slower on a faster G5. ____________ | |
| ID: 10113 | Rating: 0 | rate:
| |
Server down again. The feeder seems to lose its connection to the mysql server from time to time, which is strange since it's an unmodified BOINC one. I'll try to add a keep-alive to hopefully prevent the mysql connection from dropping. -Willem Jan | |
| ID: 10114 | Rating: 0 | rate:
| |
Server down again. Thanks Willem. It's not a problem though, as eventually, they'll all report in when ready. ____________ ![]() ![]() | |
| ID: 10115 | Rating: 0 | rate:
| |
I ran a new one, with the *much* faster app 1.02. It validated. I don't know if I'm reading it right but I think maybe your G5 is only running BOINC whereas mine is multi-tasking (this is my primary GP Mac)? The first wu finally finished, slow and like zombie 67's wu less credit than claimed: http://abctest.math.leidenuniv.nl/abcbeta/result.php?resultid=6675 Second wu still in progress. ____________ | |
| ID: 10116 | Rating: 0 | rate:
| |
|
Your task finished faster than mine. Compare the times. | |
| ID: 10122 | Rating: 0 | rate:
| |
Your task finished faster than mine. Compare the times. I guess I've been reading the times wrong. BOINCManager said I had 40 hours on each of my first two work units, but the two results I've finished were about 15.5 and 17.75 hours according to the results. Is BOINCManager really that far off in it's elapsed time calculations? ____________ | |
| ID: 10129 | Rating: 0 | rate:
| |
Your task finished faster than mine. Compare the times. Elapsed time in BOINC manager should be correct but estimated time to completion can be way off. ;-) Only other scenario I can think of is that the wall-clock time needed for the WUs was for some reason much higher than the CPU time. Hm... did you run some AQUA mt-WUs on that host, too? ____________ Lovely greetings from Cori ![]() ![]() | |
| ID: 10130 | Rating: 0 | rate:
| |
Elapsed time in BOINC manager should be correct but estimated time to completion can be way off. ;-) No, I'm not familiar with AQUA mt, do they have a PPC Mac application? ____________ | |
| ID: 10131 | Rating: 0 | rate:
| |
Elapsed time in BOINC manager should be correct but estimated time to completion can be way off. ;-) Hm... they have an app for "Mac OS 10.4 or later running on Intel". But that's different from PPC Mac I guess. :-( ____________ Lovely greetings from Cori ![]() ![]() | |
| ID: 10132 | Rating: 0 | rate:
| |
Elapsed time in BOINC manager should be correct but estimated time to completion can be way off. ;-) No, I'm not familiar with AQUA mt, do they have a PPC Mac application? ____________ | |
| ID: 10133 | Rating: 0 | rate:
| |
|
Hi | |
| ID: 10138 | Rating: 0 | rate:
| |
|
Can we get am Update on how/where the ABCbeta Project is doing/going ??? I'm sitting on over 50,000 Pending Credits for the ABC BETA Project & if the Projects Dead I'll divert the Resources to some place else. Not Complaining but just asking on how it's going ??? | |
| ID: 10140 | Rating: 0 | rate:
| |
|
A status update: | |
| ID: 10143 | Rating: 0 | rate:
| |
|
I am having to abort lots of WUs for the beta app because the actual runtime is so much longer than the estimated runtime. Is there any way either the deadlines can be extended or the estimated runtime be made more accurate? | |
| ID: 10144 | Rating: 0 | rate:
| |
A status update: Yes that sounds good, a lot of the Wu's ere running 8-10 hr's on my Quads, so shortening them by a factor of 3 or 4 would put them in the 2-3 hr range. | |
| ID: 10146 | Rating: 0 | rate:
| |
A status update: Yup, have to agree. :-))) ____________ Lovely greetings from Cori ![]() ![]() | |
| ID: 10147 | Rating: 0 | rate:
| |
A status update: Will doing this cause BOINC to simply DL more WUs causing effectively the same situation? ____________ | |
| ID: 10148 | Rating: 0 | rate:
| |
A status update: Yes it will. If you attach a project for the first time it's always a good idea to reduce your additional work buffer and crunch a few WU's. In that way Boinc client can calculate the Duration Correction Factor taking into account the speed of your computer and the (mis-)estimation of the calculation duration coming with the WU. For the next work request Boinc knows how long the tasks will be (how much work should requested) and also calculates the debts of your new project against your other projects. | |
| ID: 10149 | Rating: 0 | rate:
| |
A status update: If you go into your default settings you can set it to only download say 0.25 or even 0.10 days of work by it's own calculations. That should solve your immediate problems of having to abort units. Your Account, General Preferences and then under Network Usage. If you set the first line to 0.00 I think it only downloads a unit or so at a time because it thinks you have an always connected computer. If you then adjust the additional work line to say 0.10 or 0.25 that tries to add enough work for 1/10 of a day or 1/4 of a day additionally. This is NOT perfect but it sorta works. When you start, in your case, leave the additional work line blank too and Boinc should only get a unit or so at a time. ____________ ![]() | |
| ID: 10152 | Rating: 0 | rate:
| |
|
Server error: | |
| ID: 10194 | Rating: 0 | rate:
| |
|
Status update: after analyzing a number of workunits, it seems the speed heuristic actually does quite well, but the code that builds workunits makes many workunits much longer than intended. | |
| ID: 10196 | Rating: 0 | rate:
| |
|
This work unit stopped progress at about 12%, I paused and continued it in hopes it would finish but at 59:41 (elapsed, not CPU) I aborted it. It was about 8 hours past due at that time and still at 12% completed. | |
| ID: 10197 | Rating: 0 | rate:
| |
|
It looks like this work unit had similar issues. No mention of permission errors but it failed to detect a heartbeat several times before erroring out. Both are on my 06/2005 G5 2.0 DP. | |
| ID: 10198 | Rating: 0 | rate:
| |
|
(deleted) | |
| ID: 10199 | Rating: 0 | rate:
| |
|
The only thing I am seeing, both Windows and Linux machines, is that the time to completion starts out at wildly different numbers. I have several that think they will take 70 hours to finish and some that think they will only take 15 hours to finish. I don't know where to go to see how long they actually took though. My ABC account stats only shows the non beta units. | |
| ID: 10203 | Rating: 0 | rate:
| |
... I don't know where to go to see how long they actually took though. My ABC account stats only shows the non beta units. You can see them at your ABC Beta account: http://abctest.math.leidenuniv.nl/abcbeta/home.php ;-) ____________ Lovely greetings from Cori ![]() ![]() | |
| ID: 10204 | Rating: 0 | rate:
| |
|
I have another work unit that appears to be doing the same thing. It ran up to 15% in a few hours, now it's seemingly stuck at 15% after 12 hours. I am going to abort it and take my G5 off the beta program until there is a fix. | |
| ID: 10205 | Rating: 0 | rate:
| |
... I don't know where to go to see how long they actually took though. My ABC account stats only shows the non beta units. thanks ____________ ![]() | |
| ID: 10207 | Rating: 0 | rate:
| |
|
I found several WUs which were successfully completed but didn't get granted - only my wingmen got credits. | |
| ID: 10212 | Rating: 0 | rate:
| |
|
I have a question...when will be new wu's available? | |
| ID: 10234 | Rating: 0 | rate:
| |
|
为什么开始时上面显示剩余时间是6小时,可我运行了两天了,还没完成那个任务,才完成25%,而且剩余时间还是显示为7小时 | |
| ID: 11243 | Rating: 0 | rate:
| |
|
I have a question , if I fisihed my program, what will happen? | |
| ID: 11263 | Rating: 0 | rate:
| |
I have a question , if I fisihed my program, what will happen? Finished what program? Do you mean your workunits you got from this, or any other, project? If so they just get returned and you get new ones to work on. The Project itself decides what we work on and how many workunits they need to prove their theory right or wrong. | |
| ID: 11268 | Rating: 0 | rate:
| |
Message boards :
Beta Testing :
New application!