Message boards : Projects : Anything and Everything to do with (WCG) World Community Grid
Message board moderation
Previous · 1 . . . 30 · 31 · 32 · 33 · 34 · Next
| Author | Message |
|---|---|
|
Send message Joined: 27 Aug 22 Posts: 38 |
And now there are some MCM WUs on my Wondows machine, too. The first one was ready in 2 Minutes 16 seconds. |
|
Send message Joined: 30 Mar 20 Posts: 584
|
I've been getting so far 75 of the short test tasks. |
|
Send message Joined: 30 Mar 20 Posts: 584
|
In reply to bill's message of 21 Oct 2025: It's a miracle! I got tasks!Yes, they are special short test tasks, as previously mentioned by the WCG team on the Operational Status page, on October 18. Quote from the relevant part: "We are sending small batches of workunits out starting tonight with batch IDs in the range 9999900+ for MCM1 to test the new distributed partition-aware batch upserting app-specific create_work daemons. The few volunteers who get these workunits before we start releasing larger batches as we gain confidence that the new system is working as expected may notice these workunits have a much smaller number of signatures and run much faster than normal. These are still meaningful workunits, but key parameters such as number of signatures to test per workunit were reduced so we could get feedback quckly." |
|
Send message Joined: 30 Mar 20 Posts: 584
|
And a new update from the team was just posted to the Operational Status page: October 21, 2025 Finally stress testing rather than correctness testing. Sent a batch of 100,000 workunits (fast running, not full size in case something crashed. Thank you for your patience and continued support. |
|
Send message Joined: 11 Sep 15 Posts: 8
|
Thanks for the info. I've been out of the country so I'm pretty much in the dark about things (as if this isn't my normal state). |
|
Send message Joined: 30 Mar 20 Posts: 584
|
Another batch with a bit longer running tasks, are being sent out now. |
|
Send message Joined: 30 Mar 20 Posts: 584
|
New test batch coming in now. The tasks seems to have more normal crunch times. |
jay_eSend message Joined: 8 Mar 07 Posts: 123
|
Greetings, Sharing good news. Earlier today, I rcvd some MCM WU. Just now, I requested enough to fill 0.5 day of work. The new MCM wu have an estimated time of 1 day and 18 minutes. I am running Linux on an older laptop -1.9Ghz Intel i-7, and the actual times have been running an hour or two less. Cheers! Jay |
|
Send message Joined: 25 May 09 Posts: 1383
|
The most recent batch of MCM tasks have very large guesstimate run times to start with, but in reality are taking about an hour on my computer (which is about twice as fast as yours. This is, in the main, down to these being a "new" type of task, and by design BOINC's initial runtime guesses are way too long, so don't worry, eventually BOINC will catch up with reality and give better guesses to start with. |
|
Send message Joined: 30 Mar 20 Posts: 584
|
More test batches (MCM1_9999XXX) are being sent out now. Don't expect finished tasks to validate though. The validators still seems to be a work in progress. Out of many hundreds of test tasks that I have crunched, only 3 have been validated. The tasks crunched before or during the migration, and those uploaded and reported when WCG came back, are all still "pending validation" But, who's in a hurry really? Life goes on, with or without WCG validating :-) |
|
Send message Joined: 30 Mar 20 Posts: 584
|
I think the WCG scheduler is a bit busy now. Most of the time there is no problem, but from time to time the response to a request is: Scheduler request failed: Error 408 |
DaveSend message Joined: 28 Jun 10 Posts: 3029
|
The validators still seems to be a work in progress.Yes, nowhere near one hundred never mind hundreds here but only one task validated since the resumption between desktop and Android. |
|
Send message Joined: 11 Sep 15 Posts: 8
|
In reply to Grumpy Swede's message of 23 Oct 2025: Out of many hundreds of test tasks that I have crunched, only 3 have been validated. Hehehe...at one point I had something like 2000 waiting for them to come back on-line. They've since transitioned to never-never-land, hopefully not the great-bit-bucket-in-the-sky. |
jay_eSend message Joined: 8 Mar 07 Posts: 123
|
In reply to Grumpy Bill's message of 23 Oct 2025: The Jurisca status said: October 21, 2025 Finally stress testing rather than correctness testing. Sent a batch of 100,000 workunits (fast running, not full size in case something crashed. Thank you for your patience and continued support. My assumption is/was that a stress test is run on a fully operational system. Doing a stress test without processing of completed results to both WU partners seems like an incomplete test - to me. ---- Also, I had several WU pending results that were sent to me on 8/29 and WUs returned on 9/24. Indeed, if you were my partner of any of these WU - and if they went to the Great Bit Bucket in The Sky, I would think that Jurisca/Krembil should tell us. With that in mind, Looking at the Operational Status, I see that it is server-centric. There is no status focused on the users/volunteers; For example. 1) Were WU sent to users lost? (whether or not they were completed? 2) If not lost, is the process the accredit returned WU existing? Repaired? In process? 3) If that process is restored, when will users/volunteers see their results 4) When is the ETA of Boinc Manager's graph of Host/User statistics going to work? (What happened to our work?) An explanation of events that are user-centric is ovedue. Have our expectations been dulled by a lack of communication? -- begin metaphor -- I understand the value of giving praise to the efforts of those working to save the ship. I am not pleased by the lack of lifeboat drills and the explanation to passengers about the size of the hole in the hull. Yes, thank you for telling us that the ship has bilge pumps. You have not explicitly told us the ship will be made sea-worthy again, only that a speed trial is in progress. -- end metaphor -- Thanks to Bill, Bryn, and Grumpy for their comments and questions, Jay |
jay_eSend message Joined: 8 Mar 07 Posts: 123
|
oops. something left out. https://www.worldcommunitygrid.org/ms/device/viewDevices.do?installedSince=0&lastResult=0 does not see any work done since 8/15/2025. Jay |
|
Send message Joined: 30 Mar 20 Posts: 584
|
A new short update from Dylan, the WCG tech on the WCG forum: "blocked uploads to freeze Kafka topics while I try to fix the validation throughput" https://www.worldcommunitygrid.org/forums/wcg/viewthread_thread,47520_offset,260#706749 |
|
Send message Joined: 30 Mar 20 Posts: 584
|
And a new update on the WCG Operational Status page October 24, 2025 We have paused uploads and release of test batches while we work on the validation throughput issue. We think we have identified the root cause of the low validation rate for the MCM1 test batches, and we will send a few more test batches to confirm the fix does works. If this is the final fix and we see the expected validation rate for pairs of MCM1 uploads for new batches, we will replay the Kafka consumer on the upload events fired for test batches received earlier in the week, and this should idempotently allow the new batch assimilator to process those validations and assign credit. If the above goes well, we will schedule regular MCM1 batches to resume instead of the test batches. As volulnteers have noted, we have not yet reconciled uploads of regular MCM1 results submitted before we began sending test batches, and before the migration, but we have those files and will be able to do this in a batch update once the path for new workunits is working as described above. Naturally, we will resume ARP and MAM only after these issues are fully resolved. |
|
Send message Joined: 25 May 09 Posts: 1383
|
My assumption is/was that a stress test is run on a fully operational system. That entirely depends on what is being stress tested. If the test is of the ability to deliver and receive lots of tasks and results quickly then using small (incomplete) tasks is far better than complete tasks. Small tasks can also be used to make sure that other bits of BOINC's server-side activities are working correctly. Indeed this approach appears to have provided some data that's of use in resolving the current validation process bottleneck - getting back loads of tasks (albeit small ones) allows some human validation to take place and compare this with the computed validation (which is much harder with large results?). On your other questions about the lack of user related data - I agree, I think this is all part of the system they inherited from IBM, but is available in "BOINC-BOINC" as used by so many projects. |
|
Send message Joined: 30 Mar 20 Posts: 584
|
Something is happening now. Previous test batches are beginning to validate, and they have sent out at least one new test batch. I did not catch any of those though, because I had my computer turned off. As of now, no new task are available. Edit, added: I think the validation of previous test tasks stopped, well before all test task had been validated. |
|
Send message Joined: 30 Mar 20 Posts: 584
|
Two new messages from Dylan, the WCG tech: https://www.worldcommunitygrid.org/forums/wcg/viewthread_thread,47541_offset,220#706786 https://www.worldcommunitygrid.org/forums/wcg/viewthread_thread,47541_offset,220#706789 |
Copyright © 2025 University of California.
Permission is granted to copy, distribute and/or modify this document
under the terms of the GNU Free Documentation License,
Version 1.2 or any later version published by the Free Software Foundation.