Thread 'Anything and Everything to do with (WCG) World Community Grid'

Author	Message
MyrCu Send message Joined: 27 Aug 22 Posts: 41	Message 117134 - Posted: 21 Oct 2025, 13:24:00 UTC - in response to Message 117132. Last modified: 21 Oct 2025, 13:29:19 UTC And now there are some MCM WUs on my Wondows machine, too. The first one was ready in 2 Minutes 16 seconds. ID: 117134 · Reply Quote

Grumpy Swede Send message Joined: 30 Mar 20 Posts: 741	Message 117135 - Posted: 21 Oct 2025, 13:38:34 UTC I've been getting so far 75 of the short test tasks. ID: 117135 · Reply Quote

Grumpy Swede Send message Joined: 30 Mar 20 Posts: 741	Message 117136 - Posted: 21 Oct 2025, 13:42:45 UTC - in response to Message 117133. In reply to bill's message of 21 Oct 2025: It's a miracle! I got tasks! Whoops...they seem to only run 3:30 minutes each. Yes, they are special short test tasks, as previously mentioned by the WCG team on the Operational Status page, on October 18. Quote from the relevant part: "We are sending small batches of workunits out starting tonight with batch IDs in the range 9999900+ for MCM1 to test the new distributed partition-aware batch upserting app-specific create_work daemons. The few volunteers who get these workunits before we start releasing larger batches as we gain confidence that the new system is working as expected may notice these workunits have a much smaller number of signatures and run much faster than normal. These are still meaningful workunits, but key parameters such as number of signatures to test per workunit were reduced so we could get feedback quckly." ID: 117136 · Reply Quote

Grumpy Swede Send message Joined: 30 Mar 20 Posts: 741	Message 117137 - Posted: 21 Oct 2025, 13:47:34 UTC And a new update from the team was just posted to the Operational Status page: October 21, 2025 Finally stress testing rather than correctness testing. Sent a batch of 100,000 workunits (fast running, not full size in case something crashed. Thank you for your patience and continued support. ID: 117137 · Reply Quote

bill Send message Joined: 11 Sep 15 Posts: 47	Message 117138 - Posted: 21 Oct 2025, 14:10:13 UTC - in response to Message 117136. Thanks for the info. I've been out of the country so I'm pretty much in the dark about things (as if this isn't my normal state). ID: 117138 · Reply Quote

Grumpy Swede Send message Joined: 30 Mar 20 Posts: 741	Message 117139 - Posted: 21 Oct 2025, 14:49:11 UTC Another batch with a bit longer running tasks, are being sent out now. ID: 117139 · Reply Quote

Grumpy Swede Send message Joined: 30 Mar 20 Posts: 741	Message 117142 - Posted: 21 Oct 2025, 19:51:19 UTC New test batch coming in now. The tasks seems to have more normal crunch times. ID: 117142 · Reply Quote

jay_e Send message Joined: 8 Mar 07 Posts: 126	Message 117147 - Posted: 22 Oct 2025, 2:13:58 UTC - in response to Message 117142. Last modified: 22 Oct 2025, 2:15:47 UTC Greetings, Sharing good news. Earlier today, I rcvd some MCM WU. Just now, I requested enough to fill 0.5 day of work. The new MCM wu have an estimated time of 1 day and 18 minutes. I am running Linux on an older laptop -1.9Ghz Intel i-7, and the actual times have been running an hour or two less. Cheers! Jay ID: 117147 · Reply Quote

robsmith Volunteer tester Help desk expert Send message Joined: 25 May 09 Posts: 1457	Message 117148 - Posted: 22 Oct 2025, 7:20:10 UTC - in response to Message 117147. The most recent batch of MCM tasks have very large guesstimate run times to start with, but in reality are taking about an hour on my computer (which is about twice as fast as yours. This is, in the main, down to these being a "new" type of task, and by design BOINC's initial runtime guesses are way too long, so don't worry, eventually BOINC will catch up with reality and give better guesses to start with. ID: 117148 · Reply Quote

Grumpy Swede Send message Joined: 30 Mar 20 Posts: 741	Message 117155 - Posted: 23 Oct 2025, 1:19:14 UTC Last modified: 23 Oct 2025, 1:21:48 UTC More test batches (MCM1_9999XXX) are being sent out now. Don't expect finished tasks to validate though. The validators still seems to be a work in progress. Out of many hundreds of test tasks that I have crunched, only 3 have been validated. The tasks crunched before or during the migration, and those uploaded and reported when WCG came back, are all still "pending validation" But, who's in a hurry really? Life goes on, with or without WCG validating :-) ID: 117155 · Reply Quote

Grumpy Swede Send message Joined: 30 Mar 20 Posts: 741	Message 117158 - Posted: 23 Oct 2025, 2:44:38 UTC I think the WCG scheduler is a bit busy now. Most of the time there is no problem, but from time to time the response to a request is: Scheduler request failed: Error 408 ID: 117158 · Reply Quote

Dave Help desk expert Send message Joined: 28 Jun 10 Posts: 3348	Message 117170 - Posted: 23 Oct 2025, 16:12:10 UTC The validators still seems to be a work in progress. Out of many hundreds of test tasks that I have crunched, only 3 have been validated. Yes, nowhere near one hundred never mind hundreds here but only one task validated since the resumption between desktop and Android. ID: 117170 · Reply Quote

bill Send message Joined: 11 Sep 15 Posts: 47	Message 117172 - Posted: 23 Oct 2025, 17:04:13 UTC - in response to Message 117155. Last modified: 23 Oct 2025, 17:05:38 UTC In reply to Grumpy Swede's message of 23 Oct 2025: Out of many hundreds of test tasks that I have crunched, only 3 have been validated. Hehehe...at one point I had something like 2000 waiting for them to come back on-line. They've since transitioned to never-never-land, hopefully not the great-bit-bucket-in-the-sky. ID: 117172 · Reply Quote

jay_e Send message Joined: 8 Mar 07 Posts: 126	Message 117194 - Posted: 24 Oct 2025, 14:59:47 UTC - in response to Message 117172. Last modified: 24 Oct 2025, 15:01:53 UTC In reply to Grumpy Bill's message of 23 Oct 2025: The Jurisca status said: October 21, 2025 Finally stress testing rather than correctness testing. Sent a batch of 100,000 workunits (fast running, not full size in case something crashed. Thank you for your patience and continued support. My assumption is/was that a stress test is run on a fully operational system. Doing a stress test without processing of completed results to both WU partners seems like an incomplete test - to me. ---- Also, I had several WU pending results that were sent to me on 8/29 and WUs returned on 9/24. Indeed, if you were my partner of any of these WU - and if they went to the Great Bit Bucket in The Sky, I would think that Jurisca/Krembil should tell us. With that in mind, Looking at the Operational Status, I see that it is server-centric. There is no status focused on the users/volunteers; For example. 1) Were WU sent to users lost? (whether or not they were completed? 2) If not lost, is the process the accredit returned WU existing? Repaired? In process? 3) If that process is restored, when will users/volunteers see their results 4) When is the ETA of Boinc Manager's graph of Host/User statistics going to work? (What happened to our work?) An explanation of events that are user-centric is ovedue. Have our expectations been dulled by a lack of communication? -- begin metaphor -- I understand the value of giving praise to the efforts of those working to save the ship. I am not pleased by the lack of lifeboat drills and the explanation to passengers about the size of the hole in the hull. Yes, thank you for telling us that the ship has bilge pumps. You have not explicitly told us the ship will be made sea-worthy again, only that a speed trial is in progress. -- end metaphor -- Thanks to Bill, Bryn, and Grumpy for their comments and questions, Jay ID: 117194 · Reply Quote

jay_e Send message Joined: 8 Mar 07 Posts: 126	Message 117197 - Posted: 24 Oct 2025, 16:02:02 UTC - in response to Message 117194. oops. something left out. https://www.worldcommunitygrid.org/ms/device/viewDevices.do?installedSince=0&lastResult=0 does not see any work done since 8/15/2025. Jay ID: 117197 · Reply Quote

Grumpy Swede Send message Joined: 30 Mar 20 Posts: 741	Message 117202 - Posted: 24 Oct 2025, 20:36:03 UTC A new short update from Dylan, the WCG tech on the WCG forum: "blocked uploads to freeze Kafka topics while I try to fix the validation throughput" https://www.worldcommunitygrid.org/forums/wcg/viewthread_thread,47520_offset,260#706749 ID: 117202 · Reply Quote

Grumpy Swede Send message Joined: 30 Mar 20 Posts: 741	Message 117206 - Posted: 25 Oct 2025, 0:46:17 UTC And a new update on the WCG Operational Status page October 24, 2025 We have paused uploads and release of test batches while we work on the validation throughput issue. We think we have identified the root cause of the low validation rate for the MCM1 test batches, and we will send a few more test batches to confirm the fix does works. If this is the final fix and we see the expected validation rate for pairs of MCM1 uploads for new batches, we will replay the Kafka consumer on the upload events fired for test batches received earlier in the week, and this should idempotently allow the new batch assimilator to process those validations and assign credit. If the above goes well, we will schedule regular MCM1 batches to resume instead of the test batches. As volulnteers have noted, we have not yet reconciled uploads of regular MCM1 results submitted before we began sending test batches, and before the migration, but we have those files and will be able to do this in a batch update once the path for new workunits is working as described above. Naturally, we will resume ARP and MAM only after these issues are fully resolved. ID: 117206 · Reply Quote

robsmith Volunteer tester Help desk expert Send message Joined: 25 May 09 Posts: 1457	Message 117225 - Posted: 26 Oct 2025, 8:12:11 UTC - in response to Message 117194. My assumption is/was that a stress test is run on a fully operational system. Doing a stress test without processing of completed results to both WU partners seems like an incomplete test - to me. That entirely depends on what is being stress tested. If the test is of the ability to deliver and receive lots of tasks and results quickly then using small (incomplete) tasks is far better than complete tasks. Small tasks can also be used to make sure that other bits of BOINC's server-side activities are working correctly. Indeed this approach appears to have provided some data that's of use in resolving the current validation process bottleneck - getting back loads of tasks (albeit small ones) allows some human validation to take place and compare this with the computed validation (which is much harder with large results?). On your other questions about the lack of user related data - I agree, I think this is all part of the system they inherited from IBM, but is available in "BOINC-BOINC" as used by so many projects. ID: 117225 · Reply Quote

Grumpy Swede Send message Joined: 30 Mar 20 Posts: 741	Message 117231 - Posted: 26 Oct 2025, 14:43:01 UTC Last modified: 26 Oct 2025, 15:10:51 UTC Something is happening now. Previous test batches are beginning to validate, and they have sent out at least one new test batch. I did not catch any of those though, because I had my computer turned off. As of now, no new task are available. Edit, added: I think the validation of previous test tasks stopped, well before all test task had been validated. ID: 117231 · Reply Quote

Grumpy Swede Send message Joined: 30 Mar 20 Posts: 741	Message 117235 - Posted: 26 Oct 2025, 23:39:17 UTC Two new messages from Dylan, the WCG tech: https://www.worldcommunitygrid.org/forums/wcg/viewthread_thread,47541_offset,220#706786 https://www.worldcommunitygrid.org/forums/wcg/viewthread_thread,47541_offset,220#706789 ID: 117235 · Reply Quote

Copyright © 2026 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.