Posts by zombie67 [MM]

21) Questions and Answers : Unix/Linux : NPU and TPU AI Co-processors (Message 643)
Posted 13 Oct 2020 by zombie67 [MM]
Post:
[...]In the end, GPU support and Android support are much higher on the "bang-for-your-developers-time" priority list than NPUs, so unless there's some really motivated developer who wants to step up and do just that, it'll probably stay that way.


Any ETA for GPU apps here?
22) Message boards : News : Badges! (Message 641)
Posted 11 Oct 2020 by zombie67 [MM]
Post:
Wow. Big jump from 1M to 500M.
23) Message boards : News : Badges! (Message 639)
Posted 11 Oct 2020 by zombie67 [MM]
Post:
What is the largest badge value? I have a badge for 1m, but my total is >5m.
24) Message boards : News : Badges! (Message 593)
Posted 5 Oct 2020 by zombie67 [MM]
Post:
Minor update:

I'll try moving over the old credit later this week when (hopefully) we'll have some scheduled downtime to move to the new server. For those who don't have badges yet, give me another week. That'll also coincide with the early adopter badge. I can practice on my own account first :)


Good luck!
25) Message boards : News : Badges! (Message 564)
Posted 3 Oct 2020 by zombie67 [MM]
Post:
3) Special Badges: These are special, one-off badges for special events. The first one is a reward for those who have supported our little project so far. We're going to grant an "Early Adopter" badge for anyone who has any type of credit on this project during its first 100 days. This project went live the night of June 30th, 2020, so that leaves just 1 week (October 8th!) to earn some credit and qualify for the badge.


My early adopter badge isn't showing. I joined 1 Jul 2020, and have over 5m in credit, generated from the start.
26) Questions and Answers : Issue Discussion : stats export interval (Message 147)
Posted 9 Jul 2020 by zombie67 [MM]
Post:
No worldwide stats are reported , since now. I'm allready waited 2 days.

Stats for this project are already exported. They have been showing up on Free-DC for about a week now.

https://stats.free-dc.org/proj/mlc
27) Message boards : News : MLDS release v0.911 (Message 78)
Posted 3 Jul 2020 by zombie67 [MM]
Post:
Looks like the decimal point in the application version got moved. My machine is showing it as 9.11, not .911.
28) Questions and Answers : Issue Discussion : No longer getting tasks (Message 60)
Posted 3 Jul 2020 by zombie67 [MM]
Post:
Been crunching ok since day 1, now today, no new tasks being sent.. nothing has changed on my end on any of the 3 machines.... Sever shows 17k unsent.


Any ideas? Something up?


Enable test applications in your project preferences.
29) Questions and Answers : Issue Discussion : All tasks error out after reboot (Message 37)
Posted 2 Jul 2020 by zombie67 [MM]
Post:
my "short" tasks were already validated by other, different hosts
if there are no differences between the two, then the third is superfluous


If the interrupted tasks that error out and move to pending validation, if they still validate eventually, then we should all just interrupt all tasks to maximize points. What is the point in taking the time to run tasks to completion. Right?
30) Questions and Answers : Issue Discussion : All tasks error out after reboot (Message 35)
Posted 2 Jul 2020 by zombie67 [MM]
Post:
I think the "pending validation" pool of tasks is going to grow large very fast. Reboots are going to create a lot of false "pending validation" tasks, that will then need a third task to be issued (and hopefully completed properly), to get the WU done. This is especially problematic now, because the recent linux updates are requiring reboot.
31) Questions and Answers : Issue Discussion : All tasks error out after reboot (Message 31)
Posted 2 Jul 2020 by zombie67 [MM]
Post:
Hmm, continuing after reboot works for me. Is this one of the workunits?

https://www.mlcathome.org/mlcathome_ops/db_action.php?table=result&id=8910

That's a crash deep in libtorch, which is well below any code I wrote. So either something really bad is going on, or for some reason the snapshot got corrupted. I'll see if I can dig up the stdout and see what happened.


I am not allowed to log into that URL.

But no, that single task is not what I am referring to.

The errored out tasks I am talking about were still sitting on my machines. Also weird that they had not reported yet. Anyway, the errored out tasks are now reported and available to look at. Here is a sample:

https://www.mlcathome.org/mlcathome/result.php?resultid=9662

There were about 60 tasks on each machine that errored out after reboot.

(note: The errored-out tasks are now listed as validation pending. That is a different problem already reported.)
32) Questions and Answers : Issue Discussion : CPU extensions used? (Message 28)
Posted 2 Jul 2020 by zombie67 [MM]
Post:
I am not sure how where to look at that web site to understand your answer. But I have a machine with AVX-512, and the run times are not noticeably faster. So I don't think it's being used. How to tell?
33) Questions and Answers : Issue Discussion : All tasks error out after reboot (Message 27)
Posted 2 Jul 2020 by zombie67 [MM]
Post:
I saw that the tasks were checkpointing, which is great. So I decided to go ahead and reboot my machines for an unrelated issue. When it started up, all the tasks error out. Yikes.

Edit: The tasks are now listed as validation pending. That is a different problem already reported.
34) Questions and Answers : Issue Discussion : CPU extensions used? (Message 25)
Posted 2 Jul 2020 by zombie67 [MM]
Post:
Does this project use CPU extensions such as AVX/AVX2/AVX-512 if the CPU has those features available?


Previous 20

©2022 MLC@Home Team
A project of the Cognition, Robotics, and Learning (CORAL) Lab at the University of Maryland, Baltimore County (UMBC)