Message boards :
News :
[TWIM Notes] Oct 27 2020
Message board moderation
Author | Message |
---|---|
Send message Joined: 30 Jun 20 Posts: 462 Credit: 21,406,548 RAC: 0 ![]() ![]() ![]() ![]() |
This Week in MLC@Home Notes for Oct 27 2020 A weekly summary of news and notes for MLC@Home Summary This week we crossed the 1000 users-with-credit threshold! Thanks again for all our volunteers! GPU week, part 1. This week was consumed with developing and releasing to testing CUDA clients. The good news, when they work, they do provide a nice performance boost. The bad news, there are a lot of kinks to work out yet. Currently, we've released a windows/CUDA and linux/CUDA binary. The windows binary works if the user has the right environment (cuda 10.2 is known to work). The linux/CUDA binary was just released 48 hours ago and currently is broken in new and surprising ways not seen in our internal testing. Both apps will need some server-side changes to only allow hosts that can meet these minimum requirements. Luckily, that's why we have the "mldstest" application to find these issues before we release to the main channel! Also, the GPU apps are much larger than our current CPU app, and have wildly different resource requirements. So much so, that we will likely be releasing them as a separate app to run alongside the CPU apps with their own WUs. That way we can isolate GPU WUs from CPU WUs and keep everyone happy and crunching. If you're interested in testing, please make sure you have "Run test applications?" and "Use NVIDIA GPU?" checked in your project preferences, and follow/post your experience in the forum. News:
SingleDirectMachine 10002/10004 EightBitMachine 10001/10006 SingleInvertMachine 10001/10003 SimpleXORMachine 10000/10002 ParityMachine 884/10005 ParityModified 275/10005 EightBitModified 6492/10006 SimpleXORModified 10005/10005 SingleDirectModified 10004/10004 SingleInvertModified 10002/10002 Dataset 3 progress: Overall (so far): 37600/40425 Milestone 1, 100x100: 10000/10000 Milestone 2, 100x1000: 37600/100000 Milestone 3: 100x10000: 37600/1000000 Last week's TWIM Notes: Oct 19 2020 Thanks again to all our volunteers! -- The MLC@Home Admins |
![]() Send message Joined: 4 Oct 20 Posts: 6 Credit: 386,575 RAC: 0 ![]() ![]() |
Great news!!! OpenCL support anytime soon? Congratulations for the achievements! ![]() |
Send message Joined: 30 Jun 20 Posts: 462 Credit: 21,406,548 RAC: 0 ![]() ![]() ![]() ![]() |
We're limited by what PyTorch (our underlying framework) supports well, which is CUDA and ROCm. That covers NVIDIA GPUs on Linux and Windows, and some AMD GPUs (discrete POLARIS and VEGA) on Linux. We would love to support OpenCL or (Vulkan Compute!) but PyTorch doesn't at the moment. Intel is doing their own thing (oneAPI/ideep/mkldnn/whatever they're calling it this week) which will likely support their newer GPUs (both discrete and integrated), and are heavy contributors to PyTorch, so I suspect they will be supported at some point. Relying on a framework is win overall, in that we're not writing all the math/algorithms ourselves and instead using tested/proven code, and we get GPU support "built in". But it does mean we're stuck with its shortcomings (limited to what it currently supports for GPUs and no static linking). |
Send message Joined: 20 Sep 20 Posts: 1 Credit: 186,100 RAC: 2 ![]() ![]() |
Thanks for the news. Exciting I am looking forward to seeing how this works on my RTX 2070. |
![]() Send message Joined: 2 Jul 20 Posts: 7 Credit: 2,052,848 RAC: 8 ![]() ![]() ![]() ![]() |
Asking out of curiosity how much volume will you be creating for the Nvida test work. I have adjusted my settings but there does not seem to be any test tasks being generated. Thanks Bill F In October of 1969 I took an oath to support and defend the Constitution of the United States against all enemies, foreign and domestic; There was no expiration date. ![]() |
Send message Joined: 30 Jun 20 Posts: 462 Credit: 21,406,548 RAC: 0 ![]() ![]() ![]() ![]() |
At the moment, GPU testing WUs are paused while we try and figure out a bizarre issue with the linux cuda app which is crashing from an external signal. We're not going to release any more test units until that's fixed, and it's a real head-scratcher, as it only crashes when run from BOINC, never when standalone. We're in touch with other project admins and boinc developers who have cuda clients to get some help debugging. When that's resolved, we'll throw some more testing WUs out and make a BOINC "app plan" that sets some minimum system requirement before you'll get GPU WUs. We'll also post requirements prominently on the web site and forums for all our existing and future clients, as that's something we could do a better job publishing. Going forward, we expect to have about an even split between GPU WUs and CPU WUs. |
©2023 MLC@Home Team
A project of the Cognition, Robotics, and Learning (CORAL) Lab at the University of Maryland, Baltimore County (UMBC)