Posts by Hal Bregg

1) Questions and Answers : Issue Discussion : 195 (0x000000C3) EXIT_CHILD_FAILED error (Message 1379)
Posted 12 Oct 2021 by Hal Bregg
Post:
I didn't add more details about above errors.

Machine Learning Dataset Generator (test) v9.96 keep crashing on Ubuntu 20.04.3 LTS [5.4.0-88-generic|libc 2.31 (Ubuntu GLIBC 2.31-0ubuntu9.2)].
2) Questions and Answers : Issue Discussion : 195 (0x000000C3) EXIT_CHILD_FAILED error (Message 1378)
Posted 12 Oct 2021 by Hal Bregg
Post:
I trashed a lot of WUs today with following error

Machine Learning Dataset Generator v9.90 (Linux/x86_64) (libTorch: release/1.9)
HDF5-DIAG: Error detected in HDF5 (1.12.0) thread 0:
  #000: /home/gitlab-runner/builds/-ZexzTs7/0/mlcathome/mlds/extern/hdf5/src/H5F.c line 793 in H5Fopen(): unable to open file
    major: File accessibility
    minor: Unable to open file
  #001: /home/gitlab-runner/builds/-ZexzTs7/0/mlcathome/mlds/extern/hdf5/src/H5VLcallback.c line 3500 in H5VL_file_open(): open failed
    major: Virtual Object Layer
    minor: Can't open object
  #002: /home/gitlab-runner/builds/-ZexzTs7/0/mlcathome/mlds/extern/hdf5/src/H5VLcallback.c line 3465 in H5VL__file_open(): open failed
    major: Virtual Object Layer
    minor: Can't open object
  #003: /home/gitlab-runner/builds/-ZexzTs7/0/mlcathome/mlds/extern/hdf5/src/H5VLnative_file.c line 100 in H5VL__native_file_open(): unable to open file
    major: File accessibility
    minor: Unable to open file
  #004: /home/gitlab-runner/builds/-ZexzTs7/0/mlcathome/mlds/extern/hdf5/src/H5Fint.c line 1707 in H5F_open(): unable to read superblock
    major: File accessibility
    minor: Read failed
  #005: /home/gitlab-runner/builds/-ZexzTs7/0/mlcathome/mlds/extern/hdf5/src/H5Fsuper.c line 412 in H5F__super_read(): file signature not found
    major: File accessibility
    minor: Not an HDF5 file
terminate called after throwing an instance of 'H5::FileIException'
16:16:01 (360255): mlds exited; CPU time 0.307757
16:16:01 (360255): app exit status: 0x86
16:16:01 (360255): called boinc_finish(195)


What is causing the errors?
3) Questions and Answers : Windows : Exit status -1073741515 (0xC0000135) STATUS_DLL_NOT_FOUND (Message 1345)
Posted 27 Aug 2021 by Hal Bregg
Post:
Update Appears the TEST APP is not working out 100% of the time either on that PC.
2 TASKS FALED with "195 (0x000000C3) EXIT_CHILD_FAILED"
6594296
6593714

.. and have 2 TEST APP tasks currently in progress running for over 1 hour each so far:
6595197
6593736

CRASH Details from the Windows Error reporting LOG:
Faulting application name: mlds.exe, version: 0.0.0.0, time stamp: 0x61135d4d
Faulting module name: torch_cpu.dll, version: 0.0.0.0, time stamp: 0x60c3de87

Exception code: 0xc0000005
Fault offset: 0x0000000005d00009
Faulting process id: 0x484
Faulting application start time: 0x01d7900d46609a5e
Faulting application path: C:\BOINCData\slots\4\mlds.exe
Faulting module path: C:\BOINCData\slots\4\torch_cpu.dll
Report Id: 87590a84-fc00-11eb-8363-f80f41b4b3e3
Faulting package full name:
Faulting package-relative application ID:


I just trashed nearly 40 WUs. With same error on Windows 10.
4) Questions and Answers : Issue Discussion : Memory requirement for CPU WUs (Message 1337)
Posted 25 Aug 2021 by Hal Bregg
Post:
Ugh. No the memory requirements for CPU shouldn't have gone up, looks like a set the gpu limits by accident. Will fix.within the hour but expect the existing wus to take a few days to flush out of the system.

My mistake, my apologies.


Thanks for the update.
5) Questions and Answers : Issue Discussion : Memory requirement for CPU WUs (Message 1331)
Posted 25 Aug 2021 by Hal Bregg
Post:
I rushed with above post and didn't read official announcement about new version for CPU. Still there was nothing about memory requirement so I guess WUs will need more RAM now.
6) Questions and Answers : Issue Discussion : Memory requirement for CPU WUs (Message 1330)
Posted 25 Aug 2021 by Hal Bregg
Post:
Hi,

Did memory requirement changed with latest batch of WUs?

<message priority="low">No tasks sent</message>
<message priority="notice">Machine Learning Dataset Generator needs 2670.29 MB RAM but only 1760.70 MB is available for use.</message>
<message priority="notice">Machine Learning Dataset Generator (test) needs 2670.29 MB RAM but only 1760.70 MB is available for use.</message>




©2021 MLC@Home Team
A project of the Cognition, Robotics, and Learning (CORAL) Lab at the University of Maryland, Baltimore County (UMBC)