Task 304525

Name ParityMachine-1594150394-23106-6_0
Workunit 142784
Created 12 Jul 2020, 21:31:55 UTC
Sent 12 Jul 2020, 21:47:46 UTC
Report deadline 14 Jul 2020, 21:47:46 UTC
Received 1 Aug 2020, 12:04:00 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 716
Run time 14 min 19 sec
CPU time 12 min 57 sec
Validate state Invalid
Credit 0.00
Device peak FLOPS 2.17 GFLOPS
Application version Machine Learning Dataset Generator v9.20
x86_64-pc-linux-gnu
Peak working set size 693.39 MB
Peak swap size 1.02 GB
Peak disk usage 0.04 MB

Stderr output

<core_client_version>7.16.3</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)</message>
<stderr_txt>
malloc(): largebin double linked list corrupted (bk)
SIGABRT: abort called
Stack trace (29 frames):
../../projects/www.mlcathome.org_mlcathome/mlds_0.920_x86_64-pc-linux-gnu(boinc_catch_signal+0x70)[0x76e450]
/lib/x86_64-linux-gnu/libpthread.so.0(+0x15540)[0x7f9b67b01540]
/lib/x86_64-linux-gnu/libc.so.6(gsignal+0xcb)[0x7f9b587383eb]
/lib/x86_64-linux-gnu/libc.so.6(abort+0x12b)[0x7f9b58717899]
/lib/x86_64-linux-gnu/libc.so.6(+0x9038e)[0x7f9b5878238e]
/lib/x86_64-linux-gnu/libc.so.6(+0x984dc)[0x7f9b5878a4dc]
/lib/x86_64-linux-gnu/libc.so.6(+0x9b78d)[0x7f9b5878d78d]
/lib/x86_64-linux-gnu/libc.so.6(+0x9bcaf)[0x7f9b5878dcaf]
/lib/x86_64-linux-gnu/libc.so.6(posix_memalign+0x18c)[0x7f9b58791b6c]
/tmp/.mount_mlds_0PMrkJY/usr/bin/../lib/libc10.so(_ZN3c109alloc_cpuEm+0x3e)[0x7f9b67b3904e]
/tmp/.mount_mlds_0PMrkJY/usr/bin/../lib/libc10.so(+0x17f4a)[0x7f9b67b3af4a]
/tmp/.mount_mlds_0PMrkJY/usr/bin/../lib/libtorch_cpu.so(THStorage_resize+0x3b)[0x7f9b59e4023b]
/tmp/.mount_mlds_0PMrkJY/usr/bin/../lib/libtorch_cpu.so(_ZN2at6native7resize_ERNS_6TensorEN3c108ArrayRefIlEENS3_8optionalINS3_12MemoryFormatEEE+0x4ab)[0x7f9b59a31dfb]
/tmp/.mount_mlds_0PMrkJY/usr/bin/../lib/libtorch_cpu.so(+0xdbb693)[0x7f9b59a32693]
/tmp/.mount_mlds_0PMrkJY/usr/bin/../lib/libtorch_cpu.so(_ZN2at14TensorIterator13compute_shapeEv+0x4dd)[0x7f9b59aaeb8d]
/tmp/.mount_mlds_0PMrkJY/usr/bin/../lib/libtorch_cpu.so(_ZN2at14TensorIterator5buildEv+0x3c)[0x7f9b59ab26fc]
/tmp/.mount_mlds_0PMrkJY/usr/bin/../lib/libtorch_cpu.so(_ZN2at14TensorIterator8unary_opERNS_6TensorERKS1_b+0x12d)[0x7f9b59ab307d]
/tmp/.mount_mlds_0PMrkJY/usr/bin/../lib/libtorch_cpu.so(+0xe82032)[0x7f9b59af9032]
/tmp/.mount_mlds_0PMrkJY/usr/bin/../lib/libtorch_cpu.so(+0x1138b90)[0x7f9b59dafb90]
/tmp/.mount_mlds_0PMrkJY/usr/bin/../lib/libtorch_cpu.so(_ZN2at6native4sqrtERKNS_6TensorE+0x86)[0x7f9b59b00776]
/tmp/.mount_mlds_0PMrkJY/usr/bin/../lib/libtorch_cpu.so(+0x1138a80)[0x7f9b59dafa80]
/tmp/.mount_mlds_0PMrkJY/usr/bin/../lib/libtorch_cpu.so(+0x10c364d)[0x7f9b59d3a64d]
/tmp/.mount_mlds_0PMrkJY/usr/bin/../lib/libtorch_cpu.so(+0x2c96976)[0x7f9b5b90d976]
/tmp/.mount_mlds_0PMrkJY/usr/bin/../lib/libtorch_cpu.so(+0x10c364d)[0x7f9b59d3a64d]
/tmp/.mount_mlds_0PMrkJY/usr/bin/../lib/libtorch_cpu.so(_ZN5torch5optim4Adam4stepESt8functionIFN2at6TensorEvEE+0x1810)[0x7f9b5bfd6a30]
../../projects/www.mlcathome.org_mlcathome/mlds_0.920_x86_64-pc-linux-gnu(_ZN7TrainerI15BitMachineModelN5torch5optim4AdamENS1_2nn7MSELossE17BitMachineDatasetE5trainEv+0x341)[0x52bafb]
../../projects/www.mlcathome.org_mlcathome/mlds_0.920_x86_64-pc-linux-gnu(main+0x2766)[0x51ee3d]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0x7f9b587191e3]
../../projects/www.mlcathome.org_mlcathome/mlds_0.920_x86_64-pc-linux-gnu[0x51c342]

Exiting...

</stderr_txt>
]]>


©2022 MLC@Home Team
A project of the Cognition, Robotics, and Learning (CORAL) Lab at the University of Maryland, Baltimore County (UMBC)