Task 1226361

Name ParityMachine-1593664027-12490-18_1
Workunit 545073
Created 26 Aug 2020, 10:01:52 UTC
Sent 27 Aug 2020, 12:49:25 UTC
Report deadline 31 Aug 2020, 12:49:25 UTC
Received 27 Aug 2020, 12:59:37 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 716
Run time 4 min 54 sec
CPU time 4 min 27 sec
Validate state Invalid
Credit 0.00
Device peak FLOPS 2.88 GFLOPS
Application version Machine Learning Dataset Generator v9.55
x86_64-pc-linux-gnu
Peak working set size 668.23 MB
Peak swap size 901.00 MB
Peak disk usage 0.04 MB

Stderr output

<core_client_version>7.16.3</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)</message>
<stderr_txt>
terminate called after throwing an instance of 'torch::utils::FutureError'
  what():  s.isIntegral(false) INTERNAL ASSERT FAILED at "/home/gitlab-runner/builds/yt3u-Xpm/0/clemej/mlds/extern/pytorch/aten/src/ATen/ScalarOps.h":22, please report a bug to PyTorch. 
Exception raised from scalar_to_tensor at /home/gitlab-runner/builds/yt3u-Xpm/0/clemej/mlds/extern/pytorch/aten/src/ATen/ScalarOps.h:22 (most recent call first):
frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x68 (0x7fa9b968b1a8 in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libc10.so)
frame #1: <unknown function> + 0x11e2b52 (0x7fa9b397eb52 in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #2: <unknown function> + 0x121c240 (0x7fa9b39b8240 in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #3: at::native::mul(at::Tensor const&, c10::Scalar) + 0x3a (0x7fa9b39b851a in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #4: <unknown function> + 0x1771814 (0x7fa9b3f0d814 in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #5: <unknown function> + 0x11117b5 (0x7fa9b38ad7b5 in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #6: at::mul(at::Tensor const&, c10::Scalar) + 0x160 (0x7fa9b3e3f840 in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #7: <unknown function> + 0x3183496 (0x7fa9b591f496 in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #8: <unknown function> + 0x11117b5 (0x7fa9b38ad7b5 in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #9: at::Tensor::mul(c10::Scalar) const + 0x160 (0x7fa9b3fc97f0 in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #10: torch::autograd::generated::SubBackward0::apply(std::vector<at::Tensor, std::allocator<at::Tensor> >&&) + 0x1e3 (0x7fa9b5716453 in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #11: <unknown function> + 0x353097b (0x7fa9b5ccc97b in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #12: torch::autograd::Engine::evaluate_function(std::shared_ptr<torch::autograd::GraphTask>&, torch::autograd::Node*, torch::autograd::InputBuffer&, std::shared_ptr<torch::autograd::ReadyQueue> const&) + 0x1852 (0x7fa9b5cc7832 in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #13: torch::autograd::Engine::thread_main(std::shared_ptr<torch::autograd::GraphTask> const&) + 0x6ba (0x7fa9b5cc828a in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #14: torch::autograd::Engine::execute_with_graph_task(std::shared_ptr<torch::autograd::GraphTask> const&, std::shared_ptr<torch::autograd::Node>) + 0x417 (0x7fa9b5cc3827 in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #15: torch::autograd::Engine::execute(std::vector<torch::autograd::Edge, std::allocator<torch::autograd::Edge> > const&, std::vector<at::Tensor, std::allocator<at::Tensor> > const&, bool, bool, std::vector<torch::autograd::Edge, std::allocator<torch::autograd::Edge> > const&) + 0x7e3 (0x7fa9b5cc5bd3 in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #16: <unknown function> + 0x351bf75 (0x7fa9b5cb7f75 in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #17: torch::autograd::backward(std::vector<at::Tensor, std::allocator<at::Tensor> > const&, std::vector<at::Tensor, std::allocator<at::Tensor> > const&, c10::optional<bool>, bool) + 0x85 (0x7fa9b5cb8db5 in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #18: <unknown function> + 0x391b818 (0x7fa9b60b7818 in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #19: <unknown function> + 0x325ab01 (0x7fa9b59f6b01 in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #20: at::Tensor::backward(at::Tensor const&, c10::optional<bool>, bool) const + 0x15a (0x7fa9b3fcf56a in /tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so)
frame #21: Trainer<BitMachineModel, torch::optim::Adam, torch::nn::MSELoss, BitMachineDataset>::train() + 0x32d (0x59b61b in ../../projects/www.mlcathome.org_mlcathome/mlds_9.55_x86_64-pc-linux-gnu)
frame #22: main + 0x29ff (0x58f52c in ../../projects/www.mlcathome.org_mlcathome/mlds_9.55_x86_64-pc-linux-gnu)
frame #23: __libc_start_main + 0xf3 (0x7fa9b1de51e3 in /lib/x86_64-linux-gnu/libc.so.6)
frame #24: _start + 0x29 (0x58c379 in ../../projects/www.mlcathome.org_mlcathome/mlds_9.55_x86_64-pc-linux-gnu)

SIGABRT: abort called
Stack trace (18 frames):
../../projects/www.mlcathome.org_mlcathome/mlds_9.55_x86_64-pc-linux-gnu(boinc_catch_signal+0xd3)[0x7b9823]
/lib/x86_64-linux-gnu/libpthread.so.0(+0x15540)[0x7fa9b2774540]
/lib/x86_64-linux-gnu/libc.so.6(gsignal+0xcb)[0x7fa9b1e043eb]
/lib/x86_64-linux-gnu/libc.so.6(abort+0x12b)[0x7fa9b1de3899]
../../projects/www.mlcathome.org_mlcathome/mlds_9.55_x86_64-pc-linux-gnu[0x586bcc]
../../projects/www.mlcathome.org_mlcathome/mlds_9.55_x86_64-pc-linux-gnu(_ZN10__cxxabiv111__terminateEPFvvE+0x6)[0x7d25c6]
../../projects/www.mlcathome.org_mlcathome/mlds_9.55_x86_64-pc-linux-gnu[0x7d2631]
../../projects/www.mlcathome.org_mlcathome/mlds_9.55_x86_64-pc-linux-gnu[0x7d2785]
/tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so(+0xe6cea1)[0x7fa9b3608ea1]
/tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so(+0x351bf75)[0x7fa9b5cb7f75]
/tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so(_ZN5torch8autograd8backwardERKSt6vectorIN2at6TensorESaIS3_EES7_N3c108optionalIbEEb+0x85)[0x7fa9b5cb8db5]
/tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so(+0x391b818)[0x7fa9b60b7818]
/tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so(+0x325ab01)[0x7fa9b59f6b01]
/tmp/.mount_mlds_9P7Ix3K/usr/bin/../lib/libtorch_cpu.so(_ZNK2at6Tensor8backwardERKS0_N3c108optionalIbEEb+0x15a)[0x7fa9b3fcf56a]
../../projects/www.mlcathome.org_mlcathome/mlds_9.55_x86_64-pc-linux-gnu(_ZN7TrainerI15BitMachineModelN5torch5optim4AdamENS1_2nn7MSELossE17BitMachineDatasetE5trainEv+0x32d)[0x59b61b]
../../projects/www.mlcathome.org_mlcathome/mlds_9.55_x86_64-pc-linux-gnu(main+0x29ff)[0x58f52c]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0x7fa9b1de51e3]
../../projects/www.mlcathome.org_mlcathome/mlds_9.55_x86_64-pc-linux-gnu(_start+0x29)[0x58c379]

Exiting...

</stderr_txt>
]]>


©2022 MLC@Home Team
A project of the Cognition, Robotics, and Learning (CORAL) Lab at the University of Maryland, Baltimore County (UMBC)