Closed
Description
There was an ephemeral failure of TestMultipleVMs here (run as part of #356 ): https://buildkite.com/firecracker-microvm/firecracker-containerd/builds/1315#1dd8d6e8-953c-417b-97f7-e029aea0a82f
I haven't looked in depth yet, but a few lines in the logs for vmID 78 (which got one of the "os: process already finished" errors), shows some odd looking lines:
time="2019-12-06T00:08:44.729560758Z" level=warning msg="firecracker exited: exit status 148" runtime=aws.firecracker vmID=78
...
time="2019-12-06T00:08:44.729847979Z" level=error msg="error returned from VM wait" error="1 error occurred:\n\t* exit status 148\n\n" runtime=aws.firecracker vmID=78
...
time="2019-12-06T00:08:44.729855148Z" level=error msg="the VM returns unknown error" error="1 error occurred:\n\t* exit status 148\n\n" runtime=aws.firecracker vmID=78
...
time="2019-12-06T00:08:44.730000243Z" level=debug msg="stopVMM(): sending sigterm to firecracker" runtime=aws.firecracker vmID=78
...
time="2019-12-06T00:08:44.730102402Z" level=error error="rpc error: code = Unknown desc = os: process already finished"
I'm not sure off the top of my head what caused the exit code to be 148. We also shouldn't need to send SIGTERM if the VM is already exited (but that may be unrelated).