Skip to content

lstmtraining mutex lock failed #3655

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
stweil opened this issue Nov 21, 2021 · 2 comments
Open

lstmtraining mutex lock failed #3655

stweil opened this issue Nov 21, 2021 · 2 comments

Comments

@stweil
Copy link
Member

stweil commented Nov 21, 2021

Sometimes the training process lstmtraining raises an exception when the training is finished:

Finished! Selected model with minimal training error rate (BCER) = 2.849
libc++abi: terminating with uncaught exception of type std::__1::system_error: mutex lock failed: Invalid argument

Maybe that happens when the training finishes while the validation thread is still running.

@stweil stweil added the bug label Nov 21, 2021
@stweil stweil added this to the 5.0.0 milestone Nov 21, 2021
@stweil
Copy link
Member Author

stweil commented Nov 23, 2021

In the same context another error can also occur (I got it yesterday while examining the above issue):

Finished! Selected model with minimal training error rate (BCER) = 12.951
num_docs > 0:Error:Assert failed:in file ../../../src/ccstruct/imagedata.cpp, line 697

This is an error which was already reported in issue #757.

@amitdo
Copy link
Collaborator

amitdo commented Nov 24, 2021

A general note: maybe we can adopt some of the suggestions from the C++ Core Guidelines:
https://github.com/isocpp/CppCoreGuidelines/blob/master/CppCoreGuidelines.md#cp-concurrency-and-parallelism

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants