什么原因导致张量流"Check failed: ret == 0 (11 vs. 0)Thread creation via pthread_create() failed."

我在python中制作了一个discordbot，现在添加了"聊天机器人"；使用Tensorflow和NLTK。当我在本地运行机器人程序时，它运行得非常好，没有任何问题，但当我将它移到我的namecheap托管包中托管我的投资组合时，它开始出现错误，说：

OpenBLAS blas_thread_init: pthread_create failed for thread 29 of 64: Resource temporarily unavailable

nltk和tensorflow无法导入，机器人程序崩溃。

我在谷歌上搜索了一下，找到了一个解决方案，告诉在使用任何导入之前使用os.environ['OPENBLAS_NUM_THREADS'] = '1'。这解决了以前的错误，但现在它给出了另一个错误：

Check failed: ret == 0 (11 vs. 0)Thread creation via pthread_create() failed.

现在运行python main.py的完整输出是：

2021-06-10 11:18:19.606471: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcudart.so.11.0'; dlerror: libcudart.so.11.0: cannot open shared object file: No such file or directory
2021-06-10 11:18:19.606497: I tensorflow/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine.
2021-06-10 11:18:21.090650: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcuda.so.1'; dlerror: libcuda.so.1: cannot open shared object file: No such file or directory
2021-06-10 11:18:21.090684: W tensorflow/stream_executor/cuda/cuda_driver.cc:326] failed call to cuInit: UNKNOWN ERROR (303)
2021-06-10 11:18:21.090716: I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:156] kernel driver does not appear to be running on this host (server270.web-hosting.com): /proc/driver/nvidia/version does not exist
2021-06-10 11:18:21.091042: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2021-06-10 11:18:21.092409: F tensorflow/core/platform/default/env.cc:73] Check failed: ret == 0 (11 vs. 0)Thread creation via pthread_create() failed.

为了不让这个问题太长，源文件已经托管在GitHub上：https://github.com/Nalin-2005/The2020CoderBot并且README.md告诉哪些文件包含机器人程序的哪个部分

该机器人程序托管在Namecheap共享主机上，有关服务器的详细信息和技术规格如下：

内存：1GB
存储：20GB SSD
CPU(使用cat /proc/cpuinfo | grep 'model name' | uniq(：Intel(R(Xeon(R(Gold 6140 CPU@2.30GHz

据我所知，这两个问题都是由RAM或CPU使用量有限引起的。但现在，Python脚本本身阻止了使用
那么，是什么原因导致了这种情况(如果我不正确(，我该如何解决？

经过一段时间的头脑风暴和谷歌搜索，我发现了Tensorflow Lite，它消耗的资源更少，但在我的服务器上提供了相同的性能*，我可以很容易地将它与以前的代码集成，以生成一个更具资源效率的模型。对于那些想知道如何将任何keras模型转换为Tensorflow lite的用户，以下是说明。

训练时，将model.save("/path/to/model.h5")替换为：

converter = tf.lite.TFLiteConverter.from_keras_model(model)
tflite_model = converter.convert()
with open("/path/to/model.tflite", "wb") as f:
f.write(tflite_model)

使用时，请使用：

model = tf.lite.Interpreter("/path/to/model.tflite")
model.allocate_tensors()
input_details = model.get_input_details()
output_details = model.get_output_details()
# prepare input data
model.set_tensor(input_details[0]['index'],input_data)
model.invoke()
output_data = model.get_tensor(output_details[0]['index'])
results = np.squeeze(output_data)

相关内容

最新更新

热门标签：