Python:如何在PyTables中存储numpy多维数组

如何使用PyTables将numpy多维数组放入HDF5文件中？

据我所知，我不能将数组字段放在pytables表中。

我还需要存储一些关于这个数组的信息，并能够对它进行数学计算

有什么建议吗？

可能有一种更简单的方法，但据我所知，这就是你要做的：

import numpy as np
import tables
# Generate some data
x = np.random.random((100,100,100))
# Store "x" in a chunked array...
f = tables.open_file('test.hdf', 'w')
atom = tables.Atom.from_dtype(x.dtype)
ds = f.createCArray(f.root, 'somename', atom, x.shape)
ds[:] = x
f.close()

如果要指定要使用的压缩，请查看tables.Filters。例如

import numpy as np
import tables
# Generate some data
x = np.random.random((100,100,100))
# Store "x" in a chunked array with level 5 BLOSC compression...
f = tables.open_file('test.hdf', 'w')
atom = tables.Atom.from_dtype(x.dtype)
filters = tables.Filters(complib='blosc', complevel=5)
ds = f.createCArray(f.root, 'somename', atom, x.shape, filters=filters)
ds[:] = x
f.close()

可能有一种更简单的方法来解决很多问题。。。我已经很久没有将pytables用于类似表的数据之外的任何其他内容了。

注意：使用pytables 3.0，f.createCArray被重命名为f.create_carray。它也可以直接接受数组，而无需指定atom、

f.create_carray('/', 'somename', obj=x, filters=filters)

相关内容

最新更新

热门标签：