小贝子编程

使用camelot的PDF文件名和坐标并获取表详细信息

本文关键字：获取详细信息坐标 camelot PDF 文件名使用 python pdf python-camelot
更新时间 : 2023-09-23
英文 : Using the PDF file names and coordinates for camelot and getting the table details

我有pdf图像在excel的一列随着它的camelot坐标每一个在各自的行，一个简短的看下面:-

file_name X1_Camelot Y1_Camelot x2_Camelot Y2_Camelot路径/to/pdf_file/folder/1.pdf 16 77 80 540路径/to/pdf_file/folder/2.pdf 20 300 40 260 .pdf路径/to/pdf_file/folder/3.pdf 40 90 200 340path/to/pdf_file/folder/4.pdf 20 50 100 440

我想写一个python脚本，它去每个pdf文件取每个camelot版本的坐标，然后将值放入下面的函数:-

tables = camelot.read_pdf('table_regions.pdf'， table_regions=['170,370,560,270'])表[0].df

我想从输入的csv文件有这些列的每个PDF的结果。

我尝试使用for循环和df.iterrows()，但它没有工作。

with open('data.csv', 'r') as file:
data = file.read().split('n')[1:]
for line in data:
fields = line.split(',')
path = fields[0]
x1,y1,x2,y2 = int(fields[1]),int(fields[2]),int(fields[3]),int(fields[4])

使用camelot的PDF文件名和坐标并获取表详细信息

相关内容

最新更新

热门标签：