Pdf表格转换为excel



Hello

我有这个pdf文件,它基本上是一个表单,我想提取所有的字段名称作为列名和相关信息,然后将其保存到excel文件中。

请帮我解决这个问题。

提前谢谢。

pdf 快照

您可以使用pdfplumber包。有一个裁剪功能,可以指定一个边界框,从中可以定义要提取的区域。还有一整套其他功能,可用于提取文本和表单字段等。例如:https://github.com/jsvine/pdfplumber#extracting-形式值

这与正则表达式的选择和匹配相结合通常很有用。不过,我建议你试一试。事实上,这并不像你想象的那么困难。

从PDF中提取文本的示例:

with pdfplumber.open(pdf_file) as pdf:
first_page = pdf.pages[0]
rows = first_page.extract_text().split('n')

然后,您可以使用pandas包将数据放入数据帧中,一旦数据采用这种格式,就可以将其发送到excel。


编辑:

根据新信息,您似乎正在处理一个基于XFA的PDF。从我最初的尝试中,我无法使用如上所述的pdfplumber。我的建议是使用PyPDF2。

从PDF文档中提取XML,然后使用它来获取所需的信息。我想说,REGEX在这里仍然是一个合适的方法。

从基于XFA的PDF中提取XML的代码:

import PyPDF2 as pypdf
def findInDict(needle, haystack):
for key in haystack.keys():
try:
value=haystack[key]
except:
continue
if key==needle:
return value
if isinstance(value,dict):            
x=findInDict(needle,value)            
if x is not None:
return x
pdfobject=open('Form CHG-1-010216.pdf','rb')
pdf=pypdf.PdfFileReader(pdfobject)
xfa=findInDict('/XFA',pdf.resolved_objects)
xml=xfa[7].getObject().getData()
print(xml)

(代码来源(

这是从PDF的数据集部分提取的XML:

b'n<xfa:datasets xmlns:xfa="http://www.xfa.org/schema/xfa-data/1.0/"n><dd:dataDescription xmlns:dd="http://ns.adobe.com/data-description/" dd:name="Form8_Dtls"n><frm:Form8_Dtls xmlns:frm="http://www.mit.gov.in/eGov/BackOffice/schema/Form"n><frm:Form8n><frm:CIN dd:minOccur="0" dd:nullType="exclude"n/><frm:GLN dd:minOccur="0" dd:nullType="exclude"n/><frm:EmailID dd:minOccur="0" dd:nullType="exclude"n/><frm:ChargeType dd:minOccur="0" dd:nullType="exclude"n/><frm:Applicant dd:minOccur="0" dd:nullType="exclude"n/><frm:SrnForForm dd:minOccur="0" dd:nullType="exclude"n/><frm:ChargeID 
dd:minOccur="0" dd:nullType="exclude"n/><frm:SrnForForm2.28 dd:minOccur="0" dd:nullType="exclude"n/><frm:Beyond30Within300 dd:minOccur="0" dd:nullType="exclude"n/><frm:Beyond300 dd:minOccur="0" dd:nullType="exclude"n/><frm:ReasonDelay dd:minOccur="0" 
dd:nullType="exclude"n/><frm:WhthrChrgARCorAssignd dd:minOccur="0" dd:nullType="exclude"n/><frm:WhthrChrghldrAuth dd:minOccur="0" dd:nullType="exclude"n/><frm:UnCldShrCptl dd:minOccur="0" dd:nullType="exclude"n/><frm:Improperty dd:minOccur="0" dd:nullType="exclude"n/><frm:AnyIntrstImproperty dd:minOccur="0" dd:nullType="exclude"n/><frm:BookDebts dd:minOccur="0" dd:nullType="exclude"n/><frm:MovProperty dd:minOccur="0" dd:nullType="exclude"n/><frm:FloatngChrg dd:minOccur="0" dd:nullType="exclude"n/><frm:CallsMadeNotPaid dd:minOccur="0" dd:nullType="exclude"n/><frm:Ship dd:minOccur="0" dd:nullType="exclude"n/><frm:Goodwill dd:minOccur="0" dd:nullType="exclude"n/><frm:PatentLicence dd:minOccur="0" dd:nullType="exclude"n/><frm:tradeMark dd:minOccur="0" dd:nullType="exclude"n/><frm:Copyright dd:minOccur="0" dd:nullType="exclude"n/><frm:Others dd:minOccur="0" dd:nullType="exclude"n/><frm:OthersSpec dd:minOccur="0" dd:nullType="exclude"n/><frm:ConsrtmInvld dd:minOccur="0" dd:nullType="exclude"n/><frm:JointChrgInvld dd:minOccur="0" dd:nullType="exclude"n/><frm:NoOfChargeHolders 
dd:minOccur="0" dd:nullType="exclude"n/><frm:CategoryBank dd:minOccur="0" dd:nullType="exclude"n/><frm:IfCategoryOthers dd:minOccur="0" dd:nullType="exclude"n/><frm:ChargeHolderDetails dd:minOccur="0"n><cdt:Cin xmlns:cdt="http://www.mit.gov.in/eGov/BackOffice/schema/ComplexDataTypes" dd:minOccur="0" dd:nullType="exclude"n/><cdt:ChrgHldrName xmlns:cdt="http://www.mit.gov.in/eGov/BackOffice/schema/ComplexDataTypes" dd:minOccur="0" dd:nullType="exclude"n/><cdt:OptionalName xmlns:cdt="http://www.mit.gov.in/eGov/BackOffice/schema/ComplexDataTypes" dd:minOccur="0" dd:nullType="exclude"n/><cdt:ChargeHldrAddress xmlns:cdt="http://www.mit.gov.in/eGov/BackOffice/schema/ComplexDataTypes"n><cdt:AddressLnn><cdt:FirstLinen/><cdt:SecondLine dd:minOccur="0" dd:nullType="exclude"n/></cdt:AddressLnn><cdt:Cityn/><cdt:Staten/><cdt:Country dd:minOccur="0" dd:nullType="exclude"n/><cdt:CountryName dd:minOccur="0" dd:nullType="exclude"n/><cdt:Pincode dd:minOccur="0" dd:nullType="exclude"n/><cdt:Telephone dd:minOccur="0" dd:nullType="exclude"n/><cdt:Fax dd:minOccur="0" dd:nullType="exclude"n/><cdt:Email dd:minOccur="0" dd:nullType="exclude"n/><cdt:Mbl dd:minOccur="0" dd:nullType="exclude"n/></cdt:ChargeHldrAddressn></frm:ChargeHolderDetailsn><frm:InstrumentDesc dd:minOccur="0" dd:nullType="exclude"n/><frm:InstrumentCrtModDate dd:minOccur="0" dd:nullType="exclude"n/><frm:WhthrChrgCrMod dd:minOccur="0" dd:nullType="exclude"n/><frm:FrgnChargeRcptDate dd:minOccur="0" dd:nullType="exclude"n/><frm:AmtSecured 
dd:minOccur="0" dd:nullType="exclude"n/><frm:AmtSecChrgInWords dd:minOccur="0" dd:nullType="exclude"n/><frm:AmtSecChrgFrgnCurrncyDetails dd:minOccur="0" dd:nullType="exclude"n/><frm:TermsAndConditions dd:minOccur="0"n><frm:RateOfInt dd:minOccur="0" dd:nullType="exclude"n/><frm:TermsOfPaymnt dd:minOccur="0" dd:nullType="exclude"n/><frm:Margin dd:minOccur="0" dd:nullType="exclude"n/><frm:ExtntOperatnChrg dd:minOccur="0" dd:nullType="exclude"n/><frm:Others dd:minOccur="0" dd:nullType="exclude"n/></frm:TermsAndConditionsn><frm:ExstngChrgAcqDtls dd:minOccur="0"n><frm:InstrDate dd:minOccur="0" dd:nullType="exclude"n/><frm:InstrDescr dd:minOccur="0" dd:nullType="exclude"n/><frm:DateofAcq dd:minOccur="0" dd:nullType="exclude"n/><frm:ChrgAmnt dd:minOccur="0" dd:nullType="exclude"n/><frm:PropChrgPartclrs dd:minOccur="0" dd:nullType="exclude"n/></frm:ExstngChrgAcqDtlsn><frm:PropParticlars dd:minOccur="0" dd:nullType="exclude"n/><frm:NewPropParticlars dd:maxOccur="10" dd:minOccur="0" dd:nullType="exclude"n/><frm:PropOwnCmp dd:minOccur="0" dd:nullType="exclude"n/><frm:PropRegisteredName dd:minOccur="0" dd:nullType="exclude"n/><frm:DateOfLatestMod dd:minOccur="0" dd:nullType="exclude"n/><frm:PartclrsPresntMod dd:minOccur="0" dd:nullType="exclude"n/><frm:BoardResNo dd:minOccur="0" dd:nullType="exclude"n/><frm:AuthSigReslnDt dd:minOccur="0" dd:nullType="exclude"n/><frm:DesignationOne dd:minOccur="0" dd:nullType="exclude"n/><frm:DIN dd:minOccur="0" dd:nullType="exclude"n/><frm:DesignationTwo dd:minOccur="0" dd:nullType="exclude"n/><frm:DesignationThree dd:minOccur="0" dd:nullType="exclude"n/><frm:CharteredOrCostOrCompSec dd:minOccur="0" dd:nullType="exclude"n/><frm:AssociateorFellow dd:minOccur="0" dd:nullType="exclude"n/><frm:MembershipnumberorCertificate dd:minOccur="0" dd:nullType="exclude"n/><frm:CertificateNo dd:minOccur="0" dd:nullType="exclude"n/><frm:strBankName dd:minOccur="0" dd:nullType="exclude"n/><frm:CondonationFlag dd:minOccur="0" dd:nullType="exclude"n/><frm:HTF dd:minOccur="0" dd:nullType="exclude"n/><frm:IPresNumber dd:minOccur="0" dd:nullType="exclude"n/><frm:FormId dd:minOccur="0" dd:nullType="exclude"n/><frm:VersionNo dd:minOccur="0" dd:nullType="exclude"n/><frm:Form_Language dd:minOccur="0" dd:nullType="exclude"n/><frm:BoPreFilldataForm dd:minOccur="0"n><cdt:DateOfFiling xmlns:cdt="http://www.mit.gov.in/eGov/BackOffice/schema/ComplexDataTypes"n/><cdt:DateOfSigning xmlns:cdt="http://www.mit.gov.in/eGov/BackOffice/schema/ComplexDataTypes"n/><cdt:eFormSRN xmlns:cdt="http://www.mit.gov.in/eGov/BackOffice/schema/ComplexDataTypes"n/><cdt:MngmtDispute xmlns:cdt="http://www.mit.gov.in/eGov/BackOffice/schema/ComplexDataTypes" dd:minOccur="0" dd:nullType="exclude"n/></frm:BoPreFilldataFormn><frm:MngmtDispute dd:minOccur="0" dd:nullType="exclude"n/><frm:LSI dd:minOccur="0" dd:nullType="exclude"n/><frm:HostVersion dd:minOccur="0" dd:nullType="exclude"n/><frm:HostAppName dd:minOccur="0" dd:nullType="exclude"n/><frm:TotalPageNo dd:minOccur="0" dd:nullType="exclude"n/><frm:EfmUniqueID dd:minOccur="0" dd:nullType="exclude"n/><frm:AttachmentNames dd:minOccur="0" dd:nullType="exclude"n/></frm:Form8n></frm:Form8_Dtlsn></dd:dataDescriptionn><dd:dataDescription xmlns:dd="http://ns.adobe.com/data-description/" dd:name="CINDataConngetCINLLPINDetailsRequestDD"n><CINDataConnn><soap:Body xmlns:soap="http://schemas.xmlsoap.org/soap/envelope/"n><impl:getCINLLPINDetails xmlns:impl="http://prefill.eforms.userinterface.mydca.dca21.com"n><impl:strCINLLPIN dd:nullType="xsi"n/></impl:getCINLLPINDetailsn></soap:Bodyn></CINDataConnn></dd:dataDescriptionn><dd:dataDescription xmlns:dd="http://ns.adobe.com/data-description/" dd:name="CINSplitDataConn1getChrgHolderAddressWitCondRequestDD"n><CINSplitDataConn1n><soap:Body xmlns:soap="http://schemas.xmlsoap.org/soap/envelope/"n><impl:getChrgHolderAddressWitCond xmlns:impl="http://prefill.eforms.userinterface.mydca.dca21.com"n><impl:strCompanyID dd:nullType="xsi"n/></impl:getChrgHolderAddressWitCondn></soap:Bodyn></CINSplitDataConn1n></dd:dataDescriptionn><dd:dataDescription xmlns:dd="http://ns.adobe.com/data-description/" dd:name="ForeignCmpnyDataConngetForeignCompanyDetailsNewRequestDD"n><ForeignCmpnyDataConnn><soap:Body xmlns:soap="http://schemas.xmlsoap.org/soap/envelope/"n><impl:getForeignCompanyDetailsNew xmlns:impl="http://prefill.eforms.userinterface.mydca.dca21.com"n><impl:strCompanyID dd:nullType="xsi"n/></impl:getForeignCompanyDetailsNewn></soap:Bodyn></ForeignCmpnyDataConnn></dd:dataDescriptionn><dd:dataDescription xmlns:dd="http://ns.adobe.com/data-description/" dd:name="PrescrutinyServiceserviceForm8RequestDD"n><PrescrutinyServicen><soap:Body xmlns:soap="http://schemas.xmlsoap.org/soap/envelope/"n><impl:serviceForm8 xmlns:impl="http://prefill.eforms.userinterface.mydca.dca21.com"n><impl:objForm8PrescDDto dd:nullType="xsi"n><tns2:DBoardResDate xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:DDateOfLatestMod xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:DTempAcqstnDate xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:extntOperatnChrg xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:margin xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:others xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:partclrsPresntMod xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:propParticlars xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:rateOfInt xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:termsOfPaymnt xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:whthrChrgCrMod xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:DInstrCrtEvdDate xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:DInstrCrtModDate xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:DTempFrgnCharge xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:dupFlag xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:formID xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:formVersion xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:presNumber xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:strChargeType xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:strChrgCIN xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:strChrgHldrName xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:strCIN xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:strCountryCode xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:strDesignation xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:strDINMembrshpNo xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:strPinCode xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/><tns2:strStateCode xmlns:tns2="http://dto.eforms.business.mydca.dca21.com" dd:nullType="xsi"n/></impl:objForm8PrescDDton></impl:serviceForm8n></soap:Bodyn></PrescrutinyServicen></dd:dataDescriptionn><xfa:datan><frm:Form8_Dtls xmlns:frm="http://www.mit.gov.in/eGov/BackOffice/schema/Form"n><frm:Form8n><frm:CINn>U37100DL2004PTC128960</frm:CINn><frm:EmailIDn>vwcpl@yahoo.com</frm:EmailIDn><frm:ChargeTypen>CRTN</frm:ChargeTypen><frm:Applicantn>Company</frm:Applicantn><frm:Beyond30Within300n>Yes</frm:Beyond30Within300n><frm:ReasonDelayn>Due to some DSC problems</frm:ReasonDelayn><frm:UnCldShrCptln>2ONE</frm:UnCldShrCptln><frm:Impropertyn>2MMP</frm:Impropertyn><frm:AnyIntrstImpropertyn>2ONE</frm:AnyIntrstImpropertyn><frm:BookDebtsn>2ONE</frm:BookDebtsn><frm:MovPropertyn>2OVP</frm:MovPropertyn><frm:FloatngChrgn>2ONE</frm:FloatngChrgn><frm:CallsMadeNotPaidn>2ONE</frm:CallsMadeNotPaidn><frm:Shipn>2ONE</frm:Shipn><frm:Goodwilln>2ONE</frm:Goodwilln><frm:PatentLicencen>2ONE</frm:PatentLicencen><frm:tradeMarkn>2ONE</frm:tradeMarkn><frm:Copyrightn>2ONE</frm:Copyrightn><frm:Othersn>2THS</frm:Othersn><frm:OthersSpecn>movable / immovable fixed assets</frm:OthersSpecn><frm:ConsrtmInvldn>NO</frm:ConsrtmInvldn><frm:JointChrgInvldn>NO</frm:JointChrgInvldn><frm:NoOfChargeHoldersn>1</frm:NoOfChargeHoldersn><frm:CategoryBankn>NATB</frm:CategoryBankn><frm:ChargeHolderDetailsn><cdt:ChrgHldrName xmlns:cdt="http://www.mit.gov.in/eGov/BackOffice/schema/ComplexDataTypes"n>Others</cdt:ChrgHldrNamen><cdt:OptionalName xmlns:cdt="http://www.mit.gov.in/eGov/BackOffice/schema/ComplexDataTypes"n>State Bank of Patiala</cdt:OptionalNamen><cdt:ChargeHldrAddress xmlns:cdt="http://www.mit.gov.in/eGov/BackOffice/schema/ComplexDataTypes"n><cdt:AddressLnn><cdt:FirstLinen>MCG Pitampura</cdt:FirstLinen><cdt:SecondLinen>A-102, D-Mall, Netaji Subash Place</cdt:SecondLinen></cdt:AddressLnn><cdt:Cityn>Delhi</cdt:Cityn><cdt:Staten>DL</cdt:Staten><cdt:Countryn>IN</cdt:Countryn><cdt:CountryNamen>INDIA</cdt:CountryNamen><cdt:Pincoden>110034</cdt:Pincoden><cdt:Emailn>nimishdel@gmail.com</cdt:Emailn></cdt:ChargeHldrAddressn></frm:ChargeHolderDetailsn><frm:InstrumentDescn>Agreement of Loan Cum Hypothecation&#xD;Letter of Arrangement&#xD;Agreement of Mortgage</frm:InstrumentDescn><frm:InstrumentCrtModDaten>2015-12-30</frm:InstrumentCrtModDaten><frm:WhthrChrgCrModn>NO</frm:WhthrChrgCrModn><frm:AmtSecuredn>6000000.00</frm:AmtSecuredn><frm:AmtSecChrgInWordsn>Rupees Sixty Lacs  only</frm:AmtSecChrgInWordsn><frm:TermsAndConditionsn><frm:RateOfIntn>Term Loan of Rs.60.00 Lakh - @ 3.10 % above base rate, present 
effective rate 12.75 % p. a. with monthly rests.</frm:RateOfIntn><frm:TermsOfPaymntn>The Term Loan of Rs.60.00 Lakh shall be repayable in 26 quarterly installments of Rs.2,25,000/- each, commencing from June, 2016 (last installment of Rs.4,00,000/- )</frm:TermsOfPaymntn><frm:Marginn>41.19 %</frm:Marginn><frm:ExtntOperatnChrgn>100 percent.</frm:ExtntOperatnChrgn><frm:Othersn>THE ABOVE IS TO SECURE THE FOLLOWING CREDIT FACILITIES GRANTED TO THE COMPANY :-&#xD;1. Term Loan             - Rs.60.00 Lakh</frm:Othersn></frm:TermsAndConditionsn><frm:Form_Languagen>ENGL</frm:Form_Languagen><frm:BoPreFilldataFormn><cdt:DateOfFiling xmlns:cdt="http://www.mit.gov.in/eGov/BackOffice/schema/ComplexDataTypes"n/><cdt:DateOfSigning xmlns:cdt="http://www.mit.gov.in/eGov/BackOffice/schema/ComplexDataTypes"n/><cdt:eFormSRN xmlns:cdt="http://www.mit.gov.in/eGov/BackOffice/schema/ComplexDataTypes"n/></frm:BoPreFilldataFormn><frm:HostVersionn>22.00220191</frm:HostVersionn><frm:HostAppNamen>Reader</frm:HostAppNamen><frm:TotalPageNon>6</frm:TotalPageNon><frm:EfmUniqueIDn>Form89JOK7KCC3A8WGXDDT7VVA3ZKNRG</frm:EfmUniqueIDn><frm:LSIn>1</frm:LSIn><frm:ExstngChrgAcqDtlsn/><frm:NewPropParticlarsn>Hypothecation of all movable / immovable fixed assets of the company created out of the Term Loan.</frm:NewPropParticlarsn><frm:NewPropParticlarsn>Equitable mortgage of Shop No. CSC - 39, DDA Market, A Block, Saraswati Vihar, Delhi -110 034.</frm:NewPropParticlarsn><frm:PropOwnCmpn>NO</frm:PropOwnCmpn><frm:BoardResNon>03</frm:BoardResNon><frm:AuthSigReslnDtn>2015-12-14</frm:AuthSigReslnDtn><frm:DesignationOnen>DIRT</frm:DesignationOnen><frm:DINn>05153044</frm:DINn><frm:DesignationTwon>AACCS0143D</frm:DesignationTwon><frm:CharteredOrCostOrCompSecn>CA</frm:CharteredOrCostOrCompSecn><frm:AssociateorFellown>FW</frm:AssociateorFellown><frm:MembershipnumberorCertificaten>508508</frm:MembershipnumberorCertificaten><frm:CertificateNon>508508</frm:CertificateNon><frm:AttachmentNamesn>2014.pdf,2900.pdf</frm:AttachmentNamesn><frm:HTFn>NO</frm:HTFn><frm:IPresNumbern>0</frm:IPresNumbern><frm:FormIdn>Form8</frm:FormIdn><frm:VersionNon>30</frm:VersionNon></frm:Form8n><CompanyName_Cn>VEERA WASIR CONSULTANTS PRIVATE LIMITED</CompanyName_Cn><ExtractedVersionn>30</ExtractedVersionn><Hidden_FormLanguagen/><CompanyAdd_Cn>9 CSC DDA MARKETA-BLOCKnSARASWATI VIHARnDELHInDelhinINDIAn110085</CompanyAdd_Cn><hiddenEmailIDn/><Hidden_Ln>Agreement of Hyp.pdf:2014:Agreement of Mortgage.pdf:2900</Hidden_Ln><Err_Cn/><isDupFlagn>NO</isDupFlagn><PrescruitnyErr_Nn>-1</PrescruitnyErr_Nn><BOFiling_errMsgn/><BOFilingFlagn>NO</BOFilingFlagn><CheckForm_Cn>NO</CheckForm_Cn></frm:Form8_Dtlsn></xfa:datan><dd:dataDescription xmlns:dd="http://ns.adobe.com/data-description/" dd:name="ChargeHoldersDataConngetBankDetailsRequestDD"n><ChargeHoldersDataConnn><soap:Body xmlns:soap="http://schemas.xmlsoap.org/soap/envelope/"n><impl:getBankDetails xmlns:impl="http://prefill.eforms.userinterface.mydca.dca21.com"n><impl:strFormId dd:nullType="xsi"n/></impl:getBankDetailsn></soap:Bodyn></ChargeHoldersDataConnn></dd:dataDescriptionn><dd:dataDescription xmlns:dd="http://ns.adobe.com/data-description/" dd:name="GetBOFilingDtlsgetFilingDtlsRequestDD"n><GetBOFilingDtlsn><soap:Body xmlns:soap="http://schemas.xmlsoap.org/soap/envelope/"n><impl:getFilingDtls xmlns:impl="http://common.userinterface.backoffice.dca21.com"n><eFilingInDDto dd:nullType="xsi"n><formId dd:nullType="xsi"n/><formUniqueId dd:nullType="xsi"n/></eFilingInDDton></impl:getFilingDtlsn></soap:Bodyn></GetBOFilingDtlsn></dd:dataDescriptionn><dd:dataDescription xmlns:dd="http://ns.adobe.com/data-description/" dd:name="NewCINDataConngetCINLLPINDetails_SCRequestDD"n><NewCINDataConnn><soap:Body xmlns:soap="http://schemas.xmlsoap.org/soap/envelope/"n><impl:getCINLLPINDetails_SC xmlns:impl="http://prefill.eforms.userinterface.mydca.dca21.com"n><impl:strCINLLPINn/></impl:getCINLLPINDetails_SCn></soap:Bodyn></NewCINDataConnn></dd:dataDescriptionn></xfa:datasetsn>'

您也可以从表单部分提取:

xml2=xfa[13].getObject().getData()
print(xml2)

最新更新