utf 8 - 在带有"安全"参数的 utf-8 字符串上使用 python 的urllib.quote_plus - utf 8 - Using python's urllib.quote_plus on utf-8 strings with 'safe' arguments 小贝子编程网

我在python代码中有一个unicode字符串：

name = u'Mayte_Martín'

我想将它与SPARQL查询一起使用，这意味着我应该使用"utf-8"对字符串进行编码，并对其使用urllib.quote_plus或requests.quote。然而，无论使用还是不使用"safe"参数，这两个quote函数的行为都很奇怪。

from urllib import quote_plus

没有"安全"参数：

quote_plus(name.encode('utf-8'))
Output: 'Mayte_Mart%C3%ADn'

带有"安全"参数：

quote_plus(name.encode('utf-8'), safe=':/')
Output: 
---------------------------------------------------------------------------
UnicodeDecodeError                        Traceback (most recent call last)
<ipython-input-164-556248391ee1> in <module>()
----> 1 quote_plus(v, safe=':/')
/usr/lib/python2.7/urllib.pyc in quote_plus(s, safe)
   1273         s = quote(s, safe + ' ')
   1274         return s.replace(' ', '+')
-> 1275     return quote(s, safe)
   1276 
   1277 def urlencode(query, doseq=0):
/usr/lib/python2.7/urllib.pyc in quote(s, safe)
   1264         safe = always_safe + safe
   1265         _safe_quoters[cachekey] = (quoter, safe)
-> 1266     if not s.rstrip(safe):
   1267         return s
   1268     return ''.join(map(quoter, s))
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 10: ordinal not in range(128)

问题似乎出在rstrip函数上。我试着做了一些改变，并称之为…

quote_plus(name.encode('utf-8'), safe=u':/'.encode('utf-8'))

但这并没有解决问题。这里可能有什么问题？

我在回答我自己的问题，这样它可能会帮助其他面临同样问题的人。

当您在执行任何其他操作之前在当前工作区中进行以下导入时，就会出现此特定问题。

from __future__ import unicode_literals

不知何故，这与下面的代码序列不兼容。

from urllib import quote_plus
name = u'Mayte_Martín'
quote_plus(name.encode('utf-8'), safe=':/')

不导入unicode_literals的相同代码也可以正常工作。

根据这个错误，这里有一个解决方法：

#!/usr/bin/env python
# -*- coding: utf-8 -*-
from __future__ import unicode_literals
from urllib import quote_plus
name = u'Mayte_Martín'
quote_plus(name.encode('utf-8'), safe=':/'.encode('utf-8'))

必须将quote或quote_plus方法中的两个参数encode都utf-8

#!/usr/bin/env python
# -*- coding: utf-8 -*-
from __future__ import unicode_literals
import urllib
name = u'Mayte_Martín'
print urllib.quote_plus(name.encode('utf-8'), safe=':/')

工作对我来说没有问题(Py2.7.9，Debian(

(我不知道答案，但我不能就声誉发表评论(

utf 8 - 在带有"安全"参数的 utf-8 字符串上使用 python 的urllib.quote_plus

相关内容

最新更新

热门标签：