从我的ec2实例,我试图运行一个需要html5-parse的python脚本。当我尝试运行python3 -m pip install html5_parser
时,我得到,
Collecting html5_parser
Using cached https://files.pythonhosted.org/packages/a6/78/6a80c13be6c02fd2fb615b473a6ceff5b28bafca184f428ab197f677fd7a/html5-parser-0.4.9.tar.gz
Complete output from command python setup.py egg_info:
Package libxml-2.0 was not found in the pkg-config search path.
Perhaps you should add the directory containing `libxml-2.0.pc'
to the PKG_CONFIG_PATH environment variable
No package 'libxml-2.0' found
Traceback (most recent call last):
File "<string>", line 1, in <module>
File "/tmp/pip-build-bo946lpx/html5-parser/setup.py", line 79, in <module>
include_dirs=include_dirs(),
File "/tmp/pip-build-bo946lpx/html5-parser/build.py", line 94, in include_dirs
return [x[2:] for x in pkg_config('libxml-2.0', '--cflags-only-I')]
File "/tmp/pip-build-bo946lpx/html5-parser/build.py", line 76, in pkg_config
val = subprocess.check_output([PKGCONFIG, pkg] + list(args)).decode('utf-8')
File "/usr/lib/python3.6/subprocess.py", line 356, in check_output
**kwargs).stdout
File "/usr/lib/python3.6/subprocess.py", line 438, in run
output=stdout, stderr=stderr)
subprocess.CalledProcessError: Command '['pkg-config', 'libxml-2.0', '--cflags-only-I']' returned non-zero exit status 1.
----------------------------------------
Command "python setup.py egg_info" failed with error code 1 in /tmp/pip-build-bo946lpx/html5-parser/
我如何在ubuntu上安装pkg-config,以便我最终可以安装html5-parser?
html5-parser
可以通过Ubuntu的apt
安装:
sudo add-apt-repository universe
sudo apt update
sudo apt install python3-html5-parser
html5_parser
需要lxml
。所以从https://lxml.de/installation.html要求:
sudo apt-get install libxml2-dev libxslt-dev python-dev
其余部分见https://html5-parser.readthedocs.io/en/latest/#unix:
pip install --no-binary lxml html5-parser