如何使用bs4从下面的代码中提取src?
当我使用请求但无法从中获得src时,脚本显示
我试过page.get("src"(
但它不起作用
<script>
function hls() {
document.getElementById("videowrapper_hls").innerHTML = '<iframe src="/redirect/dHlFVXhUNU1wTHB5T2ZnblorQ1k4aVVyV1JKRjRkUEpnaExzeVpUUHZDWmJkMGF2N0pPa3RPdDRQYjFHYmhRWnljclVXa0l0V0hLK3VuSElFTUExNVE9PQ==" scrolling="no" frameborder="0" width="700" height="430" allowfullscreen="true" webkitallowfullscreen="true" mozallowfullscreen="true"></iframe>'
if ($('#videowrapper_video').length > 0) {
document.getElementById("videowrapper_video").innerHTML = ''
}
if ($('#videowrapper_fembed').length > 0) {
document.getElementById("videowrapper_fembed").innerHTML = ''
}
if ($('#videowrapper_gounlimited').length > 0) {
document.getElementById("videowrapper_gounlimited").innerHTML = ''
}
if ($('#videowrapper_mixdrop').length > 0) {
document.getElementById("videowrapper_mixdrop").innerHTML = ''
}
}
</script>
我建议使用以下正则表达式:
html = '''<script>
function hls() {
document.getElementById("videowrapper_hls").innerHTML = '<iframe src="/redirect/dHlFVXhUNU1wTHB5T2ZnblorQ1k4aVVyV1JKRjRkUEpnaExzeVpUUHZDWmJkMGF2N0pPa3RPdDRQYjFHYmhRWnljclVXa0l0V0hLK3VuSElFTUExNVE9PQ==" scrolling="no" frameborder="0" width="700" height="430" allowfullscreen="true" webkitallowfullscreen="true" mozallowfullscreen="true"></iframe>'
if ($('#videowrapper_video').length > 0) {
document.getElementById("videowrapper_video").innerHTML = ''
}
if ($('#videowrapper_fembed').length > 0) {
document.getElementById("videowrapper_fembed").innerHTML = ''
}
if ($('#videowrapper_gounlimited').length > 0) {
document.getElementById("videowrapper_gounlimited").innerHTML = ''
}
if ($('#videowrapper_mixdrop').length > 0) {
document.getElementById("videowrapper_mixdrop").innerHTML = ''
}
}
</script>'''
import re
match = re.search(r'<iframe src="(.*?)"', html)
src = match.group(1) if match is not None else ''
print(src)
输出:
/redirect/dHlFVXhUNU1wTHB5T2ZnblorQ1k4aVVyV1JKRjRkUEpnaExzeVpUUHZDWmJkMGF2N0pPa3RPdDRQYjFHYmhRWnljclVXa0l0V0hLK3VuSElFTUExNVE9PQ==