你可以使用Python的正则表达式库re来提取*.html网页名称。以下是一个示例代码:
import re
text = """
<p class="sub-links-paragraph"> <a class="sub-links-action" href="CREATEFTDCTRADERAPI.html">CreateFtdcTraderApi</a></p>
<p class="sub-links-paragraph"> <a class="sub-links-action" href="RELEASE.html">Release</a></p>
<p class="sub-links-paragraph"> <a class="sub-links-action" href="GETTRADINGDAY.html">GetTradingDay</a></p>
<p class="sub-links-paragraph"> <a class="sub-links-action" href="INIT.html">Init</a></p>
<p class="sub-links-paragraph"> <a class="sub-links-action" href="JOIN.html">Join</a></p>
<p class="sub-links-paragraph"> &nbs...
pattern = r'href="(.*\.html)"'
matches = re.findall(pattern, text)
for match in matches:
print(match)
运行这段代码将输出提取到的*.html网页名称列表:
CREATEFTDCTRADERAPI.html
RELEASE.html
GETTRADINGDAY.html
INIT.html
JOIN.html
REGISTERSPI.html
REGISTERFRONT.html
SUBSCRIBEPRIVATETOPIC.html
SUBSCRIBEPUBLICTOPIC...
内容由零声教学AI助手提供,问题来源于学员提问