bs4 funciona muy bien:
Código Python:
Ver original>>> from bs4 import BeautifulSoup
>>> content = BeautifulSoup("<html><p>Hello world!</p></html>")
>>> content
<html><head></head><body><p>Hello world!</p></body></html>
>>> content.get_text()
u'Hello world!'
>>>