We're back after a server migration that caused effbot.org to fall over a bit harder than expected. Expect some glitches.

How do I get data out of HTML?

Try Beautiful Soup:

http://www.crummy.com/software/BeautifulSoup

Beautiful Soup is more forgiving than other parsers in that it won’t choke on bad markup.

If you want to parse HTML into a structure compatible with Python’s ElementTree library, you can use the ElementSoup adapter:

http://effbot.org/zone/element-soup.htm

CATEGORY: tutor